Introduction

Asthma is a common respiratory disease characterized by airway inflammation, reversible airflow obstruction and bronchial hyper-responsiveness (BHR) to various environmental stimuli. Asthma is a complex and heterogeneous disorder caused by multiple genetic and environmental factors. Candidate-gene and genome-wide linkage studies, carried out over the past 10 years, pointed to at least 20 regions linked to asthma and asthma-related phenotypes,1, 2 suggesting that many genes with relatively small effects might be involved. Although a few genes have been characterized by positional cloning,2 localizing and identifying new genes remain challenging.

Among the many environmental factors that have been reported to be associated with asthma or to exacerbate asthma symptoms, maternal smoking during pregnancy and exposure to tobacco smoke (ETS) in early life have been found to be important risk factors for asthma and BHR.3, 4, 5 G (gene) × ETS interaction underlying susceptibility to asthma and asthma-related phenotypes have been evidenced by association studies with candidate genes including ADRB2 (β2-adrenergic receptor),6 GSTM1 (glutathione S-transferase M1)7, 8 and GSTT1 (glutathione S-transferase T1)7 genes and more recently in the context of two genome-wide scans in American and Dutch families.9, 10 These studies as well as simulation studies have shown that taking into account G × E (Environment) interaction in genetic analysis cannot only lead to detection of such interaction but can also increase power to detect linkage (or association with candidate genes) which would not be revealed by classical approaches.11, 12

A genome scan for asthma and atopy-associated phenotypes13, 14 was recently carried out in a sample of 295 French families ascertained through an asthmatic subject, as part of the EGEA study (Epidemiological study on the Genetics and Environment of Asthma). This scan showed evidence for linkage of a few regions to asthma-related phenotypes but revealed weaker signals for asthma on 1p31 and for bronchial responsiveness on 3q21. Using the same data, the aim of the present study was to include information on passive smoking exposure during early life and/or in-utero in genome-wide linkage analyses of asthma and BHR and to test for G × ETS interaction. We used two statistical methods, the predivided sample test (PST)15 and the mean interaction test (MIT),16 which differed from the approaches used by the American and Dutch screens9, 10 and have been shown, through simulations, to increase power to detect linkage in presence of G × E interaction under various genetic models.11

Materials and methods

Family data

The main EGEA sample included 348 nuclear families selected through one asthmatic proband. An additional sample of 40 nuclear families was selected through two asthmatic sibs to increase the number of informative families for linkage analyses of asthma. Inclusion criteria met by the probands were described elsewhere.17 Ethical committee approved the protocol and subjects signed informed consent forms.

Phenotypes analyzed

Information on respiratory and allergic symptoms, medical history and environmental factors including exposure to tobacco smoking was collected using a standardized questionnaire.18 Various biological and physiological tests were also performed.17 Asthma was defined using the same criteria as in previous EGEA linkage scans:13, 14 positive answer to at least one of the two questions: 1/Have you ever had attacks of breathlessness at rest with wheezing? 2/Have you ever had an attack of asthma? associated with either the presence of bronchial hyper-responsiveness, hospitalization for asthma in life or asthma therapy.

The BHR phenotype, used in the present analysis, was defined by the PD20: a fall in baseline FEV1 (forced expiratory volume at 1 s)≥20% at 4 mg/ml methacholine. The protocol of the methacholine challenge test has been described in detail elsewhere.17

The passive ETS in early-childhood phenotype was defined by answers to different questions in adults and children indicated as follows: (1) for an adult, by a positive answer to the question: did your mother or your father smoke during your early-childhood? (2) for a child, by a positive answer to the question asked to the child's mother (or father): did you or the father (or the mother) of your child smoke when your child was less than 2 years old?

Our aim was also to consider an ETS phenotype defined by passive ETS in-utero and/or in early childhood. However, the distribution of exposed/unexposed siblings according to this ETS phenotype was similar to the distribution observed when using the first ETS phenotype (i.e. ignoring the in-utero exposure). Thus, only results corresponding to passive ETS in early childhood are presented.

Genotypes

Genotyping of 396 microsatellites (378 autosomal markers) was performed in EGEA families with at least two sibs with DNA available. These markers were distributed at an average marker density of 10 centimorgan (cM) and had an average heterozygosity of 75%. Rigorous genotype quality control was performed, as described previously.13, 14 Following our previous scan,13, 14 additional markers were genotyped for fine mapping in the regions that revealed potential linkage. A total of 185 additional markers, distributed across 12 chromosomes (chromosomes 1–3, 5–7, 9, 11–13, 17 and 21), were included in the present analyses. As for the original scan, these markers were evaluated for Mendelian inconsistencies using Pedcheck19 and double recombinants were identified using Mendel.20 After this genotyping quality control, the present samples included 110 families with at least two asthmatics and 89 families with at least two sibs with BHR.

Statistical methods

IBD sharing under G × E interaction

The sharing of marker alleles identical by descent (IBD) among affected sib-pairs is classically used in linkage analysis between a disease and a marker and allows to conclude in favor of linkage if the observed IBD distribution differs significantly from that expected under the null hypothesis of independence of segregation of disease and marker. This comparison can be made with the MLS statistic21 which provides maximum likelihood estimates of the proportions of sib-pairs with 2, 1, or 0 marker alleles IBD (denoted respectively z2, z1 and z0), given the observed marker genotypes. Note that the maximum likelihood statistic (MLS) method usually assumes that the IBD proportions, z2, z1 and z0, are constrained within a triangle so that 2z0z1≤0.5, referred as the triangle constraints.22, 23 Under linkage, the IBD distribution depends on the parameters of the susceptibility gene to which the marker is linked, that is the allelic frequency and the genotype relative risks. When there is G × E interaction, these latter parameters differ according to the presence or absence of E, consequently the IBD distribution may also differ according to the number of sib(s) (2, 1 or 0) exposed in the pair. Such heterogeneity in the IBD distribution among sib-pairs is used to detect G × E interaction by the two PST and MIT methods.

The predivided sample test

Under the null hypothesis of absence of G × E interaction, the IBD vectors Z (z2, z1, z0) estimated among affected sib-pairs with 2 or 0 sibs exposed (EE and UU pairs, respectively) are expected to be equal. Therefore, the null hypothesis can be tested by −2(Ln(L(ZEE)L(ZUU)/L(Z))) which follows a χ2 distribution with 2 df15 where L(ZEE), L(ZUU) and L(Z) are the likelihood of the parameter vectors, ZEE, ZUU and Z estimated respectively in EE and UU sib-pairs and in the whole sample (made of these two subsamples). Since the parameter vectors Z are estimated using the triangle constraints, the PST statistic does not follow a χ2 with 2 df as would do the classical PST.15 Empirical P-values estimated by bootstrap analyses showed that the PST distribution under such constraints was close to a χ2 distribution with 1df.

The mean interaction test

Gauderman and Siegmund16 proposed an extension of the mean sharing test to account for G × E interaction. The mean sharing test uses the proportion of alleles shared IBD among affected sib-pairs, π, which is expected to be equal to 0.5 under the null hypothesis of no linkage. In this extension, π is estimated in the different groups of affected sib-pairs (EE, EU and UU for 2, 1 and 0 exposed sibs respectively). This method is based on the regression model πi=π+β(Xi-X)+ɛi, where π is the intercept and β the regression coefficient for the exposure (Xi the covariate of exposure being centered on its mean X). Different coding schemes were proposed for X,16 but, as shown by our simulations, they led to similar results11 under most G × E interaction models. Thus, to minimize the number of tests performed, we chose one of these coding schemes that included two variables (XEE and XEU) contrasting EE and EU sib-pairs respectively to UU pairs. The null hypothesis of absence of linkage was tested by the likelihood ratio test, Tπβ=−2(Ln{L(π=0.5, β=0)}−Ln{L(π,β)}). The alternative hypothesis is assumed to be one-sided for π>0.5, and two-sided for β. The distribution of T is a (50:50) mixture distribution of χp2 and χp+12, with p being the number of regression coefficients (here equal to 2). The alternative hypothesis corresponds to linkage with or without G × E interaction.

Note that the MIT takes into account G × E interaction when testing for linkage but does not test for G × E, while the PST allows detection of both linkage and G × E. In the absence of G × E, the PST cannot detect linkage, while the MIT can but with lower power than the classical mean test, due to additional parameters for the exposure variables, and consequently higher degrees of freedom.

For both PST and MIT methods, a multipoint approach was used to estimate the IBD sharing (Z vectors), taking into account the information provided by the entire set of chromosome markers.

Results

Among the 726 genotyped siblings of the 295 EGEA families selected through one asthmatic subject, the distribution of clinical characteristics are given separately in exposed and in unexposed sibs (Table 1). Distribution of sex and age were similar in the two groups. There was also no difference of BHR frequency between exposed and unexposed siblings, while the frequency of asthma was slightly higher in unexposed sibs than in the exposed ones but not significantly. This might be explained by under-reporting of smoking habits by mothers of asthmatic children when questioned on their children's passive ETS.24

Table 1 Descriptive characteristics of the 726 genotyped siblings of the 295 EGEA families according to passive ETS in early childhood

Table 2 gives the distribution of genotyped affected sib-pairs for asthma and BHR, according to their exposure status (EE, EU, UU).

Table 2 Distribution of sib-pairs affected with asthma and BHR according to the number of sibs exposed for passive ETS in early childhood

Table 3 shows the regions detected for asthma and BHR at a P-value≤0.005 using either PST or MIT approach. For asthma, no region was revealed at that level by the PST, while the two following regions were detected by the MIT: 14q32 (P=0.0009) and 17q21 (P=0.0003). Four regions were revealed for BHR at the 0.5% level. Three of these regions were detected by both PST and MIT approaches: 1q43-q44 (P=0.005 and 0.0009), 4q34 (P=0.003 and 0.002) and 17p11 (P=0.002 and 0.002). Evidence for linkage to 5p15 (P=0.003) was found by PST only.

Table 3 Chromosomal regions with evidence for linkage to asthma and/or to BHR at P≤0.005 with at least one test (PST or MIT) taking into account gene × passive smoking interaction

In the four regions revealed by the PST approach, there was evidence for G × ETS interaction underlying susceptibility to BHR. In two of them, 1q43-q44 and 4q34, the excess of IBD sharing was only observed in affected sib-pairs with both sibs exposed (z2=0.44 and 0.34 respectively), indicating these regions may contain genetic factors that influence BHR only in presence of ETS. The opposite situation was observed for the 5p15 and 17p11 regions with an excess of IBD sharing when both sibs were unexposed (z2=0.46 and 0.54 respectively), suggesting involvement of genetic factors underlying BHR in absence of ETS.

Discussion

Taking into account gene–environment interactions in genetic analysis of a complex and heterogeneous disease such as asthma, although scarcely done, appears particularly relevant. The present genome-wide linkage analysis, that incorporated information on passive ETS during early life, led to detect two regions linked to asthma and four regions linked to BHR that were not revealed by classical linkage analysis of the same data. This was achieved by use of the PST and MIT methods, which both enabled to conduct an overall test for linkage (and heterogeneity for PST) while taking into account the exposure status of affected sib-pairs. These methods differed from the approaches used by the two previous screens9, 10 investigating G × ETS in which linkage analyses were conducted separately in exposed and unexposed families. Moreover, G × ETS interaction underlying BHR was evidenced in four regions by use of the PST. The present study took also advantage of the extensively documented information on environmental exposures in the EGEA study, regarding both adults and children at different time periods.

Four regions, 1q43-q44, 4q34, 5p15 and 17p11, potentially involved in genetic susceptibility to BHR interacting with ETS in early life, were detected by the PST approach. The PST approach indicated that two of these regions may contain genetic factors that influence BHR in presence of ETS while, in the two other regions, genes may play a role in absence of ETS. Three of these four regions, (1q43-q44, 4q34 and 17p11) were also revealed by the MIT. In contrast, the 5p15 region was not found significant by the MIT. This latter result may be due to the greater power of the PST over the MIT, particularly under some disease models in which presence (or absence) of the environmental factor changes the direction of the genetic effect, as shown by our previous simulations.11 Conversely, the 14q32 and 17q21 regions were detected for asthma by the MIT but not by the PST. These latter results might indicate presence of linkage without G × ETS interaction in these two regions. However, linkage analyses of asthma in the same EGEA data, ignoring G × E interaction, did not show evidence for linkage to these regions.14 These results might be also explained by a higher power of MIT than the PST approach and linkage analysis ignoring G × E, under some models of G × E interaction in which presence (or absence) of the environmental factor cancels the genetic effect.11 All these results underline the importance of using concomitantly different approaches to test for linkage in presence of G × E when the underlying model is unknown. Indeed, although part of the information used by PST and MIT is similar (IBD distribution among sib-pairs), which can account for some concordant results, the PST does not use the IBD distribution among sib-pairs discordant for ETS while the MIT does and the test statistics differ between these two methods.

As mentioned in the Method section, we also considered the passive ETS in-utero and/or in early childhood phenotype, in addition to passive ETS in early childhood. But this led to unchanged results. Indeed, most of the mothers who smoked during pregnancy also continued to smoke during the early childhood of their child. Analyses considering ETS ‘in-utero’ only were not possible here because of too small sample sizes. We finally also considered ETS in early childhood defined by exposure to mother's cigarette smoking only, assuming it might have a greater impact than father's smoking. However, use of this ETS variable did not lead to detect other regions than those already reported here (results not shown). As for other retrospective studies, ETS exposure may be subject to recall bias. Indeed, we found in the EGEA study, that offspring of parents who had stopped smoking underestimated their ETS during childhood. Also mothers of asthmatic children under-reported their smoking habits when questioned on their children's passive ETS.17 This may lead to misclassification of sib-pairs with respect to exposure but it would likely bias results towards the null hypothesis.

None of the regions detected by the present study has been reported by the two previous genomes scans9, 10 that investigated G × ETS interaction, except for the result evidenced by the PST in the 1q43-q44 region which was 20 cM apart from the peak load in the 1q42 region reported by the American study.9 In particular, we did not confirm linkage of 5q to asthma or BHR reported by the American and Dutch scans respectively in exposed sib-ships. Discrepancies between these results may be partly due to differences in terms of the affection status examined, asthma only in the American study9 and both asthma and BHR in the Dutch10 and French studies. Different statistical approaches were also used, a model-based approach in the Dutch study, and model-free approaches in the American and French studies but using different testing procedures. We should also note that tests of G × E interaction are generally conservative, and absence of replication may be also owing to lack of power.

A few regions revealed by the present study have been reported by previous genome screens of asthma and asthma-related phenotypes that did not take into account G × ETS interaction (including the EGEA study). The 14q32 region has shown evidence for linkage to asthma in the Hutterites population25 and to IgE in the CSGA study26 while the 17q21 was found linked to atopy in EGEA families,27 asthma in Australian families28 and to atopy in Dutch family data.29 With respect to BHR, the 1q43-q44 region was found linked to atopic asthma in a sample of European, Australian and US families,30 the 4q34 region was detected for asthma in that same family sample31 and the 5p15 region was revealed for asthma in the CSGA Afro-American families32 and for atopy in the EGEA study.13 All these results underline the complex genetic architecture of asthma for which there may be genes with pleiotropic effects on correlated phenotypes as well as gene–gene and gene–environment interactions.

Overall, all regions revealed by the PST/MIT for asthma and BHR were not detected by previous linkage analyses of these phenotypes that ignored G × E interaction in the same EGEA data. Note, however, that these previous analyses did not use the additional fine mapping markers as done here. However, when repeating linkage analyses of asthma and BHR with these additional markers on chromosomes 1, 5 and 17 (results not shown), linkage to these regions was still not detected except for the 1q region, which was revealed for BHR but in a less significant manner than presently with the MIT. These results support that taking into account G × E can increase substantially power to detect linkage. Moreover, the PST makes it possible to reveal G × E interaction. However, replication studies are needed to confirm our findings, since only one region on 1q replicated previous results in CSGA data.9 Further identification of these genes within these regions will permit to understand how the interactions of these genes with passive smoking in early life play a role in the development of asthma.