Introduction

Asthma is a multifactorial, heterogeneous disease characterized by reversible airway obstruction.1 A strong genetic basis for asthma is indicated by high estimates of heritability,2 and motivating genetics/population genomics studies aiming to identify the genetic susceptibility factors.3, 4, 5 A number of asthma susceptibility genes have been identified to date, but the findings have not been consistent across different populations/studies.3, 4, 5 This may partly relate to heterogeneity within asthma and variations across study populations.3, 4, 5 Although the understanding of asthma pathogenesis has been enhanced in recent years, further research is required for its better understanding. Among various asthma subtypes, prevalence of extrinsic/allergic/atopic asthma seems to be the highest among all, particularly in the Indian scenario.6, 7

Helper T (Th) cell responses, particularly, balance between Th1/Th2 responses are critical for onset and progression of the disease where Th1 responses are supposed to be protective toward asthma, whereas Th2 responses aid in the development of asthma phenotype(s).3, 4, 8, 9 The micro-environment in the vicinity of the naive Th-cells, antigenic dose, co-stimulators on antigen presenting cells, genetic modifiers and so on have a significant role in deciding whether naive CD4+ T cells would assume Th1 or Th2 effecter phenotype.8, 10 A number of studies suggest that polymorphisms in Th1/Th2 candidate genes may modulate the respective gene products, thereby acting as genetic modifiers of the disease.3, 4, 11 In our laboratory, a number of Th1/Th2 candidate genes have been found to be associated with asthma or related phenotypes in the Indian population.11 Technological advancements in recent times have enabled genotyping of a large number of single-nucleotide polymorphisms (SNPs) simultaneously. Using high-throughput genotyping platform, we have performed candidate gene association studies where we selected genes modulating Th1/Th2 development, differentiation and function, either directly or indirectly. Our major focus was on the key candidate genes from IL4, IFNG and IL12 signaling pathways together with genes that are known to modulate these pathways. We also included genes from microarray data, which were found to be differentially expressed in allergen-induced asthma model potentially modulating apoptosis, cell cycle and so on with indirect evidences that these pathway genes may influence the differentiation or maintenance of Th1/Th2 subsets.11

We report the association of some important Th1/Th2 candidate genes (IRF2, IL6, IFNGR2, STAT4 and IL4RA) with asthma. Importantly, the minor alleles at these loci seem to have protective effect with respect to asthma susceptibility. Also, to the best of our knowledge, this is the first report of an independent association of IL6 gene polymorphism with atopic asthma.

Materials and methods

Subjects

The inclusion criteria of subjects, families, asthmatics, normal controls have been previously described.12, 13 In brief, in the present study, asthmatic patients (n=147, 38.5±15 years) and normal controls (n=199, 37.0±13.5 years) were recruited from our collaborating centers following clinician’s diagnosed asthma (Table 1) and a set of inclusion/exclusion criteria (Supplementary Table A), after obtaining ethical clearance from our institute and the participating centers. Written consents were obtained from individuals for participating in the study, performing skin prick test and obtaining blood samples. A standard questionnaire was filled by the candidates giving clinical details, migration status, environmental history, family history of diseases and so on. The clinical test used for ascertaining asthma phenotypes were pulmonary function test (FEV1, bronchial reversibility (>15%) test using β2-agonist inhaler (albuterol/salbutamol)), skin prick test (wheal reaction >3mm diameter) to a panel of 15 local environmental allergens and total serum immunoglobulin (IgE) estimation as described earlier.12, 13 Normal controls were recruited from general population taking details of migration status, ethnicity and no self-reported history of asthma or allergic diseases. All individuals with smoking history and parasitic/helminthic infections in the past were excluded from the study. For the family based association studies, 247 families were collected as ascertained through probands (15.6±11.2 years) who were asthmatics along with both the parents affected/unaffected. Families were further extended wherever consents were obtained, and a total of 1022 individuals were recruited with average family size of 3.94 (range from 3 to 12 members). The disease status was confirmed in a manner similar to case–control study. The genetic homogeneity between patients and controls was confirmed by genotyping loci as yet unknown to be associated with asthma as described previously.12, 13

Table 1 Characteristics of the patient and the control groups (case–control study) and the probands (family based study)

Gene and SNP selection

We have selected 33 genes, which potentially modulate Th1/Th2 differentiation, development and/or functions directly or indirectly (Supplementary Table B). A total of 354 SNPs were selected for genotyping using Golden Gate Assay on Illumina platform. Detailed description of gene and SNP selection and OPA (Oligonucleotide Pool All) design has been provided in the Supplementary methods.

Genotyping and data cleaning

Samples were genotyped with Illumina Bead Array system in accordance with manufacturer’s protocol14 and as described in Supplementary methods. Gen-Call module of BeadStudio package v3 was used to generate genotype calls. All the genotype clusters were inspected and corrected manually; the threshold for GenTrain score of >0.25 was set to call a SNP successfully genotyped.14 We retained markers for further analysis if the call rate was above 90%, maximum of one reproducibility error (5% replicates were used), maximum of five Mendelian errors and showed consistency with Hardy–Weinberg equilibrium at the level of P<0.01.

Data/statistical analyses

A detailed description of the statistical methods is provided in Supplementary methods. In brief, Hardy–Weinberg calculations were performed using PLINK (http://pngu.mgh.harvard.edu/~purcell/plink/).15 Pairwise linkage disequilibrium (LD) calculations and TagSNP selection was done using Haploview v 4.2.16 Single marker and haplotypic association studies were done using PLINK and FBAT.15, 17, 18

Results

Genotyping and data cleaning

After applying the standard quality control filters to the 354 SNPs, we retained 295 SNPs for further analysis. Of the 1368 samples, 17 samples had call rate below 95% and were therefore removed. Furthermore, eight triads were consistently showing transmission errors and were discarded. In total, therefore, 1326 individuals (239 pedigrees with 993 individuals and case–control cohort (139 cases+194 controls)) finally passed the quality control. After data cleaning, at most one Mendelian error was observed for 99% of the SNPs. Furthermore, ~98% of the markers had call rate above 97%.

LD plots and TagSNPs

The SNPs that passed the quality control criteria were used for gene-wise LD calculation and TagSNP selection. We used the LD values (r2) to identify tag SNPs using the tagger as implemented in Haploview. Using tagger (gene-wise), we selected 229 tag SNPs for our further association analysis (Supplementary Table C).

Association analysis with asthma

As mentioned in Supplementary methods family based and case–control association analyses have been performed. As the individuals in the families and case–control cohorts were non-overlapping but from the same ethnic background and therefore constituted two independent observations, Fisher’s combined probability test, which is a technique, as data fusion was used to aid our decision making. Fisher’s combination of probability test is a robust test in the sense that distribution of the underlying test statistic does not have any effect on the performance of the test. It only exploits the property of the P-value obtained using any statistics. Four novel SNPs (previously unreported to be associated with asthma or related phenotypes); rs3733475A/C (IRF2), rs2069832A/G (IL6), rs2012075G/A (IFNGR2) and rs1400656G/A (STAT4) showed statistically significant association with asthma in families as well as case–control analyses (Table 2). The combined P-values for these SNPs remains significant after correction for multiple testing (P=0.002, P=0.001, P=0.004 and P=0.003, respectively) (Table 2). Among the SNPs, which have been previously shown to be associated with asthma or related phenotypes, we found evidence of association only for rs1805011C/A (IL4RA) as the combined P-value (P=0.001) was statistically significant (Table 2). The minor alleles at all the loci show a trend toward negative association (odd ratio <1; Table 2).

Table 2 Results of allelic association analyses with asthma in families and case–control cohorts together with combined P-values

Haplotype-based association tests are thought to be more powerful than single SNP-based tests.19 We performed haplotypic association analyses to further validate the results of single marker association analyses to assess whether the significant SNP(s) are present in the significant haplotypes. Also, as explained in Supplementary Methods, we have performed sliding window haplotypic analysis (with fixed window of 5 for genes with more than five SNPs) in case controls to identify region of highest significance. In families, haplotype-based association tests were performed with region (marker combinations) that were identified as regions of high significance. In the sliding window haplotypic analysis of IRF2 gene, in the region encompassing 5′ untranslated region, the five locus haplotypes, showed statistically significant association with asthma (P<0.001) (Table 3A and B). The rs3733475C/A SNP, as mentioned previously, showed significant association with asthma in single marker association analysis, whereas the rs1863314G/A SNP showed weak association signal (P=0.062) in families but not in case–control cohort (P>0.05). Interestingly, TAACG, the haplotype that has the protective alleles of rs3733475C/A as well as rs1863314G/A, is significantly under-transmitted to asthmatic offsprings in family analysis (P=1.1 × 10−6); while present with significantly higher frequency in normal controls (>7%) than cases (<1%) in case–control analysis (P=0.01) (Table 3A). The permutation procedure with one million iterations also shows statistically significant results (P<0.001). In IL6 gene as well haplotypes CAG with protective allele A was significantly under-transmitted to asthmatics in families (P=0.0008) and present with higher frequency of occurrence in normal controls (P=0.06), whereas haplotype CGG was over transmitted to asthmatics in families and present with significantly higher frequency (P=0.06) in normal controls. In other genes as well, a number of haplotypes showed association but statistical significance was lost owing to multiple testing corrections particularly in case–control analyses (Table 3A). A number of haplotypes showed association in family based analyses, however, further investigation and/or meta analysis with large sample size in future would be required to confirm these findings (Table 3A).

Table 3 Result of haplotypic analyses (with asthma) in families and case–control cohort; (A) haplotype wise association analyses and (B) global (Omnibus) haplotypic association analyses

Association analysis with log10 total serum IgE

Total serum IgE levels follow a log normal distribution and when cases and controls were compared with respect to the log serum IgE levels, a statistically significant difference was obtained (P<0.0001) (data not shown). In association analysis with log serum IgE, SNPs that showed association with asthma did provide minor or suggestive association signals, although none of them was significant after multiple testing corrections (P>0.05) (Table 4). In the haplotypic analysis, haplotypes TAACG and AAGAA that were negatively associated with asthma showed strong association signals with lower log10 total serum IgE levels in family based and case–control studies (P<0.001) (data not shown).

Table 4 Results of allelic association analysis with log10 serum total IgE in families and in case–control cohort together with combined P-values

Discussion

We have undertaken genetic association studies with 33 genes, most of which are candidates modulating Th1/Th2 differentiation/development and/or function, whereas four differentially expressed genes from microarray data regulating cellular stress pathways, such as cell cycle, apoptosis and so on, have also been included (Supplementary Table B). To the best of our knowledge, this is the first candidate gene association study, focused on a number of genes from Th1/Th2 pathway and asthma. The gene selection in this study is based on strong evidences of their functional role in modulating asthma pathogenesis, creating high prior probabilities for the existence of genetic effects, thereby strengthening the posterior probabilities of the noted effects.

We identified four novel SNPs (genes) and one previously reported SNP to be associated with asthma (Tables 2 and 3) and suggestive association with serum total IgE (Table 4). IL6 is a proinflammatory cytokine that is known to promote allergic inflammation together with IL4 and IL13, IL6 is also involved in terminal differentiation of B lymphocytes and generation of specific allergen IgE by B cells.20 IL6 could also induce the production of IL3, IL4 and IL5 by T lymphocytes, which in turn promotes differentiation of eosinophils and basophils.21, 22 Despite important role played by IL6, reports of association of this gene with atopic asthma are lacking. Interestingly, a joint effect of atopy/asthma and an IL6 polymorphism on risk of lung cancer has been previously observed.23 Also, genetic variants in IL6 and IL6R together have been shown to have interaction effect in modifying response to bronchodilators.24 Furthermore, joint effect of IL6 and TGFB1 polymorphism in atopic sensitization was noted in childhood asthmatics.25 IL6 levels are likely influenced by genetic polymorphism26 and our results further support studies that indicate that IL6 gene polymorphisms could modify the risk of developing asthma. IL4RA is the common subunit required for both IL4 and IL13 signaling, both of which are critical for Th2 differentiation and/or isotype switching IgE that have a central role in allergic inflammation.27 Several polymorphisms in this gene have been reported to be associated with asthma in different populations.27 The rs1805011 in IL4RA gene is a coding region polymorphism28 and its association in our study further highlights the critical role played by IL4/IL13 signaling pathway genes in genetic predisposition to asthma.

The other three genes (IRF2, IFNGR2 and STAT4) in which novel asthma-associated polymorphisms have been identified are essential for the differentiation and/or function of Th1 cells. The genes in this pathway have consistently been associated with autoimmune diseases such as celiac disease, multiple sclerosis and so on.29 Also, there are previous reports of associations of this cytokine signaling pathway genes with asthma or related phenotype, although these reports are scanty and inconsistent.29, 30, 31, 32, 33, 34 The same polymorphisms could not be directly tested in this study either because of technical problems of multiplexing, genotyping or since the reports came after our study was designed. Nevertheless, it is notable that our study supports the involvement of these genes in asthma susceptibility. It should be taken into account that in our study the marker selection was very dense with uniform representation across the gene; therefore, it is highly unlikely that we might have missed any significant information. The results of the sliding window haplotypic association analyses of IRF2 gene (Table 3 A and B), where a protective haplotype TAACG is identified, suggests important role for 5′ untranslated region in IRF2 gene regulation and/or in asthma.

A number of previously associated (other studies/populations) SNPs (for example, rs2069812 (IL5), rs20541 (IL13), rs2243250 (IL4) and so on (Supplementary Table C)) were not found to be associated with asthma in our study. In this regard, it would also be valuable to add that in our study design we focused only on allergic asthma, which is the dominant clinical subtype for which heritability is well established, however, this may not represent asthma in its entirety. The other reasons for non association in our study could be ethinc differences between our study population and previous studies. Furthermore, our study may be underpowered to draw conclusive inferences for some of the genes, which might confer only modest risk. The 239 trios included in our study would have ~80% power to detect an odds ratio ranging from 1.9 to 2.2; however, these odds ratios are very high for complex trait genetics. Further studies with increased sample size and/or replication studies in another cohort might be required for further validation of our results.

The bias in our results owing to population stratification is highly unlikely. The genetic homogeneity of our asthmatics and normal controls has been verified in our laboratory by genotyping loci (microsatellite markers) that are yet unlinked to asthma.12, 13 Also, recently, subsets of individuals from the cases and controls have been genotyped on genome wide association studies and no evidence of stratification (based on analysis of randomly selected 10 000 markers) was observed (unpublished data). Family based and case–control studies have their own pros and cons; family based studies are robust against population stratification but they are prone to type II or false-negative associations, whereas case–control studies are more sensitive to population stratification and prone to false-positive associations or type I error.13 Therefore, family based studies segregate genes with modest effect while case–control studies are powered for common disease common variant approach. A positive association signal in both families and case–control analyses, as seen in our study, is more likely to reject the null hypothesis. Our decision making is further strengthened by the combined probability testing method as it provides a way to integrate information from more samples and resultant increase in power.

The associated SNPs may either themselves have an independent functional role or might be in LD with neighboring functional polymorphism. The rs1805011 (IL4RA) is a coding region polymorphism, leading to a missense mutation and amino-acid change from glutamine to alanine.28 Although, the functional role of this non-synonymous mutation is yet to be discovered, the functional role of other missense mutations such as rs1805010 A/G (Ile50Val) and rs1801275 A/G (Gln551Arg) in close vicinity of this SNP have been demonstrated.35 Understanding the functional significance of the associated SNPs and/or identification of functional polymorphisms in LD with these should be the logical step forward, in unraveling further, their relevance in asthma pathogenesis or molecular mechanisms governing asthma. Of particular interest would to understand how these variations confer protection with respect to asthma susceptibility.

In summary, a number of Th1/Th2 candidate genes show association with atopic asthma in family based and case–control analyses. We identify the important role played by 5′ untranslated region gene polymorphism in IRF2 gene in asthma susceptibility. To the best of our knowledge, this is first report of independent association of IL6 gene polymorphism with atopic asthma. Understanding the functional significance of the asthma-associated polymorphisms might provide further insight into the roles of these genes in asthma pathogenesis.