The main genetic determinant of soluble interleukin 6 receptor (sIL-6R) levels is the missense variant rs2228145 that maps to the cleavage site of IL-6R. For each Ala allele, sIL-6R serum levels increase by ∼20 ng ml−1 and asthma risk by 1.09-fold. However, this variant does not explain the total heritability for sIL-6R levels. Additional independent variants in IL6R may therefore contribute to variation in sIL-6R levels and influence asthma risk. We imputed 471 variants in IL6R and tested these for association with sIL-6R serum levels in 360 individuals. An intronic variant (rs12083537) was associated with sIL-6R levels independently of rs4129267 (P=0.0005), a proxy single-nucleotide polymorphism for rs2228145. A significant and consistent association for rs12083537 was observed in a replication panel of 354 individuals (P=0.033). Each rs12083537:A allele increased sIL-6R serum levels by 2.4 ng ml−1. Analysis of mRNA levels in two cohorts did not identify significant associations between rs12083537 and IL6R transcription levels. On the other hand, results from 16 705 asthmatics and 30 809 controls showed that the rs12083537:A allele increased asthma risk by 1.04-fold (P=0.0419). Genetic risk scores based on IL6R regulatory variants may prove useful in explaining variation in clinical response to tocilizumab, an anti-IL-6R monoclonal antibody.
Asthma is a chronic lung disease characterized by airway obstruction, inflammation and hyperresponsiveness.1 It is caused by a combination of both genetic and environmental factors but their interaction is complex and still not fully understood.
Interleukin 6 (IL-6) is a cytokine known to be involved in allergic asthma2, 3, 4, 5 and it binds to its target cells through a complex of at least two distinct membrane-bound proteins, a 80-kD ligand-binding glycoprotein (IL-6R) and a 130-kD signal-transducing glycoprotein (gp130).6, 7 Although gp130 is expressed on most cells, expression of the membrane-bound form of IL-6R is limited to hepatocytes and some immune cells, such as neutrophils, monocytes, CD4+ T cells and B cells;8, 9 as such, only these cells are directly stimulated by IL-6. However, IL-6R also exists in a soluble form (sIL-6R) that is either produced by proteolytic cleavage of membrane-bound IL-6R or by alternative splicing.10, 11, 12, 13 sIL-6R can associate with free IL-6 and this complex is then recognized by the membrane-bound gp130, hence providing an alternative IL-6 signalling pathway available to cells that do not express membrane-bound IL-6R, a process known as trans-signalling.7
The main genetic determinant of sIL-6R levels is thought to be the nonsynonymous variant rs2228145 in exon 9,14 which maps to amino acid position 358 on the main cleavage site of IL-6R.15 Specifically, individuals who carry the Ala358 allele have increased sIL-6R levels14—which in turn are associated with increased levels of allergic inflammation in the lung16—and also increased risk of asthma.17 However, given that this variant explains ∼19% of the variation in sIL-6R levels18 but the total heritability of this trait has been estimated at ∼70%,19 we hypothesized that additional variants in or near the IL6R gene contribute to sIL-6R variation independently of rs2228145 and may also influence asthma risk.
To test this hypothesis, in this study we (1) used data from the 1000 Genomes Project20 to comprehensively impute common variants in or near the IL6R gene; (2) tested these variants for association with sIL-6R levels measured in 360 individuals; (3) followed-up the most associated variants in an independent sample of 354 individuals; and (4) tested the replicated variants for association with IL6R transcript levels in two independent cohorts (N=851 and 5191) and asthma risk (N=47 514).
Common variants affecting sIL-6R serum levels independently of rs4129267
To find new common variants associated with sIL-6R levels and possibly with asthma risk, we first measured sIL-6R serum levels in 360 asthmatics (Supplementary Table 1) who had been previously genotyped with Illumina 610K single-nucleotide polymorphism (SNP) arrays (Illumina Inc., San Diego, CA, USA).17 sIL-6R measurements were performed in duplicate, with good agreement between assays (r=0.95). Assay and plate effects explained 0.7% (P=0.013) and 13.6% (P=3.3 × 10−20), respectively, of the sample variation in sIL-6R levels and hence these effects were removed before association analyses. Duplicate measurements were then averaged and the overall distribution normalized using a rank-based inverse-normal transformation. There were no significant effects of age (P=0.256) or sex (P=0.932) on sIL-6R levels.
The main genetic determinant of sIL-6R levels is thought to be the nonsynonymous variant rs2228145,14 which is in high linkage disequilibrium (r2=0.99) with rs4129267 (Figure 1), a SNP that was directly genotyped in all individuals. We confirmed that the rs4129267 SNP has a significant impact on sIL-6R availability, increasing serum levels by 19.7 ng ml−1 for each copy of the T allele and explaining 30% of the interindividual variation (P=1.8 × 10−29, Supplementary Figure 1). However, given that the heritability of sIL-6R levels has been estimated at ∼70%,19 we hypothesized that additional variants in or near the IL6R gene contribute to variation in sIL-6R serum levels independently of rs4129267.
To test this hypothesis, we imputed 452 common variants in IL6R (±50 kb) in the 360 asthmatics using data from the 1000 Genomes Project as the reference panel. These variants, as well as 19 additional IL6R SNPs that were present in the Illumina 610K array, were then tested for association with sIL-6R levels, after adjustment for the effect of rs4129267. Association results for all 471 SNPs are included in Supplementary Table 2.
The variant with strongest association with rs4129267-adjusted sIL-6R levels was rs12083537 (uncorrected P=0.0005, Supplementary Table 3 and Supplementary Figure 2); the association remained significant after correcting for the number of SNPs tested through 1 00 000 permutations (P=0.0496). The rs12083537 variant is in low linkage disequilibrium with rs4129267 (r2=0.03) and has a minor allele (G) frequency of 0.23. An additional three independent (r2<0.1) SNPs had a significant (uncorrected P<0.05) association with sIL-6R levels after adjusting for both rs4129267 and rs12083537 (Supplementary Table 3). However, these associations did not remain significant after correction for multiple SNP testing (P>0.05).
Replication of the new association between rs12083537 and sIL-6R serum levels
To confirm that the observed association between rs12083537 and sIL-6R levels was real and not a technical or statistical artefact, we performed a replication study using the same experimental procedure in an additional set of 354 unrelated asthmatics (Supplementary Table 4), genotyped for both rs4129267 and rs12083537.
After adjusting for the effects of rs4129267, rs12083537 was significantly associated with sIL-6R levels (P=0.033), with the same direction of effect observed in the discovery analysis (Supplementary Figure 3). These results thus confirm that rs12083537 is a new quantitative trait locus for sIL-6R serum protein levels.
After combining data from the discovery and replication cohorts (combined N=714), rs12083537 explained 2.2% of the variation in sIL-6R levels after adjustment for the effects of rs4129267 (P=6.2 × 10−5). Before adjustment for rs4129267, rs12083537 explained 0.15% of the variation in sIL-6R levels. The rs12083537:G minor allele decreased sIL-6R levels similarly in the three rs4129267 genotype classes (Figure 2).
Association between rs12083537 with IL6R mRNA levels
The main genetic determinant of sIL-6R levels, rs2228145, is located on the main cleavage site of IL-6R and hence it is thought to affect protein levels through a post-transcriptional mechanism.15 To provide some insight into the molecular mechanisms underlying the new rs12083537 association with sIL-6R protein levels, we investigated whether this variant influenced IL6R mRNA levels measured in whole blood in a population-based study of adolescent twins and their relatives (N=851). There was no significant association between rs12083537 and IL6R mRNA levels in this cohort, either measured by a probe targeting the 3′-untranslated region (β=−0.050 for the G allele, s.e.=0.062, P=0.4234) or exon 9 (β=−0.062 for the G allele, s.e.=0.063, P=0.3209). On the other hand, we observed a significant association between rs4129267 (a proxy for rs2228145) and IL6R gene transcription (exon 9 probe, β=−0.120 for T allele, s.e.=0.051, P=0.0187). Counterintuitively, the rs4129267:T allele that increases sIL-6R protein levels was associated with decreased mRNA expression.
To confirm that the lack of association between rs12083537 and mRNA levels was not because of low power or incomplete coverage of IL6R expression patterns, we performed a similar analysis in a larger independent study (N=5191) with IL6R gene expression levels in whole blood measured by 32 individual Affymetrix probes, targeting exons 5, 7, 8 and the 3′-untranslated region. In this cohort, the rs12083537 variant was also not associated with IL6R transcription levels after a Bonferroni correction for the number of probes tested (P<0.0016, Supplementary Table 5). The strongest association was with a probe mapping to the 3′-untranslated region (P=0.0031), with the rs12083537:A allele that increases sIL-6R protein levels being associated with decreased mRNA expression, as noted above for rs4129267.
Association between rs12083537 and asthma risk
The rs2228145:C (or rs4129267:T) allele that increases sIL-6R levels14 also increases asthma risk.17 We therefore tested the hypothesis that rs12083537 is also a genetic risk factor for asthma. Specifically, we hypothesized that, as for rs2228145, the rs12083537:A allele that is associated with higher sIL-6R levels would be predisposing for asthma. Association results were available for 47 514 individuals from three independent studies, including 16 705 doctor-diagnosed asthmatics and 30 809 controls. A consistent predisposing effect was observed for the rs12083537:A allele in all three studies (Table 1). Overall, the association between rs12083537 and asthma risk was weak but statistically significant (odds ratio=1.039, 95% confidence interval=1.002–1.078, P=0.0419). Results remained largely unchanged when the association was tested after adjusting for rs4129267 (odds ratio=1.050, 95% confidence interval=1.012–1.089, P=0.0088).
We recently identified a variant (rs4129267) in the IL6R gene with a modest (odds ratio=1.09) but genome-wide significant association with asthma risk.17 This variant is in near complete linkage disequilibrium with the exon 9 missense variant rs2228145, and is associated with a 1.4-fold increase in sIL-6R levels for each copy of the T allele.22 The goal of this study was to test the hypotheses that (1) other variants in IL6R regulate gene expression and (2) that these also associate with asthma risk.
A single variant (rs12083537) located in intron 1 of IL6R, 2.9 kb away from exon 1, was found to associate with sIL-6R levels independently of rs4129267 and after correction for multiple SNP testing. A significant and consistent association for this variant was then observed in our replication panel. Furthermore, at the time of submission, we also noted that Ferreira et al. independently reported two new genetic associations for sIL-6R levels, one of which (rs1386821) is correlated with rs12083537 (r2=0.94). The alleles that increase sIL-6R levels for these two SNPs (rs12083537:A and rs1386821:T) are in phase and hence our results are consistent with those of Ferreira et al.23 Together, these results establish rs12083537, or a variant in linkage disequilibrium with it, as a new regulatory variant for sIL-6R serum levels.
The rs12083537 variant is located in a region with high H3K4Me1 lysine methylation in four cell lines, including lung fibroblasts and keratinocytes.24 This histone modification is associated with enhancers and promoters and hence it is plausible that rs12083537 may influence gene transcription. However, analysis of mRNA levels extracted from whole blood in two independent studies did not identify any significant associations between rs12083537 and IL6R transcription levels, after correcting for multiple testing. Thus, the molecular mechanism underlying the association between rs12083537 and sIL-6R levels remains to be elucidated. A potential caveat of our analysis was that we identified the association between rs12083537 and sIL-6R protein levels in asthma patients but then analysed mRNA levels in two studies that included mostly nonasthmatics. However, this difference is unlikely to explain the lack of association between rs12083537 and mRNA levels because IL6R regulatory variants have been shown to influence protein levels similarly in population-based samples,22 healthy individuals14, 25 and patients with different diseases.23, 25, 26 Gene expression studies in individual immune cell types or upon allergen stimulation may help characterize the function of this variant in greater detail.
Interestingly, we observed that the rs4129267:T allele that is associated with increased sIL-6R protein levels22 was associated with decreased IL6R mRNA levels, which may appear contradicting. The Illumina probe for which we observed an association with rs4129267 maps to exon 9 (containing the IL-6R transmembrane domain), which is skipped in the main differentially spliced IL6R isoform that directly produces sIL-6R. Therefore, this probe measures the levels of the full-length isoform, which necessarily decrease if splicing of exon 9 increases. Given that the rs4129267:T allele, or an allele associated with it (for example, rs2228145:C), strongly increases the splicing of exon 9,23 it is in fact not surprising that consequently this allele is associated with decreased mRNA levels of the full-length isoform. Unlike our results, Ferreira et al.23 did not observe a significant effect of rs2228145 on the levels of the full-length isoform. However, as the authors noted, splicing of exon 9 is a rare event. Therefore, the lack of association in that study likely reflected the low power provided by a small sample size (N=88) to detect small differences in the levels of the full-length isoform between genotype classes.
Lastly, we found rs12083537 to be significantly associated with asthma risk. Based on our analysis, each rs12083537:A allele increased sIL-6R serum levels by 2.4 ng ml−1 and increased asthma risk by 1.04-fold. For comparison, each rs4129267:T allele increased sIL-6R serum levels by 19.7 ng ml−1 and increased asthma risk by 1.09-fold.17 The association between rs12083537 and asthma risk was consistent across the three cohorts analysed but only marginally statistically significant in the overall analysis. Replication of our results by well-powered studies is therefore warranted; over 23 000 cases and 23 000 controls (or, for example, 18 000 cases and 36 000 controls) are required to achieve 80% power to detect (α=0.05) the effect we observed.
In summary, our study demonstrates that at least two independent genetic variants influence sIL-6R serum levels. Consequently, pharmacogenetic studies of clinical response to tocilizumab, an anti-IL-6R monoclonal antibody approved to treat rheumatoid arthritis27 and suggested to treat asthma,16, 17 should consider the effects of not only the major genetic determinant of sIL-6R levels (rs2228145), but also of other confirmed regulatory variants such as rs12083537. For this purpose, multi-SNP scores that summarize an individual’s combined genetic risk of having high sIL-6R levels may prove useful.
Materials and methods
Genetic variants in or near the IL6R gene were tested for association with sIL-6R protein levels, mRNA levels and asthma risk in the cohorts described below. Informed consent was obtained from all participants and the study protocols were approved by the appropriate Ethics Committees.
Association with IL-6R protein levels: discovery cohort
To find new variants in the IL6R gene that affect sIL-6R levels, we first studied 360 unrelated asthmatic subjects from the QIMR 1995–1998 Asthma and Allergy study, which is described in detail elsewhere.28 Briefly, 3073 subjects were recruited from 802 families that were registered on the Australian Twin Registry and had at least one twin who previously reported a history of wheezing in studies conducted at QIMR and by collaborators elsewhere in Australia. Participants completed a questionnaire that was designed to validate the diagnosis of asthma and to obtain data on respiratory symptoms, environmental exposures and family history of asthma. In addition, participants underwent clinical testing, including lung function and skin prick tests. For the present study, we selected 360 unrelated individuals with available DNA and a serum sample, and who answered ‘Yes, told to me by a doctor’ to the question ‘Have you ever had asthma?’. Demographics and clinical characteristics for this discovery sample are summarized in Supplementary Table 1. These individuals were included in a recent genome-wide association study (GWAS) of asthma.17
Association with IL-6R protein levels: replication cohort
To confirm associations between SNPs and IL-6R protein levels observed in the discovery sample, we studied an additional sample of 354 unrelated asthmatics recruited through the QIMR Asthma Genetics Study between 2011 and 2013. In this study, individuals with a self-reported doctor diagnosis of asthma were recruited from the general community through media appeals (newspapers, radio, television and social media) to participate in a study of genetic risk factors for asthma. Participation included completing a brief online questionnaire about asthma symptoms, medication and severity, and the provision of a blood sample for genetic and immune function testing. Blood was collected at a local pathology laboratory and shipped to QIMR by overnight courier. Demographics and clinical characteristics for this replication sample are summarized in Supplementary Table 4.
Association with IL6R mRNA levels
To provide some insight into the molecular mechanisms underlying SNP–protein associations, SNPs confirmed to associate with sIL-6R protein levels were then tested for association with mRNA levels measured in 851 individuals from 262 families who participated in a genome-wide gene expression study reported recently.29 Briefly, adolescent MZ and DZ twins, their siblings, and their parents have been recruited over a 21-year period into an ongoing study of the genetic and environmental factors influencing pigmented nevi and the associated risk of developing skin cancer and cognition (Twin Moles study).30 Results obtained in this cohort were confirmed by analysing expression levels from an additional 5191 individuals who participated in two large-scale longitudinal studies: the Netherlands Twin Register (NTR; N=3170) and the Netherlands Study of Depression and Anxiety (NESDA; N=2021); henceforth we refer to this as the NTR–NESDA expression quantitative trait locus (eQTL) study. The individual NTR and NESDA cohorts are described in detail elsewhere.31, 32
Association with asthma risk
SNPs associated with sIL-6R protein levels were also tested for association with asthma risk in 21 039 unrelated individuals of European ancestry who participated in two independent studies. First, 5967 individuals who participated in the Australian asthma GWAS,17 including 2110 doctor-diagnosed asthmatics and 3857 controls. Sample ascertainment and patient characteristics for this study are described in detail in Ferreira et al.17 Second, 15 072 customers of 23andMe, Inc., a personal genetics company, who had been genotyped as part of the 23andMe Personal Genome Service. We selected 4230 cases and 10 842 controls of European ancestry who had taken a survey about asthma. Cases (mean age 48, 54% female, 55% with onset of asthma at age ≤16 years) gave positive responses to the question ‘Have you ever been diagnosed by a doctor with asthma or bronchial asthma?’, and also checked a box indicating that they had ever had ‘allergic rhinitis (stuffed or dripping nose caused by allergies)’. Controls (mean age 49, 39% females) gave negative responses to both questions. Association results from these 6340 cases and 14 699 controls were meta-analysed with publicly available results from 10 365 asthmatics and 16 110 controls included in the GABRIEL GWAS,33 for a combined total sample size of 47 514 individuals, including 16 705 cases and 30 809 controls.
sIL-6R serum protein measurements
A 40-ml venous blood sample was collected from all consenting subjects. After spinning 10 ml of blood in a serum tube at 805 g for 10 min, the serum layer was extracted and stored at −80 °C until analysis. ELISA kits (R&D Systems, Minneapolis, MN, USA) were used to measure sIL-6R levels according to the manufacturer’s procedures. The optical density was determined using a BioTek PowerWave XS2 microplate spectrophotometer (BioTek Instruments, Inc., Winooski, VT, USA) at both 450 and 540 nm wavelengths. Results and standard curves were acquired with BioTek Gen5 2.0 Data Analysis Software. sIL-6R levels were measured in duplicate for each sample.
IL6R mRNA measurements
Whole blood for expression profiling was collected from 851 individuals from the Twin Moles study and processed as described in detail by Powell et al.29 Briefly, total RNA was extracted from PAXgene tubes using the QIAGEN whole blood gene RNA purification kit (QIAGEN, Valencia, CA, USA). RNA quality and concentration was estimated with Agilent Bioanalyzer (Agilent, Santa Clara, CA, USA) and subsequently converted to complementary DNA, amplified and purified using the Ambion Illumina TotalPrep RNA Amplification Kit (Ambion, Austin, TX, USA). Expression profiles were generated by hybridizing 750 ng of cRNA to Illumina HumanHT-12 v4.0 Beadchip according to Illumina whole-genome gene expression direct hybridization assay guide. Beadchips were then washed and stained and subsequently scanned to obtain fluorescence intensities. Samples were scanned using an Illumina Bead Array Reader. Samples were randomized across chips and chip positions, with check for balance across families, sex and generation. For this study, we restricted our analysis to two probes targeting the IL6R gene, specifically ILMN_1754753 on the 3′-end and ILMN_1696394 on exon 9 (Supplementary Table 6).
Gene expression association results were confirmed by analysing a further 5191 individuals from the NTR–NESDA eQTL study. In this study, venous blood samples were drawn in the morning after an overnight fast; heparinized whole blood was transferred within 20 min of sampling into PAXgene tubes and stored at −20 °C. RNA extraction and analysis was performed at the Rutgers University Cell and DNA Repository (Piscataway, NJ, USA). RNA was extracted using QIAGEN Universal liquid handling system (PAXgene extraction kits following the manufacturer’s protocol). Total RNA was measured by spectroscopy (Trinean DropSense, Gentbrugge, Belgium) to determine purity and concentration, whereas RNA fidelity was measured by the Agilent Bioanalyzer analysis. RNA samples were hybridized to Affymetrix (Santa Clara, CA, USA) U219 array plates (GeneTitan) that contain 5 30 467 probes, each 25 bases in length. Array hybridization, washing, staining and scanning were carried out in an Affymetrix GeneTitan System following the manufacturer’s protocol. Nonuniquely mapping probes (hg19) and probes containing a polymorphic SNP based on snp137 (UCSC) were removed. Expression values were obtained using RMA normalization implemented in Affymetrix Power Tools (APT, v 1.12.0). A total of 32 probes targeting IL6R were selected for analysis (Supplementary Table 5).
IL6R SNP genotyping
For association with sIL-6R protein levels: discovery cohort
Genome-wide SNP data for the 360 individuals were obtained using Illumina 610-Quad Beadchip as part of the Australian Asthma Genetics Consortium GWAS, which is described in detail elsewhere.17 For this study, we selected 254 post quality control (minor allele frequency >1%, call rate >95%, Hardy–Weinberg equilibrium P>10−6) SNPs in or within 1000 kb of IL6R, including rs4129267. To expand the genetic coverage of IL6R, these 254 directly genotyped SNPs were then used to impute 530 common variants (minor allele frequency >=1%) from the 1000 Genomes Project (March 2012 release, 1092 samples of all ancestries) using Impute2.34 After quality control (minor allele frequency >1%, imputation information score >0.5), there were 471 SNPs located in or within 50 kb of IL6R available for analysis, of which 19 had been directly genotyped.
For association with sIL-6R protein levels: replication cohort
Genotypes for two SNPs (rs4129267 and rs12083537) were obtained for the 354 individuals with Sequenom MassARRAY iPLEX platform; 12.5 ng of DNA isolated from buffy coats with a salt precipitation method were used per assay. SNP sequences were downloaded from the National Centre for Biotechnology Information (http://www.ncbi.nlm.nih.gov/) and cross-checked. Design of the PCR and iPLEX Extension assays was done using the Sequenom Design Suite (https://mysequenom.com/). The assays were then manually adjusted to increase the level of multiplexing. SNPs were typed using iPLEXTM Gold chemistry and analysed using a Sequenom MassARRAY Compact Mass Spectrometer (Sequenom Inc., San Diego, CA, USA). The PCR, SAP and iPLEX reactions were performed using half the volume of reagents recommended in the manufacturer’s specifications. The post-PCR products were spotted on a Sequenom SpectroChip 2, and the data were processed and analysed using Sequenom MassARRAY TYPER 4.0 software. The primers used to amplify both SNPs are included in Supplementary Table 7.
For association with IL6R mRNA levels
DNA samples from the Twin Moles study (N=851) were genotyped by the Scientific Services Division at deCODE Genetics (Reykjavik, Iceland), using the Illumina 610-Quad Beadchip. Genotypes were called with the Illumina BeadStudio software. Full details of genotyping procedures are given in Medland et al.27 DNA extraction for the NTR–NESDA eQTL study (N=5191) has been described before.31 Genotyping was performed on multiple SNP array platforms, including Perlegen 5.0, Illumina 370, Illumina 660, Illumina Omni Express 1M and Affymetrix 6.0. Standard preimputation quality control filters were applied first within and then between chip platforms; subsequently, data were merged into a single data set. Genome-wide SNP imputation was done with Impute2,34 using the 1000 Genomes phase I Interim Jun 2011 release. This included the rs12083537 variant analysed in this study, which was imputed with high confidence (information score >0.9).
For association with asthma risk
We accessed individual-level genotype data for a single SNP (rs12083537) from the Australian GWAS (N=5967) and the 23andMe cohort (N=15 072). This SNP was imputed in the Australian GWAS with high confidence (information score 0.97), using Impute2 and genotype data from the combined 1000 Genomes (60 individuals with northern and western European ancestry from the Centre d’Etude du Polymorphisme Humain collection (CEU), March 2010 release) and HapMap 3 (955 individuals from 11 populations, February 2009 release) projects as the reference panel. In the 23andMe cohort, rs12083537 was imputed with high confidence (r2=0.95) using the August 2010 release of the 1000 Genomes reference haplotypes.
Quality control of sIL-6R ELISA data
Each sample was measured in duplicate (two assays), with up to 80 serum samples included per ELISA plate. No outlier observations (located six s.d. from the mean) were observed for either assay. The same procedure was used for the discovery and replication cohorts to estimate and remove the effects of both technical and biological effects on sIL-6R levels. Briefly, we (1) used linear regression to adjust for assay and plate effects; (2) normalized the distribution using a rank-based inverse-normal transformation; and (3) tested and, if significant (P<0.05), adjusted for the effects of age and sex. Subsequent association analyses were carried out using the residuals obtained from (3).
Individual SNPs in IL6R, directly genotyped or imputed, were tested for association with sIL-6R protein levels (discovery and replication cohorts), mRNA levels and asthma risk. In all cases, we used additive allelic tests of association; for imputed SNPs, we used the inferred allelic dosage.
The association between the 471 SNPs in IL6R and protein levels was tested using linear regression in R. Association analyses were performed before and after adjusting sIL-6R levels for the effects of rs4129267, a SNP highly correlated with the missense variant rs2228145 (r2=0.99), the main genetic determinant of sIL-6R protein serum levels.22 To adjust for the effects of rs4129267, we (1) performed a linear regression of sIL-6R levels (dependent variable) on rs4129267 allelic dosage (independent variable); (2) extracted the residuals from this analysis, which represent rs4129267-adjusted sIL-6R levels; and (3) performed a linear regression of rs4129267-adjusted sIL-6R levels (dependent variable) on the allelic dosage of individual SNPs (independent variable). To correct the observed asymptotic association P-value (Pobs) of a given SNP for the number of (and correlation between) SNPs tested, we (1) permuted rs4129267-adjusted sIL-6R levels between the 360 asthmatics; (2) tested the permuted sIL-6R levels for association individually with each of the 471 SNPs and recorded the asymptotic P-value for the most significantly associated SNP (Pmin); (3) repeated steps (1) and (2) 1 00 000 times; and (4) calculated for each SNP the empirical association P-value corrected for the number of SNPs tested as the proportion of 1 00 000 replicates for which PminPobs.
Association with mRNA levels in the Twin Moles study was tested using MACH2QTL,35, 36 which takes into account the family structure of the data. In the NTR–NESDA eQTL study, association with mRNA levels was tested using a linear mixed model in R to correct for family structure (as random effects) as well as sex, age, body mass index, smoking status, technical covariates and ancestry (as fixed effects).
Lastly, association with asthma risk was tested using logistic regression in R, including covariates to adjust for sample origin in the AAGC study and, in the 23andMe study, for age, sex and ancestry. Association analyses conditional on rs4129267 were performed in both studies by including this SNP (coded as allelic dosage: 0, 1 or 2) as a covariate in the logistic regression model. Only GWAS summary statistics were available for the GABRIEL study; as such, for this study, the conditional analysis was performed using the approach of Yang et al.21 implemented in Genome-wide Complex Trait Analysis (GCTA). Fixed-effects meta-analysis of results from the Australian GWAS, 23andMe and the publicly available GABRIEL GWAS was performed with METAL.37
We thank all the participants of the Asthma and Twin moles studies: Ann Eldridge, Marlene Grace, Kerrie McAloney (sample collection); Melinda Richter, Lisa Bowdler, Steven Crooks (DNA processing); David Smyth, Harry Beeby, Daniel Park (IT support). Funding was provided by the Australian National Health and Medical Research Council (NHMRC, 613627). The NTR/NESDA data described in this paper were funded by the US National Institute of Mental Health (RC2 MH089951, PI Sullivan) as part of the American Recovery and Reinvestment Act of 2009. MARF is supported by a Career Development Fellowship from the NHMRC.
About this article
Supplementary Information accompanies this paper on Genes and Immunity website (http://www.nature.com/gene)
Influence of IL6R gene polymorphisms in the effectiveness to treatment with tocilizumab in rheumatoid arthritis
The Pharmacogenomics Journal (2018)