Introduction

Rheumatoid arthritis (RA) is a chronic autoimmune disease occurring in approximately 1% of adults.1 It is an inflammatory synovitis that can lead to joint destruction and physical disability. The presence of autoantibodies (rheumatoid factor (RF) or anticyclic citrullinated peptide), termed seropositivity, occurs in three-quarters of patients.2 RA has a complex genetic etiology,3 the strongest known genetic risk factor being HLA-DRB1, and more specifically a group of alleles referred to as the shared epitope,4 while a further 31 risk loci have also been confirmed in seropositive disease.5

Disease-modifying antirheumatic drugs (DMARDs) such as methotrexate are used to treat RA. Second-line therapies include biological agents such as an anti-tumor necrosis factor (anti-TNF) agent, the B-cell depleting agent rituximab, or tocilizumab, a humanized monoclonal antibody that targets the IL-6 receptor. Criteria from the American College of Rheumatology (ACR) define 20, 50 and 70% improvements in disease activity (ACR20, 50 and 70). Response rates to biological therapies are typically 50–60% for ACR20, 20–40% for ACR50 and 10–20% for ACR70.6 There is considerable heterogeneity of response to a given therapy. Genetic biomarkers might identify which patients are likely to respond to a particular therapy.

Genetic biomarker discovery for response to anti-TNF therapy and rituximab has focused on candidate-based genetic studies6 of known or hypothetical susceptibility loci or target-related genes. These studies have not yielded a validated biomarker predictive of response to any of these therapies. Using a genome-wide association study (GWAS) approach, we sought to identify genetic polymorphisms that might predict response to tocilizumab and replicate our findings in an independent cohort.

Materials and methods

Subjects

Individuals (N=1683) providing DNA used in this study took part in one of five pivotal phase III studies: RADIATE7 (n=178); OPTION8 (n=273); TOWARD9 (n=459); AMBITION10 (n=247); LITHE11 (n=469) or the translational study MEASURE12 (n=57) (Table 1). These trials evaluated the efficacy and safety of tocilizumab (4 or 8 mg kg-1), administered every 4 weeks over at least a 24-week period compared with DMARD therapy. Methotrexate was the most common DMARD in these trials. The study populations differed according to background therapy with methotrexate or DMARDs (OPTION,8 TOWARD,9 LITHE,11 MEASURE12), previous inadequate response to anti-TNF agents (RADIATE7), tocilizumab monotherapy in patients with no background DMARD therapy or history of methotrexate inadequate response (AMBITION10). DNA samples were collected only from patients who gave separate informed consent to analyses designed to facilitate the study of genetic contributions to specific efficacy and safety responses to tocilizumab. For the discovery of novel variants in the genic regions of SPTLC3 (serine palmitoyltransferase, long-chain base subunit 3) and MYO18B, 194 patients (98 European League Against Rheumatism good responders and 96 European League Against Rheumatism non-responders), matched for age and sex, were selected from Caucasian RA patients treated with tocilizumab (subset 1).

Table 1 Demographic and baseline characteristics of patients

Genotyping, DNA sequencing and quality control

DNA samples were genotyped using the Illumina Bead-Chip arrays (Illumina, San Diego, CA, USA) as described previously.13 RADIATE,7 OPTION8 and TOWARD9 were genotyped with the HumanHap550K version 3.0, AMBITION10 with Human1M-Duo version 3.0, LITHE11 and MEASURE12 with HumanOmni1-Quad and sequencing-derived markers with the Illumina custom GoldenGate genotyping assay. Sample quality control (QC) was conducted before genotyping, and data QC was conducted after genotyping (Supplementary Material). Assays with either >5% missing data or with a minor allele frequency <1% were excluded from the analysis. χ2 Tests of Hardy–Weinberg equilibrium were conducted in white subjects; the results were used, along with estimates of minor allele frequency, to assist in the interpretation of associations.

A sequencing approach was used to discover novel variants in two loci of particular interest identified by the Initial association analysis, SPTLC3 and MYO18B (Supplementary Material). Genotyping was performed for 1100 variants in SPTLC3 and 1333 variants in MYO18B.

Statistical analysis, principal component analysis and genotype imputation

Principal component analysis was conducted to enable correction for population stratification in the GWAS, using Eigensoft v.3.0.14, 15 Details are provided in Supplementary Material.

Phenotypic outliers were assessed iteratively by calculating the Bonferroni-adjusted P-value for the largest absolute studentized deleted residual, using the ‘car’ package in R v.2.7.2;16, 17, 18 outlying subjects were removed from analyses as appropriate. Single-point association tests were applied in white subjects and separately in subjects of all ethnicities using PLINK v.1.06.19 Linear or logistic regression modeling of the clinical end point was applied, assuming additive effects for the single-nucleotide polymorphisms (SNPs), and including other co-variates (baseline end point value, dose, ethnicity and study). In association analysis of subjects of all ethnicities, the ethnicity co-variate was represented by the first five principal components from principal component analysis. In association analysis of white subjects, the ethnicity co-variate was represented by indicators of geographic region (Western Europe; North America; South America; Rest of World). In the first hypothesis generating round of analysis, only markers from HumanHap550K v.3.0 were considered. A small number of missing genotypes were imputed using Mendel v.9.0.0,20, 21 without external reference data, in preparation for LASSO (least absolute shrinkage and selection operator) penalized regression.22 Haplotype frequencies were estimated along a sliding genomic window using a penalized likelihood approach, and the most probable SNP genotype was assigned, based on these estimates. It is noted that if the imputation step had not been included, missing autosomal genotypes would have been replaced by heterozygote calls in Mendel LASSO analysis. LASSO was applied because our univariate analysis might lack power for the discovery of polymorphisms involved in epistatic effects. As a variable selection method, which may be used when the dimensions of the data are greater than the sample size, LASSO penalized regression has been previously applied to several GWAS.23, 24 The penalty factor was adjusted so that 10 SNPs were selected in each analysis.

In the second, confirmatory round of analysis, some SNPs to be confirmed had missing genotypes owing to the different assay platform used. Missing genotypes were imputed using IMPUTE25, 26 before single-point analysis; LASSO analysis was not conducted. In the pooled exploratory analysis, ancestry analysis was repeated as described above for all available subjects and separately for white subjects. In both cases, ProbABEL27 was used for co-variate-adjusted association testing, to account for uncertainty in imputation.

Study design

Studies were genotyped and analyzed in the order they became available. First, RADIATE,7 OPTION,8 TOWARD9 and AMBITION10 (ROTA) were used to identify initial associations with six efficacy end points. We chose a continuous clinical end point, change in Disease Activity Score (cDAS28), as a primary end point, as it measures treatment response accounting for the baseline disease activity, and is more sensitive to small effects compared with dichotomous end points (the addition of the prefix ‘c’ to an abbreviation for a clinical end point denotes ‘change in’). As DAS28 is a complex end point, it was considered plausible that a genetic contribution to DAS28 may be more readily detected through association with a particular component of DAS28 or other important measures of disease impact, and thus the following end points were also examined: changes in swollen joint count (cSJC), tender joint count (cTJC), health assessment questionnaire score (cHAQ) and C-reactive protein (cCRP). As these end points, particularly the components of DAS, are not independent, the multiple testing burden is not as great compared to an assessment of multiple, independent outcomes. As week 16 is the last data time point before escape therapy, it was used to maximize subject numbers, with the exception of cCRP, where week 8 was used as a steady state has been reached by this time point. ACR20 at week 24 was also examined as it was the primary end point in all studies. Candidate SNPs were selected as meeting at least one of the following criteria: (1) P<10−5 in white subjects (White) as they were the largest ethnic group with recognized low ethnic confounding; (2) P<10−4 in White and a lower P-value in all ethnicities (All); or (3) selected by LASSO analysis of White or All. These non-conservative thresholds are above a genome-wide significance level of P10−7 and reflect a greater emphasis on power than type I error for the purpose of hypotheses generation (Supplementary Material).

Two loci of particular interest from this initial association analysis were also sequenced in a subset of patients; novel variants were then genotyped in ROTA.7, 8, 9, 10 Association candidates were selected using criteria 1 and 2 above.

Association candidates from analysis of ROTA7, 8, 9, 10 underwent confirmatory genotyping in an independent cohort consisting of LITHE11 and MEASURE12 (LM). For those candidates not directly genotyped owing to the change in assay platform, imputed genotype probabilities were used. If imputed data were not available, the proxy SNP with the highest linkage disequilibrium r2 that passed QC was utilized. Markers in either White or All subjects with P<0.05 against the same end point as the original association or P<0.05 against the primary end point cDAS in both LM11, 12 and ROTA,7, 8, 9, 10 with the same directionality, were considered to have achieved confirmation. Estimations of effect sizes are presented for DMARD inadequate responder (IR) patients represented from LITHE,11 OPTION8 and TOWARD9 (LOT), anti-TNF IR patients from RADIATE7 and methotrexate-naïve/free patients from AMBITION.10 Replicated SNPs were also examined for predictive effect size on DAS28 remission (DAS28 <2.6) in singular, pair-wise and triple combinations in both additive and multiplicative models. Predictive vs prognostic property was investigated via comparison of effect on tocilizumab- and placebo-treated patients.

To utilize the greater sample size afforded through combination of all studies, an additional analysis explored the association of cDAS28, cTJC, cSJC and cHAQ with the union of imputed genotypes from ROTA7, 8, 9, 10 and LITHE11 (ROTAL). To investigate the influence of RF on the predictive or prognostic nature of candidate SNPs, MEASURE12 was excluded from this analysis as RF data were not collected in this study.

Results

Initial association analysis

In ROTA,7, 8, 9, 10 42 patients failed genotyping QC, leaving 1157 individuals, of whom 706 were treated with tocilizumab. Of the 534 053 markers analyzed, 207 markers were identified as having 253 significant associations in tocilizumab-treated patients as defined by our selection criteria (Supplementary Table 2). Of these 207 markers, 88 were identified only through LASSO analysis. Four SNPs that met a genome-wide significance threshold of P10−7 and a summary of univariate associations generated per end point are provided in Table 2. HLA-DRB1 (shared epitope) was also genotyped in ROTA,7, 8, 9, 10 but no association with tocilizumab response was found (Supplementary Table 3).

Table 2 Numbers of markers identified as significantly associated with efficacy end points

Two SNP markers were highlighted as of particular interest. rs6078937, an intronic SNP in SPTLC3, was the only marker highlighted by all four analysis criteria for any end point (cDAS: P=4.43 × 10−6 in White, P=1.73 × 10−6 in All and selected by LASSO in both White and All). Another SNP, rs6004913, was the only marker highlighted for four different end points: in the White population, cDAS28: P=3.94 × 10−6, cTJC: P=9.71 × 10−6 (plus 3 flanking SNPs) and cHAQ: P=1.50 × 10−6 (plus 1 flanking SNP); and in the All population, ACR20: P=4.36 × 10−5 (plus 4 flanking SNPs). The variant rs6004913 is in linkage disequilibrium with rs2236006 with r2=0.62, a non-synonymous coding change in MYO18B (MYOSIN XVIIIB). To identify potential causal variants, we re-sequenced these loci in a 194 subject responder/non-responder subset (subset 1) revealing 1100 and 1333 unique variants in SPTLC3 and MYO18B, respectively. These variants were then genotyped in ROTA,7, 8, 9, 10 with 241 SPTLC3 and 167 MYO18B markers having a minor allele frequency 0.05. Of these 408 variants, five preliminary associations were identified (Table 2C).

Confirmatory analysis

In all, 253 initial associations together with five sequencing variant associations were tested in an independent group of subjects, LM.11, 12 Four LM11, 12 patients failed genotyping QC, leaving 526 individuals, of whom 338 were treated with tocilizumab. In total, 371 subjects were defined as White as described in Figure 1, of whom 234 were treated with tocilizumab. Of 208 SNPs, 127 were directly genotyped, while 79 SNPs were imputed and two proxies were used. The White tocilizumab-treated population yields 80% power to confirm preliminary ROTA7, 8, 9, 10 associations, if the variant has a minor allele frequency of at least 0.2, expresses an additive effect of at least 0.35 units of change in DAS28, using the significance threshold of 0.05 (Supplementary Table 1).

Figure 1
figure 1

Biplot of the first two principal components, utilizing 526 LITHE and MEASURE (LM)11, 12 subjects and 988 founder subjects. Five LM11, 12 subjects, self-reporting as White and non-Hispanic and represented by green crosses, fell outside the main cluster of Caucasian subjects. Furthermore, 59 subjects, self-reporting as Hispanic and represented by red triangles, fell within the main Caucasian cluster. For the purposes of analysis, a threshold of −0.02 was placed on the second principal component; subjects falling below the threshold were considered to form a homogeneous group and were incorporated into White.

PowerPoint slide

Seven markers, five directly genotyped and two imputed, achieved confirmation at P<0.05 for the same end point with which they were identified in ROTA7, 8, 9, 10 (Table 3). Three of these were also confirmed against cDAS28, while a fourth marker, rs7055107, represented by a proxy, was confirmed against cDAS28 in LM.11, 12 Interestingly, all of these eight markers originated from LASSO regression; no markers generated from sequencing were confirmed. The observed association with efficacy could potentially be characterized as predictive, that is, associated specifically with response to tocilizumab, and not to placebo plus DMARD background therapy. Alternatively, the associations could be prognostic, that is, associated with a change in disease activity through a spontaneous disease fluctuation or general treatment effect, or may be dependent on dose of tocilizumab, or specific to treatment line. To address these possibilities, we analyzed the predictive nature of these polymorphisms on cDAS28 in various genetic models (Figure 2 and Supplementary Figure 1) and found them to be largely predictive. As a major aim in the treatment of RA is to attain remission, we also analyzed the predictive nature of these polymorphisms in terms of DAS28 <2.6 both singularly and in combination (Table 4).28

Table 3 Markers achieving confirmation in LM11, 12 either (A) to the end point with which they were associated in ROTA7, 8, 9, 10 or (B) to cDAS28, either genotyped directly (direct), genotype imputed (Impute) or proxy SNP utilized (proxy)
Figure 2
figure 2

Effect of single-nucleotide polymorphism (SNP) variants (a) rs11052877 (CD69), (b) rs4910008 (GALNTL4) on change in Disease Activity Score (cDAS28) in response to tocilizumab in All. Biomarker-positive/negative patients are defined as those carrying the major or minor allele and genetic model as indicated in each forest plot. The 95% confidence intervals are indicated for the point estimate of the difference in mean cDAS28 between biomarker-positive and -negative patients. Values are presented for all subjects and separated by treatment line, dose and drug. Interpretation of effect size estimations for patients previously exposed to anti-tumor necrosis factor (anti-TNF) inadequate responder (IR) (RADIATE7) or methotrexate naïve (AMBITION10) are confounded by the low number of patients in this cohort leading to broad confidence intervals, but are presented for completeness. BM+, biomarker positive, BM−, biomarker negative.

PowerPoint slide

Table 4 DAS28 remission rates at week 16 for DMARD IR (LOT) patients stratified by SNPs achieving confirmation in LM

Pooled exploratory analysis

ROTAL7, 8, 9, 10, 11 consisted of 1626 individuals passing genotyping QC (All). Of 1091 patients treated with tocilizumab, 791 were RF positive. Principal component analysis was used to identify a 753-subject tocilizumab-treated Caucasian subgroup (White), of whom 573 were RF positive. Principal components 1–5, as well as dose, study and baseline measure of the corresponding end point, were included in the corresponding regression model as co-variates. A total of 1 470 186 markers were tested for single-point association with response in RF-positive and -negative groups for both All and White populations, 42 markers passing significance levels of P<10−6 with a frequency >1% (Supplementary Table 5). Three markers met genome-wide significance at P<10−7 and frequency >1%; all were in the White RF-negative group (Supplementary Table 6A). However, as they were identified in the smallest subgroup (n=179), the numbers of patients carrying the minor alleles (n=46, 24 and 6, respectively) were considered too small to be of further interest at this stage. Interestingly, 13 markers in linkage disequilibrium with ENOX1, a locus highlighted by rs9594987 confirmation in LM11, 12 (Table 3), were associated with cHAQ in the RF-positive All population, seven at P<10−4 and six at P<10−6 (Supplementary Table 6B). Associations were weaker in All RF-negative (P>0.05), White RF-positive (P>10−4) and White RF-negative (P>0.01) populations.

Of seven markers achieving confirmation in LM11, 12 (Table 3), three demonstrated a differential association to response in ROTAL7, 8, 9, 10, 11 according to seropositivity. (rs7055107 was not considered as X-chromosome markers were not imputed.) rs4910008 is associated more strongly with cTJC, cSJC and cDAS28, rs9594987 with cHAQ and rs703505 with cHAQ, cTJC and cDAS28, all in RF-positive patients (Supplementary Table 6C). RA risk loci5 were also examined in seropositive white patients (Supplementary Table 4). All observed associations were weak and would not surpass an adjustment for multiple testing, although rs3093023, from the CCR6 locus, did associate at P<0.05 to cDAS, cSJC and cTJC.

Discussion

We conducted the first genetic analysis of response to tocilizumab, revealing putative associations with eight loci (Table 3, Figure 2 and Supplementary Figure 1), none of which have been previously implicated as risk alleles for RA or associated with response to any other therapy.5 None of these are obviously linked to the IL-6 pathway, and there is no association of shared epitope with tocilizumab response. Several of these associations were stronger in seropositive patients (Supplementary Table 6C), in part because the majority of patients are seropositive.

There is a clear difference in DAS28 remission rates between candidate genetic biomarker-positive and -negative patients (Table 4). However, the differences are modest, capturing an insufficient portion (<2%) of heterogeneity in response to be clinically useful.

These polymorphisms may help to reveal insights into the mechanism of action of tocilizumab in RA. Of the polymorphisms confirmed in the LM data set,11, 12 SNPs from two genes encoding C-lectins, that is, part of the NK gene complex on chromosome 12,29 were highlighted. rs1560011 is an intronic SNP in CLEC2D, and rs11052877 is located in the 3′-untranslated region of CD69 (CLEC2C). Figure 2 demonstrates tocilizumab-treated patients carrying the major allele of either polymorphism have a better DAS28 response in a recessive model; this is not observed with placebo. Without functional analysis we can only hypothesize how variation within CLEC2D and CD69 might influence response. The receptor, expressed on NK cells, T cells, activated B cells and dendritic cells, induces NK cells to produce interferon-γ.30 CLEC2D blocks osteoclast formation.31, 32 The ligand of CLEC2D, KLRB1 (CD161), identifies IL-17-producing T-cell subsets.33 CD69 is expressed on a number of hematopoietic cells, and crosslinking CD69 induces proliferation, IL-2 and interferon-γ release and NO production.34

The biological significance of genes represented by the other markers is not obvious, but these polymorphisms identified may provide clues to the molecular basis for tocilizumab response in RA. With a pooled exploratory analysis utilizing the ROTAL data set,7, 8, 9, 10, 11 we have identified a further number of polymorphisms putatively associated with response. Three of these polymorphisms met genome-wide significance thresholds but were present in a small number of patients and so will require further validation.

Only analyses of efficacy end points were reported in this manuscript. Using a candidate gene approach, bilirubin elevation was previously shown to be strongly associated with genetic variations within UGT1A1.13 For other safety end points, such as neutropenia, genetic analysis was not undertaken owing to the very low event rate.

Our analyses illustrate the complexity of performing genome-wide association scan in clinical trials. Despite having over 1600 patients from six randomized, controlled clinical studies and rigorous methodology, only putative loci of weak associations with efficacy were identified. This indicates that treatment response in RA is likely to be complex, even for a targeted biologic. It is possible that association might be stronger in more homogeneous subgroups of patients (such as methotrexate-naïve RA treated with tocilizumab as monotherapy10), or by comparing only the extremely good with the extremely poor responders. However, our GWAS has limited utility in these contexts owing to small subgroup sizes (see Supplementary Material).

There are several limitations with our study. A two-stage analysis was performed. In the SNP discovery stage, we lowered the threshold below genome-wide significance and analyzed multiple efficacy outcome measures: DAS28 and core components of disease activity and response (SJC, TJC, HAQ and CRP).35 We combined selection criteria based on univariate and multivariate approaches, to increase our chances of identifying polymorphisms that contribute to any epistatic effect on overall DAS28 response. Such a multi-pronged strategy increases type I error, but interestingly, the majority of SNPs confirmed in LM11, 12 were revealed by LASSO analysis. In the confirmatory stage, the sample size of the cohort (LM) was relatively small, thus limiting power (Supplementary Table 1), and a non-conservative threshold (P=0.05) was used. Our aim to identify predictive biomarkers specific for tocilizumab response was challenged by the lack of an active comparator drug within the available studies (the comparator arms were generally on matched background therapy with a placebo). Finally, there was no independent cohort in which to confirm the combinations of SNPs selected to predict DAS28 remission rates. Taken together, these limitations suggest that the eight reported associations require confirmation in an independent cohort before they can be considered validated. Our findings demonstrate the need for very large, adequately powered cohorts to conduct robust pharmacogenetic response analyses in a heterogeneous autoimmune disease such as RA.