Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Sieberts, Solveig K.; Zhu, Fan; García-García, Javier; Stahl, Eli; Pratap, Abhishek; Pandey, Gaurav; Pappas, Dimitrios; Aguilar, Daniel; Anton, Bernat; Bonet, Jaume; Eksi, Ridvan; Fornés, Oriol; Guney, Emre; Li, Hongdong; Marín, Manuel Alejandro; Panwar, Bharat; Planas-Iglesias, Joan; Poglayen, Daniel; Cui, Jing; Falcao, Andre O.; Suver, Christine; Hoff, Bruce; Balagurusamy, Venkat S. K.; Dillenberger, Donna; Neto, Elias Chaibub; Norman, Thea; Aittokallio, Tero; Ammad-ud-din, Muhammad; Azencott, Chloe-Agathe; Bellón, Víctor; Boeva, Valentina; Bunte, Kerstin; Chheda, Himanshu; Cheng, Lu; Corander, Jukka; Dumontier, Michel; Goldenberg, Anna; Gopalacharyulu, Peddinti; Hajiloo, Mohsen; Hidru, Daniel; Jaiswal, Alok; Kaski, Samuel; Khalfaoui, Beyrem; Khan, Suleiman Ali; Kramer, Eric R.; Marttinen, Pekka; Mezlini, Aziz M.; Molparia, Bhuvan; Pirinen, Matti; Saarela, Janna; Samwald, Matthias; Stoven, Véronique; Tang, Hao; Tang, Jing; Torkamani, Ali; Vert, Jean-Phillipe; Wang, Bo; Wang, Tao; Wennerberg, Krister; Wineinger, Nathan E.; Xiao, Guanghua; Xie, Yang; Yeung, Rae; Zhan, Xiaowei; Zhao, Cheng; Greenberg, Jeff; Kremer, Joel; Michaud, Kaleb; Barton, Anne; Coenen, Marieke; Mariette, Xavier; Miceli, Corinne; Shadick, Nancy; Weinblatt, Michael; de Vries, Niek; Tak, Paul P.; Gerlag, Danielle; Huizinga, Tom W. J.; Kurreeman, Fina; Allaart, Cornelia F.; Louis Bridges Jr., S.; Criswell, Lindsey; Moreland, Larry; Klareskog, Lars; Saevarsdottir, Saedis; Padyukov, Leonid; Gregersen, Peter K.; Friend, Stephen; Plenge, Robert; Stolovitzky, Gustavo; Oliva, Baldo; Guan, Yuanfang; Mangravite, Lara M.

doi:10.1038/ncomms12460

Download PDF

Article
Open access
Published: 23 August 2016

Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Solveig K. Sieberts¹^na1,
Fan Zhu²^na1,
Javier García-García³^na1,
Eli Stahl^4,5,
Abhishek Pratap¹,
Gaurav Pandey ORCID: orcid.org/0000-0003-1939-679X^5,6,
Dimitrios Pappas^7,8,
Daniel Aguilar³,
Bernat Anton³,
Jaume Bonet ORCID: orcid.org/0000-0001-5210-4387³,
Ridvan Eksi²,
Oriol Fornés³,
Emre Guney⁹,
Hongdong Li²,
Manuel Alejandro Marín³,
Bharat Panwar²,
Joan Planas-Iglesias³,
Daniel Poglayen³,
Jing Cui¹⁰,
Andre O. Falcao¹¹,
Christine Suver ORCID: orcid.org/0000-0002-2986-385X¹,
Bruce Hoff ORCID: orcid.org/0000-0002-4629-2091¹,
Venkat S. K. Balagurusamy¹²,
Donna Dillenberger¹²,
Elias Chaibub Neto¹,
Thea Norman¹,
Tero Aittokallio¹²,
Muhammad Ammad-ud-din^14,15,
Chloe-Agathe Azencott ORCID: orcid.org/0000-0003-1003-301X^16,17,18,
Víctor Bellón^16,17,18,
Valentina Boeva^16,17,18,
Kerstin Bunte^14,15,
Himanshu Chheda¹³,
Lu Cheng^13,14,15,
Jukka Corander^15,19,
Michel Dumontier ORCID: orcid.org/0000-0003-4727-9435²⁰,
Anna Goldenberg^21,22,
Peddinti Gopalacharyulu ORCID: orcid.org/0000-0002-8767-968X¹³,
Mohsen Hajiloo²²,
Daniel Hidru^21,22,
Alok Jaiswal¹³,
Samuel Kaski^14,15,23,
Beyrem Khalfaoui²²,
Suleiman Ali Khan^13,14,15,
Eric R. Kramer²⁴,
Pekka Marttinen^14,15,
Aziz M. Mezlini^21,22,
Bhuvan Molparia²⁴,
Matti Pirinen¹³,
Janna Saarela¹³,
Matthias Samwald²⁵,
Véronique Stoven^16,17,18,
Hao Tang²⁶,
Jing Tang¹³,
Ali Torkamani²⁴,
Jean-Phillipe Vert^16,17,18,
Bo Wang²⁶,
Tao Wang ORCID: orcid.org/0000-0002-4355-149X²⁶,
Krister Wennerberg¹³,
Nathan E. Wineinger²⁴,
Guanghua Xiao²⁶,
Yang Xie^26,27,
Rae Yeung^28,29,
Xiaowei Zhan^26,30,
Cheng Zhao^21,22,
Members of the Rheumatoid Arthritis Challenge Consortium,
Jeff Greenberg^1,31,
Joel Kremer³²,
Kaleb Michaud^33,34,
Anne Barton ORCID: orcid.org/0000-0003-3316-2527^35,36,
Marieke Coenen³⁷,
Xavier Mariette^38,39,
Corinne Miceli^38,39,
Nancy Shadick¹⁰,
Michael Weinblatt¹⁰,
Niek de Vries⁴⁰,
Paul P. Tak^40,41,42,43,
Danielle Gerlag^40,44,
Tom W. J. Huizinga⁴⁵,
Fina Kurreeman⁴⁵,
Cornelia F. Allaart⁵¹,
S. Louis Bridges Jr.⁴⁶,
Lindsey Criswell⁴⁷,
Larry Moreland⁴⁸,
Lars Klareskog⁴⁹,
Saedis Saevarsdottir⁴⁹,
Leonid Padyukov⁴⁹,
Peter K. Gregersen⁵⁰,
Stephen Friend¹,
Robert Plenge⁵¹,
Gustavo Stolovitzky^5,6,11,
Baldo Oliva³^na1,
Yuanfang Guan²^na1 &
…
Lara M. Mangravite¹^na1

Nature Communications volume 7, Article number: 12460 (2016) Cite this article

8442 Accesses
57 Citations
26 Altmetric
Metrics details

Subjects

An Erratum to this article was published on 10 October 2016

This article has been updated

Abstract

Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, treatment fails in ∼one-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment of the utility of SNP data for predicting anti-TNF treatment efficacy in RA patients was performed in the context of a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled the comparative evaluation of predictions developed by 73 research groups using the most comprehensive available data and covering a wide range of state-of-the-art modelling methodologies. Despite a significant genetic heritability estimate of treatment non-response trait (h²=0.18, P value=0.02), no significant genetic contribution to prediction accuracy is observed. Results formally confirm the expectations of the rheumatology community that SNP information does not significantly improve predictive performance relative to standard clinical traits, thereby justifying a refocusing of future efforts on collection of other data.

Predictive genetic biomarkers for the efficacy of methotrexate in rheumatoid arthritis: a systematic review

Article 17 October 2019

Frank Eektimmerman, Jesse J. Swen, … Henk-Jan Guchelaar

A machine learning-based SNP-set analysis approach for identifying disease-associated susceptibility loci

Article Open access 22 September 2022

Princess P. Silva, Joverlyn D. Gaudillo, … Jason R. Albia

Multi-ancestry and multi-trait genome-wide association meta-analyses inform clinical risk prediction for systemic lupus erythematosus

Article Open access 07 February 2023

Chachrit Khunsriraksakul, Qinmengge Li, … Dajiang J. Liu

Introduction

Rheumatoid arthritis (RA) is a chronic inflammatory autoimmune disorder affecting synovial joints, which often leads to organ system disorders and increased mortality. It is the most common autoimmune disorder, affecting ∼1% of the population worldwide¹. RA is treated in part with disease-modifying anti-rheumatic drugs, including those that block the inflammatory cytokine, tumour necrosis factor-α (anti-TNF therapy). While anti-TNF treatment is effective in reducing disease progression, response is variable with nearly one-third of RA patients failing to enter clinical remission^2,3,4. No substantive methodology exists that can be used to a priori identify anti-TNF non-responders⁵. Technological advances in DNA genotyping and sequencing have afforded the opportunity to assess the contribution of genetic variation to heterogeneity in anti-TNF response to therapy. Evidence from association analyses^6,7 and theoretical heritability estimates suggested that algorithms focusing on genetic variation may be predictive of non-response. Genetic biomarkers provide a compelling opportunity to perform simple tests with high-potential impact on clinical care. Although genetic information has not been found to provide clinically relevant predictions in many cases^8,9,10, the high-potential impact of successful genetic biomarkers and their potential to provide biological insights continues to inspire research inquiries in many fields including anti-TNF response. To this aim, we perform a community-based empirical assessment of the contribution of common single-nucleotide polymorphism (SNP) data to predictions of anti-TNF treatment response in RA patients to formally assess their utility for clinical application. Using the most comprehensive data set currently available, which we demonstrate is suitably powered to develop clinically actionable predictors, we draw on the expertise of hundreds of researchers world-wide, who collectively submit thousands of models predicting anti-TNF response. While the researchers are able to build predictive models that perform significantly better than random, formal evaluation from the best-performing teams show that common SNP variants do not meaningfully contribute to model performance within this study.

Results

Study design and challenge parameters

This study was performed as an open analytical challenge using the DREAM framework^11,12,13,14 (DREAM Challenges website; www.dreamchallenges.org) as a mechanism to test predictions developed across a variety of state-of-the-art methodologies. In this manner, we were able to evaluate the accuracy of predictive models developed by dozens of research groups across a wide spectrum of modelling approaches. Challenge participants were provided with SNP data collected on 2,706 anti-TNF-treated RA patients⁶ (Supplementary Table 1) with which to develop predictive models of disease-modulating treatment response where treatment efficacy was measured using (a) the absolute change in disease activity score in 28 joints¹⁵ (DAS28) following 3–6 months of anti-TNF treatment and (b) categorical non-response as defined by EULAR-response criteria¹⁶. EULAR response is calculated based on the pre- and post-treatment DAS28 and is widely used in clinical research and practice. Models were evaluated based on the predictive accuracy in a held-out test data set containing 591 anti-TNF-treated RA patients from a separate cohort (Supplementary Table 1). This represents the most comprehensive set of data available to address this question.

Statistical assessment of study power

The feasibility of developing SNP-based predictive models given this collection of data was determined in three steps. First, the genetic contribution to overall variance in treatment efficacy was estimated. Significant SNP-heritability estimates were identified via variance component modelling^17,18 of common SNPs (minor allele frequency (MAF)≥0.01) within the primary cohort consisting of 2,706 patients from 13 studies⁶ (SNP-h²=0.18, P value=0.02, Table 1). Heritability estimates were strongest in the subset of patients treated with anti-TNF monoclonal antibodies (MABs) relative to those treated with the circulating biologic, enteracept (Table 1). These heritability estimates are similar to those reported for other treatment response traits¹⁹ and of sufficient effect size to consider the use of predictive modelling methods to identify polygenic predictors of anti-TNF treatment efficacy^20,21. As the second step, the proportion of SNP heritability that must be represented in a predictive model to provide a clinically actionable predictor was estimated. Although the definition of an actionable predictor is highly dependent on clinical context, we set a lower bound, as defined by the area under the receiver operating characteristic curve (AUROC), of 0.75. Maximum AUROC achievable was calculated for the primary endpoint used in the challenge, categorical non-response based on the EULAR criteria and assuming the above estimated SNP heritability. Given these assumptions, the achievable AUROC was estimated across a set of predictive models as a function of the percent heritability explained by the model²². These estimates indicated that clinically actionable predictors would require predictive models to explain at least 55% of the heritable component of treatment efficacy (Supplementary Table 2). As the third and final step, statistical power to build such predictive models was estimated given these data. Statistical power was defined as the expected percent heritability that can be explained by a given model. This measure was computed over a range of models representing common risk variants (MAF≥0.01). In each case, models were consistent with the estimated SNP-h² and non-response prevalence rate calculated within these data and assumed a fixed number of loci of equal effect⁸ (Supplementary Fig. 1a). Because previous analyses of these data did not reveal strongly associated individual loci, and ultimately many teams limit multiple testing through biologically informed feature selection²³ (see Supplementary Table 3), this analysis was performed across a series of significance thresholds, ranging from that appropriate for genome-wide association (5e−8) to that appropriate for testing 100 independent loci (5e−4). These estimates indicated that this study was powered to develop clinically actionable predictive models in the case where the observed SNP heritability was explained by tens of risk loci. In this range, dimensionality reduction through literature or database curation could extract information even when strong, genome-wide significant associations are not observed. More sophisticated simulations using similar sample and disease characteristics suggest that the power estimates presented here may be conservative⁸. Despite smaller sample sizes, we estimate slightly increased power to build clinically meaningful predictors in the subset of patients treated with anti-TNF MABs. While the number of true underlying loci that contribute to genetic risk for anti-TNF response is unknown, an assumption of tens of loci is supported by observations of small numbers of loci associated with other treatment response traits^24,25,26 and has the added advantage of approaching the number of loci that are practical to include on a clinical diagnostic panel. It should also be noted that our power calculations require a fixed level of statistical significance be achieved for model inclusion of SNP predictors, however, the inclusion requirements for many machine learning approaches are not as strict and, as such, these methods may be better powered²³.

Table 1 Heritability estimates within the Primary Cohort.

Full size table

Open challenge study design

The open challenge was designed to assess genetic contribution to prediction of anti-TNF response in RA patients using whole-genome SNP data derived from anti-TNF-treated RA patients (Fig. 1a, Supplementary Table 1)^6,27. The question of anti-TNF treatment response was addressed in two ways. The primary endpoint used in the challenge was the classification of response to anti-TNF therapy as defined by EULAR-response criteria¹⁶ (Classification subchallenge). Participants were also invited to directly predict ΔDAS28 as a continuous measure (Quantitative subchallenge). In total, 242 individuals representing 30 countries and 4 continents registered to participate in this challenge. Challenge participants were invited to train models using a data set containing whole-genome SNP data, age, sex, anti-TNF therapy, concomitant methotrexate treatment and baseline DAS28 in a subset of 2,031 individuals (Fig. 1b, Supplementary Table 1 and see the ‘Methods’ section)⁶. SNP data were provided as imputed (HapMap phase 2) genotype probabilities and dosages, as well as directly assayed variants for participant use.

Participants were provided with a leaderboard with real-time feedback, which evaluated the performance of their predictions in the remaining 675 individuals. To reduce the potential for overfitting or reverse-engineering of treatment outcomes from the leaderboard, each team was limited to 100 leaderboard submissions. Over the course of the 16-week training period, 73 teams submitted a total of 4,874 predictions for evaluation on the leaderboard data. Upon completion of the training period, teams were allowed up to two final submissions per subchallenge and final evaluation of algorithms was performed relative to a separate test data set consisting of data collected from 591 RA patients in the CORRONA CERTAIN²⁷ study. Comparison with an independent, blinded test data set reduced the contribution to estimated accuracy of overfitting to the training data set, as indicated by comparing predictive performance between leaderboard and test data predictions for both the area under precision-recall curves (AUPRs) and AUROCs (Supplementary Fig. 2). Anti-TNF non-response differed slightly between the training and test data sets (21.7 and 35.7%, respectively), likely due to differences in inclusion criteria in the two cohorts, although demographic data were similar between the two (Supplementary Table 1). Similar methods were used to quality control (QC) and impute genotypes in both cohorts (see the ‘Methods’ section for details). Participants remained blinded to outcomes from both the leaderboard and test data sets throughout the experiment. Harmonized data from all cohorts are publicly available as a resource for use by the research community (doi:10.7303/syn3280809).

Performance across predictive modelling methodologies

For the classification subchallenge, 27 final submissions were received from 15 teams and these were scored using both AUROC and AUPR. Overall rank for each submission was determined as the average of the AUROC rank and the AUPR rank among all valid submissions. AUROC and AUPR were interpolated in the case of binary classifications or in the case of tied predictions²⁸. Of 27 submissions, 11 performed significantly better than random for both AUPR and AUROC after Bonferroni correction for multiple submissions. The AUPR of all submissions ranged from 0.345 to 0.510 (null expectation 0.359), and the AUROC ranged from 0.471 to 0.624 (null expectation 0.5). Using bootstrap analysis of submission ranks (Fig. 2a), we determined that the top two submissions performed robustly better than all remaining solutions (Wilcoxon signed-rank test of bootstraps P value=5e−34 and 1e−66, relative to the third ranked submission, respectively) but were not distinct from one another (P value=0.44). These submissions had AUPR of 0.5099 and 0.5071 and AUROC of 0.6152 and 0.6237, respectively. Both of the top-performing submissions were generated using Gaussian Process Regression (GPR)²⁹ models. Team Guanlab selected SNP predictors based on analysis of the training data and previous analyses described in the literature and, using those features, applied a GPR model to predict non-response classification directly. Team SBI_Lab selected SNP predictors based entirely on analysis of the training data, applied a GPR model to predict ΔDAS28, and then refactored these predictions into classification weights. The code and provenance for the winning algorithms have been catalogued and made available for reuse through the challenge website (see ‘Team Guanlab’ and ‘Team SBI_Lab’ in the Supplementary Notes for more details).

For the quantitative subchallenge, 28 final models from 17 teams were received and performance was evaluated based on the correlation between predicted and observed ΔDAS28 (observed range: r=0.393 to −0.356). Of these, 18 submissions performed significantly better than random at a Bonferroni-corrected P value threshold of 0.05 (r=0.393 to 0.208). The top-performing submission, provided by Team Guanlab, was robustly better than all remaining solutions (Wilcoxon’s signed-rank test of bootstraps P value=2e−32 relative to the second ranked submission, Supplementary Fig. 3) and used a similar GPR model to predict ΔDAS28 as described above (see ‘Team Guanlab’ in the Supplementary Notes for more details).

Genetic contribution to model performance

Following the completion of the open challenge, the eight teams with the best predictive performances (seven from each subchallenge) were invited to join challenge organizers to perform a formalized evaluation of the contribution of genetic information to model performance across the solution space captured by these diverse methods. Challenge participants and organizers worked together in a collaborative manner to design and implement analyses to address this question. To ensure this analysis was performed across the best possible methods, teams were invited to continue to refine their individual algorithms based on the information shared across teams. Crosstalk was promoted through a webinar series for the discussion of methodological considerations between challenge teams, organizers and external experts. In addition, the eight teams were divided into three groups where intra-group discussions were encouraged. In general, we observed that teams altered their strategies regarding knowledge mining and feature selection in response to these efforts, but did not alter machine learning algorithms (Supplementary Table 3). Collaborative Phase predictions did not perform significantly better than Open Challenge predictions among collaborative phase participants (−0.017 ΔAUPR and −0.011 ΔAUROC for classification subchallenge, P value=0.270 and 0.265, respectively), further supporting the hypothesis that the overall genetic contribution to predictive performance was negligible.

To explicitly test the ability of modelling techniques to detect weak genetic contribution, we first examined the contribution of feature selection to model performance. Most teams had used a combination of knowledge-based and data-driven evidence to perform dimensionality reduction in their model development. To approximate the null distribution of the genetic models, each of the 8 teams trained 100 models using an equivalent number of randomly sampled SNPs relative to their best-performing model³⁰. For 5 of 7 classification algorithms, models using knowledge-mined SNP selection significantly outperformed models using random SNPs for AUPR, AUROC or both at a nominal P value<0.05 (one-sided Kolmogorov–Smirnov test for enrichment of P values versus uniform P value=4.2e−05; Fig. 2b). Although there is uncertainty in the estimates of individual-algorithm enrichment due to the relatively small numbers of resamplings that were performed, a sensitivity analysis indicated that this experiment-wide significant enrichment was robust to these uncertainties (enrichment P value=0.002 at the 99.87% upper confidence bound of estimated P values). This suggested that for these models there was a non-zero contribution of genetic information to treatment effect. We next performed a pairwise comparison to directly assess the practical contribution of genetic information to model performance. Each team developed a model built in the absence of genetic information (clinical model) against which we compared their best model incorporating SNP data (full model). Clinical and demographic covariates were available for incorporation in both cases. Pairwise comparison across models demonstrated no statistical difference (paired t-test P value=0.85, 0.82, for classification AUPR and AUROC, respectively, and P value=0.65 for continuous prediction correlation; Fig. 2c, Supplementary Fig. 4), indicating that the contribution of SNP data to the prediction of treatment effect was not of sufficient magnitude to provide a detectable contribution to overall predictive performance. In further support of this conclusion, we note that the top-performing regression-based model by Team Outliers did not include any contribution from genetic information—genetic information was provided as an input but regularized out as part of the parameter selection process (see Supplementary Information). Despite the fact that heritability estimates were highest in the MAB therapies (adalimumab and infliximab), and that the most effective approaches explicitly modelled drug-specific genetic signal, there was no evidence that the genetic information contributed substantially for any drug-specific subset of the data (Bonferroni-corrected paired t-test P value for classification AUPR=1.0, 1.0, 1.0, 0.29, 1.0 for adalimumab, certolizumab, etanercept, infliximab and the combined set of all MAB therapies, respectively, and for AUROC=1.0, 1.0, 1.0, 0.59, 1.0, respectively).

The use of a diverse set of methodological approaches across teams provided the opportunity to test whether an aggregate of the individual approaches that leveraged their diversity/complementarity may boost the overall genetic contribution to predictions. Specifically, ensemble models were learned from individual predictions submitted to the classification subchallenge using a supervised approach³¹. These models were trained using leave-one-out cross-validated predictions generated on the original training set using the individual methods, and, as with individual submissions, analysed in a blinded manner using the test data. Two ensemble models were developed for the classification subchallenge using the stacking method^32,33, one built using team predictions submitted during the open challenge phase (AUPR=0.5228, AUROC=0.622) and the second built using team predictions submitted during the collaborative phase (AUPR=0.5209, AUROC=0.6168). Performance of these supervised ensemble models was compared with performance of the individual team model with the best overall performance—the model submitted in the collaborative phase by Team Outliers that did not contain any genetic information (Supplementary Fig. 5). The ensemble models performed incrementally better than this model (differences=0.005 and 0.0006 and boostrap P values=0.32 and 0.46 for AUPR and AUROC, respectively). This indicates that even ensembles that leverage complementary information among the individual predictions could not boost the ability to robustly predict anti-TNF response using genetic information.

Discussion

The RA Responder DREAM challenge performed a community-based open assessment of the contribution of SNP genotypes to predict disease-modulating response to anti-TNF treatment in RA patients, and found that SNPs did not meaningfully contribute to the prediction of treatment response above the available clinical predictors (sex, age, anti-TNF drug name and methotrexate use). Given the negative nature of the findings in this report, it is important to clearly frame these findings within the constraints of the problem that was addressed. This study was designed to assess the ability to develop clinically actionable predictors using common SNP variants in the case where the genetic contribution to treatment efficacy is represented by tens of loci. Thorough analysis by dozens of researchers has shown that current predictive algorithms, as well as their ensembles, are not able to produce such predictors despite the estimation of a significant heritability for this trait. In fact, these researchers were not able to detect any genetic contribution to predictions, even in the subset of data for which heritability and power are predicted to be the highest. This may reflect the complex nature of genetic contribution across loci, the absence of individual, strongly associated common variants, or the presence of non-genetic sources of heterogeneity across individuals^8,9,10. These findings do not provide information about the ability to use genetic data for predictive modelling of anti-TNF treatment efficacy in other cases such as when: (1) the true number of risk loci is on the order of hundreds, or (2) the heritability is better explained by variants not assayed or tagged by variants in this study, including rare variants or CNVs. Given the sample sizes required to identify loci when the number of risk loci is on the order of hundreds (Supplementary Fig. 1b), and the general challenge in explaining estimated heritability in complex traits even with large cohorts^{34,35,36,37,38}, this does suggest that future efforts may be better spent in identifying biomarkers based on data modalities that better encapsulate both genetic and non-genetic contributions to treatment efficacy.

Although these genetic data did not provide a meaningful contribution to the predictions in this study, the methods used in this analysis were able to leverage the small set of available clinical features to develop a prediction that performed significantly better than random. These results suggest that future research efforts focused on the incorporation of a richer set of clinical information—including seropositivity, treatment compliance and disease duration—may provide opportunity to leverage these methods in clinically meaningful ways. In addition, the identification of data modalities that are more effective than genetics in capturing heterogeneity in RA disease progression—whether clinical, molecular or other—may also improve predictive performance.

This study demonstrates that a formalized evaluation of a scientific question across a wide solution space can be effectively accomplished by combining resources—data and methodologies—across an open community of interested researchers. In research areas of high-potential impact but uncertain likelihood of success, such as described here, this community-wide approach provides an opportunity to build consensus regarding research outcomes to guide future efforts within that field. In this context, positive outcomes can highlight a rich strategy for future enquiry, while negative results can provide strong evidence in support of adjusting future paths of scientific exploration. Since the evidence that a task is implausible mounts with the number of failed attempts at solving it, making the case of implausibility requires the active contribution of multiple research groups. In this study, we demonstrate that formalized evaluation across a community of researchers provided a rapid mechanism for transparent assessment of current capabilities to assess the contribution of genetic information to prediction of anti-TNF response in RA patients.

Methods

Data sets

Two separate data sets were provided to participants to train and test the predictive models, respectively (Supplementary Table 1). In the case of the test data, only predictor variables were released, and the teams remained blinded to the response variables. The training data consisted of a previously published collection of anti-TNF-treated patients (n=2,706) of European ancestry, compiled from 13 collections⁶, of which the response variables from 675 patients were held-out as a leaderboard test set. All patients met 1987 ACR criteria for RA, or were diagnosed by a board-certified rheumatologist and were required to have at least moderate DAS28 (ref. 15 at baseline (DAS28>3.2). Available clinical and demographic data included DAS28 at baseline and at least one time point after treatment, sex, age, anti-TNF drug name and methotrexate use. Follow-up DAS28 was measured 3–12 months after initiating anti-TNF therapy, though precise duration of treatment was not available. Genotypes for each sample were analysed for quality control (QC) for sample and marker missingness, Hardy–Weinberg disequlibrium, relatedness and population outliers, and imputed to HapMap Phase 2 (release 22) as previously described⁶. We note that although this data set does not represent the full spectrum of patient information that may be utilized within a clinical setting to inform treatment—including synovial tissue and novel soluble biomarkers like MRP8/14 levels^4,39,40, it did present sufficient data to explicitly assess the contribution of genetics to prediction.

The final test set was derived from a subset of patients enroled in the CORRONA CERTAIN study²⁷. CERTAIN is a prospective, non-randomized comparative effectiveness study of 2,804 adult patients with RA, having at least moderate disease activity defined by a clinical disease activity index score >10 who are starting or switching biologic agents. DAS28 was provided at baseline and 3-month follow-up. At the time of challenge launch, 723 subjects had initiated anti-TNF therapy and had a 3-month follow-up visit. Of these patients, 57.4% were previously naive to biologics. Genotypes were generated on the Illumina Infinium HumanCoreExome array and imputed to HapMap Phase 2 (release 22) using IMPUTE2 (ref. 41). Before imputation, genotype QC included filtering individuals with >5% missing data, and filtering SNPs with >1% missing data, MAF<1% and χ²-test of Hardy-Weinberg equilibrium p_HWE<10-5. Sex was inferred based on the X-chromosome genotypes using PLINK⁴², and all samples matched with respect to reported sex. One parent-offspring relationship was identified in the data, but was kept in the test set. While data for all 723 were released to participants, 93 patients were excluded for the purposes of scoring because their genotyping data were not consistent with European ancestry as inferred by EIGENSTRAT⁴³. In addition, a subset of patients in the test data set were treated with anti-TNF drugs that were not represented in the training data set: golimumab and certolizumab. The 39 patients receiving golimumab were excluded because this drug was not represented in the training data and predictions showed that participants were unable to successfully predict response in these subjects. In contrast, prediction in certolizumab-treated patients was similar to prediction in the remaining three drugs and so these data were included in the final test set.

Two ancillary data sets were made available for participant use. The first measured TNFα protein level in HapMap cell lines⁴⁴. The second included blood RNA-seq data and genotypes for 60 RA patients from the Arthritis Foundation-sponsored Arthritis Internet Registry, 30 of whom displayed high inflammatory levels and 30 of whom displayed low inflammatory levels. Inflammatory levels were assessed using blood concentrations of C-reactive protein (CRP), and elevated disease was defined as CRP>0.8 mg dl⁻¹, while low disease activity was defined as CRP<0.1 mg dl⁻¹. In addition to CRP levels, rheumatoid factor antibody levels and cyclic citrullinated peptide levels were also assayed. Genotypes were assayed on the Illumina HumanOmniExpressExome array.

Power calculations

For combinations of a range of risk allele frequency, P=(0.01, 0.02, 0.03, …, 0.99), and relative risk, λ=(1.1, 1.11, 1.12, …, 2.4), we computed the number, n, of such loci required to explain a heritability of 0.18, as estimated for this trait, (equation (3) in the study by Wray et al.⁸), and the power assuming a multiplicative model using the GeneticsDesign Bioconductor package⁴⁵, given a trait prevalence, K=0.217, as estimated from the discovery cohort. The expected heritability explained was estimated as the median power over all combinations of p and λ for which n rounded to a given value.

The AUROC corresponding to various proportions of heritability explained was computed using equation (3) in the study by Wray et al.²² after converting our estimated heritability to the liability scale. In addition, we estimated the proportion of the variance explained by clinical variables using the AUROC for the best clinical model from the collaborative phase (equation (4) in the study by Wray et al.²²) and computed the AUROC corresponding to various proportions of heritability explained assuming independence between the clinical and genetic components.

Scoring methods

For the classification subchallenge, teams were asked to submit an ordered list of patients ranked according to the predicted response to therapy. Special treatment was given to the computation of the curve statistics when the order was ambiguous such as in the case in the case of ties or binary predictions, in which case an average across all possible consistent solutions was used²⁸. The average of the rank of the AUPR and AUROC was used to rank solutions.

For the quantitative subchallenge, teams were asked to submit predicted ΔDAS28, and the Pearson’s correlation between the predicted and actual ΔDAS28 was used to score submissions.

Competitive phase of the challenge

The challenge was open to all individuals who agreed to the DREAM terms of use and obtained access to the challenge data by certifying their compliance with the data terms of use. The training and ancillary data were released for use on 10 February 2014. The leaderboards opened on 5 March, at which time participants were able to test their models in real-time against a held-out portion of the training data set. The prediction variables of the test data set were released to participants on 8 May and submission queues for final submissions were open between 21 May and 4 June. Only the final two submissions per team per subchallenge were scored. Participants who did not have enough computational resources in their home institutions were offered the option to use an IBM z-Enterprise cloud, with two virtual machines running Linux servers, one with 20 processors, 242 GB memory, 9 TB storage space and the other with 12 processors, 128 GB memory and 1 TB of storage space. Cloud users could access the Challenge data directly through the IBM system.

Evaluation of submissions

Predictions were evaluated using two data sets: 675 individuals from the training cohort (leaderboard test set) and all individuals from the CORRONA CERTAIN data (final test set). In both cases, response variables were withheld from participants. Participants were allowed 100 submissions to the classification subchallenge leaderboard and unlimited submissions to the quantitative subchallenge leaderboard throughout the competitive phase of the competition, and were provided near-instant results. Participants were allowed two final submissions per subchallenge and scores were revealed after the submission deadline. A permutation test was used to assess whether the classifications or ΔDAS28 quantitative predictions were better than expected at random using a one-sided P value. To assess the robustness of the relative ranking of predictions, 1,000 bootstraps were performed by sampling subjects with replacement. Within each bootstrap iteration, evaluation scores were computed for each submission, along with the within-iteration rank. A prediction was deemed ‘robustly’ better than another if the Wilcoxon’s signed-rank test of the 1000 bootstrap iteration estimates was significant with P value<0.05. While this is not the same as strict statistical significance, it was the criteria we used to differentiate models given the relatively small improvements from one to another.

Development and scoring in the collaborative phase

One of the aims of DREAM Challenges is to foster collaborative research. As such, the collaborative phase was designed to foster cooperation between the best-performing teams in the competitive phase. Teams came together to develop research questions and analytical strategies to answer specific questions related to the ability to predict non-response to anti-TNF treatment. Each team submitted a number of classifications/predictions and/or sets of classifications/predictions that were designed to be able to answer questions about the degree to which genetic data were contributing to the models, and the classifications were scored and analysed across teams by the challenge organizers. To compare across methods and approaches, we asked the collaborative phase participants to submit classifications/predictions using their own knowledge- and data-mined SNP lists, which they refined from the competitive phase after peer review from fellow participants. In addition, they were asked to submit a classification/prediction, which used only clinical predictors and did not include genetic predictors. We also asked the participants to submit 100 sets of classifications/predictions in which the SNPs used as potential predictors were randomly sampled from the genome and matched the number of SNPs in their genetic model. Eight teams participated in the collaborative phase, seven in each subchallenge. Ranked results for the genetic models are shown in Supplementary Fig. 5.

Ensemble classifications

The goal of ensemble learning was to aggregate the classifications submitted by individual teams to the classification subchallenge, including 6 from the Competitive Phase and 7 from the Collaborative Phase, by effectively leveraging the consensus as well as diversity among these predictions. We focused on learning heterogeneous ensembles³¹, which are capable of aggregating classifications from a diverse set of potentially unrelated base classifiers, as is the case with the submissions to this subchallenge. Specifically, we followed the stacking methodology^32,33, which involves learning a meta-classifier (second level predictor) on top of the base classifications. This methodology was applied to the training set classifications generated through a leave-one-out cross-validation procedure applied to the training set for the initial ensemble learning. To address the potential calibration issue in this task⁴⁶, we investigated using the raw base classifications and the output of two other normalization procedures—z-score (mean=0, s.d.=1) and Scale0–1 (maximum=1, minimum=0)—applied to the raw base classifications. Next, sixteen different classification algorithms (Supplementary Table 4) were used to train ensemble models from each of the above normalized versions of the base classifications. The implementations of these algorithms were obtained from the Weka machine learning suite⁴⁷, and their default parameters were used.

Supplementary Figure 6 shows the performance of different combinations of normalization and classification methods on the leaderboard test set in terms of (a) AUPR, (b) AUROC and (c) the overall rank. Several observations can be made from these results. First, the ensemble learned with normalization using z-score and subsequent learning of a Naive Bayes classifier that uses kernelized probability distribution functions⁴⁸ produced the best aggregate performance on the leaderboard test set (AUROC=0.7569, AUPR=0.49), indicating the conditional independence of the base classifications and the non-normality of their underlying distributions. In general, normalization (either z-score or Scale0–1) improved the performance for 14, 14 and 13 of the 16 classifiers examined in terms of PR, ROC and overall rank, respectively, thus indicating the importance of effective calibration in such ensemble learning tasks. Of these, 10, 9 and 7 classifiers, including NaiveBayes_kdf, saw the best performance due to the use of z-score normalization, thus giving this normalization method an edge over Scale0–1.

On the basis of the conclusions above, we applied the ensemble model trained using z-score and NaiveBayes_kdf to the individual team classifications submitted for the CORRONA CERTAIN test set in the competitive and collaborative phases. The ensemble of the competitive phase (AUPR=0.5228, AUROC=0.622) performed better than each of the individual classifications and slightly better than the ensemble of the collaborative phase (AUPR=0.5209, AUROC=0.6168). However, these improvements were not statistically significant.

Data availability

Data use within the scope of this challenge was performed with the approval of an internal review board for all data sets. All data used for the challenge are available through the Synapse repository (syn3280809, doi:10.7303/syn3280809).

Additional information

How to cite this article: Sieberts, S. K. et al. Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis. Nat. Commun. 7:12460 doi: 10.1038/ncomms12460 (2016).

Change history

10 October 2016
The HTML version of this Article incorrectly duplicated the authors S. Louis Bridges, Lindsey Criswell, Larry Moreland, Lars Klareskog, Saedis Saevarsdottir, Leonid Padyukov, Peter K. Gregersen, Stephen Friend, Robert Plenge, Gustavo Stolovitzky, Baldo Oliva, Yuanfang Guan and Lara M. Mangravite in the author list. This has now been corrected in the HTML; the PDF version of the paper was correct from the time of publication.

References

Gibofsky, A. Overview of epidemiology, pathophysiology, and diagnosis of rheumatoid arthritis. Am. J. Manag. Care 18, S295–S302 (2012).
PubMed Google Scholar
McInnes, I. B. & Schett, G. The pathogenesis of rheumatoid arthritis. N. Engl. J. Med. 365, 2205–2219 (2011).
Article CAS Google Scholar
Vincent, F. B. et al. Antidrug antibodies (ADAb) to tumour necrosis factor (TNF)-specific neutralising agents in chronic inflammatory diseases: a real issue, a clinical perspective. Ann. Rheum. Dis. 72, 165–178 (2013).
Article CAS Google Scholar
Wijbrandts, C. A. et al. The clinical response to infliximab in rheumatoid arthritis is in part dependent on pretreatment tumour necrosis factor alpha expression in the synovium. Ann. Rheum. Dis. 67, 1139–1144 (2008).
Article CAS Google Scholar
Tak, P. P. A personalized medicine approach to biologic treatment of rheumatoid arthritis: a preliminary treatment algorithm. Rheumatology 51, 600–609 (2012).
Article CAS Google Scholar
Cui, J. et al. Genome-wide association study and gene expression analysis identifies CD84 as a predictor of response to etanercept therapy in rheumatoid arthritis. PLoS Genet. 9, e1003394 (2013).
Article CAS Google Scholar
Stahl, E. A. et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat. Genet. 42, 508–514 (2010).
Article CAS Google Scholar
Wray, N. R., Goddard, M. E. & Visscher, P. M. Prediction of individual genetic risk to disease from genome-wide association studies. Genome Res. 17, 1520–1528 (2007).
Article CAS Google Scholar
Wray, N. R. et al. Pitfalls of predicting complex traits from SNPs. Nat. Rev. Genet. 14, 507–515 (2013).
Article CAS Google Scholar
Daetwyler, H. D., Villanueva, B. & Woolliams, J. A. Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE 3, e3395 (2008).
Article ADS Google Scholar
Costello, J. C. et al. A community effort to assess and improve drug sensitivity prediction algorithms. Nat. Biotechnol. 32, 1–103 (2014).
Article Google Scholar
Marbach, D. et al. Wisdom of crowds for robust gene network inference. Nat. Methods. 9, 796–804 (2012).
Article CAS Google Scholar
Margolin, A. A. et al. Systematic analysis of challenge-driven improvements in molecular prognostic models for breast cancer. Sci. Transl. Med. 5, 181re1 (2013).
Article Google Scholar
Plenge, R. M. et al. Crowdsourcing genetic prediction of clinical utility in the rheumatoid arthritis responder challenge. Nat. Genet. 45, 468–469 (2013).
Article CAS Google Scholar
Prevoo, M. L. L. et al. Modified disease activity scores that include twenty-eight-joint counts: development and validation in a prospective longitudinal study of patients with rheumatoid arthritis. Arthritis Rheum. 38, 44–48 (1995).
Article CAS Google Scholar
van Gestel, A. M. et al. Development and validation of the European League Against Rheumatism response criteria for rheumatoid arthritis. Comparison with the preliminary American College of Rheumatology and the World Health Organization/International League Against Rheumatism Criteria. Arthritis Rheum 39, (1996).
Yang, J. et al. Common {SNPs} explain a large proportion of the heritability for human height. Nat Genet. 42, 565–569 (2010).
Article CAS Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Chhibber, A. et al. Genomic architecture of pharmacological efficacy and adverse events. Pharmacogenomics 15, 2025–2048 (2014).
Article CAS Google Scholar
Mäki-tanila, A. & Hill, W. G. Influence of gene interaction on complex trait variation with multi-locus models. Genetics 198, 355–367 (2014).
Article Google Scholar
Okser, S. et al. Regularized machine learning in the genetic prediction of complex traits. PLoS Genet. 10, e1004754 (2014).
Article Google Scholar
Wray, N. R., Yang, J., Goddard, M. E. & Visscher, P. M. The genetic interpretation of area under the ROC curve in genomic profiling. PLoS Genet. 6, e1000864 (2010).
Article Google Scholar
Kruppa, J., Ziegler, A. & König, I. R. Risk estimation and risk prediction using machine-learning methods. Hum. Genet. 131, 1639–1654 (2012).
Article Google Scholar
Postmus, I. et al. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins. Nat. Commun. 5, 5068 (2014).
Article CAS Google Scholar
Takeuchi, F. et al. A genome-wide association study confirms VKORC1, CYP2C9, and CYP4F2 as principal genetic determinants of warfarin dose. PLoS Genet. 5, e1000433 (2009).
Article Google Scholar
Tantisira, K. G. et al. Genomewide association between GLCCI1 and response to glucocorticoid therapy in asthma. N. Engl. J. Med. 365, 1173–1183 (2011).
Article CAS Google Scholar
Pappas, D. A., Kremer, J. M., Reed, G., Greenberg, J. D. & Curtis, J. R. ‘Design characteristics of the CORRONA CERTAIN study: a comparative effectiveness study of biologic agents for rheumatoid arthritis patients’. BMC Musculoskelet. Disord. 15, 113 (2014).
Article Google Scholar
Stolovitzky, G., Prill, R. J. & Califano, A. Lessons from the DREAM2 challenges: a community effort to assess biological network inference. Ann. N. Y. Acad. Sci. 1158, 159–195 (2009).
Article ADS CAS Google Scholar
Rasmussen, C. E. & Williams, C. K. I. Gaussian Processes for Machine Learning The MIT Press (2006).
Starmans, M. H. W., Fung, G., Steck, H., Wouters, B. G. & Lambin, P. A simple but highly effective approach to evaluate the prognostic performance of gene expression signatures. PLoS ONE 6, e28320 (2011).
Article ADS CAS Google Scholar
Whalen, S. & Pandey, G. A Comparative Analysis of Ensemble Classifiers: Case Studies in Genomics. in 13th IEEE International Conference on Data Mining (ICDM) 807–816IEEE (2013).
Wolpert, D. H. Stacked generalization. Neural Net. 5, 241–259 (1992).
Article Google Scholar
Ting, K. M. & Witten, I. H. Issues in stacked generalization. J. Artif. Intell. Res. 10, 271–289 (1999).
Article Google Scholar
Maher, B. Personal genomes: the case of the missing heritability. Nature 456, 18–21 (2008).
Article CAS Google Scholar
Frazer, K. A., Murray, S. S., Schork, N. J. & Topol, E. J. Human genetic variation and its contribution to complex traits. Nat. Rev. Genet. 10, 241–251 (2009).
Article CAS Google Scholar
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article ADS CAS Google Scholar
Moore, J. H., Asselbergs, F. W. & Williams, S. M. Bioinformatics challenges for genome-wide association studies. Bioinformatics 26, 445–455 (2010).
Article CAS Google Scholar
Zeng, P. et al. Statistical analysis for genome-wide association study. Jpn. J. Clin. Oncol. 45, 1023–1028 (2015).
Article Google Scholar
Klaasen, R. et al. The relationship between synovial lymphocyte aggregates and the clinical response to infliximab in rheumatoid arthritis: a prospective study. Arthritis Rheum. 60, 3217–3224 (2009).
Article CAS Google Scholar
Choi, I. Y. et al. MRP8/14 serum levels as a strong predictor of response to biological treatments in patients with rheumatoid arthritis. Ann. Rheum. Dis. 1–9 (2013).
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Article CAS Google Scholar
Choy, E. et al. Genetic analysis of human traits in vitro: drug response and gene expression in lymphoblastoid cell lines. PLoS Genet. 4, e1000287 (2008).
Article Google Scholar
Warnes, G., Duffy, D., Man, M., Qiu, W. & Lazarus, R. GeneticsDesign: Functions for designing genetics studies. R package version 1.32.0 (2010).
Bella, A., Ferri, C., Hernández-Orallo, J. & Ramírez-Quintana, M. On the effect of calibration in classifier combination. Appl. Intell. 38, 566–585 (2013).
Article Google Scholar
Hall, M. et al. The WEKA data mining software: an update. SIGKDD Explor. Newsl. 11, 10–18 (2009).
Article Google Scholar
John, G. H. & Langley, P. Estimating continuous distributions in Bayesian classifiers. Proc. Eleven. Conf. Uncertain. Artif. Intell. 338–345 (1995).

Download references

Acknowledgements

G.P. is partially supported by NIH grant# R01GM114434 and an IBM faculty award. E.S. is funded by NIH R01GM105857. The Corrona CERTAIN study is sponsored by Corrona, LLC with support from the Agency for Healthcare Research and Quality (R01HS018517). The majority of funding for the planning and implementation of CERTAIN was derived from Genentech, with additional support for substudies from Eli Lilly, Momenta Pharmaceuticals and Pfizer. CERTAIN investigators also receive support from the National Institute of Health (JRC AR053351, JDG AR054 412).

Author information

Solveig K. Sieberts, Fan Zhu, Javier García-García, Baldo Oliva, Yuanfang Guan and Lara M. Mangravite: These authors contributed equally to this work
Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
Department of Computer Science, Stanford University, Stanford, California 94305, USA
lara.magravite@sagebase.org

Authors and Affiliations

Sage Bionetworks, Seattle, 98115, Washington, USA
Solveig K. Sieberts, Abhishek Pratap, Christine Suver, Bruce Hoff, Elias Chaibub Neto, Thea Norman, Jeff Greenberg, Stephen Friend & Lara M. Mangravite
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, Michigan, USA
Fan Zhu, Ridvan Eksi, Hongdong Li, Bharat Panwar & Yuanfang Guan
Structural Bioinformatics Group (GRIB/IMIM), Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, 08003, Spain
Javier García-García, Daniel Aguilar, Bernat Anton, Jaume Bonet, Oriol Fornés, Manuel Alejandro Marín, Joan Planas-Iglesias, Daniel Poglayen & Baldo Oliva
Division of Psychiatric Genomics, Center for Statistical Genetics, Icahn School of Medicine at Mount Sinai, New York, 10029, New York, USA
Eli Stahl
Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, 10029, New York, USA
Eli Stahl, Gaurav Pandey & Gustavo Stolovitzky
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, 10029, New York, USA
Gaurav Pandey, Quan Long, Jun Zhu & Gustavo Stolovitzky
Division of Rheumatology, Department of Medicine, Columbia University, New York, 10032, New York, USA
Dimitrios Pappas
Corrona LLC,, Southborough, 01772, Massachusetts, USA
Dimitrios Pappas
Center for Complex Network Research, Northeastern University, Boston, 02115, Massachusetts, USA
Emre Guney
Division of Rheumatology, Immunology, and Allergy, Brigham and Women’s Hospital, Harvard Medical School, Boston, 02115, Massachusetts, USA
Jing Cui, Nancy Shadick & Michael Weinblatt
Department of Informatics, Faculty of Sciences, University of Lisbon, Lisbon, 1749-016, Portugal
Andre O. Falcao & Gustavo Stolovitzky
IBM T.J. Watson Research Center, Yorktown Heights, New York, 10598, New York, USA
Venkat S. K. Balagurusamy, Donna Dillenberger & Tero Aittokallio
Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, FI-00014, Finland
Himanshu Chheda, Lu Cheng, Peddinti Gopalacharyulu, Alok Jaiswal, Suleiman Ali Khan, Matti Pirinen, Janna Saarela, Jing Tang & Krister Wennerberg
Department of Computer Science, Aalto University, Espoo, 02150, Finland
Muhammad Ammad-ud-din, Kerstin Bunte, Lu Cheng, Samuel Kaski, Suleiman Ali Khan & Pekka Marttinen
Helsinki Institute for Information Technology (HIIT), Esbo, 02150, Finland
Muhammad Ammad-ud-din, Kerstin Bunte, Lu Cheng, Jukka Corander, Samuel Kaski, Suleiman Ali Khan & Pekka Marttinen
MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, 77300, France
Chloe-Agathe Azencott, Víctor Bellón, Valentina Boeva, Véronique Stoven & Jean-Phillipe Vert
Institut Curie, Paris, 75248, France
Chloe-Agathe Azencott, Víctor Bellón, Valentina Boeva, Véronique Stoven & Jean-Phillipe Vert
Bioinformatics, Biostatistics, Epidemiology and Computational Systems Biology of Cancer, INSERM U900, Paris, 75248, France
Chloe-Agathe Azencott, Víctor Bellón, Valentina Boeva, Véronique Stoven & Jean-Phillipe Vert
Department of Mathematics and Statistics, University of Helsinki, Helsinki, FI-00014, Finland
Jukka Corander
Stanford Center for Biomedical Informatics, Stanford University, Stanford, 94305, California, USA
Michel Dumontier
Department of Computer Science, University of Toronto, Toronto, M5G OA5, Ontario, Canada
Anna Goldenberg, Daniel Hidru, Aziz M. Mezlini & Cheng Zhao
Genetics and Genome Biology, SickKids Research Institute, Toronto, M5G 0A4, Ontario, Canada
Anna Goldenberg, Mohsen Hajiloo, Daniel Hidru, Beyrem Khalfaoui, Aziz M. Mezlini & Cheng Zhao
Department of Computer Science, University of Helsinki, Helsinki, FI-00014, Finland
Samuel Kaski
Department of Integrative Structural and Computational Biology, The Scripps Translational Science Institute, The Scripps Research Institute, La Jolla, 92037, California, USA
Eric R. Kramer, Bhuvan Molparia, Ali Torkamani & Nathan E. Wineinger
Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, Vienna, 1090, Austria
Matthias Samwald
Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, 75390, Texas, USA
Hao Tang, Bo Wang, Tao Wang, Guanghua Xiao, Yang Xie & Xiaowei Zhan
Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, 75390, Texas, USA
Yang Xie
Department of Paediatrics, Department of Immunology, Institute of Medical Sciences, University of Toronto, Toronto, M5S 1A8, Ontario, Canada
Rae Yeung
Cell Biology, SickKids Research Institute, Toronto, M5G 0A4, Ontario, Canada
Rae Yeung
Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, 75390, Texas, USA
Xiaowei Zhan
Department of Medicine, New York University School of Medicine, New York, 10003, New York, USA
Jeff Greenberg
Department of Medicine, Division of Rheumatology, Albany Medical College, Albany, 12206, New York, USA
Joel Kremer
Department of Medicine, University of Nebraska Medical Center, Omaha, 68198, Nebraska, USA
Kaleb Michaud
National Data Bank for Rheumatic Diseases, Wichita, 67214, Kansas, USA
Kaleb Michaud
Arthritis Research UK Centre for Genetics and Genomics, Centre for Musculoskeletal Research, Manchester Academic Health Sciences Centre, The University of Manchester, Manchester, M13 9PT, UK
Anne Barton
NIHR Manchester Musculoskeletal Biomedical Research Unit, Central Manchester Foundation Trust, Manchester, M13 9WU, UK
Anne Barton
Department of Human Genetics, Radboud University Nijmegen Medical Centre, Nijmegen, 6525 GA, The Netherlands
Marieke Coenen
Department of Rheumatology, Université Paris-Sud, Orsay, 91400, France
Xavier Mariette & Corinne Miceli
APHP–Hôpital Bicêtre, Center of Immunology of Viral Infections and Autoimmune Diseases (IMVA) INSERM U1184, Paris, 94276, France
Xavier Mariette & Corinne Miceli
Department of Clinical Immunology and Rheumatology, Academic Medical Center/University of Amsterdam, Amsterdam, 1105 AZ, The Netherlands
Niek de Vries, Paul P. Tak & Danielle Gerlag
Department of Medicine, Cambridge University, Cambridge, CB2 1TN, UK
Paul P. Tak
Department of Rheumatology, Ghent University, Ghent, 9000, Belgium
Paul P. Tak
GlaxoSmithKline, Stevenage, SG1 2NY, UK
Paul P. Tak
Clinical Unit, GlaxoSmithKline, Cambridge, CB2 0QQ, UK
Danielle Gerlag
Department of Rheumatology, Leiden University Medical Centre, Leiden, 2300 RC, The Netherlands
Tom W. J. Huizinga & Fina Kurreeman
Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, 35294, USA
S. Louis Bridges Jr.
Division of Rheumatology, Department of Medicine, Rosalind Russell/Ephraim P Engleman Rheumatology Research Center, University of California San Francisco, San Francisco, 94143, California, USA
Lindsey Criswell
Division of Rheumatology and Clinical Immunology, University of Pittsburgh, Pittsburgh, 15261, Pennsylvania, USA
Larry Moreland
Department of Medicine, Rheumatology Unit, Karolinska Hospital and Karolinska Institutet, Stockholm, 171 76, Solna, Sweden
Lars Klareskog, Saedis Saevarsdottir & Leonid Padyukov
Robert S. Boas Center for Genomics and Human Genetics, Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, 11030, New York, USA
Peter K. Gregersen
Merck Research Labs, Merck and Co., Inc., Boston, Massachusetts, 02115, USA
Cornelia F. Allaart & Robert Plenge
Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, 35294, Alabama, USA
Manuel Calaza
Department of Medicine, Division of Rheumatology, Rosalind Russell/Ephraim P Engleman Rheumatology Research Center, University of California San Francisco, San Francisco, 94143, California, USA
Manuel Calaza
Division of Rheumatology and Clinical Immunology, University of Pittsburgh, Pittsburgh, 15261, Pennsylvania, USA
Haitham Elmarakeby & Lenwood S. Heath
Department of Medicine, Rheumatology Unit, Karolinska Hospital and Karolinska Institutet, Solna, Stockholm, 171 76, Sweden
Jonathan D. Moore & Richard S. Savage
Robert S. Boas Center for Genomics and Human Genetics, Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, 11030, New York, USA
Stephen Obol Opiyo
Merck Research Labs, Merck and Co, Boston, 02115, Massachusetts, USA
Richard S. Savage

Authors

Solveig K. Sieberts
View author publications
You can also search for this author in PubMed Google Scholar
Fan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Javier García-García
View author publications
You can also search for this author in PubMed Google Scholar
Eli Stahl
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Pratap
View author publications
You can also search for this author in PubMed Google Scholar
Gaurav Pandey
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios Pappas
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Bernat Anton
View author publications
You can also search for this author in PubMed Google Scholar
Jaume Bonet
View author publications
You can also search for this author in PubMed Google Scholar
Ridvan Eksi
View author publications
You can also search for this author in PubMed Google Scholar
Oriol Fornés
View author publications
You can also search for this author in PubMed Google Scholar
Emre Guney
View author publications
You can also search for this author in PubMed Google Scholar
Hongdong Li
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Alejandro Marín
View author publications
You can also search for this author in PubMed Google Scholar
Bharat Panwar
View author publications
You can also search for this author in PubMed Google Scholar
Joan Planas-Iglesias
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Poglayen
View author publications
You can also search for this author in PubMed Google Scholar
Jing Cui
View author publications
You can also search for this author in PubMed Google Scholar
Andre O. Falcao
View author publications
You can also search for this author in PubMed Google Scholar
Christine Suver
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Hoff
View author publications
You can also search for this author in PubMed Google Scholar
Venkat S. K. Balagurusamy
View author publications
You can also search for this author in PubMed Google Scholar
Donna Dillenberger
View author publications
You can also search for this author in PubMed Google Scholar
Elias Chaibub Neto
View author publications
You can also search for this author in PubMed Google Scholar
Thea Norman
View author publications
You can also search for this author in PubMed Google Scholar
Tero Aittokallio
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Ammad-ud-din
View author publications
You can also search for this author in PubMed Google Scholar
Chloe-Agathe Azencott
View author publications
You can also search for this author in PubMed Google Scholar
Víctor Bellón
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Boeva
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Bunte
View author publications
You can also search for this author in PubMed Google Scholar
Himanshu Chheda
View author publications
You can also search for this author in PubMed Google Scholar
Lu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Jukka Corander
View author publications
You can also search for this author in PubMed Google Scholar
Michel Dumontier
View author publications
You can also search for this author in PubMed Google Scholar
Anna Goldenberg
View author publications
You can also search for this author in PubMed Google Scholar
Peddinti Gopalacharyulu
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Hajiloo
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Hidru
View author publications
You can also search for this author in PubMed Google Scholar
Alok Jaiswal
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Kaski
View author publications
You can also search for this author in PubMed Google Scholar
Beyrem Khalfaoui
View author publications
You can also search for this author in PubMed Google Scholar
Suleiman Ali Khan
View author publications
You can also search for this author in PubMed Google Scholar
Eric R. Kramer
View author publications
You can also search for this author in PubMed Google Scholar
Pekka Marttinen
View author publications
You can also search for this author in PubMed Google Scholar
Aziz M. Mezlini
View author publications
You can also search for this author in PubMed Google Scholar
Bhuvan Molparia
View author publications
You can also search for this author in PubMed Google Scholar
Matti Pirinen
View author publications
You can also search for this author in PubMed Google Scholar
Janna Saarela
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Samwald
View author publications
You can also search for this author in PubMed Google Scholar
Véronique Stoven
View author publications
You can also search for this author in PubMed Google Scholar
Hao Tang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Tang
View author publications
You can also search for this author in PubMed Google Scholar
Ali Torkamani
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Phillipe Vert
View author publications
You can also search for this author in PubMed Google Scholar
Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Krister Wennerberg
View author publications
You can also search for this author in PubMed Google Scholar
Nathan E. Wineinger
View author publications
You can also search for this author in PubMed Google Scholar
Guanghua Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Rae Yeung
View author publications
You can also search for this author in PubMed Google Scholar
Xiaowei Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Greenberg
View author publications
You can also search for this author in PubMed Google Scholar
Joel Kremer
View author publications
You can also search for this author in PubMed Google Scholar
Kaleb Michaud
View author publications
You can also search for this author in PubMed Google Scholar
Anne Barton
View author publications
You can also search for this author in PubMed Google Scholar
Marieke Coenen
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Mariette
View author publications
You can also search for this author in PubMed Google Scholar
Corinne Miceli
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Shadick
View author publications
You can also search for this author in PubMed Google Scholar
Michael Weinblatt
View author publications
You can also search for this author in PubMed Google Scholar
Niek de Vries
View author publications
You can also search for this author in PubMed Google Scholar
Paul P. Tak
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Gerlag
View author publications
You can also search for this author in PubMed Google Scholar
Tom W. J. Huizinga
View author publications
You can also search for this author in PubMed Google Scholar
Fina Kurreeman
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia F. Allaart
View author publications
You can also search for this author in PubMed Google Scholar
S. Louis Bridges Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Lindsey Criswell
View author publications
You can also search for this author in PubMed Google Scholar
Larry Moreland
View author publications
You can also search for this author in PubMed Google Scholar
Lars Klareskog
View author publications
You can also search for this author in PubMed Google Scholar
Saedis Saevarsdottir
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Padyukov
View author publications
You can also search for this author in PubMed Google Scholar
Peter K. Gregersen
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Friend
View author publications
You can also search for this author in PubMed Google Scholar
Robert Plenge
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Stolovitzky
View author publications
You can also search for this author in PubMed Google Scholar
Baldo Oliva
View author publications
You can also search for this author in PubMed Google Scholar
Yuanfang Guan
View author publications
You can also search for this author in PubMed Google Scholar
Lara M. Mangravite
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Members of the Rheumatoid Arthritis Challenge Consortium

Manuel Calaza
, Haitham Elmarakeby
, Lenwood S. Heath
, Quan Long
, Jonathan D. Moore
, Stephen Obol Opiyo
, Richard S. Savage
& Jun Zhu

Contributions

The following authors contributed to organizing the challenge: L.M.M., S.K.S., G.S., E.S., A.P., G.P., D.P., J.C., A.O.F., C.S., T.N., S.F. and R.P. The following authors contributed to data analysis: S.K.S., E.S., A.P., G.P., J.C., A.O.F., E.C.N. The following authors contributed to software and technical solutions for the challenge: A.P., B.H., V.S.K.B., D.D. The following authors contributed data for the challenge: J.G., J.K., K.M., A.B., M.C., X.M., C.M., N.S., M.W., N.de V., P.P.T., D.G., T.W.J.H., F.K., C.F.A., S.L.B. Jr, L.C., L.M., L.K., S.S., L.P., P.K.G., R.P. The following authors participated in the predictive modelling challenge: F.Z., J.G.-G., D.A., B.A., J.B., R.E., O.F., E.G., H.L., M.A.M., B.P., J.P.-I., D.P., T.A., M.A.-ud-din, C.A.A., V.B, V.B., K.B., H.C., L.C., J.C., M.D., A.G., P.G., M.H., D.H., A.J., S.K., B.K., S.A.K., E.R.K., P.M., A.M.M., B.M., M.P., J.S., M.S., V.S., H.T., J.T., A.T., J.P.V., B.W., T.W., K.W., N.E.W., G.X., Y.X., R.Y., X.Z., C.Z., The Rheumatoid Arthritis Challenge Consortium, B.O. and Y.G.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-6, Supplementary Tables 1-4, Supplementary Note 1 and Supplementary References (PDF 493 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Sieberts, S., Zhu, F., García-García, J. et al. Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis. Nat Commun 7, 12460 (2016). https://doi.org/10.1038/ncomms12460

Download citation

Received: 28 October 2015
Accepted: 05 July 2016
Published: 23 August 2016
DOI: https://doi.org/10.1038/ncomms12460

This article is cited by

Carriers of HLA-DRB1*04:05 have a better clinical response to abatacept in rheumatoid arthritis
- Mariko Inoue
- Yasuo Nagafuchi
- Keishi Fujio
Scientific Reports (2023)
Does persistence to methotrexate treatment in early rheumatoid arthritis have a familial component?
- Anton Öberg Sysojev
- Thomas Frisell
- Helga Westerlind
Arthritis Research & Therapy (2022)
Multi-omics approach to precision medicine for immune-mediated diseases
- Mineto Ota
- Keishi Fujio
Inflammation and Regeneration (2021)
Red meat intake is associated with early onset of rheumatoid arthritis: a cross-sectional study
- Jiayang Jin
- Jing Li
- Zhanguo Li
Scientific Reports (2021)
Predicting the treatment response of certolizumab for individual adult patients with rheumatoid arthritis: protocol for an individual participant data meta-analysis
- Yan Luo
- Konstantina Chalkou
- Toshi A. Furukawa
Systematic Reviews (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.