Crowdsourcing genetic prediction of clinical utility in the Rheumatoid Arthritis Responder Challenge

Plenge, Robert M; Greenberg, Jeffrey D; Mangravite, Lara M; Derry, Jonathan M J; Stahl, Eli A; Coenen, Marieke J H; Barton, Anne; Padyukov, Leonid; Klareskog, Lars; Gregersen, Peter K; Mariette, Xavier; Moreland, Larry W; Bridges, S Louis; de Vries, Niek; Huizinga, Tom W J; Guchelaar, Henk-Jan; Friend, Stephen H; Stolovitzky, Gustavo

doi:10.1038/ng.2623

Download PDF

Correspondence
Open access
Published: 26 April 2013

Crowdsourcing genetic prediction of clinical utility in the Rheumatoid Arthritis Responder Challenge

Robert M Plenge^1,2,
Jeffrey D Greenberg³,
Lara M Mangravite⁴,
Jonathan M J Derry⁴,
Eli A Stahl⁵,
Marieke J H Coenen⁶,
Anne Barton⁷,
Leonid Padyukov⁸,
Lars Klareskog⁸,
Peter K Gregersen⁹,
Xavier Mariette¹⁰,
Larry W Moreland¹¹,
S Louis Bridges Jr¹²,
Niek de Vries¹³,
Tom W J Huizinga¹⁴,
Henk-Jan Guchelaar¹⁵,
International Rheumatoid Arthritis Consortium (INTERACT),
Stephen H Friend^4,16 &
…
Gustavo Stolovitzky^16,17

Nature Genetics volume 45, pages 468–469 (2013)Cite this article

6119 Accesses
18 Citations
29 Altmetric
Metrics details

Subjects

To the Editor:

The ability to translate large-scale genetics and genomics data into biological knowledge has not kept pace with our ability to generate these data sets. As a consequence, a major bottleneck in biomedical research has become access to data within a computational workspace that allows for robust, collaborative analyses. One innovative solution is to bring together scientific data, code, tools and disease models into an open commons or workspace, for example, the Synapse platform of Sage Bionetworks¹. This environment allows for real-time sharing of large genomic data sets, continuous peer review and rapid learning within a system constructed to provide data access in a manner aligned with the informed consent provided by patients and research participants.

This crowdsourcing approach has been used to predict breast cancer survival from clinical and omics data² and was suggested as a way to find new drugs³ by soliciting contributions from a large online community collaborating or competing to answer an inherently difficult but important question⁴. Researchers initiating an open challenge invite solutions but also incentivize the process by offering new data, a process in which the participants' methods can be assessed by testing their predictions against previously unseen data sets. This year, Sage and DREAM (Dialogue for Reverse Engineering Assessments and Methods) are running four open challenges (http://www.sagebase.org/challenges-overview/2013-dream-challenges/).

Here we announce the challenge to develop genetic predictors of response to immunosuppressive therapy in a common autoimmune disease, rheumatoid arthritis (RA). Disease-modifying antirheumatic drugs such as those that block the inflammatory cytokine tumor necrosis factor-α (known as anti-TNF therapy) are not effective in all patients with RA, with up to one-third of such patients failing to enter clinical remission after a standard course of therapy⁵. Moreover, the biological mechanisms underlying this failure are unknown, limiting the development of clinical biomarkers to guide either this therapy or the development of new drugs to target refractory cases.

The Rheumatoid Arthritis Responder Challenge is for teams to build the best genetic predictor of response to anti-TNF therapy. There are two phases to the challenge: discovery and validation (Fig. 1). In the discovery phase, teams will utilize genomic data sets—several of which will be generated for the purposes of this challenge—and a variety of analytical methods to build predictive polygenic models of treatment response. We recently published a genome-wide association study (GWAS) in ∼2,700 patients with RA treated with anti-TNF therapy⁶. Our GWAS data indicate that the genetic architecture of the anti-TNF response is probably highly polygenic, similar to what has been observed for other complex traits, such as risk of RA⁷. Importantly, our challenge will incorporate a new GWAS data set, which will be used in the validation phase, in which models built in the discovery phase are tested. The data set of ∼1,100 patients with RA treated with anti-TNF therapy will be made available though a public-private partnership between the Consortium of Rheumatology Researchers of North America, Inc. (CORRONA) and the Pharmacogenomics Research Network (PGRN) sponsored by the National Institute of General Medical Sciences (NIGMS) and the US National Institutes of Health (NIH).

**Figure 1: Overview of the Rheumatoid Arthritis Responder Challenge.**

A unique component of our Rheumatoid Arthritis Responder Challenge is the diversity of participation across a number of groups from academic institutions, private foundations and for-profit companies. In addition to support from CORRONA and PGRN, we received funding from pharmaceutical companies (see complete list on our website; link below) and a private foundation (the Arthritis Foundation) to support the public commons. We also received support from the Arthritis Internet Registry (AIR) and the Broad Institute to generate new genomic data sets, as well as in-kind support from a large number of academic collaborators from across the world to make GWAS data available in the discovery phase. We anticipate that a winning classifier could enable a follow-on prospective clinical trial within the group of appropriately consented patients in AIR.

Through Synapse, analysts who are inclined to establish collaborations will have the opportunity to see in real time the models that others are using so that each team can learn from the others (Fig. 1). A leaderboard will show the relative performance ranking of the different teams on the basis of a crossvalidation strategy designed to minimize overfitting. During the discovery phase, teams that choose to collaborate with each other will have the opportunity to check each other's algorithms for readability, speed and reproducibility. Then, during the validation phase, each team will submit computer code, which the Sage-DREAM team (http://www.sagebase.org/) will test in Synapse to establish whether it runs as expected to predict if a subject is an anti-TNF therapy responder or nonresponder on the basis of the GWAS data. Predefined performance metrics will be used to objectively determine the accuracy of the predictions, their statistical significance and the final performance ranking of the participating teams. The team that develops the most highly predictive model will be deemed the 'winner', with precise attribution of contributor roles going to all members of teams that contributed to building the final consensus model.

The best-performing models, therefore, will have passed a test of performance that is outside the realm of, and complements, traditional peer review. Indeed, this stringent test of method performance can be used as an enhanced way of publication vetting, what we call 'challenge-assisted peer review'. Traditional peer review is essential for ensuring the clarity, originality, contextualization and logical thread of a discrete set of work that is ready to be used by researchers in the form of a published article. However, the complexity of working with omics data—entailing multiple analytical decisions, computational simulations and statistical calculations—means that referees are challenged to follow and check the components of even a traditional research paper. In our Rheumatoid Arthritis Responder Challenge, we will explore the feasibility of enhancing the reliability and transparency of conventional peer review in partnership with Nature Genetics. This can be achieved if the referees and authors of the paper reporting on the best-performing methods in the challenge are willing to leave their comments openly (yet anonymously) on the Synapse platform (Fig. 1). We anticipate that the challenge-based assessment of accuracy will provide an objective metric of performance and a comparison with state-of-the-art analytical methodologies that will greatly enhance the task of refereeing a body of work with more quality control than is currently provided by conventional peer review.

In conclusion, we believe that the Rheumatoid Arthritis Responder Challenge is an apt use of crowdsourcing in human genetics to gain insight into clinical prediction and disease biology. Details of the challenge, including the rules by which the models will be judged, can be found at https://synapse.prod.sagebase.org/#!Synapse:syn1734172.

References

Derry, J.M.J. et al. Nat. Genet. 44, 127–130 (2012).
Article CAS Google Scholar
Margolin, A.A. et al. Sci. Transl. Med. 5, 181re1 (2013).
Article Google Scholar
Wadman, M. Nature 490, 15 (2012).
Article CAS Google Scholar
Costello, J.C. & Stolovitzky, G. Clin. Pharmacol. Ther. 93, 396–398 (2013)
Article CAS Google Scholar
McInnes, I.B. & Schett, G. N. Engl. J. Med. 365, 2205–2219 (2011).
Article CAS Google Scholar
Cui, J. et al. PLoS Genet. published online, http://dx.doi.org/10.1371/journal.pgen.1003394 (28 March 2013).
Stahl, E.A. et al. Nat. Genet. 44, 483–489 (2012).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Division of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
Robert M Plenge
Medical and Population Genetics Program, Chemical Biology Program, Broad Institute, Cambridge, Massachusetts, USA
Robert M Plenge
Division of Rheumatology, New York University School of Medicine, New York, New York, USA
Jeffrey D Greenberg
Sage Bionetworks, Seattle, Washington, USA
Lara M Mangravite, Jonathan M J Derry & Stephen H Friend
Division of Psychiatric Genomics and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mt. Sinai, New York, New York, USA
Eli A Stahl
Department of Human Genetics, Radboud University Medical Centre, Nijmegen, The Netherlands
Marieke J H Coenen
Arthritis Research UK Epidemiology Unit, Musculoskeletal Research Group, University of Manchester, Manchester Academic Health Sciences Centre, Manchester, UK
Anne Barton
Department of Medicine, Rheumatology Unit, Karolinska Institutet, Stockholm, Sweden
Leonid Padyukov & Lars Klareskog
The Feinstein Institute for Medical Research, North Shore–Long Island Jewish Health System, Manhasset, New York, USA
Peter K Gregersen
Université Paris-Sud, Assistance Publique-Hôpitaux de Paris (AP-HP), Hôpitaux Universitaires Paris-Sud, Institut National de la Santé et de la Recherche Médicale (INSERM) U1012, Rheumatology, le Kremlin-Bicêtre, France
Xavier Mariette
Division of Rheumatology and Clinical Immunology, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Larry W Moreland
Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, Birmingham, Alabama, USA
S Louis Bridges Jr
Department of Clinical Immunology and Rheumatology, Academic Medical Center (AMC), University of Amsterdam, Amsterdam, The Netherlands
Niek de Vries
Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
Tom W J Huizinga
Department of Clinical Pharmacy and Toxicology, Leiden University Medical Center, Leiden, The Netherlands
Henk-Jan Guchelaar
Sage/DREAM Project, Sage Bionetworks, Seattle, Washington, USA
Stephen H Friend & Gustavo Stolovitzky
IBM Computational Biology Center, T.J. Watson Research Center, Yorktown Heights, New York, USA
Gustavo Stolovitzky

Authors

Robert M Plenge
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey D Greenberg
View author publications
You can also search for this author in PubMed Google Scholar
Lara M Mangravite
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M J Derry
View author publications
You can also search for this author in PubMed Google Scholar
Eli A Stahl
View author publications
You can also search for this author in PubMed Google Scholar
Marieke J H Coenen
View author publications
You can also search for this author in PubMed Google Scholar
Anne Barton
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Padyukov
View author publications
You can also search for this author in PubMed Google Scholar
Lars Klareskog
View author publications
You can also search for this author in PubMed Google Scholar
Peter K Gregersen
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Mariette
View author publications
You can also search for this author in PubMed Google Scholar
Larry W Moreland
View author publications
You can also search for this author in PubMed Google Scholar
S Louis Bridges Jr
View author publications
You can also search for this author in PubMed Google Scholar
Niek de Vries
View author publications
You can also search for this author in PubMed Google Scholar
Tom W J Huizinga
View author publications
You can also search for this author in PubMed Google Scholar
Henk-Jan Guchelaar
View author publications
You can also search for this author in PubMed Google Scholar
Stephen H Friend
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Stolovitzky
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

International Rheumatoid Arthritis Consortium (INTERACT)

Corresponding author

Correspondence to Robert M Plenge.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

A full list of members is provided at https://synapse.prod.sagebase.org/#!Synapse:syn1734172.

Rights and permissions

This work is licensed under a Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.

Reprints and permissions

About this article

Cite this article

Plenge, R., Greenberg, J., Mangravite, L. et al. Crowdsourcing genetic prediction of clinical utility in the Rheumatoid Arthritis Responder Challenge. Nat Genet 45, 468–469 (2013). https://doi.org/10.1038/ng.2623

Download citation

Published: 26 April 2013
Issue Date: May 2013
DOI: https://doi.org/10.1038/ng.2623

This article is cited by

An introduction to machine learning and analysis of its use in rheumatic diseases
- Kathryn M. Kingsmore
- Christopher E. Puglisi
- Peter E. Lipsky
Nature Reviews Rheumatology (2021)
Machine learning in rheumatology approaches the clinic
- Aridaman Pandit
- Timothy R. D. J. Radstake
Nature Reviews Rheumatology (2020)
Can machine learning predict responses to TNF inhibitors?
- Nisha Nair
- Anthony G. Wilson
Nature Reviews Rheumatology (2019)
DREAMing of benchmarks
- Irene Jarchum
- Susan Jones
Nature Biotechnology (2015)
Taking pan-cancer analysis global

Nature Genetics (2013)

Crowdsourcing genetic prediction of clinical utility in the Rheumatoid Arthritis Responder Challenge

Subjects

References

Author information

Authors and Affiliations

Consortia

International Rheumatoid Arthritis Consortium (INTERACT)

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

This article is cited by

An introduction to machine learning and analysis of its use in rheumatic diseases

Machine learning in rheumatology approaches the clinic

Can machine learning predict responses to TNF inhibitors?

DREAMing of benchmarks

Taking pan-cancer analysis global

Search

Quick links

Subjects

References

Author information

Authors and Affiliations

Consortia

International Rheumatoid Arthritis Consortium (INTERACT)

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

An introduction to machine learning and analysis of its use in rheumatic diseases

Machine learning in rheumatology approaches the clinic

Can machine learning predict responses to TNF inhibitors?

DREAMing of benchmarks

Taking pan-cancer analysis global

Search

Quick links