The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models

doi:10.1038/nbt.1665

Article
Published: 30 July 2010

The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models

MAQC Consortium

Nature Biotechnology volume 28, pages 827–838 (2010)Cite this article

15k Accesses
597 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Gene expression data from microarrays are being applied to predict preclinical and clinical endpoints, but the reliability of these predictions has not been established. In the MAQC-II project, 36 independent teams analyzed six microarray data sets to generate predictive models for classifying a sample with respect to one of 13 endpoints indicative of lung or liver toxicity in rodents, or of breast cancer, multiple myeloma or neuroblastoma in humans. In total, >30,000 models were built using many combinations of analytical methods. The teams generated predictive models without knowing the biological meaning of some of the endpoints and, to mimic clinical reality, tested the models on data that had not been used for training. We found that model performance depended largely on the endpoint and team proficiency and that different approaches generated models of similar performance. The conclusions and recommendations from MAQC-II should be useful for regulatory agencies, study committees and independent investigators that evaluate methods for global gene expression analysis.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Experimental design and timeline of the MAQC-II project.**

**Figure 2: Model performance on internal validation compared with external validation.**

**Figure 3: Performance, measured using MCC, of the best models nominated by the 17 data analysis teams (DATs) that analyzed all 13 endpoints in the original training-validation experiment.**

**Figure 4: Correlation between internal and external validation is dependent on data analysis team.**

**Figure 5: Effect of modeling factors on estimates of model performance.**

The g3mclass is a practical software for multiclass classification on biomarkers

Article Open access 05 November 2022

Re-evaluation of publicly available gene-expression databases using machine-learning yields a maximum prognostic power in breast cancer

Article Open access 05 October 2023

Controlling technical variation amongst 6693 patient microarrays of the randomized MINDACT trial

Article Open access 27 July 2020

Accession codes

Accessions

GenBank/EMBL/DDBJ

009-00002-0010-000-3

Gene Expression Omnibus

GSE16716

References

Marshall, E. Getting the noise out of gene arrays. Science 306, 630–631 (2004).
Article CAS PubMed Google Scholar
Frantz, S. An array of problems. Nat. Rev. Drug Discov. 4, 362–363 (2005).
Article CAS PubMed Google Scholar
Michiels, S., Koscielny, S. & Hill, C. Prediction of cancer outcome with microarrays: a multiple random validation strategy. Lancet 365, 488–492 (2005).
Article CAS PubMed Google Scholar
Ntzani, E.E. & Ioannidis, J.P. Predictive ability of DNA microarrays for cancer outcomes and correlates: an empirical assessment. Lancet 362, 1439–1444 (2003).
Article CAS PubMed Google Scholar
Ioannidis, J.P. Microarrays and molecular research: noise discovery? Lancet 365, 454–455 (2005).
Article PubMed Google Scholar
Ein-Dor, L., Kela, I., Getz, G., Givol, D. & Domany, E. Outcome signature genes in breast cancer: is there a unique set? Bioinformatics 21, 171–178 (2005).
Article CAS PubMed Google Scholar
Ein-Dor, L., Zuk, O. & Domany, E. Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. Proc. Natl. Acad. Sci. USA 103, 5923–5928 (2006).
Article CAS PubMed Google Scholar
Shi, L. et al. QA/QC: challenges and pitfalls facing the microarray community and regulatory agencies. Expert Rev. Mol. Diagn. 4, 761–777 (2004).
Article PubMed Google Scholar
Shi, L. et al. Cross-platform comparability of microarray technology: intra-platform consistency and appropriate data analysis procedures are essential. BMC Bioinformatics 6 Suppl 2, S12 (2005).
Article PubMed PubMed Central CAS Google Scholar
Shi, L. et al. The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat. Biotechnol. 24, 1151–1161 (2006).
Article CAS PubMed Google Scholar
Guo, L. et al. Rat toxicogenomic study reveals analytical consistency across microarray platforms. Nat. Biotechnol. 24, 1162–1169 (2006).
Article CAS PubMed Google Scholar
Canales, R.D. et al. Evaluation of DNA microarray results with quantitative gene expression platforms. Nat. Biotechnol. 24, 1115–1122 (2006).
Article CAS PubMed Google Scholar
Patterson, T.A. et al. Performance comparison of one-color and two-color platforms within the MicroArray Quality Control (MAQC) project. Nat. Biotechnol. 24, 1140–1150 (2006).
Article CAS PubMed Google Scholar
Shippy, R. et al. Using RNA sample titrations to assess microarray platform performance and normalization techniques. Nat. Biotechnol. 24, 1123–1131 (2006).
Article CAS PubMed PubMed Central Google Scholar
Tong, W. et al. Evaluation of external RNA controls for the assessment of microarray performance. Nat. Biotechnol. 24, 1132–1139 (2006).
Article CAS PubMed Google Scholar
Irizarry, R.A. et al. Multiple-laboratory comparison of microarray platforms. Nat. Methods 2, 345–350 (2005).
Article CAS PubMed Google Scholar
Strauss, E. Arrays of hope. Cell 127, 657–659 (2006).
Article CAS PubMed Google Scholar
Shi, L., Perkins, R.G., Fang, H. & Tong, W. Reproducible and reliable microarray results through quality control: good laboratory proficiency and appropriate data analysis practices are essential. Curr. Opin. Biotechnol. 19, 10–18 (2008).
Article CAS PubMed Google Scholar
Dudoit, S., Fridlyand, J. & Speed, T.P. Comparison of discrimination methods for the classification of tumors using gene expression data. J. Am. Stat. Assoc. 97, 77–87 (2002).
Article CAS Google Scholar
Goodsaid, F.M. et al. Voluntary exploratory data submissions to the US FDA and the EMA: experience and impact. Nat. Rev. Drug Discov. 9, 435–445 (2010).
Article CAS PubMed Google Scholar
van 't Veer, L.J. et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature 415, 530–536 (2002).
Article CAS PubMed Google Scholar
Buyse, M. et al. Validation and clinical utility of a 70-gene prognostic signature for women with node-negative breast cancer. J. Natl. Cancer Inst. 98, 1183–1192 (2006).
Article CAS PubMed Google Scholar
Dumur, C.I. et al. Interlaboratory performance of a microarray-based gene expression test to determine tissue of origin in poorly differentiated and undifferentiated cancers. J. Mol. Diagn. 10, 67–77 (2008).
Article CAS PubMed PubMed Central Google Scholar
Deng, M.C. et al. Noninvasive discrimination of rejection in cardiac allograft recipients using gene expression profiling. Am. J. Transplant. 6, 150–160 (2006).
Article CAS Google Scholar
Coombes, K.R., Wang, J. & Baggerly, K.A. Microarrays: retracing steps. Nat. Med. 13, 1276–1277, author reply 1277–1278 (2007).
Article CAS PubMed Google Scholar
Ioannidis, J.P.A. et al. Repeatability of published microarray gene expression analyses. Nat. Genet. 41, 149–155 (2009).
Article CAS PubMed Google Scholar
Baggerly, K.A., Edmonson, S.R., Morris, J.S. & Coombes, K.R. High-resolution serum proteomic patterns for ovarian cancer detection. Endocr. Relat. Cancer 11, 583–584, author reply 585–587 (2004).
Article CAS PubMed Google Scholar
Ambroise, C. & McLachlan, G.J. Selection bias in gene extraction on the basis of microarray gene-expression data. Proc. Natl. Acad. Sci. USA 99, 6562–6566 (2002).
Article CAS PubMed Google Scholar
Simon, R. Using DNA microarrays for diagnostic and prognostic prediction. Expert Rev. Mol. Diagn. 3, 587–595 (2003).
Article CAS PubMed Google Scholar
Dobbin, K.K. et al. Interlaboratory comparability study of cancer gene expression analysis using oligonucleotide microarrays. Clin. Cancer Res. 11, 565–572 (2005).
CAS PubMed Google Scholar
Shedden, K. et al. Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat. Med. 14, 822–827 (2008).
Article CAS PubMed PubMed Central Google Scholar
Parry, R.M. et al. K-nearest neighbors (KNN) models for microarray gene-expression analysis and reliable clinical outcome prediction. Pharmacogenomics J. 10, 292–309 (2010).
Article CAS PubMed PubMed Central Google Scholar
Dupuy, A. & Simon, R.M. Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting. J. Natl. Cancer Inst. 99, 147–157 (2007).
Article PubMed Google Scholar
Dave, S.S. et al. Prediction of survival in follicular lymphoma based on molecular features of tumor-infiltrating immune cells. N. Engl. J. Med. 351, 2159–2169 (2004).
Article CAS PubMed Google Scholar
Tibshirani, R. Immune signatures in follicular lymphoma. N. Engl. J. Med. 352, 1496–1497, author reply 1496–1497 (2005).
Article CAS PubMed Google Scholar
Shi, W. et al. Functional analysis of multiple genomic signatures demonstrates that classification algorithms choose phenotype-related genes. Pharmacogenomics J. 10, 310–323 (2010).
Article CAS PubMed PubMed Central Google Scholar
Robinson, G.K. That BLUP is a good thing: the estimation of random effects. Stat. Sci. 6, 15–32 (1991).
Article Google Scholar
Hothorn, T., Hornik, K. & Zeileis, A. Unbiased recursive partitioning: a conditional inference framework. J. Comput. Graph. Statist. 15, 651–674 (2006).
Article Google Scholar
Boutros, P.C. et al. Prognostic gene signatures for non-small-cell lung cancer. Proc. Natl. Acad. Sci. USA 106, 2824–2828 (2009).
Article CAS PubMed Google Scholar
Popovici, V. et al. Effect of training sample size and classification difficulty on the accuracy of genomic predictors. Breast Cancer Res. 12, R5 (2010).
Article PubMed PubMed Central CAS Google Scholar
Yousef, W.A., Wagner, R.F. & Loew, M.H. Assessing classifiers from two independent data sets using ROC analysis: a nonparametric approach. IEEE Trans. Pattern Anal. Mach. Intell. 28, 1809–1817 (2006).
Article PubMed Google Scholar
Gur, D., Wagner, R.F. & Chan, H.P. On the repeated use of databases for testing incremental improvement of computer-aided detection schemes. Acad. Radiol. 11, 103–105 (2004).
Article PubMed Google Scholar
Allison, D.B., Cui, X., Page, G.P. & Sabripour, M. Microarray data analysis: from disarray to consolidation and consensus. Nat. Rev. Genet. 7, 55–65 (2006).
Article CAS PubMed Google Scholar
Wood, I.A., Visscher, P.M. & Mengersen, K.L. Classification based upon gene expression data: bias and precision of error rates. Bioinformatics 23, 1363–1370 (2007).
Article CAS PubMed Google Scholar
Luo, J. et al. A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data. Pharmacogenomics J. 10, 278–291 (2010).
Article CAS PubMed PubMed Central Google Scholar
Fan, X. et al. Consistency of predictive signature genes and classifiers generated using different microarray platforms. Pharmacogenomics J. 10, 247–257 (2010).
Article CAS PubMed PubMed Central Google Scholar
Huang, J. et al. Genomic indicators in the blood predict drug-induced liver injury. Pharmacogenomics J. 10, 267–277 (2010).
Article CAS PubMed PubMed Central Google Scholar
Oberthuer, A. et al. Comparison of performance of one-color and two-color gene-expression analyses in predicting clinical endpoints of neuroblastoma patients. Pharmacogenomics J. 10, 258–266 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hong, H. et al. Assessing sources of inconsistencies in genotypes and their effects on genome-wide association studies with HapMap samples. Pharmacogenomics J. 10, 364–374 (2010).
Article CAS PubMed PubMed Central Google Scholar
Thomas, R.S., Pluta, L., Yang, L. & Halsey, T.A. Application of genomic biomarkers to predict increased lung tumor incidence in 2-year rodent cancer bioassays. Toxicol. Sci. 97, 55–64 (2007).
Article CAS PubMed Google Scholar
Fielden, M.R., Brennan, R. & Gollub, J. A gene expression biomarker provides early prediction and mechanistic assessment of hepatic tumor induction by nongenotoxic chemicals. Toxicol. Sci. 99, 90–100 (2007).
Article CAS PubMed Google Scholar
Ganter, B. et al. Development of a large-scale chemogenomics database to improve drug candidate selection and to understand mechanisms of chemical toxicity and action. J. Biotechnol. 119, 219–244 (2005).
Article CAS PubMed Google Scholar
Lobenhofer, E.K. et al. Gene expression response in target organ and whole blood varies as a function of target organ injury phenotype. Genome Biol. 9, R100 (2008).
Article PubMed PubMed Central CAS Google Scholar
Symmans, W.F. et al. Total RNA yield and microarray gene expression profiles from fine-needle aspiration biopsy and core-needle biopsy samples of breast carcinoma. Cancer 97, 2960–2971 (2003).
Article CAS PubMed Google Scholar
Gong, Y. et al. Determination of oestrogen-receptor status and ERBB2 status of breast carcinoma: a gene-expression profiling study. Lancet Oncol. 8, 203–211 (2007).
Article CAS PubMed Google Scholar
Hess, K.R. et al. Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer. J. Clin. Oncol. 24, 4236–4244 (2006).
Article CAS PubMed Google Scholar
Zhan, F. et al. The molecular classification of multiple myeloma. Blood 108, 2020–2028 (2006).
Article CAS PubMed PubMed Central Google Scholar
Shaughnessy, J.D. Jr. et al. A validated gene expression model of high-risk multiple myeloma is defined by deregulated expression of genes mapping to chromosome 1. Blood 109, 2276–2284 (2007).
Article CAS PubMed Google Scholar
Barlogie, B. et al. Thalidomide and hematopoietic-cell transplantation for multiple myeloma. N. Engl. J. Med. 354, 1021–1030 (2006).
Article CAS PubMed Google Scholar
Zhan, F., Barlogie, B., Mulligan, G., Shaughnessy, J.D. Jr. & Bryant, B. High-risk myeloma: a gene expression based risk-stratification model for newly diagnosed multiple myeloma treated with high-dose therapy is predictive of outcome in relapsed disease treated with single-agent bortezomib or high-dose dexamethasone. Blood 111, 968–969 (2008).
Article CAS PubMed PubMed Central Google Scholar
Chng, W.J., Kuehl, W.M., Bergsagel, P.L. & Fonseca, R. Translocation t(4;14) retains prognostic significance even in the setting of high-risk molecular signature. Leukemia 22, 459–461 (2008).
Article CAS PubMed Google Scholar
Decaux, O. et al. Prediction of survival in multiple myeloma based on gene expression profiles reveals cell cycle and chromosomal instability signatures in high-risk patients and hyperdiploid signatures in low-risk patients: a study of the Intergroupe Francophone du Myelome. J. Clin. Oncol. 26, 4798–4805 (2008).
Article CAS PubMed Google Scholar
Oberthuer, A. et al. Customized oligonucleotide microarray gene expression-based classification of neuroblastoma patients outperforms current clinical risk stratification. J. Clin. Oncol. 24, 5070–5078 (2006).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The MAQC-II project was funded in part by the FDA's Office of Critical Path Programs (to L.S.). Participants from the National Institutes of Health (NIH) were supported by the Intramural Research Program of NIH, Bethesda, Maryland or the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences (NIEHS), Research Triangle Park, North Carolina. J.F. was supported by the Division of Intramural Research of the NIEHS under contract HHSN273200700046U. Participants from the Johns Hopkins University were supported by grants from the NIH (1R01GM083084-01 and 1R01RR021967-01A2 to R.A.I. and T32GM074906 to M.M.). Participants from the Weill Medical College of Cornell University were partially supported by the Biomedical Informatics Core of the Institutional Clinical and Translational Science Award RFA-RM-07-002. F.C. acknowledges resources from The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine and from the David A. Cofrin Center for Biomedical Information at Weill Cornell. The data set from The Hamner Institutes for Health Sciences was supported by a grant from the American Chemistry Council's Long Range Research Initiative. The breast cancer data set was generated with support of grants from NIH (R-01 to L.P.), The Breast Cancer Research Foundation (to L.P. and W.F.S.) and the Faculty Incentive Funds of the University of Texas MD Anderson Cancer Center (to W.F.S.). The data set from the University of Arkansas for Medical Sciences was supported by National Cancer Institute (NCI) PO1 grant CA55819-01A1, NCI R33 Grant CA97513-01, Donna D. and Donald M. Lambert Lebow Fund to Cure Myeloma and Nancy and Steven Grand Foundation. We are grateful to the individuals whose gene expression data were used in this study. All MAQC-II participants freely donated their time and reagents for the completion and analyses of the MAQC-II project. The MAQC-II consortium also thanks R. O'Neill for his encouragement and coordination among FDA Centers on the formation of the RBWG. The MAQC-II consortium gratefully dedicates this work in memory of R.F. Wagner who enthusiastically worked on the MAQC-II project and inspired many of us until he unexpectedly passed away in June 2008.

Author information

Authors and Affiliations

National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas, USA
Leming Shi, Zhining Wen, Minjun Chen, Huixiao Hong, Roger G Perkins, James C Fuscoe, Weigong Ge, Stephen C Harris, Zhiguang Li, Jie Liu, Zhichao Liu, Baitang Ning, Qiang Shi, Brett Thorn, Lei Xu, Lun Yang, Min Zhang & Weida Tong
Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, Maryland, USA
Gregory Campbell, Weijie Chen, Brandon D Gallas, Gene A Pennello, Reena Philip, Lakshmi Vishnuvajjala, Francisco Martinez-Murillo, Frank W Samuelson, Rong Tang, Zivana Tezak & Uwe Scherf
Expression Analysis Inc., Durham, North Carolina, USA
Wendell D Jones & Joel Parker
Department of Physiology and Biophysics and HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, New York, USA
Fabien Campagne
Wake Forest Institute for Regenerative Medicine, Wake Forest University, Winston-Salem, North Carolina, USA
Stephen J Walker
Z-Tech, an ICF International Company at NCTR/FDA, Jefferson, Arkansas, USA
Zhenqiang Su, Hong Fang, Feng Qian, Dhivya Arasappan, Joseph Meehan & Joshua Xu
SAS Institute Inc., Cary, North Carolina, USA
Tzu-Ming Chu, Li Li, Wenjun Bao, Wendy Czika, Kelci Miclaus, Padraic Neville, Pei-Yi Tan & Russell D Wolfinger
Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, Maryland, USA
Federico M Goodsaid, Sue Jane Wang, Mat Soukup, Jialu Zhang & Li Zhang
Breast Medical Oncology Department, University of Texas (UT) M.D. Anderson Cancer Center, Houston, Texas, USA
Lajos Pusztai
Myeloma Institute for Research and Therapy, University of Arkansas for Medical Sciences, Little Rock, Arkansas, USA
John D Shaughnessy Jr, Bart Barlogie & Yiming Zhou
Department of Pediatric Oncology and Hematology and Center for Molecular Medicine (CMMC), University of Cologne, Cologne, Germany
André Oberthuer, Matthias Fischer, Frank Berthold & Yvonne Kahlert
The Hamner Institutes for Health Sciences, Research Triangle Park, North Carolina, USA
Russell S Thomas
National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, USA
Richard S Paules, Pierre R Bushel, Jeff Chou & Jennifer Fostel
Roche Palo Alto LLC, South San Francisco, California, USA
Mark Fielden
Biomedical Informatics Center, Northwestern University, Chicago, Illinois, USA
Pan Du & Simon M Lin
Fondazione Bruno Kessler, Povo-Trento, Italy
Cesare Furlanello, Giuseppe Jurman, Samantha Riccadonna & Roberto Visintainer
Department of Mathematics & Statistics, South Dakota State University, Brookings, South Dakota, USA
Xijin Ge
Department of Electrical and Computer Engineering, CMINDS Research Center, University of Massachusetts Lowell, Lowell, Massachusetts, USA
Dalila B Megherbi & Manuel Madera
Department of Pathology, UT M.D. Anderson Cancer Center, Houston, Texas, USA
W Fraser Symmans
Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, Georgia, USA
May D Wang, Richard A Moffitt, R Mitchell Parry, John H Phan & Todd H Stokes
Systems Analytics Inc., Waltham, Massachusetts, USA
John Zhang, Jun Luo, Eric Wang & Matthew Woods
Hoffmann-LaRoche, Nutley, New Jersey, USA
Hans Bitter
Department of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Benedikt Brors, Dilafruz Juraeva, Roland Eils & Frank Westermann
Computational Life Science Cluster (CLiC), Chemical Biology Center (KBC), Umeå University, Umeå, Sweden
Max Bylesjo & Johan Trygg
GlaxoSmithKline, Collegeville, Pennsylvania, USA
Jie Cheng
Medical Systems Biology Research Center, School of Medicine, Tsinghua University, Beijing, China
Jing Cheng
Almac Diagnostics Ltd., Craigavon, UK
Timothy S Davison
Swiss Institute of Bioinformatics, Lausanne, Switzerland
Mauro Delorenzi & Vlad Popovici
Department of Biological Sciences, University of Southern Mississippi, Hattiesburg, Mississippi, USA
Youping Deng
Global Pharmaceutical R&D, Abbott Laboratories, Souderton, Pennsylvania, USA
Viswanath Devanarayan
National Center for Computational Toxicology, US Environmental Protection Agency, Research Triangle Park, North Carolina, USA
David J Dix, Fathi Elloumi, Richard Judson & Zhen Li
Department of Bioinformatics and Genomics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
Joaquin Dopazo
HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, New York, USA
Kevin C Dorff & Piali Mukherjee
Department of Operation Research and Financial Engineering, Princeton University, Princeton, New Jersey, USA
Jianqing Fan & Yang Feng
MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST / Department of Automation, Tsinghua University, Beijing, China
Shicai Fan, Xuegong Zhang, Rui Jiang, Ying Liu & Lu Meng
Institute of Pharmaceutical Informatics, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, China
Xiaohui Fan, Yiyu Cheng, Jianping Huang & Shao Li
Roche Palo Alto LLC, Palo Alto, California, USA
Nina Gonzaludo
Department of Biostatistics, UT M.D. Anderson Cancer Center, Houston, Texas, USA
Kenneth R Hess
Department of Electrical Engineering & Computer Science, University of Kansas, Lawrence, Kansas, USA
Jun Huan, Brian Quanz & Aaron Smalter
Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland, USA
Rafael A Irizarry & Matthew N McCall
Center for Biologics Evaluation and Research, US Food and Drug Administration, Bethesda, Maryland, USA
Samir Lababidi, Jennifer G Catalano, Jing Han & Raj K Puri
Golden Helix Inc., Bozeman, Montana, USA
Christophe G Lambert
Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Yanen Li
SABiosciences Corp., a Qiagen Company, Frederick, Maryland, USA
Guozhen Liu & Xiao Zeng
a Division of Clinical Data Inc., Cogenics, Morrisville, North Carolina, USA
Edward K Lobenhofer
Ligand Pharmaceuticals Inc., La Jolla, California, USA
Wen Luo
GeneGo Inc., Encinitas, California, USA
Yuri Nikolsky, Weiwei Shi, Richard J Brennan & Tatiana Nikolskaya
Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Nathan D Price & Jaeyun Sung
Spheromics, Kontiolahti, Finland
Andreas Scherer
The Center for Bioinformatics and The Institute of Biomedical Sciences, School of Life Science, East China Normal University, Shanghai, China
Tieliu Shi, Chang Chang, Jian Cui, Junwei Wang & Chen Zhao
National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland, USA
Danielle Thierry-Mieg & Jean Thierry-Mieg
Rockefeller Research Laboratories, Memorial Sloan-Kettering Cancer Center, New York, New York, USA
Venkata Thodima
CapitalBio Corporation, Beijing, China
Jianping Wu, Liang Zhang, Sheng Zhu & Qinglan Sun
Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
Yichao Wu
SRA International (EMMES), Rockville, Maryland, USA
Qian Xie
Helwan University, Helwan, Egypt
Waleed A Yousef
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Sheng Zhong
Agilent Technologies Inc., Santa Clara, California, USA
Anne Bergstrom Lucas & Stephanie Fulmer-Smentek
F. Hoffmann-La Roche Ltd., Basel, Switzerland
Andreas Buness
Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
Rong Chen
Department of Pathology and Laboratory Medicine and HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, New York, USA
Francesca Demichelis
Cedars-Sinai Medical Center, UCLA David Geffen School of Medicine, Los Angeles, California, USA
Xutao Deng & Charles Wang
Vavilov Institute for General Genetics, Russian Academy of Sciences, Moscow, Russia
Damir Dosymbekov & Marina Tsyganova
DNAVision SA, Gosselies, Belgium
Laurent Gatto
École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Darlene R Goldstein
State Key Laboratory of Multi-phase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, China
Li Guo
Abbott Laboratories, Abbott Park, Illinois, USA
Donald N Halbert
Nuvera Biosciences Inc., Woburn, Massachusetts, USA
Christos Hatzis
Winthrop P. Rockefeller Cancer Institute, University of Arkansas for Medical Sciences, Little Rock, Arkansas, USA
Damir Herman
VirginiaTech, Blacksburg, Virgina, USA
Roderick V Jensen
BioMath Solutions, LLC, Austin, Texas, USA
Charles D Johnson
Bioinformatic Program, University of Toledo, Toledo, Ohio, USA
Sadik A Khuder
Department of Mathematics, University of Bayreuth, Bayreuth, Germany
Matthias Kohl
Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, USA
Jianying Li
Pediatric Department, Stanford University, Stanford, California, USA
Li Li
College of Chemistry, Sichuan University, Chengdu, Sichuan, China
Menglong Li
University of Texas Southwestern Medical Center (UTSW), Dallas, Texas, USA
Quan-Zhen Li
Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
Ignacio Medina & David Montaner
Millennium Pharmaceuticals Inc., Cambridge, Massachusetts, USA
George J Mulligan
RTI International, Atlanta, Georgia, USA
Grier P Page
Takeda Global R & D Center, Inc., Deerfield, Illinois, USA
Xuejun Peng
Novartis Institutes of Biomedical Research, Cambridge, Massachusetts, USA
Ron L Peterson
W.M. Keck Center for Collaborative Neuroscience, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
Yi Ren
Entelos Inc., Foster City, California, USA
Alan H Roter
Biomarker Development, Novartis Institutes of BioMedical Research, Novartis Pharma AG, Basel, Switzerland
Martin M Schumacher & Frank Staedtler
Genedata Inc., Lexington, Massachusetts, USA
Joseph D Shambaugh
Affymetrix Inc., Santa Clara, California, USA
Richard Shippy
Department of Chemistry and Chemical Engineering, Hefei Teachers College, Hefei, Anhui, China
Shengzhu Si
Institut Jules Bordet, Brussels, Belgium
Christos Sotiriou
Biostatistics, F. Hoffmann-La Roche Ltd., Basel, Switzerland
Guido Steiner
Lilly Singapore Centre for Drug Discovery, Immunos, Singapore
Yaron Turpaz
Microsoft Corporation, US Health Solutions Group, Redmond, Washington, USA
Silvia C Vega
Data Analysis Solutions DA-SOL GmbH, Greifenberg, Germany
Juergen von Frese
Cornell University, Ithaca, New York, USA
Wei Wang
Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Toledo Health Sciences Campus, Toledo, Ohio, USA
James C Willey
Bristol-Myers Squibb, Pennington, New Jersey, USA
Shujian Wu
OpGen Inc., Gaithersburg, Maryland, USA
Nianqing Xiao

Consortia

MAQC Consortium

Leming Shi
, Gregory Campbell
, Wendell D Jones
, Fabien Campagne
, Zhining Wen
, Stephen J Walker
, Zhenqiang Su
, Tzu-Ming Chu
, Federico M Goodsaid
, Lajos Pusztai
, John D Shaughnessy Jr
, André Oberthuer
, Russell S Thomas
, Richard S Paules
, Mark Fielden
, Bart Barlogie
, Weijie Chen
, Pan Du
, Matthias Fischer
, Cesare Furlanello
, Brandon D Gallas
, Xijin Ge
, Dalila B Megherbi
, W Fraser Symmans
, May D Wang
, John Zhang
, Hans Bitter
, Benedikt Brors
, Pierre R Bushel
, Max Bylesjo
, Minjun Chen
, Jie Cheng
, Jing Cheng
, Jeff Chou
, Timothy S Davison
, Mauro Delorenzi
, Youping Deng
, Viswanath Devanarayan
, David J Dix
, Joaquin Dopazo
, Kevin C Dorff
, Fathi Elloumi
, Jianqing Fan
, Shicai Fan
, Xiaohui Fan
, Hong Fang
, Nina Gonzaludo
, Kenneth R Hess
, Huixiao Hong
, Jun Huan
, Rafael A Irizarry
, Richard Judson
, Dilafruz Juraeva
, Samir Lababidi
, Christophe G Lambert
, Li Li
, Yanen Li
, Zhen Li
, Simon M Lin
, Guozhen Liu
, Edward K Lobenhofer
, Jun Luo
, Wen Luo
, Matthew N McCall
, Yuri Nikolsky
, Gene A Pennello
, Roger G Perkins
, Reena Philip
, Vlad Popovici
, Nathan D Price
, Feng Qian
, Andreas Scherer
, Tieliu Shi
, Weiwei Shi
, Jaeyun Sung
, Danielle Thierry-Mieg
, Jean Thierry-Mieg
, Venkata Thodima
, Johan Trygg
, Lakshmi Vishnuvajjala
, Sue Jane Wang
, Jianping Wu
, Yichao Wu
, Qian Xie
, Waleed A Yousef
, Liang Zhang
, Xuegong Zhang
, Sheng Zhong
, Yiming Zhou
, Sheng Zhu
, Dhivya Arasappan
, Wenjun Bao
, Anne Bergstrom Lucas
, Frank Berthold
, Richard J Brennan
, Andreas Buness
, Jennifer G Catalano
, Chang Chang
, Rong Chen
, Yiyu Cheng
, Jian Cui
, Wendy Czika
, Francesca Demichelis
, Xutao Deng
, Damir Dosymbekov
, Roland Eils
, Yang Feng
, Jennifer Fostel
, Stephanie Fulmer-Smentek
, James C Fuscoe
, Laurent Gatto
, Weigong Ge
, Darlene R Goldstein
, Li Guo
, Donald N Halbert
, Jing Han
, Stephen C Harris
, Christos Hatzis
, Damir Herman
, Jianping Huang
, Roderick V Jensen
, Rui Jiang
, Charles D Johnson
, Giuseppe Jurman
, Yvonne Kahlert
, Sadik A Khuder
, Matthias Kohl
, Jianying Li
, Li Li
, Menglong Li
, Quan-Zhen Li
, Shao Li
, Zhiguang Li
, Jie Liu
, Ying Liu
, Zhichao Liu
, Lu Meng
, Manuel Madera
, Francisco Martinez-Murillo
, Ignacio Medina
, Joseph Meehan
, Kelci Miclaus
, Richard A Moffitt
, David Montaner
, Piali Mukherjee
, George J Mulligan
, Padraic Neville
, Tatiana Nikolskaya
, Baitang Ning
, Grier P Page
, Joel Parker
, R Mitchell Parry
, Xuejun Peng
, Ron L Peterson
, John H Phan
, Brian Quanz
, Yi Ren
, Samantha Riccadonna
, Alan H Roter
, Frank W Samuelson
, Martin M Schumacher
, Joseph D Shambaugh
, Qiang Shi
, Richard Shippy
, Shengzhu Si
, Aaron Smalter
, Christos Sotiriou
, Mat Soukup
, Frank Staedtler
, Guido Steiner
, Todd H Stokes
, Qinglan Sun
, Pei-Yi Tan
, Rong Tang
, Zivana Tezak
, Brett Thorn
, Marina Tsyganova
, Yaron Turpaz
, Silvia C Vega
, Roberto Visintainer
, Juergen von Frese
, Charles Wang
, Eric Wang
, Junwei Wang
, Wei Wang
, Frank Westermann
, James C Willey
, Matthew Woods
, Shujian Wu
, Nianqing Xiao
, Joshua Xu
, Lei Xu
, Lun Yang
, Xiao Zeng
, Jialu Zhang
, Li Zhang
, Min Zhang
, Chen Zhao
, Raj K Puri
, Uwe Scherf
, Weida Tong
& Russell D Wolfinger

Corresponding author

Correspondence to Leming Shi.

Ethics declarations

Competing interests

Many of the MAQC-II participants are employed by companies that manufacture gene expression products and/or perform testing services.

Supplementary information

Supplementary Text and Figures

Supplementary Tables 3–8, Supplementary Data and Supplementary Figs. 1–13 (PDF 4568 kb)

Supplementary Table 1

UniqueModels19779_PerformanceMetrics (XLS 14906 kb)

Supplementary Table 2

Swap_UniqueModels13287_PerformanceMetrics (XLS 12587 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

MAQC Consortium. The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. Nat Biotechnol 28, 827–838 (2010). https://doi.org/10.1038/nbt.1665

Download citation

Received: 02 March 2010
Accepted: 30 June 2010
Published: 30 July 2010
Issue Date: August 2010
DOI: https://doi.org/10.1038/nbt.1665

This article is cited by

AF1q is a universal marker of neuroblastoma that sustains N-Myc expression and drives tumorigenesis
- Babak Oskouian
- Joanna Y. Lee
- Julie D. Saba
Oncogene (2024)
Basal–epithelial subpopulations underlie and predict chemotherapy resistance in triple-negative breast cancer
- Mohammed Inayatullah
- Arun Mahesh
- Vijay K Tiwari
EMBO Molecular Medicine (2024)
Enhancing prognostic power in multiple myeloma using a plasma cell signature derived from single-cell RNA sequencing
- Jian-rong Li
- Shahram Arsang-Jang
- Chao Cheng
Blood Cancer Journal (2024)
The age-specific comorbidity burden of mild cognitive impairment: a US claims database study
- Gang Li
- Nicola Toschi
- Harald Hampel
Alzheimer's Research & Therapy (2023)
Notch-based gene signature for predicting the response to neoadjuvant chemotherapy in triple-negative breast cancer
- Mohamed Omar
- Pier Vitale Nuzzo
- Luigi Marchionni
Journal of Translational Medicine (2023)