To facilitate collaborative research efforts between multi-investigator teams using DNA microarrays, we identified sources of error and data variability between laboratories and across microarray platforms, and methods to accommodate this variability. RNA expression data were generated in seven laboratories, which compared two standard RNA samples using 12 microarray platforms. At least two standard microarray types (one spotted, one commercial) were used by all laboratories. Reproducibility for most platforms within any laboratory was typically good, but reproducibility between platforms and across laboratories was generally poor. Reproducibility between laboratories increased markedly when standardized protocols were implemented for RNA labeling, hybridization, microarray processing, data acquisition and data normalization. Reproducibility was highest when analysis was based on biological themes defined by enriched Gene Ontology (GO) categories. These findings indicate that microarray results can be comparable across multiple laboratories, especially when a common platform and set of procedures are used.
Access optionsAccess options
Subscribe to Journal
Get full journal access for 1 year
only $18.92 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Gene Expression Omnibus
We thank J. Quackenbush from The Institute for Genomic Research, L. Hartwell from Fred Hutchinson Cancer Research Center and R. Wolfinger from the SAS Institute for their scientific contributions. We thank K.J. Yost (Science Applications International) and P. Cozart (NIEHS ITSS) for their information technology support. Research support was provided by National Institutes of Environmental Health Sciences grants ES11375, ES11384, ES11387, ES11391 and ES11399, and Contract # N01-ES-25497.
Clustering of laboratory/platform combinations based on log ratio values associated with the common genes.
Within and between laboratory median Pearson correlation coefficients of log intensities from standard array experiments.
Within and between laboratory median Pearson correlation coefficients of log ratios (LvsP) for standard array experiments using different preprocessing.
Common Gene Elements Across All Platforms (Standard and Resident Arrays): Mapping to NIA NAP Clusters.
Percent overlap of significantly induced and repressed genes across laboratories for the Dataset D and Dataset C and number of gene transcripts identified as differentially expressed across laboratories for Dataset D and Dataset C.
Percentage of the functionally-enriched GO Nodes that demonstrate different levels of concordance within and between branches of the clustering dendrogram.
About this article
BMC Bioinformatics (2011)