Abstract
Microarray technology makes it possible to simultaneously study the expression of thousands of genes during a single experiment. We have developed an information system, ArrayDB, to manage and analyse large-scale expression data. The underlying relational database was designed to allow flexibility in the nature and structure of data input and also in the generation of standard or customized reports through a web-browser interface. ArrayDB provides varied options for data retrieval and analysis tools that should facilitate the interpretation of complex hybridization results. A sampling of ArrayDB storage, retrieval and analysis capabilities is available (http://www.nhgri.nih.gov/DIR/LCG/15K/HTML/), along with information on a set of approximately 15,000 genes used to fabricate several widely used microarrays. Information stored in ArrayDB is used to provide integrated gene expression reports by linking array target sequences with NCBI's Entrez retrieval system, UniGene and KEGG pathway views. The integration of external information resources is essential in interpreting intrinsic patterns and relationships in large-scale gene expression data.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Jacob, F. & Monod, J. Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol. 3, 318-356 (1961).
Nirenberg, M.W. & Matthaei, J.H. The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc. Natl Acad. Sci. USA 47, 1588-1602 (1961).
Taylor, J.H. Selected Papers on Molecular Genetics. (Academic Press, New York, 1965).
Bishop, J.O. & Smith, G.P. The determination of RNA homogeneity by molecular hybridization. Cell 3, 341- 346 (1974).
Galau, G.A., Britten, R.J. & Davidson, E.H. A measurement of the sequence complexity of polysomal messenger RNA in sea urchin embryos. Cell 2, 9-20 (1974).
Lewin, B. The Molecular Basis of Gene Expression. (Wiley-Interscience, London, 1970).
Lewin, B. Gene Expression-1 (John Wiley, New York, 1974).
Lewin, B. Gene Expression-2 (John Wiley, New York, 1974).
Lewin, B. Gene Expression-3 (John Wiley, New York, 1977).
Fodor, S.P. et al. Multiplexed biochemical assays with biological chips. Nature 364, 555-556 ( 1993).
Schena, M., Shalon, D., Davis, R.W. & Brown, P.O. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270, 467-70 ( 1995).
Velculescu, V.E., Zhang, L., Vogelstein, B. & Kinzler, K.W. Serial analysis of gene expression. Science 270, 484-487 (1995).
DeRisi, J. et al. Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nature Genet. 14, 457- 460 (1996).
DeRisi, J.L., Iyer, V.R. & Brown, P.O. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 278, 680- 686 (1997).
Wodicka, L., Dong, H., Mittmann, M., Ho, M.H. & Lockhart, D.J. Genome-wide expression monitoring in Saccharomyces cerevisiae. Nature Biotechnol. 15, 1359 -1367 (1997).
Lockhart, D.J. et al. Expression monitoring by hybridization to high-density oligonucleotide arrays. Nature Biotechnol. 14, 1675- 1680 (1996).
de Saizieu, A. et al. Bacterial transcript imaging by hybridization of total RNA to oligonucleotide arrays. Nature Biotechnol. 16, 45-48 (1998).
Hieter, P. & Boguski, M. Functional genomics: it's all how you read it. Science 278, 601- 602 (1997).
Adams, M.D. Serial analysis of gene expression: ESTs get smaller. Bioessays 18, 261-262 (1996 ).
Fodor, S.P. et al. Light-directed, spatially addressable parallel chemical synthesis. Science 251, 767-773 (1991).
Lennon, G.G. & Lehrach, H. Hybridization analyses of arrayed cDNA libraries. Trends Genet. 7, 314- 317 (1991).
Gress, T.M., Hoheisel, J.D., Lennon, G.G., Zehetner, G. & Lehrach, H. Hybridization fingerprinting of high-density cDNA-library arrays with cDNA pools derived from whole tissues. Mamm. Genome 3, 609-619 (1992).
Greenspun, P. Database Backed Web Sites: The Thinking Person's Guide to Web Publishing (Ziff-Davis, Emeryville, California, 1997).
Schuler, G.D. et al. A gene map of the human genome. Science 274, 540-546 (1996).
Boguski, M.S. & Schuler, G.D. ESTablishing a human transcript map. Nature Genet. 10, 369-371 (1995).
Chen, Y., Dougherty, E.R. & Bittner, M.L. Ratio-based decisions and the quantitative analysis of cDNA microarray images. Biomedical Optics 2, 364-374 (1997).
Kanehisa, M. A database for post-genome analysis. Trends Genet. 13, 375-376 (1997).
Berry, M.J.A. & Linoff, G. Data Mining Techniques for Marketing, Sales, and Customer Support (John Wiley, New York, 1997).
Kaufman, L. & Rousseeuw, P.J. Finding Groups in Data: An Introduction to Cluster Analysis (John Wiley, New York, 1990).
Acknowledgements
We thank M. Eisen, P. Brown and J. Hudson for stimulating discussions. We also thank J. Hudson and Research Genetics for re-arraying the 10K/15K clone sets from their original IMAGE cDNA libraries.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ermolaeva, O., Rastogi, M., Pruitt, K. et al. Data management and analysis for gene expression arrays. Nat Genet 20, 19–23 (1998). https://doi.org/10.1038/1670
Issue Date:
DOI: https://doi.org/10.1038/1670
This article is cited by
-
Comprehensive characterization of 11 prognostic alternative splicing events in ovarian cancer interacted with the immune microenvironment
Scientific Reports (2022)
-
The transcriptional response to oxidative stress is part of, but not sufficient for, insulin resistance in adipocytes
Scientific Reports (2018)
-
Microarray image enhancement by denoising using decimated and undecimated multiwavelet transforms
Signal, Image and Video Processing (2010)
-
Institutional shared resources and translational cancer research
Journal of Translational Medicine (2009)
-
Normalization of oligonucleotide arrays based on the least-variant set of genes
BMC Bioinformatics (2008)