Nature Publishing Group, publisher of Nature, and other science journals and reference works NATURE.COM NATURE NEWS NATUREJOBS NATUREEVENTS ABOUT NPG
Help Nature.com site index  
The Pharmacogenomics Journal
SEARCH     advanced search my account e-alerts subscribe register
Journal home
Advance online publication
Current issue
Archive
Press releases
For authors
For referees
Contact editorial office
About the journal
For librarians
Subscribe
Advertising
naturereprints
Contact NPG
Customer services
Site features
NPG Subject areas
Access material from all our publications in your subject area:
Biotechnology Biotechnology
Cancer Cancer
Chemistry Chemistry
Dentistry Dentistry
Development Development
Drug Discovery Drug Discovery
Earth Sciences Earth Sciences
Evolution & Ecology Evolution & Ecology
Genetics Genetics
Immunology Immunology
Materials Materials Science
Medical Research Medical Research
Microbiology Microbiology
Molecular Cell Biology Molecular Cell Biology
Neuroscience Neuroscience
Pharmacology Pharmacology
Physics Physics
Browse all publications
 
2002, Volume 2, Number 4, Pages 259-271
Table of contents    Previous  Abstract  Next   Full text  PDF
Original Article
Pharmacogenomic analysis: correlating molecular substructure classes with microarray gene expression data
P E Blower1, C Yang1, M A Fligner2, J S Verducci2, L Yu1, S Richman3 and J N Weinstein3

1Leadscope Inc, Columbus, OH, USA

2Department of Statistics, The Ohio State University, Columbus, OH, USA

3Laboratory of Molecular Pharmacology, Division of Basic Sciences, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA

Correspondence to: P E Blower, Leadscope Inc, 1275 Kinnear Road, Columbus, OH 43212, USA Tel: 001 641 675 376 E-mail: Pblower@leadscope.com http://www.leadscope.com or J N Weinstein, Building 37, Room 5068, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA Tel: 001 301 496 9571 E-mail: jw4i@nih.gov http://discover.nci.nih.gov

Abstract

Genomic studies are producing large databases of molecular information on cancers and other cell and tissue types. Hence, we have the opportunity to link these accumulating data to the drug discovery processes. Our previous efforts at 'information-intensive' molecular pharmacology have focused on the relationship between patterns of gene expression and patterns of drug activity. In the present study, we take the process a step further¾relating gene expression patterns, not just to the drugs as entities, but to ~27 000 substructures and other chemical features within the drugs. This coupling of genomic information with structure-based data mining can be used to identify classes of compounds for which detailed experimental structure-activity studies may be fruitful. Using a systematic substructure analysis coupled with statistical correlations of compound activity with differential gene expression, we have identified two subclasses of quinones whose patterns of activity in the National Cancer Institute's 60-cell line screening panel (NCI-60) correlate strongly with the expression patterns of particular genes: (i) The growth inhibitory patterns of an electron-withdrawing subclass of benzodithiophenedione-containing compounds over the NCI-60 are highly correlated with the expression patterns of Rab7 and other melanoma-specific genes; (ii) the inhibitory patterns of indolonaphthoquinone-containing compounds are highly correlated with the expression patterns of the hematopoietic lineage-specific gene HS1 and other leukemia genes. As illustrated by these proof-of-principle examples, we introduce here a set of conceptual tools and fluent computational methods for projecting directly from gene expression patterns to drug substructures and vice versa. The analysis is presented in terms of the NCI-60 cell lines and microarray-based gene expression patterns, but the concept and methods are broadly applicable to other large-scale pharmacogenomic database sets as well. The approach (SAT for Structure-Activity-Target) provides a systematic way to mine databases for the design of further structure-activity studies, particularly to aid in target and lead identification.

The Pharmacogenomics Journal (2002) 2, 259-271. doi:10.1038/sj.tpj.6500116

Keywords

molecular class; NCI cell lines; statistical correlation

Received 7 January 2002; accepted 8 April 2002
2002, Volume 2, Number 4, Pages 259-271
Table of contents    Previous  Abstract  Next   Full text  PDF
Privacy Policy © 2002 Nature Publishing Group