DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome

Luo, Heng; Zhang, Ping; Cao, Xi Hang; Du, Dizheng; Ye, Hao; Huang, Hui; Li, Can; Qin, Shengying; Wan, Chunling; Shi, Leming; He, Lin; Yang, Lun

doi:10.1038/srep35996

Download PDF

Article
Open access
Published: 02 November 2016

DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome

Heng Luo¹^na1^na2^na3,
Ping Zhang²^na1^na2^na3,
Xi Hang Cao³^na1^na2^na3,
Dizheng Du¹^na1^na2^na3,
Hao Ye¹^na1^na2^na3,
Hui Huang¹^na1^na2^na3,
Can Li¹^na1^na2^na3,
Shengying Qin¹^na1^na2^na3,
Chunling Wan¹^na1^na2^na3,
Leming Shi⁴^na1^na2^na3,
Lin He¹^na1^na2^na3 &
…
Lun Yang¹^na1^na2^na3

Scientific Reports volume 6, Article number: 35996 (2016) Cite this article

4309 Accesses
18 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The cost of developing a new drug has increased sharply over the past years. To ensure a reasonable return-on-investment, it is useful for drug discovery researchers in both industry and academia to identify all the possible indications for early pipeline molecules. For the first time, we propose the term computational “drug candidate positioning” or “drug positioning”, to describe the above process. It is distinct from drug repositioning, which identifies new uses for existing drugs and maximizes their value. Since many therapeutic effects are mediated by unexpected drug-protein interactions, it is reasonable to analyze the chemical-protein interactome (CPI) profiles to predict indications. Here we introduce the server DPDR-CPI, which can make real-time predictions based only on the structure of the small molecule. When a user submits a molecule, the server will dock it across 611 human proteins, generating a CPI profile of features that can be used for predictions. It can suggest the likelihood of relevance of the input molecule towards ~1,000 human diseases with top predictions listed. DPDR-CPI achieved an overall AUROC of 0.78 during 10-fold cross-validations and AUROC of 0.76 for the independent validation. The server is freely accessible via http://cpi.bio-x.cn/dpdr/.

DrugRepo: a novel approach to repurposing drugs based on chemical and genomic features

Article Open access 07 December 2022

Yinyin Wang, Jehad Aldahdooh, … Ziaurrehman Tanoli

DrugRep: an automatic virtual screening server for drug repurposing

Article 10 October 2022

Jian-hong Gan, Ji-xiang Liu, … Yang Cao

Structure-based assessment and druggability classification of protein–protein interaction sites

Article Open access 13 May 2022

Lara Alzyoud, Richard A. Bryce, … Mohammad A. Ghattas

Introduction

The cost of developing a new drug increased from $0.8 billion in 2003 to $2.6 billion in 2014¹. It was estimated that only one drug compound was approved for market use after screening, selection and trials from a large number of compounds within 10–17 years^2,3. The research and development (R&D) costs of new drugs are increasing while the number of annual approved new drugs has not changed much⁴. Therefore, it is important for drug developers in industry or academia to identify all possible indications for their pipeline molecules, i.e., positioning the molecule towards the best possible indications as early as possible. Even if there is no clinical or animal data available for the molecule, which is usually the case at early stages of the pipeline, potential indications should be identified. Here, for the first time, we propose this indication prioritization process as “drug candidate positioning”, or “drug positioning”, which differentiates with “drug repositioning” and could be one of the essential steps in the future R&D strategy. On the other hand, drug repositioning, i.e., identifying new uses for existing drugs³, also could maximize the market value of the existing drugs⁵. For both positioning and repositioning, the process of computational indication prediction is essential.

Many computational methods have been developed for drug repositioning, including structure-based prediction⁶, side-effect-based approach^7,8, networks^9,10,11, gene expression analysis^{12,13,14,15,16} and text mining¹⁷. Some studies combined various data types to get improved prediction performance^18,19. Servers that utilize descriptors^20,21, gene expressions^13,22 and multiple data types¹¹ were developed. Most of the above methods require data and knowledge that have already been generated, such as the associated drug targets, drug labels, gene expression profiles and side-effects, many of which are only applicable to the late-stage or marketed drugs but not to early pipeline molecules. Therefore, they are not available to support drug candidate positioning. During our previous studies, we address this issue by constructing the in silico chemical-protein interactome (CPI)^{6,23,24,25,26,27}, based on which the DRAR-CPI was developed⁶. The server requires the user submission of a molecular structure via the web interface, and then a CPI profile will be constructed for indication prediction. The CPI profile will be compared against the profiles of our library drugs and potential indications will be suggested based on profile similarities. It has helped different groups of researchers to identify putative targets and potential indications for their molecules^28,29,30,31. However, the server was developed five years ago and it has two major limitations: (a) the number of predicted indications are limited and biased because of the limited drug library in our server and (b) the indication prediction is based on an unsupervised method, which does not utilize a training process to optimize the prediction for each indication. Therefore, we introduce an upgraded version of the server, DPDR-CPI, to predict drug candidate positioning and drug repositioning via CPI. It can accept a small molecule in major formats, including MOL, MOL2, PDB, SDF and SMILES, and predict its potential indications across 963 diseases using machine learning models. The performances were validated using a blinded independent validation–the model was trained at one institution and validated another institution. It achieved an area under the receiver operating characteristic curve (AUROC) of 0.78 during 10-fold cross-validations. The server will also suggest putative targets and their docking conformations based on a faster and more accurate docking program so that the users can explore the rationale of the predicted indications³².

Results and Discussion

Model evaluation

The training set and the independent validation set both contain 628 drugs and 638 ICD-9 disease indications belonging to 328 ICD-9 disease families (Supplementary Tables S1 and S2). For the 10-fold cross-validations of the training set under global metrics, the models obtained an AUROC of 0.752 for the 638 ICD-9 disease indications and 0.760 for the 328 ICD-9 disease families. The server-side models were trained using the combination of both the training set and the independent validation set (called entire dataset). They reached an AUROC of 0.782 for the 638 ICD-9 disease indications and 0.783 for the 328 ICD-9 disease families. Other measurements, including accuracy, precision, sensitivity, specificity and area under the precision-recall curve (AUPR), are shown in Table 1.

Table 1 Performance evaluation of DPDR-CPI using the entire dataset versus the training set during 10-fold cross-validations.

Full size table

For the independent validation, we compared two types of prediction methods: (1) logistic regressions based on E-state, Extended Connectivity Fingerprint (ECFP)-6, Functional-Class Fingerprints (FCFP)-6, FP4, Klekota-Roth method, MACCS and PubChem structural descriptors (called LR-E-state, LR-ECFP6, LR-FCFP6, LR-FP4, LR-KR, LR-MACCS and LR-PubChem, respectively)³³, and (2) DPDR-CPI proposed in this paper that analyzes CPI profiles to predict indications. For the 638 ICD-9 disease indications as endpoints, the comparisons of receiver operating characteristic (ROC) curves and precision-recall curves under global metrics are shown in Fig. 1. All evaluation measurements including global, drug-centric and disease-centric metrics are summarized in Table 2. We see the DPDR-CPI obtained the best overall performance with an AUROC of 0.764 during the independent validation.

Table 2 Performance comparisons of the different structural descriptor-based methods and DPDR-CPI using 638 endpoints of ICD-9 disease indications on the independent validation data.

Full size table

Likewise, we used 328 ICD-9 disease families as endpoints and compared the structural descriptor-based methods and DPDR-CPI. The ROC and precision-recall curves are shown in Supplementary Figure S1 and evaluation measurements are attached in Supplementary Table S3. From either ICD-9 diseases or ICD-9 disease families, the independent validation showed that our CPI-based method generally outperformed structural descriptor-based methods. DPDR-CPI achieved a reasonably good overall performance and can be utilized for drug candidate positioning and repositioning purposes.

The CPI is an in silico atomistic prediction of drug-protein binding data. Though some studies utilized experimental drug-protein binding data to predict drug indications and demonstrated good prediction performances^18,19,34, such information is limited for new or pipeline drug candidates. Though our CPI may not be as accurate as the experimental binding data, it has the advantage to make predictions for new or pipeline drug candidates. Since obtaining the wet-lab binding data can be both costly and time-consuming, we believe our CPI provides a fast, low-cost and useful solution for drug candidate positioning.

Another advantage of our CPI approach is the consideration of potential off-target binding effects, which are important to the discovery of new indications. The 611 targets in our library consist of both pharmacokinetic (PK) and pharmacodynamic (PD) proteins serving as a reasonable distribution of off-targets. The features provided by off-target binding effects can be used to identify drug indications even if the on-target does not exist in the library. For example, Rolapitant is a neurokinin-1 (NK-1) receptor antagonist that can treat vomiting. Even though its target NK-1 is not included in our library, we submitted the molecule to our DPDR-CPI server and found its indication ranked to top second with a high confidence value of 0.85.

Since drugs in the independent validation set may have similar structures to some of the drugs used in the training set, to reduce such impact, we removed the drugs from the independent validation set which have a Tanimoto similarity >0.7¹⁸ towards any drug in the training set. The new results of independent validation are shown in Supplementary Tables S4 and S5. We see that after removing the similar drugs, the AUROC of DPDR-CPI slightly dropped by 0.02~0.03, indicating the performance of our method is not mainly contributed by structural similar drugs.

Case study 1: drug candidate positioning for parogrelil

It is important to make early decisions of the indication prioritization for the pipeline molecules, so that the developers could choose the best indications with unmet needs, clinical developability and return on investment. Here we found an investigational molecule, “NM-702”, originally developed for peripheral vascular disease³⁵ (http://www.drugbank.ca/drugs/DB05505), and submitted it the DPDR-CPI server. The server successfully picked up this indication (Table 3) as the third rated one and all the top four predictions were relevant to the same disease category (cardiovascular diseases). Among these four top predictions, we believe that the second prediction, cerebral arterial occlusion, is a highly unmet need and should be considered. Acute stroke is caused by cerebral arterial occlusion and can lead to brain infarction³⁶. Stroke is the fifth most common cause of death and the most frequent causes of disability in the US³⁷. Therefore, by using the DPDR-CPI server, the drug developer could have positioned this drug candidate into the second indication and compared efficacy for both indications in the respective animal models. We believe the server provides drug developers an opportunity to choose the most promising indication for further development, such as deciding whether to pursue it for a higher unmet need (cerebral artery occlusion) or continuing its original designated indication, along with the atomistic docking model to help make sense of the additional targets.

Table 3 Drug candidate positioning prediction for NM-702 using the DPDR-CPI server.

Full size table

From the case study, we see that the DPDR-CPI server can identify the best indications for a compound based only on its molecular structure, which is very important to the pharmaceutical industry since it supports a rapid high throughput approach. Though our work is based on the in silico docking approach, which has been extensively used for virtual screening and target identification in the past decades, the purposes of this work include drug candidate positioning as an important application.

Case study 2: drug repositioning for rosiglitazone

Rosiglitazone is an anti-diabetic drug which has been on the market for years. We would like to know whether our server is able to expand its indications for possible new uses. We submitted its structure to the server and found our server successfully identified its original indications, hypoglycemia and diabetes mellitus, as the top two predictions (Table 4). Some other reported new uses, such as disorders of fatty acid oxidation³⁸ and Alzheimer’s disease³⁹, are also prioritized by the server. Among the top predictions, retinal disorders and glaucoma are also listed. It is reported that rosiglitazone is a potential neuroprotectant for retinal cells and may increase the retinal cell survival⁴⁰. It may also delay the onset of proliferative diabetic retinopathy⁴¹. In addition, the drug was found useful after glaucoma filtration surgery for anti-fibrotic activity⁴². Therefore, in concordance with the literature reports, the atomistic based prediction results suggested that it is possible to expand rosiglitazone for eye disease treatments.

Table 4 Top disease predictions for rosiglitazone from the server.

Full size table

We also look at the binding target predictions for rosiglitazone and found monoamine oxidase A (MAO-A) is ranked in the top three. It was reported that rosiglitazone is an inhibitor for MAO-A⁴³, a drug target for neuroprotective therapy⁴⁴. Such prediction provides possible biologic clues for rosiglitazone’s neuroprotective effects towards retinal cells, and may help to discover its potential uses and mechanisms for treating eye diseases.

Conclusion

The DPDR-CPI server is able to produce indication predictions for a user molecule towards ~1,000 human diseases, providing suggestions for drug candidate positioning and drug repositioning. It has the potential to improve the drug development pipeline in terms of indication prioritization even for molecules in the early R&D stage.

Methods

Preparation of the training set

We included 2,515 drug molecules, 611 ligand-bindable target structures and their CPI from our previous study²⁴. The 2,515 molecules were collected from DrugBank⁴⁵ and STITCH⁴⁶, of which 85% are FDA-approved drugs. The 611 target structures contains 239 PK proteins and 372 PD proteins collected from Protein Data Bank (PDB)⁴⁷ and PDBBind⁴⁸. Though the targets were harvested from a project for drug-drug interaction prediction, we still believe they can serve as potential off-target binding features for drug indication prediction. The in silico interactome of these 2,515 molecules across 611 targets was generated using AutoDock Vina³².

We chose MEDication Indication resource (MEDI)⁴⁹ as a gold standard for drug indications since it contains the largest number of indications (4,352 diseases) among the existing drug-indication databases⁵⁰ and it uses International Classification of Disease (ICD)-9 codes (2014 version) to represent diseases. We mapped the 3,112 drugs from MEDI to DrugBank using DrugBank synonym rules, and identified 1,256 common drugs that exist both in MEDI and our CPI (Supplementary Table S1). The docking scores of 1,256 common drugs against the 611 targets were used as features for our machine learning models, and the disease indications are considered as endpoints.

We filtered the endpoints according to the following criteria: (a) we removed the endpoints containing ICD-9 codes from 780 to 999 since they are related to symptoms, injuries or poisoning which are less of interest; (b) we removed the endpoints that can be treated by less than five drugs due to the fact that the positive samples are too few in those cases. Afterwards, we got 963 ICD-9 disease indications which belong to 424 ICD-9 families (Supplementary Table S2). For each drug-indication pair, if the drug is reported to treat the indication in MEDI, it is labeled as “1” (positive), otherwise, “0” (negative). Finally, the dataset was converted to a matrix containing 1,256 drugs as rows and 611 target-binding features as predictor variables with 963 ICD-9 diseases and 424 ICD-9 disease families as dependent variables or endpoints.

Model training and evaluation

To evaluate an indication prediction method for multiple drugs to multiple diseases, there are three possible approaches- (1) Global metrics: one can merge the prediction scores for all drugs over all diseases, and then compute the overall evaluation result; (2) Drug-centric metrics: one can compute an evaluation result for each drug and then average the results over all drugs to obtain an overall score; (3) Disease-centric metrics: one can compute an evaluation result for each disease and then average the results over all diseases to obtain an overall score. In this study, global metrics were used during the model training and cross-validation. All three evaluation approaches were implemented during the independent validation.

The workflow of the model training and prediction is shown in Fig. 2. We randomly split the original dataset into two equal parts, one half serving as training set, and the other half as independent validation set. We filtered the diseases that have fewer than five associated drugs in the new training set to ensure each endpoint has at least five positive samples. After the filtering process, we ended up having 638 ICD-9 individual diseases and 328 disease families. We treated the indication prediction task as a binary classification problem and constructed separate classifiers for each disease. A comparison of Naïve Bayes, logistic regression and random forest models showed comparable efficiency and accuracy of predictions on our training data, so we chose logistic regression for the DPDR-CPI server. The models were set up with L2-regularization which gives an increasing penalty as model complexity increases to prevent overfitting. Models were constructed using Python 2.7 and the Scikit-Learn package⁵¹ and evaluated with 10-fold cross-validation. Cross-validation experiments were repeated 100 times to get a mean and a standard deviation of the AUROCs and the AUPRs and the accuracy, precision, sensitivity, and specificity measures were calculated based on a prediction threshold when the maximum F-score (harmonic mean of precision and recall) was achieved.

Then we assessed the models on the independent validation data by using global metrics, drug-centric metrics, and disease-centric metrics. Since this independent dataset was not included anywhere in the training, we used it as a gold standard to evaluate our method. To compare our method against structural descriptor-based methods, we generated the E-state, ECFP6, FCFP6, FP4, Klekota-Roth, MACCS and PubChem³³ fingerprints for all the drugs. The E-State, ECFP6, MACCS, and PubChem fingerprints were generated using rcdk package 3.3.2⁵² in R 3.1.3, FP4 fingerprints were produced by Open Babel 2.3.2⁵³ and FCFP6 and Klekota-Roth fingerprints were generated via RDKit version 2016-06-30 in Anaconda Python 2.7.12. We built models based on the descriptor features following the same procedure above and compared the methods during the independent validation.

We also utilized all the data, including both the training and validation sets, to train comprehensive models to run on the server-side for predictions. The parameters and thresholds were determined using the exact cross-validation procedure described above. In order to make the scores comparable across different diseases for ranking purposes, we used an Empirical Bayes method⁵⁴ to normalize prediction scores of the same drug across all endpoints (i.e., diseases). To explain this process, consider a particular drug i and divide the diseases into two groups: group 1 includes diseases which can be treated by drug i, and group 0 includes diseases which cannot be treated by drug i. For a disease j, y_j is the predicted score generated from the models. We use the confidence of disease j belonging to Group 1 (i.e. the probability of the disease belongs to Group 1 based on all predicted scores for drug i) as the normalized value.

According to the Bayes’s rule,

Here P(·) denotes the probability of an event. G₁ and G₀ denote the events of belonging to Group 1 and Group 0, respectively, and y_j|G₁ denotes the event of observing y_j when the disease belongs to Group 1. We obtain the probabilities on the right-hand side of the formula from empirical distributions. P(G₀) and P(G₁) are the prior probabilities of a disease from Group 0 and Group 1, respectively. P(G₀) is the proportion of diseases that cannot be treated by the drug from the training data, and P(G₁) is the proportion of diseases that can be treated by the drug from the training data. Let P(y_j|G₀) denotes the probability density from the distribution of predicted scores of diseases from Group 0 based on the training data. P(y_j|G₁) is the probability density from the distribution of predicted scores of diseases from Group 1 based on the training data. After obtaining all values on the right-hand side of the formula, the normalized score is calculated. Since the probabilities on the right-hand side are obtained from for each drug, the normalized scores of diseases are comparable within each drug.

Server workflow

The overall workflow of the server is shown in Fig. 3. Users can submit a molecular file in the following formats: MOL, MOL2, PDB, SDF and SMILES. A JSME Molecule Editor⁵⁵ is also provided for the user to sketch a molecule. We utilize Molconvert 14.8.18.0 from Marvin Beans (https://www.chemaxon.com) and AutoDock Tools 1.5.4⁵⁶ to convert the 2D molecular structure to 3D PDBQT file with Gasteiger charges. A small molecule, naphthylamine, is provided for a quick test of the server. Our server is designed to dock small drug-like molecules so it may fail or generate inaccurate results for molecules that are larger than 900 Daltons, such as peptides and natural products, or small inorganic molecules that do not contain any rotatable bonds. When the molecule file is submitted, it is added to the queue to be docked by AutoDock Vina³² against the 611 targets with default parameters. The docking scores and poses with the lowest energy scores are extracted and sent to the machine learning models for indication prediction. A typical calculation task usually takes minutes to hours, depending on how complicated the input molecule is. The user can choose to view the ongoing process online as it executes, bookmark the task link and return later, or leave an email address and wait for a notice.

The following results will be provided when a task is complete:

1
The predicted indications from 963 ICD-9 indications of 424 ICD-9 disease families along with confidence values. The indication table is organized as a tree-like structure based on ICD-9 code hierarchy and ranked by the ICD-9 family confidence values.
2
The binding scores and structures of the user molecule towards the 611 library targets. The interaction patterns can be visualized online via JSMol (http://www.jmol.org) and the target residues within 6.4 Å distance²³ from the ligand are highlighted.

Disclaimer

This server is only for research purposes and the authors and their organizations are excluded from all liability for any costs, claims, expenses, charges, losses, damages or penalties of any kind incurred directly or indirectly arising from the use of this server.

Additional Information

How to cite this article: Luo, H. et al. DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome. Sci. Rep. 6, 35996; doi: 10.1038/srep35996 (2016).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Avorn, J. The $2.6 billion pill–methodologic and policy considerations. N. Engl. J. Med. 372, 1877–1879, doi: 10.1056/NEJMp1500848 (2015).
Article CAS PubMed Google Scholar
Barratt, M. J. & Frail, D. E. In Drug repositioning: Bringing new life to shelved assets and existing drugs 66 (John Wiley & Sons, 2012).
Ashburn, T. T. & Thor, K. B. Drug repositioning: identifying and developing new uses for existing drugs. Nat. Rev. Drug Discov. 3, 673–683, doi: 10.1038/nrd1468 (2004).
Article CAS PubMed Google Scholar
Riveraa, C., Trumanb, K., Parkera, T. & Palmisanoa, S. Trends in new drug approvals and clinical trial publications over a 2-decade interval. 10th Annual Meeting of International Society for Medical Publication Professionals, 33 (2014).
Li, Y. Y. & Jones, S. J. Drug repositioning for personalized medicine. Genome Med. 4, 27, doi: 10.1186/gm326 (2012).
Article CAS PubMed PubMed Central Google Scholar
Luo, H. et al. DRAR-CPI: a server for identifying drug repositioning potential and adverse drug reactions via the chemical-protein interactome. Nucleic Acids Res. 39, W492–W498, doi: 10.1093/nar/gkr299 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, L. & Agarwal, P. Systematic drug repositioning based on clinical side-effects. PLoS One 6, e28025, doi: 10.1371/journal.pone.0028025 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Ye, H., Liu, Q. & Wei, J. Construction of drug network based on side effects and its application for drug repositioning. PLoS One 9, e87864, doi: 10.1371/journal.pone.0087864 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Wu, Z., Wang, Y. & Chen, L. Network-based drug repositioning. Molecular bioSystems 9, 1268–1281, doi: 10.1039/c3mb25382a (2013).
Article CAS PubMed Google Scholar
Cheng, F., Zhou, Y., Li, W., Liu, G. & Tang, Y. Prediction of chemical-protein interactions network with weighted network-based inference method. PLoS One 7, e41064, doi: 10.1371/journal.pone.0041064 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
von Eichborn, J. et al. PROMISCUOUS: a database for network-based drug-repositioning. Nucleic Acids Res. 39, D1060–D1066, doi: 10.1093/nar/gkq1037 (2011).
Article CAS PubMed Google Scholar
Wang, K. et al. Prediction of drug-target interactions for drug repositioning only based on genomic expression similarity. PLoS Comput. Biol. 9, e1003315, doi: 10.1371/journal.pcbi.1003315 (2013).
Article CAS PubMed PubMed Central Google Scholar
Brown, A. S., Kong, S. W., Kohane, I. S. & Patel, C. J. ksRepo: a generalized platform for computational drug repositioning. BMC Bioinformatics 17, 78, doi: 10.1186/s12859-016-0931-y (2016).
Article CAS PubMed Google Scholar
Iorio, F. et al. Discovery of drug mode of action and drug repositioning from transcriptional responses. Proc. Natl. Acad. Sci. USA 107, 14621–14626, doi: 10.1073/pnas.1000138107 (2010).
Article ADS PubMed PubMed Central Google Scholar
Sirota, M. et al. Discovery and preclinical validation of drug indications using compendia of public gene expression data. Sci. Transl. Med. 3, 96ra77, doi: 10.1126/scitranslmed.3001318 (2011).
Article CAS PubMed PubMed Central Google Scholar
Iskar, M. et al. Characterization of drug-induced transcriptional modules: towards drug repositioning and functional understanding. Mol. Syst. Biol. 9, 662, doi: 10.1038/msb.2013.20 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bisgin, H. et al. Investigating drug repositioning opportunities in FDA drug labels through topic modeling. BMC Bioinformatics 13 Suppl 15, S6, doi: 10.1186/1471-2105-13-S15-S6 (2012).
Article PubMed PubMed Central Google Scholar
Gottlieb, A., Stein, G. Y., Ruppin, E. & Sharan, R. PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol. Syst. Biol. 7, 496, doi: 10.1038/msb.2011.26 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y., Chen, S., Deng, N. & Wang, Y. Drug repositioning by kernel-based integration of molecular structure, molecular activity, and phenotype data. PLoS One 8, e78518, doi: 10.1371/journal.pone.0078518 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Li, G. H. & Huang, J. F. CDRUG: a web server for predicting anticancer activity of chemical compounds. Bioinformatics 28, 3334–3335, doi: 10.1093/bioinformatics/bts625 (2012).
Article CAS MathSciNet PubMed Google Scholar
Sakakibara, Y. et al. COPICAT: a software system for predicting interactions between proteins and chemical compounds. Bioinformatics 28, 745–746, doi: 10.1093/bioinformatics/bts031 (2012).
Article CAS PubMed Google Scholar
Laenen, G., Ardeshirdavani, A., Moreau, Y. & Thorrez, L. Galahad: a web server for drug effect analysis from gene expression. Nucleic Acids Res. 43, W208–W212, doi: 10.1093/nar/gkv436 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yang, L., Luo, H., Chen, J., Xing, Q. & He, L. SePreSA: a server for the prediction of populations susceptible to serious adverse drug reactions implementing the methodology of a chemical-protein interactome. Nucleic Acids Res. 37, W406–W412, doi: 10.1093/nar/gkp312 (2009).
Article CAS PubMed PubMed Central Google Scholar
Luo, H. et al. DDI-CPI, a server that predicts drug-drug interactions through implementing the chemical-protein interactome. Nucleic Acids Res. 42, W46–W52, doi: 10.1093/nar/gku433 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yang, L. et al. Exploring off-targets and off-systems for adverse drug reactions via chemical-protein interactome–clozapine-induced agranulocytosis as a case study. PLoS Comput. Biol. 7, e1002016, doi: 10.1371/journal.pcbi.1002016 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, L. et al. Identifying unexpected therapeutic targets via chemical-protein interactome. PLoS One 5, e9568, doi: 10.1371/journal.pone.0009568 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Yang, L., Chen, J. & He, L. Harvesting candidate genes responsible for serious adverse drug reactions from a chemical-protein interactome. PLoS Comput. Biol. 5, e1000441, doi: 10.1371/journal.pcbi.1000441 (2009).
Article CAS ADS PubMed PubMed Central Google Scholar
Shu, M., Zai, X., Zhang, B., Wang, R. & Lin, Z. Hypothyroidism Side Effect in Patients Treated with Sunitinib or Sorafenib: Clinical and Structural Analyses. PLoS One 11, e0147048, doi: 10.1371/journal.pone.0147048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Qiu, J. X. et al. Plumbagin elicits differential proteomic responses mainly involving cell cycle, apoptosis, autophagy, and epithelial-to-mesenchymal transition pathways in human prostate cancer PC-3 and DU145 cells. Drug Des. Devel. Ther. 9, 349–417, doi: 10.2147/DDDT.S71677 (2015).
Article CAS PubMed PubMed Central Google Scholar
Qi, L. & Ding, Y. Potential antitumor mechanisms of phenothiazine drugs. Science China. Life sciences 56, 1020–1027, doi: 10.1007/s11427-013-4561-6 (2013).
Article CAS PubMed Google Scholar
Faraone, S. V. & Zhang-James, Y. Can sodium/hydrogen exchange inhibitors be repositioned for treating attention deficit hyperactivity disorder? An in silico approach. Am. J. Med. Genet. B Neuropsychiatr. Genet. 162B, 711–717, doi: 10.1002/ajmg.b.32155 (2013).
Article PubMed Google Scholar
Trott, O. & Olson, A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461, doi: 10.1002/jcc.21334 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. et al. PubChem Substance and Compound databases. Nucleic Acids Res. 44, D1202–D1213, doi: 10.1093/nar/gkv951 (2016).
Article CAS PubMed Google Scholar
Wang, W., Yang, S., Zhang, X. & Li, J. Drug repositioning by integrating target information through a heterogeneous network model. Bioinformatics 30, 2923–2930, doi: 10.1093/bioinformatics/btu403 (2014).
Article CAS PubMed PubMed Central Google Scholar
Brass, E. P. et al. The novel phosphodiesterase inhibitor NM-702 improves claudication-limited exercise performance in patients with peripheral arterial disease. J. Am. Coll. Cardiol. 48, 2539–2545, doi: 10.1016/j.jacc.2006.07.064 (2006).
Article CAS PubMed Google Scholar
Wunderlich, M. T. In Encyclopedia of Molecular Mechanisms of Disease 306–307 (2009).
Mozaffarian, D. et al. Heart disease and stroke statistics–2015 update: a report from the American Heart Association. Circulation 131, e29–322, doi: 10.1161/CIR.0000000000000152 (2015).
Article PubMed Google Scholar
Benton, C. R. et al. Rosiglitazone increases fatty acid oxidation and fatty acid translocase (FAT/CD36) but not carnitine palmitoyltransferase I in rat muscle mitochondria. J. Physiol. 586, 1755–1766, doi: 10.1113/jphysiol.2007.146563 (2008).
Article CAS PubMed PubMed Central Google Scholar
Risner, M. E. et al. Efficacy of rosiglitazone in a genetically defined population with mild-to-moderate Alzheimer’s disease. Pharmacogenomics J. 6, 246–254, doi: 10.1038/sj.tpj.6500369 (2006).
Article CAS PubMed Google Scholar
Doonan, F., Wallace, D. M., O’Driscoll, C. & Cotter, T. G. Rosiglitazone acts as a neuroprotectant in retinal cells via up-regulation of sestrin-1 and SOD-2. J. Neurochem. 109, 631–643, doi: 10.1111/j.1471-4159.2009.05995.x (2009).
Article CAS PubMed Google Scholar
Shen, L. Q., Child, A., Weber, G. M., Folkman, J. & Aiello, L. P. Rosiglitazone and delayed onset of proliferative diabetic retinopathy. Arch. Ophthalmol. 126, 793–799, doi: 10.1001/archopht.126.6.793 (2008).
Article PubMed Google Scholar
Luo, Y. H., Ouyang, P. B., Tian, J., Guo, X. J. & Duan, X. C. Rosiglitazone inhibits TGF-beta 1 induced activation of human Tenon fibroblasts via p38 signal pathway. PLoS One 9, e105796, doi: 10.1371/journal.pone.0105796 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Binda, C. et al. Molecular Insights into Human Monoamine Oxidase B Inhibition by the Glitazone Anti-Diabetes Drugs. ACS Med. Chem. Lett. 3, 39–42, doi: 10.1021/ml200196p (2011).
Article CAS PubMed PubMed Central Google Scholar
Binda, C. et al. Lights and shadows on monoamine oxidase inhibition in neuroprotective pharmacological therapies. Curr. Top. Med. Chem. 11, 2788–2796 (2011).
Article CAS PubMed Google Scholar
Wishart, D. S. et al. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 34, D668–D672, doi: 10.1093/nar/gkj067 (2006).
Article CAS PubMed Google Scholar
Kuhn, M., von Mering, C., Campillos, M., Jensen, L. J. & Bork, P. STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 36, D684–D688, doi: 10.1093/nar/gkm795 (2008).
Article CAS PubMed Google Scholar
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article CAS ADS PubMed PubMed Central Google Scholar
Wang, R., Fang, X., Lu, Y. & Wang, S. The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. J. Med. Chem. 47, 2977–2980, doi: 10.1021/jm030580l (2004).
Article CAS PubMed Google Scholar
Wei, W. Q. et al. Development and evaluation of an ensemble resource linking medications to their indications. J. Am. Med. Inform. Assoc. 20, 954–961, doi: 10.1136/amiajnl-2012-001431 (2013).
Article PubMed PubMed Central Google Scholar
Salmasian, H., Tran, T. H., Chase, H. S. & Friedman, C. Medication-indication knowledge bases: a systematic review and critical appraisal. J. Am. Med. Inform. Assoc. 22, 1261–1270, doi: 10.1093/jamia/ocv129 (2015).
Article PubMed PubMed Central Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Guha, R. Chemical informatics functionality in R. Journal of Statistical Software 18, 1–16 (2007).
Article Google Scholar
O’Boyle, N. M. et al. Open Babel: An open chemical toolbox. J. Cheminform. 3, 33, doi: 10.1186/1758-2946-3-33 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Kang, J. & Wang, G. An empirical Bayes normalization method for connectivity metrics in resting state fMRI. Front. Neurosci. 9, 316, doi: 10.3389/fnins.2015.00316 (2015).
Article PubMed PubMed Central Google Scholar
Bienfait, B. & Ertl, P. JSME: a free molecule editor in JavaScript. J. Cheminform. 5, 24, doi: 10.1186/1758-2946-5-24 (2013).
Article CAS PubMed PubMed Central Google Scholar
Morris, G. M. et al. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 30, 2785–2791, doi: 10.1002/jcc.21256 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Dr. Wendy Cornell for her revision, suggestions and comments. We also thank the developers of AutoDock Vina, JSME Molecular Editor, JSMol and Marvin Beans.

Author information

Present address: IBM Thomas J. Watson Research Center, 1101 Kitchawan Rd, Yorktown Heights, NY 10598, USA.
Present address: Bayer Pharma AG, Müllerstraße 178, 13353 Berlin, Germany.
Luo Heng and Zhang Ping contributed equally to this work.

Authors and Affiliations

Bio-X Institutes, Shanghai Jiao Tong University, Shanghai, 200030, China
Heng Luo, Dizheng Du, Hao Ye, Hui Huang, Can Li, Shengying Qin, Chunling Wan, Lin He & Lun Yang
Center for Computational Health, IBM T.J. Watson Research Center, Yorktown Heights, 10598, NY, USA
Ping Zhang
Center for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, 19122, PA, USA
Xi Hang Cao
Collaborative Innovation Center for Genetics and Development, State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200438, China
Leming Shi

Authors

Heng Luo
View author publications
You can also search for this author in PubMed Google Scholar
Ping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xi Hang Cao
View author publications
You can also search for this author in PubMed Google Scholar
Dizheng Du
View author publications
You can also search for this author in PubMed Google Scholar
Hao Ye
View author publications
You can also search for this author in PubMed Google Scholar
Hui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Can Li
View author publications
You can also search for this author in PubMed Google Scholar
Shengying Qin
View author publications
You can also search for this author in PubMed Google Scholar
Chunling Wan
View author publications
You can also search for this author in PubMed Google Scholar
Leming Shi
View author publications
You can also search for this author in PubMed Google Scholar
Lin He
View author publications
You can also search for this author in PubMed Google Scholar
Lun Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.Y. and L.H. designed and led the project. H.L., P.Z. and X.H.C. collected the data and implemented the methods. H.L., P.Z., X.H.C., D.D., H.Y., H.H., C.L., S.Q., C.W., L.S. and L.Y. discussed the data analysis and the results. H.L., P.Z., X.H.C. and L.Y. wrote the manuscript.

Ethics declarations

Competing interests

Lun Yang is a current employee at Bayer Pharma AG and Heng Luo is a current postdoctoral researcher at IBM. However, this study was based on their previous work at Shanghai Jiao Tong University and Bayer Pharma AG was not involved.

Electronic supplementary material

Supplementary Information

Supplementary Tables

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Luo, H., Zhang, P., Cao, X. et al. DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome. Sci Rep 6, 35996 (2016). https://doi.org/10.1038/srep35996

Download citation

Received: 01 June 2016
Accepted: 10 October 2016
Published: 02 November 2016
DOI: https://doi.org/10.1038/srep35996

This article is cited by

A deep learning framework for drug repurposing via emulating clinical trials on real-world patient data
- Ruoqi Liu
- Lai Wei
- Ping Zhang
Nature Machine Intelligence (2021)
DPubChem: a web tool for QSAR modeling and high-throughput virtual screening
- Othman Soufan
- Wail Ba-alawi
- Vladimir B. Bajic
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.