Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery

Tsou, Lun K.; Yeh, Shiu-Hwa; Ueng, Shau-Hua; Chang, Chun-Ping; Song, Jen-Shin; Wu, Mine-Hsine; Chang, Hsiao-Fu; Chen, Sheng-Ren; Shih, Chuan; Chen, Chiung-Tong; Ke, Yi-Yu

doi:10.1038/s41598-020-73681-1

Download PDF

Article
Open access
Published: 08 October 2020

Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery

Lun K. Tsou¹^na1,
Shiu-Hwa Yeh¹^na1,
Shau-Hua Ueng¹^na1,
Chun-Ping Chang¹^na1,
Jen-Shin Song¹,
Mine-Hsine Wu¹,
Hsiao-Fu Chang¹,
Sheng-Ren Chen¹,
Chuan Shih¹,
Chiung-Tong Chen¹ &
…
Yi-Yu Ke¹

Scientific Reports volume 10, Article number: 16771 (2020) Cite this article

10k Accesses
51 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Machine learning is a well-known approach for virtual screening. Recently, deep learning, a machine learning algorithm in artificial neural networks, has been applied to the advancement of precision medicine and drug discovery. In this study, we performed comparative studies between deep neural networks (DNN) and other ligand-based virtual screening (LBVS) methods to demonstrate that DNN and random forest (RF) were superior in hit prediction efficiency. By using DNN, several triple-negative breast cancer (TNBC) inhibitors were identified as potent hits from a screening of an in-house database of 165,000 compounds. In broadening the application of this method, we harnessed the predictive properties of trained model in the discovery of G protein-coupled receptor (GPCR) agonist, by which computational structure-based design of molecules could be greatly hindered by lack of structural information. Notably, a potent (~ 500 nM) mu-opioid receptor (MOR) agonist was identified as a hit from a small-size training set of 63 compounds. Our results show that DNN could be an efficient module in hit prediction and provide experimental evidence that machine learning could identify potent hits in silico from a limited training set.

GPCR_LigandClassify.py; a rigorous machine learning classifier for GPCR targeting compounds

Article Open access 04 May 2021

Computational approaches streamlining drug discovery

Article 26 April 2023

SYNDEEP: a deep learning approach for the prediction of cancer drugs synergy

Article Open access 15 April 2023

Introduction

Implementation of “big data” with deep learning has created a paradigm shift in many scientific disciplines^1,2,3. From the perspective of medicinal chemistry, predicting particular functions or properties, e.g., absorption, distribution, metabolism, and excretion (ADME), of a molecular entity might greatly increase the quality of hit compounds and quicken the drug-discovery process. The use of artificial intelligence (AI) in drug design to generate a prediction model, conduct virtual screening, and predict compounds’ activities has received much attention recently^4,5,6,7. Traditionally, quantitative structure–activity relationship (QSAR) model was utilized by medicinal chemists and statisticians to associate bioactivities to particular functional group manipulations. In particular, a linear equation was generated to correlate the features and bioactivities for each compound, while different descriptors were employed to calculate the physical properties to merge with the 3D-structrual information and generate 2D or 3D-QSAR models^8,9,10. Nowadays the development of QSAR have apply to multi-target and multi-objective QSAR approaches to assist drug design^11,12,13. These QSAR approaches are able to integrate multiple diverse chemical and biological data, being therefore capable of jointly making predictions ranging from in vitro and in vivo activities to ADMET properties¹⁴. Nonetheless, these QSAR models were hard to generate from random and diverse databases. In addition, to properly separate the training set and the test set was time consuming. To provide an alternative strategy, as reported by Zhavoronkov et al., they have successfully used the deep learning method in the designs of more potent compounds¹⁵. The incorporation of machine learning method for the progressive analysis of the active compounds and concurrent generation of the prediction model should address such limitations.

Lavecchia et al.¹⁶ summarized applications of machine learning algorithms, such as support vector machine (SVM)¹⁷ for ADME evaluation and decision tree (DT) in the classification of compounds¹⁸. Moreover, a Naïve Bayesian classifier is frequently used in chemoinformatics for predicting biological properties, while k-Nearest neighbors (K-NN) is a simple and rough method to predict and rank the molecule^19,20. Others like the artificial neural networks (ANNs), is the popular technique for compound classification, QSAR studies, and primary virtual screening (VS) of compounds²¹. All these machine learning algorithms were programmed to pick out and reclassify important features of the molecules as instructed, the limitations of these algorithms stemmed from the intrinsic inability to “self-taught” and prioritize the features in relation to the activities. Improper combining of the compounds’ descriptors could increase the noise level in features learning that could result in swamping the classifier model and generate a misleading prediction²².

Herein, we employed deep learning algorithm to analyze the compound features, generate a first-hand model through 613 descriptors for training, and validated its findings through experimental confirmation. In addition, we compared its accuracy and efficiency with three other different virtual screening methods. After in silico screening of our in-house database of 165,000 compounds, by which different hit compounds were identified from^15,23,24,25, 100 top-ranked newly identified TNBC inhibitors were subjected to the bioassay to cross-examine the model accuracy. Moreover, to extend the scope of this deep learning model in predicting meaningful hits, another case study for the search of novel G protein-coupled receptor (GPCR) agonist was carried out. By using a similar model, we only trained the model with a collection of 63 mu-opioid receptor (MOR) agonists to learn the importance of compound features for the given bioactivities. We then identified the nanomolar MOR agonist from the in-house compounds library. Our study suggested that deep learning method could generate potent hit compounds in different disease areas for the drug discovery process.

Results and discussion

Model generation and comparative studies in efficiency

An advancement in the virtual screening method was made to reduce the burden of the drug discovery/development processes in a cost-effective manner²⁶. The virtual screening can be devised by using either structure-based virtual screening (SBVS) like docking screening methods²⁷ or LBVS like QSAR model screening²⁸. To investigate the application and efficiency of the DNN approach in medicinal chemistry, we compared other contemporary QSAR method, such as RF approach²⁹, with traditional QSAR methods, such as PLS and MLR. RF has been demonstrated to have high prediction accuracy and robustness with adjustable parameters. It has become a “gold standard” machine learning method. Meanwhile, partial least squares (PLS) and multiple linear regression (MLR) are methods used for large data manipulation and allow facile generation of the model unlike other 3D-QSAR methods. In the current study, the same data set and descriptors were systematically incorporated to generate the models.

The traditional QSAR model helps to identify the relationship between activities and the descriptors’ variables. In addition to the QSAR methods, RF and DNN from the machine learning approach were used to generate the prediction model. RF is an ensemble learning method to perform classification in a similar manner to that of the decision tree (DT). Yet, the major difference stems from the use of Bagging method (or Bootstrap Aggregating) to generate many individual trees³⁰. Each tree could self-process samples from the training set data and provide a fixed number of random sampling data from the training set to generate a DT for voting. The final model was based on the highest score from individually developed trees in the forest. On the other hand, DNN are mathematical methods developed to mimic the neurons (nodes) of the human brain to recognize objects and analyze progressively, improving the efficiency of previously reported neural network algorithms^1,31. Each neuron is treated as a particular feature to classify the complex factors. The system, in turn, learns from the training set and assigns different weights for each neuron as this model eventually facilitates a prediction following the different clusters. Taken together, DNN increase the hidden layer numbers by allowing each layer of the nodes to access different features based on the previous layer’s output. Consequently, as more executed nodes are added in each layer, more features are recognized, enhancing the overall decision process.

To compare the different methods of virtual screening, a database of 7130 molecules with previously reported MDA-MB-231 inhibitory activities were collected from the ChEMBL web service. As the model prediction accuracy is highly depended on the quality of the database. In this study, these compounds were then randomly separated into 6069 compounds (the training set) and 1061 compounds (the test set) to evaluate which model can more efficiently analyze the database and generate more useful models (Fig. 1). We implemented the extended connectivity fingerprints (ECFPs), which are circular topological depictions of the molecules, as the major molecular descriptors. Specifically, ECFPs are generated in a molecule-directed manner by systematically recording the neighborhood of each non-hydrogen atom into multiple circular layers up to a given diameter of that molecule³². These atom-centered sub-structural features are then mapped into integer codes and the resulting identifiers shape the extended connectivity fingerprint. These identifiers capture the local information of the corresponding atom in such a way that various atom properties (e.g., atomic number, connection counts) are packed into a single integer value. The default identifier configuration of ECFP captures highly specific atomic information, enabling the representation of a large set of precisely defined structural features.

In some applications, however, different kinds of abstraction may be desirable. For example, a chlorine or a bromine substituent on a ring may be functionally equivalent but would be redundantly distinguished by ECFP. Alternatively, functional-class fingerprints (FCFPs)³² detail circular fingerprints via the pharmacophore identification of atoms, which reports topological pharmacophore fingerprints. To perform the classifications comparisons, the software devised a total of 613 descriptors from AlogP_count³³, ECFP, and FCFP to generate the model (Fig. 1, and supplementary data Table S1).

Three distinct different numbers of training set (6069, 3035, and 303 compounds) were used to generate the models and their efficiencies were evaluated by the fixed test set (1061 compounds). R-square value (r² value) was used to quantify the differential efficiencies between the training set and test set prediction in machine learning methods (DNN and RF) and the QSAR methods (PLS and MLR) (Fig. 1). With training set compounds fixed at 6069, the machine learning methods (DNN or RF) exhibited higher predicted r² value near 90% than the traditional QSAR method (PLS or MLR) at 65%. In general, a good model was considered as having larger r² and R²_pred (r² > 80, R²_pred > 60 is an assessable model)^34,35,36. With the decrease of training set numbers, the machine learning methods sustained the overall higher r² value. As the training set number decreases, the deviation only retained with DNN and RF at 0.84 to 0.94, while PLS and MLR dropped to 0.24 from 0.69. In particular, with significantly lower training set numbers, interestingly, the MLR method maintained a respectful r² value near 0.93, but when running against the test set, R²_pred ${\mathrm{R}}_{\mathrm{pred}}^{2}$ was calculated to be zero. This implies that MLR could be an over-fitting model with a high false-positive rate, especially when the numbers of learning compounds are very limited. These results showed that the PLS and MLS methods could not efficiently distinguish the descriptors and were problematic in generating meaningful fitting equations. On the other hand, the DNN method with lower number of training sets, the data still held a higher r² value of 0.94 than that of 0.84 by RF method (Fig. 1). Although the RF method could classify the features and select intrinsic feature for the analysis, DNN method was better in providing insights in weighting of important features. As a result, the DNN method held a higher r² value with lower numbers of training data sets. Of the machine learning methods, the R²_pred ${\mathrm{R}}_{\mathrm{pred}}^{2}$ significantly improved with the increase in training set numbers, which is vastly different than the QSAR models (Fig. 1). With routine sampling of large amount of molecular features against a target from the public domain might be limiting, the large spread or deviation of PLS and MLR processes could greatly hinder the potential of identifying potent hits. Taken together, DNN and RF exhibited better accuracy and efficiency in the prediction of hit compounds. As shown in Fig. 1, the R²_pred of DNN (0.26) and RF (0.24) were much lower, which implies that the database quality might not be sufficient for learning. We envision that more datasets might be needed or the quality of the datasets in terms of structural information and their activities should be more correlated for better learning by the algorithm.

Seminal work by Grisoni and coworkers^37,38, have indicated the R²_pred or Q² metrics (Eq. 1) should be optimize to ${\mathrm{Q}}_{F3}^{2}$ (Eq. 2) as it was more sensitive for comparing predicted abilities between different models with the same training set. The original R²_pred metrics was shown bellow

$${\mathrm{Q}}^{2}=1-\frac{\sum_{i=1}^{{n}_{test}}{({y}_{i}-{\widehat{y}}_{i})}^{2}}{{\sum_{i=1}^{{n}_{test}}({y}_{i}-{\stackrel{-}{y}}_{TR})}^{2}}$$

(1)

where y_i is the experimental result for i-th compounds not existing in the training set, ŷ_i is the predicted result of the i-th compound, y̅_TR is the average value of the training set experimental results, and n_test is the test set numbers. Reported by Todeschini et al., the ${\mathrm{Q}}_{F3}^{2}$ should be calculated as

$${\mathrm{Q}}_{F3}^{2}=1-\frac{\sum_{i=1}^{{n}_{test}}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}/{n}_{test}}{{\sum_{j=1}^{{n}_{TR}}({y}_{j}-{\stackrel{-}{y}}_{TR})}^{2}/{n}_{TR}}$$

(2)

By which, y_j is the experimental result for training set , y̅_TR is the average value of the training set experimental result, and n_TR is the training set numbers. By applying this metric to our studies, the DNN and RF exhibited highest ${\mathrm{Q}}_{F3}^{2}$ value of 0.679 and 0.670, respectively (Supplementary data Table S2). In addition, Consonni et al. showed the calculation of Root-Mean-Square Error in prediction (RMSEP) and Root-Mean-Square Error in calculation (RMSEC) could quantify predictive abilities of QSAR model. The higher value of RMSEP led to higher chances of error. Our calculation results also showed that DNN method had the lowest value for RMSEC and RMSEP in comparison to those of other models (Supplementary data Table S2).

To further investigate the advantageous prediction ability of machine learning methods (DNN and RF) over the traditional QSAR methods (PLS and MLR), we analyzed the receiver operating characteristic (ROC) curve with the fix training set (6069 compounds) and fix test set (1061 compounds)^39,40. ROC curve evaluates the performance of a binary classifier system and provides means in selecting optimal models. ROC curve was constructed by plotting a graph of sensitivity (Se, true positive rate) vs. 1-specificity (1-Sp, false positive rate). The measure of Se and Sp are defined as

$$\mathrm{Se}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}$$

(3)

$$\mathrm{Sp}=\frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{FP}}$$

(4)

where TP is the number of correctly identified active ligands (true positives), TN is the number of correctly identified inactive ligands (true negatives), FP the number of incorrectly identified active ligands (false positives), and FN the number of incorrectly identified inactive ligands (false negatives). The area under the ROC curve (AUC) measures the performance of each virtual screening approaches. The ideal screening method results in an AUC value of 1, while a random screening method would lead to an AUC value of 0.5. As shown in Fig. 2A, the AUC calculated by the training set of the RF and DNN methods were 0.991 and 0.992, respectively. Interestingly, these values were higher than those of PLS and MLR methods with 0.907 and 0.922. To investigate the prediction ability of the test set, the respective AUC values of RF and DNN methods were 0.922 and 0.924. Also, they were expected to be superior than those of PLS and MLR methods with 0.870 and 0.865. These ROC curve analyses further potentiated the RF and DNN screening method might be more suitable than traditional QSAR methods (PLS and MLR).

Virtual screening and identification of TNBC inhibitors by DNN and RF models with experimental validation

Based on the above information, the DNN and RF models were chosen as the preferred means to perform virtual screening. The identified compounds were then assayed for their corresponding bioactivities. Herein, we demonstrated two different cases for evaluating these models’ accuracy. First, we successfully identified active hits for TNBC inhibition. The DNN and RF models were used to screen the in-house database (165,000 compounds), and the selected hits were assayed against the anti-TNBC cellular assay (Fig. 3A). The top predicted 100 compounds were selected and tested at 10 μM concentration for MDA-MB-231 cell line inhibition (Supplementary data Table S3.1 and Table S3.2). Since the compound collection was acquired based on MDA-MB-231 inhibitory activities, other TNBC cell lines were also assayed to obtain selective TNBC inhibitors. Out of the multiple hits identified through both methods, six compounds from each classification (compounds 1–12) were assayed and showed low cytotoxicity to MCF10A, a nonmalignant mammary epithelial cell line (Fig. 3B,C). We then assayed these hits against other TNBC cell lines, BT-549 and MDA-MB-453. Compounds 3, 7, 8, 10, which exhibited broader TNBC inhibitions, were then subjected to IC₅₀ determination (Supplementary data Figure S1). Notably, between RF and DNN, we obtained a thiazole core with selective anti-TNBC profiles over the normal mammary cells (Fig. 3B,C). Synthesis of the thiazole-based inhibitors were carried out and several potent TNBC inhibitors were identified (Fig. 4). Compound 18, which showed good selectivity over nonmalignant mammary epithelial cell, had an IC₅₀ of 0.62 µM against MDA-MB-231. Interestingly, regioisomeric controls in compounds 22 and 23 were synthesized. Compound 22 did not show activities toward the TNBC and 23, although it possessed moderate micro molar activities and also exhibited cytotoxicity to MCF-10A. This study serves as a good example of hit generation from an unknown target with good cellular selectivity and functional manipulatable core.

Analysis between model-identified compounds and database compounds

To address the availability of thiazole core in the original set of 7130 compounds, we devised a principle component analysis of the database with PIC50, AlogP, and polar surface versus total surface area (Fig. 5). These properties were chosen to fulfill the characteristics of a hit compound in a common drug-discovery campaign. The 7130 compounds were then mapped and showed that compounds consisting of the thiazole core are clustered in the quadrant with activities ranging from PIC50 4.8–6.5 (10 µM to 0.3 µM). Moreover, the AlogP and polar surface versus total surface area values were in the satisfactory range for a hit compound (Fig. 5). Gratifyingly, this finding correlates well to the experimental results from our SAR studies of the TNBC inhibitors (Fig. 4). Our findings suggest that both RF and DNN can be adapted to generate meaningful models and identify functional hits for the later optimization process.

Identification and experimental validation of novel GPCR agonists by the DNN and RF models

We envision that predicting new scaffold with the experimental validation should render the greatly expand the application of this deep learning approach. We then adapted this classification for GPCR agonist generation, where structure-based designs are limited without a known information of the core structure due to the membrane associated nature of many GPCRs (Fig. 6). To evaluate the scope of the model, the MOR agonist was also identified via virtual screening of the same in-house database with the DNN and RF models. In our previous studies on MOR agonist⁴¹, we synthesized 63 compounds and tested by FLIPR calcium assay (Supplementary data Table S4 and Figure S2). We used MOR as an example to demonstrate the predictive power of this approach. To train the learning system, we provided a small sample collection of 63 compounds⁴¹. The total 63 compounds, divided into series A–E clusters, were used as the training set to generate the DNN and RF models (Fig. 6A). We envision that incorporation of molecular diversity with large spread of bioactivities in series A–E should minimize deviation of the r² with DNN and RF and improve the learning process. Model generation was performed with the same 613 descriptors, and then new cores in the 165,000 in-house pool were processed. The top 40 compounds predicted by RF and another top 40 by DNN (Supplementary data Table S5.1 and Table S5.2) were subjected to the FLIPR calcium assay (Fig. 6B). The CHO-K1 cell line, stably expressing MOR and Gα15 (GenScript), was used to evaluate the selected compounds. In the FLIPR calcium assay of CHO-K1/MOR/Gα15 cells, activation of MOR elicits an intracellular calcium release, leading to an increase in the relative fluorescence units (RFU). Five compounds, 24–28, were identified as potential hits by these two different screening models. As shown in Fig. 6B, in addition to hit 26 identified from DNN method exhibited potent agonist activities (EC₅₀ = 560 nM), these models provided great molecular diversities over the training set of compounds. To the best of our knowledge, this is the first example correlating prediction and validation of a GPCR agonist discovery where structure-based design is limited. Notably, only a small training set of 63 compounds (Supplementary data Table S4) was employed, and a set of five structurally distinct hits was identified. This result provided strong support in that DNN and RF methods could still sustained high predicted r² value in low numbers of training data set.

The Opioid receptor binding affinity assay was performed to further confirm these compounds direct bind to MOR. The MOR membranes was detected by measuring the competitive inhibition ratio of [3H]diprenorphine binding assessment. Ki = IC50/(1 + L/Kd), where L is the concentration of [3H]diprenorphine (1 nM) used, and the Kd value in MOR is 0.4 nM. All assays were carried out independently and at least in triplicate. The values indicate the mean ± SEM. MOR = mu opioid receptor; ME = [Met5]Enkephalin; N.D. = not determined; SEM = standard error of the mean. As shown in Fig. 6, the compounds 24–28 has no structural similarity to morphine or any other previously described opioid receptor agonist. In the receptor binding assay, membrane proteins from HEK-MOP were used for detecting the binding affinity of these compounds by comparing with the morphine (Table 1).

Table 1 The binding affinity assay of compounds 24, 25, 26, 27, 28 and morphine on MOR.

Full size table

Conclusion

Hit identification is an important step in the early stages of drug discovery. Virtual screening is extensively used to identify suitable hits, and such methods to improve the hit rate are much sought after. In this study, we report comparative studies between traditional QSAR methods and machine learning methods applied in VS. The results showed that machine learning methods could achieve a higher predicted r² value with fewer compounds required in the training set. In our work, DNN and RF predicted the selective TNBC inhibitors from the our in-house database. In case of identifying novel MOR agonist, 5 hit compounds were readily found from only a 63-compound training set. The diversified chemical structures of the 5 hits identified by the DNN method showed good potency as a hit with an EC₅₀ = 560 nM. This is an interesting application of the deep learning classification as structure-based design of GPCR agonist are limited with limited information of the core structure due to the membrane associated nature of many GPCR. Taken together, this study demonstrated the efficiency of DNN and RF machine learning methods for VS and provided experimental evidences that this application can be adapted to identify hit compounds among different diseases.

Experimental procedures

Data set collection for TNBC and MOR

For the TNBC inhibitor identification studies, 7130 compounds that contain MDA-MB-231 bioassay activity data were collected from the ChEMBL database (https://www.ebi.ac.uk/chembl/). The database was randomly separated into two parts. One part contained 85% of the compounds (6069 compounds), which were used as the training set; the other 15% of compounds (1061 compounds) were used as a test set in our studies. However, for the MOR agonist discovery studies, 63 compounds were collected from the publication of Chen et al.⁴¹ as a training set database (Supplementary data Table S4).

Descriptors and model generation

All models were generated by BIOVIA pipeline pilot V18.1 platform with R statistic software V 3.4.1^42,43. These models were generated by the same descriptors from the Discovery Studio/Calculates ligand properties program (BIOVIA, Inc., San Diego, CA), including ALogP_count (101 descriptors), ECFP_4 (256 descriptors), and FCFP_4 (256 descriptors). The RF model use a recursive partitioning (decision tree) forest model by R package “"randomForest". The number of trees was set for 500. The fraction of descriptors to use for each tree in the forest was set to 0.3. A deep neural network model using R package “deepnet” performed the DNN model. Three hidden layers were used and each layer with 80 notes. The learning rate of every epoch was 0.1 with the momentum for 0.9, the maximum number of iterations for network training was 5000. To prevent the model + over-fitting, the fraction of hidden layer to be dropped out for model training was set for 0.25. The traditional QSAR model, like multiple linear regression analysis (MLR), is a equation to describe the dependent variable Y with independent variables, X1, X2, …, etc. For example, Y(pred)ⁱ = b0 + b1 * X1 + b2 * X2 + ....+ bp*Xp, where the b1, b2,…,bn are the regression coefficients, Y(pred)ⁱ can be project as ith bioactivities, and X1, X2,…,Xp can apply to different descriptors⁴⁴. The PLS regression is using the orthogonal matrices (T) to determine the fundamental relations between dependent variable Y and independent variables X. For example, Y = X × W × Q + E, T = X × W, where Y is a response matrix for the dependent variables like bioactivities result, T is a extraction matrix for the independent variables like descriptors, Q is a matrix of the regression coefficients, W are the factor score matrix and the weight matrix, and E is an error term for the model^45,46.The PLS and MLR models were also conduct by pilot V18.1 platform with the default protocol and evaluate by fivefold cross-validated method.

Cell viability assay for TNBC inhibitors

The cells were seeded in 384-well clear plates with a density of 8 × 10² cells/well for MCF-10A and BT-549 cell lines, 1 × 10³ cells/well for MDA-MB-453, and 2 × 10³ cells/well for MDA-MB-231 overnight. Then cells were treated with the indicated concentrations of test compounds for 72 h. At the end of incubation, 5 μL of PrestoBlue Cell Viability Reagent (Invitrogen, Carlsbad, CA, USA) was added to each well with 50 μL medium. The plates were incubated for an additional 1.5 h at 37 °C in a humidified 5% CO₂ atmosphere; the relative fluorescence unit (RFU) in the reaction mixture will then be recorded (Ex560/Em590) by Victor²-Vplate reader (PerkinElmer, Waltham, MA, USA). The cell lines were chosen based on the mutation status of PTEN and/or TP53: MCF-10A, the nonmalignant mammary epithelial cell line; BT-549 with mutation of PTEN and TP53; MDA-MB-453 with mutation of PTEN; MDA-MB-231 with mutation of TP53⁴⁷.

FLIPR calcium assay

Black with clear flat bottom 96-well assay plates (Corning) were coated with a 0.1 mg/mL Poly-l-Lysine solution a day prior to the assay. CHO-K1/MOR/Gα15 cells were suspended in the F12 medium and plated at a density of ~ 8 × 10⁴ cells/well in 200 μL medium. Cells were incubated in a humidified atmosphere of 10% CO₂ at 37 °C overnight to reach an 80–90% confluence cell monolayer before assay. On the day of assay, 150 μL medium/well was removed from the plate. To each well, 50 μL FLIPR calcium assay reagent dissolved in 1 × assay buffer (HBSS: KCl 5 mM, KH₂PO₄ 0.3 mM, NaCl 138 mM, NaHCO₃ 4 mM, Na₂HPO₄ 0.3 mM, d-glucose 5.6 mM, with an additional 20 mM HEPES and 13 mM CaCl₂, pH 7.4), with 2.5 mM probenecid added; then the plate was incubated at 37 °C for 1 h. Compounds (30 μM) and other reagents were dissolved in the assay buffer. Using a FlexStationIII (Molecular Devices Corp.), the increase of fluorescence after robotic injections of compounds or other reagents were monitored every 1.52 s interval with excitation wavelength at 485 nm and emission wavelength at 525 nm. The [Ca²⁺]_i fluorescence was measured up to 90 s after agonist injection. The relative fluorescence intensity from 2 wells of cells were averaged and the relative amount of [Ca²⁺]_i release was determined by integrating the area under the curve (AUC) with Prism software (GraphPad). The AUC of each compound was then subtracted from the response in the presence of MOR agonist naloxone (20 nM) to obtain the specific MOR responses⁴⁸.

Radioligand binding assay

Human embryonic kidney 293 cells constitutively expressing MOR (HEK-MOR) (Dr. Ping-Yee Law; University of Minnesota Medical School) were harvested and homogenized in membrane preparation buffer (50 mM Tris–HCl at pH 7.4, containing 2 mM ethylenediaminetetraacetic acid [EDTA]) containing a fresh protease inhibitor cocktail (Roche, Basel, Switzerland) and then centrifuged at 30,000g for 30 min. The pellets were resuspended, aliquoted, and stored at − 80 °C. For the [³H]diprenorphine saturation binding assays, membranes (containing 25 μg of protein) were incubated with different concentrations (0.5–5 nM) of [³H]diprenorphine in binding buffer (50 mM Tris–HCl at pH 7.4, containing 2 mM EDTA) at 25 °C for 1 h. For the competitive binding experiments, [³H]diprenorphine (1 nM) was incubated with membranes (containing 25 μg of protein) in the absence or presence of various concentrations of compounds at 25 °C for 1 h. The samples were then rapidly filtered onto glass-fiber filters (Millipore, Billerica, MA, USA) and washed three times with ice-cold phosphate-buffered saline. The radioactivity was quantified using a liquid scintillation counter⁴⁹.

Abbreviations

DNN:: Deep neural networks
LBVS:: Ligand-based virtual screening
RF:: Random forest
TNBC:: Triple-negative breast cancer
GPCR:: G-protein-couple receptors
AI:: Artificial intelligence
QSAR:: Quantitative structure–activity relationship
SVMs:: Support vector machine
ADME:: Absorption, distribution, metabolism, and excretion
DT:: Decision tree
K-NN:: K-nearest neighbors
ANNs:: Artificial neural networks
VS:: Virtual screening
MOR:: Mu-opioid receptor
SBVS:: Structure-based virtual screening
PLS:: Partial least squares
MLR:: Multiple linear regression
ECFPs:: Extended connectivity fingerprints
FCFPs:: Functional-class fingerprints
FRU:: Relative fluorescence units

References

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444. https://doi.org/10.1038/nature14539 (2015).
Article ADS PubMed CAS Google Scholar
Aliper, A. et al. Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data. Mol. Pharm. 13, 2524–2530. https://doi.org/10.1021/acs.molpharmaceut.6b00248 (2016).
Article PubMed PubMed Central CAS Google Scholar
Jing, Y., Bian, Y., Hu, Z., Wang, L. & Xie, X. S. Deep learning for drug design: An artificial intelligence paradigm for drug discovery in the big data era. AAPS J. 20, 58. https://doi.org/10.1208/s12248-018-0210-0 (2018).
Article PubMed CAS Google Scholar
Gawehn, E., Hiss, J. A. & Schneider, G. Deep learning in drug discovery. Mol. Inform. 35, 3–14. https://doi.org/10.1002/minf.201501008 (2016).
Article PubMed CAS Google Scholar
Popova, M., Isayev, O. & Tropsha, A. Deep reinforcement learning for de novo drug design. Sci. Adv. https://doi.org/10.1126/sciadv.aap7885 (2018).
Article PubMed PubMed Central Google Scholar
Lavecchia, A. Deep learning in drug discovery: Opportunities, challenges and future prospects. Drug Discov. Today 24, 2017–2032. https://doi.org/10.1016/j.drudis.2019.07.006 (2019).
Article PubMed Google Scholar
Stahl, N., Falkman, G., Karlsson, A., Mathiason, G. & Bostrom, J. Deep Reinforcement learning for multiparameter optimization in de novo drug design. J. Chem. Inf. Model. 59, 3166–3176. https://doi.org/10.1021/acs.jcim.9b00325 (2019).
Article PubMed CAS Google Scholar
Verma, J., Khedkar, V. M. & Coutinho, E. C. 3D-QSAR in drug design—A review. Curr. Top. Med. Chem. 10, 95–115. https://doi.org/10.2174/156802610790232260 (2010).
Article PubMed CAS Google Scholar
Ke, Y. Y. et al. 3D-QSAR assisted drug design: Identification of a potent quinazoline based Aurora kinase inhibitor. ChemMedChem 8(1), 136–148 (2013).
Article CAS PubMed Google Scholar
James, N., Shanthi, V. & Ramanathan, K. Drug design for ALK-positive NSCLC: An integrated pharmacophore-based 3D QSAR and virtual screening strategy. Appl. Biochem. Biotechnol. 185, 289–315. https://doi.org/10.1007/s12010-017-2650-x (2018).
Article PubMed CAS Google Scholar
Ambure, P., Halder, A. K., Diaz, H. G. & Cordeiro, M. N. D. S. QSAR-Co: An open source software for developing robust multitasking or multitarget classification-based QSAR models. J. Chem. Inf. Model. 59, 2538–2544 (2019).
Article CAS PubMed Google Scholar
Cruz-Monteagudo, M., Borges, F. & Cordeiro, M. N. D. S. Desirability-based multiobjective optimization for global QSAR studies: Application to the design of novel NSAIDs with improved analgesic, antiinflammatory, and ulcerogenic profiles. J. Comput. Chem. 29, 2445–2459 (2008).
Article CAS PubMed Google Scholar
Cruz-Monteagudo, M. et al. Desirability-based methods of multiobjective optimization and ranking for global QSAR studies. Filtering safe and potent drug candidates from combinatorial libraries. J. Comb. Chem. 10, 897–913 (2008).
Article CAS PubMed Google Scholar
Nicolaou, C. A., Kannas, C. & Loizidou, E. Multi-objective optimization methods in de novo drug design. Mini-Rev. Med. Chem. 12, 979–987 (2012).
Article CAS PubMed Google Scholar
Zhavoronkov, A. et al. Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat. Biotechnol. 37, 1038. https://doi.org/10.1038/s41587-019-0224-x (2019).
Article PubMed CAS Google Scholar
Lavecchia, A. Machine-learning approaches in drug discovery: Methods and applications. Drug Discov Today 20, 318–331. https://doi.org/10.1016/j.drudis.2014.10.012 (2015).
Article PubMed Google Scholar
Hou, T., Wang, J. & Li, Y. ADME evaluation in drug discovery. 8. The prediction of human intestinal absorption by a support vector machine. J. Chem. Inf. Model. 47, 2408–2415. https://doi.org/10.1021/ci7002076 (2007).
Article PubMed CAS Google Scholar
Klekota, J. & Roth, F. P. Chemical substructures that enrich for biological activity. Bioinformatics 24, 2518–2525. https://doi.org/10.1093/bioinformatics/btn479 (2008).
Article PubMed PubMed Central CAS Google Scholar
Koutsoukas, A. et al. In silico target predictions: Defining a benchmarking data set and comparison of performance of the multiclass Naive Bayes and Parzen-Rosenblatt window. J. Chem. Inf. Model. 53, 1957–1966. https://doi.org/10.1021/ci300435j (2013).
Article PubMed CAS Google Scholar
Nigsch, F., Bender, A., Jenkins, J. L. & Mitchell, J. B. O. Ligand-target prediction using winnow and naive Bayesian algorithms and the implications of overall performance statistics. J. Chem. Inf. Model. 48, 2313–2325. https://doi.org/10.1021/ci800079x (2008).
Article PubMed CAS Google Scholar
Patel, J. L. & Goyal, R. K. Applications of artificial neural networks in medical science. Curr. Clin. Pharmacol. 2, 217–226 (2007).
Article PubMed Google Scholar
Goodarzi, M., Dejaegher, B. & Vander Heyden, Y. Feature selection methods in QSAR studies. J. AOAC Int. 95, 636–651 (2012).
Article CAS PubMed Google Scholar
Wu, C. H. et al. Design and synthesis of tetrahydropyridothieno[2,3-d]pyrimidine scaffold based epidermal growth factor receptor (EGFR) kinase inhibitors: The role of side chain chirality and Michael acceptor group for maximal potency. J. Med. Chem. 53, 7316–7326. https://doi.org/10.1021/jm100607r (2010).
Article PubMed CAS Google Scholar
Yeh, J. Y. et al. Anti-influenza drug discovery: Structure–activity relationship and mechanistic insight into novel angelicin derivatives. J. Med. Chem. 53, 1519–1533. https://doi.org/10.1021/jm901570x (2010).
Article PubMed CAS Google Scholar
Ke, Y. Y. et al. Ligand efficiency based approach for efficient virtual screening of compound libraries. Eur. J. Med. Chem. 83, 226–235. https://doi.org/10.1016/j.ejmech.2014.06.029 (2014).
Article PubMed CAS Google Scholar
Ripphausen, P., Nisius, B., Peltason, L. & Bajorath, J. Quo vadis, virtual screening? A comprehensive survey of prospective applications. J. Med. Chem. 53, 8461–8467. https://doi.org/10.1021/jm101020z (2010).
Article PubMed CAS Google Scholar
Ripphausen, P., Stumpfe, D. & Bajorath, J. Analysis of structure-based virtual screening studies and characterization of identified active compounds. Future Med. Chem. 4, 603–613. https://doi.org/10.4155/fmc.12.18 (2012).
Article PubMed CAS Google Scholar
Ripphausen, P., Nisius, B. & Bajorath, J. State-of-the-art in ligand-based virtual screening. Drug Discov. Today 16, 372–376. https://doi.org/10.1016/j.drudis.2011.02.011 (2011).
Article PubMed CAS Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Efron, B. 1977 Rietz Lecture—bootstrap methods—another look at the Jackknife. Ann. Stat. 7, 1–26. https://doi.org/10.1214/aos/1176344552 (1979).
Article MATH Google Scholar
Ma, J. S., Sheridan, R. P., Liaw, A., Dahl, G. E. & Svetnik, V. Deep neural nets as a method for quantitative structure-activity relationships. J. Chem. Inf. Model. 55, 263–274. https://doi.org/10.1021/ci500747n (2015).
Article PubMed CAS Google Scholar
Rogers, D. & Hahn, M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 50, 742–754. https://doi.org/10.1021/ci100050t (2010).
Article PubMed CAS Google Scholar
Ghose, A. K. & Crippen, G. M. Atomic physicochemical parameters for three-dimensional-structure-directed quantitative structure-activity relationships. 2. Modeling dispersive and hydrophobic interactions. J. Chem. Inf. Comput. Sci. 27, 21–35. https://doi.org/10.1021/ci00053a005 (1987).
Article PubMed CAS Google Scholar
Dearden, J. C., Cronin, M. T. D. & Kaiser, K. L. E. How not to develop a quantitative structure-activity or structure-property relationship (QSAR/QSPR). SAR QSAR Environ. Res. 20, 241–266. https://doi.org/10.1080/10629360902949567 (2009).
Article PubMed CAS Google Scholar
Ke, Y. Y. & Lin, T. H. Modeling the ligand–receptor interaction for a series of inhibitors of the capsid protein of enterovirus 71 using several three-dimensional quantitative structure–activity relationship techniques. J. Med. Chem. 49, 4517–4525. https://doi.org/10.1021/jm0511886 (2006).
Article PubMed CAS Google Scholar
Cherkasov, A. et al. QSAR modeling: Where have you been? Where are you going to?. J. Med. Chem. 57, 4977–5010. https://doi.org/10.1021/jm4004285 (2014).
Article PubMed PubMed Central CAS Google Scholar
Todeschini, R., Ballabio, D. & Grisoni, F. Beware of unreliable Q(2)! A comparative study of regression metrics for predictivity assessment of QSAR models. J. Chem. Inf. Model. 56, 1905–1913. https://doi.org/10.1021/acs.jcim.6b00277 (2016).
Article PubMed CAS Google Scholar
Consonni, V., Todeschini, R., Ballabio, D. & Grisoni, F. On the misleading use of QF32 for QSAR model comparison. Mol. Inform. https://doi.org/10.1002/Minf.201800029 (2019).
Article PubMed Google Scholar
Truchon, J. F. & Bayly, C. I. Evaluating virtual screening methods: Good and bad metrics for the “early recognition” problem. J. Chem. Inf. Model. 47, 488–508. https://doi.org/10.1021/ci600426e (2007).
Article PubMed CAS Google Scholar
Baldi, P., Brunak, S., Chauvin, Y., Andersen, C. A. F. & Nielsen, H. Assessing the accuracy of prediction algorithms for classification: An overview. Bioinformatics 16, 412–424. https://doi.org/10.1093/bioinformatics/16.5.412 (2000).
Article PubMed CAS Google Scholar
Chen, S. R. et al. Discovery, structure–activity relationship studies, and anti-nociceptive effects of N-(1,2,3,4-tetrahydro-1-isoquinolinylmethyl)benzamides as novel opioid receptor agonists. Eur. J. Med. Chem. 126, 202–217. https://doi.org/10.1016/j.ejmech.2016.09.003 (2017).
Article PubMed CAS Google Scholar
Gentleman, R., Hornik, K. & Leisch, F. R 1.5 and the Bioconductor 1.0 releases. Comput. Stat. Data An. 39, 557–558 (2002).
Google Scholar
Warr, W. A. Scientific workflow systems: Pipeline Pilot and KNIME. J. Comput. Aid Mol. Des. 26, 801–804. https://doi.org/10.1007/s10822-012-9577-7 (2012).
Article ADS CAS Google Scholar
Wold, S. & Dunn, W. J. Multivariate quantitative structure activity relationships (QSAR)—conditions for their applicability. J. Chem. Inf. Comput. Sci. 23, 6–13. https://doi.org/10.1021/Ci00037a002 (1983).
Article CAS Google Scholar
Hellberg, S., Wold, S., Dunn, W. J., Gasteiger, J. & Hutchings, M. G. The anesthetic activity and toxicity of halogenated ethyl methyl ethers, a multivariate QSAR modeled by Pls. Quant. Struct. Act. Rel. 4, 1–11. https://doi.org/10.1002/qsar.19850040102 (1985).
Article CAS Google Scholar
Luco, J. M. & Ferretti, F. H. QSAR based on multiple linear regression and PLS methods for the anti-HIV activity of a large group of HEPT derivatives. J. Chem. Inf. Comput. Sci. 37, 392–401 (1997).
Article CAS PubMed Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. Investig. 121, 2750–2767. https://doi.org/10.1172/JCI45014 (2011).
Article PubMed CAS PubMed Central Google Scholar
Lin, S. Y. et al. The in vivo antinociceptive and mu-opioid receptor activating effects of the combination of N-phenyl-2ʹ,4ʹ-dimethyl-4,5ʹ-bi-1,3-thiazol-2-amines and naloxone. Eur. J. Med. Chem. 167, 312–323. https://doi.org/10.1016/j.ejmech.2019.01.063 (2019).
Article PubMed CAS Google Scholar
Chao, P. K. et al. 1-(2,4-dibromophenyl)-3,6,6-trimethyl-1,5,6,7-tetrahydro-4H-indazol-4-one a novel opioid receptor agonist with less accompanying gastrointestinal dysfunction than morphine. Anesthesiology 126, 952–966. https://doi.org/10.1097/Aln.0000000000001568 (2017).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This work was supported by the grants of Intramural Research Program of the National Health Research Institutes (06A1-BPAP-01-033) and the Ministry of Economic Affairs, Taiwan R.O.C (106-EC-17-A-22-0624).

Author information

These authors contributed equally: Lun K. Tsou, Shiu-Hwa Yeh, Chun-Ping Chang and Shau-Hua Ueng.

Authors and Affiliations

Institute of Biotechnology and Pharmaceutical Research, National Health Research Institutes, Zhunan, 35053, Miaoli County, Taiwan, ROC
Lun K. Tsou, Shiu-Hwa Yeh, Shau-Hua Ueng, Chun-Ping Chang, Jen-Shin Song, Mine-Hsine Wu, Hsiao-Fu Chang, Sheng-Ren Chen, Chuan Shih, Chiung-Tong Chen & Yi-Yu Ke

Authors

Lun K. Tsou
View author publications
You can also search for this author in PubMed Google Scholar
Shiu-Hwa Yeh
View author publications
You can also search for this author in PubMed Google Scholar
Shau-Hua Ueng
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Ping Chang
View author publications
You can also search for this author in PubMed Google Scholar
Jen-Shin Song
View author publications
You can also search for this author in PubMed Google Scholar
Mine-Hsine Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hsiao-Fu Chang
View author publications
You can also search for this author in PubMed Google Scholar
Sheng-Ren Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Shih
View author publications
You can also search for this author in PubMed Google Scholar
Chiung-Tong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Yu Ke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.T. wrote and organized the manuscript. S.H.Y. and H.F.C. carried out the FLIPR calcium assay to evaluate the MOR experiments. S.H.U. and S.R.C. help to synthesize the MOR agonist. J.S.S. and M.H.W. maintained the high-throughput screening core facility and aided in TNBC cellular assay. C.P.C. carried out the synthesis of the TNBC inhibitors. Y.Y.K. was the team leader, carried out the computational experiments, and analyzed the results. C.S. suggest some concepts and help to revise the manuscript. C.T.C. was the Chief Investigator and guided the development of the project. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yi-Yu Ke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tsou, L.K., Yeh, SH., Ueng, SH. et al. Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery. Sci Rep 10, 16771 (2020). https://doi.org/10.1038/s41598-020-73681-1

Download citation

Received: 23 March 2020
Accepted: 18 September 2020
Published: 08 October 2020
DOI: https://doi.org/10.1038/s41598-020-73681-1

This article is cited by

Inferring molecular inhibition potency with AlphaFold predicted structures
- Pedro F. Oliveira
- Rita C. Guedes
- Andre O. Falcao
Scientific Reports (2024)
Antimicrobial Activity Classification of Imidazolium Derivatives Predicted by Artificial Neural Networks
- Andżelika Lorenc
- Anna Badura
- Adam Buciński
Pharmaceutical Research (2024)
Artificial intelligence: opportunities and challenges in the clinical applications of triple-negative breast cancer
- Jiamin Guo
- Junjie Hu
- Ji Ma
British Journal of Cancer (2023)
Ligand-based approaches to activity prediction for the early stage of structure–activity–relationship progression
- Itsuki Maeda
- Akinori Sato
- Tomoyuki Miyao
Journal of Computer-Aided Molecular Design (2022)
AI in drug development: a multidisciplinary perspective
- Víctor Gallego
- Roi Naveiro
- Nuria E. Campillo
Molecular Diversity (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.