Using predictive machine learning models for drug response simulation by calibrating patient-specific pathway signatures

Golriz Khatami, Sepehr; Mubeen, Sarah; Bharadhwaj, Vinay Srinivas; Kodamullil, Alpha Tom; Hofmann-Apitius, Martin; Domingo-Fernández, Daniel

doi:10.1038/s41540-021-00199-1

Download PDF

Article
Open access
Published: 27 October 2021

Using predictive machine learning models for drug response simulation by calibrating patient-specific pathway signatures

npj Systems Biology and Applications volume 7, Article number: 40 (2021) Cite this article

5081 Accesses
5 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The utility of pathway signatures lies in their capability to determine whether a specific pathway or biological process is dysregulated in a given patient. These signatures have been widely used in machine learning (ML) methods for a variety of applications including precision medicine, drug repurposing, and drug discovery. In this work, we leverage highly predictive ML models for drug response simulation in individual patients by calibrating the pathway activity scores of disease samples. Using these ML models and an intuitive scoring algorithm to modify the signatures of patients, we evaluate whether a given sample that was formerly classified as diseased, could be predicted as normal following drug treatment simulation. We then use this technique as a proxy for the identification of potential drug candidates. Furthermore, we demonstrate the ability of our methodology to successfully identify approved and clinically investigated drugs for four different cancers, outperforming six comparable state-of-the-art methods. We also show how this approach can deconvolute a drugs’ mechanism of action and propose combination therapies. Taken together, our methodology could be promising to support clinical decision-making in personalized medicine by simulating a drugs’ effect on a given patient.

Modeling cancer drug response through drug-specific informative genes

Article Open access 23 October 2019

Gene expression based inference of cancer drug sensitivity

Article Open access 27 September 2022

Drug ranking using machine learning systematically predicts the efficacy of anti-cancer drugs

Article Open access 25 March 2021

Introduction

Applying machine learning (ML) methods to biomedical data has enormous potential for the development of personalized therapies,¹ drug repurposing,² and drug discovery.³ The data exploited by these methods can comprise multiple modalities including imaging data,⁴ chemical structure information,⁵ and natural language data.⁶ However, the widespread availability of transcriptomics data (e.g., RNA-Sequencing (RNA-Seq), microarrays, etc.) along with its capacity to provide a comprehensive overview of biological systems have made this particular modality a popular choice for various computational methods. Although this modality can reveal both molecular signatures as well as phenotypic changes that occur in altered states, pathway analyses are often performed to map measured transcripts to the pathway level due to high dimensionality and correlations present in transcriptomics datasets.^7,8 This transformation facilitates the training of ML/AI models by reducing dimensional complexity whilst enhancing interpretive power.⁹ However, such a transformation implicates the use of prior pathway knowledge¹⁰ from databases such as KEGG¹¹ and Reactome.^12,13

The transformation of data from the transcriptomics to the pathway level can be used to generate pathway features (i.e., sets of genes involved in a given pathway that are coordinately up or down-regulated), the latter of which have broad applications in drug discovery and drug response prediction.¹⁴ For instance,^15,16,17 exploited the concept of anti-similarity between drugs and disease-specific pathway signatures to identify therapeutic candidate drugs that can potentially revert disease pathophysiology. Furthermore,¹⁸ shows how pathway signatures derived from cell lines using kernelized Bayesian matrix factorization can be used for drug response prediction.

Alternatively, other methods can generate individualized pathway features from a population of patients or cell lines.¹⁹ These features, or pathway activity scores, can subsequently be used for several downstream ML applications including classification tasks and survival prediction.^8,20 In addition,²¹ showed how ML models can be used to predict drug response using pathway activity scores derived from cell lines. Furthermore, another example from²² demonstrated how modeling individualized pathway activity scores from Fanconi anemia patients can reveal potential targets for therapeutic interventions. Finally, similar approaches have been used to prioritize drug treatments in the cancer context.^23,24

While these methods have shown how pathway signatures can be used for drug discovery and drug response prediction, existing methods thus far fail to account for two important factors. First, as the response triggered by a drug in a given patient may differ if administered in another, these methods should account for patient heterogeneity which is crucial in designing individualized therapies. Second, specific indications may be improved or corrected by a drug combination approach or through the administration of multi-target drugs.

In this work, we present an intuitive methodology that exploits the predictive power of ML models to simulate drug response by calibrating pathway signatures of patients. We first trained an ML model (i.e., elastic net penalized logistic regression model) to discriminate between disease samples and controls based on sample-specific pathway activity scores. Next, we simulate drug responses in patients using a scoring algorithm that modifies a patient’s pathway signatures using existing knowledge on drug-target interactions. We hypothesize that promising drug candidates for a given condition would modify pathway activity scores of patients in such a way that they closely resemble scores of controls. Thus, using the previously trained ML model, we then evaluate whether patients with modified pathway scores are now classified as normal as a proxy for promising drug candidates. We demonstrate the scalability and generalizability of our methodology by simulating over one thousand drugs from two independent drug-target datasets on four cancer indications. Furthermore, we show how our methodology is able to recover a large proportion of clinically investigated drugs on these four indications, outperforming six comparable state-of-the-art methods. Finally, we show how the most relevant pathways identified by our methodology can be used to better understand the biology pertaining to a given condition.

Results

We present a workflow designed to approximate a drug’s effect on a patient by intentional modifications to patient-specific features, specifically, pathway activity scores, by employing highly predictive ML models trained to differentiate between normal and disease samples (Fig. 1). In the first subsection, we validate our approach by (i) evaluating its capability in retrieving FDA-approved drugs and those in clinical trials for multiple cancer datasets and, (ii) comparing the results yielded by our approach against several equivalent methods. Then, in the following two subsections, we investigate the drug candidates prioritized by our approach and the specific pathways targeted by these prioritized drugs, respectively. Finally, we show the utility of our approach in predicting the effects of a combination of drugs for applications in combination therapy and for the identification of potential adverse events associated with drug combinations.

**Fig. 1: Conceptual overview of the drug simulation workflow and case scenario on multiple datasets.**

Validation of the methodology and comparison against equivalent approaches

In this subsection, we investigate the drug candidates prioritized by our methodology in four different cancers and evaluate the ability of our approach to identify approved and clinically investigated drugs (i.e., true positives). Table 1 shows that only a minority of the drugs present in both drug-target datasets were prioritized by our methodology given that a stringent threshold was employed which required that prioritized drugs change the predictions of at least 80% of the patients (see “Materials and Methods” and Supplementary Figs. 7, and 8 for details on the selection of this threshold). Overall, our methodology is able to recover a large proportion of true positives (ranging from 13% to 32%) in all four cancers as well as in both drug-target datasets (Table 1). This wide range may be attributable to a disproportion in the number of true positives that exist for each of the cancer datasets (e.g., BRCA has more than twice as many FDA-approved drugs and drugs in clinical trials than LIHC) as well as to the size of the drug-target datasets (i.e., DrugBank contains twice as many drugs as DrugCentral).

Table 1 Number of FDA-approved and clinically tested drugs recovered for both drug-target datasets (i.e., DrugBank (DB) and DrugCentral (DC)) across the four investigated cancers.

Full size table

As a comparison, the methodology proposed by²⁵ reported lower proportions of true positives than our approach for the BRCA and PRAD datasets with 21.42% and 15.94%, respectively (Supplementary Table 1). Furthermore, four additional methods present that were benchmarked by²⁵ yielded even lower results on the same two cancer datasets (Supplementary Tables 2–8). Similarly,²⁶ also reported a lower proportion of true positives than our approach for the BRCA and PRAD datasets with 0.8% and 0.4%, respectively (Supplementary Table 9). Overall, the performance across all six methods varied from 0% to 11.53% for BRCA, and from 0.50% to 22.22% for PRAD and is summarized in Supplementary Table 10.

In addition, the proportion of true positives yielded by our methodology is significantly higher than what one would expect by chance (see “Materials and Methods”). Furthermore, we compared the number of prioritized drugs found in the original DrugBank and DrugCentral datasets to the number of prioritized drugs obtained in the robustness experiments in which we applied our methodology to drugs with randomly generated targets and target interactions (Supplementary Fig. 1). We found that all permutation experiments yielded a significantly lower number of prioritized drugs. Because our methodology can capture a much greater number of prioritized drugs on a real dataset, this validation highlights the capability of our approach to prioritize drugs with targets in relevant pathways that are key to change the predictions of patients.

As a final remark, we explored the performance of our methodology when varying one of the weights while keeping the other two constant to better understand how sensitive the results are to the selected weights (Supplementary Tables 11, 12). We have observed that the proportions of true positives recovered mainly vary between 15% and 35% in the three test disease datasets for both drug-target datasets when W₁ (i.e., the weight assigned to the quartile that contains the most dysregulated pathways) is in the range of 10–20. There are multiple cases where we found sets of weights yielding better results than the ones presented in Table 1 if exclusively looking at a single or two specific disease datasets (Supplementary Table 13). In contrast, we observed that when weights are low (e.g., W₁ = 1), our approach often does not yield any prioritized drugs (Supplementary Table 14), as in these cases, the modified pathway activity scores are not sufficient enough to change the predictions of the ML model.

In-depth investigation of the prioritized candidate drugs

Apart from the previous quantitative evaluation of our methodology, we conducted an in-depth analysis of the prioritized drugs to better understand the predictions made by our approach. Below, we focus on drugs prioritized using the DrugCentral dataset as this dataset contains a fewer number of prioritized drugs than DrugBank.

In the breast cancer dataset (BRCA), we identified a major class of drugs based on their mechanisms of action (Fig. 2a). This class targeted DNA and RNA metabolism and included commonly used anti-tumor drugs. One example of this group of drugs is fluorouracil, which targets thymidylate synthase, thereby inhibiting the formation of thymidylate from uracil.²⁷ This drug is a chemotherapy medication commonly used to treat several cancers.

**Fig. 2: Pathways targeted by prioritized drugs in DrugCentral for each of the three cancer test datasets.**

In the prostate cancer dataset (PRAD), we found that the majority of drugs were related to hormone metabolism and regulation (Fig. 2c). Due to the key role of sex steroid hormones in its initiation and progression,²⁶ this cancer is classified as hormone-dependent. Thus, current treatments are often directly targeted towards these hormones, such as androgen deprivation therapy, which represents the major therapeutic option for treatment of advanced stages of this cancer.^28,29,30

The third dataset, LIHC, corresponds to hepatocarcinoma. Interestingly, the vast majority of the candidate drugs in this dataset (14/19) are tyrosine kinase inhibitors (TKI) corresponding to anti-tumor drugs already FDA-approved for other cancers³¹ (Fig. 2b). Since these kinases act as regulatory players in several cancer signaling pathways that can be hyperactivated, TKIs are used to “switch-off” these pathways, indirectly inhibiting cell growth.³² One of the predicted drugs is sorafenib, which was the first TKI to be approved for the treatment of liver carcinoma and still remains as a first-line therapy. Similarly, another predicted drug, trametinib, is a dual-kinase inhibitor that is used in the treatment of advanced liver cancer. Finally, two of the remaining non-TKIs are also employed as chemotherapy drugs as they inhibit the synthesis of nucleotides.

Investigation of pathways targeted by the prioritized drugs

Here, we interpret and analyze the results yielded by our methodology for multiple datasets by investigating the pathways targeted by the drugs prioritized through our approach. We identified clusters of pathways belonging to several distinct classes (Fig. 2). Not surprisingly, we found that various metabolic pathways appeared in all three test datasets as the regulation of metabolism plays an important role in numerous cancers. Given that each of the three test datasets were cancer subtypes, intuitively, we also observed several disease-relevant pathways targeted by the prioritized drugs, among which were ~30 cancer-related pathways from KEGG (e.g., prostate cancer, pancreatic cancer, bladder cancer, and breast cancer).

Drugs that were prioritized by our approach (Fig. 2) were likewise clustered based on the pathways they targeted to assess whether drugs that targeted the same pathway fell within the same class of drugs. Prioritized drugs for liver cancer could be clustered into four different classes of tyrosine kinase inhibitors: (i) JAK inhibitors (i.e., sorafenib, vandetanib, erlotinib, and lapatinib), (ii) ALK inhibitors (i.e., lorlatinib), (iii) BCR–Abl (i.e., nilotinib, dasatinib, and imatinib), and (iv) and EGFR inhibitors (i.e., afatinib).³³ In addition, we found MEK kinase inhibitors, specifically trametinib and cobimetinib. Finally, we found that while some drugs were able to change the predictions by targeting only a limited number of pathways (e.g., fludarabine in breast cancer and liver cancer), other drugs could change predictions by targeting several pathways (e.g., tretinoin in prostate cancer and trametinib in liver cancer).

Among the most commonly targeted pathways by the prioritized drugs in liver carcinoma, we found Ras/Raf/MAPK and PI3K/AKT/mTOR signaling, both of which have been reported to play important roles in the development of this type of cancer.³⁴ One of the prioritized drugs, sorafenib, is a multi-kinase inhibitor that targets several kinases including RFA1, PDGFR, and FLT3, which are involved in both tumor proliferation and angiogenesis.^35,36 Sorafenib has been shown to inhibit tumor cell proliferation by blocking the Ras/Raf/MAPK pathway and to inhibit angiogenesis by blocking PDGFR signaling³⁷ (Supplementary Table 15).

Prioritizing combination therapies

Combination therapies are widely used for treating indications like cancer as they can often lead to the inhibition of the compensatory signaling pathways that maintain the growth and survival of tumor cells. Here, we demonstrate how our methodology can be extended to predict the effects of a combination of drugs. To reduce the computational complexity associated with running our methodology on all possible combinations of drug pairs from both drug-target datasets (i.e., DrugBank and DrugCentral), we exclusively applied our method on all possible pairs from the set of prioritized drugs. Table 2 lists a subset of combinations of prioritized drugs, alongside the proportion of patients that they reclassify as normal (i.e., proportion of treated patients).

Table 2 Examples of predicted combination therapies.

Full size table

For two of the three test datasets (i.e, LIHC and PRAD), nearly all drug pairs yielded better results (i.e., larger proportion of disease samples predicted as normal) than the use of a single drug alone. In the BRCA dataset, however, multiple combinations yielded worse results than those observed with single drug therapy. For example, the combination of bromocriptine with valproic acid decreased the proportion of treated patients from 80% to <10%. Specifically, bromocriptine is an adrenergic receptor agonist that stimulates the beta-adrenergic signaling pathway, which in turn prompts tumor angiogenesis and cancer development.³⁸ Similarly, valproic acid is a histone deacetylase which also induces beta-adrenergic signaling, thus promoting cancer progression.³⁹ Therefore, the combination of these two drugs not only fails to treat the cancer, but may in fact lead to the worsening of the condition.

Discussion

Here, we have presented a powerful machine learning framework to simulate drug responses for applications in drug discovery and precision medicine. We demonstrate our methodology on four different cancer datasets and two independent drug-target datasets by using patient-specific pathway signatures to train highly predictive models which we use as a proxy for drug candidate identification. Across all datasets, our results yielded a larger proportion of FDA-approved drugs as well as drugs investigated in clinical trials than six comparable approaches for the indications we studied, suggesting that other drugs prioritized by our methodology may also represent promising candidates for repurposing. In addition, in contrast to the other methodologies, our approach is able to prioritize drugs for individual patients, making it suitable for precision medicine applications. Finally, we also show how our methodology can be applied to propose drug combinations as well as to reveal sets of dysregulated pathways that could be used as possible targets.

Currently, there exist several limitations to this study; first, although our scoring algorithm used to simulate drug response has been shown to yield promising results in the four datasets analyzed, other scoring algorithms may be better suited for different datasets and/or applications. For instance, we could tailor the current scoring algorithm for drug discovery to learn pathway signatures from approved drugs and use these drugs to prioritize candidates that exhibit similar patterns of activity. Second, although we recommend the selection of weights following a similar logic to the one we have presented here (i.e., assigning larger weights to the quartile containing the most dysregulated pathways and lower weights for others), it may be the case that weights must be tuned for other datasets to yield promising candidates. Third, since our methodology relies on pathway signatures derived from transcriptomics data, it is inherently limited to indications where this modality is highly predictive. In other words, pathway activity scores must be readily separable between disease and normal samples in the disease we investigate as we require highly predictive models that can guarantee the change in the predicted class label is exclusively caused by the drug simulation step and not by the lack of accuracy of the model. Thus, it would be less effective in indications where transcriptomics have limited prediction power to discriminate between normal and disease samples, such as Parkinson’s disease.⁴⁰ Finally, while we have demonstrated our approach with a commonly used sample-wise enrichment method, ssGSEA does not take network topology into consideration. Thus, in the future, other enrichment methods that leverage the topological information of pathways can be used to generate the pathway activity scores used by our algorithm.

Beyond this proof-of-concept, our methodology can be extended to include several additional functionalities. For instance, drug administration could be simulated in an ML model that takes into consideration temporal dimensions (e.g., event-based models,⁴¹ survival analysis⁴²). Furthermore, in this paper we trained a simple ML model, nonetheless, the same strategy could be applied to more complex ML or AI models. Since the elastic net penalty encourages sparsity, one may also use the coefficients of an ML model as a preliminary method of filtering for significant features. To save time, the total set of drug candidates can be subset to only those which directly affect the features that significantly affect the prediction capabilities of the model. In addition, we restricted our analysis to a single pathway database as it was sufficient to deploy a predictive ML model for the specific classification task we presented. However, by incorporating pathway information from other databases into the ML model, we can increase the total number and coverage of pathways to potentially reveal additional pathway targets. Similarly, the use of different drug-target databases such as ExCAPE-DB⁴³ could broaden the chemical space and lead to the identification of new candidates. By combining brute-force and reverse engineering approaches, one can also identify the most effective pathway scores a drug should target for any given indication; thus, tailoring the presented methodology towards drug discovery. Finally, due to limited data for all possible responses a given patient could have to a particular drug in large cohorts, we relied upon classic drug repurposing validation strategies to demonstrate the efficacy of our approach. However, with enough training data, our methodology could be deployed to support clinical decision-making in personalized medicine by simulating the effect of drugs on individual patients.

Materials and methods

The initial step of our methodology consists of generating patient-specific features that can be used for model training. Although in this work, we employed pathway activity scores (see subsection “Calculating individualized pathway activity scores”), other features could also be used for the same purpose. Using these scores, we trained an ML model (subsection “Building a predictive classifier”) that can accurately discriminate between sample classes (e.g., disease vs normal). Next, we developed a scoring algorithm aimed to simulate the effect of a drug intervention at the pathway-level by modifying the pathway activity scores of disease samples (subsection “Scoring algorithm”). Then, the method uses the modified pathway activity scores as an input in the trained model to assess whether samples that were previously classified as “diseased” could now be classified as “normal” as a proxy for drug candidates (Subsection “Drug response prediction and prioritization”). Then we validate and evaluate our approach by presenting the datasets used for our case scenario and comparing our methodology against six equivalent approaches. Finally, we provide details on the implementation.

Datasets

Datasets from The Cancer Genome Atlas (TCGA)⁴⁴ were retrieved from the Genomic Data Commons (GDC; https://gdc.cancer.gov) portal through the R/Bioconductor package, TCGAbiolinks (version 2.16.3;⁴⁵) on 04-08-2020 (Fig. 1d). Gene expression data from RNA-Seq was quantified using the HTSeq and raw read counts were normalized using Fragments Per Kilobase of transcript per Million mapped reads upper quartile (FPKM-UQ). Gene identifiers were mapped to HUGO Gene Nomenclature Committee (HGNC) symbols where possible. The datasets downloaded include The Cancer Genome Atlas Breast Invasive Carcinoma (TCGA-BRCA), The Cancer Genome Atlas Prostate Adenocarcinoma (TCGA-PRAD), The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC), and The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma (TCGA-KIRC) (Supplementary Table 16). We would like to note that due to the design of our methodology, we required the datasets to have a large sample size to conduct the hyperparameter optimization of the ML model and the cross validation strategy described below.

Calculating individualized pathway activity scores

We used single-sample GSEA (ssGSEA),⁴⁶ a commonly used tool to generate patient-specific pathway activity scores. Normalized gene expression (FPKM-UQ) and pathway definitions (i.e., gene sets) were provided as input and were converted to scores through ssGSEA (Supplementary Table 17; Supplementary Fig. 5). As a reference database, we used 337 pathways from KEGG (downloaded on 01-04-2020) as it is the most widely used pathway database and a standard for the most commonly used pathway activity scoring methods¹⁸ (Fig. 1d).

Building a predictive classifier

Patient-specific pathway activity scores generated by ssGSEA were used to generate a ML classifier to distinguish between normal and tumor sample labels for each of the four datasets. The classification was conducted using an elastic net penalized logistic regression model⁴⁷ as regularized models have been shown to be generally well suited for -omics data which typically contains a disproportionate number of features to samples, and specifically well suited for these datasets.²¹ Furthermore, we previously used this ML model on the same TCGA datasets,¹⁹ yielding AUC-ROC and AUC-PR values close to 1 (Supplementary Fig. 6), in line with Mubeen et al. (2019). Prediction performance was evaluated via 10 times repeated 10-fold stratified cross-validation and tuning of elastic net hyper-parameters (i.e., l₁, l₂ regularization parameters) via grid search was performed within the cross-validation loop to avoid over-optimism.⁴⁸

Scoring algorithm

To modify the pathway activity scores for disease samples, we developed a scoring algorithm to replicate the effect of a drug at the pathway-level. The scoring algorithm exploits interactions from drug-target datasets to modify the activity scores of pathways containing the target(s) of a drug (see example in Supplementary Fig. 10). We describe the scoring algorithm in Box 1.

For each drug-pathway association, the pathway is assigned an effect score ES which is equivalent to a drug’s effect on a protein target coming from drug-target datasets (i.e., activation and inhibition relationships given +1 and −1 labels, respectively). For pathways that contain multiple protein targets, the ES is equivalent to the mean of these effects (e.g., if a drug activates a protein in a pathway but also inhibits a second protein in the same pathway, the overall effect of the drug on the pathway (ES) would be 0). The absolute values of the mean differences between healthy and disease groups are calculated for each pathway μ_H-D(p) while their quartiles are then computed on line 2. Then, from lines 3–12, for each disease sample, if the ES of a pathway p is less than or greater than 0, the scoring algorithm calculates a calibration score CS as the product of the absolute value of the original pathway activity score PAS, the weight w, and the effect of the drug on the pathway sgn(p) (i.e., −1, 0 or 1). We assign w based on the quartile μ_H-D(p) pathway p falls into. For pathways with larger mean differences between groups, weights are assigned greater values, while pathways with smaller differences are weighted less (see example in Supplementary Text 1). On lines 13–14, if the ES of a pathway p is 0, the CS is assigned the value of the original PAS. Finally, on line 15, the CS is returned as a score that simulates the effect of a drug on a pathway for a disease sample.

Box 1 Scoring algorithm pseudocode. The pseudocode outlines the scoring algorithm used to modify the pathway activity scores of a given patient

Scoring Algorithm

Require:

Set of pathways containing the protein target(s) of the drug, $\left\{ {P|p \in P} \right\}$

Set of samples, $\left\{ {S|s \in S} \right\}$

Set of healthy and disease samples, $\left\{ {H,D|H,D \in S|\forall h \in H,d \in D} \right\}$

Set of target labels, $\left\{ {T|t \in T} \right\}$

Array consisting of effect scores for all pathways,

$$\left\{ {{{{{{\mathrm{ES}}}}}}|{{{{{\mathrm{ES}}}}}}\left( p \right) \in {{{{{\mathrm{ES}}}}}},{{{{{\mathrm{ES}}}}}}\left( p \right) = \frac{1}{N}\mathop {\sum }\limits_{j = 0}^N t_j\left( p \right)} \right\}$$

Where, N is the number of targets that are affected by a drug in pathway p

Matrix consisting of original pathway activity scores for disease samples, ${{{{{\mathrm{PAS}}}}}}$

Array consisting of the absolute values of mean differences between sample groups for each $p$, $\mu _{{{{{{\mathrm{H}}}}}} - {{{{{\mathrm{D}}}}}}} = \left| {\mu _{{{{{\mathrm{H}}}}}} - \mu _{{{{{\mathrm{D}}}}}}} \right|$

1: function SCORING_FUNCTION $\left( {D,P,{{{{{\mathrm{ES}}}}}},{{{{{\mathrm{PAS}}}}}},\mu _{{{{{{\mathrm{H}}}}}} - {{{{{\mathrm{D}}}}}}}} \right)$

2: Compute quartiles, $Q_1,Q_2,Q_3$, for all values of $\mu _{{{{{{\mathrm{H}}}}}} - {{{{{\mathrm{D}}}}}}}$

3: for all $d \in D$ do

4: for all $p \in P$ do

5: $sgn\left( p \right): = \left\{ {\begin{array}{*{20}{l}} { - 1} \hfill & {{{{{{\mathrm{if}}}}}}\;{{{{{\mathrm{ES}}}}}}\left( p \right) \, < \, 0,} \hfill \\ 0 \hfill & {{{{{{\mathrm{if}}}}}}\;{{{{{\mathrm{ES}}}}}}\left( p \right) = 0,} \hfill \\ 1 \hfill & {{{{{{\mathrm{if}}}}}}\;{{{{{\mathrm{ES}}}}}}\left( p \right) \, > \, 0.} \hfill \end{array}} \right.$

6: if $ES \, \ne \, 0$ then

7: if $\mu _{H - D}\left( p \right) \in \left( {Q_3, + \infty } \right)$ then

8: ${{{{{\mathrm{CS}}}}}}\left( {p,d} \right) = \left| {{{{{{\mathrm{PAS}}}}}}\left( {p,d} \right)} \right| \ast \left( {w_1 \ast sgn\left( p \right)} \right)$

9: else if $\mu _{H - D}\left( p \right) \in |Q_2,Q_3|$ then

10: ${{{{{\mathrm{CS}}}}}}\left( {p,d} \right) = |{{{{{\mathrm{PAS}}}}}}\left( {p,d} \right)| \ast \left( {w_2 \ast sgn\left( p \right)} \right)$

11: else

12: ${{{{{\mathrm{CS}}}}}}\left( {p,d} \right) = |{{{{{\mathrm{PAS}}}}}}\left( {p,d} \right)| \ast \left( {w_3 \ast sgn\left( p \right)} \right)$

13: else

14: ${{{{{\mathrm{CS}}}}}}\left( {p,d} \right) = {{{{{\mathrm{PAS}}}}}}\left( {p,d} \right)$

$\Rightarrow \, {\mathrm{CS}}$, Matrix consisting of calibrated pathway scores after drug treatment

15: return ${{{{{\mathrm{CS}}}}}}$

Drug response prediction and prioritization

The methodology then aims at identifying drug candidates based on the predicted response of a patient to the simulated drug treatment. To do so, we input the modified features generated by the scoring algorithm in the trained ML model and re-evaluate the new class assignment of the patient.

Since the ML model has learnt to accurately differentiate between normal and disease samples, we expect that if a drug fails to affect a set of relevant pathways, the labels of the disease samples would remain unchanged. However, if the drug were to target a set of pathways dysregulated in a disease, we expect that the scoring algorithm could modify the scores so that they resemble those observed in control samples. Thus, by inputting these modified scores into the trained ML model, we can assess whether disease samples can now be classified as normal. Finally, after re-evaluating the predictions made by the ML model, we can rank promising drugs by the proportion of disease samples that are classified as normal as a proxy of the effectiveness of the drug.

Validation and robustness analysis

Here, we outline the robustness experiments conducted to assess the ability of our methodology to identify drugs which are already FDA-approved or have been tested in clinical trials for each of the four cancer types (i.e., TCGA datasets).

First, to simulate drug treatment using the scoring algorithm described in Box 1, we used two different drug-target datasets: DrugBank (version 5.1.6)⁴⁹ and DrugCentral (version 9.18.2020).⁵⁰ For each of the datasets, we mapped drugs to DrugBank identifiers and protein targets to HGNC symbols. In total, we retrieved 1346 unique drugs and 4673 drug-target interactions from DrugBank and 638 unique drugs and 1481 drug-target interactions from DrugCentral. Here, we would like to note that both datasets are largely overlapping (Supplementary Fig. 11). We then used these drug-target interactions as the input to our methodology to simulate patient treatments (Fig. 1e).

For validation purposes, we used two ground-truth lists containing drug-disease pairs as true positives to verify the predictions made by our methodology (Fig. 1f). The first ground-truth list contained FDA-approved drugs for the four cancer types manually retrieved from the National Cancer Institute (https://www.cancer.gov/about-cancer/treatment/drugs/cancer-type) which we mapped to the two drug-target datasets previously described. The second ground-truth list contained drugs investigated in clinical trials for the four cancer datasets retrieved from the ClinicalTrials.gov website (downloaded on 16.04.2020). Table 3 lists the number of approved and clinically tested drugs present in both drug-target datasets across the four investigated cancers.

Table 3 Number of FDA-approved and clinically tested drugs present in both drug-target datasets across the four investigated cancers.

Full size table

As validation, both ground-truth lists were compared against the list of prioritized drugs that, according to our methodology, changed the predictions of 80% of the patients and subsequently classified them as normal. This threshold was selected as there were no drugs that changed the prediction for 90% or more of the patients with the parameters used by our scoring algorithm (Supplementary Figs. 7, 8). In addition, we would like to note that the vast majority of the drugs do not change the predictions for most patients. Thus, we were exclusively interested in assessing the ability of our approach to recover true positives (i.e., positive predictive value) from the list of prioritized drugs. However, since our methodology aims to prioritize drug candidates, it suffers from an early retrieval problem.⁵¹ Furthermore, only a small minority of drugs from the drug-target datasets can be used as positive labels for each of the indications, while the majority of drugs are not known to have therapeutic benefits for them, thus, creating a large imbalance between positive and negative labels. Due to these reasons, we maintain that the evaluation strategy we present is more suitable than other conventional metrics such as the receiver operating characteristic (ROC) curves.

To identify a set of weights for the three quartiles (i.e., Q₁, Q₂ and Q₃ (see Box 1)) that perform well in three cancer test datasets, we followed a similar strategy to²⁶ where we tested different weight combinations with the intention of assigning larger weights to pathways with significantly higher dysregulations between disease and normal samples. We would like to note that the purpose of using weights in the algorithm was to modify the pathway activity scores of the few but relevant pathways targeted by the drug while maintaining the underlying distribution of pathway scores (Supplementary Fig. 9). We performed the drug simulation and conducted this parameter optimization independently on the three cancer test datasets on DrugBank, the first of two drug-target datasets. Consequently, we found a set of weights (i.e., W₁ = 20, W₂ = 5, and W₃ = 10 for Q₃ (the upper quartile representing the most dysregulated pathways), Q₂ (middle quartile), and Q₁ (lower quartile), respectively), that yielded both a large proportion of true positives among the prioritized drugs and also performed better than any of the six methods we compared our methodology against, as described below. Finally, we validated whether this same set of weights could also yield a large proportion of true positives on the second drug-target dataset (i.e., DrugCentral) as well as the fourth cancer dataset (i.e, KIRC).

To test the robustness of our methodology, we replicated our experiments by generating one hundred sets of 1346 drugs (the size of the DrugBank dataset) where each drug was assigned to a randomly selected protein target (from the set of all HGNC symbols) with a random causal effect following the same distribution as the original dataset (i.e., activation or inhibition). Next, we compared the number of drugs prioritized by these permutation experiments against the number of drugs prioritized by our methodology for the DrugBank dataset in the three cancer test datasets. Since we use a method to generate pathway activity scores that ignores network topology (i.e., ssGSEA), we did not conduct a robustness analysis that focused on perturbing pathway networks.

Performance comparison against equivalent drug-repurposing approaches

To evaluate our methodology, we compared it to six similar approaches that also employ transcriptomics data and pathway information to repurpose drugs on the BRCA and PRAD datasets^25,26 (note that the LIHC dataset is not included in their analyses). In the first of the two studies,²⁵ evaluated the ability of their methodology and four additional approaches to predict known drugs (i.e., FDA-approved or in advanced clinical trials) for breast and prostate cancer. Similarly,²⁶ reported the ability of their approach to identify FDA-approved drugs on the same datasets. We were thus able to directly compare the proportion of true positives that were recovered by other approaches as reported in the aforementioned studies against the proportion recovered by our approach.

Implementation

We performed ssGSEA with the Python package, GSEApy (version 0.9.12; https://github.com/zqfang/gseapy) and generated the ML models using scikit-learn.⁵² We would like to note that ssGSEA does not take the topology of the pathways into account.

Data availability

Data used in this manuscript are available at https://github.com/sepehrgolriz/simdrugs under the Apache 2.0 License.

Code availability

Source code used in this manuscript is available at https://github.com/sepehrgolriz/simdrugs under the Apache 2.0 License.

References

Pai, S. et al. netDx: Interpretable patient classification using integrated patient similarity networks. Mol. Syst. Biol. 15, e8497 (2019).
Article PubMed PubMed Central Google Scholar
Zhao, K. & So, H. C. Using drug expression profiles and machine learning approach for drug repurposing. Computational methods for drug repurposing, 219–237. Humana Press, New York, NY (2019).
Réda, C. et al. Machine learning applications in drug development. Computational Struct. Biotechnol. J. 18, 241–252 (2020).
Article CAS Google Scholar
Liu, S. et al. Early diagnosis of Alzheimer’s disease with deep learning. IEEE 11th international symposium on biomedical imaging (ISBI) 1015–1018 (2014).
Hirohara, M. et al. Convolutional neural network based on SMILES representation of compounds for detecting chemical motifs. BMC Bioinforma. 19, 526 (2018).
Article CAS Google Scholar
Castro, V. M. et al. Large-scale identification of patients with cerebral aneurysms using natural language processing. Neurology 88, 164–168 (2017).
Article PubMed PubMed Central Google Scholar
Su, J., Yoon, B. J. & Dougherty, E. R. Accurate and reliable cancer classification based on probabilistic inference of pathway activity. PloS ONE 4, e8161 (2009).
Article PubMed PubMed Central CAS Google Scholar
Lim, S. et al. Comprehensive and critical evaluation of individualized pathway activity measurement tools on pan-cancer data. Brief. Bioinforma. 21, 36–46 (2020).
CAS Google Scholar
Reimand, J. et al. Pathway enrichment analysis and visualization of omics data using g: Profiler, GSEA, Cytoscape and EnrichmentMap. Nat. Protoc. 14, 482–517 (2019).
Article CAS PubMed PubMed Central Google Scholar
Perscheid, C. Integrative biomarker detection on high-dimensional gene expression data sets: a survey on prior knowledge approaches. Brief. Bioinforma. 22, bbaa151 (2020).
Article Google Scholar
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361 (2017).
Article CAS PubMed Google Scholar
Jassal, B. et al. The reactome pathway knowledgebase. Nucleic Acids Res. 48, D498–D503 (2020).
CAS PubMed Google Scholar
Nguyen, T. M., Shafi, A., Nguyen, T. & Draghici, S. Identifying significantly impacted pathways: a comprehensive review and assessment. Genome Biol. 20, 1–15 (2019).
CAS Google Scholar
Adam, G. et al. Machine learning approaches to drug response prediction: challenges and recent progress. npj Precis. Oncol. 4, 1–10 (2020).
CAS Google Scholar
Peyvandipour, A., Saberian, N., Shafi, A., Donato, M. & Draghici, S. A novel computational approach for drug repurposing using systems biology. Bioinformatics 34, 2817–2825 (2018).
Article CAS PubMed PubMed Central Google Scholar
Saberian, N., Peyvandipour, A., Donato, M., Ansari, S. & Draghici, S. A new computational drug repurposing method using established disease–drug pair knowledge. Bioinformatics 35, 3672–3678 (2019).
Article CAS PubMed PubMed Central Google Scholar
Emon, M. A. et al. PS4DR: a multimodal workflow for identification and prioritization of drugs based on pathway signatures. BMC Bioinforma. 21, 1–21 (2020).
Article Google Scholar
Ammad-ud-din, M. et al. Drug response prediction by inferring pathway-response associations with kernelized Bayesian matrix factorization. Bioinformatics 32, i455–i463 (2016).
Article CAS PubMed Google Scholar
Amadoz, A. et al. A comparison of mechanistic signaling pathway activity analysis methods. Brief. Bioinforma. 20, 1655–1668 (2019).
Article Google Scholar
Mubeen, S. et al. The impact of pathway database choice on statistical enrichment analysis and predictive modeling. Front. Genet. 10, 1203 (2019).
Article CAS PubMed PubMed Central Google Scholar
Smith, A. M. et al. Standard machine learning approaches outperform deep representation learning on phenotype prediction from transcriptomics data. BMC Bioinforma. 21, 119 (2020).
Article Google Scholar
Esteban-Medina, M. et al. Exploring the druggable space around the Fanconi anemia pathway using machine learning and mechanistic models. BMC Bioinforma. 20, 370 (2019).
Article Google Scholar
Cubuk, C. et al. Gene expression integration into pathway modules reveals a pan-cancer metabolic landscape. Cancer Res. 78, 6059–6072 (2018).
Article CAS PubMed Google Scholar
Çubuk, C. et al. Differential metabolic activity and discovery of therapeutic targets using summarized metabolic pathway models. NPJ Syst. Biol. Appl. 5, 1–11 (2019).
Article Google Scholar
Chan, J., Wang, X., Turner, J. A., Baldwin, N. E. & Gu, J. Breaking the paradigm: Dr Insight empowers signature-free, enhanced drug repurposing. Bioinformatics 35, 2818–2826 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. R., Sherr, D. H., Hu, Z. & DeLisi, C. A network based approach to drug repositioning identifies plausible candidates for breast cancer and prostate cancer. BMC Med. Genomics 9, 1–11 (2016).
Article CAS Google Scholar
Zhang, N. et al. 5-Fluorouracil: mechanisms of resistance and reversal strategies. Molecules 13, 1551–1569 (2008).
Article CAS PubMed PubMed Central Google Scholar
Snaterse, G. et al. Circulating steroid hormone variations throughout different stages of prostate cancer. Endocr.-Relat. Cancer 24, R403–R420 (2017).
Article CAS PubMed Google Scholar
Harris, W. P. et al. Androgen deprivation therapy: progress in understanding mechanisms of resistance and optimizing androgen depletion. Nat. Clin. Pract. Urol. 6, 76–85 (2009).
Article CAS PubMed PubMed Central Google Scholar
Karantanos, T., Corn, P. G. & Thompson, T. C. Prostate cancer progression after androgen deprivation therapy: mechanisms of castrate resistance and novel therapeutic approaches. Oncogene 32, 5501–5511 (2013).
Article CAS PubMed PubMed Central Google Scholar
Huynh, H. Tyrosine kinase inhibitors to treat liver cancer. Expert Opin. Emerg. Drugs 15, 13–26 (2010).
Article CAS PubMed Google Scholar
Khoo T. S. W. L., Rehman A. & Olynyk J. K. Tyrosine kinase inhibitors in the treatment of hepatocellular carcinoma. Exon Publications. 127–139 (2019).
Bhullar, K. S. et al. Kinase-targeted cancer therapies: progress, challenges and future directions. Mol. Cancer 17, 1–20 (2018).
Article CAS Google Scholar
Gedaly, R. et al. PI-103 and sorafenib inhibit hepatocellular carcinoma cell proliferation by blocking Ras/Raf/MAPK and PI3K/AKT/mTOR pathways. Anticancer Res. 30, 4951–4958 (2010).
CAS PubMed PubMed Central Google Scholar
Mousa, A. B. Sorafenib in the treatment of advanced hepatocellular carcinoma. Saudi J. Gastroenterol.: Off. J. Saudi Gastroenterol. Assoc. 14, 40 (2008).
Article Google Scholar
Zhu, Y. J., Zheng, B., Wang, H. Y. & Chen, L. New knowledge of the mechanisms of sorafenib resistance in liver cancer. Acta Pharmacologica Sin. 38, 614–622 (2017).
Article CAS Google Scholar
Llovet, J. M. et al. Sorafenib in advanced hepatocellular carcinoma. N. Engl. J. Med. 359, 378–390 (2008).
Article CAS PubMed Google Scholar
Chen, H. et al. Adrenergic signaling promotes angiogenesis through endothelial cell-tumor cell crosstalk. Endocr.-Relat. Cancer 21, 783–795 (2014).
Article CAS PubMed Google Scholar
Hulsurkar, M. et al. Beta-adrenergic signaling promotes tumor angiogenesis and prostate cancer progression through HDAC2-mediated suppression of thrombospondin-1. Oncogene 36, 1525–1536 (2017).
Article CAS PubMed Google Scholar
Chen-Plotkin, A. S. Blood transcriptomics for Parkinson disease? Nat. Rev. Neurol. 14, 5–6 (2018).
Article PubMed Google Scholar
Fonteijn, H. M. et al. An event-based model for disease progression and its application in familial Alzheimer’s disease and Huntington’s disease. NeuroImage 60, 1880–1889 (2012).
Article PubMed Google Scholar
Tibshirani, R. The lasso method for variable selection in the Cox model. Stat. Med. 16, 385–395 (1997).
Article CAS PubMed Google Scholar
Sun, J. et al. ExCAPE-DB: an integrated large scale dataset facilitating Big Data analysis in chemogenomics. J. Cheminformatics 9, 17 (2017).
Article CAS Google Scholar
Weinstein, J. N. et al. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45, 1113 (2013).
Article PubMed PubMed Central CAS Google Scholar
Colaprico, A. et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44, e71–e71 (2015).
Article PubMed PubMed Central CAS Google Scholar
Barbie, D. A. et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature 462, 108 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zou, H. & Trevor, H. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B 67, 301–320 (2005).
Article Google Scholar
Molinaro, A. M., Simon, R. & Pfeiffer, R. M. Prediction error estimation: a comparison of resampling methods. Bioinformatics 21, 3301–3307 (2005).
Article CAS PubMed Google Scholar
Knox, C. et al. DrugBank 3.0: a comprehensive resource for ‘omics’ research on drugs. Nucleic Acids Res. 39, D1035–D1041 (2010).
Article PubMed PubMed Central CAS Google Scholar
Ursu, O. et al. DrugCentral: online drug compendium. Nucleic Acids Res. 45, D932–D939 (2016).
Article PubMed PubMed Central CAS Google Scholar
Berrar, D. & Flach, P. Caveats and pitfalls of ROC analysis in clinical microarray research (and how to avoid them). Brief. Bioinforma. 13, 83–97 (2012).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Google Scholar
Kim, R. et al. A Phase I trial of trametinib in combination with sorafenib in patients with advanced hepatocellular cancer. Oncologist 25, e1893–e1899 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. Erlotinib for advanced hepatocellular carcinoma: a systematic review of phase II/III clinical trials. Saudi Med. J. 37, 1184 (2016).
Article PubMed PubMed Central Google Scholar
Di Gennaro, E. et al. Vorinostat synergises with capecitabine through upregulation of thymidine phosphorylase. Br. J. Cancer 103, 1680–1691 (2010).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank Colin Birkenbihl and Lauren DeLong for their valuable feedback. The authors would like to thank the reviewers for their valuable feedback and suggestions. This work was developed in the Fraunhofer Cluster of Excellence “Cognitive Internet Technologies”.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin, 53757, Germany
Sepehr Golriz Khatami, Sarah Mubeen, Vinay Srinivas Bharadhwaj, Alpha Tom Kodamullil, Martin Hofmann-Apitius & Daniel Domingo-Fernández
Bonn-Aachen International Center for Information Technology (B-IT), University of Bonn, 53115, Bonn, Germany
Sepehr Golriz Khatami, Sarah Mubeen, Vinay Srinivas Bharadhwaj & Martin Hofmann-Apitius
Fraunhofer Center for Machine Learning, Sankt Augustin, Germany
Sarah Mubeen & Daniel Domingo-Fernández
Enveda Biosciences, Boulder, CO, 80301, USA
Daniel Domingo-Fernández

Authors

Sepehr Golriz Khatami
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Mubeen
View author publications
You can also search for this author in PubMed Google Scholar
Vinay Srinivas Bharadhwaj
View author publications
You can also search for this author in PubMed Google Scholar
Alpha Tom Kodamullil
View author publications
You can also search for this author in PubMed Google Scholar
Martin Hofmann-Apitius
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Domingo-Fernández
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.D.F. conceived and designed the study. S.G.K. implemented the scoring algorithm and ran the validation experiments with assistance from D.D.F. S.G.K. analyzed the case scenario and M.H.A. and D.D.F. assisted in the interpretation of the results. S.M. processed and prepared the datasets. S.M. and S.G.K. ran the datasets with the pathway enrichment method to generate the pathway activity scores. S.M. and V.S.B. trained the ML models. A.T.K., M.H.A., and D.D.F. acquired the funding. S.G.K., S.M., and D.D.F. wrote the paper. All authors have read and approved the final paper.

Corresponding authors

Correspondence to Sepehr Golriz Khatami or Daniel Domingo-Fernández.

Ethics declarations

Competing interests

D.D.F. received salary from Enveda Biosciences.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Data 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Golriz Khatami, S., Mubeen, S., Bharadhwaj, V.S. et al. Using predictive machine learning models for drug response simulation by calibrating patient-specific pathway signatures. npj Syst Biol Appl 7, 40 (2021). https://doi.org/10.1038/s41540-021-00199-1

Download citation

Received: 02 April 2021
Accepted: 21 September 2021
Published: 27 October 2021
DOI: https://doi.org/10.1038/s41540-021-00199-1

Subjects

Abstract

Similar content being viewed by others

Modeling cancer drug response through drug-specific informative genes

Gene expression based inference of cancer drug sensitivity

Drug ranking using machine learning systematically predicts the efficacy of anti-cancer drugs

Introduction

Results

Validation of the methodology and comparison against equivalent approaches

In-depth investigation of the prioritized candidate drugs

Investigation of pathways targeted by the prioritized drugs

Prioritizing combination therapies

Discussion

Materials and methods

Datasets

Calculating individualized pathway activity scores

Building a predictive classifier

Scoring algorithm

Drug response prediction and prioritization

Validation and robustness analysis

Performance comparison against equivalent drug-repurposing approaches

Implementation

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Supplementary Data 1

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links