Abstract
We present comboFM, a machine learning framework for predicting the responses of drug combinations in preclinical studies, such as those based on cell lines or patientderived cells. comboFM models the cell contextspecific drug interactions through higherorder tensors, and efficiently learns latent factors of the tensor using powerful factorization machines. The approach enables comboFM to leverage information from previous experiments performed on similar drugs and cells when predicting responses of new combinations in so far untested cells; thereby, it achieves highly accurate predictions despite sparsely populated data tensors. We demonstrate high predictive performance of comboFM in various prediction scenarios using data from cancer cell line pharmacogenomic screens. Subsequent experimental validation of a set of previously untested drug combinations further supports the practical and robust applicability of comboFM. For instance, we confirm a novel synergy between anaplastic lymphoma kinase (ALK) inhibitor crizotinib and proteasome inhibitor bortezomib in lymphoma cells. Overall, our results demonstrate that comboFM provides an effective means for systematic prescreening of drug combinations to support precision oncology applications.
Introduction
Combination therapies are often required for treating cancer patients with advanced stages of the disease. In addition to overcoming monotherapy resistance, combinatorial treatments can also reduce toxicity of the treatment (by reduced doses of the drugs) and improve therapeutic efficacy (by multitargeting effect)^{1,2,3}. With recent advances in highthroughput screening methods, a systematic evaluation of combinations among large collections of chemical compounds has become feasible. This typically leads to largescale experiments, in which the combinatorial responses are tested in various doses of the individual compounds, resulting in dose–response matrices that capture the measured combination effects for every concentration pair in a particular sample (e.g., cancer cell line or patientderived cells)^{4}. However, even with modern highthroughput instruments, experimental screening of drug combinations quickly becomes impractical, as the number of conceivable drug combinations increases rapidly with the number of drugs in consideration. In addition, the inherent heterogeneity of cancer cells pose further challenges for the experimental efforts, as the combinations need to be tested in various cell contexts and genomic backgrounds^{5,6}. Therefore, computational methods are often being used to guide the discovery of effective combinations to be prioritized for further preclinical and clinical validation^{7,8}.
During the recent years, machine learning has emerged as a powerful approach to aid the drug development process by offering systematic means for the prediction of target bioactivities and druginduced effects^{9,10,11,12,13}, thereby providing guidance for drug discovery and repositioning efforts^{14,15}. Until recently, the performance of machine learning methods in predicting drug combination effects was limited by the lack of highquality training data^{8}. However, this is gradually changing as increasing amounts of data from preclinical drug combination screens are becoming available, therefore creating new opportunities also for the application of largescale machine learning methods^{4,5,16}. For instance, the NCIALMANAC dataset generated by the US National Cancer Institute (NCI) provides over 3 million experimentally measured drug combination responses across various cell lines and tissue types^{4}. However, despite the potential value of such datasets, the high dimensionality of the underlying dose–response data and the inherent complexity of drug interaction patterns across various doses pose challenges to accurate modeling of drug combination effects.
Several computational tools have been proposed for the prediction of drug combinations^{2,7,8,17}. Many of these tools have been systematically benchmarked in two crowdsourced DREAM Challenge competitions^{18,19}, which demonstrated that computational predictions can achieve high accuracies for selected drug classes, provided there are enough drug information and training data available. However, the focus of these challenges and most of the previously proposed methods has been on directly predicting drug combination synergies (i.e., whether the combined summary effect is higher than expected). In many practical applications, however, more detailed information on dose–response effects of the combinations is required, rather than simply classifying the summary effects into synergistic or antagonistic classes. Furthermore, as noted in the recent AstraZenecaSanger drug combination prediction DREAM challenge^{19}, the performance of the computational methods typically relies on selective incorporation of target features and biological knowledge that is not always available for all drugs and cell models. Therefore, there is a need to develop integrative and robust models capable of generalizing and learning from large amounts of available data that facilitate the exploration of the extensive combinatorial drug and dose spaces.
Here, we present comboFM, a novel machine learning framework for systematic modeling of drugdose combination effects in a cell contextspecific manner. It is generally applicable to any preclinical model systems, such as patientderived primary cells, but we demonstrate its performance here in cancer cell lines (Fig. 1). We base our work on the observation that the drug combination dose–response data can be compiled into a higherorder tensor indexed by drugs, drug concentrations, and cell lines. comboFM then models the cell linespecific responses to a combination of drugs as an interaction between the different modes of the tensor using a higherorder factorization machine (FM)^{20}, a recently proposed machine learning approach for nonlinear learning on large data. FMs have been shown to be compelling tools with the ability to work particularly well with highdimensional and sparse datasets^{20,21,22}. In contrast to existing machine learning models, comboFM enables one to explore the detailed landscape of drug combination responses across various doses. We demonstrate that comboFM obtains high prediction accuracy in various practical application scenarios, significantly outperforming other approaches. Furthermore, we show the robustness and practical potential of comboFM by experimentally validating untested drug combinations predicted for specific cell lines.
Results
Overview of comboFM model
comboFM was developed for predicting drug combination responses of cancer cell lines in three practical scenarios (Fig. 1a). The first scenario of predicting new dose–response matrix entries corresponds to filling in the gaps in partially measured dose–response matrices. In the second scenario of new dose–response matrix inference, the predictions are made for completely held out dose–response matrices of untested drug–drug–cell line triplets, such that the drug pair has still been observed in other cell lines. In the third and most challenging scenario of new drug combination inference, the predictions are made for completely new drug combinations with no available combination measurements in any cell line, thereby providing guidance on repositioning of the drugs for new combinations and cell contexts.
To capture the highorder interactions between drug combinations in different cell lines and at various doses, comboFM models the multiway interactions between the two drugs, the cell lines and the dose–response matrices as a fifthorder data tensor X (Fig. 1b). Furthermore, comboFM makes it possible to integrate any auxiliary data of the drugs and cell lines, such as chemical descriptors in the form of molecular fingerprints of drug compounds, gene expression profiles of the cancer cell lines and concentration values tested for the drugs.
For the learning algorithm, the data tensor X is flattened into a twodimensional array (Fig. 1c), where each row vector x identifies a single entry in the original tensor. Given the associated responses y_{i} in the training data, comboFM model is learned using factorization machines (FMs). Higherorder FMs learn a nonlinear regression model from the input features (x) to the output (y) by estimating a regression weight \({w}_{{i}_{1},...,{i}_{t}}\) for each combination of input features \({x}_{{i}_{1}}\cdot {x}_{{i}_{2}}\cdots {x}_{{i}_{t}}\), where t is the order of the interaction. However, instead of estimating the weights \({w}_{{i}_{1},...,{i}_{t}}\) separately as in polynomial regression, FMs approximate the weights using factorized parametrization (Fig. 1d), where the weights are coupled through multiplication of latent factors learned by the FM. This approach avoids the computational and statistical problems that would result from directly estimating the weight tensor W. In addition, the coupling of the weights allows effective learning in situations where the data tensor is sparsely populated.
Accurate drug combination response predictions by comboFM
To systematically evaluate the comboFM model, we used the anticancer drug combination response data from the NCIALMANAC study^{4}. To enable various splits of data into different crossvalidation folds as required by the different prediction scenarios and to keep the computational complexity manageable, we considered a subset of the data consisting of 50 unique FDAapproved drugs (Supplementary Table 3) in 617 distinct combinations screened in various concentration pairs across all the 60 cell lines originating from 9 tissue types^{23}. In this data subset, a total of 333,180 drug combination response measurements and 222,120 monotherapy response measurements of single drugs are available in the form of percentage growth of the cell lines (see “Methods”). To computationally quantify the performance of comboFM in predicting drug combination responses and optimize the model parameters, we performed a 10 × 5 (10 outer folds, 5 inner folds) nested crossvalidation (CV) procedure under the three prediction scenarios (see “Methods”). The order of the feature interactions modeled by the FM was set to m = 5, according to the order of the underlying tensor.
To investigate the benefit of considering higherorder feature interactions, we also performed experiments using both second order formulation of FMs and first order FMs (corresponding to ridge regression). To further benchmark the predictive performance of comboFM, we applied random forest (RF) as a reference model, a widelyused machine learning model that is based on a rather different learning principle, and has previously been used for modeling drug combination effects^{24,25,26,27,28}, including the winning method of the recent AstraZenecaSanger drug combination prediction DREAM Challenge^{19}. The crossvalidation folds were held fixed throughout the experiments to ensure a fair comparison. We assessed the predictive performance of the methods using root mean squared error (RMSE), as well as Pearson and Spearman correlation between original and predicted dose–response matrices.
By leveraging the multiway interactions present in the underlying highdimensional drug combination space across drugs, drug concentrations, and cancer cell lines, the 5th order comboFM demonstrated high predictive accuracy in all the three prediction scenarios (Fig. 2), outperforming the random forest reference (p < 10^{−10} in all prediction scenarios, twosided Wilcoxon paired signed rank sum test, N = 666,360). In the scenarios of predicting new dose–response matrix entries and new dose–response matrices, the 5th order comboFM obtained a Pearson correlation of 0.97, and even in the new drug combination prediction scenario, the 5th order comboFM obtained a Pearson correlation of 0.95. The 5th order comboFM was also markedly more accurate than both the 1st and 2nd order comboFMs in all the three scenarios. Similar relative performance of the methods was also observed using Spearman correlation and RMSE (Fig. 2). In addition, the distribution of the predictions by 5th order comboFM followed that of the measured responses most accurately (Supplementary Fig. 1).
In addition to the global predictive performance of the methods, we analyzed also their performance in different tissue types and across the various types of drug combination therapies (Fig. 3 and Supplementary Figs. 2–4 and Supplementary Table 1). In all the three prediction scenarios (Fig. 3a–c), comboFM showed the highest average prediction accuracy in each of the tissue types, and also the smallest variance across the tissue types. The combination response in colon cancer appeared marginally more difficult to predict than the other tissue types, which is likely explained by higher variation in the colon cancer response data, as the number of colon cancer cell lines was similar to the other tissue types and thus the marginally inferior performance is unlikely to stem from limited data quantity. Nevertheless, the 5th order comboFM was still the most accurate method also in colon cancer cell lines. Furthermore, comboFM was shown to provide high accuracies across various types of combination therapies (chemotherapies, targeted therapies, and other therapies, such as hormonal therapies) (Fig. 3d–f). The combination therapies involving drugs from the Other class include the smallest number of observations, explaining their reduced predictive accuracy with all the methods.
To further validate the performance of the 5th order comboFM, we also evaluated its predictive accuracy in the remaining part of the NCIALMANAC data that was not used in the crossvalidation, consisting of 4737 distinct drug combinations. The model was trained on the full development dataset of 617 drug combinations as well as the monotherapy responses of the single drugs in the validation set, and the trained model was then used for predicting responses of the 4737 drug combinations in the validation set across the various cell lines. 5th order comboFM demonstrated high predictive accuracy also in this validation set (Supplementary Figs. 5 and 6), with Pearson correlation of 0.91 even for combinations where neither drug had previously been observed in any other combination, i.e. only the monotherapy responses of the individual drugs in the combination were available to the model.
Synergy scores can be recovered with high accuracy based on the predicted dose–response matrices
As the interest in drug combination experiments often lies in discovering the most synergistic drug combinations, we also quantified drug combination synergies based on the dose–response matrices predicted with comboFM. As a synergy quantification model, we applied NCI ComboScore (see “Methods”)^{4}, computed over the complete predicted dose–response matrices. Although drug combinations with an NCI ComboScore above zero are technically defined to be synergistic, combinations with highly synergistic effects are typically considered as more attractive candidates for further experimental validation. Therefore, we labeled the extreme synergistic drug combinations (observed NCI ComboScore value in the top 10%) as the positive class and the remaining combinations, including lowly synergistic, additive, and antagonist combinations, as the negative class.
Drug combination synergy scores were recovered with a high accuracy from the dose–response matrices predicted by the 5th order comboFM in all three prediction scenarios, significantly outperforming the other compared methods (Supplementary Fig. 7). Importantly, the drug combination synergies could be accurately computed based on the predicted dose–response matrices using 5th order comboFM even in the challenging scenario of predicting new drug combinations, with a Pearson correlation of 0.72 (p < 10^{−10}, twosided ttest, N = 74,040) between the observed and predicted NCI ComboScores. In the task of discriminating highly synergistic drug combinations, the 5th order comboFM obtained a high area under the receiver characteristic operator curve (AUC) of 0.91 in the new drug combination prediction task (Supplementary Fig. 8). The discrimination accuracies were at high level in each prediction scenario, and when using various top% extreme synergy combinations (Supplementary Fig. 8).
Experimental validation of the most synergistic predicted drug combinations
To further demonstrate the ability of comboFM to predict novel and robust drug combinations, the model was trained using all the available dose–response measurements in the development dataset, and the trained comboFM was then used to predict dose–response matrices for remaining unmeasured drug combinations across all the 60 cell lines, which resulted in a total of 10,320 predicted complete dose–response matrices. Experimental validation was performed subsequently on a subset of 16 drug combinations specific for 4 cell lines (Supplementary Table 2), where high synergy was predicted by comboFM. These combinations were selected to mainly involve molecularly targeted therapies, as the recent interest has increasingly evolved toward targeted agents over the standard cytotoxic chemotherapies. In particular, we focused on cancerspecific drug combinations which were predicted to have highly synergistic effects only in a subset of all the cell lines and tissue types. This poses a more challenging task than identifying broadly toxic combinations that kill most cancer cells, but which may also induce severe toxicities in the healthy cells. As in the previous experiments, we considered as highly synergistic those combinations with an observed NCI ComboScore values in the top 10% in a particular tissue type.
The results of the experimental validation of 16 drug–drug–cell line triplets are summarised in Fig. 4, using the Bliss model to quantify the observed synergy. The background histogram shows a distribution of an inhouse drug combination dataset, consisting of 60 drug combinations tested against 16 KRASmutants pancreatic ductal adenocarcinoma cell lines. Since the combinations in the reference set were not randomlyselected, the background synergy distribution shows a slight positive bias; however, since the assay was the same as the one used for the experimental validation of comboFM predictions (“Methods”), it is expected to provide a valid reference distribution for statistical evaluations. All the drug combinations predicted by comboFM were validated as synergistic, when considering positive Bliss score as evidence for a degree of synergy (p < 10^{−4}, binomial test against the background distribution). Importantly, 9 out of 16 combinations had a Bliss synergy score higher than 90% of the background distribution (p < 10^{−5}, binomial test). In addition to Bliss synergy score, we also computed the synergy scores using three other popular synergy models: Loewe, highest single agent (HSA) and zerointeraction potency (ZIP) scores (Supplementary Figs. 9 and 10). These results demonstrate the robustness of the comboFM predictions across various experimental setups and synergy scoring models.
Among others, comboFM predicted a particularly high level of synergy for the combination between anaplastic lymphoma kinase (ALK) inhibitor crizotinib and proteasome inhibitor bortezomib in lymphoma cell line SR. In addition to our inhouse experimental validations, this finding was further validated in external measurements in the NCIALMANAC data that were not used as part of comboFM training data. The ALK inhibitors are effective against cancers harboring ALK fusions. The SR cell line carries the NPM1ALK fusion, which is the first ever discovered ALK fusion in largecell lymphoma^{29}. Bortezomib is approved for mantel cell lymphoma supporting its potential in lymphoma treatment. It is likely that two even mildly effective inhibitors when used in combination may enhance the inhibition effect and potentially overcome monotherapy resistance. Notably, comboFM made this prediction without knowledge of the ALK fusion status of the SR cell line, i.e., this biological rationale was not available for the model. The prediction of high synergy between the firstgeneration inhibitors of ALK and proteasome for lymphoma cell lines highlights the potential of comboFM to predict biologically plausible combination effects.
The comboFM model identified also another unique drug combination effective against the SR cell line, the combination of EGFR inhibitor gefitinib with an approved chemotherapy lomustine for lymphoma treatment. One of the mechanisms inducing resistance to ALK inhibitors is activation of EGFR, as they signal through similar downstream pathways. Brigatinib, a dual ALK/EGFR inhibitor, is therefore being explored in clinical settings against lymphoma and lung cancer patients (NCT01449461). Our comboFM method predicted combination partners to extensively explored ALK and EGFR inhibitors for lymphoma, which we were able to also validate in the experimental setting (Fig. 4). These examples show the potential of comboFM to identify novel combinations of both targeted and cytotoxic treatments, that individually are already used as lymphoma treatments, and therefore are likely to have acceptable toxicity profiles in clinical applications.
Discussion
Given the enormous number of conceivable drug and dose combinations, computational approaches are needed to accelerate the experimental work by providing guidance toward identifying the most promising drug combinations for further experimental validation. While large datasets of drug combination dose–response matrices have already been tested in the lab, extensive gaps still remain in the combinatorial space among both targeted and nontargeted therapies, as well as hormonal and immunotherapies. Here, we have presented a novel machine learning framework, comboFM, for largescale systematic prediction of drug combination effects in human cancer cell lines. The obtained results demonstrate that comboFM can leverage predictive higherorder relationships between drugs, drug concentrations, and cancer cell line responses, which were missed when using random forest and simpler approaches, including 1st and 2nd order formulation of comboFM. Importantly, comboFM can accurately generalize the predictions also for new drug combinations not observed in the training space, which enables one to systematically predict dose–response matrices also for so far untested drug combinations formed by the individual drugs in the training set. This will provide guidance on repositioning the drugs into new combinations. We also demonstrated that comboFM consistently obtains high prediction performance across various tissue types and classes of drug combination therapy. In addition, 5th order comboFM was 3 times faster to train compared to the random forest reference when run on the same CPU and considering relatively conservative amount of 200 training epochs for training the comboFM model (Supplementary Table 2). Further performance advantages were obtained by employing a GPU for training the 5th order comboFM model (34 times faster compared to random forest).
Modeling the drug combination effects first at the level of dose–response matrices and subsequently quantifying the level of overall drug combination synergy over the full matrix provides many benefits compared to approaches that directly aim at predicting the drug combination synergies. First of all, predicting the underlying dose–response matrices enables one to leverage all the information contained in the dose–response matrices and provides detailed information of the response landscape across various dose combinations. In addition, in the second stage, one is not limited only by a single synergy quantification model, but can explore the synergies using various models, hence gaining a more comprehensive view of the synergistic drug combination landscapes^{30}. Furthermore, understanding the drug combination effects both at the dose level as well as at the synergy level provides useful guidance for precision medicine efforts. For instance, combination synergies observed at lower doses are often better tolerated in the clinical practice. Furthermore, it has been shown that for most of the FDAapproved drug combinations, only little evidence of additivity or synergy was observed in preclinical models^{31}, highlighting that synergy is not always needed for clinical treatment success. However, it has also been argued that patient stratification based on predictive markers is likely to reduce variability in clinical therapy responses, and contribute to achieving truly synergistic responses to combination treatments^{32}.
Inhouse experimental validations of the topsynergistic combinations predicted using the NCIALMANAC data demonstrated that the comboFM predictions are robust also to the experimental setup. The inhouse assay had many experimental differences when compared to the combination assay used to profile the NCIALMANAC development dataset. In particular, the inhouse assay measured the drug combination responses in the form of percentage inhibition, instead of percentage growth that is used in the NCIALMANAC assay. Therefore, we could not calculate the NCI ComboScore for the experimental validations, but instead scored the combinations using four popular synergy models (Supplementary Figs. 9 and 10). As an example, comboFM predicted a pivotal role of histone deacetylase (HDAC) in melanoma cell line MALME3M, thereby suggesting potential of HDAC inhibition against melanoma. In particular, various combinations with HDAC inhibitor romidepsin were predicted to be effective against BRAFmutants melanoma cell line MALME3M, which also held true in the experimental settings (Fig. 4). Even though most of the drugs in the romidepsincombinations have already been explored in different combinations to target melanoma^{33,34}, the combinations predicted by comboFM have remained unexplored against melanoma, and warrant further investigation. Individually, each of these inhibitors have shown promising results in preclinical or clinical settings against melanoma, further supporting their use in combination therapies.
Even though the main objective of this work was to develop and carefully validate the comboFM model in cancer cell lines as an accurate methodology for systematic prediction of drug combination responses for biological discovery, we note that many of the drugs identified by comboFM have been or are currently being explored in clinical settings against the specific cancer type, either as single agents or in combination with other drugs (see Supplementary Table 5). For instance, HDAC inhibitor vorinostat is being tested against BRAFmutant advanced melanoma in an ongoing clinical trial (ref. ^{35}; NCT02836548). Similarly, mTOR inhibitor everolimus is shown to selectively target BRAFmutant melanoma in acidic condition^{36}. In an ongoing clinical trial, mTOR inhibitors everolimus or temsirolimus in combination with BRAF inhibitor are being investigated against BRAFmutant advanced solid tumors (NCT01596140). SMOinhibitor vismodegib blocks Hedgehog pathway which regulates the skin growth. In case of medulloblastoma, HDAC inhibitors are active against even SMOinhibitor resistant cell lines^{37}. Hence, concurrent use of HDAC and SMO inhibitors holds a promising strategy to target melanoma, as predicted by romidepsin and vismodegib combination (Fig. 4). In the same line of rationale, combining HDAC inhibitor with DNA damaging agents, such as oxaliplatin, dactinomycin, and cladribine, holds strong promises and are explored in different preclinical and clinical settings^{33,34,38,39}.
These case examples already unveil the potential of our method for predicting combinations with translational potential, although these findings warrant further validation in proper clinical trials. Furthermore, once the model accuracy has been confirmed in the cell line resources, we envision that the carefully validated model will be applicable also to data from individual cancer patients, thereby providing means for tailoring effective combinations in precision oncology applications. For selected cancer types, such as haematological malignancies, molecular and drug response profiling data are becoming available from patientderived primary cells that can be used for training cancer typespecific prediction models^{40,41}. Once similar data from other cancer types becomes available, comboFM will enable also pancancer analyses, similar to the current analyses in the NCIALMANAC cell lines. We found that many of the combinations predicted in the NCIALMANAC cell lines have actually already been tested in clinical trials (Supplementary Table 5). Interestingly, most of the combinations are tested in different indications than what was predicted based on the cell lines, suggesting further drug repurposing opportunities. The comboFM predictions require input data that start to be routinely available in many functional precision medicine studies, making it therefore broadly applicable for many cancer types and therapy classes.
In the present study, we assumed that one knows the monotherapy responses of single drugs prior to predicting the combination responses, as in practice it is often needed to know the concentration ranges and potencies of the single drugs (i.e., dose–response curves) in order to know which dose combinations should be used in combination testing, and also how potent the compounds are individually. comboFM strongly benefits from this information due to its capability to interpolate in the space of dose–response matrices through the computation of latent factors representing similarly behaving drug combinations from the response tensor alone (similarly to recommender systems grouping users by the movies they have liked in the past), while the drug and cell line descriptors merely finetune the predictions. It is plausible that by careful experimental design, one could minimize the number of monotherapy responses needed for accurate dose–response matrix prediction^{42} whilst maintaining the accuracy of the comboFM model, which we leave as an interesting future research topic. However, in a scenario where one would like to perform predictions for completely new molecules with no prior monotherapy or combination response data in any cell line, the computed latent factors are no longer helpful, and none of the methods could perform well with the current design (Supplementary Fig. 13). This limitation of the methodology in such scenarios could potentially be addressed by more extensive feature engineering or by developing models that are specialized for the case of predicting dose–response matrices for combinations of completely new drugs.
As with any highthroughput preclinical data, the cell line drug response profiles may show inconsistency in experimental outputs across the same cell linetreatment pairs^{43}. Therefore, we argue that it is important to develop and initially evaluate the prediction models in large enough and standardized cell line resources, such as NCIALMANAC, to avoid any reproducibility issues in the development phase. We further tested the model predictions using distinct experimental setups in the same cell lines to show that the predictions were robust enough against such biological and technical variability.
In conclusion, given the high cost of the experimental screening of drug combinations, comboFM has the potential to provide time and costeffective means toward prioritizing the most promising drug combinations for further preclinical or clinical studies. The accurate and robust drug combination response predictions provide a promising approach to streamline the development and expansion of combination therapeutics in personalized cancer treatment. This could ultimately accelerate the clinical use of combination therapeutics to combat acquired drug resistance and to increase therapeutic efficacies.
Methods
Higherorder factorization machines
comboFM uses higherorder factorization machines (HOFM)^{20,21} for predicting the drug–drug combination responses. HOFMs are nonlinear regression models learned with a training set of examples
of feature vectors \({\bf{x}}\in {{\mathbb{R}}}^{d}\) and output labels \(y\in {\mathbb{R}}\).
A trained HOFM models the output \(y\in {\mathbb{R}}\) as a function of single, pairwise, and higherorder interactions between input features up to order m:
The first term corresponds to a linear model, and all parameters w_{i} are independently estimated. The higherorder parameters are, on the other hand, estimated in a factorized form
where \({{\bf{p}}}_{i}^{m}\in {{\mathbb{R}}}^{k}\) denotes the mth order factor weight of feature i, k is the hyperparameter defining the rank of the factorization, and
denotes a generalized inner product of m vectors \({{\bf{a}}}_{i}\in {{\mathbb{R}}}^{k},i=1,\ldots ,m\) that generalizes the usual pairwise inner product 〈a, b〉 = a^{T}b to sets of m vectors.
The factor weights are collected into matrices \({{\bf{P}}}^{(m)}={({{\bf{p}}}_{1}^{m},\ldots ,{{\bf{p}}}_{d}^{m})}^{T}\in {{\mathbb{R}}}^{d\times k}\). The factorized parametrization drastically reduces the number of estimated parameters from O(d^{m}) (all feature combinations have their own parameter) to O(kdm) (m − 1 factor matrices of dimension d × k). In principle HOFMs allow an unique rank k_{t} for each order t = 2, ..., m. In the above description and in our experiments, we used uniform rank k = k_{2} = … = k_{m}.
FMs are based on the assumption that the effect of pairwise and higherorder feature interactions has a low rank and allows FMs to estimate reliable parameters even under highly sparse data. Hence, the cooccurrence of x_{i} and \({x}_{i^{\prime} }\) does not need to be observed in order to learn \({w}_{i,i^{\prime} }\): the factors \({{\bf{p}}}_{i^{\prime} :}\) and \({{\bf{p}}}_{i^{\prime} :}\) can be learned by interacting with other dimensions and the dot product of \({{\bf{p}}}_{i^{\prime} :}\) and \({{\bf{p}}}_{i^{\prime} :}\) still gives \({w}_{i,i^{\prime} }\). This is extremely useful in the case of highdimensional drug combination data where the input tensor is typically very sparse, and thus allows to make reliable inferences of the responses to new drug combinations whose individual components have still been observed in other combinations elsewhere in the training tensor. Compared to standard matrix factorization approaches, FMs provide additional flexibility by allowing integration of auxiliary data describing the drugs and cell lines, such as chemical and genomic descriptors.
The objective function of learning higherorder factorization machines is to minimize the regularized mean squared error
where β_{1}, . . . , β_{m} > 0 are regularization parameters. To limit the number of hyperparameter combinations to search, following the work by Blondel et al.^{20}, we set β_{1} = . . . = β_{m}, and a uniform rank k = k_{2} = . . . = k_{m}. In the experiments, we used a recent TensorFlow implementation of higherorder factorization machines^{44}.
On the NCIALMANAC data, increasing the order and rank of the factorization machine both improve the predictive performance (Pearson correlation) of the comboFM model (Supplementary Fig. 12). The predictive performance increases steeply until order 5, which matches the intrinsic order of the data tensor X (See Fig. 1b), and then continues to increase more slowly. The performance increase due to increasing rank of the factorization is rapid until around rank 50 and then continues to increase more slowly. There is no apparent overfitting even with factorization order as high as 10 and rank as high as 150.
Synergy quantification
As the interest often lies in discovering the most synergistic drug combinations, we quantify the drug combination synergies based on the predicted dose–response matrices. To compute the synergy scores, we apply the NCI ComboScore, which was introduced along with the NCIALMANAC dataset^{4}, originally modified from the Bliss independence score.
The NCI ComboScore for drug A and drug B is defined as the sum of the deviations between expected and observed responses over all concentrations p and q:
where y_{c}(A_{p}, B_{q}) is the combination growth fraction of the cell line exposed to drug A in concentration p and drug B in concentration q, and y_{e}(A_{p}, B_{q}) is the expected growth fraction for the combination defined based on the monotherapy effects of drug A and drug B as follows:
where y_{m}(A_{p}) and y_{m}(B_{q}) denote the monotherapy effects of drug A in concentration p and drug B in concentration q, respectively. We applied \({\tilde{y}}_{m}=\min ({y}_{m},150)\) that truncates the growth fraction at 150, with the threshold selected based on the histogram of the measured drug combination responses (Supplementary Fig. 11).
Training setup
In order to evaluate the predictive performance and optimize the model parameters under the three prediction scenarios, we performed a 10 × 5 (10 outer folds, 5 inner folds) nested crossvalidation procedure. For all the factorization machine models, the rank parameter was optimized in the range k = {25, 50, 75, 100} and the regularization parameter in the range β = {10^{2}, 10^{3}, 10^{4}, 10^{5}}. The order of the modeled feature interaction was set to 5 according to the order of the underlying tensor, as a compromise between the training time and prediction accuracy. The learning rate was set to 0.001 based on preliminary experiments and other parameters were kept in their default values. The number of trees of the random forest model was optimized in the range {32, 64, 128, 512} and the fraction of features considered when looking for the best split (MaxFeatures) in the range {0.25, 0.5, 0.75, 1.0}.
As each input sample is represented by a single feature vector, in order to take the symmetry of the drug combinations into account, the samples were duplicated such that both of the drugs in a combination were included in both positions in the feature vectors. This informs the algorithm that the combination of drug A with drug B should be considered the same as the combination of drug B with drug A. The prediction accuracy of all the models was assessed using the same performance evaluation metrics: RMSE, Pearson correlation, and Spearman correlation.
Evaluation of the prediction performance
In this type of applications, the predictive performance is significantly affected by whether the training and test sets share the different components of the modeled interactions, and it is thus important to reliably quantify the prediction accuracy under practical application scenarios. Therefore, we evaluated the predictive performance of comboFM under three prediction scenarios: (a) new dose–response matrix entry prediction, (b) new dose–response matrix prediction and (c) new drug combination prediction (c.f. Fig. 1). For each scenario, we used dedicated nested crossvalidation setups to ensure unbiased evaluation. In scenario (a), the predictions were made for individual heldout entries in dose–response matrices. The heldout entries were selected at random for each crossvalidation fold. In scenario (b), the predictions were made for completely held out (dose–response matrix, cell line) pair, such that the same drug combination had still been measured in other cell lines. This scenario corresponds to a widelyused strategy in other computational works concerning drug combination synergy prediction, in which the predictions are made for new drug–drugcell line triplets. In scenario (c), most challenging scenario of new drug combination prediction, the predictions are made for novel drug combinations outside the training space with no available combination measurements. In all prediction scenarios, we assumed that the monotherapy responses of the single drugs in the combination are known.
To computationally evaluate the prediction performance and optimize the model parameters, we performed a nested crossvalidation procedure. In the first prediction scenario of new dose–response matrix entry prediction, the crossvalidation folds were formed by simply random sampling from the tensor entries. In the second prediction scenario concerning new dose–response matrices, the folds were created by randomly sampling on the level of dose–response matrices, i.e., if a drug paircell line triplet (\({x}_{{d}_{1}}\),\({x}_{{d}_{2}},{x}_{c}\)) belonged to the test set, the training tensor did not include any entry involving the triplet (\({x}_{{d}_{1}}\),\({x}_{{d}_{2}},{x}_{c}\)). In the third scenario of new drug combination prediction, the random sampling was performed on the level of drug pairs and all the entries involving the test drug pairs were held out from the training set, i.e., if a drug pair (\({x}_{{d}_{1}}\),\({x}_{{d}_{2}}\)) belonged to the test set, the training tensor did not contain any entry involving the pair (\({x}_{{d}_{1}}\),\({x}_{{d}_{2}}\)). Furthermore, we ensured that the individual drugs in the left out drug pairs are still observed individually in other combinations in the training set, which enables the model to learn from the way the individual drugs in the held out combinations act in other combinations.
Drug combination anticancer activity dataset
The drug combination anticancer activity dataset was obtained from a recent NCIALMANAC study^{4}, which is the largest available drug combination dataset to date. The original dataset covers over 5000 combinations of roughly 100 small molecule drugs screened against 60 cell lines in various concentrations, containing over 3 million response measurements. The drugs included in the dataset are FDAapproved oncology drugs with proven activity and established safety profiles. The cell lines represent human tumor cell lines from the NCI60 panel, originating from 9 different tissue types.
To reduce the computational complexity, we selected a subset of the NCIALMANAC dataset by randomly sampling 50 drugs (Supplementary Table 3) from the original set of drugs, ensuring that the distribution of the subset of drug combination responses matched to that of the original one. Furthermore, we selected drug combinations for which complete measurements across all the 60 cell lines were available. As a result, we obtained a dataset for our experiments consisting of 617 drug combinations of 50 unique drugs, screened in 45 unique concentrations against 60 cell lines, containing 333,180 response measurements for combinations and 222,120 measurements for monotherapies, measured by percentage growth of the cell line with respect to a control. Each drug combination in the dataset had been screened using 4 × 4 dose–response matrix design.
Data representation
Defining an informative input feature representation of the underlying data is essential to take the full advantage of comboFM and FMs in general. By defining appropriate input features, FMs have been shown to have the representation power encompassing a variety of matrix and tensor factorization models from standard models to more specialized ones^{21,22}. Hence, by learning FMs, all the subsumed factorization models can also be learned.
In order to represent the structure of the tensor underlying the drug combination response data as single input feature vectors, onehot encoding is used. Here, the input feature vectors x are divided into five different groups corresponding to the different modes of the tensor: two sets of drugs, their concentrations, and a cell line. In each group, exactly one value is set to 1 and the rest to 0, with 1 denoting the instance that is present in the corresponding interaction:
As the feature vector is nonzero only for the pair of drugs, drug concentrations, and cell line present in the corresponding interaction, all the other interactions in the FM model vanish and the model corresponds to standard factorization models involving categorical variables. However, whereas standard factorization models are limited to categorical input data only, comboFM and FMs can also incorporate auxiliary features in addition to the information of the interacting elements, which can further aid the prediction task, particularly when making predictions outside the training space. In this work, we used chemical descriptors of molecules and genomic descriptors of cell lines (see below for details).
Chemical descriptors
As chemical descriptors, we integrated molecular fingerprints, binary vectors which are designed to represent the structure of a molecule as a series of bits, each one representing the presence or absence of a particular substructure. We selected a popular fingerprint of type ‘estate’, consisting of 79 bits corresponding to the EState atom types originally defined by^{45}, obtained from the rcdk R package^{46}. Fingerprint bits with zero variance across the dataset were further removed, resulting in remaining 34 bits for the two sets of drugs.
Genomic descriptors
As genomic descriptors, we incorporated gene expression profiles of the cancer cell lines, obtained from the rcellminer R package^{47}. The gene expression profiles were measured with five different platforms (four Affymetrix arrays and an Agilent Whole Human Genome Oligo array) and a combined average zscore was reported as a combined gene expression for a gene. To reduce the dimensionality of the resulting feature matrix, we selected 0.5% of the genes with the highest variance across the samples, resulting in 78 gene expression values for each cell line.
Cell lines
Early passage cells lines purchased from ATCC (HS578T & Malme3M) and NCIFrederick DCTD tumor/cell lines repository (SR & IGROV1) were used for drug combination screening. The cell lines were maintained at 37 °C with 5% CO_{2} in a humidified incubator in their respective medium (see Supplementary Table 4a). All the reagents were purchased from ThermoFisher Scientific. All the cell lines were tested negative for mycoplasma. The test was based on the method described by Choppa et al.^{48} and was performed as a service by the sample management laboratory of THL Biobank, Helsinki, Finland.
Drug combination screening
The drug combination testing experimental design was adopted from Gautam et al.^{49}. Seven different concentrations in log3fold dilution of two drugs were combined with each other in 8 × 8 matrix formats. Please refer to Supplementary Tables 4b and c for the dug information and combinations design, respectively. The compounds were plated to black clear bottom 384well plates (Corning #3764) using an Echo 550 Liquid Handler (Labcyte). 100 μM benzethonium chloride (BzCl2) and 0.1% dimethyl sulfoxide (DMSO) were used as positive and negative controls, respectively. All subsequent liquid handling was performed using MultiFlo FX multimode dispenser (BioTek). The predispensed compounds were dissolved in 5 μl of culture media and left in a plate shaker at room temperature for 30 min. Twenty microliter cell suspension (please refer to Supplementary Table 4a for cell line specific seeding densities) was dispensed in the drugged plates. After 72 h incubation, 25 μl per well of CellTiterGlo (Promega) reagent was added, and after 10 min of incubation at room temperature, luminescence (cell viability) was measured using PheraStar plate reader (BMG Labtech).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The NCIALMANAC dataset is publicly available from National Cancer Institute (NCI) at https://wiki.nci.nih.gov/display/NCIDTPdata/NCIALMANAC. The preprocessed data used in the computational experiments and inhouse drug combination testing data for validating comboFM predictions are available at https://doi.org/10.5281/zenodo.4135059. Source data underlying the figures and display items are provided at https://doi.org/10.5281/zenodo.4135059 subdirectory source_data.
Code availability
The code is available at https://doi.org/10.5281/zenodo.4129688.
References
AlLazikani, B., Banerji, U. & Workman, P. Combinatorial drug therapy for cancer in the postgenomic era. Nat. Biotechnol. 30, 679 (2012).
Masui, K. et al. A tale of two approaches: complementary mechanisms of cytotoxic and targeted therapy resistance may inform nextgeneration cancer treatments. Carcinogenesis 34, 725–738 (2013).
Lehár, J. et al. Synergistic drug combinations tend to improve therapeutically relevant selectivity. Nat. Biotechnol. 27, 659–666 (2009).
Holbeck, S. L. et al. The national cancer institute almanac: a comprehensive screening resource for the detection of anticancer drug pairs with enhanced therapeutic activity. Cancer Res. 77, 3564–3576 (2017).
O’Neil, J. et al. An unbiased oncology compound screen to identify novel combination strategies. Mol. Cancer Therapeutics 15, 1155–1162 (2016).
Day, D. & Siu, L. L. Approaches to modernize the combination drug development paradigm. Genome Med. 8, 115 (2016).
Bulusu, K. C. et al. Modelling of compound combination effects and applications to efficacy and toxicity: stateoftheart, challenges and perspectives. Drug Discov. Today 21, 225–238 (2016).
Ali, S., Tonekaboni, M., Ghoraie, L. S., Satya Kumar Manem, V. & HaibeKains, B. Predictive approaches for drug combination discovery in cancer. Brief. Bioinforma. 19, 263–276 (2018).
Rampášek, L., Hidru, D., Smirnov, P. & GoldenbergDr, A. vae: improving drug response prediction via modeling of drug perturbation effects. Bioinformatics 35, 3743–3751 (2019).
Paltun, B. G., Mamitsuka, H. & Kaski, S. Improving drug response prediction by integrating multiple data sources: matrix factorization, kernel and networkbased approaches. Brief Bioinform. https://doi.org/10.1093/bib/bbz153 (2019).
Cichonska, A. et al. Learning with multiple pairwise kernels for drug bioactivity prediction. Bioinformatics 34, i509–i518 (2018).
Cichonska, A. et al. Computationalexperimental approach to drugtarget interaction mapping: a case study on kinase inhibitors. PLoS Computational Biol. 13, e1005678 (2017).
Costello, J. C. et al. A community effort to assess and improve drug sensitivity prediction algorithms. Nat. Biotechnol. 32, 1202 (2014).
Gertrudes, J. C. et al. Machine learning techniques and drug design. Curr. Medicinal Chem. 19, 4289–4297 (2012).
Lavecchia, A. Machinelearning approaches in drug discovery: methods and applications. Drug Discov. Today 20, 318–331 (2015).
Griner, L. A. M. et al. Highthroughput combinatorial screening identifies drugs that cooperate with ibrutinib to kill activated bcelllike diffuse large bcell lymphoma cells. Proc. Natl Acad. Sci. USA 111, 2349–2354 (2014).
Sidorov, P., Naulaerts, S., ArieyBonnet, J., Pasquier, E. & Ballester, P. Predicting synergism of cancer drug combinations using ncialmanac data. Front. Chem. 7, 509 (2019).
Bansal, M. et al. A community computational challenge to predict the activity of pairs of compounds. Nat. Biotechnol. 32, 1213 (2014).
Menden, M. P. et al. Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen. Nat. Commun. 10, 2674 (2019).
Blondel, M., Fujino, A., Ueda, N. & Ishihata, M. Higherorder factorization machines. In Advances in Neural Information Processing Systems, 3351–3359 (2016).
Rendle, S. Factorization machines. In 2010 IEEE International Conference on Data Mining, 995–1000 (IEEE, 2010).
Rendle, S. Factorization machines with libfm. ACM Trans. Intell. Syst. Technol. (TIST) 3, 57 (2012).
Shoemaker, R. H. The nci60 human tumour cell line anticancer drug screen. Nat. Rev. Cancer 6, 813–823 (2006).
Li, H., Li, T., Quang, D. & Guan, Y. Network propagation predicts drug synergy in cancers. Cancer Res. 78, 5446–5457 (2018).
Jeon, M., Kim, S., Park, S., Lee, H. & Kang, J. In silico drug combination discovery for personalized cancer therapy. BMC Syst. Biol. 12, 16 (2018).
Gayvert, K. M. et al. A computational approach for identifying synergistic drug combinations. PLoS Comput. Biol. 13, e1005308 (2017).
Wildenhain, J. et al. Prediction of synergism from chemicalgenetic interactions by machine learning. Cell Syst. 1, 383–395 (2015).
Chen, L. et al. Prediction of effective drug combinations by chemical interaction, protein interaction and target enrichment of kegg pathways. BioMed Res. Int. 2013, 723780 (2013).
Morris, S. W. et al. Fusion of a kinase gene, alk, to a nucleolar protein gene, npm, in nonhodgkin’s lymphoma. Science 263, 1281–1284 (1994).
Vlot, A. H. C., Aniceto, N., Menden, M. P., UlrichMerzenich, G. & Bender, A. Applying synergy metrics to combination screening data: agreements, disagreements and pitfalls. Drug Discov. Today 24, 2286–2298 (2019).
Palmer, A. C. & Sorger, P. K. Combination cancer therapy can confer benefit via patienttopatient variability without drug additivity or synergy. Cell 171, 1678–1691 (2017).
Boshuizen, J. & Peeper, D. S. Rational cancer treatment combinations: An urgent clinical need. Mol. Cell 78, 1002–1018 (2020).
Grazia, G., Penna, I., Perotti, V., Anichini, A. & Tassi, E. Towards combinatorial targeted therapy in melanoma: from preclinical evidence to clinical application. Int. J. Oncol. 45, 929–949 (2014).
Suraweera, A., O’Byrne, K. J. & Richard, D. J. Combination therapy with histone deacetylase inhibitors (hdaci) for the treatment of cancer: achieving the full therapeutic potential of hdaci. Front. Oncol. 8, 92 (2018).
Haas, N. B. et al. Phase ii trial of vorinostat in advanced melanoma. Invest. New Drugs 32, 526–534 (2014).
Ruzzolini, J. et al. Everolimus selectively targets vemurafenib resistant brafv600e melanoma cells adapted to low ph. Cancer Lett. 408, 43–54 (2017).
Pak, E. et al. A largescale drug screen identifies selective inhibitors of class i hdacs as a potential therapeutic option for shh medulloblastoma. Neuro. Oncol. 21, 1150–1163 (2019).
Gerner, R. E., Moore, G. E. & Didolkar, M. S. Chemotherapy of disseminated malignant melanoma with dimethyl triazeno imidazole carboxamide and dactinomycin. Cancer 32, 756–760 (1973).
Rocca, A. et al. A phase i–ii study of the histone deacetylase inhibitor valproic acid plus chemoimmunotherapy in patients with advanced melanoma. Br. J. Cancer 100, 28–36 (2009).
Tyner, J. W. et al. Functional genomic landscape of acute myeloid leukaemia. Nature 562, 526–531 (2018).
Friedman, A. A., Letai, A., Fisher, D. E. & Flaherty, K. T. Precision medicine for cancer with nextgeneration functional diagnostics. Nat. Rev. Cancer 15, 747–756 (2015).
Ianevski, A. et al. Prediction of drug combination effects with a minimal set of experiments. Nat. Mach. Intell. 1, 568–577 (2019).
HaibeKains, B. et al. Inconsistency in large pharmacogenomic studies. Nature 504, 389–393 (2013).
Trofimov, M. & Novikov, A. TFFM: Tensorflow implementation of an arbitrary order factorization machine. https://github.com/geffy/tffm (2016).
Hall, L. H. & Kier, L.B. The molecular connectivity chi indexes and kappa shape indexes in structureproperty modeling. Rev. Comput. Chem. 367–422 (1991).
Guha, R. et al. Chemical informatics functionality in r. J. Stat. Softw. 18, 1–16 (2007).
Luna, A. et al. rcellminer: exploring molecular profiles and drug response of the nci60 cell lines in r. Bioinformatics 32, 1272–1274 (2015).
Choppa, P. C., Vojdani, A., Tagle, C., Andrin, R. & Magtoto, L. Multiplex pcr for the detection of Mycoplasma fermentans, M. hominis and M. penetrans in cell cultures and blood samples of patients with chronic fatigue syndrome. Mol. Cell. Probes 12, 301–308 (1998).
Gautam, P. et al. Identification of selective cytotoxic and synthetic lethal drug responses in triple negative breast cancer cells. Mol. Cancer 15, 34 (2016).
Acknowledgements
This work was supported by the Academy of Finland [ICT2023 programme grants 313268 to J.R.; 313266 to T.P. and 313267 to T.A. and grants 292611, 310507, 326238 to T.A.], the Cancer Society of Finland [T.A.]), the Sigrid Jusélius Foundation [T.A.], and Orion Research Foundation sr [P.G.]. The authors thank the FIMM HTB unit and especially Laura Turunen for their great help with the drug combination assays and Aleksandr Ianevski for his great help with the synergy scoring and the background distribution data for Fig. 4. The authors also acknowledge the computational resources provided by the Aalto ScienceIT project as well as CSC  IT Center for Science, Finland.
Author information
Authors and Affiliations
Contributions
H.J., T.A., A.C., S.S., T.P., and J.R. designed the research. H.J., A.C., T.P., J.R., and S.S. developed computational methods and evaluation protocols. A.C., J.D. and H.J. performed computational evaluations. P.G. designed and performed experimental evaluation. H.J., A.C., P.G., J.R., and T.A. wrote the paper, contributed by T.P. and S.S.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Krishna Bulusu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Julkunen, H., Cichonska, A., Gautam, P. et al. Leveraging multiway interactions for systematic prediction of preclinical drug combination effects. Nat Commun 11, 6136 (2020). https://doi.org/10.1038/s4146702019950z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s4146702019950z
This article is cited by

DFFNDDS: prediction of synergistic drug combinations with dual feature fusion networks
Journal of Cheminformatics (2023)

Dose–response prediction for invitro drug combination datasets: a probabilistic approach
BMC Bioinformatics (2023)

EDST: a decision stump based ensemble algorithm for synergistic drug combination prediction
BMC Bioinformatics (2023)

Harmonizing across datasets to improve the transferability of drug combination prediction
Communications Biology (2023)

Deep learning in drug discovery: an integrative review and future challenges
Artificial Intelligence Review (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.