Bipartite network models to design combination therapies in acute myeloid leukaemia

Jafari, Mohieddin; Mirzaie, Mehdi; Bao, Jie; Barneh, Farnaz; Zheng, Shuyu; Eriksson, Johanna; Heckman, Caroline A.; Tang, Jing

doi:10.1038/s41467-022-29793-5

Download PDF

Article
Open access
Published: 19 April 2022

Bipartite network models to design combination therapies in acute myeloid leukaemia

Nature Communications volume 13, Article number: 2128 (2022) Cite this article

4299 Accesses
16 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Combination therapy is preferred over single-targeted monotherapies for cancer treatment due to its efficiency and safety. However, identifying effective drug combinations costs time and resources. We propose a method for identifying potential drug combinations by bipartite network modelling of patient-related drug response data, specifically the Beat AML dataset. The median of cell viability is used as a drug potency measurement to reconstruct a weighted bipartite network, model drug-biological sample interactions, and find the clusters of nodes inside two projected networks. Then, the clustering results are leveraged to discover effective multi-targeted drug combinations, which are also supported by more evidence using GDSC and ALMANAC databases. The potency and synergy levels of selective drug combinations are corroborated against monotherapy in three cell lines for acute myeloid leukaemia in vitro. In this study, we introduce a nominal data mining approach to improving acute myeloid leukaemia treatment through combinatorial therapy.

Designing patient-oriented combination therapies for acute myeloid leukemia based on efficacy/toxicity integration and bipartite network modeling

Article Open access 01 March 2024

Network-driven cancer cell avatars for combination discovery and biomarker identification for DNA damage response inhibitors

Article Open access 21 June 2024

A network-based trans-omics approach for predicting synergistic drug combinations

Article Open access 29 July 2024

Introduction

Studies on cases with advanced cancers have shown that less than 10% of patients have actionable mutations, and the improvement of outcomes is unobserved in a randomised trial of precision medicine based on genomic profiles¹. The current limitation of genomics-centric personalised medicine falls short of the enormous heterogeneity and lack of actionable and sustainable treatment options. With a few exceptions, patient genomic signatures with clinical pathology do not typically predict drug responses. More precisely, cancer can principally be considered a signalling disease, not a genetic disease. There is a wealth of data that has validated this hypothesis, including signalling behaviours involved in growth factor and nutrient responses, the process of entering and exiting the cell cycle, ensuring that chromosomes are segregated in an orderly, efficient and accurate manner during mitosis, and apoptosis^2,3. On the other hand, the complexity of crosstalk between signalling pathways necessitates to modify multiple targets in cancer cells; otherwise, a lack of complete response, resistance, and relapse will emerge during the course of treatment.

Despite the fact that large amounts of small molecules or drugs have been tested on many cell lines or patient-derived samples, using single drugs as monotherapies to cure cancer might not be a promising strategy, as it is known that the complex interactions of various biochemical components can induce drug resistance during the treatment of cancer^4,5,6. As a matter of fact, monotherapy, the slogan of one target one drug, is inefficient in curing complex diseases, such as cancer^7,8. Combination therapy or polytherapy with synergistic drugs may achieve a more effective and safer outcome by targeting several targets in the same or separate pathways of the complex system^4,9. To better identify the synergistic drug combination based on precision medicine, we need ex vivo drug screening to decipher the functional impact of cancer genomics at the phenotypic level and to understand their interactions in the context of biological networks^10,11,12. Therefore, understanding network biology may provide a unique opportunity to leverage the rich source of drug response data to offer network-based models for combinatorial therapy. These network models have shown promise for developing clinical decision support tools to discriminate functional patient subclasses^13,14. Even though there are networks reconstructed to model biological mechanisms of diseases and predict drug combination synergies based on molecular data^15,16,17,18, network models have not been systematically applied to patient data, such as the drug response data of patient-derived samples, to predict patient-customised drug combinations¹⁴. Instead, the ex vivo drug response data are straightforwardly translated into the clinic for patient treatment since these individualised experiments represent the efficiency of some approved drugs on patient-derived primary cultures^19,20.

In 2018, the Beat AML programme reported a cohort of 672 tumour specimens collected from 531 patients, analysing the ex vivo sensitivity for 122 drugs alongside the mutational status and the gene expression signatures of the samples²¹. Despite the dearth of large patient-related drug response datasets, some large cell line–based datasets, such as genomics of drug sensitivity in cancer (GDSC) and ALMANAC, can offer a strong source of supporting evidence for predictions. The GDSC database contains the responses of 1001 cancer cell lines to 265 anti-cancer drugs, providing a rich source of information to connect genotypes with cellular phenotypes and to identify cancer-specific therapeutic options²². The largest publicly accessible dataset for cancer combination drugs, such as ALMANAC, was recently published by the U.S. National Cancer Institute. This data collection contained more than 5,000 combinations of 104 investigational and licensed drugs, with synergies calculated against 60 cancer cell lines, resulting in more than 290,000 synergy scores²³. Moreover, DrugComb (https://drugcomb.org/), a web-based portal for storing and studying drug combination screening datasets, offers a comprehensive visualisation of drug combination susceptibility and synergy, which can significantly aid in the understanding of drug interactions at unique dosage levels. Drugcomb now has 751,498 drug combinations and 717,684 single drug screens from 37 trials, which relate to 2040 cell lines and 216 cancer forms²⁴.

In this work, we develop a network pharmacology approach to predict potential drug combinations for acute myeloid leukaemia (AML) based on the Beat AML dataset. We propose a drug combination strategy using bipartite network modelling of ex vivo drug screening data. By ex vivo drug response data, we directly access the individual phenotypes of the patients’ cancer cells, and by network modelling, we demonstrate the similarity of drugs and AML patients. Then, we use the community structures within the drug-based bipartite networks to discover effective multi-targeted drug combination regimens (Fig. 1). Our predicted drug combinations are only suggested regarding the phenotypic interactions of the cancer cells or patient samples with the drugs without prior understanding of the genetic origin or molecular understanding of the disease.

Results

Defining the edge weight of bipartite networks

In the Beat AML dataset, a set of 122 inhibitor drugs was used against 531 patient-derived AML samples. The spectra of low to high potency of drugs were observed across the patient-derived samples. However, this panel of small molecule inhibitors was selected according to their activity against the proteins involved in tyrosine and non-tyrosine kinase pathways, particularly for AML²¹. First, we determined the weight value of the drug–sample interaction to be used in the bipartite network reconstruction. This value should describe the most potent compounds for inhibiting tumour cells regarding the drug sensitivity analysis. Adding to the relative and absolute IC_50, RI value²⁵, and AUC, we calculated the median cell viability in the drug response experiments. The distribution of these measures was evaluated in terms of normality, skewness, and modality (Fig. 2) to choose the best measure as a weight in the bipartite network. The relationship of median to AUC was a high positive value (with the highest r Pearson correlation coefficient ~ 0.94). The distribution of medians was unimodal in contrast to IC₅₀ distributions, homoscedastic contrary to RI distribution, and more symmetric (non-skewed) compared to AUC distribution. In addition to investigating the linear relationship, that is, Pearson correlation analysis, we computed MIC, which measures the relationship strength, and MEV to check the closeness of the relationship to being a function. Interestingly, the relationship between median and AUC displayed higher MAS and MEV (~0.75) compared with the relationship of RI and AUC, meaning that median has a stronger association with AUC. Therefore, we have chosen the inverse of the actual median as the weight of the drug–patient interaction.

**Fig. 2: Comparison of different measures for drug response experiments in the Beat AML study.**

Analysis of bipartite networks

Furthermore, the full square submatrix (with no missing entries) of patient samples and small molecules was used as the incidence matrix of the bipartite network. Specifically, we selected the list-wise deletion strategy to remove missing values, and we used the complete cases of both variables. The downstream analysis was done on an undirected weighted bigraph comprising 176 (88 + 88) nodes and 7744 edges (Fig. 3A). The distribution of the min–max normalised edge weights indicated positive skewness, indicating that the cells were not highly sensitive to most drugs. All the performed analyses were also carried out for the GDSC dataset as proof of concept. The undirected weighted bigraph of the GDSC dataset comprised 532 (266 + 266) nodes and 70,756 edges (Fig. 3C). The distribution of the min–max normalised edge weights showed positive skewness in this dataset as well (Fig. 3D), again indicating low potency for most of the drugs. Therefore, exploring the best combination is not straightforward, and categorising drug–sample interactions seems to be required. Following the projection of these bigraphs as outlined in Fig. 3, two projected graphs, the patient similarity network (PSN) and drug similarity network (DSN), were reconstructed, such that each edge was obtained by multiplying the weighted incidence matrix. Thus, the edge weights of the projected graphs indicate the profile similarities of patient samples in PSN and small molecule inhibitors in DSN. Note that the edge weight values in DSNs and PSNs differ due to the different matrix multiplications.

**Fig. 3: Bigraphs of cancer datasets.**

The PSN and DSN of the Beat AML dataset contained 88 nodes and 3828 edges (Fig. 4), while in the GDSC projected similarity networks, there were 266 nodes and 35,378 edges. In Fig. 4, the larger the node size, the more sensitive patient-derived samples and the more potent drugs. In this subset of the Beat AML dataset, without missing data, patient 16–00627 was found to be the most sensitive and SNS-032 was the most potent inhibitor (See Supplementary Fig. 1). The community detection was subsequently done for both similarity networks via modularity score optimisation, resulting in two communities for DSN with 50 and 38 small molecules, and two communities for PSN with 39 and 49 patient samples. Alternatively, we identified two clusters of patients with distinctive drug response profiles, suggesting two subcategories of the disease. Also, we detected two clusters of small molecules, which pointed to disparate inhibiting patterns on the patient samples. In the following steps, we presented evidence of the consistency of cluster members in both networks using prior knowledge.

**Fig. 4: DSN and PSN of the Beat AML dataset.**

Intra-cluster homogeneity analysis of similarity networks

Drug similarity network

Focusing on small molecules, we presumed that inhibitory molecules with correlated effects on cell survival tended to have similar structures, purposes, and functions^26,27,28,29. Therefore, we evaluated the similarity of SMILES structures, the analogy of protein targets, and the biological pathways of the detected clusters in the DSNs against random groupings of molecules. The distribution of the Dice similarity of the SMILES structures differed significantly between the random grouping and the clusters based on network topology (Fig. 5A). The statistical test of the median difference also resulted in the lowest p-values for both the pairwise two-sample Wilcoxon and Kruskal–Wallis rank sum test (adjusted p-value < 2e − 16). Evaluating their target similarities, we explored the protein targets of the small molecule inhibitors and examined the number of target intersections of small molecule pairs within the clusters. In this analysis, DTC and OmniPath were applied to explore the binding targets of small molecules and second-order node neighbours (secondary targets) in the signalling network, respectively. Assuming that proteins usually correspond to multiple signalling pathways, the KEGG database was used to check the number of pathway intersections of the protein targets for each pair of small molecules. The median similarity measures of the intersections within the network clusters significantly exceeded those for a large set of random pairs of small molecules (Fig. 5B–D) (adjusted p-value < 2.2e − 16, Kruskal–Wallis rank sum test). Comparable findings were obtained from the analysis of the GDSC dataset (Fig. 5E–H) (adjusted p-value < 2.2e − 16, Kruskal–Wallis rank sum test), suggesting that our method is also reproducible for the analysis of cell line-based datasets.

**Fig. 5: Beat AML and GDSC intra-cluster homogeneity analysis.**

Patient and cell-line similarity network

Next, we examined the member consistency of the patient clusters in the PSN using other available data from patient samples in the Beat AML dataset. The gene expression data, including the RPKM and CPM of the samples, were utilised to check the pairwise similarity of the cluster members. The similarity measures were also computed for a large set of random pairs of patient samples to compare with our patient stratification using network clustering. When we compared the harmonic mean similarities of the RPKM values, the pairwise similarities of patients within the clusters significantly exceeded those of the randomly selected patients (adjusted p-value < 2.2e − 16, Kruskal–Wallis rank sum test) (Fig. 5I). For the CPM dataset, the distributions of Jaccard distance were shown, where the distances within the clusters were statistically lower than those in the random group (adjusted p-value = 4.655e − 05, Kruskal–Wallis rank sum test) (Fig. 5J). For the GDSC dataset, we used the expression profiles of signature genes provided by the SPEED platform³⁰. Then, differentially expressed genes were used to provide gene signatures of perturbed cancer-related pathways. In this dataset, there were 11 activity scores to represent the activity levels of 11 well-known pathways for each cell line. Therefore, we compared the distance distributions of the cell line pairs in the clusters to a set of random pairs of cell lines. Our findings indicated that the distances within the clusters were much lower than those in the random grouping (adjusted p-value = 6.94e − 08, Kruskal–Wallis rank sum test) (Fig. 5K).

The Beat AML study also provided the mutational landscape in AML. Here, we used a dataset of non-benign gene mutations to characterise both clusters of patient samples. As shown in Fig. 6, both clusters of patients demonstrate a distinct profile of gene mutations regarding the involved genes and the ranks of genes based on frequency. Previously, Tyner et al. highlighted the importance of TP53 and ASXL gene mutations, both responsible for the broad drug resistance patterns²¹. They further showed that mutations in certain genes may identify disease subgroups sensitive to certain inhibitors. For example, they found that patients with FLT3-ITD and NPM1 mutations were sensitive to SYK inhibitors. Interestingly, our molecular-independent network-based approach to characterise patient samples also captured the significance of the mutations above. Furthermore, our findings indicate that TP53, DNMT3A, and NRAS were the most frequently mutated genes in one of the patient clusters, while TET2 and NPM1 were the most frequently mutated genes in the other cluster, along with the FLT3-ITD mutation. These results suggest that the phenotype-level information in drug response data can corroborate the genotype-level information to stratify patients more effectively.

**Fig. 6: The frequently mutated genes in the clusters of Beat AML patients.**

Inter-cluster design strategy for drug combinations

We assumed that the best drug combination strategy was the selection of one drug from each cluster to block potential drug resistance mechanisms and cancer recurrence. A common drug combination design could be the use of the most effective drugs of each cluster to inhibit cancer cells more effectively. However, other pharmacologic evidence can encourage the choice of the best combination of drugs more specifically. As the focus in drug combination studies also lies in finding the most synergistic drug combinations, previously reported studies were used to explore the synergy values (i.e., the degree of interactions) of drug combinations^31,32,33,34. First, we checked if the combinations of the top five drugs (based on the median values of cell viability) of each cluster in the Beat AML and GDSC datasets (Table 1) were found in the DrugComb database. However, there were no reports regarding the 25 possible combinations of these drugs, so we aimed to compare the average synergistic values for these 10 drugs in the whole database. Figure 7 shows the distributions of synergy values in DrugComb, highlighting the mean of the synergy of the bottom and top five drugs in each network cluster. This analysis revealed the reasonably high potential of the combinations of the top five drugs according to the average median values in both Beat AML and GDSC datasets (p-value = 2.96e − 02 and p-value = 3.56e − 02, Wilcox rank sum test, respectively).

Table 1 Top five small molecules in each cluster of DSNs.

Full size table

**Fig. 7: Distribution of drug combination synergy scores in the DrugComb database.**

Synergy analysis of the inter-cluster combination of drugs

For further validation of our strategy for predicting synergistic drug combinations using network modelling, we focused on the ALMANAC dataset²³, which has 1,892,650 combinations of 103 inhibitors tested on 60 cell lines. The same procedure as described in Fig. 1 was implemented to extract the drug modules in the DSN according to the available single drug experiments in this dataset. The median inhibition values of the single-drug responses on cell lines were used as weight values in the bipartite drug-cell line network. Using the projection of the weighted DSN, clusters of drugs with similar effect profiles on cell lines were extracted.

According to our predefined assumption, the combinations of drugs from different clusters were used as the positive group and the combinations of drugs within the clusters as the negative group. Then, we retrieved the synergy and sensitivity scores of the combinations for both groups using the DrugComb computed values, especially the highest single agent (HSA), zero-interaction potency (ZIP), Bliss, Loewe, combinational sensitivity score (CSS), and S synergy. Figure 8A shows that the positive group of drug combinations exhibited a significantly higher value of drug synergy than the negative group. This result was evident for all types of synergy measures, indicating the superiority of the strategy of using inter-cluster drug combinations. These data also indicate the efficiency of our proposed network-based modelling to discern drugs with similar profile effects on biological samples. Also, our proposed strategy of drug combination using the drugs of contrary clusters is more likely to acquire higher drug synergy and potency.

**Fig. 8: Synergy of drug combinations.**

High-throughput drug screening for the proposed drug combinations in AML cell lines

To further demonstrate the ability of our model in predicting specific and robust drug combinations, experimental corroboration was conducted on a subset of 45 drug combinations for 3 AML cell lines, MOLM-16, OCI-AML3, and NOMO-1. Also, 25 out of 45 drug combinations originated from the top five drugs of the two clusters as the positive group, where higher synergy was predicted by our model, while others were the combinations of the top five drugs within each cluster, which transformed into 20 combinations as the negative group. The findings of the experimental validation of 135 drug-drug-cell line triplets are depicted in Fig. 8B using the ZIP, Bliss, HSA, and Loewe models to assess the degree of synergy. The drug combinations predicted by our model in the positive group were validated as more synergistic when considering positive scores as evidence of synergy degree (Fig. 9 and Supplementary Fig. 2). These findings were statistically more significant when using Bliss or HSA measures. These cell lines were chosen based on their genetic backgrounds and to represent a wide range of genetic variations in the Beat AML dataset’s ex vivo models. We did correlation analysis of the ten (top five drugs of two clusters) selected single drug response between ex vivo model and three cell lines to illustrate the extrapolation of drug sensitivity studies in cell lines for our prediction on ex vivo models. The majority of patient-derived samples were highly correlated with these three cell lines, according to our findings (Supplementary Fig. 3). Overall, these results demonstrate the robustness of network-based predictions across various experimental setups and synergy scoring models, and the ability of our network-based model to detect new combinations of treatments.

**Fig. 9: The top synergistic drug combinations identified in the positive group.**

Discussion

The availability of single-drug response datasets for cancer cell lines has prompted us to develop methods for predicting and selecting the most effective combination therapy. Several AI-based combination prediction approaches have recently been introduced that combine high-throughput molecular profiling data with drug response data to improve prediction and validation. To reflect the relationships between drug combinations, Narayan et al. used dose-response data from pharmacogenomic encyclopaedias and represented them as drug atlas³². Combining with the pathway/gene ontology data, their approach enables the prediction of combinatorial therapy, i.e., vulnerability when attacked by two drugs that can be related to tumour-driving mutations. They repeated the predicted synergies in several tumours, including glioblastoma, breast cancer, melanoma, and leukaemia mouse models, highlighting the cancer-independent prediction power of drug combination treatment. Ianevski et al. also showed that bulk viability single-agent screening assays had unexpectedly large predictability for AML cell subpopulation co-inhibition effects when combined with scRNA-seq transcriptomic data²⁰. They developed a machine-learning model by combining single-cell RNA sequencing with ex vivo single-agent testing for AML with a different genetic background. They displayed an accurate prediction of synergistic patient-specific combinations while avoiding the inhibition of non-malignant cells. However, while our biomarker-independent approach relies only on the phenotypic level of information, that is, drug-response data, our predictions were compatible with the molecular profiling and biochemical annotations when it came to assessing the intra-cluster homogeneity of drugs, patients, and cell lines. Based on drug response in genetically diverse patient populations, Palmer and Sorger, on the other hand, emphasised the independent drug action in combinatorial therapy rather than drug additivity or synergy³⁵. They argued that heterogeneous responses across a population or patient-to-patient variability have a greater impact on predicting effective drug combinations. In our model, we also considered the patient’s level of information when recommending drug combinations. The reconstruction of the bipartite network on a large sample of the patient population and the subsequent clustering of patients and drugs took population heterogeneity into account for drug combinations, and we also computed several synergy measures to track the synergistic behaviour of drugs rather than drug additivity.

Moreover, a training machine-learning model for predicting drug combination response, comboFM, was recently introduced using drug combination screening data as a training dataset³¹. comboFM uses a factorisation machine to model cell context-specific drug interactions through higher-order tensors. Julkunen et al. demonstrated that comboFM enables leveraging information from previous experiments performed on similar drugs and cells as training data when predicting responses of new combinations, insofar as untested cells (testing data). They displayed high predictive performance and robust applicability of comboFM in various prediction scenarios using experimental validation of a set of previously untested drugs. However, we expounded that the prediction accuracy of the inter-cluster design strategy of drug combinations based on multipartite networks can be achieved independently from the high-quality training dataset.

Strictly speaking, in the present study, we revisited the analysis of nominal variables, namely drug name and sample identity, in drug screening results for data mining using graph theory, which we termed the nominal data mining approach^36,37. We first considered data quality control, such as outlier detection, outlier treatment, biological and technical replicates. Because of the discrete explanatory independent variable (i.e., drug doses)³⁸, we assumed that regression-based measurements might even be discarded; hence, we demonstrated that median values can represent an appropriate weight score in comparing drug functionality for network reconstruction. These values were used to quantify and weight the bipartite network, which reflects the interaction strength of the drugs and biological samples. Then, two similarity networks were provided by weighted network projection to detect the topological structure of the network communities. We showed that network communities represent a rationale starting point for proposing a combinational drug regimen. Our computational and experimental validation steps amplified the logic of our proposed platform³⁹. Hence, while training datasets were not required in this method to predict drug combination, drug response data alone were adequate for the prediction, without integrating prior knowledge of biochemical profiling.

Noting that the occurrence of synergistic toxicities, which can arise from additive toxicities when targets are shared by the combined drugs, is a major barrier to applying combination therapy in the clinic⁴⁰. If drug screening data on healthy cells are available, we suggest that a similar strategy for predicting toxicity without losing efficacy is also essential before future translational experiments. Ianevski et al. previously illustrated the importance of a desired synergy-efficacy-toxicity balance for predicting patient-customised drug combinations²⁰. Hence, drug-response data on healthy cells are demanded to complement synergistic interactions of drug combinations with toxicity predictions; where drug synergy and toxicity data are optimally matched for combinatorial therapy, stronger and longer-lasting outcomes of drug combinations can be predicted. While we aimed to identify combinations with maximal synergy, we cannot discount that the effects, especially at the patient level, could be lack of toxicity rather than synergistic, and that the efficacy of the combinations may be limited to specific patient populations.

Considering these possibilities, prospective work will necessitate the provision of further patient-derived experimental validations. Despite the fact that our prediction depends solely on the drug sensitivity dataset, our suggested combinations address the common mutational assigned aetiology of AML. This combination was proposed purely on the grounds of the phenotypic response of patient samples to the drugs, with no previous knowledge of the disease’s genetic origin. In this regard, for newly diagnosed leukaemia, we recommend evaluating top combinations rather than all potential combinations (which is unfeasible) in an ex vivo drug sensitivity and resistance testing (DSRT) method reported in⁴¹ to allow rapid translational precision medicine.

Methods

Figure 1 presents the entire workflow of this study. The weighted bipartite network is constructed using the Beat AML dataset. This dataset is a collaborative research programme of 11 academic medical centres providing data on AML samples while offering genomics, clinical, and drug responses. It includes a cohort study of 672 tumour specimens collected from 531 patients and an analysis of 122 drug responses. To construct a weighted bipartite network, the best response read-out of drug potency was defined using information-based measures. Then, two unipartite networks were obtained using network projection on the samples and drugs. Next, communities of two projected networks were extracted, and intra-cluster homogeneity analysis was performed using the similarity of drugs and patients/cell members based on available gene expression profiles for patients, protein–protein interaction network, and biological pathways. The drug candidates for drug combination were selected from two different communities, and a high-throughput drug screening was used to assess their synergistic effects.

Defining the response read-out for drug screening experiments

Pharmacogenomic studies require extensive standardisation to avoid inconsistency of drug response data for further research and unbiased predictions^42,43. Therefore, first, we controlled the quality of cell viability data to select the potent compounds. To achieve this, we examined the raw datasets regarding the availability of replicated data and outlier detection, followed by assessment of distribution, pairwise correlation, and homoscedasticity analyses to select the best response read-out or measure of drug potency. This analysis was performed using information-based nonparametric measures available in the Minerva package⁴⁴ by computing the maximal information coefficient (MIC), maximum edge value (MEV), and maximum asymmetry score (MAS). Furthermore, the relative and absolute IC50 (i.e., IC50 measures, which were computed based on the top and bottom plateaus of the curve or based on the blank and the positive control values, respectively), relative inhibition (RI) value, area under curve of drug-response fitted line (AUC), and the median of cell viability in the drug response experiments were assessed to select the best measurement. The chosen measurement was later used as a weight value for the edges of the weighted bipartite network reconstruction.

Reconstruction and analysis of the bipartite network model

In our bipartite network model, one group of nodes contained drugs and the other group contained cancer cell lines (in GDSC and ALMANAC) or patient samples (in Beat AML). The edges were defined by incidence matrices derived from the min–max normalised values:

$${Normalised\; value}=\frac{{{{{{\rm{value}}}}}}-{{{{{\rm{minimum}}}}}}({values})}{{{{{{\rm{maximum}}}}}}\left({values}\right)-{{{{{\rm{minimum}}}}}}({values})}$$

(1)

This normalisation transforms these values, which indicate the potency of small molecules on cancer cell lines or patient samples, into a decimal between 0 and 1. Next, we projected the bipartite network into two similarity networks: the drug similarity network and sample similarity network. In the network projection, two unipartite graphs were derived from a bipartite graph, resulting in the deduction of a similar node’s relationships. In this study, we projected similarity networks that consider the edge weights in the bipartite network. Then, we studied the general properties of the networks, such as network heterogeneity, centralisation, and clustering coefficients. The critical step was community detection within the projected networks to discern functionally similar drugs and cells or patients regarding drug response. The modularity index was used to determine the best community detection algorithms, including infomap⁴⁵, fast greedy⁴⁶, and spinglass⁴⁷. Furthermore, we explored the network modules to propose a strategy for drug combination design.

Computational corroboration

Multiple computational methods were applied to validate the predictions of the drug combinations and patient or cell stratification. The validation of the community structures is like the general cluster quality assessment method, and we assessed the clustering performance by matching the clustering structures to prior knowledge. This validation is foundational to possible drug combination designs. Alternatively, the combination of distinct drugs in terms of chemical structure, target profile, and implicated biological pathways is likeliest more efficient than similar drugs⁷. Therefore, we used the drug–target network, protein–protein interactions, and signalling networks to justify the similarity of cluster elements. Thus, Chembl⁴⁸, drug target commons (DTC)⁴⁹, KEGG⁵⁰, and the OmniPath database⁵¹ were used to extract prior annotations about the drugs and their targets. To compare the chemical structures of the drugs, a simplified molecular input line entry system (SMILES) of the drug molecules was retrieved and transformed into an extended connectivity fingerprint (ECFP) to assess the Dice similarity of the molecules. The Dice similarity is one of the standard metrics for molecular similarity calculations in which

$${S}_{A,B}=2c/(a+b)$$

(2)

where a is the number of ON bits in molecule A, b is the number of ON bits in molecule B, and c is the number of ON bits in both A and B molecules⁵². Also, the corresponding gene expression profiles were used to assess similarity within a patient or cell line modules in the sample similarity networks. For reads per kilobase per million (RPKM) with negative values and counts per million (CPM), we used the Harmonic similarity and Jaccard distance, respectively, as follows:

$${S}_{P,Q}=2\times \mathop{\sum }\nolimits_{i=1}^{n}\left({P}_{i}\times {Q}_{i}\right)/\left({P}_{i}+{Q}_{i}\right)$$

(3)

$${D}_{P,Q}=1-\mathop{\sum }\nolimits_{i=1}^{n}\left({P}_{i}\times {Q}_{i}\right)/\left({\sum }_{i=1}^{n}{{P}_{i}}^{2}+{\sum }_{i=1}^{n}{{Q}_{i}}^{2}+{\sum }_{i=1}^{n}{P}_{i}\times {Q}_{i}\right)$$

(4)

where ${{{{{\bf{P}}}}}}=\left\{{P}_{1},{P}_{2},\cdots ,{P}_{n}\right\}$ and ${{{{{\bf{Q}}}}}}=\left\{{Q}_{1},{Q}_{2},\cdots ,{Q}_{n}\right\}$ denote the vector of gene expression values for patients or cell lines, and n is the number of genes. In all cases, the similarity or distance scores were compared with the random grouping of small molecules or biological samples to perform statistical testing.

The synergy scores provided by the DrugComb database⁵³ were used to corroborate synergistic combinations of our network-based predictions, including HSA, Bliss, Loewe, ZIP, CSS, and S. Let us assume that drug 1 at dose x₁ and drug 2 at dose x₂ are used to produce the effects of y₁ and y₂, and y_c is the effect of their combination. Drug effect is usually measured as a percentage of cell death, and a drug combination is classified as synergistic, antagonistic, or non-interactive⁵⁴. The expected effect denoted by y_e represents a non-interactive level, and it is quantified based on a reference model. Several mathematical models have been introduced to calculate the expected effect by assuming specific principles. The HSA model⁵⁵ considers the expected combination effect as the maximum of single-drug effects, that is,

$${y}_{e}={\max }\left({y}_{1},{y}_{2}\right)$$

(5)

The Loewe model⁵⁶ assumes that an individual drug produces y_e at a higher dose than in the combination. In the Bliss model⁵⁷, y_e is the effect of the two drugs acting independently. The ZIP model⁵⁴ considers the assumptions of the Loewe and Bliss models by assuming that, at the reference model, two drugs do not potentiate each other. CSS determines the sensitivity of a drug pair, and S synergy is based on the difference between the drug combination and the single drug dose–response curves²⁵.

Cell culture and reagents

AML cell lines MOLM-16,NOMO-1, and OCL-AML3 were purchased from DSMZ-German Collection of Microorganisms and Cell Cultures (DSMZ no. ACC 555: MOLM-16, ACC 542: NOMO-1, ACC 582: OCI-AML3). MOLM-16 and NOMO-1 were cultured in RPMI-1640 medium (Gibco/Thermo Fisher Scientific, Waltham, MA, USA) and OCI-AML3 in α-MEM (with nucleosides; Gibco/Thermo Fisher Scientific) supplemented with GlutaMAX (Gibco CTS/Thermo Fisher Scientific), foetal bovine serum (20% for MOLM-16 and OCI-AML3; 10% for NOMO-1), and antibiotics.

Drug combination testing

The compounds dissolved in dimethyl sulfoxide (DMSO) were plated using Beckman Coulter Echo 550 Liquid Handler (Beckman Coulter, Indianapolis, IN, USA) combined with seven concentrations for each compound in half-log dilution series with 2.5/7.5/25 nl volumes, covering a 1,000-fold concentration range on black clear-bottom TC-treated 384-well plates (Corning #3764, Corning, NY, USA). All doses were randomised across the plate to minimise any plate effects. As positive (total killing) and negative (non-effective) controls, 100 μM of benzethonium chloride and 0.2% DMSO were used, respectively.

Cells were plated on pre-administered compound plates in 25 μl (2500, 2000, or 1250 cells per well for MOLM-16, NOMO-1, and OCI-AML3 cell lines, respectively) using BioTek MultiFlo FX RAD (5 μl cassette) (Biotek, Winooski, VT, USA) and incubated for 72 h at 37 °C and 5% CO2. Cell viability was then determined by dispensing 25 μl of Cell Titre Glow 2.0 reagent (Promega, Madison, WI, USA). Plates were incubated for 5 min and centrifuged for 5 min (173 × g) before reading luminescence with a PHERAstar FS multimode plate reader (BMG Labtech, Ortenberg, Germany).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets analysed during the current study are publicly available in the abovementioned repositories, i.e., Beat AML [http://vizome.org/aml/], GDSC [https://www.cancerrxgene.org/], ALMANAC [https://drugcomb.org/]. Also, the generated data in this study have been deposited in the Zenodo database under this DOI link [https://doi.org/10.5281/zenodo.5789170]. Source data are provided with this paper. The remaining data are available in the Article and Supplementary Figures.

Code availability

All analyses reported in this study used the statistical software R (v.4.0.0). All related R files are available in this link; https://doi.org/10.5281/zenodo.5789170.

References

Nussinov, R., Jang, H., Tsai, C. J. & Cheng, F. Review: Precision medicine and driver mutations: computational methods, functional assays and conformational principles for interpreting cancer drivers. Plos Comput. Biol. 15, 1–54 (2019).
Google Scholar
Yaffe, M. B. Why geneticists stole cancer research even though cancer is primarily a signaling disease. Sci. Signal. 12, 565 (2019).
Article CAS Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Kibble, M. et al. Network pharmacology applications to map the unexplored target space and therapeutic potential of natural products. Nat. Prod. Rep. 6, 1249–1266 (2015).
Article Google Scholar
Tang, J. & Aittokallio, T. Network pharmacology strategies toward multi-target anticancer therapies: from computational models to experimental design principles. Curr. Pharm. Des. 20, 23–36 (2014).
Article CAS PubMed Google Scholar
Wang, Z. et al. Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd. Nat. Commun. 7, 12846 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Barneh, F. et al. Integrated use of bioinformatic resources reveals that co-targeting of histone deacetylases, IKBK and SRC inhibits epithelial-mesenchymal transition in cancer. Brief. Bioinformatics 20, 717–731 (2019).
Article CAS PubMed Google Scholar
Barneh, F. et al. Valproic acid inhibits the protective effects of stromal cells against chemotherapy in breast cancer: insights from proteomics and systems biology. J. Cell. Biochem. 119, 9270–9283 (2018).
Article CAS PubMed Google Scholar
Gholizadeh, E. et al. Identification of celecoxib-targeted proteins using label-free thermal proteome profiling on rat hippocampus. Mol. Pharmacol. 99, 308 (2021).
Article CAS PubMed Google Scholar
Pemovska, T. et al. Individualized systems medicine strategy to tailor treatments for patients with chemorefractory acute myeloid leukemia. Cancer Discov. 3, 1416–1429 (2013).
Article CAS PubMed Google Scholar
Pauli, C. et al. Personalized in vitro and in vivo cancer models to guide precision medicine. Cancer Discov. 7, 462–477 (2017).
Article PubMed PubMed Central Google Scholar
Jafari, M., Ansari-Pour, N., Azimzadeh, S. & Mirzaie, M. A logic-based dynamic modeling approach to explicate the evolution of the central dogma of molecular biology. PLOS ONE 12, e0189922 (2017).
Article CAS PubMed PubMed Central Google Scholar
Xu, T., Pi, Z., Liu, S., Song, F. & Liu, Z. Chemical profiling combined with “omics” technologies (CP-Omics): a strategy to understand the compatibility mechanisms and simplify herb formulas in traditional Chinese medicines. Phytochemical Anal. 28, 381–391 (2017).
Article CAS Google Scholar
Shinkafi, T. S. Holistic approach to traditional and herbal medicines: the role of omics, systems biology, and computational technologies. In Plant Bioinformatics (eds. Hakeem, K., Vardar-Sukan, F. & Ozturk M.) (Springer, Cham, 2017).
Flobak, Å. et al. A high-throughput drug combination screen of targeted small molecule inhibitors in cancer cell lines. Sci. Data 6, 237 (2019).
Article CAS PubMed PubMed Central Google Scholar
Budman, D. R., Calabro, A. & Kreis, W. Synergistic and antagonistic combinations of drugs in human prostate cancer cell lines in vitro. Anticancer Drugs 13, 1011–1016 (2002).
Article CAS PubMed Google Scholar
Budman, D. R., Calabro, A., Rosen, L. & Lesser, M. Identification of unique synergistic drug combinations associated with downexpression of survivin in a preclinical breast cancer model system. Anti-Cancer drugs 23, 272–279 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jaiswal, A. et al. Multi‐modal meta‐analysis of cancer cell line omics profiles identifies ECHDC1 as a novel breast tumor suppressor. Mol. Syst. Biol. 17, e9526 (2021).
Article CAS PubMed PubMed Central Google Scholar
He, L. et al. Patient-customized drug combination prediction and testing for T-cell prolymphocytic leukemia patients. Cancer Res. 78, 2407–2418 (2018).
Article CAS PubMed Google Scholar
Ianevski, A. et al. Patient-tailored design for selective co-inhibition of leukemic cell subpopulations. Sci. Adv. 7, eabe4038 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Tyner, J. W. et al. Functional genomic landscape of acute myeloid leukaemia. Nature 562, 526–531 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Iorio, F. et al. A landscape of pharmacogenomic interactions. Cancer Cell. 166, 740–754 (2016).
CAS Google Scholar
Holbeck, S. L. et al. The National Cancer Institute ALMANAC: a comprehensive screening resource for the detection of anticancer drug pairs with enhanced therapeutic activity. Cancer Res. 77, 3564–3576 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zheng, S. et al. DrugComb update: a more comprehensive drug sensitivity data repository and analysis portal. Nucleic Acid Res. 49, W174–W184 (2021).
Article CAS PubMed PubMed Central Google Scholar
Malyutina, A. et al. Drug combination sensitivity scoring facilitates the discovery of synergistic and efficacious drug combinations in cancer. PLoS Computational Biol. 15, e1006752 (2019).
Article CAS Google Scholar
Tabei, Y., Pauwels, E., Stoven, V., Takemoto, K. & Yamanishi, Y. Identification of chemogenomic features from drug–target interaction networks using interpretable classifiers. Bioinformatics 28, i487–i494 (2012).
Article CAS PubMed PubMed Central Google Scholar
Öztürk, H., Ozkirimli, E. & Özgür, A. A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction. BMC Bioinformatics 17, 128 (2016).
Article CAS PubMed PubMed Central Google Scholar
Montaruli, M. et al. Accelerating drug discovery by early protein drug target prediction based on a multi-fingerprint similarity search. Molecules 24, 2233 (2019).
Article CAS PubMed Central Google Scholar
Trosset, J.-Y. & Cavé, C. In silico drug–target profiling. In Target Identification and Validation in Drug Discovery: Methods and Protocols (eds. Moll, J. & Carotta, S.) 89–103 (Springer, New York, NY, 2019).
Parikh, J. R., Klinger, B., Xia, Y., Marto, J. A. & Blüthgen, N. Discovering causal signaling pathways through gene-expression patterns. Nucleic Acids Res 38, W109–W117 (2010).
Article CAS PubMed PubMed Central Google Scholar
Julkunen, H. et al. Leveraging multi-way interactions for systematic prediction of pre-clinical drug combination effects. Nat. Commun. 11, 6136 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Narayan, R. S. et al. A cancer drug atlas enables synergistic targeting of independent drug vulnerabilities. Nat. Commun. 11, 2935 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ianevski, A., He, L., Aittokallio, T. & Tang, J. SynergyFinder: a web application for analyzing drug combination dose-response matrix data. Bioinformatics 33, 2413–2415 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zheng, S. et al. SynergyFinder Plus: toward better interpretation and annotation of drug combination screening datasets. Genomics Proteomics Bioinformatics https://www.sciencedirect.com/science/article/pii/S1672022922000080 (2022). In press.
Palmer, A. C. & Sorger, P. K. Combination cancer therapy can confer benefit via patient-to-patient variability without drug additivity or synergy. Cell 171, 1678 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jafari, M., Chen, C., Mirzaie, M. & Tang, J. NIMAA: an R/CRAN package to accomplish NomInal data Mining AnAlysis. bioRxiv https://doi.org/10.1101/2022.01.13.475835.
Jafari, M., Wang, Y., Amiryousefi, A. & Tang, J. Unsupervised learning and multipartite network models: a promising approach for understanding traditional medicine. Front. Pharmacol. 11, 1319 (2020).
Article PubMed PubMed Central Google Scholar
Montgomery, D. C., Peck, E. A. & Vining, G. G. Introduction to Linear Regression Analysis. (Wiley, 2012).
Jafari, M., Guan, Y., Wedge, D. C. & Ansari-Pour, N. Re-evaluating experimental validation in the Big Data Era: a conceptual argument. Genome Biol. 22, 71 (2021).
Article PubMed PubMed Central Google Scholar
Larkin, J. et al. Combined nivolumab and ipilimumab or monotherapy in untreated melanoma. N. Engl. J. Med. 373, 23–34 (2015).
Article CAS PubMed PubMed Central Google Scholar
Malani, D. et al. Implementing a functional precision medicine tumor board for acute myeloid leukemia. Cancer Discov. 12, 388 (2021).
Geeleher, P., Gamazon, E. R., Seoighe, C., Cox, N. J. & Huang, R. S. Consistency in large pharmacogenomic studies. Nature 540, E1–E2 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Mpindi, J. P. et al. Consistency in drug response profiling. Nature 540, E5–E6 (2016).
Article CAS PubMed Google Scholar
Reshef, D. N. et al. Detecting novel associations in large data sets. Science 334, 1518–1524 (2011).
Article ADS CAS MATH PubMed PubMed Central Google Scholar
Rosvall, M. & Bergstrom, C. T. Maps of random walks on complex networks reveal community structure. Proc. Natl Acad. Sci. 105, 1118 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Clauset, A., Newman, M. E. & Moore, C. Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004).
Article ADS CAS Google Scholar
Newman, M. E. & Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004).
Article ADS CAS Google Scholar
Gaulton, A. et al. The ChEMBL database in 2017. Nucleic Acids Res 45, D945–D954 (2017).
Article CAS PubMed Google Scholar
Tang, J. et al. Drug target commons: a community effort to build a consensus knowledge base for drug-target interactions. Cell Chem. Biol. 25, 224 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M., Sato, Y., Furumichi, M., Morishima, K. & Tanabe, M. New approach for understanding genome variations in KEGG. Nucleic Acids Res 47, D590–D595 (2019).
Article CAS PubMed Google Scholar
Türei, D., Korcsmáros, T. & Saez-Rodriguez, J. OmniPath: guidelines and gateway for literature-curated signaling pathway resources. Nat. Methods 13, 966–967 (2016).
Article CAS PubMed Google Scholar
Bajusz, D., Rácz, A. & Héberger, K. Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?. J. Cheminformatics 7, 1–13 (2015).
Article CAS Google Scholar
Zagidullin, B. et al. DrugComb: an integrative cancer drug combination data portal. Nucleic Acids Res. 47, 43 (2019).
Yadav, B., Wennerberg, K., Aittokallio, T. & Tang, J. Searching for drug synergy in complex dose-response landscapes using an interaction potency model. Comput Struct. Biotechnol. J. 13, 504–513 (2015).
Article CAS PubMed PubMed Central Google Scholar
Berenbaum, M. C. What is synergy? Pharm. Rev. 41, 93–141 (1989).
CAS PubMed Google Scholar
Loewe, S. The problem of synergism and antagonism of combined drugs. Arzneimittelforschung 3, 285–290 (1953).
CAS PubMed Google Scholar
BLISS, C. I. The toxicity of poisons applied jointly1. Ann. Appl. Biol. 26, 585–615 (1939).
Article CAS Google Scholar

Download references

Acknowledgements

This study was financially supported by the Academy of Finland [Grant 332454 to M.J., Grant 317680 to J.T. and Grant 320131 to J.T.], and European Research Council [Grant 716063 to J.T.]. Drug screening was carried out at the FIMM High Throughput Biomedicine Unit (HTB), which is hosted by the University of Helsinki and supported by HiLIFE and Biocenter Finland. Additionally, the authors wish to acknowledge Jani Saarela and Laura Turunen of the HTB unit.

Author information

Authors and Affiliations

Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
Mohieddin Jafari, Mehdi Mirzaie, Jie Bao, Shuyu Zheng, Johanna Eriksson & Jing Tang
Prinses Maxima Center for Pediatric Oncology, 3584 CS Utrecht, Utrech, the Netherlands
Farnaz Barneh
Institute for Molecular Medicine Finland - FIMM, HiLIFE - Helsinki Institute of Life Science, iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
Caroline A. Heckman

Authors

Mohieddin Jafari
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Mirzaie
View author publications
You can also search for this author in PubMed Google Scholar
Jie Bao
View author publications
You can also search for this author in PubMed Google Scholar
Farnaz Barneh
View author publications
You can also search for this author in PubMed Google Scholar
Shuyu Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Eriksson
View author publications
You can also search for this author in PubMed Google Scholar
Caroline A. Heckman
View author publications
You can also search for this author in PubMed Google Scholar
Jing Tang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.J. and J.T. conceived of the study and supervised the project. M.J. developed the network models and led the computational analysis. M.M. and S.Z. provided computational support, while J.B. and J.E. designed and developed the experimental methods for drug sensitivity analysis. M.J., M.M., F.B., J.E. and J.T. contributed to the interpretation of the findings, and C.A.H. advised on the work. All authors contributed to the final manuscript by discussing the findings and reviewing and modifying it.

Corresponding authors

Correspondence to Mohieddin Jafari or Jing Tang.

Ethics declarations

Competing interests

C.A.H. has received research funding from BMS/Celgene, Kronos Bio, Novartis, Oncopeptides, Orion Pharma, and IMI2 projects HARMONY and HARMONYPLUS unrelated to this work. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Feixiong Cheng and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jafari, M., Mirzaie, M., Bao, J. et al. Bipartite network models to design combination therapies in acute myeloid leukaemia. Nat Commun 13, 2128 (2022). https://doi.org/10.1038/s41467-022-29793-5

Download citation

Received: 31 May 2021
Accepted: 30 March 2022
Published: 19 April 2022
DOI: https://doi.org/10.1038/s41467-022-29793-5

This article is cited by

Designing patient-oriented combination therapies for acute myeloid leukemia based on efficacy/toxicity integration and bipartite network modeling
- Mehdi Mirzaie
- Elham Gholizadeh
- Mohieddin Jafari
Oncogenesis (2024)
Personalized tumor combination therapy optimization using the single-cell transcriptome
- Chen Tang
- Shaliu Fu
- Qi Liu
Genome Medicine (2023)
Harmonizing across datasets to improve the transferability of drug combination prediction
- Hanrui Zhang
- Ziyan Wang
- Yuanfang Guan
Communications Biology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.