Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

A new precision medicine initiative at the dawn of exascale computing

Abstract

Which signaling pathway and protein to select to mitigate the patient’s expected drug resistance? The number of possibilities facing the physician is massive, and the drug combination should fit the patient status. Here, we briefly review current approaches and data and map an innovative patient-specific strategy to forecast drug resistance targets that centers on parallel (or redundant) proliferation pathways in specialized cells. It considers the availability of each protein in each pathway in the specific cell, its activating mutations, and the chromatin accessibility of its encoding gene. The construction of the resulting Proliferation Pathway Network Atlas will harness the emerging exascale computing and advanced artificial intelligence (AI) methods for therapeutic development. Merging the resulting set of targets, pathways, and proteins, with current strategies will augment the choice for the attending physicians to thwart resistance.

Introduction

Precision medicine aims to identify patient-specific drug targets.1,2,3,4,5 To date, the approaches have largely focused on (i) identification of proteins with driver mutations; the mutations can be strong drivers, weak drivers, rare drivers, or latent drivers;2,5,6,7,8,9,10,11 (ii) decision on how to target: should the mutant protein be targeted with a combination of drugs, e.g. one orthosteric and the other allosteric, or should a second protein also be targeted, in which case should the second protein be from the same pathway (the more frequent case) or from a different pathway.12 If from a different (redundant, or parallel) pathway, the protein is generally selected based on the physician’s prior knowledge; and finally, once identified, (iii) selection of drugs targeting these. Drug discovery can be via large-scale screening, in silico docking, structure-based drug design, or drug repurposing.13,14,15,16,17,18 The approaches rely on vast quantities of data, high-resolution structural data, highly efficient state-of-the-art algorithms to sift through these and large-scale scientific computation.1 They are mapped in the diagram in Fig. 1. These approaches empowered significant progress since the launch of the precision medicine initiative, with breakthrough discoveries identifying activating mutations in key oncogenic proteins and their isoforms, their patterns and mechanisms.19 However, the complexity and challenge of identifying drug resistance targets call for broadening the current strategies and marshalling new ones from a different standpoint.

Fig. 1
figure1

A workflow illustrating an artificial intelligence-based, exascale computing framework for integrative analysis of multi-omics data for precision cancer medicine. PCO, probabilistic co-occurrence; PPMI, positive pointwise mutual information. The detailed description about a deep neural network-based multi-omics and network integration can be found in a recent study81

The framework outlined here maps a conceptually innovative complementary Precision Medicine Initiative that embraces the principles of cell proliferation to counter drug resistance. Different than current strategies it proposes (i) to identify or predict all potential proliferation pathways in the cell. Proliferation pathways constitute the ‘stockpile’ of drug resistance pathways; (ii) to investigate the chromatin accessibility of genes encoding each protein in each proliferation pathway in the specific cell to confirm pathway availability in drug resistance, as well as their cell-specific expression levels; (iii) to identify their driver mutations, and the mechanisms of pathway activation; and finally, exploiting artificial intelligence (AI) methodologies, to (iv) integrate multi-omics cancer data and networks’ perturbations for therapeutic development. Thus, it aims to stop cell proliferation by identifying all possible proliferation pathways and predicting which gene is likely to become the next driver of cancer in the specific patient cell. This innovative and comprehensive strategy is computationally intensive and proposes to exploit the emerging exascale computing in the last two steps. We expect that a Proliferation Pathway Atlas incorporating such data would be an invaluable resource to the community.

Proliferation pathways are critical in cancer

Proliferation pathways link the cellular environment to the cell cycle. Genetic (mutations) and epigenetic alterations can lead to pathway hyperactivation, fueling cancer progression, as does inactivation of tumor suppressors. Protein-protein interactions of major cancer drivers are enriched in mutations, hijacking pro-proliferative signaling networks.

Pathways crosstalk. Crosstalk emerges due to shared interactions and elements.20,21 It can influence their expression level and function. Under physiological conditions, crosstalk enables cells to cope with perturbations of homeostasis. In drug resistance, inhibition of a signaling pathway can promote activation of a survival pathway that bypasses the inhibited pathway. Insight into connections between signaling pathways and foresight into their distinct activation can be powerful in the treatment of cancer.22

What distinguishes proliferation pathways from other signaling pathways?

The Ras/phosphoinositide 3-kinase (PI3K)/Akt and Ras/extracellular signal-regulated kinase (ERK) pathways provide good examples of proliferation pathways.23,24 Ras is activated by stimulated receptor tyrosine kinases (RTKs). Ras mutants are involved in roughly a third of the cancers. The identity of the other two-third proliferation pathways is only partially known.

Proliferation involves cell growth and division. Proliferation can take place through many pathways and is particularly active during development. It is also essential in adult homeostasis. Signaling pathways that control cell proliferation25 can act by linking the cellular environment to progression through the G1 (Gap 1) phase of the cell cycle (Fig. 2). Progression through G1 is controlled by retinoblastoma protein (pRb) whose phosphorylation by the G1 cyclin-dependent kinases (CDKs) promote passage of the cell cycle to the S (Synthesis) phase. The pRb pathway (thus G1) is mainly regulated by cyclins and CDK inhibitors with inputs from major cellular signaling pathways. pRb tumor suppressor binds to the E2F1 transcription factor (TF), repressing the G1/S transition; phosphorylation of pRb proteins by CDKs liberates E2F, promoting the transition to S phase.

Fig. 2
figure2

Cyclin-dependent kinases (CDKs) cell cycle. CDKs and their cyclin partners form complexes, regulating the progression through the cell cycle (upper left panel). In early G1 phase (G1-pm), retinoblastoma protein (pRb) becomes phosphorylated by the complex, a pair of cyclin-D and CDK4/6 (upper right panel). In late G1 phase (G1-ps), pRb is hyper-phosphorylated by cyclin-E/CDK2 complex, undergoing a large conformational change. This conformational change fails to assemble with E2F, a transcription factor, which promotes to progress the G1/S transition. Cyclins bind with the dependent kinases and their concentration varies during the cell cycle (lower left panel). A crystal structure illustrates an example of cyclin/CDK complex (PDB: 2W99) (lower right panel). Apo-CDKs exhibit little kinase activity; CDKs become active kinases when interacting with the regulatory protein called a cyclin

Criteria for identifying proliferation pathways

A pathway that promotes cell proliferation (i) can lead to activation of TFs that induce expression of proteins acting in multiple pathways, including oncogenic functions such as proliferation and survival, with some of these (ii) entering the cell cycle. Cyclin-D, whose synthesis is initiated during the cell cycle G1 phase and is involved in regulating cell cycle progression provides an example (Fig. 2). The cyclin-D/CDK4 complex, which consists of cyclin-D and CDK4, or CDK6, a serine-threonine kinase, is essential for the progression of the cell from the G1 to the S phase, for the Start or G1/S checkpoint. Some proteins control operations critical for cell cycle progression. Cyclin-D transcription is activated through the growth factor-stimulated RTK proliferation pathway which expresses Myc, a TF that controls transcription of several cell cycle-regulating genes, including cyclin-D. Myc promotes the cell cycle primarily through its role in cellular growth control. c-Myc target genes include regulators of cell growth; but also, those functioning in cell division pathways. Among c-Myc target genes that regulate cell growth are those associated with ribosomal protein transcription and translation, including translation initiation factors such as eukaryotic translation initiation factor 4E (eIF4E). Active RTK signals through the two major signaling pathways; c-Myc is involved in both. Notably, a proliferation pathway, such as MAPK can also activate gene sets for immune response.26

Examples of proliferation pathways

Wnt/β-catenin, Notch, Hedgehog, transforming growth factor β (TGF-β) and Hippo are implicated in developmental processes and proliferation. Janus kinase/signal transducer and activator of transcription (JAK/STAT) is an example of a proliferation pathway through a cytokine receptor (IL7). Here we focus on development-related pathways and discuss the first three. Embryogenesis and tumorigenesis share coordinated mechanisms of proliferation, differentiation, and migration.27

Wnt/β-catenin signaling

The Wnt signaling cascade is a main regulator of development, controlling the growth of embryonic stem cells and adult cell specialization (Fig. 3). The pathway is also frequently active in cancer.28 Wnt growth factors alter gene expression by stimulating different classes of receptors. They lead to cell proliferation through their impact on the cell cycle.29 Wnt pathway components, such as β-catenin, Dishevelled (Dsh, Dvl in mammals), Frizzled (Frz, a Wnt receptor), low-density lipoprotein receptor-related protein 6 (LRP6, a Wnt co-receptor), and Axin have been associated with cell cycle regulation, centrosome biology, and cell division. Several Wnt pathway components play essential roles during mitosis, which is proposed to also regulate Wnt signaling via cyclin-Y/CDK14 phosphorylation of LRP6.30 They also control cell morphogenesis, affecting the cytoskeleton and the mitotic spindle. Wnt-stimulated signaling activates β-catenin which interacts with DNA-bound TFs of the T-cell factor (TCF) family. β-catenin switches inactive TCF into a transcriptional activator of its target genes.31 Chromatin remodeling complexes can bind β-catenin and promote transcriptional activation of TCF-responsive reporter genes. Transcriptional co-activators, such as p300 and cAMP-response element binding protein (CREB) can alter chromatin structure through histone acetyltransferase to stimulate transcriptional activity. In the absence of a Wnt signal, β-catenin is degraded by a complex which includes the Axin scaffold protein, glycogen synthase kinase 3β (GSK3β), and adenomatous polyposis coli (APC). TCF is bound to the Groucho repressor; binding of Wnt to its receptors induces dissociation of the complex. β-catenin binds TCF in the nucleus.

Fig. 3
figure3

The Wnt signaling pathways. In the absence of Wnt signal, the destruction complex constituted by Axin, adenomatosis polyposis coli (APC), and glycogen synthase kinase 3β (GSK3β) leads to phosphorylation of β-catenin by GSK3β in the cytoplasm. The phosphorylated β-catenin is subsequently ubiquitinated by β-TrCP and targeted for proteasomal degradation. The canonical Wnt/β-catenin pathway is activated by upon binding of Wnt to its Frizzled (Frz) receptor and low-density lipoprotein receptor-related protein 6 (LRP6) co-receptor. LRP6 phosphorylated by cyclin-Y/CDK14 complex, GSK3β, and casein kinase 1γ (CK1γ) recruits the destruction complex and Dishevelled (Dsh, Dvl in mammals) to the plasma membrane. Dvl is activated through polymerization, inhibiting the destruction complex. This results in accumulation of unphosphorylated β-catenin in the cytoplasm and its subsequent translocation into the nucleus, leading to cell proliferation. In the non-canonical Wnt/calcium pathway, calmodulin-bound CaMKII (calcium/calmodulin-dependent protein kinase II) negatively regulates the canonical β-catenin/TCF/LEF signaling through phosphorylation of TCF, which inhibits β-catenin/TCF-mediated transcription

Hedgehog signaling

Hedgehog communicates between cells. It is important for organ development, regeneration and homeostasis; it is frequently modulated in cancer.27 It cross-talks with e.g. transforming growth factor β (TGFβs), Wnt, Notch, and the Sonic hedgehog (Shh). The Shh pathway can involve canonical or non-canonical signaling. The first is receptor ligand-dependent when Shh binds to Ptch (a 12-transmembrane protein) at the membrane; the second is through downstream smoothened (Smo).32 Smo regulates Gli transcription factors processing and activation, which activate target genes. Non-canonical activation is Gli-independent. Hedgehog signaling upregulates multiple proteins, including N-Myc (a member of the Myc family), forkhead box M1 (FoxM1), and Cdc25B, which activates the cyclin-dependent kinase CDC2. It also upregulates CCND1, CCND2, and CCNE. Cyclin-D1, cyclin-D2, and cyclin-E which drive cell-cycle progression at the G1/S phase, while FoxM1, cyclin-B1, and Cdc25B act at the G2/M (mitotic) phase. Thus, hedgehog signals drive cell-cycle progression through multiple cell cycle regulators.

Notch signaling

Notch signaling takes place via cell-cell communication, where transmembrane ligands on one cell activate those of the other. The cleaved receptor is translocated to the nucleus.33 Notch intracellular domain (NICD) forms a trimeric complex with CSL (CBF1, Suppressor of Hairless, Lag-1; a transcription factor that activates genes downstream in the Notch pathway) and Mastermind-like (MAML) transcriptional coactivator, which converts CSL from a repressor to an activator and initiates transcription of Notch downstream target genes. In the absence of Notch signaling, CSL represses transcription; following activation by Notch, it is converted into a transcriptional activator and activates transcription of the same genes. Notch signaling with its CSL cofactor can maintain cells in an undifferentiated state, consequently associated with cancer. It controls cell lineage and tissue development, blocking differentiation thus retaining stem or progenitor cells, or governing the balance between cell fates. Notch signaling mediates G1/S cell-cycle progression in T-cells via cyclin-D3 and its dependent kinases and activates cell cycle reentry and progression in quiescent cardiomyocytes. Notch signaling acts before cell division to promote asymmetric cleavage and cell fate of neural precursor cells; its activation can inhibit proliferation of endothelial cells by delaying cyclin-D/CDK4-mediated phosphorylation of the retinoblastoma protein. It also regulates variant cell cycles to control cell size34 and more.

EGFR signaling

Epidermal growth factor receptor (EGFR) pathway is a classic example of proliferation pathway that can lead to G1 cell cycle progression, through cyclin-D expression, CDK4/6 activation, and the repression of cyclin-dependent kinase inhibitor proteins (CDKi) by EGFR signaling pathways.

Selection of the proliferation pathway to drug

Halting proliferation by drugging the pathway most likely to become the next driver in the patient cell is a powerful and compelling amplification of current therapeutic approaches. It considers cancer evolution dynamics which to date has been missing. The challenge is however in the knowledge of (i) all possible proliferation pathways, (ii) the accessibility of each gene encoding a protein in each pathway in the specific cancer cell, including (iii) expression data, and (iv) the driver mutations in each gene.

Genes of targeted pathways should be accessible

To be a good drug candidate, the proteins in the proliferation pathway should be available in the specific cell. This requires that the genes encoding the pathway proteins are accessible to the transcription machinery or can become accessible upon a ‘modest’ change in the chromatin structure. Not all proteins are expressed in all cells. Chromatin availability status is cell type, lineage and state-dependent.12,35 Genes active in developmental or embryonic pathways can become densely packed in the chromatin and inaccessible. Further, because signaling in a skin cell differs from that in a kidney cell, proliferation pathways in drug resistance are likely to differ between these cells.36 Oncogenic cells manifest tissue-specific tendencies,36,37 with distinct cells having preferred proliferation profiles. Accessibility is controlled by cell-specific chromatin-binding factors,38,39 including e.g. pioneer transcription factors that locally unfold the condensed chromatin and nucleosomes. Accessibility can also be regulated by the proliferation pathway itself, as in the case of Notch33 and its epigenetics.40

Experimental accessibility data are limited. Predicting the three-dimensional genome organization and chromatin accessibility is also challenging. High-resolution structural data provide structural detail, allow mapping of genomes, insight into effects of mutations and dysregulation that traditional methods that identify the genes with active histone modification markers, such as H3K27ac, H3K4ac3 are unable to provide. Simulations with parameterization based on the free-energy landscape theory,41,42 genomics and epigenomics data, reproduced chromosome conformation capture data (Hi-C)12,43,44,45,46,47 and super-resolution microscopy.42,48,49 They permitted predicting chromatin structures at 5 kilobase resolution starting from genomics and epigenomics data that are available for hundreds of cell types, including cancer cells.42 Integration of Hi-C data with conventional microscopy led to more accurate prediction of genome organization.50 More recently, Hi-C data and super-resolution imaging were brought together through integrative modeling of genomic regions (IMGR), thus achieving high spatial and genomic resolution, while maintaining the single-cell identity.51,52,53 IMGR can be broadly divided into three steps. In step one, models are constructed of Hi-C data.54,55 In step two, these models are rigidly fitted onto structures resolved by super-resolution microscopy. The top 5% that fit the most qualify to the next step, which is the flexible fitting. In flexible fitting, the polymer chains are allowed to swivel around TAD borders, which are expected to be more flexible. The model that best fits each super-resolved structure is chosen. Such a technology promotes optimism that a precision level that unearths the chromatin status of driver genes is reachable; genes with sparse chromatin density would suggest that they are drug resistance candidates. Integrative successes promise increasingly detailed mapping of dynamic chromatin maps of single cells.

IMGR is especially beneficial when integrating with images (Fig. 4). Here, we focus on 7 chromosomal segments out of the 9 imaged using sequential OligoSTORM, and color-code them as either active (red) or inactive (blue).51 OligoSTORM52,56 is the integration of Oligopaints Fluorescence in situ Hybridization (FISH) probes,57 with the super-resolution technology called Stochastic Optical Reconstruction Microscopy (STORM).58 Sequential OligoSTORM51,53,59 allows imaging of multiple genomic loci, going much beyond the limitations of spectral resolution (Fig. 4A). Even though these chromosomal segments were imaged at the ~ Mb scale, with IMGR, their genomic resolution can improve to 10 kb and better, which is two orders of magnitude higher for some of these segments. Interestingly, the density of the inactive chromatin in this PGP1f (Personal Genome Project, participant 1 fibroblasts) nucleus is higher than that of the active chromatin (Fig. 4B). Drugs may find active, cell-type specific, chromatin target more efficiently. OligoSTORM gene-specific visualizing technologies, or IMGR, can learn whether gene accessibility is influential in successful drug therapy. The efficiency of drug therapy might also be dependent upon the structural variation between homologous chromosomes.51

Fig. 4
figure4

OligoSTORM image of chromatin density. (a) Seven chromosomal segments (chr19:8.68–15.2; hg19) rendered as isosuurface clusters in a PGP1 fibroblast. Each color represents a different chromosomal segment. (b) The seven segments color-coded according to their transcriptional activity and epigenetics marks51 (blue—inactive, red—active). Here, the density of the inactive is higher than the active chromatin

Identifying driver mutations with exascale computing

Proliferation pathways are activated by driver mutations. Their identification involves algorithmic strategies, statistical evaluation and databases.2,19,60,61,62,63,64 Since the methods are statistics-based, the mutations are mostly identified based on their frequencies of occurrence. Recently, however, an increasing number of statistically rare mutations were identified in patients, raising the question of how to identify rare, and weak drivers which are often observed only in certain tissues thus overall infrequent.5,19,62,65,66,67,68,69,70,71,72 K-Ras4BA146T is one example where the mechanism is understood. Different than K-Ras4BG12D, a strong driver that blocks GTP hydrolysis and is expressed in many cancers, including pancreatic and colon, the weaker K-Ras4BA146T which acts by promoting guanine nucleotide exchange factor (GEF)-mediated GDP by GTP exchange, transforms colon but is not sufficiently powerful in transforming pancreatic cancer cells.73 “Latent” mutations, that need an emerging ‘helper’ mutation with additive effects for observable pathological consequences are especially challenging to identify. Mechanistically, whether frequent or rare, mutations that release autoinhibition are often driver mutations;63 clusters of mutations also tend to contain drivers, including rare, and latent.5,19,62 Identification of driver mutations, including weak, rare and latent, in each protein in all proliferation pathways requires immense computational power. These mutations are determined not based on their statistics, but by their ability to shift the protein conformation from an inactive to the active state. Identification of each mutation in each protein necessitates powerful computing to observe whether it executes this shift, expressed by conformational change. Such computing power is forecast to reach to scientific community. Exascale computing systems are capable of a billion (i.e. a quintillion) calculations per second. This scale permits executing such long timescales explicit solvent simulations which are required to capture the redistributions of the ensembles. These indicate the population time of conformations where the mutation switches the protein from the inactive to the active state. Figure 5 illustrates why massive compute time is necessary.

Fig. 5
figure5

A proposed atlas and its associated portal for precision cancer medicine using exascale computing of “big” data

Artificial intelligence, multi-omics data, and network perturbations for therapeutic development

The human genome project accelerated genetic and genomic studies such as The Cancer Genome Atlas (TCGA) to inform precision medicine drug discovery.1 The underlying hypothesis of cancer systems biology is that sub-cellular networks gradually rewire throughout disease initiation, progression, and maintenance, leading to progressive shifts of local and global network properties and systems states,74 including protein-protein interactions and gene regulatory network, all controlling cancer initiation and drug responses (Fig. 5). Genome alterations, amplification, deletion, translocation, and mutations can only be selected for in cells if they encode changes, or perturbations, in the human interactome and systems properties of the affected cells.75,76 Personalized treatment needs to be designed to deal with such perturbations; rather than only with genomic events. Analysis of over 2.5 million nonsynonymous somatic mutations derived from 6,789 tumor exomes across 14 cancer types from TCGA, showed that Individualized Network-based Co-Mutation (INCM)-inferred putative genetic interactions are correlated with patient survival and drug responses in cancer cell lines.75 Drug-target network analysis revealed candidate therapeutic pathways that target tumor vulnerabilities and identified several potential pharmacogenomics biomarkers. A Genome-wide Positioning Systems network (GPSnet) algorithm incorporated individual patient’s DNA and RNA profiles into the human protein-protein interactome network to prioritize targets and repurposed drugs for cancer.14 A GPSnet-predicted and experimentally validated drug, ouabain, revealed potential antitumor activities in lung adenocarcinoma by uniquely targeting a HIF1α/LEO1-mediated cell metabolism pathway.14

The human interactome networks already contributed to understanding tumorigenesis and rapid identification of driver genes in human cancer and drug treatment.1,15,77,78 Cancer networks, and broadly sub-cellular systems, require information and models at multi-dimensional levels, including cells, tissues, organs, and organisms, which are missing in traditional computational approaches. Cancer therapy is moving from drug-centered to patient-centered approach. This requires paradigm shifts along the entire drug development process and multi-omics data integration. The increase in data (including DNA/RNA sequencing data) and the difficulty of data analysis, will also be aided by exascale computing. Advances in AI have been applied to cancer medicine, particularly in large-scale, integrative analyses of multi-omics and biological networks. Still, development and application of AI methods in precision medicine are still in its infancy.

Cancer data come from high-dimensional sources, electronic health care records (imaging, laboratory results, diagnosis codes), genetic testing, among others. An oncologist has to evaluate vast amounts of information, including the patient’s history, family history, genomic sequences, medications, and more, to guide rapid clinical decision. Among the multiple AI techniques, deep neural networks have gained attention in precision cancer medicine, especially for imaging data analysis79,80 and complex biological network integration.16,81 Saltz et al. presented convolutional neural network (CNN) models to analyze 5,200 digital images from 13 cancer types.79 They demonstrated that tumor-infiltrating lymphocyte maps identified by CNN models were correlated with patient survival, tumor types, and immune profiles.79 A one-class logistic regression (OCLR) machine-learning algorithm incorporated transcriptomic and epigenetic profiles from cancer patients for assessing the degree of oncogenic dedifferentiation.82 OCLR identified previously undiscovered biological mechanisms associated with the dedifferentiated oncogenic state quantified by stemness indices, a key measurement of cancer progression.82 Indices predicted by OCLR revealed novel targets and possible targeted therapies by specifically targeting tumor differentiation.82

AI approaches excel at automatically recognizing complex patterns in multi-omics data and providing quantitative assessment of genetic regions, omic layers, and pathways associated with tumorigenesis and precision medicine drug discovery (Fig. 1). deepDTnet, a network-based deep learning methodology was developed for novel target identification and drug repurposing via a heterogeneous drug-gene-disease network embedding 15 types of chemical, genomic, phenotypic, and cellular network profiles.81 DCell, a visible neural network embedded in the hierarchical structure of 2526 subsystems comprising a eukaryotic cell,83 showed consistent results with laboratory observations when evaluated on several million genotypes.83 Its framework may be applied to tumor cells although they are highly complex systems with millions of components and interactions. An AI-based, exascale computing framework that incorporates genome/transcriptome/proteome data, human protein-protein interactome, public drug-target databases (Fig. 1), along with functional validation or patient data validation offers powerful tools for accelerating precision cancer medicine.

Coupled with identification of mutations, enabled by powerful exascale computing at the single protein level, can create a comprehensive and rounded computational framework, whose organization will integrate all components.

Conclusions: stop cell proliferation

The potential of precision medicine to sustain human health has captivated the imagination of the scientists and the public. The National Cancer Institute described precision medicine as “an approach to patient care that allows doctors to select treatments that are most likely to help patients based on a genetic understanding of their disease”. However, exactly how to select has been unclear. The number of possibilities is massive, and the drug combination should fit the patient status. Significant progress has been made since the launch of the precision medicine initiative. However, to date its success has been limited. A major reason is the emergence of drug resistance.

Here we map a new concept: stopping cancer cell proliferation by targeting the proliferation pathway and genes that are likely to be the next drivers in the expected emergence of drug resistance. Current technologies, which can already obtain gene-scale resolution of chromatin increasingly allow forecasting such set of drug resistance targets through identification of proliferation pathways and the accessible genes encoding them. While here we focus on proliferation pathways, for completeness, in the future survival pathways and others critical in drug resistance should also be included.

In the biological sciences, exascale computing in the next decade is expected to be dominated by hybrid modeling, molecular dynamics, free-energy simulations, drug design, and discovery, and modeling the behavior of molecular assemblies and cell actions exploiting imaging at different scales. The concept described here fits well into these capabilities aiming to arrest cell proliferation in drug resistance.

References

  1. 1.

    Cheng, F., Liang, H., Butte, A. J., Eng, C. & Nussinov, R. Personal mutanomes meet modern oncology drug discovery and precision health. Pharm. Rev. 71, 1–19 (2019).

    CAS  PubMed  Article  Google Scholar 

  2. 2.

    Nussinov, R., Jang, H., Tsai, C. J. & Cheng, F. Review: precision medicine and driver mutations: Computational methods, functional assays and conformational principles for interpreting cancer drivers. PLoS Comput. Biol. 15, e1006658 (2019).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  3. 3.

    Manem, V. S. K., Salgado, R., Aftimos, P., Sotiriou, C. & Haibe-Kains, B. Network science in clinical trials: a patient-centered approach. Semin Cancer Biol. 52, 135–150 (2018).

    PubMed  Article  Google Scholar 

  4. 4.

    Dugger, S. A., Platt, A. & Goldstein, D. B. Drug development in the era of precision medicine. Nat. Rev. Drug Discov. 17, 183–196 (2018).

    CAS  PubMed  Article  Google Scholar 

  5. 5.

    Nussinov, R., Jang, H., Tsai, C. J. & Cheng, F. Precision medicine review: rare driver mutations and their biophysical classification. Biophys. Rev. 11, 5–19 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  6. 6.

    Nussinov, R. & Tsai, C. J. ‘Latent drivers’ expand the cancer mutational landscape. Curr. Opin. Struct. Biol. 32, 25–32 (2015).

    CAS  PubMed  Article  Google Scholar 

  7. 7.

    Carter, H. et al. Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. Cancer Res. 69, 6660–6667 (2009).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  8. 8.

    Vogelstein, B. et al. Cancer genome landscapes. Science 339, 1546–1558 (2013).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  9. 9.

    Nussinov, R., Jang, H. & Tsai, C. J. The structural basis for cancer treatment decisions. Oncotarget 5, 7285–7302 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Tokheim, C. J., Papadopoulos, N., Kinzler, K. W., Vogelstein, B. & Karchin, R. Evaluating the evaluation of cancer driver genes. Proc. Natl Acad. Sci. USA 113, 14330–14335 (2016).

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Dimitrakopoulos, C. M. & Beerenwinkel, N. Computational approaches for the identification of cancer genes and pathways. Wiley Interdiscip. Rev. Syst. Biol. Med. 9, e1364 (2017).

  12. 12.

    Nussinov, R., Tsai, C. J. & Jang, H. Are parallel proliferation pathways redundant? Trends Biochem. Sci. 45, 554–563 (2020).

  13. 13.

    Zeng, X. et al. Network-based prediction of drug-target interactions using an arbitrary-order proximity embedded deep forest. Bioinformatics 36, 2805–2812 (2020).

    CAS  PubMed  Article  Google Scholar 

  14. 14.

    Cheng, F. et al. A genome-wide positioning systems network algorithm for in silico drug repurposing. Nat. Commun. 10, 3476 (2019).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  15. 15.

    Huang, Y. et al. A systems pharmacology approach uncovers wogonoside as an angiogenesis inhibitor of triple-negative breast cancer by targeting Hedgehog signaling. Cell Chem. Biol. 26, 1143–1158 (2019). e1146.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  16. 16.

    Zeng, X. et al. deepDR: a network-based deep learning approach to in silico drug repositioning. Bioinformatics 35, 5191–5198 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  17. 17.

    Cheng, F. In silico oncology drug repositioning and polypharmacology. Methods Mol. Biol. 1878, 243–261 (2019).

    CAS  PubMed  Article  Google Scholar 

  18. 18.

    Cheng, F. et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat. Commun. 9, 2691 (2018).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  19. 19.

    Gao, J. et al. 3D clusters of somatic mutations in cancer reveal numerous rare mutations as functional targets. Genome Med. 9, 4 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  20. 20.

    Rowland, M. A., Fontana, W. & Deeds, E. J. Crosstalk and competition in signaling networks. Biophys. J. 103, 2389–2398 (2012).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  21. 21.

    Adelaja, A. & Hoffmann, A. Signaling crosstalk mechanisms that may fine-tune pathogen-responsive NFkappaB. Front Immunol. 10, 433 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  22. 22.

    Prahallad, A. & Bernards, R. Opportunities and challenges provided by crosstalk between signalling pathways in cancer. Oncogene 35, 1073–1079 (2016).

    CAS  PubMed  Article  Google Scholar 

  23. 23.

    Nussinov, R., Tsai, C. J. & Mattos, C. ‘Pathway drug cocktail’: targeting Ras signaling based on structural pathways. Trends Mol. Med. 19, 695–704 (2013).

    PubMed  Article  CAS  Google Scholar 

  24. 24.

    Fernandes, M. S., Sanches, J. M. & Seruca, R. Targeting the PI3K signalling as a therapeutic strategy in colorectal cancer. Adv. Exp. Med Biol. 1110, 35–53 (2018).

    CAS  PubMed  Article  Google Scholar 

  25. 25.

    Duronio, R. J. & Xiong, Y. Signaling pathways that control cell proliferation. Cold Spring Harb. Perspect. Biol. 5, a008904 (2013).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  26. 26.

    Wei, X. et al. The evolutionarily conserved MAPK/Erk signaling promotes ancestral T-cell immunity in fish via c-Myc-mediated glycolysis. J. Biol. Chem. 295, 3000–3016 (2020).

    CAS  PubMed  Article  Google Scholar 

  27. 27.

    Carballo, G. B., Honorato, J. R., de Lopes, G. P. F. & Spohr, T. A highlight on sonic hedgehog pathway. Cell Commun. Signal 16, 11 (2018).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  28. 28.

    Nusse, R. & Clevers, H. Wnt/beta-catenin signaling, disease, and emerging therapeutic modalities. Cell 169, 985–999 (2017).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  29. 29.

    Bryja, V., Cervenka, I. & Cajanek, L. The connections of Wnt pathway components with cell cycle and centrosome: side effects or a hidden logic? Crit. Rev. Biochem Mol. Biol. 52, 614–637 (2017).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  30. 30.

    Davidson, G. The cell cycle and Wnt. Cell Cycle 9, 1667–1668 (2010).

    CAS  PubMed  Article  Google Scholar 

  31. 31.

    Franz, A., Shlyueva, D., Brunner, E., Stark, A. & Basler, K. Probing the canonicity of the Wnt/Wingless signaling pathway. PLoS Genet. 13, e1006700 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  32. 32.

    Blotta, S. et al. Canonical and noncanonical Hedgehog pathway in the pathogenesis of multiple myeloma. Blood 120, 5002–5013 (2012).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  33. 33.

    Siebel, C. & Lendahl, U. Notch signaling in development, tissue homeostasis, and disease. Physiol. Rev. 97, 1235–1294 (2017).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  34. 34.

    Von Stetina, J. R., Frawley, L. E., Unhavaithaya, Y. & Orr-Weaver, T. L. Variant cell cycles regulated by Notch signaling control cell size and ensure a functional blood-brain barrier. Development 145, dev157115 (2018).

  35. 35.

    Ludwig, L. S. et al. Transcriptional states and chromatin accessibility underlying human erythropoiesis. Cell Rep. 27, 3228–-3240 (2019). e3227.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  36. 36.

    Sack, L. M. et al. Profound tissue specificity in proliferation control underlies cancer drivers and aneuploidy patterns. Cell 173, 499–514 (2018). e423.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  37. 37.

    Haigis, K. M., Cichowski, K. & Elledge, S. J. Tissue-specificity in cancer: the rule, not the exception. Science 363, 1150–1151 (2019).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  38. 38.

    Monroe, T. O. et al. YAP partially reprograms chromatin accessibility to directly induce adult cardiogenesis in vivo. Dev. Cell 48, 765–779 (2019). e767.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  39. 39.

    Klemm, S. L., Shipony, Z. & Greenleaf, W. J. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 20, 207–220 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  40. 40.

    Wang, H. et al. NOTCH1-RBPJ complexes drive target gene expression through dynamic interactions with superenhancers. Proc. Natl Acad. Sci. USA 111, 705–710 (2014).

    CAS  PubMed  Article  Google Scholar 

  41. 41.

    Di Pierro, M., Zhang, B., Aiden, E. L., Wolynes, P. G. & Onuchic, J. N. Transferable model for chromosome architecture. Proc. Natl Acad. Sci. USA 113, 12168–12173 (2016).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  42. 42.

    Qi, Y. & Zhang, B. Predicting three-dimensional genome organization with chromatin states. PLoS Comput. Biol. 15, e1007024 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  43. 43.

    Zhou, J. et al. Robust single-cell Hi-C clustering by convolution- and random-walk-based imputation. Proc. Natl Acad. Sci. USA 116, 14011–14018 (2019).

    CAS  PubMed  Article  Google Scholar 

  44. 44.

    Gursoy, G., Xu, Y., Kenter, A. L. & Liang, J. Computational construction of 3D chromatin ensembles and prediction of functional interactions of alpha-globin locus from 5C data. Nucleic Acids Res. 45, 11547–11558 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  45. 45.

    Baxter, J. S. et al. Capture Hi-C identifies putative target genes at 33 breast cancer risk loci. Nat. Commun. 9, 1028 (2018).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  46. 46.

    Nuebler, J., Fudenberg, G., Imakaev, M., Abdennur, N. & Mirny, L. A. Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl Acad. Sci. USA 115, E6697–E6706 (2018).

    CAS  PubMed  Article  Google Scholar 

  47. 47.

    Oluwadare, O., Highsmith, M. & Cheng, J. An overview of methods for reconstructing 3-D chromosome and genome structures from Hi-C data. Biol. Proced. Online 21, 7 (2019).

    PubMed  PubMed Central  Article  Google Scholar 

  48. 48.

    Cheng, R. R. et al. Exploring chromosomal structural heterogeneity across multiple cell lines. Elife 9, e60312 (2020).

  49. 49.

    Contessoto, V. G. et al. The Nucleome Data Bank: web-based resources to simulate and analyze the three-dimensional genome. biorXiv https://doi.org/10.1101/2019.12.20.885145 (2020).

  50. 50.

    Abbas, A. et al. Integrating Hi-C and FISH data for modeling of the 3D organization of chromosomes. Nat. Commun. 10, 2049 (2019).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  51. 51.

    Nir, G. et al. Walking along chromosomes with super-resolution imaging, contact maps, and integrative modeling. PLoS Genet. 14, e1007872 (2018).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  52. 52.

    Boettiger, A. N. et al. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature 529, 418–422 (2016).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  53. 53.

    Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).

  54. 54.

    Bau, D. & Marti-Renom, M. A. Genome structure determination via 3C-based data integration by the Integrative Modeling Platform. Methods 58, 300–306 (2012).

    CAS  PubMed  Article  Google Scholar 

  55. 55.

    Serra, F. et al. Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors. PLoS Comput Biol. 13, e1005665 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  56. 56.

    Beliveau, B. J. et al. Single-molecule super-resolution imaging of chromosomes and in situ haplotype visualization using Oligopaint FISH probes. Nat. Commun. 6, 7147 (2015).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  57. 57.

    Beliveau, B. J. et al. Versatile design and synthesis platform for visualizing genomes with Oligopaint FISH probes. Proc. Natl Acad. Sci. USA 109, 21301–21306 (2012).

    CAS  PubMed  Article  Google Scholar 

  58. 58.

    Rust, M. J., Bates, M. & Zhuang, X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM). Nat. Methods 3, 793–795 (2006).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  59. 59.

    Mateo, L. J. et al. Visualizing DNA folding and RNA in embryos at single-cell resolution. Nature 568, 49–54 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  60. 60.

    Masica, D. L. et al. CRAVAT 4: cancer-related analysis of variants toolkit. Cancer Res 77, e35–e38 (2017).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  61. 61.

    Brown, A. L., Li, M., Goncearenco, A. & Panchenko, A. R. Finding driver mutations in cancer: elucidating the role of background mutational processes. PLoS Comput Biol. 15, e1006981 (2019).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  62. 62.

    Nussinov, R., Tsai, C. J. & Jang, H. Why are some driver mutations rare? Trends Pharm. Sci. 40, 919–929 (2019).

    CAS  PubMed  Article  Google Scholar 

  63. 63.

    Nussinov, R., Tsai, C. J. & Jang, H. Autoinhibition can identify rare driver mutations and advise pharmacology. FASEB J. 34, 16–29 (2020).

    CAS  PubMed  Article  Google Scholar 

  64. 64.

    Chen, H. et al. Comprehensive assessment of computational algorithms in predicting cancer driver mutations. Genome Biol. 21, 43 (2020).

    PubMed  PubMed Central  Article  Google Scholar 

  65. 65.

    Rogers, M. F., Gaunt, T. R. & Campbell, C. CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome. Bioinformatics 36, 3637(2020).

  66. 66.

    Loganathan, S. K. et al. Rare driver mutations in head and neck squamous cell carcinomas converge on NOTCH signaling. Science 367, 1264–1269 (2020).

    CAS  PubMed  Article  Google Scholar 

  67. 67.

    Myers, M. B., McKim, K. L., Wang, Y., Banda, M. & Parsons, B. L. ACB-PCR quantification of low-frequency hotspot cancer-driver mutations. Methods Mol. Biol. 2102, 395–417 (2020).

    CAS  PubMed  Article  Google Scholar 

  68. 68.

    Guo, Y. et al. Recent progress in rare oncogenic drivers and targeted therapy for non-small cell lung cancer. Onco Targets Ther. 12, 10343–10360 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  69. 69.

    Lee-Six, H. et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature 574, 532–537 (2019).

    CAS  PubMed  Article  Google Scholar 

  70. 70.

    Harrison, P. T., Vyse, S. & Huang, P. H. Rare epidermal growth factor receptor (EGFR) mutations in non-small cell lung cancer. Semin Cancer Biol. 61, 167–179 (2020).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  71. 71.

    Allen, A. et al. Rare BRAF mutations in pancreatic neuroendocrine tumors may predict response to RAF and MEK inhibition. PLoS ONE 14, e0217399 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  72. 72.

    Song, J., Peng, W. & Wang, F. A random walk-based method to identify driver genes by integrating the subcellular localization and variation frequency into bipartite graph. BMC Bioinform. 20, 238 (2019).

    Article  Google Scholar 

  73. 73.

    Poulin, E. J. et al. Tissue-specific oncogenic activity of KRAS(A146T). Cancer Discov. 9, 738–755 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  74. 74.

    Cheng, F. et al. Studying tumorigenesis through network evolution and somatic mutational perturbations in the cancer interactome. Mol. Biol. Evol. 31, 2156–2169 (2014).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  75. 75.

    Liu, C. et al. Individualized genetic network analysis reveals new therapeutic vulnerabilities in 6,700 cancer genomes. PLoS Comput. Biol. 16, e1007701 (2020).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  76. 76.

    Liu, C. et al. Computational network biology: data, models, and applications. Phys. Rep. 846, 1–66 (2020).

    Article  Google Scholar 

  77. 77.

    Cheng, F., Jia, P., Wang, Q. & Zhao, Z. Quantitative network mapping of the human kinome interactome reveals new clues for rational kinase inhibitor discovery and individualized cancer therapy. Oncotarget 5, 3697–3710 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  78. 78.

    Cheng, F., Kovacs, I. A. & Barabasi, A. L. Network-based prediction of drug combinations. Nat. Commun. 10, 1197 (2019).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  79. 79.

    Saltz, J. et al. Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 23, 181–193 (2018). e187.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  80. 80.

    Hosny, A., Parmar, C., Quackenbush, J., Schwartz, L. H. & Aerts, H. Artificial intelligence in radiology. Nat. Rev. Cancer 18, 500–510 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  81. 81.

    Zeng, X. et al. Target identification among known drugs by deep learning from heterogeneous networks. Chem. Sci. 11, 1775–1797 (2020).

    CAS  Article  Google Scholar 

  82. 82.

    Malta, T. M. et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell 173, 338–354 (2018). e315.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  83. 83.

    Ma, J. et al. Using deep learning to model the hierarchical structure and function of a cell. Nat. Methods 15, 290–298 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

Download references

Acknowledgements

This project has been funded in whole or in part with federal funds from the National Cancer Institute, National Institutes of Health, under contract HHSN261200800001E. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government. This research was supported [in part] by the Intramural Research Program of NIH, National Cancer Institute, Center for Cancer Research.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Ruth Nussinov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Nussinov, R., Jang, H., Nir, G. et al. A new precision medicine initiative at the dawn of exascale computing. Sig Transduct Target Ther 6, 3 (2021). https://doi.org/10.1038/s41392-020-00420-3

Download citation

Further reading

Search

Quick links