A Novel System for Functional Determination of Variants of Uncertain Significance using Deep Convolutional Neural Networks

Zimmerman, Lior; Zelichov, Ori; Aizenmann, Arie; Barbash, Zohar; Vidne, Michael; Tarcic, Gabi

doi:10.1038/s41598-020-61173-1

Download PDF

Article
Open access
Published: 06 March 2020

A Novel System for Functional Determination of Variants of Uncertain Significance using Deep Convolutional Neural Networks

Scientific Reports volume 10, Article number: 4192 (2020) Cite this article

3180 Accesses
5 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Many drugs are developed for commonly occurring, well studied cancer drivers such as vemurafenib for BRAF V600E and erlotinib for EGFR exon 19 mutations. However, most tumors also harbor mutations which have an uncertain role in disease formation, commonly called Variants of Uncertain Significance (VUS), which are not studied or characterized and could play a significant role in drug resistance and relapse. Therefore, the determination of the functional significance of VUS and their response to Molecularly Targeted Agents (MTA) is essential for developing new drugs and predicting response of patients. Here we present a multi-scale deep convolutional neural network (DCNN) architecture combined with an in-vitro functional assay to investigate the functional role of VUS and their response to MTA’s. Our method achieved high accuracy and precision on a hold-out set of examples (0.98 mean AUC for all tested genes) and was used to predict the oncogenicity of 195 VUS in 6 genes. 63 (32%) of the assayed VUS’s were classified as pathway activating, many of them to a similar extent as known driver mutations. Finally, we show that responses of various mutations to FDA approved MTAs are accurately predicted by our platform in a dose dependent manner. Taken together this novel system can uncover the treatable mutational landscape of a drug and be a useful tool in drug development.

Explainable drug sensitivity prediction through cancer pathway enrichment

Article Open access 04 February 2021

Yi-Ching Tang & Assaf Gottlieb

RefDNN: a reference drug based neural network for more accurate prediction of anticancer drug resistance

Article Open access 05 February 2020

Jonghwan Choi, Sanghyun Park & Jaegyoon Ahn

Identification of phenocopies improves prediction of targeted therapy response over DNA mutations alone

Article Open access 17 October 2022

Hamza Bakhtiar, Kyle T. Helzer, … Shuang G. Zhao

Introduction

Precision Medicine, the paradigm proposing that treatment should be tailored to patients according to the individual molecular characteristics of their tumor, is gaining more and more evidence¹. This paradigm is fueled by the rapid progress in sequencing technologies of tumor samples and the inception of several projects such as the Catalogue of Somatic Mutations In Cancer (COSMIC)² and The Cancer Genome Atlas³, all of which aim to identify actionable alterations that could lead to novel therapies, and by the growing number of FDA approved targeted therapies. One of the sparks that ignited the precision medicine paradigm was the identification of the BCR/Abl fusion event as a primary cancer driving mutation in Chronic Myeloid Leukemia⁴ and the subsequent development of imatinib⁵ in 1998 that resulted in dramatic clinical responses and FDA approval in 2001. Since the commercialization of imatinib, dozens of MTAs have been developed for various indications– from kinase inhibitors to monoclonal antibodies⁶. While many malignant genetic alterations have been thoroughly characterized and can be successfully treated⁷, tumor genetic screening frequently discovers rare mutations that have uncertain significance for disease formation and therefore pose a challenge for developing targeted therapies⁸ and establishing reliable eligibility criteria for clinical trials⁹. For example, In the TCGA database, more than 30% of lung adenocarcinoma samples with either nonsense or missense mutations in EGFR have mutations at positions that are not frequently observed¹⁰. Moreover, a similar analysis for all cancers showed that this proportion increases to 45%. The variants found in those analyses however, are both driver and passenger mutations which may not participate in oncogenesis. Indeed, most drugs are clinically validated only against a handful of mutations (for example, erlotinib for EGFR exon 19 mutations in lung cancer¹¹) and occasionally are given off-label in cases where there is some basis of actionability, with limited success¹².

Given the abundance of VUS in these datasets, a strategy that includes accurate characterization of the activity of VUS and their response to MTAs could provide significant benefit to drug development and increase the success rates of clinical trials. However, this requires methods to tackle the challenging task of deciphering the role of VUS in oncogenesis. Although VUS are frequently abundant in tumors, each individual VUS is rare¹³. Therefore, elucidating the role of each of those mutations from sequencing data alone is not feasible using computational tools due to their rarity and therefore lack of sufficient power for statistical analysis. In many genes, most of the oncogenic mutations are located on functionally important positions which are colloquially termed “hotspots”. Although the occurrence of a mutation in hotspots increases the likelihood of it being oncogenic, it is neither a necessary nor sufficient condition for determining its role in oncogenicity and therefore, serve as a sub-optimal predictor. (One such example is BRAF V600M which is located in a functionally important residue, but was found only to be an intermediate pathway activator compared to V600E/K/D¹⁴). Many methods attempting to address the challenge of VUS determination have been developed over the last few years. Those methods can be divided to experimental methods, pure computational algorithms or a combination of both, and can utilize genome sequencing, transcriptome sequencing or proteomic profiling¹⁵.

Experimental methods for variant classification have the advantage of being independent of prior knowledge and can be used to test responses to MTAs. In one example¹⁶, a quantitative proteomics analysis was used to probe the proteomics of Triple Negative Breast Cancer (TNBC) to identify cancer subtypes and biomarkers. The proteomic profile was integrated with exome sequencing data to determine how protein expression is affected by genomic aberrations to initiate tumorigenesis. Another method¹⁰ involves sequencing and analysis of gene expression data, comparing perturbations induced by mutated and wild-type (WT) gene variants to label pathogenic variants in lung adenocarcinoma. The authors showed that resistance to erlotinib treatment which is caused by rare variants is MEK dependent. A recent method¹⁷, developed by Ng and colleagues, used two cell models that are growth factor dependent in addition to functional signaling profiles to probe the effect of more than 1000 genomic aberrations. Remarkably, the authors showed that their method is able to identify weak cancer drivers such as BRAF G466A and PI3KCA M1043I.

Pure computational approaches to classify VUS and identify cancer driving mutations mostly leverage evolutionary, functional and structural data, as well as data from clinical, family history, co-occurrence and other sources^18,19. One of the first algorithms to be developed is PANTHER²⁰. It works by calculating scores derived from position specific evolutionary conservation which is based on multiple sequence alignment of homologous proteins, to predict oncogenesis. While evolutionary conservation may be an informative prior, it may misclassify passenger mutations as drivers since the model does not have any disease context. FATHMM²¹ (Functional Analysis through Hidden Markov Models) aims to solve this issue and indeed achieved higher accuracy and precision on the same data set by incorporating a dataset of mutations found in inherited diseases. However, this biased the results towards mutations that are seen in observable inherited diseases, and the number of such diseases is considerably smaller than the number of somatic mutations. Another notable approach involves a Bayesian framework constructed from a set of publicly available in-silico predictors²² that reports an AUC (area under the ROC curve) of 0.997 when their ensemble was evaluated on data of 1161 missense mutations. NIPS (Network Integrated Predictor of deleterious protein Single amino acid polymorphism) is a structure-based approach that identifies deleterious mutations in tumor samples. It works by integrating data from several sources, including 3D protein-protein interface interactions, evolutionary conservation and network topology. While the study reports AUC of 0.93, it is still limited to cases where protein structures are available. One of the most comprehensive studies⁸ to characterize cancer driver mutations used an ensemble of 26 different algorithms and identified 299 driver genes and >3400 missense driver mutations; 60–85% of mutations were validated experimentally as probable cancer drivers.

DCNNs are a class of machine learning models that have gained a considerable amount of attention recently because of their superior performance in various machine learning tasks^23,24,25. Such models have been successful at various classification, labelling and segmentation tasks in biomedical research. In cancer genetics, deepDriver²⁶ is a notable example of the application of DCNNs for the task of cancer driver genes prediction. This study used a DCNN that was trained on tensors constructed from a combination of mutation-based features (such as the fraction of silent or missense mutations in tumor samples) and gene similarity network. Although the performance of the algorithm is high (AUC of 0.984 and 0.976 on breast cancer and colorectal cancer), predicting the role of individual mutations and VUS in particular, is beyond its scope. Another tool, Mut2Vec²⁷, is an unsupervised approach for cancer driver prediction which is based on the popular Word2Vec²⁸ class of models. In Mut2Vec, the model is trained on a set of cancer profiles to generate an embedding for each mutation, showing that passenger and driver mutations can be distinguished when the embeddings are clustered. Pathology is another field of cancer research that has gone through significant transformations by the recent advances in deep learning²⁹. In one study³⁰, a DCNN that was trained on slide images of sentinel lymph node biopsies was able to classify and label tumors with exceptional accuracy. Another study³¹ developed a DCNN trained on a dataset that integrated histology images and genomic biomarkers to predict time-to-progression outcomes. The authors showed that by integrating genomic data the median concordance index was significantly improved from 0.754 to 0.801. Another field that has successfully utilized deep learning is fluorescent microscopy. Wang et al.³² developed a Generative Adversarial Network for increasing the resolution of diffraction limited fluorescent microscopy images, wide-field images taken with low numerical aperture objectives, and confocal microscopy images which were able to achieve the resolution acquired with a stimulated emission depletion (STED) microscope. Christiansen et al.³³ developed a neural network architecture that is composed of several sub-networks, each accepts as input a scaled version of the image. This network was shown to accurately predict the fluorescent labels of unlabeled microscopy images. The inputs to the network are patches of images taken with differential interference contrast, bright-field or phase contrast microscopy at multiple resolutions; the network outputs a vector of 256 intensity values for each of the pixels of the output image. This study demonstrated the effective use of a multi-scale network to create an artificial fluorescent labeling system that requires minimal experimental preparations and has much less impact on imaged cells.

To the best of our knowledge, this is the first study in which DCNNs are applied, together with a novel dataset of more than 60,000 fluorescent microscopy images to determine the role of VUS in oncogenesis. During training, the network constructs a latent representation of pathway hyperactivation and uses it to quantify the level of oncogenicity of other mutations, which the network has never seen before. Similar solutions were implemented for problems such as predicting personal traits or estimating chronological age both from facial images^34,35, where latent representations for people’s chronological age or for traits such as Intro/Extroversion are constructed during training and are used for trait quantification during inference. We trained our DCNN on a large set of fluorescent microscopy images of live cells transfected with a plasmid containing a fluorescently tagged mutant or WT gene and a fluorescently tagged downstream reporter that translocates into the nucleus upon pathway activation (Fig. 1). We show that our system accurately measures several known mutations as well as VUS activity levels. We further show that although the network has not been trained on images of MTA treated cells, it is able to predict responses of mutations to MTAs. Altogether these results establish a system that can be used for variant annotation and sensitivity to MTAs.

Materials and Methods

System overview

First (Fig. 1a), mutations are collected from different sources and are synthesized using the Q5 site directed mutagenesis kit (New England Biolabs, Cat #E0554S) and verified using Sanger sequencing. Next (Fig. 1b), HeLa cells are seeded in a 384-well Poly-L-lysine coated, transparent bottom plate. Twenty-four hours after seeding, cells are transfected with plasmids carrying the desired mutation and an EGFP tagged reporter. For the MAPK/ERK pathway the ERK2 reporter was used³⁶, for the JAK-STAT pathways the STAT3 reporter was used³⁷ in four repeats using the Fugene HD reagent (Promega, Cat. #E2312). After transfection, cells are incubated for 24-hours to allow adequate expression of the gene constructs. The plates were then fixated using 3% Paraformaldehyde, and a nuclear stain (DAPI) was performed. In the third step, (Fig. 1c) images of the plates are taken using a NIKON Ti eclipse microscope and NIS-elements software. Finally, in the last step, (Fig. 1d) images of wells seeded with cells transfected with selected known oncogenic mutations and wildtype forms of the same genes are inputted for a DCNN for training or inference.

Cell culture

HeLa cell line was obtained from ATCC (Rockville, MD) and were grown under standard condition for 14 passages at most. We used EMEM media supplied with 10% sera a (Gibco, LIFE) L-Glutamine, Sodium Pyruvate and antibiotics. FUGENE (promega) was used for transfection procedure according to manufacturer protocol. For the transfections we used Janus (PE) liquid handler system in 384 well plates, Poly-L-Lysin coated, with non-supplemented media. The raw images were obtained using automated NIKON Ti-Eclipse microscope coupled with an Andor Zyla 4.2 PLUS sCMOS camera and a LED-based SOLA light source.

The dataset

Our dataset is composed of 65,698 multi-channel images of cells from individual wells from 384 well-plates that were transfected with plasmids carrying either mutated or WT KRAS, NRAS, HRAS, BRAF, MEK, cKIT or PDGFRa genes that were transiently expressed. The image data set contains 308 different gene variants (213 images per mutation on average) that were assigned one of 3 levels of certainty (activating, predicted to activate, VUS) regarding their oncogenicity according to the JAX-CKB³⁸ database mutation classification system (See Supplementary Table S2 for a list of the tested mutations and their corresponding JAX-CKB classification). Each image in the dataset is composed of 3 color channels – red (610 nm), green (509 nm) and blue (461 nm). The green color corresponds to a GFP tagged reporter. For the KRAS, NRAS, HRAS, BRAF and MEK genes, the reporter was ERK2 and for cKIT and PDGFRa the reporter was STAT3. The red channel corresponds to mCherry which was used to tag the gene itself. The blue color channel corresponds to a DAPI stain that binds the DNA molecules in the nucleus. Overall, there were 3,543 ± 767 visible DAPI stained cells in the field of view (FOV) of each well on average, out of those 429 ± 159 (≈12%) were positive for mCherry (expressing the tested gene) indicating a successful transfection.

DCNN Architecture design and implementation

We constructed a DCNN that follows a novel multi-scale architecture (Fig. 2). This class of models was demonstrated in several studies to have superior performance in image segmentation, labeling³³ and classification³⁹ (compared to other classes of models such as pixel-wise CNN or combined SVM and RF classifier).

The DCNN was trained using Tensorflow and Keras; data preparation and analysis was done in python, matplotlib and seaborn.

The computation path is composed of 7 main steps. First, to enable the network to operate on different scales, the input images are scaled to 3 different resolutions (Fig. 2a). Then, each image is broken down into patches of 256 × 256 (Fig. 2b), which reduces complexity and regularizes the network. During training, only features that are consistently present in many patches are selected. Subsequently, for each patch, features of increasing complexity are computed. This is done by 5 rounds, each composed of 2D convolutions with a 3 × 3 kernel, and an increasing number of convolution filters at each round (4,8,16,32,64), batch normalization⁴⁰ and a 2 × 2 maximum pooling, outputting a 8 × 8 × 64 feature matrix (Fig. 2c). Next, we reduce the dimension of the feature matrix by applying global average pooling, an operation that averages features across the spatial domain (Fig. 2d), outputting a vector |v| = 64 per patch. Finally, all vectors representing all patches at all scales for one image are concatenated (Fig. 2e) and are inputted to a fully connected layer to cross correlate features across all patches and scales (Fig. 2f). The last fully connected layer is connected to an output neuron with a sigmoid activation function (Fig. 2g) that outputs values in (0,1), where 0 corresponds to images containing cells transfected with WT genes, and 1 corresponds to images containing cells transfected with a pathway activating oncogenic mutant.

Results

Training phase

Out of the data set of 7 genes, for which we have a total of 301 mutated variants and wildtype forms of each (see Materials and Methods for a description of the data set) we selected 8 mutated variants to be used as positive examples of pathway activation (one for each gene, except for cKIT for which we used 2 different mutations) and the wildtype form as negative examples for pathway activation. This subset was partitioned to training (60% of the images), validation (20% of the images), and test sets (20% of the images) with an additional stratification by well plate (all images from a plate belonged to the same set), to be able to assess the model generalization capabilities across experiments. (Summary of the data used for training, validation and test is in Table S1). An extensive hyperparameter tuning was performed and converged on the following hyperparameters - batch size of 32 images, Adam optimization method⁴¹ with a learning rate of 10⁻⁴. Following the training phase, we assessed the sensitivity (True Positive Rate) and specificity (True Negative Rate) of the network on the test set and found that it has high sensitivity and specificity across all genes and pathway reporters, with mean AUC of 0.98 (Fig. 3c) and average of 95% accuracy (Fig. 3a).

Next, we tested whether there was a difference in the pathway activation patterns induced by each gene by training the DCNN to predict with which gene the cells in the image were transfected. For that purpose, we added to the DCNN from the previous step an additional output layer with 7 neurons (the number of different genes in the data set), with a softmax activation function - \(Softmax(x)=\frac{\exp ({z}_{i})}{{\sum }_{j}\exp ({z}_{j})}\) which computes a probability distribution over multiple output neurons, and categorical cross entropy as a loss function: \(-\frac{1}{N}\sum _{i\in I}\sum _{g\in G}\,\log ({P}_{model}({y}_{i}\in {G}_{g}))\) (where G corresponds to the set of genes, and I the image dataset). Finally, we created an additional ground truth vector with one-hot encoding such that: y^g_i = 1 _g and the images were not changed. Following the training phase, we assessed the accuracy and specificity of the network on a hold-out set. Trained to identify the unique phenological properties induced by each gene, the network achieved a mean of 66% accuracy (Fig. 3c) where most cases of confusion occurred between the 3 RAS homologs, and to some extent BRAF. Similarly, cKIT and PDGFRA were also commonly confused. We hypothesize that this is because they were assayed with a different reporter gene (GFP-STAT3) than the rest of the genes in the study (N/H/K-RAS, MEK, BRAF were all assayed with GFP-ERK2 as a reporter), and that the reporter genes themselves contain intrinsic properties that differ between each other.

VUS Determination

We used the trained DCNN to annotate mutations that have not been functionally profiled (VUS), as well as known oncogenic mutations, all of which were not encountered by the network during training, test or validation. For that purpose, we used a data set of 301 mutated variants that were collected from the cBioPortal⁴² database. Each gene variant was given one of 3 labels that corresponds to the level of evidence regarding their involvement in tumorigenesis, according to the JAX Clinical Knowledgebase (JAX-CKB)³⁸: activating- peer-reviewed published literature demonstrating functional evidence that the gene alteration present results in increased intrinsic activity of the protein; predicted to be oncogenic- the specific type of gene alteration as well as its location is similar to other alterations in the same gene that have been functionally characterized as a gain of function within peer-reviewed published literature; and unknown- there is no peer-reviewed published literature demonstrating the gene alteration present affects the intrinsic activity of the protein.

We synthesized plasmids carrying each of the gene variants from the data set and the same reporter that was used for the gene during training, transfected and imaged them as was described above. The resulting images were inputted to the trained DCNN and the level of predicted pathway activation was determined. Table 1 summarizes the predictions for each label and each gene that was tested. A mutation was determined to be active if its mean prediction value, calculated over all fluorescent microscopy images was above the sigmoid middle point of 0.5. Out of the 301 tested mutations in all 7 genes, JAX-CKB classified 81 as activating, 24 as predicted to activate, and 196 as VUS. The “activating” class is the only class that can be used to validate the accuracy of our platform, since it contains only experimentally validated mutations. Remarkably, our system was able to correctly predict the pathway activation status of 75/81 (92.6%) of those experimentally validated, activating mutations (Table 1). Additionally20/24 (83.3%) of the variants labeled as “predicted to activate” (Table S2) And 63/196 (32.1%, Table 1) of the mutations that were labeled as VUS are predicted by our system to be pathway activating, hinting to their potential oncogenicity.

Table 1 Summary of CNN output.

Full size table

As an example, the output of the network for each of the surveyed variants of cKIT, 110 in total, is presented in Fig. 4. As can be seen, most cKIT mutations tested are concentrated in the juxtamembrane and protein kinase domains, resembling the relative distribution of mutations in different cancer types. Several cKIT VUS were predicted by our system to lead to pathway activation and could be novel cancer drivers. For example, cKIT Y553S and P551L are both predicted by our system to be active and lie within the juxtamembrane domain. P551L has been identified in sequencing studies⁴³ but has not been biochemically characterized, while Y553S has not been functionally analyzed but has been associated with imatinib resistance⁴⁴. Similarly, cKIT V654A which lies in the kinase domain has conflicting evidence regarding its pathway activation capabilities. It was found to lead to increased proliferation of cultured cells but not to factor independence and has been described as a secondary drug resistance mutation⁴⁵.

In the case of KRAS, 51 variants were tested most of which are concentrated in the phosphate binding loop, base binding loops and switches I,II, with G13,G12 and Q61 being the positions with the highest incidence of activating mutations (Supplementary Fig. S1). The high incidence of active VUS in KRAS (60%) compared to HRAS (9.5%) and NRAS (20%) stems from the large number of G12-G13 deletion/insertion variants that were tested only for KRAS and were predicted to be active. The VUS tested range from small deletions such as KRAS V152del, to missense mutations such as L23R, N116H and indels such as G12_G13_Del_Ins_DC, all with little to no evidence regarding their oncogenic activity. Interestingly, the KRAS mutation N116H has been known to increase the nucleotide exchange rate of KRAS⁴⁶ and therefore activate MAPK signaling. Similarly, KRAS L23R was predicted by our system to be pathway activating. Although it was identified in several sequencing studies of cancer patients^47,48, it does not lie within any known functional domains of KRAS and has not been biochemically characterized. KRAS V152del is another rare mutation which was predicted by our DCNN to be pathway activating, and although a different mutation - V152G was identified in a recent sequencing study⁴⁹ as active, the V152del variant lacks any evidence regarding its activity.

Of the six genes analyzed, BRAF had lowest concordance between literature and our results (20% of mutations predicted to be active correctly analyzed, Supplementary Table S2). We therefore analyzed these false negatives and found that the only false negative in the known activating mutation class (G466V) and 3 of the 4 false negative in the predicted to activate class (BRAF D594N, G466R, G596R) were classified recently as a distinct class of BRAF mutations (BRAF class III) that differ significantly from the V600E\K\D mutations and were found to possess basal kinase activity that is lower when compared to WT BRAF, or lack kinase activity entirely^50,51. Moreover, biochemical studies predict that this class of mutants would require upstream activation of MAPK for pathway activity and tumorigenesis⁵². Therefore, this class of mutations may have been missed since our system was trained to identify only mutations that directly lead to pathway activation and do not depend on other mutations to induce tumorigenesis. BRAF V600M is the 4^th false negative class of “predicted to be activating”. It lies within the activation segment of the kinase domain of BRAF, at the same position of other highly activating mutations such as BRAF V600E\K. However it was shown to cause only intermediate increase in kinase activity in cell culture⁵². The mean prediction value determined by our network for V600M (0.47) is in concordance with the intermediate kinase activity reported by the literature, providing an additional evidence for the accuracy of our platform.

Three additional false negatives that are known pathway activators are RAS mutations: KRAS T58I, NRAS G60E, HRAS G13S – all lie in the GTP binding domain of each of the RAS proteins and are characterized as MAPK pathway activating and proliferating inducing mutations^53,54,55. Although these mutations are below the cutoff determined for pathway activation (0.5), all have a mean pathway activation score significantly higher than their wildtype variants (0.33, 0.38, 0.18 respectively, student’s t-test p-value < 0.002 for all variants compared to wildtype mean activation scores across all well images). The last 2 false negatives are cKIT variants S628N and V530I. Both were documented as pathway activating; S628N lies within the protein kinase domain (exon 13) of the protein and results in constitutive Kit phosphorylation and activation of downstream signaling, and is transforming in cell culture⁵⁶. V530I lies within the transmembrane domain of the Kit protein and confers a gain of function on the protein, as indicated by constitutive phosphorylation of cKIT and activation of signaling in cell culture⁵⁷.

Concluding, our method shows remarkable ability in identifying pathway activating mutations, with a success rate of 92.6% over the class of known pathway activating mutations and 83.3% over mutations which are predicted to be activating based on similar or proximal alterations. The success rates increase to 93.8% and 95.4% respectively when class III BRAF mutations are excluded. Finally, almost a third (32.3%) of the alterations that were labeled as unknown were found to be active, a finding that demonstrates the importance of functionally testing all identified mutations.

Prediction of drug responses

One of the main features of our platform is the ability to test drug responses on different gene variants and pathways. To test the accuracy of this capability, we tested the response of 3 different cKIT alterations (W557R, W557_558 Del, D816V) and cKIT WT to sorafenib or dasatinib, FDA approved drugs and potent cKIT inhibitors^58,59. All the cKIT alterations are annotated by JAX-CKB as activating and identified as activating in our system (Fig. 4). Each of the 3 cKIT gene variants as well as the WT form were expressed, and the cells were incubated for 18 hours with either sorafenib or dasatinib in increasing doses. We inputted the images to the trained DCNN and for each drug concentration recorded the mean network output across all images for each mutation in each concentration (Fig. 5). As can be seen, our system clearly identified a dose dependent decrease in pathway activation level for each of the cKIT alterations and for both Dasatinib and Sorafenib, with Dasatinib showing significant drop in predicted pathway activation levels in lower concentrations than Sorafenib, which is consistent with previously published literature⁶⁰.

One mutation, D816V, (Fig. 5a) was predicted by our network to be more resistant to dasatinib than the rest of the mutations, as it shows a decrease in activity only at higher concentrations (100nM–1uM). This is consistent with previous studies showing decreased sensitivity of D816V to dasatinib⁶¹. For cKIT W557_558Del (Fig. 5b) and W557R (Fig. 5c) the mean network output reaches values lower than 0.2 at 1–10 nM for dasatinib and 100 nM for Sorafenib, while the output for the cKIT WT (Fig. 5d) remains close to 0 for both drugs at these concentrations. Interestingly, we also observed an increase in predicted pathway activity level at concentrations higher than 10 nM in the dasatinib treated cells, which was not apparent for sorafenib and only apparent to a lesser degree for D816V. We hypothesize that this outcome results from off-target effects at high concentrations of the drug. Such effects have been documented previously for dasatinib^62,63.

Discussion

We present here a novel method for determining the functional role of VUS and their response to targeted therapies, that can be used as a tool to guide the development of targeted therapies. Our method synergizes an experimental functional assay with a computational framework composed of a DCNN, that was trained on several thousands of fluorescent microscopy images of cells transfected with mutated or WT genes (BRAF, cKIT, HRAS, KRAS, NRAS, PDGFRa) to identify the activity of mutations annotated as VUS. The method involves fluorescent tagging of 3 key components: the mutated protein itself, a downstream signaling protein fused to GFP (ERK2 or STAT3) and nucleic DNA (DAPI staining). The novel network architecture we presented here has been carefully selected after considering many existing state of the art alternative architectures in the field of image recognition, such as ResNet⁶⁴ and Inception⁶⁵. Those networks are composed of millions of free parameters and are usually trained on large and diverse image datasets⁶⁶ containing more than a million images. Using this class of network architectures on a smaller dataset such as ours, most commonly leads to overfitting. There are several advantages to the architecture presented in this study: First, the number of free parameters is considerably smaller, a few hundred-thousands of parameters (compared to millions in the above-mentioned architectures). Second, similarly to ResNet, our architecture enables the cross-correlation of low-level and high-level features, learned from different resolutions of the same image. Third, our network avoids the vanishing gradient phenomenon that frequently occurs in deep neural networks by adopting an architecture that is composed of several shallow networks. A similar interpretation has been described by Veit and colleagues for residual neural networks⁶⁷.

Most state-of-the-art architectures mentioned above are frequently used to extract features from images of other domains (such as skin lesions⁶⁸) for subsequent learning tasks in an approach called transfer learning⁶⁹. The performance of a classifier resulted from transfer learning is directly related to the similarity between the domain of the source dataset, on which the network was originally trained, and the target dataset. In our case however, the degree of similarity between the source dataset (e.g. ImageNet) and the target dataset (fluorescently labeled HeLa cells) is considerably low and indeed, transfer learning approaches resulted in classifiers with degraded performance (data not shown).

We have shown that our system not only recognizes whether a gene variant leads to pathway activation but is also able to recognize in most cases the type of gene that is expressed in the cells and the type of reporter used in the assay. Features learned by deep neural networks are notoriously difficult to interpret, as they are often immensely complex tensors composed of many dimensions. Recent studies⁷⁰ in the field of explainable deep neural networks should bridge this gap and may help explain the unique changes that each variant and reporter induces on cells.

A frequently observed phenotype in tumorigenesis is pathway hyper-activation which results in a range of phenotypes⁷¹. Compared to methods which are purely computational and can identify pathway activating mutations using various in-silico approaches, our method has the advantage of being able to predict a dose dependent pathway activation level change, which is currently out of scope for purely computational approaches.

Indeed, we show that our system is currently only able to identify cancer drivers and annotate VUS that lead to pathway hyperactivation. However, the challenge of VUS determination remains, as there are modes of operations that were beyond the scope of this study. For example, BRAF class III mutations such as BRAF D594G, G466V, G596R, G466E mutations which were predicted to be inactive by our system but are known cancer drivers. These alterations constitute an entirely different class of BRAF mutations that lead to pathway activation using a different mechanism than V600 mutations⁵² that served as our training set. This class of mutations work in tandem with other aberrations to generate a malignant phenotype. Specifically, they require a dysregulated RAS in order to hyperactivate the ERK pathway¹⁴. Determining the role of such mutations without the context of its co-occurring mutations may lead to false predictions. Another aspect that should be addressed by future studies, is tissue specificity of some oncogenic gene variants, some genetic aberrations function as cancer drivers only in specific tissues, such as the loss of function of BRCA which can be found only in breast and cervical cancers⁷².

The biological mechanism that was addressed in this study included only reporters whose main property is that they shuttle between the cytoplasm and nucleus. Both ERK2 and STAT3 are translocated into the nucleus following their phosphorylation. However, there are many other biological mechanisms that have been correlated to oncogenic mutations, for example changes in expression levels or translocation of proteins between different compartments of the cell. It has long been known that HER2 overexpression, which occurs in 15–30% of breast cancers and 10–30% of gastric/gastroesophageal cancers is a cancer driving alteration⁷³. Other than HER2, there is a substantial amount of evidence that overexpression of MYC, MYCN, ER and EGFR is also involved in disease⁷⁴. Changes in subcellular localization have also been characterized as a cancer phenotype. In one example, MUC1, a membrane bound protein which is expressed at the apical borders of glandular epithelial cells, is overexpressed in the nucleus as well as the entire cell surface, cytoplasm and mitochondria. Translocation of MUC1 to the mitochondria leads to apoptosis suppression by attenuating caspase-3 activation as well as the release of cytochrome-c⁷⁵. Dysregulation of cell death signals, some of which are mediated by the BCL-2 protein family⁷⁶, may also serve as reporters in similar circumstances. For example⁷⁷, BCL-2-related ovarian killer (BOK) was found to be significantly depleted in colorectal tumors, and its levels also accurately predicted clinical outcome.

We have also demonstrated the capability of our system to predict drug responses in a dose dependent manner. This ability, coupled with the annotation of VUS activity can be leveraged for several clinically relevant uses, for example, optimizing the MTA clinical development process and improving patient’s treatment recommendations. In the case of development of novel MTA’s, it has been shown that the patient mutational landscape varies significantly⁷⁸. Moreover, current drug development processes usually consider only a handful of highly frequent mutations as a model system. However, it has been shown that the same inhibitor can have significantly different efficacies on different mutations in the same gene, with some prominent examples in BRAF⁷⁹ and cKIT⁸⁰. We therefore suggest that considering these large numbers of mutations and their differential vulnerabilities to inhibitors early in the MTA development process will allow a much higher rate of success. The second aspect in which this system can be utilized is optimizing treatments for cancer patients. It has recently been shown that more comprehensive interpretation of genetic profiles can both improve the matching of patient to available treatments⁸¹ as well as new drug combinations⁸². The annotation of the many VUS found in patient genomic profiles will increase the matching score and therefore improve patient outcomes.

In summary, we have presented in this study a system that can determine the level of pathway activation of a wide range of gene variants and predict the response of those to different MTA’s. Future work will need to focus on expanding the capabilities of this model, for example, by training on more genes, mutations and reporters, increasing the robustness of the network and using different types of reporters.

References

Martini, M., Vecchione, L., Siena, S., Tejpar, S. & Bardelli, A. Targeted therapies: how personal should we go? Nat. Rev. Clin. Oncol. 9, 87–97 (2011).
Article CAS PubMed Google Scholar
Forbes, S. A. et al. The Catalogue of Somatic Mutations in Cancer (COSMIC). In Current Protocols in Human Genetics vol. 91 355–358 (John Wiley & Sons, Inc., 2008).
Cancer Genome Atlas Research Network. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455, 1061–8 (2008).
Article CAS Google Scholar
Daley, G. Q., Van Etten, R. A. & Baltimore, D. Induction of chronic myelogenous leukemia in mice by the P210bcr/abl gene of the Philadelphia chromosome. Science (80-.) 247, 824–830 (1990).
Article ADS CAS Google Scholar
le Coutre, P. et al. In vivo eradication of human BCR/ABL-positive leukemia cells with an ABL kinase inhibitor. J. Natl. Cancer Inst. 91, 163–8 (1999).
Article PubMed Google Scholar
Giovannetti, E. & Rodriguez, J. A. Targeted therapies in cancer: where are we going? Cancer Drug Resist. 1, 82–86 (2018).
Article Google Scholar
Afghahi, A. & Sledge, G. W. Targeted Therapy for Cancer in the Genomic Era. Cancer J. 21, 294–8.
Bailey, M. H. et al. Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell 173, 371–385.e18 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kim, E. S., Atlas, J., Ison, G. & Ersek, J. L. Transforming Clinical Trial Eligibility Criteria to Reflect Practical Clinical Application. Am. Soc. Clin. Oncol. Educ. book. Am. Soc. Clin. Oncol. Annu. Meet. 35, 83–90 (2016).
Article CAS Google Scholar
Berger, A. H. et al. High-throughput Phenotyping of Lung Cancer Somatic Mutations. Cancer Cell 30, 214–228 (2015).
Article CAS Google Scholar
Riely, G. J. Clinical Course of Patients with Non-Small Cell Lung Cancer and Epidermal Growth Factor Receptor Exon 19 and Exon 21 Mutations Treated with Gefitinib or Erlotinib. Clin. Cancer Res. 12, 839–844 (2006).
Article CAS PubMed Google Scholar
Levêque, D. Off-label use of targeted therapies in oncology. World J. Clin. Oncol. 7, 253–7 (2016).
Article PubMed PubMed Central Google Scholar
Ng, P. K.-S. et al. Systematic Functional Annotation of Somatic Mutations in Cancer. Cancer Cell 33, 450–462.e10 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yao, Z. et al. Tumours with class 3 BRAF mutants are sensitive to the inhibition of activated RAS. Nature 548, 234–238 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Carels, N., Spinassé, L. B., Tilli, T. M. & Tuszynski, J. A. Toward precision medicine of breast cancer. Theor. Biol. Med. Model. 13, 7 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lawrence, R. T. et al. The Proteomic Landscape of Triple-Negative Breast Cancer. Cell Rep. 11, 630–644 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ng, P. K.-S. S. et al. Systematic Functional Annotation of Somatic Mutations in Cancer. Cancer Cell 33, 450–462.e10 (2018).
Article CAS PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–24 (2015).
Article PubMed PubMed Central Google Scholar
Maxwell, K. N. et al. Evaluation of ACMG-Guideline-Based Variant Classification of Cancer Susceptibility and Non-Cancer-Associated Genes in Families Affected by Breast Cancer. Am. J. Hum. Genet. 98, 801–817 (2016).
Article CAS PubMed PubMed Central Google Scholar
Thomas, P. D. et al. PANTHER: a library of protein families and subfamilies indexed by function. Genome Res. 13, 2129–41 (2003).
Article CAS PubMed PubMed Central Google Scholar
Shihab, H. A. et al. Predicting the functional, molecular, and phenotypic consequences of amino acid substitutions using hidden Markov models. Hum. Mutat. 34, 57–65 (2013).
Article CAS PubMed Google Scholar
Qian, D. et al. A Bayesian framework for efficient and accurate variant prediction. PLoS One 13, e0203553 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (2012).
Hinton, G. et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups. IEEE Signal Process. Mag. 29, 82–97 (2012).
Article ADS Google Scholar
Mnih, V. et al. Playing Atari with Deep Reinforcement Learning (2013).
Luo, P., Ding, Y., Lei, X. & Wu, F.-X. deepDriver: Predicting Cancer Driver Genes Based on Somatic Mutations Using Deep Convolutional Neural Networks. Front. Genet. 10, 13 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kim, S., Lee, H., Kim, K. & Kang, J. Mut2Vec: distributed representation of cancerous mutations. BMC Med. Genomics 11, 33 (2018).
Article PubMed PubMed Central Google Scholar
Mikolov, T., Chen, K., Corrado, G. & Dean, J. Efficient estimation of word representations in vector space. 1st Int. Conf. Learn. Represent. ICLR 2013 - Work. Track Proc. 1–12 (2013).
Levine, A. B. et al. Rise of the Machines: Advances in Deep Learning for Cancer Diagnosis. Trends in Cancer 5, 157–169 (2019).
Article PubMed Google Scholar
Wang, D., Khosla, A., Gargeya, R., Irshad, H. & Beck, A. H. Deep Learning for Identifying Metastatic Breast Cancer. 1–6 (2016).
Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. 115, E2970–E2979 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wang, H. et al. Deep learning enables cross-modality super-resolution in fluorescence microscopy. Nat. Methods 16, 103–110 (2019).
Article CAS PubMed Google Scholar
Christiansen, E. M. Silico Labeling: Predicting Fluorescent Labels in Unlabeled Images Resource In Silico Labeling: Predicting Fluorescent Labels in Unlabeled Images. Cell 1–12. 10.1016
Zhang, T. et al. Physiognomy: Personality traits prediction by learning. Int. J. Autom. Comput. 14, 386–395 (2017).
Article Google Scholar
Qawaqneh, Z., Mallouh, A. A. & Barkana, B. D. Deep Convolutional Neural Network for Age Estimation based on VGG-Face Model (2017).
Cohen-Saidon, C., Cohen, A. A., Sigal, A., Liron, Y. & Alon, U. Dynamics and Variability of ERK2 Response to EGF in Individual Living Cells. Mol. Cell 36, 885–893 (2009).
Article CAS PubMed Google Scholar
Herrmann, A. et al. Nucleocytoplasmic shuttling of persistently activated STAT3. J. Cell Sci. 120, 3249–3261 (2007).
Article CAS PubMed Google Scholar
Patterson, S. E. et al. The clinical trial landscape in oncology and connectivity of somatic mutational profiles to targeted therapies. Hum. Genomics 10, 4 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hu, J., Chen, Z., Yang, M., Zhang, R. & Cui, Y. A multiscale fusion convolutional neural network for plant leaf recognition. IEEE Signal Process. Lett 25, 853–857.
Ioffe, S. & Szegedy, C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (2015).
Kingma, D. P. & Ba, J. Adam: A Method for Stochastic Optimization (2014).
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal 6, 11 (2013).
Article CAS Google Scholar
Kong, Y. et al. Large-scale analysis of KIT aberrations in Chinese patients with melanoma. Clin. Cancer Res. 17, 1684–91 (2011).
Article CAS PubMed Google Scholar
Lim, K.-H. et al. Molecular analysis of secondary kinase mutations in imatinib-resistant gastrointestinal stromal tumors. Med. Oncol. 25, 207–13 (2008).
Article CAS PubMed Google Scholar
Roberts, K. G. et al. Resistance to c-KIT kinase inhibitors conferred by V654A mutation. Mol. Cancer Ther. 6, 1159–66 (2007).
Article CAS PubMed Google Scholar
Patel, G., MacDonald, M. J., Khosravi-Far, R., Hisaka, M. M. & Der, C. J. Alternate mechanisms of ras activation are complementary and favor and formation of ras-GTP. Oncogene 7, 283–8 (1992).
CAS PubMed Google Scholar
Zhang, J. et al. Key pathways are frequently mutated in high-risk childhood acute lymphoblastic leukemia: a report from the Children’s Oncology Group. Blood 118, 3080–3087 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jones, L. et al. A review of new agents evaluated against pediatric acute lymphoblastic leukemia by the Pediatric Preclinical Testing Program. Leukemia 30, 2133–2141 (2016).
Article CAS PubMed Google Scholar
Li, J. et al. Multiregional Sequencing Reveals Genomic Alterations and Clonal Dynamics in Primary Malignant Melanoma of the Esophagus. Cancer Res. 78, 338–347 (2018).
Article CAS PubMed Google Scholar
Dankner, M., Rose, A. A. N., Rajkumar, S., Siegel, P. M. & Watson, I. R. Classifying BRAF alterations in cancer: New rational therapeutic strategies for actionable mutations. Oncogene 37, 3183–3199 (2018).
Article CAS PubMed Google Scholar
Wan, P. T. C. et al. Mechanism of activation of the RAF-ERK signaling pathway by oncogenic mutations of B-RAF. Cell 116, 855–67 (2004).
Article CAS PubMed Google Scholar
Yao, Z. et al. Tumours with class 3 BRAF mutants are sensitive to the inhibition of activated RAS. Nature 548, 234–238 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Schubbert, S. et al. Germline KRAS mutations cause Noonan syndrome. Nat. Genet. 38, 331–6 (2006).
Article CAS PubMed Google Scholar
Chung, H. H., Benson, D. R. & Schultz, P. G. Probing the structure and mechanism of Ras protein with an expanded genetic code. Science 259, 806–9 (1993).
Article ADS CAS PubMed Google Scholar
Tyner, J. W. et al. High-throughput sequencing screen reveals novel, transforming RAS mutations in myeloid leukemia patients. Blood 113, 1749–55 (2009).
Article CAS PubMed PubMed Central Google Scholar
Vita, M. et al. Characterization of S628N: a novel KIT mutation found in a metastatic melanoma. JAMA dermatology 150, 1345–9 (2014).
Article PubMed Google Scholar
Cammenga, J. et al. Extracellular KIT receptor mutants, commonly found in core binding factor AML, are constitutively active and respond to imatinib mesylate. Blood 106, 3958–61 (2005).
Article CAS PubMed Google Scholar
Araujo, J. & Logothetis, C. Dasatinib: a potent SRC inhibitor in clinical development for the treatment of solid tumors. Cancer Treat. Rev. 36, 492–500 (2010).
Article CAS PubMed PubMed Central Google Scholar
Abbaspour Babaei, M., Kamalidehghan, B., Saleem, M., Huri, H. Z. & Ahmadipour, F. Receptor tyrosine kinase (c-Kit) inhibitors: a potential therapeutic target in cancer cells. Drug Des Devel Ther 10, 2443–2459 (2016).
Article PubMed PubMed Central Google Scholar
Galanis, A. & Levis, M. Inhibition of c-Kit by tyrosine kinase inhibitors. Haematologica 100, e77–9 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shah, N. P. et al. Dasatinib (BMS-354825) inhibits KIT D816V, an imatinib-resistant activating mutation that triggers neoplastic growth in most patients with systemic mastocytosis. Blood 108, 286–291 (2006).
Article CAS PubMed Google Scholar
Packer, L. M. et al. Nilotinib and MEK Inhibitors Induce Synthetic Lethality through Paradoxical Activation of RAF in Drug-Resistant Chronic Myeloid Leukemia. Cancer Cell 20, 715–727 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fang, Y. et al. MEK/ERK Dependent Activation of STAT1 Mediates Dasatinib-Induced Differentiation of Acute Myeloid Leukemia. PLoS One 8, e66915 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 2016-Decem, 770–778 (2016).
Google Scholar
Längkvist, M., Karlsson, L. & Loutfi, A. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. Pattern Recognit. Lett. 42, 11–24 (2014).
Article Google Scholar
Russakovsky, O. et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. 115, 211–252 (2015).
Article MathSciNet Google Scholar
Veit, A., Wilber, M. & Belongie, S. Residual networks behave like ensembles of relatively shallow networks. Adv. Neural Inf. Process. Syst. 550–558 (2016).
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Razavian, A. S., Azizpour, H., Sullivan, J. & Carlsson, S. CNN features off-the-shelf: An astounding baseline for recognition. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. Work. 512–519, https://doi.org/10.1109/CVPRW.2014.131 (2014).
Lundberg, S. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. 16, 426–430 (2017).
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Zhang, X. et al. Attenuation of RNA polymerase II pausing mitigates BRCA1-associated R-loop accumulation and tumorigenesis. Nat. Commun. 8, 15908 (2017).
Article ADS PubMed PubMed Central Google Scholar
Iqbal, N. & Iqbal, N. Human Epidermal Growth Factor Receptor 2 (HER2) in Cancers: Overexpression and Therapeutic Implications. Mol. Biol. Int. 2014, 1–9 (2014).
Article CAS Google Scholar
Santarius, T., Shipley, J., Brewer, D., Stratton, M. R. & Cooper, C. S. A census of amplified and overexpressed human cancer genes. Nat. Rev. Cancer 10, 59–64 (2010).
Article CAS PubMed Google Scholar
Ren, J. et al. Human MUC1 carcinoma-associated protein confers resistance to genotoxic anticancer agents. Cancer Cell 5, 163–75 (2004).
Article CAS PubMed PubMed Central Google Scholar
Campbell, K. J. & Tait, S. W. G. Targeting BCL-2 regulated apoptosis in cancer. Open Biol. 8 (2018).
Carberry, S. et al. The BAX/BAK-like protein BOK is a prognostic marker in colorectal cancer. Cell Death Dis. 9, 125 (2018).
Article CAS PubMed PubMed Central Google Scholar
Telenti, A. et al. Deep Sequencing of 10,000 Human Genomes. Proc. Natl. Acad. Sci. 061663, https://doi.org/10.1101/061663 (2016).
Agianian, B. & Gavathiotis, E. Current Insights of BRAF Inhibitors in Cancer. J. Med. Chem. 61, 5775–5793 (2018).
Article CAS PubMed Google Scholar
Serrano, C. et al. Complementary activity of tyrosine kinase inhibitors against secondary kit mutations in imatinib-resistant gastrointestinal stromal tumours. Br. J. Cancer 120, 612–620 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rodon, J., Dienstmann, R., Serra, V. & Tabernero, J. Development of PI3K inhibitors: lessons learned from early clinical trials. Nat Rev Clin Oncol 10, 143–153 (2013).
Article CAS PubMed Google Scholar
Sicklick, J. K. et al. Molecular profiling of cancer patients enables personalized combination therapy: the I-PREDICT study. Nat. Med., https://doi.org/10.1038/s41591-019-0407-5 (2019).

Download references

Author information

Authors and Affiliations

NovellusDx, Jerusalem, 9112001, Israel
Lior Zimmerman, Ori Zelichov, Arie Aizenmann, Zohar Barbash, Michael Vidne & Gabi Tarcic

Authors

Lior Zimmerman
View author publications
You can also search for this author in PubMed Google Scholar
Ori Zelichov
View author publications
You can also search for this author in PubMed Google Scholar
Arie Aizenmann
View author publications
You can also search for this author in PubMed Google Scholar
Zohar Barbash
View author publications
You can also search for this author in PubMed Google Scholar
Michael Vidne
View author publications
You can also search for this author in PubMed Google Scholar
Gabi Tarcic
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.Z. and G.T. wrote the manuscript, L.Z. prepared the figures, A.A. and Z.B. generated the data, L.Z., M.V. and G.T. designed the work, O.Z. and M.V. reviewed the manuscript.

Corresponding author

Correspondence to Gabi Tarcic.

Ethics declarations

Competing interests

All article authors are full time employees of NovellusDx.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Dataset 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zimmerman, L., Zelichov, O., Aizenmann, A. et al. A Novel System for Functional Determination of Variants of Uncertain Significance using Deep Convolutional Neural Networks. Sci Rep 10, 4192 (2020). https://doi.org/10.1038/s41598-020-61173-1

Download citation

Received: 18 October 2019
Accepted: 24 February 2020
Published: 06 March 2020
DOI: https://doi.org/10.1038/s41598-020-61173-1

This article is cited by

Actionability classification of variants of unknown significance correlates with functional effect
- Amber Johnson
- Patrick Kwok-Shing Ng
- Funda Meric-Bernstam
npj Precision Oncology (2023)
Deep Fuzzy System Algorithms Based on Deep Learning and Input Sharing for Regression Application
- Yunhu Huang
- Dewang Chen
- Hong Mo
International Journal of Fuzzy Systems (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.