A novel network regularized matrix decomposition method to detect mutated cancer genes in tumour samples with inter-patient heterogeneity

Inter-patient heterogeneity is a major challenge for mutated cancer genes detection which is crucial to advance cancer diagnostics and therapeutics. To detect mutated cancer genes in heterogeneous tumour samples, a prominent strategy is to determine whether the genes are recurrently mutated in their interaction network context. However, recent studies show that some cancer genes in different perturbed pathways are mutated in different subsets of samples. Subsequently, these genes may not display significant mutational recurrence and thus remain undiscovered even in consideration of network information. We develop a novel method called mCGfinder to efficiently detect mutated cancer genes in tumour samples with inter-patient heterogeneity. Based on matrix decomposition framework incorporated with gene interaction network information, mCGfinder can successfully measure the significance of mutational recurrence of genes in a subset of samples. When applying mCGfinder on TCGA somatic mutation datasets of five types of cancers, we find that the genes detected by mCGfinder are significantly enriched for known cancer genes, and yield substantially smaller p-values than other existing methods. All the results demonstrate that mCGfinder is an efficient method in detecting mutated cancer genes.

elements A ij of A are equal the elements A (adj) ij of A adj divided by the square root of the product of d i and d j , i.e.
The normalized Laplacian Matrix is for L G the un-normalized Laplacian. Therefore, the diagonal elements L ij of L are equal the degree of vertex i and off-diagonal elements L ij are -1 if vertex i is adjacent to j and 0 otherwise [5], i.e. (2) 1.4 Proof: s r 2 2 I p + λ L L is an invertible matrix Proof. Note that graph Laplacian matrix L (p × p) is positive semidefinite. Thus, through eigendecomposition, it can be factorized as L = P T ΛP, where P is an orthogonal matrix and Λ is a diagonal matrix whose diagonal entries are the eigenvalues of L. Due to positive semidefinite, all diagonal entries of diagonal matrix Λ are nonnegative. Because of the matrix orthogonality P T P = I p , the matrix s r 2 2 I p + λ L L can be factorized as s r 2 2 I p + λ L L = P T s r 2 2 I p + λ L Λ P, where s r 2 2 I p + λ L Λ is also a diagonal matrix. Note that s r 2 2 is always positive, λ L is an nonnegative tuning parameter and all diagonal entries of matrix Λ are nonnegative. Consequently, all diagonal entries of s r 2 2 I p + λ L Λ are positive, suggesting that it is a positive definite matrix. Therefore, the investigated matrix s r 2 2 I p + λ L L is invertible.

Significance test through a semiexact estimation
In brief, we define X net := s r as the network influenced matrix. For the r-th component, the coefficients of gene score vector g r can be calculated by the summation of the entries of a subset of rows of the network influenced matrix X net , where the rows are indicated by the sample indicator vector s r of the investigated component. To assess which of these mutated genes are statistically significant in a subset of samples, we follow the procedure of previous studies [6,7] and identify the genes of which the scores can disprove the null hypothesis that their values of the gene score vector coefficients are only contributed by background mutations alone.
Since the random background mutations could occur anywhere in the genome, we model the null distribution by recalculating the gene score vectors across all combinations of permutations of the network influenced matrix X net within rows (samples) indicated by s r of the r-th component [6,7]. Under the null hypothesis, the arrangement of entries in X net is independent between the indicated samples (rows). Accordingly, by permuting the entries in the rows indicated by s r of the matrix X net , we can generate a conservative, high estimate of the null distribution which contains information from both the somatic mutations and the network context. Since large numbers of permutations is usually time consuming, we follow the procedure proposed in previous approaches [6,7] by using a semi-exact estimate of this null distribution instead of simulating the null distribution by performing each of these permutations in turn. The distribution of the sum of across the indicated rows equals the convolution of the distributions of entries in all the indicated rows. For the investigated component, we approximate these distributions by generating histograms h r i (x net ) for the i-th indicated row of the network influenced matrix X net , where the number of bins is set to 10 5 . The final distribution for coefficient values of the gene score vector is calculated by H r = h r 1 ⊗ h r 2 . . . ⊗ h r lr , where 1, 2, . . . , l r is the indices of rows indicated by the r-th component (totally l r samples included in the investigated component). By comparing the coefficients of the estimated vector g r to the distribution above, we can assign the p-value for each investigated gene by the sum of the tail of the null distribution estimated above.

Input data: TCGA somatic mutation data
In this study, we use TCGA somatic mutation data to evaluate the performances of mCGfinder. To prevent mutagens or carcinogens involved in cancer treatment which could cloud the origin of the cancer, TCGA have strict sample criteria in acquiring tissue samples, such as "sample from primary tumor was necessary" and "neoadjuvant treatment was not allowable": https://cancergenome.nih.gov/cancersselected/biospeccriteria Therefore, to the best of our knowledge, the underlying genomics of primary, untreated tumor samples in TCGA is not affected by chemotherapy.
In consistent with HotNet2 and ReMIC, somatic mutation data are required as the input of mCGfinder. Therefore, somatic mutations from raw data files should be filtered to remove polymorphisms as described in previous study [8]. More detailed information of the datasets of the investigated cancers are provided in Supplementary Table S5.
[4] Yoav Benjamini and Yosef Hochberg. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the royal statistical society. Series B (Methodological), pages 289-300, 1995.

Supplementary Tables and Captions
Supplementary iquitously expressed in all eukaryotic cells. 23266771; 16685646; 1734024; 15940343 AKAP9 A-Kinase Anchoring Protein 9 BRCA (IntOGen) Scaffolding protein that assembles several protein kinases and phosphatases on the centrosome and Golgi apparatus. Required to maintain the integrity of the Golgi apparatus. Recruited to the Golgi apparatus by GM130/GOLGA2 and is required for microtubule nucleation at the cis-side of the Golgi apparatusGM130/GOLGA2.

10202149;
15047863; 19242490 AKT1 AKT Serine/Threonine Kinase 1 BRCA (CGC & IntOGen) AKT1 is one of 3 closely related serine/threonine-protein kinases (AKT1, AKT2 and AKT3) called the AKT kinase, and which regulate many processes including metabolism, proliferation, cell survival, growth and angiogenesis. This is mediated through serine and/or threonine phosphorylation of a range of downstream substrates. Over 100 substrate candidates have been reported so far, but for most of them, no isoform specificity has been reported. AKT is responsible of the regulation of glucose uptake by mediating insulin-induced translocation of the SLC2A4/GLUT4 glucose transporter to the cell surface. Phosphorylation of PTPN1 at 'Ser-50' negatively modulates its phosphatase activity preventing dephosphorylation of the insulin receptor and the attenuation of insulin signaling. Phosphorylation of TBC1D4 triggers the binding of this effector to inhibitory 14-3-3 proteins, which is required for insulin-stimulated glucose transport. AKT regulates also the storage of glucose in the form of glycogen by phosphorylating GSK3A at 'Ser-21' and GSK3B at 'Ser-9', resulting in inhibition of its kinase activity. Phosphorylation of GSK3 isoforms by AKT is also thought to be one mechanism by which cell proliferation is driven. AKT regulates also cell survival via the phosphorylation of MAP3K5 (apoptosis signal-related kinase). Phosphorylation of 'Ser-83' decreases MAP3K5 kinase activity stimulated by oxidative stress and thereby prevents apoptosis. AKT mediates insulin-stimulated protein synthesis by phosphorylating TSC2 at 'Ser-939' and 'Thr-1462', thereby activating mTORC1 signaling and leading to both phosphorylation of 4E-BP1 and in activation of RPS6KB1. AKT is involved in the phosphorylation of members of the FOXO factors (Forkhead family of transcription factors), leading to binding of 14-3-3 proteins and cytoplasmic localization. In particular, FOXO1 is phosphorylated at 'Thr-24', 'Ser-256' and 'Ser-319'. FOXO3 and FOXO4 are phosphorylated on equivalent sites. AKT has an important role in the regulation of NF-kappa-B-dependent gene transcription and positively regulates the activity of CREB1 (cyclic AMP (cAMP)-response element binding protein). The phosphorylation of CREB1 induces the binding of accessory proteins that are necessary for the transcription of pro-survival genes such as BCL2 and MCL1. AKT phosphorylates 'Ser-454' on ATP citrate lyase (ACLY), thereby potentially regulating ACLY activity and fatty acid synthesis. Activates the 3B isoform of cyclic nucleotide phosphodiesterase (PDE3B) via phosphorylation of 'Ser-273', resulting in reduced cyclic AMP levels and inhibition of lipolysis. Phosphorylates PIKFYVE on 'Ser-318', which results in increased PI3P-5 activity. The Rho GTPase-activating protein DLC1 is another substrate and its phosphorylation is implicated in the regulation cell proliferation and cell growth. AKT plays a role as key modulator of the AKT-mTOR signaling pathway controlling the tempo of the process of newborn neurons integration during adult neurogenesis, including correct neuron positioning, dendritic development and synapse formation. Signals downstream of phosphatidylinositol 3-kinase (PI3K) to mediate the effects of various growth factors such as platelet-derived growth factor (PDGF), epidermal growth factor (EGF), insulin and insulin-like growth factor I (IGF-I). AKT mediates the antiapoptotic effects of IGF-I. Essential for the SPATA13-mediated regulation of cell migration and adhesion assembly and disassembly. May be involved in the regulation of the placental development. Phosphorylates STK4/MST1 at 'Thr-120' and 'Thr-387' leading to inhibition of its: kinase activity, nuclear translocation, autophosphorylation and ability to phosphorylate FOXO3. Phosphorylates STK3/MST2 at 'Thr-117' and 'Thr-384' leading to inhibition of its: cleavage, kinase activity, autophosphorylation at Thr-180, binding to RASSF1 and nuclear translocation. Phosphorylates SRPK2 and enhances its kinase activity towards SRSF2 and ACIN1 and promotes its nuclear translocation. Phosphorylates RAF1 at 'Ser-259' and negatively regulates its activity. Phosphorylation of BAD stimulates its pro-apoptotic activity. Probable Polycomb group (PcG) protein involved in transcriptional regulation mediated by ligand-bound nuclear hormone receptors, such as retinoic acid receptors (RARs) and peroxisome proliferator-activated receptor gamma (PPARG). Acts as coactivator of RARA and RXRA through association with NCOA1. Acts as corepressor through recruitment of KDM1A and CBX5 to target genes in a cell-type specific manner; the function seems to involve differential recruitment of methylated histone H3 to respective promoters. Acts as corepressor for PPARG and suppresses its adipocyte differentiation-inducing activity . Non-catalytic component of the PR-DUB complex, a complex that specifically mediates deubiquitination of histone H2A monoubiquitinated at 'Lys-119' (H2AK119ub1 Involved in double-strand break repair and/or homologous recombination. Binds RAD51 and potentiates recombinational DNA repair by promoting assembly of RAD51 onto single-stranded DNA (ssDNA). Acts by targeting RAD51 to ssDNA over double-stranded DNA, enabling RAD51 to displace replication protein-A (RPA) from ssDNA and stabilizing RAD51-ssDNA filaments by blocking ATP hydrolysis. Part of a PALB2-scaffolded HR complex containing RAD51C and which is thought to play a role in DNA repair by HR. May participate in S phase checkpoint activation. Binds selectively to ssDNA, and to ssDNA in tailed duplexes and replication fork structures. May play a role in the extension step after strand invasion at replication-dependent DNA double-strand breaks; together with PALB2 is involved in both POLH localization at collapsed replication forks and DNA polymerization activity. In concert with NPM1, regulates centrosome duplication. Interacts with the TREX-2 complex (transcription and export complex 2) subunits PCID2 and SEM1, and is required to prevent R-loop-associated DNA damage and thus transcription-associated genomic instability. Silencing of BRCA2 promotes R-loop accumulation at actively transcribed genes in replicating and non-replicating cells, suggesting that BRCA2 mediates the control of Rloop associated genomic instability, independently of its known role in homologous recombination. Checkpoint Kinase 2 BLCA (IntOGen) Serine/threonine-protein kinase which is required for checkpoint-mediated cell cycle arrest, activation of DNA repair and apoptosis in response to the presence of DNA double-strand breaks. May also negatively regulate cell cycle progression during unperturbed cell cycles. Following activation, phosphorylates numerous effectors preferentially at the consensus sequence [L-X-R-X-X-S/T]. Regulates cell cycle checkpoint arrest through phosphorylation of CDC25A, CDC25B and CDC25C, inhibiting their activity. Inhibition of CDC25 phosphatase activity leads to increased inhibitory tyrosine phosphorylation of CDK-cyclin complexes and blocks cell cycle progression. May also phosphorylate NEK6 which is involved in G2/M cell cycle arrest. Regulates DNA repair through phosphorylation of BRCA2, enhancing the association of RAD51 with chromatin which promotes DNA repair by homologous recombination. Also stimulates the transcription of genes involved in DNA repair (including BRCA2) through the phosphorylation and activation of the transcription factor FOXM1. Regulates apoptosis through the phosphorylation of p53/TP53, MDM4 and PML. Phosphorylation of p53/TP53 at 'Ser-20' by CHEK2 may alleviate inhibition by MDM2, leading to accumulation of active p53/TP53. Phosphorylation of MDM4 may also reduce degradation of p53/TP53. Also controls the transcription of pro-apoptotic genes through phosphorylation of the transcription factor E2F1. Tumor suppressor, it may also have a DNA damage-independent function in mitotic spindle assembly by phosphorylating BRCA1. Its absence may be a cause of the chromosomal instability observed in some cancer cells. Promotes the CCAR2-SIRT1 association and is required for CCAR2mediated SIRT1 inhibition.
DNA damage induced by both ionizing and UV irradiation. Adapter protein which binds to BRCA1 and the checkpoint kinase CHEK1 and facilitates the ATR-dependent phosphorylation of both proteins. Can also bind specifically to branched DNA structures and may associate with S-phase chromatin following formation of the pre-replication complex (pre-RC). This may indicate a role for this protein as a sensor which monitors the integrity of DNA replication forks.
e complexes, which mediate the ubiquitination of proteins involved in cell cycle progression, signal transduction and transcription. SCF complexes and ARIH1 collaborate in tandem to mediate ubiquitination of target proteins. In the SCF complex, serves as a rigid scaffold that organizes the SKP1-F-box protein and RBX1 subunits.
s which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. BCR complexes and ARIH1 collaborate in tandem to mediate ubiquitination of target proteins. As a scaffold protein may contribute to catalysis through positioning of the substrate and the ubiquitin-conjugating enzyme. The E3 ubiquitin-protein ligase activity of the complex is dependent on the neddylation of the cullin subunit and is inhibited by the association of the deneddylated cullin subunit with TIP120A/CAND1. The functional specificity of the BCR complex depends on the BTB domain-containing protein as the substrate recognition component. BCR(KLHL42) is involved in ubiquitination of KATNA1. BCR(SPOP) is involved in ubiquitination of BMI1/PCGF4, BRMS1, H2AFY and DAXX, GLI2 and GLI3. Can also form a cullin-RING-based BCR (BTB-CUL3-RBX1) E3 ubiquitin-protein ligase complex containing homodimeric SPOPL or the heterodimer formed by SPOP and SPOPL; these complexes have lower ubiquitin ligase activity. BCR(KLHL9-KLHL13) controls the dynamic behavior of AURKB on mitotic chromosomes and thereby coordinates faithful mitotic progression and completion of cytokinesis. BCR(KLHL12) is involved in ER-Golgi transport by regulating the size of COPII coats, thereby playing a key role in collagen export, which is required for embryonic stem (ES) cells division: BCR(KLHL12) acts by mediating monoubiquitination of SEC31 (SEC31A or SEC31B). BCR(KLHL3) acts as a regulator of ion transport in the distal nephron; by mediating ubiquitination of WNK4. The BCR(KLHL20) E3 ubiquitin ligase complex is involved in interferon response and anterograde Golgi to endosome transport: it mediates both ubiquitination leading to degradation and 'Lys-33'-linked ubiquitination. The BCR(KLHL21) E3 ubiquitin ligase complex regulates localization of the chromosomal passenger complex (CPC) from chromosomes to the spindle midzone in anaphase and mediates the ubiquitination of AURKB. The BCR(KLHL22) ubiquitin ligase complex mediates monoubiquitination of PLK1, leading to PLK1 dissociation from phosphoreceptor proteins and subsequent removal from kinetochores, allowing silencing of the spindle assembly checkpoint (SAC) and chromosome segregation. The BCR(KLHL25) ubiquitin ligase complex is involved in translational homeostasis by mediating ubiquitination and subsequent degradation of hypophosphorylated EIF4EBP1 (4E-BP1). Involved in ubiquitination of cyclin E and of cyclin D1 (in vitro) thus involved in regulation of G1/S transition. Involved in the ubiquitination of KEAP1, ENC1 and KLHL41. In concert with ATF2 and RBX1, promotes degradation of KAT5 thereby attenuating its ability to acetylate and activate ATM. The BCR(KCTD17) E3 ubiquitin ligase complex mediates ubiquitination and degradation of TCHP, a down-regulator of cilium assembly, thereby inducing ciliogenesis. BLCA (IntOGen) Molecular chaperone that promotes the maturation, structural maintenance and proper regulation of specific target proteins involved for instance in cell cycle control and signal transduction. Undergoes a functional cycle that is linked to its ATPase activity which is essential for its chaperone activity. This cycle probably induces conformational changes in the client proteins, thereby causing their activation. Interacts dynamically with various cochaperones that modulate its substrate recognition, ATPase cycle and chaperone function. Engages with a range of client protein classes via its interaction with various co-chaperone proteins or complexes, that act as adapters, simultaneously able to interact with the specific client and the central chaperone itself. Recruitment of ATP and co-chaperone followed by client protein forms a functional chaperone. After the completion of the chaperoning process, properly folded client protein and co-chaperone leave HSP90 in an ADP-bound partially open conformation and finally, ADP is released from HSP90 which acquires an open conformation for the next cycle. Apart from its chaperone activity, it also plays a role in the regulation of the transcription machinery. HSP90 and its co-chaperones modulate transcription at least at three different levels. In the first place, they alter the steady-state levels of certain transcription factors in response to various physiological cues. Second, they modulate the activity of certain epigenetic modifiers, such as histone deacetylases or DNA methyl transferases, and thereby respond to the change in the environment. Third, they participate in the eviction of histones from the promoter region of certain genes and thereby turn on gene expression. Binds bacterial lipopolysaccharide (LPS) and mediates LPS-induced inflammatory response, including TNF secretion by monocytes. Antagonizes STUB1-mediated inhibition of TGF-beta signaling via inhibition of STUB1-mediated SMAD3 ubiquitination and degradation. BLCA (IntOGen) Molecular chaperone that promotes the maturation, structural maintenance and proper regulation of specific target proteins involved for instance in cell cycle control and signal transduction. Undergoes a functional cycle that is linked to its ATPase activity. This cycle probably induces conformational changes in the client proteins, thereby causing their activation. Interacts dynamically with various co-chaperones that modulate its substrate recognition, ATPase cycle and chaperone function. Engages with a range of client protein classes via its interaction with various co-chaperone proteins or complexes, that act as adapters, simultaneously able to interact with the specific client and the central chaperone itself. Recruitment of ATP and co-chaperone followed by client protein forms a functional chaperone. After the completion of the chaperoning process, properly folded client protein and cochaperone leave HSP90 in an ADP-bound partially open conformation and finally, ADP is released from HSP90 which acquires an open conformation for the next cycle. Apart from its chaperone activity, it also plays a role in the regulation of the transcription machinery. HSP90 and its co-chaperones modulate transcription at least at three different levels. In the first place, they alter the steady-state levels of certain transcription factors in response to various physiological cues. Second, they modulate the activity of certain epigenetic modifiers, such as histone deacetylases or DNA methyl transferases, and thereby respond to the change in the environment. Third, they participate in the eviction of histones from the promoter region of certain genes and thereby turn on gene expression. Antagonizes STUB1-mediated inhibition of TGF-beta signaling via inhibition of STUB1-mediated SMAD3 ubiquitination and degradation. HNSC (IntOGen) Molecular chaperone implicated in a wide variety of cellular processes, including protection of the proteome from stress, folding and transport of newly synthesized polypeptides, activation of proteolysis of misfolded proteins and the formation and dissociation of protein complexes. Plays a pivotal role in the protein quality control system, ensuring the correct folding of proteins, the re-folding of misfolded proteins and controlling the targeting of proteins for subsequent degradation. This is achieved through cycles of ATP binding, ATP hydrolysis and ADP release, mediated by co-chaperones. The co-chaperones have been shown to not only regulate different steps of the ATPase cycle of HSP70, but they also have an individual specificity such that one co-chaperone may promote folding of a substrate while another may promote degradation. The affinity of HSP70 for polypeptides is regulated by its nucleotide bound state. In the ATP-bound form, it has a low affinity for substrate proteins. However, upon hydrolysis of the ATP to ADP, it undergoes a conformational change that increases its affinity for substrate proteins. HSP70 goes through repeated cycles of ATP hydrolysis and nucleotide exchange, which permits cycles of substrate binding and release. The HSP70-associated co-chaperones are of three types: J-domain co-chaperones HSP40s (stimulate ATPase hydrolysis by HSP70), the nucleotide exchange factors (NEF) such as BAG1/2/3 (facilitate conversion of HSP70 from the ADP-bound to the ATP-bound state thereby promoting substrate release), and the TPR domain chaperones such as HOPX and STUB1. Acts as a repressor of transcriptional activation. Inhibits the transcriptional coactivator activity of CITED1 on Smad-mediated transcription. Component of the PRP19-CDC5L complex that forms an integral part of the spliceosome and is required for activating pre-mRNA splicing. May have a scaffolding role in the spliceosome assembly as it contacts all other components of the core complex. Binds bacterial lipopolysaccharide (LPS) and mediates LPS-induced inflammatory response, including TNF secretion by monocytes. Participates in the ER-associated degradation (ERAD) quality control pathway in conjunction with J domain-containing co-chaperones and the E3 ligase STUB1. Myocyte Enhancer Factor 2C HNSC (IntOGen) Transcription activator which binds specifically to the MEF2 element present in the regulatory regions of many muscle-specific genes. Controls cardiac morphogenesis and myogenesis, and is also involved in vascular development. Plays an essential role in hippocampal-dependent learning and memory by suppressing the number of excitatory synapses and thus regulating basal and evoked synaptic transmission. Crucial for normal neuronal development, distribution, and electrical activity in the neocortex. Necessary for proper development of megakaryocytes and platelets and for bone marrow B-lymphopoiesis. Required for B-cell survival and proliferation in response to BCR stimulation, efficient IgG1 antibody responses to T-cell-dependent antigens and for normal induction of germinal center B-cells. May also be involved in neurogenesis and in the development of cortical architecture . Isoform 3 and isoform 4, which lack the repressor domain, are more active than isoform 1 and isoform 2. Functions as a dual-specificity transcription factor, regulating the expression of both MAX-network and T-box family target genes. Functions as a repressor or an activator. Binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence and seems to regulate MYC-MAX target genes. Suppresses transcriptional activation by MYC and inhibits MYC-dependent cell transformation. Function activated by heterodimerization with MAX. This heterodimerization serves the dual function of both generating an E-box-binding heterodimer and simultaneously blocking interaction of a corepressor .

10601024; 24362264; 11031250 MLH1
MutL Homolog 1 BLCA (IntOGen) Heterodimerizes with PMS2 to form MutL alpha, a component of the post-replicative DNA mismatch repair system (MMR). DNA repair is initiated by MutS alpha (MSH2-MSH6) or MutS beta (MSH2-MSH6) binding to a dsDNA mismatch, then MutL alpha is recruited to the heteroduplex. Assembly of the MutL-MutS-heteroduplex ternary complex in presence of RFC and PCNA is sufficient to activate endonuclease activity of PMS2. It introduces single-strand breaks near the mismatch and thus generates new entry points for the exonuclease EXO1 to degrade the strand containing the mismatch. DNA methylation would prevent cleavage and therefore assure that only the newly mutated DNA strand is going to be corrected. MutL alpha (MLH1-PMS2) interacts physically with the clamp loader subunits of DNA polymerase III, suggesting that it may play a role to recruit the DNA polymerase III to the site of the MMR. Also implicated in DNA damage signaling, a process which induces cell cycle arrest and can lead to apoptosis in case of major DNA damages. Heterodimerizes with MLH3 to form MutL gamma which plays a role in meiosis. Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis. Plays a critical role in the control of circadian gene expression and is essential for the transcriptional activation mediated by the CLOCK-ARNTL/BMAL1 heterodimer. Establishes a permissive chromatin state for circadian transcription by mediating a rhythmic methylation of 'Lys-4' of histone H3 (H3K4me) and this histone modification directs the circadian acetylation at H3K9 and H3K14 allowing the recruitment of CLOCK-ARNTL/BMAL1 to chromatin .
10490642; 12453419; 15960975; 19556245 MTOR Mechanistic Target Of Rapamycin BRCA (IntOGen) Serine/threonine protein kinase which is a central regulator of cellular metabolism, growth and survival in response to hormones, growth factors, nutrients, energy and stress signals. MTOR directly or indirectly regulates the phosphorylation of at least 800 proteins. Functions as part of 2 structurally and functionally distinct signaling complexes mTORC1 and mTORC2 (mTOR complex 1 and 2). Activated mTORC1 up-regulates protein synthesis by phosphorylating key regulators of mRNA translation and ribosome synthesis. This includes phosphorylation of EIF4EBP1 and release of its inhibition toward the elongation initiation factor 4E (eiF4E). Moreover, phosphorylates and activates RPS6KB1 and RPS6KB2 that promote protein synthesis by modulating the activity of their downstream targets including ribosomal protein S6, eukaryotic translation initiation factor EIF4B, and the inhibitor of translation initiation PDCD4. Stimulates the pyrimidine biosynthesis pathway, both by acute regulation through RPS6KB1-mediated phosphorylation of the biosynthetic enzyme CAD, and delayed regulation, through transcriptional enhancement of the pentose phosphate pathway which produces 5-phosphoribosyl-1-pyrophosphate (PRPP), an allosteric activator of CAD at a later step in synthesis, this function is dependent on the mTORC1 complex. Regulates ribosome synthesis by activating RNA polymerase III-dependent transcription through phosphorylation and inhibition of MAF1 an RNA polymerase III-repressor. In parallel to protein synthesis, also regulates lipid synthesis through SREBF1/SREBP1 and LPIN1. To maintain energy homeostasis mTORC1 may also regulate mitochondrial biogenesis through regulation of PPARGC1A. mTORC1 also negatively regulates autophagy through phosphorylation of ULK1. Under nutrient sufficiency, phosphorylates ULK1 at 'Ser-758', disrupting the interaction with AMPK and preventing activation of ULK1. Also prevents autophagy through phosphorylation of the autophagy inhibitor DAP. mTORC1 exerts a feedback control on upstream growth factor signaling that includes phosphorylation and activation of GRB10 a INSR-dependent signaling suppressor. Among other potential targets mTORC1 may phosphorylate CLIP1 and regulate microtubules. As part of the mTORC2 complex MTOR may regulate other cellular processes including survival and organization of the cytoskeleton. Plays a critical role in the phosphorylation at 'Ser-473' of AKT1, a pro-survival effector of phosphoinositide 3-kinase, facilitating its activation by PDK1. mTORC2 may regulate the actin cytoskeleton, through phosphorylation of PRKCA, PXN and activation of the Rho-type guanine nucleotide exchange factors RHOA and RAC1A or RAC1B. mTORC2 also regulates the phosphorylation of SGK1 at 'Ser-422'. Regulates osteoclastogensis by adjusting the expression of CEBPB isoforms . Myosin Heavy Chain 10 BLCA (IntOGen) Cellular myosin that appears to play a role in cytokinesis, cell shape, and specialized functions such as secretion and capping. Involved with LARP6 in the stabilization of type I collagen mRNAs for CO1A1 and CO1A2. During cell spreading, plays an important role in cytoskeleton reorganization, focal contacts formation (in the central part but not the margins of spreading cells), and lamellipodial extension; this function is mechanically antagonized by MYH9.

20603131; 20052411
MYH9 Myosin Heavy Chain 9 BRCA (IntOGen) Cellular myosin that appears to play a role in cytokinesis, cell shape, and specialized functions such as secretion and capping. During cell spreading, plays an important role in cytoskeleton reorganization, focal contacts formation (in the margins but not the central part of spreading cells), and lamellipodial retraction; this function is mechanically antagonized by MYH10. Tumor suppressor. Acts as a dual-specificity protein phosphatase, dephosphorylating tyrosine-, serine-and threonine-phosphorylated proteins. Also acts as a lipid phosphatase, removing the phosphate in the D3 position of the inositol ring from phosphatidylinositol 3,4,5-trisphosphate, phosphatidylinositol 3,4-diphosphate, phosphatidylinositol 3-phosphate and inositol 1,3,4,5-tetrakisphosphate with order of substrate preference in vitro PtdIns(3,4,5)P3 > PtdIns(3,4)P2 > PtdIns3P > Ins(1,3,4,5)P4. The lipid phosphatase activity is critical for its tumor suppressor function. Antagonizes the PI3K-AKT/PKB signaling pathway by dephosphorylating phosphoinositides and thereby modulating cell cycle progression and cell survival. The unphosphorylated form cooperates with AIP1 to suppress AKT1 activation. Dephosphorylates tyrosine-phosphorylated focal adhesion kinase and inhibits cell migration and integrin-mediated cell spreading and focal adhesion formation. Plays a role as a key modulator of the AKT-mTOR signaling pathway controlling the tempo of the process of newborn neurons integration during adult neurogenesis, including correct neuron positioning, dendritic development and synapse formation. May be a negative regulator of insulin signaling and glucose metabolism in adipose tissue. The nuclear monoubiquitinated form possesses greater apoptotic potential, whereas the cytoplasmic nonubiquitinated form induces less tumor suppressive ability. In motile cells, suppresses the formation of lateral pseudopods and thereby promotes cell polarization and directed movement. SET Domain Bifurcated 1 BRCA (IntOGen) Histone methyltransferase that specifically trimethylates 'Lys-9' of histone H3. H3 'Lys-9' trimethylation represents a specific tag for epigenetic transcriptional repression by recruiting HP1 (CBX1, CBX3 and/or CBX5) proteins to methylated histones. Mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. H3 'Lys-9' trimethylation is coordinated with DNA methylation. Probably forms a complex with MBD1 and ATF7IP that represses transcription and couples DNA methylation and histone 'Lys-9' trimethylation. Its activity is dependent on MBD1 and is heritably maintained through DNA replication by being recruited by CAF-1. SETDB1 is targeted to histone H3 by TRIM28/TIF1B, a factor recruited by KRAB zinc-finger proteins. Probably forms a corepressor complex required for activated KRAS-mediated promoter hypermethylation and transcriptional silencing of tumor suppressor genes (TSGs) or other tumor-related genes in colorectal cancer (CRC) cells. Also required to maintain a transcriptionally repressive state of genes in undifferentiated embryonic stem cells (ESCs). Associates at promoter regions of tumor suppressor genes (TSGs) leading to their gene silencing. The SETDB1-TRIM28-ZNF274 complex may play a role in recruiting ATRX to the 3'-exons of zinc-finger coding genes with atypical chromatin signatures to establish or maintain/protect H3K9me3 at these transcriptionally active regions. Subunit of the splicing factor SF3B required for 'A' complex assembly formed by the stable binding of U2 snRNP to the branchpoint sequence (BPS) in pre-mRNA. Sequence independent binding of SF3A/SF3B complex upstream of the branch site is essential, it may anchor U2 snRNP to the pre-mRNA. May also be involved in the assembly of the 'E' complex. Belongs also to the minor U12-dependent spliceosome, which is involved in the splicing of rare class of nuclear pre-mRNA intron. Transmembrane serine/threonine kinase forming with the TGF-beta type I serine/threonine kinase receptor, TGFBR1, the non-promiscuous receptor for the TGF-beta cytokines TGFB1, TGFB2 and TGFB3. Transduces the TGFB1, TGFB2 and TGFB3 signal from the cell surface to the cytoplasm and is thus regulating a plethora of physiological and pathological processes including cell cycle arrest in epithelial and hematopoietic cells, control of mesenchymal cell proliferation and differentiation, wound healing, extracellular matrix production, immunosuppression and carcinogenesis. The formation of the receptor complex composed of 2 TGFBR1 and 2 TGFBR2 molecules symmetrically bound to the cytokine dimer results in the phosphorylation and the activation of TGFRB1 by the constitutively active TGFBR2. Activated TGFBR1 phosphorylates SMAD2 which dissociates from the receptor and interacts with SMAD4. The SMAD2-SMAD4 complex is subsequently translocated to the nucleus where it modulates the transcription of the TGF-beta-regulated genes. This constitutes the canonical SMADdependent TGF-beta signaling cascade. Also involved in non-canonical, SMAD-independent TGF-beta signaling pathways.

TP53
Tumor Protein P53 BLCA (IntOGen) Acts as a tumor suppressor in many tumor types; induces growth arrest or apoptosis depending on the physiological circumstances and cell type. Involved in cell cycle regulation as a trans-activator that acts to negatively regulate cell division by controlling a set of genes required for this process. One of the activated genes is an inhibitor of cyclin-dependent kinases. Apoptosis induction seems to be mediated either by stimulation of BAX and FAS antigen expression, or by repression of Bcl-2 expression. In cooperation with mitochondrial PPIF is involved in activating oxidative stress-induced necrosis; the function is largely independent of transcription. Induces the transcription of long intergenic non-coding RNA p21 (lincRNA-p21) and lincRNA-Mkln1. LincRNA-p21 participates in TP53-dependent transcriptional repression leading to apoptosis and seem to have to effect on cell-cycle regulation. Implicated in Notch signaling cross-over. Prevents CDK7 kinase activity when associated to CAK complex in response to DNA damage, thus stopping cell cycle progression. Isoform 2 enhances the transactivation activity of isoform 1 from some but not all TP53-inducible promoters. Isoform 4 suppresses transactivation activity and impairs growth suppression mediated by isoform 1. Isoform 7 inhibits isoform 1-mediated apoptosis. Regulates the circadian clock by repressing CLOCK-ARNTL/BMAL1-mediated transcriptional activation of PER2. In complex with TSC2, inhibits the nutrient-mediated or growth factor-stimulated phosphorylation of S6K1 and EIF4EBP1 by negatively regulating mTORC1 signaling. Seems not to be required for TSC2 GAP activity towards RHEB. Implicated as a tumor suppressor. Involved in microtubule-mediated protein transport, but this seems to be due to unregulated mTOR signaling. Zinc-finger RNA-binding protein that destabilizes several cytoplasmic AU-rich element (ARE)-containing mRNA transcripts by promoting their poly(A) tail removal or deadenylation, and hence provide a mechanism for attenuating protein synthesis. Acts as a 3'-untranslated region (UTR) ARE mRNA-binding adapter protein to communicate signaling events to the mRNA decay machinery. Functions by recruiting the CCR4-NOT deadenylase complex and components of the cytoplasmic RNA decay machinery to the bound ARE-containing mRNAs, and hence promotes ARE-mediated mRNA deadenylation and decay processes. Induces also the degradation of ARE-containing mRNAs even in absence of poly(A) tail . Binds to 3'-UTR ARE of numerous mRNAs. Positively regulates early adipogenesis by promoting ARE-mediated mRNA decay of immediate early genes (IEGs) . Promotes ARE-mediated mRNA decay of mineralocorticoid receptor NR3C2 mRNA in response to hypertonic stress. Negatively regulates hematopoietic/erythroid cell differentiation by promoting ARE-mediated mRNA decay of the transcription factor STAT5B mRNA. Positively regulates monocyte/macrophage cell differentiation by promoting ARE-mediated mRNA decay of the cyclin-dependent kinase CDK6 mRNA. Promotes degradation of AREcontaining pluripotency-associated mRNAs in embryonic stem cells (ESCs), such as NANOG, through a fibroblast growth factor (FGF)-induced MAPK-dependent signaling pathway, and hence attenuates ESC self-renewal and positively regulates mesendoderm differentiation . May play a role in mediating pro-apoptotic effects in malignant B-cells by promoting ARE-mediated mRNA decay of BCL2 mRNA. In association with ZFP36L2 maintains quiescence on developing B lymphocytes by promoting ARE-mediated decay of several mRNAs encoding cell cycle regulators that help B cells progress through the cell cycle, and hence ensuring accurate variable-diversityjoining (VDJ) recombination and functional immune cell formation . Together with ZFP36L2 is also necessary for thymocyte development and prevention of T-cell acute lymphoblastic leukemia (T-ALL) transformation by promoting ARE-mediated mRNA decay of the oncogenic transcription factor NOTCH1 mRNA . Participates in the delivery of target ARE-mRNAs to processing bodies (PBs). In addition to its cytosolic mRNA-decay function, plays a role in the regulation of nuclear mRNA 3'-end processing; modulates mRNA 3'-end maturation efficiency of the DLL4 mRNA through binding with an ARE embedded in a weak noncanonical polyadenylation (poly(A)) signal in endothelial cells. Also involved in the regulation of stress granule (SG) and P-body (PB) formation and fusion. Plays a role in vasculogenesis and endocardial development . Plays a role in the regulation of keratinocyte proliferation, differentiation and apoptosis. Plays a role in myoblast cell differentiation