A pan-cancer assessment of alterations of the kinase domain of ULK1, an upstream regulator of autophagy

Kumar, Mukesh; Papaleo, Elena

doi:10.1038/s41598-020-71527-4

Download PDF

Article
Open access
Published: 10 September 2020

A pan-cancer assessment of alterations of the kinase domain of ULK1, an upstream regulator of autophagy

Mukesh Kumar¹ &
Elena Papaleo^1,2

Scientific Reports volume 10, Article number: 14874 (2020) Cite this article

3012 Accesses
15 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Autophagy is a key clearance process to recycle damaged cellular components. One important upstream regulator of autophagy is ULK1 kinase. Several three-dimensional structures of the ULK1 catalytic domain are available, but a comprehensive study, including molecular dynamics, is missing. Also, an exhaustive description of ULK1 alterations found in cancer samples is presently lacking. We here applied a framework which links -omics data to structural protein ensembles to study ULK1 alterations from genomics data available for more than 30 cancer types. We predicted the effects of mutations on ULK1 function and structural stability, accounting for protein dynamics, and the different layers of changes that a mutation can induce in a protein at the functional and structural level. ULK1 is down-regulated in gynecological tumors. In other cancer types, ULK2 could compensate for ULK1 downregulation and, in the majority of the cases, no marked changes in expression have been found. 36 missense mutations of ULK1, not limited to the catalytic domain, are co-occurring with mutations in a large number of ULK1 interactors or substrates, suggesting a pronounced effect of the upstream steps of autophagy in many cancer types. Moreover, our results pinpoint that more than 50% of the mutations in the kinase domain of ULK1, here investigated, are predicted to affect protein stability. Three mutations (S184F, D102N, and A28V) are predicted with only impact on kinase activity, either modifying the functional dynamics or the capability to exert effects from distal sites to the functional and catalytic regions. The framework here applied could be extended to other protein targets to aid the classification of missense mutations from cancer genomics studies, as well as to prioritize variants for experimental validation, or to select the appropriate biological readouts for experiments.

Model-based analysis uncovers mutations altering autophagy selectivity in human cancer

Article Open access 31 May 2021

Zhu Han, Weizhi Zhang, … Da Jia

Multi-layered proteomic analyses decode compositional and functional effects of cancer mutations on kinase complexes

Article Open access 16 July 2020

Martin Mehnert, Rodolfo Ciuffa, … Ruedi Aebersold

Exploring selective autophagy events in multiple biologic models using LC3-interacting regions (LIR)-based molecular traps

Article Open access 10 May 2022

Grégoire Quinet, Pierre Génin, … Manuel S. Rodriguez

Introduction

Autophagy is a highly conserved catabolic mechanism across eukaryotes to degrade different cellular components and molecules, including organelles, proteins, and bacteria^1,2. Autophagy initiates with the formation of the autophagosome, which later fuses to the lysosome, resulting in the degradation of the cargo and the release of cellular building blocks^3,4. At a basal level, autophagy contributes maintaining cellular homeostasis and the recycling of cellular components. Autophagy is also induced as a response to different stresses with a cytoprotective role. Defects in the autophagy machinery are often linked to diseases, including cancer, neurodegeneration, and bacterial infections⁵.

The ULK1 (Unc-51 like autophagy activating kinase 1) complex initiates autophagy^6,7,8. The complex consists of ULK1kinase, FIP200 (FAK family kinase interacting protein of 200 kDA) scaffold protein, and the ATG13-ATG101 HORMA (Hop/Rev7/Mad2) complex⁹. The ULK1 complex can integrate different signals to promote both bulk and selective autophagy⁹. A highly coordinated and conserved cascade of post-translational events of the ULK1 complex, including phosphorylation and ubiquitination acts as major switch for autophagy initiation^10,11,12,13. In most of the cases, autophagy initiation is regulated by the interplay between mTOR (mammalian Target of Rapamycin) and AMPK (AMP-activated protein kinase) complexes. These complexes perform a series of inhibitory or activatory phosphorylations of the ULK1 complex in different physiological conditions^{10,14,15,16,17,18}. Upon activation, ULK1 directly phosphorylates the PI3K kinase complex, including BECLIN-1, VPS34, and ATG14, which, in turn, facilitate nucleation of the phagophore membrane at the phagophore assembly sites^19,20,21. ULK1 also phosphorylated AMBRA1, which is a positive regulator of the PI3K complex²¹. By analogy to the yeast counterpart, ULK1 has been suggested to play essential scaffolding roles for autophagosome formation and maturation⁹. Mammals have five ULK1 homologs with ULK1 and ULK2 featuring the highest similarities and functional redundancy⁷, implying that both need to be inactivated for a substantial inhibition of autophagy²².

Different cancer types or subtypes show a deregulation of ULK1^23,24,25,26. Autophagy, in general, has a strong association with cancer and can act with a dual role in a context-dependent way, being both tumor suppressor or promoter^27,28. AMPK-ULK1 mediated autophagy induces resistance against bromodomain and extraterminal domain inhibitors, which are novel epigenetic therapeutics for acute myeloid leukemia²⁹. Several inhibitors of ULK1 have been used to study its function in autophagy^21,30,31,32. These molecules have potential to be used in cancer therapy in light of cancer addiction to autophagy³³. Moreover, ULK1 is involved in the first biochemical steps of autophagy, representing an amenable druggable target in the pathway. X-ray structures of ULK1 in complex with inhibitors are available (PDB entries: 4WNO³¹, 4WNP³¹, 5CI7³², 6QAS³⁴, and 6MNH³⁵).

ULK1 is a multi-domain serine-threonine kinase, enriched in disordered regions³⁶, which is a common trait of many scaffolding proteins. ULK1 has preference for serine as the phospho-acceptor residue in the substrate and hydrophobic residues surrounding the phosphorylation site²¹. The kinase domain is located at the N-terminal region of the protein and conserved among yeast and mammals⁹. The N-terminal kinase domain of ULK1 includes the activation loops (165–174 and 178–191) and the catalytic loop (136–145). ULK1 also includes a proline/serine-rich region (279–828) and a C-terminal domain (829–1,051), which are involved in interactions with ATG13, FIP200, and other upstream regulators such as mTOR and AMPK^15,37. On the contrary, the N-terminal kinase domain of ULK1 could interact with the LRKK2 protein³⁸. An important activatory autophosphorylation site is located in the activation loop at T180⁹ which, upon phosphorylation, can engage in salt bridges with the neighboring arginine residues R137 and R170³¹.

Despite the availability of X-ray structures of ULK1 kinase domain, no extensive molecular dynamics (MD) simulation studies of the protein have been undertaken. The availability of a conformational ensemble of a protein is of paramount importance to better understand its function and the effects of its alterations due to mutations^39,40,41,42. Thus, we used all-atom and coarse-grain models to obtain an ensemble of conformations and account for ULK1 flexibility and dynamics. We then applied methods inspired by network theory to achieve a Protein Structure Network representation^43,44 of the conformational ensemble. This methodology can help to identify residues important for structural stability and function, along with to pinpoint possible and elusive effects triggered by distal sites with respect to the functional residues of the protein^45,46,47.

We combined these structural studies with a curation of alterations of ULK1 in more than 30 cancer studies available in The Cancer Genome Atlas (TCGA)^48,49, with attention to changes in expression level or due to mutational events of the protein itself. Indeed, TCGA is among the most important examples of large-scale cancer genomics studies, collecting clinical and molecular data for over 33 tumor types and more than 11,000 samples from cancer patients. Both expression and mutational data are available for TCGA samples. In addition, since the pool of normal samples is often underrepresented or not available in TCGA, the Recount and Recount2 initiatives^50,51 aimed at integrating TCGA data with normal healthy samples for the Genotype-Tissue-Expression (GTEx) project⁵².

The combination of analysis of -omics data and structural methods used in this study, expanding a framework that we recently applied to other proteins^46,53, provide a detailed assessment of the different effects that ULK1 mutations can cause to perturb the protein structural stability or activity. Our results can guide the selection of ULK1 as a target in certain cancer types, suggest which readouts to study for experimental research in cancer cellular biology, and provide knowledge for pharmacological or clinical-oriented efforts.

Results and discussion

Alterations in gene expression in ULK1 and ULK2 in different cancer types

In our study, we focus on the effects of mutations found in cancer samples in ULK1 kinase domain, since it is the only region with an available experimental structure. The majority of the rest of the protein has a high content of predicted disorder according to FELLS⁵⁴. Nevertheless, due to the complexity of alterations occurring in tumors, it is also important to evaluate whether other alterations are occurring, such as at the level of expression. We analyzed the changes in gene expression of ULK1 using RNASeq data from TCGA^49,55 and Recount2^51,56, for a total of 30 different cancer types (Fig. 1, Table S1). Due to the high functional redundancy of ULK1 and ULK2⁵⁷, we also monitored the changes of ULK2 in the same cancer types. We used differential expression analyses to estimate the changes in expression of all the genes in the dataset and then retrieved the estimate for ULK1 and ULK2 (Fig. 1). We observed different changes depending on the cancer types. Brain, gynecological and esophagus cancer types feature a downregulation of both genes, suggesting an impairment of their function in autophagy. In another group of tumors, compensatory effects might be in play with one of the two ULK genes downregulated and the other upregulated, as exemplified by Lung Squamous Cell Carcinoma, LUSC (Fig. 1). In other cases, only one of the two kinases is up- or downregulated, suggesting that the unaltered levels of the remaining kinase can have a partial compensatory effect. We did not observe marked changes in expression of either ULK1 or ULK2 in eleven of the cancer types used in our study, such as breast, bladder, and kidney.

Functional elements of ULK1 kinase domain of interest for the study

The ULK1 kinase domain (residues 8–280, Fig. 2a) consists of catalytic and regulatory regions and belongs to the CAMK (Ca²⁺/calmodulin-dependent kinase) family based on a structurally validated alignment of the kinome⁵⁸. It can be divided into a small N-lobe (residues 8–92), a hinge region (93-EYCNGG-98), and a large C-lobe (99–280), as shown in Fig. 2b,c. Before illustrating our analysis, we aim to orient the reader on the important functional ULK1 elements that we will recall in the discussion of the results. The structural background illustrated below is needed to appreciate the effects that the mutations found in cancer samples could exert on this protein.

ULK1 kinase domain includes a long and positively charged activation loop (165–174 and 178–191) that may play a role for substrate recognition and activity regulation (Fig. 2b,c). ULK kinases share the long disordered activation loop, a rare feature in the rest of the kinome³¹. The kinase domain can be activated by phosphorylation on Thr180 on the activation loop^9,59 (Fig. 2d). The activation loop of ULK1 also includes the invariant kinase DFG motif (Asp165-Phe166-Gly167) and extends up to the APE motif (Ala189-Pro190-Glu191). The activation loop generally forms a cleft for substrate binding when the kinase is in its active state. The bound substrates form specific interactions with a conserved HRD motif (His136-Arg137-Asp138 in ULK1) (Fig. 2e). The active state generally exhibits a salt bridge between a conserved lysine in the β3 strand (K46 in ULK1) and a glutamate residue (E63) in the C-helix (48–69, Fig. 2e). This salt bridge is conserved in the structures from the MD ensembles collected in this study (see below). A basic patch around the acetylation site K162 is also important for protein activation⁶⁰. The basic patch can be involved in interactions of ULK1 with membranes or ATG13, along with for the binding to its own C-terminal domain³¹.

ULK1 is known to prefer serine residues in the target substrates⁶¹, a trait that we predict related to the presence of the F168 as DFG + 1 residue (Fig. 2e), in agreement with the findings by Chen et al.⁶² that large hydrophobic DFG + 1 residues promote Ser phosphorylation.

Kinases often present a gatekeeper residue in the active site⁶³, which corresponds to Met92 in ULK1 (Fig. 2e). Mutations of gatekeeper residues in other kinases are associated with the development of chemotherapeutic resistance⁶⁴. Mutations in the gatekeeper position of kinases also improve inhibitors potency, which can be increased by large-to-small mutations at this site⁶⁵. In this context, methionine (as observed in ULK1) is among the larger and bulky gatekeeper residues found in kinases so far. The mutation of a threonine gatekeeper to methionine is associated with the development of drug resistance in other kinases⁶⁶. A glycine-rich loop in the proximity of the DFG motif (G25 and G23 in ULK1) is also important for kinase function⁶⁷.

The regulatory (RS0: D203, RS1: H136, RS2: F166, RS3: L67 and RS4: L78, Fig. 2f) and the catalytic spines (L21, V30, A44, L145, I144, L146, I210 and C214, Fig. 2g) observed for other kinases are conserved in ULK1. A comparison of active and inactive regulatory spines (R-spines, RS) of kinases showed that the RS3 residue in the C-helix of the dormant enzyme is displaced and the spine not properly aligned when compared to the active enzyme⁶⁸. The R-spine consists of residues from both the N- and C-lobes. The histidine of the HRD motif and the phenylalanine of the DFG motif also contribute to the R-spine formation. V30 and A44 of the ULK1 catalytic spine should be the residues for the binding with the adenine group of ATP, whereas one of the leucine residues is likely to be important for the interaction with the adenine base. The catalytic and the regulatory spines control catalysis by dictating the positioning of the ATP and the substrate, respectively. Thus, their proper alignment is necessary for the assembly of an active kinase.

ULK1 microsecond dynamics

Kinases are characterized by complex conformational changes and several dynamic elements^69,70. Biomolecular simulations, which allow the description of protein dynamics over different timescales, proved their effectiveness in study structure–function-dynamics relationships in kinases⁶⁹. Thus, we collected all-atom Molecular Dynamics (MD) simulations in explicit solvent to provide the first description of the ensemble of conformations of the ULK1 kinase domain in solution. An ensemble of conformations, as the one provided by MD, is also pivotal to the structural analyses required for the annotation of the ULK1 mutations found in cancer genomics studies, as we recently applied to other target proteins^46,53. To rule out dynamic patterns that are depending from the physical model used in the simulations, we collected MD simulations of the ULK1 kinase domain with two different force fields (i.e., CHARMM22star and CHARMM27). Using a dimensionality reduction approach based on Principal Component Analysis, we compared the conformational sampling⁷¹ achieved with the two force fields (Fig. 3a). We find a good overlap in the subspace described by the two first principal components, which account alone for more than 40% of the atomic fluctuations, suggesting that the two simulations give a consistent view on the ULK1 dynamics. To quantify the overlap between the conformational space sampled by the two different force fields, we also calculated the root mean square inner product (RMSIP), which is a measure of the similarity of the structural space described by the first 20 principal components with a value of unity as an indicator of identical subspaces. We obtained a RMSIP value of 0.76, indicating a high similarity of sampled conformational subspaces, for the two simulations of the ULK1 kinase domain, confirming the results from inspection of the 2D projections.

Further, we analyzed the principal motions described by the first principal component (Fig. 3b,c). The analysis highlighted concerted motions between the loop 148–158 and a part of the activation loop (in the region 172–183), along with the loop adjacent to the catalytic lysine (in the region 35–41). The two disordered regions 148–158 and 172–183 feature motions of closure towards the rest of the ULK1 structure. The conformational change seems to be triggered by the electrostatic interactions between two arginine residues (R152 and R153) and a negatively charged residue (D102) on the facing helix. Simultaneously with this motion, a conformational change in the region 35–41 occurs where a loop moves apart from the L78 residue of the regulatory spine. The analyses of these dynamic patterns will be used in the annotations of the possible functional impact of ULK1 missense mutations found in cancer samples, discussed in the sections below.

Missense mutations in cancer of ULK1 kinase domain

We retrieved the missense mutations in the coding region of ULK1 for each of the cancer studies deposited in TCGA (Table S2, Table 1). We identified the majority of the mutations in uterine, lung, colon, and stomach tumors (i.e., UCEC, LUSC, COAD and STAD). ULK1 is also predicted as a driver gene in some of these cancer types by OncodriveCLUST, a method based on positional clustering and exploiting the notion that variants in cancer-causing genes are enriched at few specific loci⁷². We should notice that these TCGA cancer types are characterized by a high mutational burden, as shown in a recent study⁷³. Thus, it is not surprising that these tumor types have a higher number of missense mutations of ULK1.

Table 1 Summary of the missense mutations of ULK1 kinase domain analyzed in this study.

Full size table

ULK1 is a multi-domain protein with scaffolding functions and, as such, can interact with a multitude of different other proteins. For a proper assessment of ULK1 missense mutations, it is important to gain knowledge on the alterations, in the same tumor samples, of the biological partners of interaction. To this scope, we curated the ULK1 interactome, mining the IID protein–protein interaction database⁷⁴. We then estimated the co-occurrence of mutations among each of the 30 identified interactors and ULK1 mutations for each cancer type (Table S2). We found co-occurring mutations between ULK1 and its interactors in eleven cancer types in which ULK1 has been found mutated (see GitHub repository for more details on each of them). Among these, stomach, skin, brain, colorectal, uterine and pancreas tumor samples (Fig. 4a, as an example) are characterized by co-occurring mutations in a large number of members of the ULK1 interactome, especially the ones important for the upstream regulation of autophagy For example, we found AMBRA1, components of the mTOR and ULK1 complex, RB1CC1/FIP200, TBC1, AMPK subunits, members of the ATG8 family, ATG16L1, BECN-1, PDPK,I RGM, RAB1A, P62, MINK1, SDCBP, and ATG13). This suggests a pronounced effect of alterations related to ULK1 function and activity, and, more in general, upstream steps of autophagy, in these cancer types.

We also verified if mutations of ULK2 kinase domain were co-occurring with mutations in ULK1, and we observed this pattern only in few isolated cases (Table 1). The mutations occurring in ULK2 kinase domain (D73A, T102A, R254I) are predicted with no effect on protein stability with the methods illustrated below, resulting in minor changes of free energy associated with stability (i.e., 0.2–1.6 kcal/mol). In most cases, we noticed that most of the mutations occurred in cancer types where ULK1 and ULK2 gene expression is down-regulated.

Mutations in the catalytic domain of ULK1

As mentioned above, the kinase domain of ULK1 is the only part of the protein for which a structure is available and which allows us to use other methods to assess the impact of mutations, beyond the observation of co-occurrence.

In total, we collected 36 different missense mutations of ULK1 kinase domain distributed over the whole structure (Fig. 4b) of which D138N of the HRD motif occurred in both pancreatic and lung cancer samples and, the R137, R252, and R261 sites were mutated to different residues in different cancer types.

We then turned our attention to a workflow similar to the one that we recently applied to another protein⁵³ for a more comprehensive assessment and understanding of the effects induced by ULK1 mutations. We evaluated different properties to discriminate between effects associated with the structural stability of the protein or with its function. Moreover, these analyses help to link the effects of the mutations with specific factors for ULK1 activity or regulation, which is an important piece of knowledge to guide the experimental characterization of the molecular mechanisms and phenotypes associated with each mutation. The analyses used for the assessment are described one by one in the following sections, for the sake of clarity.

Interplay of the ULK1 mutations with post-translational modifications and functional motifs

As the first factor for our assessment, we evaluated each mutation site in the context of interplay with post-translational modifications (PTMs) and overlap with functional short linear motifs (SLiMs), along with the potential of harboring new PTM sites upon mutation (Table 1). These are properties that can ultimately influence the regulation of the protein or its spectrum of interactions.

Moreover, we evaluated if any additional mutations was found in the LC3 interaction region (LIR) of ULK1^75,76,77, which is placed in a distal region with respect to the kinase domain. This analysis was motivated by the observation that mutations in members of the ATG8 family are co-occurring with mutations of ULK1 in some of the TCGA cancer studies (Fig. 4b). LIRs are SLiMs for interaction between the LC3/GABARAP (ATG8) family members and other autophagy proteins and key mediators of autophagosome formation⁷⁸. We recently found mutations of LC3B co-occurring with mutations in its LIR-containing interactors in cancer genomic data⁵³, and we thus aimed to evaluate if the same happens for ULK1. We did not find any mutations in the surrounding of the ULK1 LIR region in the TCGA samples under investigation, thus suggesting that the recognition between ULK1 and the ATG8 family is not a major driver of its alterations in these cancer samples.

The only mutation in the proximity of a SLiM is the C-terminal D279N, which includes an IAP (Inhibitor of Apoptosis Protein) binding motif (IBM). This motif has not been characterized in ULK1 yet, at the best of our knowledge. Interestingly, one of the ULK1 interactors, BIRC2 (Table S2) is a cellular inhibitor of apoptosis and promote autophagy, interacting with ULK1 during mitophagy⁷⁹. We speculate that this interaction could be mediated by the IAP motif of ULK1 and that the mutation of D279N could impair it. The mutations have been found in uterine cancer, where mutations of ULK1and BIRC2 are co-occurring (see GitHub repository).

We did not identify any mutations directly altering an experimentally validated PTM site. Nevertheless, we find one solvent-exposed mutation site (S195) which is predicted as a phosphorylation site for DNAPK or ATM kinases. A substitution to proline could have the effect to abolish this modification. We then analyzed the mutations to serine, threonine or tyrosine for their capability to introduce new phosphorylatable residues, along with mutations to cysteine for their possibility to introduce a redox-sensitive post-translational modification, i.e., S-nitrosylation⁸⁰ (Table 1, Table S2). P250S ULK1 could result in a phosphorylatable serine by PKC kinase and the R137C mutation in the HRD motif could introduce a possible S-nitrosylation site.

Assessment of the impact on ULK1 protein stability of the missense mutations

One of the main effects that a mutation can have on a protein is to alter its structural stability, causing local misfolding and a higher propensity for the mutated variant to be targeted by pathways for protein clearance, such as proteasomal degradation^42,81,82. In this scenario, the function of the protein will also be affected, but mostly as the result of compromised protein levels and not necessarily an alteration of its capability to interact with the biological partners or be an active enzyme.

We used a high-throughput saturation mutagenesis approach⁸³ based on an empirical energy function implemented in FoldX to predict the effect on protein stability induced by all the possible substitutions of each position of the structure of the kinase domain of ULK1. This approach has the advantage of providing both the estimate of the damaging effect of the disease-related mutation of interest and a pre-computed list of predicted changes in stability for any other mutations of the protein. The latter is useful to identify important hotspots for the structure of ULK1, along with pre-annotated effects of amino acid substitutions that can be consulted for newly discovered mutations in future genomics studies.

To overcome the inherent issues in local sampling and lack of backbone flexibility of FoldX, we used the MD-derived ensembles for the analysis. We estimated the changes in the free energy of folding upon mutation (Tables S3–S6), as recently applied to other cases study^46,53,84. The predicted ΔΔGs, using the two different MD ensembles of the ULK1 kinase domain, are in good agreement (Table S3). Moreover, we notice that, for some mutations, the predicted damaging effect is a result of the usage of the static X-ray structure (Table S3), whereas the possibility to account for the flexibility of the protein structure in the prediction result in neutral effect. An example of this behavior is A28V (Table S3). Based on this observation, we used the ΔΔGs predictions from the calculation on the MD ensembles to classify the ULK1 mutations found in cancer patients (Fig. 4b). Two mutations (S184F and V211I) could stabilize the protein architecture, suggesting a better packing of the protein. S184 is located in the proximity of aromatic residues, including the one of the DFG motif, which is important for catalysis so it cannot be excluded that a mutation to S184F could alter the functional dynamic of the kinase at this site.

We also notice that some mutation sites are more general hotspots for ULK1 structural stability (Fig. 5a). In these cases, the sites are sensitive to substitutions to most of the other residues (i.e., F14, S56, L78, F81, A125, R137, A169, G183, and F273). I135 is also a stability hotspot, but the I135V mutation, which was found in the cancer samples, is one of the few tolerated substitutions, suggesting a neutral effect.

Next, the availability of MD ensembles for ULK1 kinase domain prompted us to apply a Protein Structure Network (PSN) approach based on the persistence of side-chain contacts in the conformational ensemble^85,86 to estimate hub residues, which are often corresponding to important residues for protein stability (Fig. 5b,c). We verified which of the mutation sites are found in correspondence or proximity of a hub, as an additional parameter to evaluate the impact of the substitution on protein stability. Moreover, to account for the fact that the mutation in a hub can still retain its hub capability, we collected the same analyses on conformational ensembles derived for each of the mutant variants (Table S3). We classified as damaging mutations according to this parameter only the ones for which the substitution abolishes the hub behavior. We find only one case, i.e., F14L, where the hub behavior is conserved upon mutation. Most of the mutation sites in PSN hubs also correspond to mutational hotspots associated with high ΔΔGs for protein stability, except for D138N.

Another strategy to impair structural stability could be related to the loss of electrostatic interactions in the form of salt bridges or hydrogen bonds. We calculated the persistence of the salt bridges and their network in the MD ensembles and annotate which of the mutation site where likely to abolish these interactions (Table 2). Most of the mutations of residues involved in salt bridges are likely to have marginal effects since they conserve the negatively charged nature of the wild-type residue or they are replaced by asparagine, which could still account for electrostatic interactions with the guanidinium group of arginine. The only mutations which could impair salt-bridge formation are R152L and D268H. R152L is involved in salt bridges in the CHARMM27 MD ensemble, whereas it shows a loose tendency to form electrostatic interactions in CHARMM22* MD ensembles. MD force fields are known to have limitations in overestimating salt bridge contributions^87,88, and according to a recent benchmarking⁸⁸, we selected the CHARMM22star results for the annotation of the mutations with respect to salt-bridge formation. 58%of the mutations of the ULK1 kinase domain found in the different cancer types are predicted damaging for protein stability using at least one of the two criteria above.

Table 2 Salt-bridges involving ULK1 mutation sites characterized by charged residues, along with their persistence in the MD ensemble.

Full size table

Assessment of the impact of mutations on ULK1 function

The availability of structure and dynamics of ULK1 allowed us to predict effects induced by the mutations on its function. We evaluated the occurrence of the mutations in proximity of the disordered regions interested by the functional motions underpinned by Principal Component Analysis (Figs. 3b,c, 5b,c). R152L and D102N are likely to impair or weaken, the functional motions observed for the region 148–158, which are triggered by their electrostatic interactions. We cannot rule out that the presence of R153 in the R152L mutant variants could partially compensate for the mutation. On the other side, different mutation sites are in the area of the lateral motion of the region 172–183 of the activation loop, such as M177 in the loop itself, S195 in the region where the loop bends, and Y171, G183/S184 which act as hinges for the motion of these regions. It could be expected that these mutations impair ULK1 functional dynamics. In addition, the regulatory spine residue L78 and F81 in the proximity of the other disordered loop (35–41), which is involved in the concerted conformational changes, are also corresponding to mutation sites.

PSN approaches can be used to infer functionally-damaging sites if the paths of communication between the mutation sites and other important functional sites for the kinase activity are considered. This analysis can shed light on effects that are likely to be transmitted long-range, often at the base of allostery^45,47,89. We estimated all the shortest paths of communication between each mutation site and five important classes of residues for kinase function. In particular, we selected three groups of target residues: (i) residues important for activity (K46, E63, M62, T180 and K162); (ii) residues of the DFG, HRD, and APE motifs; (iii) central residues of the C-helix (55–65); (iv) the residues of the catalytic and (v) regulatory spines (Fig. 6a,b). We selected only those paths conserved both in the CHARMM22star and CHARMM27 simulations (Table S7). We did not find any communication roads to the regulatory spine or the APE motif. On the contrary, a subset of mutation sites (A28, A101, D102, D138, N96, and R137) was communicating with at least two of the target areas of interest, often using multiple paths. This result suggests that substitutions at these sites could be detrimental for functional long-range communication or it could increase it (if oncogenic). This could be especially the case in which mutations alter the steric hindrance or the physicochemical properties of the wild-type residue, as in the case of A28V, A101T, R137C and, R137H. We used the statistical mechanical model implemented in AlloSigMA⁹⁰ to have a more direct proof that these mutations could exert an allosteric effect (see GitHub repository for the outputs).

General assessment and classification of ULK1 missense mutations in TCGA

We integrated all the results collected by each of the analyses above to provide an overall view on the several properties and layers of alterations that a mutation in a protein can cause, and which are ultimately connected to the alterations in its function at the cellular level. In particular, with our framework we can assess: (i) effects on protein structural stability, which will impact on the protein levels and turnover in the cell; (ii) interplay with post-translational modifications and emergence of new layers of regulation; (iii) alterations of binding regions for biological partners; and (iv) long-range functional effects. We classified the mutations according to each of these properties as damaging or neutral (Fig. 7a) and then ranked them. The ranking allowed us to identify mutations that are likely to be damaging, along with identifying if the effect is triggered more by destabilization of the protein product or a stable protein variant with impaired functionality (Fig. 7a,b).

As stated above, we observed that more than 50% of the mutations of ULK1 kinase domain found in the cancer samples are predicted to alter protein stability. This is often accompanied by a possible impact also on the native functional properties of the same variant. A minority of mutations are only damaging for stability (A125T, F273V, L215P, F14L and G12D) and do not alter the functional state of the protein. The detrimental effect on the protein stability observed by these mutations could alter the cellular level and turnover of the protein. This effect could be dominant with respect to the effects that the same mutations exert on protein activity or interactions.

ULK1 stability could also be modulated by the interaction with ATG13 and FIP200, which bind to the cytoplasmic domain of the ULK1³⁷, or by chaperonin-like proteins, such as p32⁹¹. Thus, these interactions might compensate for the loss of stability induced by some of the mutant variants. Interestingly, in several cases, the cancer types where destabilizing mutations of ULK1 occurred also feature co-occurrence of mutations in ATG13 or FIP200 (Table S3), suggesting an overall alteration of ULK1 stability. Three mutations (S184F, D102N, and A28V) are predicted with a possible impact only on kinase activity, either altering the functional dynamics of the protein or the capability to exert long-range effects from distal site to the functional and catalytic regions.

We searched in the literature if the mutation sites under investigation have been subjects of experimental studies and if the results of these experiments corroborate our prediction. Most of the mutations that we found are related to the mouse variant of ULK1, which shares 97.5% of sequence identity with the human variant in the catalytic domain. We found a study reporting a mutation of S184 to alanine⁹². S184 is mutated to phenylalanine in head and neck TCGA samples and, we predict a marginally stabilizing effect upon mutation. Moreover, we classified S184 as a functional damaging mutation likely to impair ULK1 functional dynamics, due to its hinge behavior for the motions of the activation loop. S184A and S184D have been reported to inactivate the ULK1 kinase⁹², supporting the predicted functional role more than an effect on structural stability. We collected other mutations at other sites for which the functional impact has been studied experimentally and they are summarized in Table S3. Of interest, S174A mutation results in a hyperactive enzyme⁹² and it is located in the region of the activation loop undergoing conformational changes. Moreover, we notice that the other experimental mutations with a reported effect on ULK1 activity are predicted to have marginal effects on protein stability, except for K46N, M92A, and Y89A. These mutations result either in inactivation of the kinase⁹³ or impairment of phosphorylation of the ATG13 substrate²⁵. Our calculations suggest that the detrimental effect could be due to changes in protein stability, an aspect which could deserve further investigation to verify the cellular levels and half-life of these variants to conclude which one is the predominant effect.

Materials and methods

All the inputs, scripts and main outputs of this study are available in the GitHub repository https://github.com/ELELAB/ULK1_mutations. The trajectories and input files for the molecular dynamics simulations have been deposited in OSF: https://osf.io/8xuaj.

Expression levels of ULK1 in TCGA datasets

We downloaded and pre-processed level 3 harmonized RNA-Seq data (HTSeq count) for all the available datasets from TCGA. We downloaded the data in June 2019 from the Genomic Data Common (GDC) Portal using the GDCdownload function of TCGAbiolinks⁹⁴. An overview of the analysed datasets is reported in Table S1. We employed the TCGAbiolinks function GDCprepare to obtain a Summarized Experiment object⁹⁵. We removed outlier samples with the TCGAanalyze_Preprocessing function of TCGAbiolinks using a Spearman correlation cutoff of 0.6. We normalized the datasets for GC-content⁹⁶ and library size using the TCGAanalyze_Normalization function of TCGAbiolinks. Lastly, we filtered the normalized RNA-Seq data for low counts across samples using the function TCGAanalyze_Filtering with a 0.20 cutoff for quantile filtering. For the TCGA datasets where normal samples were missing, we used the unified dataset that integrates the Genotype-Tissue Expression (GTEx) datasets⁵² of healthy samples and the TCGA data, as provided by the Recount2 protocol⁵¹. We also employed this dataset as an additional source of information for the TCGA datasets with less than five normal samples (see Table S1). We used the TCGAquery_Recount2 function of TCGAbiolinks⁹⁷ to query the GTEx and TCGA unified datasets. We carried out GC-content normalization and quantile filtering on the unified datasets, as described above.

Differential expression analyses have been carried out using limma-voom⁹⁸ as implemented within the TCGAanalyze_DEA function of TCGAbiolinks⁹⁷, along with edgeR to confirm the results, as we recently applied to another case study 100. We included in the design matrix conditions (tumor vs normal) and the TSS (Tissue Source Site; the center where the samples are collected) or the Plates (where available) as source of batch-effects to assess the robustness of the estimate of changes in expression with respect to different correction factors. In all our DEA analyses, we defined as a cutoff to retain significant DE genes a log fold change (logFC) ≥ 0.5 or ≤ − 0.5, whereas a cutoff of 0.05 was used for the False Discovery Rate (FDR). We then retrieved the estimate logFC for ULK1 (ENSEMBL ID ENSG00000177169) and ULK2 (ENSG00000083290) in the different comparisons (see Table S1).

Curation and analyses of missense mutations of ULK1 from TCGA

We retrieved mutations for ULK1 from each TCGA cancer study using the MuTect2 pipeline¹⁰⁰ as implemented in the TCGAbiolinks function GDCquery_Maf. We retained missense mutations in the kinase domain of ULK1 for the structural analysis. For each mutation, we also collected the following additional information: (i) REVEL score¹⁰¹; (ii) interplay with post-translational modifications and functional short linear motifs using as a source of information PhosphoSite¹⁰² and ELM¹⁰³, respectively; (iii) identification of the same mutation in COSMIC¹⁰⁴. We also evaluated if the mutations under investigation were not found in ExAC¹⁰⁵. as natural polymorphisms with high frequency in the healthy population and as a such, not interesting in a cancer context. We also used iSNO-AAPair¹⁰⁶, SNOSite¹⁰⁷ and NetPhos¹⁰⁸ to predict S-nitrosylation or phosphorylation sites upon mutation to cysteine or phosphorylatable (serine, threonine and tyrosine) residues, respectively. We used the NetPhos predictor only for those mutations that were in solvent exposed sites upon analyzes of solvent accessibility of their sidechain with NACCESS (https://wolf.bms.umist.ac.uk/naccess/). Moreover, we verified that each mutation under investigation was the only one targeting the ULK1 gene in the sample where it was identified.

Interactome of ULK1 and co-occurrence of mutations

We retrieved the experimentally known ULK1 interactors through the Integrated Interaction Database (IID) version 2018-05⁷⁴. We then estimated the co-occurrence of mutations between ULK1 and each of these interactors, along with other ULKs kinases (i.e. ULK2, ULK3 and ULK4) with the somaticInteractions function of maftools R/Bioconductor package¹⁰⁹, which performs a pairwise Fisher’s Exact test to detect significant pairs of genes.

Prediction of driver genes

We used the oncodrive function of maftools¹⁰⁹ to evaluate if ULK1 was predicted as driver gene in any of the cancer type under investigation. The function is based on the algorithm oncodriveCLUST⁷².

Free energy calculations

We employed the FoldX energy function^110,111 to perform in silico saturation mutagenesis. Calculations with this empirical energy function resulted in an average ΔΔG (differences in ΔG between mutant and wild-type variant) for each mutation over five independent runs performed using: (i) the X-ray structure of ULK1 (PDB entry 5CI7³²), (ii) an ensemble of 20 representative conformations from the MD simulations with CHARMM22star or (iii) with CHARMM27. The protocol is detailed in our previous publication⁴⁶. We also performed a literature-based curation of mutations for which the effects have been studied experimentally and use them as a control of the quality of our predictions.

Molecular dynamics simulations

We carried out 1-μs molecular dynamics (MD) simulations for the human ULK1 kinase domains in explicit solvent using GROMACS software version 4.6¹¹². We used as starting structure the PDB entry 5CI7 after in silico retro-mutation of the phospho-Thr 180 to Thr, to provide a model of the unphosphorylated variant of the domain. We used two protein different force fields CHARMM22*¹¹³, CHARMM27¹¹⁴ in combination with the TIP3P water model¹¹⁵ to evaluate the robustness of our results with respect to different physical models.

We used a dodecahedral box applying periodic boundary conditions and a concentration of NaCl of 150 mM, neutralizing the charges of the system. The simulated system (protein + water) accounted for 83,077 atoms. The system was prepared by different steps of minimization, solvent equilibration, thermalization and pressurization. We carried out productive MD simulations in the canonical ensemble at 300 K using velocity rescaling with a stochastic term¹¹⁶. We applied the LINCS algorithm¹¹⁷ to constrain the heavy atom bonds to use a time-step of 2 fs. We calculated long-range electrostatic interactions using the Particle-mesh Ewald (PME) summation scheme 119, whereas we truncated Van der Waals and short-range Coulomb interactions at 10 Å.

We verified the absence of artificial contacts between the periodic images of the protein in the simulations, which were always at a distance higher than 30 Å. We evaluated the quality of the conformational ensemble on 100 representative conformations equally spaced in time for each of the simulations using the machine-learning based approach implemented in ResProx¹¹⁹ to predict the atomic resolution from structural ensembles of proteins. We obtained a predicted resolution of 1.53 ± 0.30 and 1.66 ± 0.12 Å for CHARMM22star and CHARMM27 simulations of ULK1 kinase domain, respectively. These values are very close to the resolution of the corresponding X-ray structure (1.74 Å), suggesting an overall quality of the MD-based ensemble.

We used Principal Component Analysis (PCA) of the covariance matrix of Cα atomic fluctuations to extract the principal motions from the MD simulations¹²⁰. We performed the PCA on a concatenated trajectory of the two MD simulations of ULK1 to compare them in the same essential subspace.

CABS_flex ensembles of selected ULK1 mutant variants

For a selection of mutant variants of ULK1 (F14L, N96D, A101T, A125T, R137C, R137H and D138N) that have hub-behavior, we also collected conformational ensembles using the coarse-grained approach implemented in CABS_flex 2.0¹²¹.

We used as starting structures for CABS_flex calculations the models generated by FoldX during the mutational scan for each of these mutations. In particular, we selected the most representative models in terms of rotameric state of the mutated residue for each mutation. We then collected an ensemble of ten different conformations for each mutant variant to be used for the contact-based PSN analyses of hub residues described below, upon reconstruction of the corresponding full-atom models.

Protein structure networks

We employed a contact-based Protein Structure Network (PSN) to the MD ensemble as implemented in Pyinteraph⁸⁵. We defined as hubs those residues of the network with at least three edges⁴³. We used the node inter-connectivity to calculate the connected components, which are clusters of connected residues in the graph. We selected 5 Å as the optimal cutoff for the contact-based PSN using the PyInKnife pipeline⁸⁶. The distance was estimated between the center of mass of the residue side chains. We removed spurious interactions during the simulations applying a persistence cutoff of 20% (i.e., each contact was included as an edge of the PSN only if occurring in 20% of the MD frames), as indicated in the original implementation of the method⁸⁵. We applied a variant of the depth-first search algorithm to identify the shortest path of communication. We defined the shortest path as the path in which the two residues were non-covalently connected by the smallest number of intermediate nodes.

We also calculated the persistence of salt bridges and hydrogen bonds with PyInteraph and the corresponding networks. For salt-bridges, all the distances between atom pairs belonging to charged moieties of two oppositely charged residues were calculated. The charged moieties were considered as interacting if at least one pair of atoms was found at a distance shorter than 4.5 Å. In the case of aspartate and glutamate residues, the atoms forming the carboxylic group were considered. The NH3- and the guanidinium groups were employed for lysine and arginine, respectively. We also verified the consistency of the results with a cutoff of 5 Å. We applied a persistence cutoff to filter interactions of 20% also for these networks.

Conclusions

The assessment of the different effects that a mutation can exert on a protein explored in this study and the subsequent classification of the mutations can provide a useful complement to cancer genomics studies. For example, it allows to identify mutations that are likely to be ‘driver’ or ‘passenger’, along with to predict if the effect is triggered more by a destabilization of the protein product or a protein variant with impaired functionality. Moreover, our combined approach for mutation assessment could also benefit for the prioritization and selection of mutant variants for cellular experimental validation. Indeed, it can suggest how to select the proper readout for experimental validation. As an example, in a case where the mutation is predicted to be damaging for stability, experiments to estimate its cellular levels and half-life could be used, along with readouts to evaluate if the changes are due to proteasomal degradation or other clearance mechanisms. On the other side, if a mutation is predicted to result in a variant which is as stable as the wild-type, but the effect is more related to its function, experiments to evaluate its interactions in the cell with the biological partners, its regulation by PTMs or cellular assays to evaluate the effects on the pathways where interactors of the target protein are involved would be the most suitable choice. Moreover, we showed how the structural analyses used here benefit of the integration of bioinformatic tools to assess the changes in expression level of the target gene along with changes in other genes that can have compensatory effects, as we exemplified for ULK2. In addition, the extension of the analyses to the protein target interactome in terms of understanding co-occurrence of alterations and synergic effects that can arise from them allow a comprehensive view and to pinpoint interesting alterations at the molecular level. We here showed how our workflow can help in the study of a key kinase of the autophagy pathway, ULK1. We discovered that in the majority of the cases the gene expression levels are not altered or can be compensated by an up-regulation of the homologous kinase ULK2, whereas more than 30 different missense mutations altering the coding region of the gene have been identified. These mutations co-occur with mutations in ULK1 interactors fundamental for the upstream regulation of autophagy, suggesting an impairment of this process in cancer types such as uterine, stomach, skin, glioblastoma and colon cancers. Moreover, our study allowed to pinpoint that more than 50% of the mutations of ULK1 kinase domain found in the cancer samples have an effect on protein stability, which is likely to have a more pronounced effect that the residual effect on protein activity, especially if it cannot be compensated by interactions with regulators of cellular ULK1 stability, which are also altered in the samples under investigation. We identified three mutations (S184F, D102N, and A28V) that predicted with only impact on kinase activity, either altering the functional dynamics of the protein or the capability to exert long range effects from distal site to the functional and catalytic regions. Due to the paucity of experimental data on ULK1 mutations, future studies will be required to understand if these mutations have an inhibitory or activatory role on the kinase. The framework here applied could be more broadly extended to other targets of interest, as we recently started to apply, to help in the classification of mutational effects, along with prioritizing the variants for experimental validation and a specific biological readout.

References

Reggiori, F. & Klionsky, D. J. Autophagy in the eukaryotic cell. Eukaryot. Cell 1, 11–21 (2002).
PubMed PubMed Central CAS Google Scholar
Galluzzi, L. et al. Molecular definitions of autophagy and related processes. EMBO J. 36, 1811–1836 (2017).
PubMed PubMed Central CAS Google Scholar
Yin, Z., Pascual, C. & Klionsky, D. Autophagy: machinery and regulation. Microb. Cell 3, 457–465 (2016).
CAS Google Scholar
Yu, L., Chen, Y. & Tooze, S. A. Autophagy pathway: cellular and molecular mechanisms. Autophagy 14, 207–215 (2018).
PubMed CAS Google Scholar
Mizushima, N., Levine, B., Cuervo, A. M. & Klionsky, D. J. Autophagy fights disease through cellular self-digestion. Nature 451, 1069–1075 (2008).
ADS PubMed PubMed Central CAS Google Scholar
Noda, N. N. & Fujioka, Y. Atg1 family kinases in autophagy initiation. Cell. Mol. Life Sci. 72, 3083–3096 (2015).
PubMed PubMed Central CAS Google Scholar
Zachari, M. & Ganley, I. G. The mammalian ULK1 complex and autophagy initiation. Essays Biochem. 61, 585–596 (2017).
PubMed PubMed Central Google Scholar
Mizushima, N. The role of the Atg1/ULK1 complex in autophagy regulation. Curr. Opin. Cell Biol. 22, 132–139 (2010).
PubMed CAS Google Scholar
Lin, M. G. & Hurley, J. H. Structure and function of the ULK1 complex in autophagy. Curr. Opin. Cell Biol. 39, 61–68 (2016).
PubMed PubMed Central CAS Google Scholar
Antonioli, M., Di Rienzo, M., Piacentini, M. & Fimia, G. M. Emerging mechanisms in initiating and terminating autophagy. Trends Biochem. Sci. xx, 1–14 (2016).
Dorsey, F. C. et al. Mapping the phosphorylation sites of Ulk1. J. Proteome Res. 8, 5253–5263 (2009).
PubMed CAS Google Scholar
Alers, S., Löffler, A. S., Wesselborg, S. & Stork, B. The incredible ULKs. Cell Commun. Signal. 10, 7 (2012).
CAS Google Scholar
Nazio, F. et al. Fine-tuning of ULK1 mRNA and protein levels is required for autophagy oscillation. J. Cell Biol. 215, 841–856 (2016).
PubMed PubMed Central CAS Google Scholar
Egan, D. F. et al. Phosphorylation of ULK1 (hATG1) by AMP-activated protein kinase connects energy sensing to mitophagy. Science (80-.). 331, 456–461 (2011).
Kim, J., Kundu, M., Viollet, B. & Guan, K.-L. AMPK and mTOR regulate autophagy through direct phosphorylation of Ulk1. Nat. Cell Biol. 13, 132–141 (2011).
PubMed PubMed Central CAS Google Scholar
Nazio, F. et al. mTOR inhibits autophagy by controlling ULK1 ubiquitylation, self-association and function through AMBRA1 and TRAF6. Nat. Cell Biol. 15, 406–416 (2013).
PubMed CAS Google Scholar
Wong, P.-M.M., Puente, C., Ganley, I. G. & Jiang, X. The ULK1 complex sensing nutrient signals for autophagy activation. Autophagy 9, 124–137 (2013).
PubMed PubMed Central CAS Google Scholar
Puente, C., Hendrickson, R. C. & Jiang, X. Nutrient-regulated phosphorylation of ATG13 inhibits starvation-induced autophagy. J. Biol. Chem. 291, 6026–6035 (2016).
PubMed PubMed Central CAS Google Scholar
Russell, R. C. et al. ULK1 induces autophagy by phosphorylating Beclin-1 and activating VPS34 lipid kinase. Nat. Cell Biol. 15, 741–750 (2013).
PubMed PubMed Central CAS Google Scholar
Park, J. M. et al. The ULK1 complex mediates MTORC1 signaling to the autophagy initiation machinery via binding and phosphorylating ATG14. Autophagy 12, 547–564 (2016).
PubMed PubMed Central CAS Google Scholar
Egan, D. F. et al. Small molecule inhibition of the autophagy kinase ULK1 and identification of ULK1 substrates. Mol. Cell 59, 285–297 (2015).
PubMed PubMed Central CAS Google Scholar
Mcalpine, F., Williamson, L. E., Tooze, S. A. & Chan, E. Y. W. Regulation of nutrient-sensitive autophagy by uncoordinated 51-like kinases 1 and 2. Autophagy 9, 361–373 (2013).
PubMed PubMed Central CAS Google Scholar
Lu, J. et al. Overexpression of ULK1 represents a potential diagnostic marker for clear cell renal carcinoma and the antitumor effects of SBI-0206965. EBioMedicine 34, 85–93 (2018).
PubMed PubMed Central Google Scholar
Wu, D. hao et al. Combination of ULK1 and LC3B improve prognosis assessment of hepatocellular carcinoma. Biomed. Pharmacother. 97, 195–202 (2018).
Zhang, L. et al. Discovery of a small molecule targeting ULK1-modulated cell death of triple negative breast cancer in vitro and in vivo. Chem. Sci. 8, 2687–2701 (2017).
PubMed PubMed Central CAS Google Scholar
Yun, M. et al. ULK1: A promising biomarker in predicting poor prognosis and therapeutic response in human nasopharygeal carcinoma. PLoS ONE 10, 1–15 (2015).
Google Scholar
Martinet, W., Agostinis, P., Vanhoecke, B., Dewaele, M. & Meyer, G. R. Y. D. E. Autophagy in disease: a double-edged sword with therapeutic potential. Clin. Sci. 712, 697–712 (2009).
Singh, S. S. et al. Dual role of autophagy in hallmarks of cancer. Oncogene 37, 1142–1158 (2018).
PubMed CAS Google Scholar
Jang, J. E. et al. AMPK-ULK1-mediated autophagy confers resistance to BET inhibitor JQ1 in acute myeloid leukemia stem cells. Clin. Cancer Res. 23, 2781–2794 (2017).
PubMed CAS Google Scholar
Petherick, K. J. et al. Pharmacological inhibition of ULK1 kinase blocks mammalian target of rapamycin (mTOR)-dependent autophagy. J. Biol. Chem. 290, 11376–11383 (2015).
PubMed PubMed Central CAS Google Scholar
Lazarus, M. B., Novotny, C. J. & Shokat, K. M. Structure of the human autophagy initiating kinase ULK1 in complex with potent inhibitors. ACS Chem. Biol. 10, 257–261 (2015).
PubMed CAS Google Scholar
Lazarus, M. B. & Shokat, K. M. Discovery and structure of a new inhibitor scaffold of the autophagy initiating kinase ULK1. Bioorganic Med. Chem. 23, 5483–5488 (2015).
CAS Google Scholar
Galluzzi, L. et al. Autophagy in malignant transformation and cancer progression. EMBO J. 34, 856–880 (2015).
PubMed PubMed Central CAS Google Scholar
Chaikuad, A. et al. Conservation of structure, function and inhibitor binding in UNC-51-like kinase 1 and 2 (ULK1/2). Biochem. J. BCJ20190038 (2019). https://doi.org/10.1042/BCJ20190038
Nicolaou, C. A. et al. Idea2Data: toward a new paradigm for drug discovery. ACS Med. Chem. Lett. 10, 278–286 (2019).
PubMed PubMed Central CAS Google Scholar
Mercer, T. J., Gubas, A. & Tooze, S. A. A molecular perspective of mammalian autophagosome biogenesis. J. Biol. Chem. 293, 5386–5395 (2018).
PubMed PubMed Central CAS Google Scholar
Ganley, I. G. et al. ULK1·ATG13·FIP200 complex mediates mTOR signaling and is essential for autophagy. J. Biol. Chem. 284, 12297–12305 (2009).
PubMed PubMed Central CAS Google Scholar
Zhu, Y. et al. ULK1 and JNK are involved in mitophagy incurred by LRRK2 G2019S expression. Protein Cell 4, 711–721 (2013).
PubMed PubMed Central CAS Google Scholar
Nussinov, R., Tsai, C. & Jang, H. Protein ensembles link genotype to phenotype. PLoS Comput. Biol. 15, e1006648 (2019).
PubMed PubMed Central CAS Google Scholar
Nussinov, R. Precision medicine review: rare driver mutations and their biophysical classification. Biophys. Rev. 11, 5–19 (2019).
PubMed PubMed Central CAS Google Scholar
Naganathan, A. N. Modulation of allosteric coupling by mutations: from protein dynamics and packing to altered native ensembles and function. Curr. Opin. Struct. Biol. 54, 1–9 (2019).
PubMed CAS Google Scholar
Stein, A., Fowler, D. M., Hartmann-Petersen, R. & Lindorff-Larsen, K. Biophysical and mechanistic models for disease-causing protein variants. Trends Biochem. Sci. 1, 1–14 (2019). https://doi.org/10.1016/j.tibs.2019.01.003
Papaleo, E. Integrating atomistic molecular dynamics simulations, experiments, and network analysis to study protein dynamics: strength in unity. Front. Mol. Biosci. 2, 1–6 (2015).
Google Scholar
Di Paola, L. & Giuliani, A. Protein contact network topology: a natural language for allostery. Curr. Opin. Struct. Biol. 31, 43–48 (2015).
PubMed Google Scholar
Lambrughi, M. et al. DNA-binding protects p53 from interactions with cofactors involved in transcription-independent functions. Nucleic Acids Res. 44, 9096–9109 (2016).
PubMed PubMed Central CAS Google Scholar
Nygaard, M. et al. The mutational landscape of the oncogenic MZF1 SCAN domain in cancer. Front. Mol. Biosci. 3, 1–18 (2016).
Google Scholar
Papaleo, E. et al. The role of protein loops and linkers in conformational dynamics and allostery. Chem. Rev. 116, 6391–6423 (2016).
PubMed CAS Google Scholar
Chang, K. et al. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45, 1113–1120 (2013).
CAS Google Scholar
Hutter, C. & Zenklusen, J. C. The cancer genome atlas: creating lasting value beyond its data. Cell 173, 283–285 (2018).
PubMed CAS Google Scholar
Frazee, A. C., Langmead, B. & Leek, J. T. ReCount: A multi-experiment resource of analysis-ready RNA-seq gene count datasets. BMC Bioinformatics 12, 449 (2011).
PubMed PubMed Central Google Scholar
Collado-Torres, L. et al. Reproducible RNA-seq analysis using recount2. Nat. Biotechnol. 35, 319–321 (2017).
PubMed PubMed Central CAS Google Scholar
Carithers, L. J. & Moore, H. M. The genotype-tissue expression (GTEx) project. Biopreserv. Biobank. 13, 307–308 (2015).
PubMed PubMed Central Google Scholar
Aykac Fas, B. et al. The conformational and mutational landscape of the ubiquitin-like marker for the autophagosome formation in cancer. bioarXiv (2019). https://doi.org/10.1101/635284
Piovesan, D., Walsh, I., Minervini, G. & Tosatto, S. C. E. FELLS: Fast estimator of latent local structure. Bioinformatics 33, 1889–1891 (2017).
PubMed CAS Google Scholar
Tomczak, K., Czerwińska, P. & Wiznerowicz, M. Review The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Współczesna Onkol. 1A, 68–77 (2015).
Google Scholar
Collado-Torres, L., Nellore, A. & Jaffe, A. E. recount workflow: Accessing over 70,000 human RNA-seq samples with Bioconductor. F1000Research 6, 1558 (2017).
Lee, E. & Tournier, C. The requirement of uncoordinated 51-like kinase 1 (ULK1) and ULK2 in the regulation of autophagy. Autophagy 7, 689–695 (2011).
PubMed PubMed Central CAS Google Scholar
Modi, V. & Dunbrack, R. L. A structurally validated sequence alignment of all 497 typical human protein kinase domains. bioarXiv (2019).
Bach, M., Larance, M., James, D. E. & Ramm, G. The serine/threonine kinase ULK1 is a target of multiple phosphorylation events. Biochem. J. 440, 283–291 (2011).
PubMed CAS Google Scholar
Lin, S. Y. et al. Protein phosphorylation-acetylation cascade connects growth factor deprivation to autophagy. Autophagy 8, 1385–1386 (2012).
PubMed PubMed Central CAS Google Scholar
Papinski, D. & Kraft, C. Regulation of autophagy by signaling through the Atg1/ULK1 complex. J Mol Biol 1725–1741 (2016). https://doi.org/10.1016/j.jmb.2016.03.030
Chen, C. et al. Identification of a major determinant for serine-threonine kinase phosphoacceptor specificity. Mol Cell 53, 140–147 (2013).
PubMed Google Scholar
Liu, Q. et al. Developing irreversible inhibitors of the protein kinase cysteinome. Chem. Biol. 20, 146–159 (2013).
PubMed PubMed Central Google Scholar
Bhullar, K. S. et al. Kinase-targeted cancer therapies: progress, challenges and future directions. Mol. Cancer 17, 48 (2018).
PubMed PubMed Central Google Scholar
Garske, A. L., Peters, U., Cortesi, A. T., Perez, J. L. & Shokat, K. M. Chemical genetic strategy for targeting protein kinases based on covalent complementarity. Proc. Natl. Acad. Sci. U S A 108, 1 (2011).
Google Scholar
Suda, K., Onozato, R., Yatabe, Y. & Mitsudomi, T. EGFR T790M mutation: a double role in lung cancer cell survival?. J. Thorac. Oncol. 4, 1–4 (2009).
PubMed Google Scholar
Modi, V. & Dunbrack, R. L. Defining a new nomenclature for the structures of active and inactive kinases. Proc Natl Acad Sci U S A 116, 6818–6827 (2019).
PubMed PubMed Central CAS Google Scholar
Roskoski, R. Src protein-tyrosine kinase structure, mechanism, and small molecule inhibitors: this paper is dedicated to the memory of Prof. Donald F. Steiner (1930–2014) Advisor, mentor, and discoverer of proinsulin. Pharmacol. Res. 94, 9–25 (2015).
Saladino, G. & Gervasio, F. L. Modeling the effect of pathogenic mutations on the conformational landscape of protein kinases. Curr. Opin. Struct. Biol. 37, 108–114 (2016).
PubMed CAS Google Scholar
Ahuja, L. G., Taylor, S. S. & Kornev, A. P. Tuning the “Violin” of protein kinases: the role of dynamics-based allostery. IUBMB 71, 685–696 (2019).
CAS Google Scholar
Martín-García, F., Papaleo, E., Gomez-Puertas, P., Boomsma, W. & Lindorff-Larsen, K. Comparing molecular dynamics force fields in the essential subspace. PLoS ONE 10, e0121114 (2015).
PubMed PubMed Central Google Scholar
Tamborero, D., Gonzalez-Perez, A. & Lopez-Bigas, N. OncodriveCLUST : exploiting the positional clustering of somatic mutations to identify cancer genes. Bioinformatics 29, 2238–2244 (2013).
PubMed CAS Google Scholar
Colli, L. M. et al. Burden of nonsynonymous mutations among TCGA cancers and candidate immune checkpoint inhibitor responses. Cancer Res. 76, 3767–3772 (2016).
PubMed PubMed Central CAS Google Scholar
Kotlyar, M., Pastrello, C., Malik, Z. & Jurisica, I. IID 2018 update: Context-specific physical protein-protein interactions in human, model organisms and domesticated species. Nucleic Acids Res. 47, D581–D589 (2019).
PubMed CAS Google Scholar
Grunwald, D. S. et al. GABARAPs and LC3s have opposite roles in regulating ULK1 for autophagy induction. Autophagy. https://doi.org/10.1080/15548627.2019.1632620 (2019).
Alemu, E. A. et al. ATG8 family proteins act as scaffolds for assembly of the ULK complex: sequence requirements for LC3-interacting region (LIR) motifs. J. Biol. Chem. 287, 39275–39290 (2012).
PubMed PubMed Central CAS Google Scholar
Kraft, C. et al. Binding of the Atg1/ULK1 kinase to the ubiquitin-like protein Atg8 regulates autophagy. EMBO J. 31, 3691–3703 (2012).
PubMed PubMed Central CAS Google Scholar
Birgisdottir, A. B., Lamark, T. & Johansen, T. The LIR motif - crucial for selective autophagy. J. Cell Sci. 126, 3552–3562 (2013).
Google Scholar
Mukhopadhyay, S., Naik, P. P., Panda, P. K., Sinha, N. & Bhutia, S. K. Serum starvation induces anti-apoptotic cIAP1 to promote mitophagy through ubiquitination. Biochem. Biophys. Res. Commun. 479, 940–946 (2016).
PubMed CAS Google Scholar
Bignon, E., Allega, M. F., Lucchetta, M., Tiberti, M. & Papaleo, E. Computational structural biology of S-nitrosylation of cancer targets. Front. Oncol. 8, 272 (2018).
PubMed PubMed Central Google Scholar
Nielsen, S. V. et al. Predicting the impact of Lynch syndrome-causing missense mutations from structural calculations. PLOS Genet. 13, e1006739 (2017).
PubMed PubMed Central Google Scholar
Scheller, R. et al. Toward mechanistic models for genotype-phenotype correlations in phenylketonuria using protein stability calculations. Hum. Mutat. https://doi.org/10.1002/humu.23707 (2019).
Article PubMed Google Scholar
Tiberti, M. et al. MutateX: an automated pipeline for in-silico saturation mutagenesis of protein structures and structural ensembles. bioarXiv (2019).
Papaleo, E., Parravicini, F., Grandori, R., De Gioia, L. & Brocca, S. Structural investigation of the cold-adapted acylaminoacyl peptidase from Sporosarcina psychrophila by atomistic simulations and biophysical methods. Biochim. Biophys. Acta Proteins Proteomics 1844, 2203–2213 (2014).
CAS Google Scholar
Tiberti, M. et al. PyInteraph: a framework for the analysis of interaction networks in structural ensembles of proteins. J. Chem. Inf. Model 54, 1537–1551 (2014).
PubMed CAS Google Scholar
Viloria, J. S., Allega, M. F., Lambrughi, M. & Papaleo, E. An optimal distance cutoff for contact-based protein structure networks using side-chain centers of mass. Sci. Rep. 7, 1–11 (2017).
Google Scholar
Jónsdóttir, L. B. et al. The role of salt bridges on the temperature adaptation of aqualysin I, a thermostable subtilisin-like proteinase. Biochim. Biophys. Acta Proteins Proteomics 1844, 2174–2181 (2014).
Google Scholar
Ahmed, M. C., Papaleo, E. & Lindorff-larsen, K. How well do force fields capture the strength of salt bridges in proteins ?. PeerJ 6, e4967 (2018).
PubMed PubMed Central Google Scholar
Invernizzi, G., Tiberti, M., Lambrughi, M., Lindorff-Larsen, K. & Papaleo, E. Communication routes in ARID domains between distal residues in helix 5 and the DNA-binding loops. PLoS Comput. Biol. 10, e1003744 (2014).
ADS PubMed PubMed Central Google Scholar
Guarnera, E., Tan, Z. W., Zheng, Z. & Berezovsky, I. N. AlloSigMA: Allosteric signaling and mutation analysis server. Bioinformatics 33, 3996–3998 (2017).
PubMed CAS Google Scholar
Jiao, H. et al. Chaperone-like protein p32 regulates ULK1 stability and autophagy. Cell Death Differ. 22, 1812–1823 (2015).
PubMed PubMed Central CAS Google Scholar
Loska, S. L. Regulation of ULK1 in autophagy. PhD Thesis, Univ. Manchester, Fac. Life Sci. (2012).
Tomoda, T., Bhatt, R. S., Kuroyanagi, H., Shirasawa, T. & Hatten, M. E. A mouse serine/threonine kinase homologous to C. elegans UNC51 functions in parallel fiber formation of cerebellar granule neurons. Neuron 24, 833–46 (1999).
Colaprico, A. et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44, gkv1507 (2015).
Huber, W. et al. Orchestrating high-throughput genomic analysis with bioconductor. Nat Methods 12, 115–121 (2015).
PubMed PubMed Central CAS Google Scholar
Risso, D., Schwartz, K., Sherlock, G. & Dudoit, S. GC-content normalization for RNA-Seq data. (2011).
Mounir, M. et al. New functionalities in the TCGAbiolinks package for the study and integration of cancer data from GDC and GTEx. PLOS Comput. Biol. 15, e1006701 (2019).
PubMed PubMed Central Google Scholar
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2009).
PubMed PubMed Central Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
PubMed PubMed Central CAS Google Scholar
Ioannidis, N. M. et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99, 877–885 (2016).
PubMed PubMed Central CAS Google Scholar
Hornbeck, P. V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–D520 (2015).
PubMed CAS Google Scholar
Van Roey, K., Dinkel, H., Weatheritt, R. J., Gibson, T. J. & Davey, N. E. The switches.ELM resource: a compendium of conditional regulatory interaction interfaces. Sci. Signal. 6, rs7 (2013).
Tate, J. G. et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947 (2019).
CAS PubMed Google Scholar
Kobayashi, Y. et al. Pathogenic variant burden in the ExAC database: an empirical approach to evaluating population data for clinical variant interpretation. Genome Med. 9, 1–14 (2017).
Google Scholar
Xu, Y., Shao, X., Wu, L., Deng, N. & Chou, K. iSNO-AAPair: incorporating amino acid pairwise coupling into PseAAC for predicting cysteine S-nitrosylation sites in proteins. PeerJ 1–18 (2013). https://doi.org/10.7717/peerj.171
Lee, T. Y., Chen, Y. J., Lu, T. C., Huang, H. Da & Chen, Y. J. Snosite: Exploiting maximal dependence decomposition to identify cysteine S-Nitrosylation with substrate site specificity. PLoS One 6, (2011).
Blom, N., Sicheritz-Pontén, T., Gupta, R., Gammeltoft, S. & Brunak, S. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4, 1633–1649 (2004).
PubMed CAS Google Scholar
Mayakonda, A. & Koeffler, H. P. Maftools: Efficient analysis, visualization and summarization of MAF files from large-scale cohort based cancer studies. bioRxiv 052662 (2016). https://doi.org/10.1101/052662
Guerois, R., Nielsen, J. E. & Serrano, L. Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. J. Mol. Biol. 320, 369–387 (2002).
PubMed CAS Google Scholar
Schymkowitz, J. et al. The FoldX web server: an online force field. Nucleic Acids Res. 33, W382–W388 (2005).
PubMed PubMed Central CAS Google Scholar
Hess, B., Kutzner, C., van der Spoel, D. & Lindahl, E. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J. Chem. Theory Comput. 4, 435–447 (2008).
PubMed CAS Google Scholar
Piana, S., Lindorff-Larsen, K. & Shaw, D. E. How robust are protein folding simulations with respect to force field parameterization?. Biophys. J. 100, L47–L49 (2011).
PubMed PubMed Central CAS Google Scholar
Mackerell, A. D., Feig, M. & Brooks, C. L. Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J. Comput. Chem. 25, 1400–1415 (2004).
PubMed CAS Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926 (1983).
ADS CAS Google Scholar
Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 126, 014101 (2007).
ADS PubMed Google Scholar
Hess, B., Bekker, H., Berendsen, H. & Fraaije, J. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 12, 1463–1472 (1993).
Google Scholar
Essmann, U. et al. A smooth particle mesh Ewald method. J. Chem. Phys. 103, 8577 (1995).
ADS CAS Google Scholar
Berjanskii, M., Zhou, J., Liang, Y., Lin, G. & Wishart, D. S. Resolution-by-proxy: a simple measure for assessing and comparing the overall quality of NMR protein structures. J. Biomol. NMR 53, 167–180 (2012).
PubMed CAS Google Scholar
Daidone, I. & Amadei, A. Essential dynamics: foundation and applications. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2, 762–770 (2012).
CAS Google Scholar
Kuriata, A. et al. CABS-flex 20: a web server for fast simulations of flexibility of protein structures. Nucleic Acids Res. 46, W338–W343 (2018).
PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank Matteo Lambrughi and Matteo Tiberti for technical assistance and fruitful discussion. The results shown here are in part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga. The project was supported by Danmarks Grundforskningsfond (DNRF125) and a Carlsberg Foundation Distinguished Fellowship (CF18-0314). Moreover, the project has been supported by a Netaji Subhash ICAR international fellowship, Govt. of India to MK to work in EP group. The calculations described in this paper were performed using the DeiC National Life Science Supercomputer Computerome at DTU (Denmark), and DECI-PRACE 14th and 15th HPC Grants for calculations on Archer (UK).

Author information

Authors and Affiliations

Computational Biology Laboratory, Center for Autophagy, Recycling and Disease (CARD), Danish Cancer Society Research Center, Strandboulevarden 49, 2100, Copenhagen, Denmark
Mukesh Kumar & Elena Papaleo
Translational Disease System Biology, Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
Elena Papaleo

Authors

Mukesh Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Elena Papaleo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: E.P.; Data Curation: M.K., E.P.; Formal Analysis: M.K., E.P.; Funding Acquisition: M.K., E.P.; Investigation: M.K., E.P.; Methodology: E.P.; Project Administration: E.P.; Resources: E.P.; Supervision: E.P.; Validation: M.K., E.P.; Visualization: E.P., M.K.; Writing-Original Draft: E.P.; Writing-Review and Editing: M.K., E.P.

Corresponding author

Correspondence to Elena Papaleo.

Ethics declarations

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file 1

Supplementary file 2

Supplementary file 3

Supplementary file 4

Supplementary file 5

Supplementary file 6

Supplementary file 7

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumar, M., Papaleo, E. A pan-cancer assessment of alterations of the kinase domain of ULK1, an upstream regulator of autophagy. Sci Rep 10, 14874 (2020). https://doi.org/10.1038/s41598-020-71527-4

Download citation

Received: 10 December 2019
Accepted: 22 June 2020
Published: 10 September 2020
DOI: https://doi.org/10.1038/s41598-020-71527-4

This article is cited by

Crosstalk between autophagy inhibitors and endosome-related secretory pathways: a challenge for autophagy-based treatment of solid cancers
- Martina Raudenska
- Jan Balvan
- Michal Masarik
Molecular Cancer (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.