Utilizing The Cancer Genome Atlas (TCGA) and KM plotter databases we identified six heat shock proteins associated with survival of breast cancer patients. The survival curves of samples with high and low expression of heat shock genes were compared by log-rank test (Mantel-Haenszel). Interestingly, patients overexpressing two identified HSPs – HSPA2 and DNAJC20 exhibited longer survival, whereas overexpression of other four HSPs – HSP90AA1, CCT1, CCT2, CCT6A resulted in unfavorable prognosis for breast cancer patients. We explored correlations between expression level of HSPs and clinicopathological features including tumor grade, tumor size, number of lymph nodes involved and hormone receptor status. Additionally, we identified a novel signature with the potential to serve as a prognostic model for breast cancer. Using univariate Cox regression analysis followed by multivariate Cox regression analysis, we built a risk score formula comprising prognostic HSPs (HSPA2, DNAJC20, HSP90AA1, CCT1, CCT2) and tumor stage to identify high-risk and low-risk cases. Finally, we analyzed the association of six prognostic HSP expression with survival of patients suffering from other types of cancer than breast cancer. We revealed that depending on cancer type, each of the six analyzed HSPs can act both as a positive, as well as a negative regulator of cancer development. Our study demonstrates a novel HSP signature for the outcome prediction of breast cancer patients and provides a new insight into ambiguous role of these proteins in cancer development.
Despite the significant progress that has been made in recent years to improve the breast cancer diagnosis and treatment, it is still the most commonly diagnosed cancer among women and remains the second cancer-related death in women worldwide1. Breast cancer is a heterogenous group of tumors with distinct morphologies, clinical implications and response to therapy2. Patients are stratified into risk groups basing on the clinicopathological features (tumor size, lymph node stage, metastasis) combined with classical molecular features, such as the expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2)3. Currently, with the public availability of clinical and genomic data such as The Cancer Genome Atlas, lots of bioinformatics groups publish the multigene classifiers that could complement traditional diagnostic methods and develop more effective treatments. For example, association between physician characteristics and the use of 21-gene recurrence score genomic testing creates opportunities for breast cancer patients to receive optimal care4.
Heat shock proteins (HSPs) belong to a highly conserved family of proteins that act as molecular chaperones under stress conditions, including carcinogenesis5,6. They have been classified into the following families: HSPA (HSP70), HSPH (HSP110) HSPB (small heat shock proteins, sHSP), HSPC (HSP90), DNAJ (HSP40) and chaperonins7,8. HSPs interact with a broad range of unfolded, misfolded and semi-native proteins, assist in the acquisition of their active structures and prevent the formation of unwanted intermolecular interactions and protein aggregates9,10,11. In some cases, HSPs do not only prevent aggregation of proteins but also, using unfoldase activity, they are able to dissociate already existing protein aggregates12,13,14,15. Overexpression of HSPs has been observed in a wide range of human tumors, including breast, endometrial, ovarian, gastric, colon, lung and prostate cancers5,16,17. Most of previous studies reported correlation of high expression of HSPs with cancer aggressiveness and prognosis6,17,18,19. The overexpression of HSPs has been shown to be implicated in cancer cell proliferation, differentiation, invasion, metastasis and anti-apoptotic activity16,18. Recently, it was shown that heat shock proteins create a network which helps cells to survive stress conditions20. In cancer cells this network is remodeled to evade cell death, bypass senescence and refashion cell signaling to help highly malignant cancer cells to survive11. Moreover, it was shown recently that HSPs are involved in the evolution of cancer cells resulting in tumor heterogeneity. Such evolution could be accelerated by the chemotherapy and could lead to the acquisition of chemoresistance11,21,22. In contrast, some research has shown that reduced expression of HSPs is associated with poor prognosis in cancer patients23,24. Therefore, the biological mechanisms of HSPs and their role in cancer development still remain to be investigated.
In this study, we identified several heat shock genes which expression is crucial for breast cancer development. Interestingly, some of these HSPs work as negative regulators of cancer development and their expression is reduced in breast cancer cells, whereas other can support oncogenic activities and their expression in breast cancer cells is elevated. More importantly, identified HSPs that positively or negatively regulate breast cancer development, can play opposite role in other cancer types. The workflow of this study is presented in Supplementary Fig. S1.
Identification of heat shock proteins associated with survival of breast cancer patients
To identify the heat shock proteins (HSPs) associated with prognosis for breast cancer patients, we utilized TCGA and KM plotter datasets. We first performed the log-rank test (Mantel-Haenszel) to determine significant differences in patients’ survival depending on HSP expression. Expression of each of 96 HSPs was separated into low-expression and high-expression groups based on the median expression in each database used as a cutoff. We identified 13/96 HSPs (HSPA2, HSPA8, HSPA9, DNAJB5, DNAJC13, DNAJC20, DNAJC23, HSP90AA1, HSP90AB1, CCT1, CCT2, CCT4 and CCT6A) from the TCGA cohort and 22/96 HSPs (HSPA1A, HSPA1B, HSPA2, DNAJA1, DNAJC2, DNAJC5, DNAJC5G, DNAJC9, DNAJC16, DNAJC27, DNAJC20, HSPB1, HSPB5, HSP90AA1, CCT1, CCT2, CCT3, CCT5, CCT6A, CCT7, CCT8, HSP60) from KM plotter cohort significantly associated with overall survival (p ≤ 0,05). Six of them: HSPA2, DNAJC20, HSP90AA1, CCT1, CCT2 and CCT6A were statistically significant in both datasets (TCGA and KM plotter) and they were selected for further validation (Table 1). Comparison of the HSP expression in two independent datasets was performed to minimize the risk of false findings. With the exception of CCT6A, selected heat shock genes (HSPA2, DNAJC20, HSP90AA1, CCT1, CCT2) exhibited clinical significance also when subjected to univariate Cox regression model (Supplementary Fig. S2). Because expression of each of HSPs showed a nearly normal distribution, we divided patients into low-expression and high-expression groups by the median value (Fig. 1B). Interestingly, high expression of HSPA2 and DNAJC20 was significantly associated with better prognosis for breast cancer patients from TCGA cohort (p = 6,4e-03 and p = 4,3e-02, respectively), longer overall survival in KM plotter cohort (p = 4,5e-04 and p = 5,3e-03, respectively) and longer relapse-free survival in KM plotter cohort (p = 1,5e-07 and p = 5,3e-12, respectively) (Figs 1A, S3). In contrast, high expression of the other four HSPs (HSP90AA1, CCT1, CCT2, CCT6A) was significantly associated with reduced overall survival in TCGA cohort (p = 9,3e-04; p = 9,9e-03; p = 4,3e-02; p = 2,7e-03, respectively), reduced overall survival in KM plotter cohort (p = 9,4e-03; p = 1,6e-05; p = 1,4e-06; p = 6,7e-04, respectively) and reduced relapse-free survival in KM plotter cohort (p = 5,9e-15; p = 2e-09; p < 1e-16; p = 4,6e-15, respectively) (Figs 1A, S3). Collectively, these results suggest that high expression of HSPA2 and DNAJC20 is associated with low-risk breast cancer, whereas high expression of HSP90AA1, CCT1, CCT2 and CCT6A correlates with high-risk breast cancer.
Breast cancer survival-associated HSPs are differentially expressed in normal and tumor tissue
Differences in the expression of six HSPs between primary tumor tissue and normal solid tissue were assessed. The expression of DNAJC20 (better prognosis) was significantly lower in tumor tissue (log2 Fold Change (FC) = 0,7; p = 0,0251), whereas HSP90AA1, CCT2, CCT6A (unfavorable prognosis) were upregulated in tumors (HSP90AA1 log2 FC = 3,4; p < 0,0001; CCT2 log2 FC = 0,78; p < 0,0001; CCT6A log2 FC = 0,68; p < 0,0001, respectively). Analysis of HSPA2 and CCT1 expression did not reveal statistical difference between cancer and normal tissue (p > 0,05) (Fig. 2).
The relationship between six HSP expression and clinicopathological features
To explore the effect of six prognostic HSPs on clinical features, we performed the analysis of each of HSP mRNA expression in subgroups stratified by clinicopathological features. We have observed that patients with high expression of HSPA2 (better prognosis) were associated with smaller tumors (p = 0,0162), ER-positive and PR-positive cancers (p < 0,0001 and p < 0,0001, respectively). Similarly, high expression of DNAJC20 (better prognosis) was observed in ER-positive and HER2-positive cancers (p < 0,0198 and p < 0,0001, respectively). In contrast, patients with increased expression of other four HSPs (poor prognosis) had higher clinical stage (HSP90AA1 p < 0,0001; CCT1 p = 0,0277; CCT2 p = 0,0005; CCT6A p = 0,0185), larger tumors (HSP90AA1 p < 0,0001; CCT1 p < 0,0001; CCT2 p = 0,0003; CCT6A p < 0,0001), more lymph nodes involved (HSP90AA1 p = 0,0139; CCT2 p = 0,003), ER-negative cancers (HSP90AA1 p = 0,0015; CCT1 p < 0,0001; CCT6A p < 0,0001), PR-negative cancers (HSP90AA1 p < 0,0001; CCT1 p < 0,0001; CCT6A p < 0,0001) and HER2-positive cancers (HSP90AA1 p < 0,0001; CCT1 p < 0,0001; CCT2 p = 0,0002; CCT6A p = 0,0075). Clinicopathological data of breast cancer patients are summarized in Table 2.
It is well established that tumor suppressor encoded by TP53 gene is at the crossroads of a network of signaling pathways that prevents cancer development11. Moreover, mutated TP53 lose the oncosuppressive role and acquire new oncogenic functions. In line with this, we found significantly increased expression of HSPA2 (better prognosis) in TP53 WT cancers (log2 FC = 0,99; p < 0,0001), whereas high expression of HSP90AA1, CCT1, CCT2 and CCT6A (poor prognosis) coincided with TP53 mutations (HSP90AA1 log2 FC = 0,18; p < 0,0001; CCT1 log2 FC = 0,99; p < 0,0001; CCT2 log2 FC = 0,15; p = 0,0047; CCT6A log2 FC = 0,65; p < 0,0001, respectively) (Fig. 3).
Development of prognostic signature based on the expression of HSPs to predict the survival of breast cancer patients
To build a prediction model, evaluated previously HSPs as well as clinical candidate predictors (stage, ER status, PR status and HER2 status) were subjected to univariate Cox regression model. In total, five HSPs and stage of cancer were significantly correlated with the overall survival of breast cancer patients (p < 0,05; Table 3). Two of HSPs (HSPA2, DNAJC20) had negative coefficients, suggesting that their higher expression was observed in patients with longer survival. The positive coefficients for the remaining three significant HSPs (HSP90AA1, CCT1, CCT2) represented that the higher expression level was observed in patients with poor survival. As expected, cancer stage exhibited a positive coefficient indicating a worse prognosis. One of HSPs and receptors were not prognostically relevant for overall survival (in univariate analysis p > 0,05) and were omitted from further prognosis evaluation. The 1068 breast invasive carcinoma (BRCA) patients from TCGA dataset were randomly divided into a training set (n = 534) and a validation set (n = 534). Basing on the expression level of five prognostic HSPs, cancer stage and multivariate Cox regression coefficients for training set, we built a risk score formula for BRCA patients’ survival prediction. Expression data were converted to a binary format (low expression = 0, high expression = 1) and cancer stage data (AJCC_PATHOLOGIC_TUMOR_STAGE) were converted as follows: Stage I, IA, IB = 1; Stage II, IIA, IIB = 2; Stage III, IIIA, IIIB, IIIC = 3; Stage IV = 4. Patients with Stage X and NA were excluded from the analysis. Risk score was constructed with the formula: Risk score = (−0,4181 × HSPA2 0/1) + (−0,1813 × DNAJC20 0/1) + (0,6861 × HSP90AA1 0/1) + (0,0824 × CCT1 0/1) + (0,11 × CCT2 0/1) + (0,8427 × Stage 1/2/3/4). We next validated our signature in the validation set to confirm our findings. By calculating the risk score for each patient in the validation set based on the same risk score formula, we divided BRCA patients into a low-risk group (n = 290) and high-risk group (n = 244) using the same threshold. The risk score showed a great survival prediction in breast cancer with area under curve (AUC) equal to 0,6237 in the training set, AUC equal to 0,654 in the validation set, AUC equal to 0,659 in entire BRCA cohort and AUC equal to 0,572 in independent METABRIC dataset (Figs 4A, S4). The Kaplan-Meier curve suggested that patients in the high-risk group suffered worse prognosis than patients in the low-risk group (median survival 100,6 months vs 212,1 months, p < 0,0001 in the training set; median survival 112,3 vs 216,6 months, p < 0,0001 in the validation set; median survival 112,3 vs 212,1, p < 0,0001 in the entire TCGA dataset) (Fig. 4B). The distribution of the risk score, patients’ survival status and expression profiles of prognostic HSPs were ranked according to the risk score value (Fig. 4C). Patients with a high risk score had greater mortality than patients with low risk score (Fig. 4C, middle panel). In addition, patients with a high-risk score had higher expression of HSP90AA1, CCT1 and CCT2, whereas the expression of the remaining two HSPs (HSPA2 and DNAJC20) was downregulated (Fig. 4C, heatmap). These findings suggested that risk score calculated basing on five HSP expression and stage of cancer has a competitive performance for the survival prediction of BRCA patients (Supplementary Fig. S5). Importantly, nearly identical results of survival prediction were obtained when Stage variable was binarized (Stage I 0/1, Stage II 0/1, Stage III 0/1) and coefficients in a Cox regression model were calculated for each Stage (Supplementary Fig. S6).
Functional characteristics of HSP prognostic signature
To explore the functional implications of five-HSP signature, we performed gene set enrichment analysis (GSEA). The top 3 enriched datasets from GSEA analysis were shown in Fig. 5A. We found that the most upregulated genes in high-risk group clustered most significantly in cell-cycle associated processes including E2F targets (NES = 3,36), MYC targets (NES = 3,35), G2M checkpoint (NES = 3,24) (Fig. 5A, top panel). In contrast, the most enriched processes in low-risk group included estrogen response early (NES = −2,73), estrogen response late (NES = −2,10), UV response DN (NES = −1,82) (Fig. 5A, bottom panel). All processes enriched in high-risk or low-risk group were mentioned in Fig. 5B.
HSPs associated with breast cancer survival play dual roles in other cancer types
As heat shock proteins are mostly reported to play pro-oncogenic role in cancer development, we utilized PRECOG (PREdiction of Clinical Outcomes from Genomic Profiles) tool to investigate the association between six HSP expression and overall survival in various solid and liquid cancers. For HSPA2 we observed correlation with both good and bad prognosis depending on cancer types. The poor survival (survival Z-score > 0) associated with HSPA2 overexpression was observed for Skin Cutaneous Melanoma (SKCM), Acute Myeloid Leukemia (LAML), Lung adenocarcinoma (LUAD), Bladder Urothelial Carcinoma (BLCA), Lung squamous cell carcinoma (LUSC), Ovarian serous cystadenocarcinoma (OV), Colon adenocarcinoma (COAD), Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), Lymphoid Neoplasm Diffuse Large B-cell Lymphoma (DLBC), Pheochromocytoma and Paraganglioma (PCPG), Glioblastoma multiforme (GBM) and Sarcoma (SARC), whereas the favorable prognosis (survival Z-score < 0) was observed for Breast invasive carcinoma (BRCA), Adrenocortical carcinoma (ACC), Kidney Chromophobe (KICH), Uterine Carcinosarcoma (UCS), Head and Neck squamous cell carcinoma (HNSC), Pancreatic adenocarcinoma (PAAD), Kidney renal papillary cell carcinoma (KIRP), Uterine Corpus Endometrial Carcinoma (UCEC), Rectum adenocarcinoma (READ), Brain Lower Grade Glioma (LGG), Thyroid carcinoma (THCA), Liver hepatocellular carcinoma (LIHC) and Prostate adenocarcinoma (PRAD). High expression of DNAJC20 correlated with good prognosis (survival Z-score < 0) for most of cancer types excluding KICH, DLBC, KIRC, ACC, READ, HNSC, UCS, KIRP, PAAD. Conversely, the high expression of other four HSPs (HSP90AA1, CCT1, CCT2, CCT6A) was associated with poor prognosis (survival Z-score > 0) for most types of cancer. HSP90AA1 correlated with good prognosis only in PRAD, READ, THCA, DLBC, AAC, OV, KICH, PCPG, LUCS, COAD and KIRC. CCT1 also played pro-oncogenic role in most cancer types excepting KIRC, COAD, KICH, DLBC, GBM, THCA, READ, LUSC and LGG. Similar results were observed when we correlated expression of CCT2 with clinical outcome. In most cancer types, CCT2 correlated with poor prognosis, but there were some like COAD, GBM, SKCM, PCPG, LAML, READ, UCS, DLBC and LUCS which had increased survival rate when CCT2 was overexpressed. Correlation between high expression of CCT6A and good survival was observed just for 7/26 cancer types including SKCM, OV, LUSC, DLBC, GBM, LAML and READ (Fig. 6A).
In summary, we have shown that overexpression of HSPA2 and DNAJC20 in most cancer types correlates with favorable prognosis suggesting tumor suppressor activity of these gene products whereas high expression of HSP90AA1, CCT1, CCT2 and CCT6A correlates mainly with poor prognosis suggesting oncogenic activity of these gene products (Fig. 6B).
Detailed transcriptomic analysis of several different types of tumors and their normal counterparts (TCGA database) shows that in cancer cells, genes conserved with unicellular organisms were strongly up-regulated, whereas genes of metazoan origin were primarily inactivated. Moreover, the coordinated expression of strongly interacting multicellularity and unicellularity processes was lost in tumors25. It has been shown previously that HSPs belong to the highly conserved network which helps to survive both unicellular and multicellular organisms11. Several mechanisms are involved in the cytoprotective effect of HSPs: 1 – as molecular chaperones, HSPs catalyze the proper folding of new proteins and prevent formation of potential aggregates in existing structures26; 2 – expression of HSPs correlates with increased resistance to apoptosis revealing their prosurvival mechanism27,28; 3 – HSPs favour the proteasomal degradation of certain proteins under stress conditions29,30. In the case of cancer cells, these HSP networks are extensively remodeled in such a way that they become advantageous to the proliferating cells, i.e. misregulation of the stress signaling cascades, receptor blocking or hiperactivation, effective apoptosis and senescence evasion11,20. The crucial role of HSPs in cell transformation and tumor evolution leads to the consideration that HSPs are important therapeutic targets for cancer treatment11,16,17,31.
Herein, using The Cancer Genome Atlas and KM plotter database, we showed strong association between expression of at least six HSP-encoding genes (namely: HSPA2, DNAJC20, HSP90AA1, CCT1, CCT2 and CCT6A) and survival of breast cancer patients. Interestingly, among these HSPs, overexpression of HSPA2 and DNAJC20 was associated with better prognosis (tumor suppressor-like activity), whereas high expression of other heat shock genes (HSP90AA1, CCT1, CCT2 and CCT6A) correlated with poor survival (oncogene-like activity). These results are in line with the recent comprehensive study of Zoppino et al. demonstrating expression of among others: HSPA2, DNAJC20, HSP90AA1, CCT1, CCT2 with significance on survival32.
The HSPA2 belongs to the HSPA/HSP70 family of proteins possessing ATPase activity required for their molecular chaperone activity. Specificity of these activities towards different substrates is driven by DNAJ specificity factors33,34. In our study, the emergence of the tumor suppressive functions of HSPA2 was supported by the decreasing HSPA2 expression in larger and more advanced tumors. Additionally, higher expression of HSPA2 was observed in ER- and PR-positive breast cancers which are linked to better clinical outcome. Significantly elevated expression of HSPA2 was observed in tumors with no mutation in tumor suppressor TP53 preventing the tumor development and metastasis. However, according to previously reported studies, overexpression of HSPA2 was also associated with worse clinical outcome. HSPA2 (HSP70-2) is expressed at high levels in testis where it plays an essential role in spermatogenesis and has been described as an important biomarker in many cancer types35. High expression of HSPA2 have been associated with shorter overall survival in stage I-II of non-small cell lung carcinoma patients36. HSPA2 was also overexpressed in esophageal squamous cell carcinoma and was significantly associated with primary tumor, TNM stage, lymph node metastases and recurrence resulting in shorter DFS and OS37. Similarly, increased HSPA2 in pancreatic ductal adenocarcinoma and hepatocellular carcinoma was associated with more aggressive clinical features and shorter overall survival38,39. Contrary to our findings, a recent study indicated that HSPA2 might also play an essential role in breast cancer development and progression by promoting cell growth and cellular motility both in culture as well as in vivo in xenotransplanted mice40. On the other hand, overexpression of HSPA2 was found to correlate with longer overall survival in breast cancer patients basing on the TCGA data, the Netherlands Cancer Institute (NKI) data and data from several other breast cancer gene expression datasets at the Oncomine (http://www.oncomine.org/)24,32. These observations could suggest the dual role of HSPA2, both tumor suppressive and prosurvival, in different cancer types as well as the possibility of some limitations resulting from dish-based culture which does not fully develop the tumor microenvironment conditions.
Another survival-associated HSP protein identified in our study is DNAJC20. This protein belongs to the DNAJ family which functions as substrate specificity factors for HSPA/HSP70 family. DNAJC20 acts as a co-chaperone in iron-sulfur cluster biosynthesis in mitochondria41. To the best of our knowledge, no clear evidence for the involvement of DNAJC20 in cancer development has been presented before. Basing on the TCGA data from microarray analysis, we observed decreased expression of this heat shock gene in tumors when compared to normal tissues. In addition, low expression of DNAJC20 correlated with poor survival of breast cancer patients. Decreased expression of DNAJC20 was associated with ER-negative and HER2-negative tumors suggesting correlation of low expression of DNAJC20 with more aggressive basal breast cancer subtype42.
In this study, we identified HSP90AA1 gene encoding HSP90 alpha protein, the inducible isoform of HSP90, among four most significant factors of poor prognosis in breast cancer. Indeed, in previous reports it has been described that HSP90 (both HSP90α and HSP90β) increases the risk of recurrence and distant metastases in triple negative and ER+/HER2- breast cancer subtypes, as well as strongly associates with the risk of death from breast cancer43. Another studies indicated overexpression of HSP90α in human breast cancer cells associated with increased cell proliferation44. HSP90 is also involved in many cancer-associated processes like cellular transformation45, DNA double-strand break repair46,47, apoptosis48, invasion49, genetic variation50,51. Due to the complex involvement in oncogenic signaling, HSP90α has attracted much attention as a potential therapeutic target. Not surprisingly, we also observed strong correlation between HSP90AA1 expression and poor survival of breast cancer patients based on data from two databases. Consistent with previous observations, HSP90AA1 was overexpressed in tumors when compared to normal tissue. Additionally, overexpression of HSP90AA1 was observed in tumors containing mutation in TP53, one of the most frequent genetic alteration in cancer that is often associated with accelerated tumor progression. Oncogenic properties of HSP90α correlated with aggressive clinicopathological features including high clinical stage, large tumors (T3 & T4) and lymph node involvement.
Intriguingly, next three genes identified in our studies, which high expression correlates with poor prognosis of breast cancer patients, are the subunits of molecular chaperonin complex CCT/TRiC (CCT for chaperonin containing TCP1, also called TCP-1 ring complex). CCT/TRiC complex consists of two rings stacked back-to-back, each ring is composed of eight distinct subunits (CCT1-CCT8)52,53. CCT has been shown to mediate folding of approximately 10% of the eukaryotic proteome including a number of cancer-linked proteins like p5354, tumor suppressor Von Hippel-Lindau55, signal transducer and activator of transcription 3 (STAT3)56, cyclin E57, p21Ras and cyclin B58. High expression of CCT2 occurred in liver, prostate and breast cancer and correlated with cancer severity and unfavorable prognosis59,60. Another study reported that CCT1 and CCT2 were amplified in breast cancer and necessary for cell survival and growth61. CCT subunits have been also implicated in the development of hepatocellular carcinoma62,63, gastric64, esophageal65 and colon cancer66. According to our studies, three subunits of CCT complex – CCT1, CCT2 and CCT6A strongly correlated with survival of breast cancer patients. Tumor samples showed a significantly higher expression of these subunits than normal controls. Moreover, high expression of identified CCT subunits was associated with aggressive clinical features including the high stage and grade of cancer. Increased expression of CCT subunits negatively correlated with the status of estrogen (ER) and progesterone (PR) receptors indicating more aggressive cancers. The involvement of particular subunits of CCT complex in tumorigenesis observed in our studies and previously reported in different types of cancer, raise the question of whether CCT subunits exert tumorigenic effects acting as independent monomers or components of CCT complex. In fact, there are several studies showing that some individual subunits of CCT chaperonin, when monomeric, can have an oligomer-independent functions67,68,69,70,71. On the other hand, subunits of CCT complex are thought to recognize different motifs in substrates52. Therefore, even if they are bound in a CCT complex, they may recruit specific clients involved in the regulation of oncogenesis. To date, it still remains unclear whether the pro-oncogenic role of CCT complex results from the properties of its individual components or the full complex.
In this study, we revealed ambiguous role of HSPs in various types of cancer. We identified that depending on cancer type, each of the analyzed HSPs can act both as a positive as well as a negative regulator of carcinogenesis. These findings explain the semi-contradictory reports in the literature. A prosurvival role of HSPs have been reported several times11, but the positive correlation between expression of HSPs and better prognosis provides very new insight into the role of molecular chaperones in tumorigenesis. Besides our studies, there are only a few reports of tumor suppressive functions of HSPs23,24,32.
Finally, by using univariate Cox regression analysis followed by multivariate Cox regression analysis, we identified HSP expression signature combined with tumor stage that was associated with survival of breast cancer patients. Then, by calculating a risk score and performing ROC curve analysis, we found that this signature demonstrated significant prognostic performance in training, validation and entire TCGA dataset. Utilizing our risk score and GSEA analysis, we observed that high-risk patient cohort was enriched in cell cycle regulators whereas low-risk group overexpressed genes involved in estrogen response suggesting less aggressive luminal subtypes of breast cancer.
In conclusion, our study investigates the involvement of heat shock proteins in breast cancer development and contributes to the comprehension of the complex role of these proteins in other cancer type. Our unpublished results demonstrate the influence of six identified HSPs on proliferation, viability and response to chemotherapy in various breast cancer cell lines. Further functional investigations are needed to validate our studies and elucidate the molecular mechanisms underlying the role of these identified HSPs in tumorigenesis. Nevertheless, our study might be helpful to predict the survival of breast cancer patients and serves as an inspiration for seeking of potential new targets in cancer treatment.
All 1247 patients of breast invasive carcinoma (BRCA) were retrieved from The Cancer Genome Atlas (TCGA) and downloaded from the UCSC Xena browser (http://xena.ucsc.edu) or The cBioPortal for Cancer Genomics (http://cbioportal.org)72,73. Normal and metastatic samples have been excluded from analysis. Overall, 1101 TCGA samples of primary tumor have been included in our study with the corresponding clinical information and gene expression data. To validate the survival results obtained from TCGA dataset, we used an online database KM plotter (http://kmplot.com/analysis/) which contains data of 5143 breast cancer patients74. In this database, gene expression data, relapse free survival and overall survival information have been downloaded from Gene Expression Omnibus (GEO, Affymetrix microarrays only), European Genome-phenome Archive (EGA) and TCGA.
Breast cancer patients from TCGA dataset were divided into high-expression and low-expression groups by the median values of mRNA expression. Significant differences in survival were assessed by log-rank (Mantel-Haenszel) test using GraphPad Prism 6. P-value less than 0,05 was considered as statistically significant. Survival curves for BRCA patients from KM plotter were generated on the webpage. Hazard ratios (HR) and p-values (from the log-rank test) were calculated online. Then, HSPs were fitted in a univariate and multivariate Cox proportional hazards regression analysis using R software. Risk scores were estimated by involving selected HSPs and cancer stage, which where weighted by their estimated regression coefficients in the multivariate Cox regression model. Patients were divided into high-risk and low-risk groups using the median risk score as a cutoff value. Differences in patient survival between these two groups were estimated by the Kaplan-Meier survival analysis and log-rank (Mantel-Haenszel) test. The receiver operating characteristic (ROC) curve for the risk score and survival status (0 – deceased, 1 – living) was performed in XLSTAT statistical software to assess the predictive accuracy of prognostic model.
TCGA database analysis
Box plots of HSP expression in normal/cancer tissue of TCGA BRCA patients were generated using pan-cancer normalized Agilent array expression from XENA browser. Statistics was calculated using unpaired t test in GraphPad Prism 6. For box plots comparing TP53 status in low-expression and high-expression groups, we used gene expression RNAseq data (normalized_log2[norm_count + 1]) from XENA browser. Statistics was calculated using unpaired t test in GraphPad Prism 6. The association of HSP expression and clinicopathological features presented in Table 2 have been analyzed using gene expression RNAseq data (normalized_log2[norm_count + 1]) and clinical traits from XENA browser. P-value has been calculated using unpaired t test or one-way ANOVA in GraphPad Prism 6. Heatmaps were generated using ClustVis web tool (http://biit.cs.ut.ee/clustvis/) and gene expression RNAseq data (mRNA median Z-score) from cBioPortal72,73,75.
Survival Z-scores for individual genes and cancer types from TCGA were obtained from the PREdiction of Clinical Outcomes from Genomic Profiles (PRECOG) portal (http://precog.stanford.edu)76. PRECOG encompasses 165 cancer expression datasets, including overall survival data for ~26,000 patients diagnosed with 39 distinct malignancies. Survival Z-scores have been calculated for the whole TCGA dataset and for 26 individual cancer types from TCGA.
Functional enrichment analysis
Gene Set Enrichment Analysis (GSEA) software version 3.0 from the Broad Institute was used to identify significantly enriched gene sets77,78. BRCA patients from TCGA cohort were dichotomized into low-risk and high-risk groups based on the median of risk score. Our input file contained expression data for 20437 genes and 1100 patients. We used 1000 gene set permutations for the analysis and pathways with nominal p-value p ≤ 0,05 and FDR ≤ 0,25 were considered significant. We used 50 pathways in the hallmark gene sets (H) collection from MSigDB.
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics. CA Cancer J Clin 66, 7–30 (2016).
Rakha, E. A. & Ellis, I. O. Modern classification of breast cancer: should we stick with morphology or convert to molecular profile characteristics. Adv. Anat. Pathol. 18, 255–267 (2011).
Onitilo, A. A., Engel, J. M., Greenlee, R. T. & Mukesh, B. N. Breast cancer subtypes based on ER/PR and Her2 expression: Comparison of clinicopathologic features and survival. Clin. Med. Res. 7, 4–13 (2009).
Wilson, L. E., Pollack, C. E., Greiner, M. A. & Dinan, M. A. Association between physician characteristics and the use of 21-gene recurrence score genomic testing among Medicare beneficiaries with early-stage breast cancer, 2008–2011. Breast Cancer Res. Treat. 1–11 (2018).
Nahleh, Z., Tfayli, A. & Sayed, A. El. Heat shock proteins in cancer: targeting the ‘chaperones’. Futur. Med Chem 4, 927–935 (2012).
Yang, Z. et al. Upregulation of heat shock proteins (HSPA12A, HSP90B1, HSPA4, HSPA5 and HSPA6) in tumour tissues is associated with poor outcomes from HBV-related early-stage hepatocellular carcinoma. Int. J. Med. Sci. 12, 256–263 (2015).
Kampinga, H. H. et al. Guidelines for the nomenclature of the human heat shock proteins. Cell Stress Chaperones 14, 105–111 (2009).
Kampinga, H. H. et al. Function, evolution, and structure of J-domain proteins. Cell Stress Chaperones (2018).
Richter, K., Haslbeck, M. & Buchner, J. The Heat Shock Response: Life on the Verge of Death. Mol. Cell 40, 253–266 (2010).
Lianos, G. D. et al. The role of heat shock proteins in cancer. Cancer Lett. 360, 114–118 (2015).
Wawrzynow, B., Zylicz, A. & Zylicz, M. Chaperoning the guardian of the genome. The two-faced role of molecular chaperones in p53 tumor suppressor action. Biochim. Biophys. Acta - Rev. Cancer 1869, 161–174 (2018).
Sharma, S. K., De Los Rios, P., Christen, P., Lustig, A. & Goloubinoff, P. The kinetic parameters and energy cost of the Hsp70 chaperone as a polypeptide unfoldase. Nat. Chem. Biol. 6, 914–920 (2010).
Priya, S. et al. GroEL and CCT are catalytic unfoldases mediating out-of-cage polypeptide refolding without ATP. Proc. Natl. Acad. Sci. 110, 7199–7204 (2013).
Skowyra, D., Georgopoulos, C. & Zylicz, M. The E. coli dnaK gene product, the hsp70 homolog, can reactivate heat-inactivated RNA polymerase in an ATP hydrolysis-dependent manner. Cell 62, 939–944 (1990).
Ziemienowicz, A. et al. Both the Escherichia coli Chaperone Systems, GroEL/GroES and DnaK/DnaJ/GrpE, Can Reactivate Heat-treated RNA Polymerase. J Biol Chem 268, 25425–25431 (1993).
Ciocca, D. R. & Calderwood, S. K. Heat shock proteins in cancer: diagnostic, prognostic, predictive, and treatment implications. Cell Stress Chaperones 10, 86 (2005).
Calderwood, S. K., Khaleque, M. A., Sawyer, D. B. & Ciocca, D. R. Heat shock proteins in cancer: Chaperones of tumorigenesis. Trends Biochem. Sci. 31, 164–172 (2006).
Calderwood, S. K. & Gong, J. Heat Shock Proteins Promote Cancer: It’s a Protection Racket. Trends Biochem. Sci. 41, 311–323 (2016).
Dimas, D. T. et al. The Prognostic Significance of Hsp70/Hsp90 Expression in Breast Cancer: A Systematic Review and Meta-analysis. Anticancer Res. 38, 1551–1562 (2018).
Rodina, A. et al. The epichaperome is an integrated chaperome network that facilitates tumour survival. Nature 538, 397–401 (2016).
Calderwood, S. K. Tumor heterogeneity, clonal evolution, and therapy resistance: an opportunity for multitargeting therapy. Discov. Med. 15, 188–94 (2013).
Tracz-Gaszewska, Z. et al. Molecular chaperones in the acquisition of cancer cell chemoresistance with mutated TP53 and MDM2 up-regulation. Oncotarget 5, 82123–82143 (2017).
Wang, X., Shi, X., Tong, Y. & Cao, X. The Prognostic Impact of Heat Shock Proteins Expression in Patients with Esophageal Cancer: A Meta-Analysis. Yonsei Med. J. 56, 1497 (2015).
Huang, Z., Duan, H. & Li, H. Identification of gene expression pattern related to breast cancer survival using integrated TCGA datasets and genomic tools. Biomed Res. Int. 2015 (2015).
Trigos, A. S., Pearson, R. B., Papenfuss, A. T. & Goode, D. L. Altered interactions between unicellular and multicellular genes drive hallmarks of transformation in a diverse range of solid tumors. Proc. Natl. Acad. Sci. 114 (2017).
Jego, G., Hazoumé, A., Seigneuric, R. & Garrido, C. Targeting heat shock proteins in cancer. Cancer Lett. 332, 275–285 (2013).
Bruey, J. M. et al. Hsp27 negatively regulates cell death by interacting with cytochrome c. Nat. Cell Biol. 2, 645–652 (2000).
Creagh, E. M., Sheehan, D. & Cotter, T. G. Heat shock proteins - modulators of apoptosis in tumour cells. 14, 1161–1173 (2000).
Lanneau, D., de Thonel, A., Maurel, S., Didelot, C. & Garrido, C. Apoptosis versus cell differentiation: role of heat shock proteins HSP90, HSP70 and HSP27. Prion 1, 53–60 (2007).
Parcellier, A. et al. HSP27 favors ubiquitination and proteasomal degradation of p27Kip1 and helps S-phase re-entry in stressed cells. FASEB J. 20, 1179–1181 (2006).
Soo, E. T. L., Yip, G. W. C., Lwin, Z. M., Kumar, S. D. & Bay, B.-H. Heat shock proteins as novel therapeutic targets in cancer. In Vivo 22, 311–5 (2008).
Zoppino, F. C. M., Guerrero-Gimenez, M. E., Castro, G. N. & Ciocca, D. R. Comprehensive transcriptomic analysis of heat shock proteins in the molecular subtypes of human breast cancer. BMC Cancer 18, 700 (2018).
Liberek, K., Marszalek, J., Ang, D., Georgopoulos, C. & Zylicz, M. Escherichia coli DnaJ and GrpE heat shock proteins jointly stimulate ATPase activity of DnaK. J Biol Chem 88, 2874–2878 (1991).
Brehmer, D. et al. Tuning of chaperone activity of Hsp70 proteins by modulation of nucleotide exchange. Nat. Struct. Biol. 8, 427–32 (2001).
Daugaard, M., Jäättelä, M. & Rohde, M. Hsp70-2 is required for tumor cell growth and survival. Cell Cycle 4, 877–880 (2005).
Scieglinska, D. et al. HSPA2 is expressed in human tumors and correlates with clinical features in non-small cell lung carcinoma patients. Anticancer Res. 34, 2833–2840 (2014).
Zhang, H., Chen, W., Duan, C.-J. & Zhang, C.-F. Overexpression of HSPA2 is correlated with poor prognosis in esophageal squamous cell carcinoma. World J. Surg. Oncol. 11, 141 (2013).
Zhang, H. et al. Expression and clinical significance of HSPA2 in pancreatic ductal adenocarcinoma. Diagn. Pathol. 10, 13 (2015).
Li, S.-L. et al. Expression of SCCA1 in human hepatocellular carcinoma and its clinical significance. World Chinese J. Dig. 22, 1015–1021 (2014).
Jagadish, N. et al. Heat shock protein 70-2 (HSP70-2) overexpression in breast cancer. J. Exp. Clin. Cancer Res. 35, 150 (2016).
Vickery, L. E. & Cupp-Vickery, J. R. Molecular chaperones HscA/Ssq1 and HscB/Jac1 and their roles in iron-sulfur protein maturation. Crit. Rev. Biochem. Mol. Biol. 42, 95–111 (2007).
Yersal, O. Biological subtypes of breast cancer: Prognostic and therapeutic implications. World J. Clin. Oncol. 5, 412 (2014).
Cheng, Q. et al. Amplification and high-level expression of heat shock protein 90 marks aggressive phenotypes of human epidermal growth factor receptor 2 negative breast cancer. Breast Cancer Res 14, R62 (2012).
Yano, M. et al. Expression of hsp90 and cyclin D1 in human breast cancer. Cancer Lett. 137, 45–51 (1999).
Teng, S. C. et al. Direct Activation of HSP90A Transcription by c-Myc Contributes to c-Myc-induced Transformation. J. Biol. Chem. 279, 14649–14655 (2004).
Stecklein, S. R. et al. BRCA1 and HSP90 cooperate in homologous and non-homologous DNA double-strand-break repair and G2/M checkpoint activation. Proc. Natl. Acad. Sci. 109, 13650–13655 (2012).
Pennisi, R., Antoccia, A., Leone, S., Ascenzi, P. & di Masi, A. Hsp90α regulates ATM and NBN functions in sensing and repair of DNA double-strand breaks. FEBS J. 284, 2378–2395 (2017).
Perotti, C. et al. Heat shock protein-90-alpha, a prolactin-STAT5 target gene identified in breast cancer cells, is involved in apoptosis regulation. Breast Cancer Res. 10, 1–18 (2008).
Eustace, B. K. et al. Functional proteomic screens reveal an essential extracellular role for hsp90α in cancer cell invasiveness. Nat. Cell Biol. 6, 507–514 (2004).
Queitsch, C., Sangstert, T. A. & Lindquist, S. Hsp90 as a capacitor of phenotypic variation. Nature 417, 618–624 (2002).
Rutherford, S. L. & Lindquist, S. Hsp90 as a capacitor for morphologival evolution. Nature 396, 336–342 (1998).
Joachimiak, L. A., Walzthoeni, T., Liu, C. W., Aebersold, R. & Frydman, J. The structural basis of substrate recognition by the eukaryotic chaperonin TRiC/CCT. Cell 159, 1042–1055 (2014).
Leitner, A. et al. The molecular architecture of the eukaryotic chaperonin TRiC/CCT. Structure 20, 814–825 (2012).
Trinidad, A. G. et al. Interaction of p53 with the CCT Complex Promotes Protein Folding and Wild-Type p53 Activity. Mol. Cell 50, 805–817 (2013).
McClellan, A. J., Scott, M. D. & Frydman, J. Folding and quality control of the VHL tumor suppressor proceed through distinct chaperone pathways. Cell 121, 739–748 (2005).
Kasembeli, M. et al. Modulation of STAT3 Folding and Function by TRiC/CCT Chaperonin. PLoS Biol. 12 (2014).
Won, K.-A., Schumacher, R. J., Farr, G. W., Horwich, A. L. & Reed, S. I. Maturation of human cyclin E requires the function of eukaryotic chaperonin CCT. Mol. Cell. Biol. 18, 7584–9 (1998).
Boudiaf-Benmammar, C., Cresteil, T. & Melki, R. The Cytosolic Chaperonin CCT/TRiC and Cancer Cell Proliferation. PLoS One 8, 1–10 (2013).
Carr, A. C. et al. Targeting chaperonin containing TCP1 (CCT) as a molecular therapeutic for small cell lung cancer. Oncotarget 8, 110273–110288 (2017).
Bassiouni, R. et al. Chaperonin containing TCP-1 protein level in breast cancer cells predicts therapeutic application of a cytotoxic peptide. Clin. Cancer Res. 22, 4366–4379 (2016).
Guest, S. T., Kratche, Z. R., Bollig-Fischer, A., Haddad, R. & Ethier, S. P. Two members of the TRiC chaperonin complex, CCT2 and TCP1 are essential for survival of breast cancer cells and are linked to driving oncogenes. Exp. Cell Res. 332, 223–235 (2015).
Zhang, Y. et al. Molecular chaperone CCT3 supports proper mitotic progression and cell proliferation in hepatocellular carcinoma cells. Cancer Lett. 372, 101–109 (2016).
Cui, X., Hu, Z. P., Li, Z., Gao, P. J. & Zhu, J. Y. Overexpression of chaperonin containing TCP1, subunit 3 predicts poor prognosis in hepatocellular carcinoma. World J. Gastroenterol. 21, 8588–8604 (2015).
Li, L.-J. et al. Chaperonin containing TCP-1 subunit 3 is critical for gastric cancer growth. Oncotarget 8, 111470–111481 (2017).
Yang, X. et al. Chaperonin-containing T-complex protein 1 subunit 8 promotes cell migration and invasion in human esophageal squamous cell carcinoma by regulating α-actin and β-tubulin expression. Int. J. Oncol. (2018).
Zhu, Q. L., Wang, T. F., Cao, Q. I. F., Zheng, M. H. & Lu, A. G. Inhibition of cytosolic chaperonin CCTζ-1 expression depletes proliferation of colorectal carcinoma in vitro. J. Surg. Oncol. 102, 419–423 (2010).
Sergeeva, O. A. et al. Human CCT4 and CCT5 chaperonin subunits expressed in Escherichia coli form biologically active homo-oligomers. J. Biol. Chem. 288, 17734–17744 (2013).
Brackley, K. I. & Grantham, J. Subunits of the chaperonin CCT interact with F-actin and influence cell shape and cytoskeletal assembly. Exp. Cell Res. 316, 543–553 (2010).
Grantham, J., Brackley, K. I. & Willison, K. R. Substantial CCT activity is required for cell cycle progression and cytoskeletal organization in mammalian cells. Exp. Cell Res. 312, 2309–2324 (2006).
Roobol, A., Sahyoun, Z. P. & Carden, M. J. Selected subunits of the cytosolic chaperonin associate with microtubules assembled in vitro. J. Biol. Chem. 274, 2408–15 (1999).
Kabir, M. A. et al. Physiological effects of unassembled chaperonin Cct subunits in the yeast Saccharomyces cerevisiae. Yeast 22, 219–239 (2005).
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 6, pl1 (2013).
Cerami, E. et al. The cBio Cancer Genomics Portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2, 401–404 (2012).
Györffy, B. et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res. Treat. 123, 725–731 (2010).
Metsalu, T., Vilo, J., Science, C. & Liivi, J. ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap. 43, 566–570 (2015).
Gentles, A. J. et al. The prognostic landscape of genes and infiltrating immune cells across human cancers. Nat. Med. 21, 938–945 (2015).
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102, 15545–15550 (2005).
Mootha, V. K. et al. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 34, 267–273 (2003).
This research was supported by National Science Centre under MAESTRO grant No: DEC-2012/06/A/NZ1/00089.
The authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.