Abstract
Aberrant activation of DNA repair is frequently associated with tumor progression and response to therapy in hepatocellular carcinoma (HCC). Bioinformatics analyses of HCC data in the Cancer Genome Atlas (TCGA) were performed to define DNA repair based molecular classification that could predict the prognosis of patients with HCC. Furthermore, we tested its predictive performance in 120 independent cases. Four molecular subgroups were identified on the basis of coordinate DNA repair cluster (CDRC) comprising 15 genes in TCGA dataset. Increasing expression of CDRC genes were significantly associated with TP53 mutation. High CDRC was significantly correlated with advanced tumor grades, advanced pathological stage and increased vascular invasion rate. Multivariate Cox regression analysis indicated that the molecular subgrouping was an independent prognostic parameter for both overall survival (p = 0.004, hazard ratio (HR): 2.989) and tumor-free survival (p = 0.049, HR: 3.366) in TCGA dataset. Similar results were also obtained by analyzing the independent cohort. These data suggest that distinct dysregulation of DNA repair constituents based molecular classes in HCC would be useful for predicting prognosis and designing clinical trials for targeted therapy.
Similar content being viewed by others
Introduction
Hepatocellular carcinoma (HCC) is the leading cause of cancer death worldwid with dismal prognosis1. Although surgical resection offers the best chance of cure of HCC, the prognosis after surgery differs considerably among patients, which has hampered both treatment and prognostic prediction. Much work has been devoted to identifying histopathological and biochemical markers or establishing prognostic models for identifying groups of HCC that differ with respect to prognosis2. However, HCC, even in the same tissue, is a very heterogeneous disease that differs widely in clinical outcome and in response to therapy3. Different molecular defects that can induce similar tumor phenotypes predispose the heterogeneity of HCC4. Therefore, genomic information from tumor tissue will refine the prognostic prediction of HCC patients and facilitate the identification of genetic determinants that are components of the specific regulatory pathways altered in cancers, providing the opportunity to precision medicine5.
The application of microarray or high-throughput technologies which rely on thousands of pieces of bio-information and provide an accurate landscape of HCC genetic alterations have enabled researchers to measure the expression of a large number of genes in HCC for identifying tumor subtypes6. Gene expression profiling studies in HCC have reported molecular signatures that were associated with prognosis7,8,9. However, most studies that have been conducted to date involved relatively few patients, which was not clear whetner results could be generalized into large clinical populations with the collection of samples at different sites making for variation in study cohort. As such, a coherent molecular explanation for the HCC heterogeneity and prognostic prediction has yet to be reported. Furthermore, laboratory-based functional analysis is an important part of the evaluation for microarray or high-throughput analysis and it remains unclear how this would be incorporated into routine clinical application of sophisticated and non-uniform gene-expression signatures in HCC prognostic prediction9,10. Therefore, gene-expression pattern of the specific HCC regulatory pathways which could predict or stratify the prognostic subgroups of HCC needed to be identified11,12.
DNA repair process is constantly active as it responds to damage in the DNA structure and proceed by several mechanisms, including base excision repair, mismatch excision repair, nucleotide excision repair, homologous recombination13. A cell that has accumulated a large amount of DNA damage, or one that no longer effectively repairs damage incurred to its DNA, can enter one of three possible states: senescence, apoptosis and cancerous14. Cancer tissues overexpress DNA repair genes and thus develop greater DNA repair capacity than normal tissues15. Therefore, multiple DNA damage signals and DNA repair pathways could have a significant impact on prognosis and response to therapy for various cancers16.
A thorough understanding of the DNA repair genes expression pattern in tumors would be of utmost importance in HCC prognostic prediction and improving therapy and in achieving the best therapeutic response17. In the present study, we investigated the possibility that the expression pattern of the HCC associated DNA repair genes obtained at diagnosis would permit the identification of distinct subclasses of HCC patients with different prognosis in the Cancer Genome Atlas (TCGA) dataset. Furthermore, we tested the expression pattern of co-regulated cluster of DNA repair genes at both protein and mRNA level to explore the relationship with prognosis of HCC patients in another separate cohort.
Results
Molecular subgroups based on DNA repair genes in hepatocellular carcinoma
Based on previous studies on molecular networks incorporating DNA repair genes linked with hepatocarcinogenesis or progression in HCC, 59 initial genes were included in the preliminary analyses13,18,19 (Supplementary Table S1). These genes refered to mismatch excision repair (MSH2, MSH3, MSH6, MLH1, PMS2, MLH3), base excision repair (MBD4, TDG, OGG1, NEIL3, APEX1), nucleotide excision repair (XPC, RAD23B, XPA, DDB1, RPA2, ERCC6), homologous recombination (RAD51, XRCC3, RAD52, BRCA1, RAD50, MRE11A, NBN, MUS81), non-homologous end-joining (XRCC6, XRCC5, PRKDC, LIG4, XRCC4), chromatin structure and modification and DNA polymerases. DNA damage response genes (ATR, CHEK1, CHEK2, TP53, TP53BP1, WRN, ATM) were also included.
Unsupervised two-way hierarchical clustering analysis was performed to assess the extent to which gene expression was coordinate or independent across the patient cohort. A 15-gene subset was identified by the visual analysis of the clustering, expression of which was coordinately regulated and formed a clear gene grouping (Fig. 1, Supplementary Table S2). The validity of this gene clustering was confirmed by the gene-tree analysis of the dendrogram. These 15 highly coordinated genes (distance threshold 1.64) associated with some different DNA repair pathways, including mismatch excision repair, base excision repair, homologous recombination and DNA damage response genes. The degree of correlation among the 15 genes was notable (Supplementary Fig. S1). The genes formed a clearly coordinated block on visual and correlation analysis. Therefore, we proceeded with the 15-gene block and termed this cluster as coordinate DNA repair cluster (CDRC). Four distinct patient groups were delineated by the CDRC (Fig. 1). The patients in the group 2 had the highest CDRC expression (mean expression 1.11), while the lowest in the group 3 (mean expression −0.60). Clinical characteristics of patients in the four groups by CDRC were shown in Fig. 2.
Molecular determinants of the coordinate DNA repair cluster
Each tumor accumulates numerous damaging mutations, therefore, three major groups of associated alterations, including the CTNNB1 cluster (CTNNB1, TERT, KMT2D, ARID2, APOB, NFE2L2), the AXIN1 cluster (AXIN1, ARID1A, RPS6KA3) and the tumour protein p53 (TP53) cluster (TP53, KEAP1, CCND1, TSC2) were identified by exome sequencing of HCC20. The mutational status of three major groups in the cohort was as follows: CTNNB1 mutant 19.0%, KMT2D mutant 7.5%, ARID2 mutant 2.2%, APOB mutant 9.0%, NFE2L2 mutant 1.5%, AXIN1 mutant 3.4%, ARID1A mutant 5.6%, RPS6KA3 mutant 2.2%, TP53 mutant 22.4%, KEAP1 mutant 2.6% and TSC2 mutant 3.4%. None variant of the TERT or CCND1 was found in this cohort. Table 1 showed the distribution of three major groups of associated alterations in each distinct patient group. The frequency of TP53 mutation was significantly lower in group 3 than other patient groups, group 3 (6.7%) < group 4 (20.2%) < group 1 (34.3%) < group 2 (37.5%), (p < 0.05). Multivariate linear regression analysis indicated that all the CDRC genes expression were significantly associated with TP53 mutation (p < 0.01). There was a strong trend for CTNNB1 mutation to be associated with low CDRC expression. Multivariate analysis revealed that decreased expression of ERCC6, FANCD2, H2AFX, MSH6, RAD51 and XRCC3 were associated with CTNNB1 mutation (p < 0.05, Supplementary Table S3).
To determine whether any other molecular characteristics were associated with CDRC expression, we analyzed the correlation of the methylation and copy number status of the CDRC with the gene expression. Interestingly, most CDRC genes had a significant negative correlation between methylation beta-values and mRNA expression Z score (Fig. 3). Furthermore, copy number statuses, including putative copy-number alterations from GISTIC and log2 copy-number values, were significantly positive associated with gene mRNA expression in most CDRC genes (Supplementary Figs S2 and S3).
Molecular subgroups correlate with clinicopathological variables
Age distribution was closely associated with four DNA repair based molecular subgroups of HCC described herein. Patients in group 2 had lower mean age compared to other CDRC clusters (p < 0.05). The gender distribution of all subtypes was presented in Fig. 2, with some significant differences between subgroups. There was a preponderance of males represented in the group 1 (male:female = 2.7:1), group 3 (male:female = 1.8:1) and group 4 (male:female = 1.5:1) subgroups, whereas in the group 2, males were in the minority (male:female = 0.47:1) (p < 0.001). The history HCC risk factor distribution in each subgroup was similar. Advanced tumor grade (grade G3–G4), advanced AJCC pathological stage (stage III–IV) and vascular invasion were observed in lower frequencies in group 3 compared to the other groups (Supplementary Fig. S4). For each individual aspect of AJCC TNM staging system, the frequencies of T1/2 and no distant metastasis were considerably higher in Group 3; the frequency of no regional lymph node involvement was also relatively lower in Group 3.
Molecular subgroups correlate with hepatocellular carcinoma progression
Survival analyses for all patients with available outcome data (n = 239) showed remarkable differences in overall survival (OS) and tumor-free survival (TFS) between the molecular subgroups. Patients in group 3 showed a favorable outcome, with 5-year OS rates of ~45% (Fig. 4A) and PFS rates of ~75% (Fig. 4B). Notably, patients comprising this subgroup had the lowest frequency of TP53 mutation, which was consistent with published literatures21,22. All other subgroups have a more dismal outcome, with 5-year OS rates ranging from 14–30% (Fig. 4A) and TFS rates ranging from 31–51% (Fig. 4B). Although a substantial proportion of patients in group 3 was classified as favorable histology (early tumor grades, early AJCC pathological stages and without vascular invasion), correlation between molecular subgroups (especially between group 2 and group 3) and OS and TFS rate in HCC patients was independent of tumor grade (Fig. 5A, Supplementary Fig. S5A), AJCC pathological stage (Fig. 5B, Supplementary Fig. S5B) and vascular invasion (Fig. 5E, Supplementary Fig. S5E). Therefore, as one example for the clinical utility of molecular subgrouping over conventional histopathology, molecular classification by CDRC in these instances revealed HCC subgroup with favorable outcome that would not be yielded by histopathological analysis.
Examining other clinical variables for subgroup-specific prognostic value showed that OS and TFS rate in HCC patients between group 2 and group 3 were independent of gender, age and HCC history risk factors (Supplementary Figs S6 and S7). Although, TP53 mutation showed no prognostic value in the present series, highly significant differences in OS and TFS between group 2 and group 3 were also seen in the patients without TP53 mutation (Supplementary Fig. S8).
According to the results above, a multivariate Cox regression analysis was performed to assess whether CDRC molecular subgrouping was independent prognostic parameters for HCC in group 2 and group 3. Results showed that in group 2 and group 3, CDRC molecular subgrouping was the independent prognostic parameter for both OS (hazard ratio (HR): 2.989, 95% confidence interval (CI) 1.431–6.243, P = 0.004) and TFS (HR: 3.366, 95% CI 1.053–11.889, P = 0.049) (Table 2). Pathological stage and T classification were also the independent prognostic parameter for OS and TFS, respectively. Furthermore, the performance of CDRC clusters for identifying the HCC subgroups was validated in our own cohort. Due to the consistent expression of the genes in CDRC clusters (Supplementary Fig. S1), we characterized the expression pattern of MSH2 which is the component of the post-replicative DNA mismatch repair system23 in HCC tissues and assessed the clinical significance of MSH2 expression in HCC patients from an independent cohort. One hundred and twenty patients were enrolled with m ean follow-up of 612.0 ± 230.2 days. The clinicopathological characteristics of the patients were summarized in Supplementary Table S4.
MSH2 protein expression was evaluated by immunohistochemical staining. According to the MSH2 expression pattern by immunohistochemical staining, HCC tissues was divided into low (n = 41), moderate (n = 48) and high (n = 31) expression groups (Fig. 6). The characteristics of patient groups by MSH2 expression pattern were shown Table 3. MSH2 protein level analyzed by Western blot (Fig. 7) was consistent with that by immunochemistry. MSH2 mRNA expression levels were examined using real-time quantitative PCR in the same tumor tissues. The results demonstrated that MSH2 mRNA expression pattern was similar to protein level (Supplementary Fig. S9). Furthermore, MSH2 expression pattern (between low expression group and high expression group) in HCC tissues was closely correlated with pathological stage (P < 0.001) and T classification (P < 0.001). No significant associations were observed between MSH2 expression pattern (between low expression group and high expression group) and age, gender, tumor size, tumor grade and vascular invasion. As shown in Supplementary Fig. S10, low MSH2 expression pattern was associated with favorable OS (P < 0.001). Kaplan-Meier analyses showed that mortality was lower in HCC patients with lower MSH2 expression, which was independent of pathological stage, pathologic T classification, tumor grade, vascular invasion, gender and age (Supplementary Fig. S10). Multivariate Cox regression analyses showed that MSH2 expression pattern (between high expression group and low expression group) could be useful as an independent predictor for the prognosis in HCC patients (HR: 3.375, 95% CI 1.483–7.681, P = 0.004, Supplementary Table S5).
Discussion
Many genetic and epigenetic disruptions in HCC indicate that it is characterized by remarkable molecular heterogeneity24. Gene expression profiles of HCC could define the molecular characteristics of the tumors and show a good prognostic value25,26,27,28. However, common laboratory practices can not address the research needs of molecular biology. To focus on HCC specific pathway molecular-stratification, cluster analysis was used for molecular stratification based on DNA repair gene expression patterns in TCGA. In current study, we identified a group of 15 tightly co-regulated DNA repair-related genes termed as CDRC and used this to assess differences in the prognosis of HCC patients. Four molecular subtypes of HCC with distinct progression were classified by CDRC expression pattern.
The relationship between CDRC expression pattern and clinicopathologic parameters was evaluated. The results indicated that increased DNA repair capacity was correlated with poor tumor differentiation and poor survival29,30, which was also validated in the independent cohort. However, interestingly, intrinsic HCC subtypes by CDRC were not contingent on tumour stage or other single clinicopathologic parameter. The subtypes differed significantly with respect to several biological and clinical parameters associated with different tumor biologic behavior. Furthermore, the major finding of the current study was that mortality in patients could be predicted accurately using our CDRC based molecular groups, especially between group 2 and group 3. Of note, expression of MSH2, as well as other DNA repair genes31,32, in tumor tissues has been used in many investigations, revealing that DNA repair capacity could be a potential predictor of prognosis in some human malignancies33,34. Down-regulation of MSH2 expression by Hsp90 inhibitor could enhance pemetrexed-induced cytotoxicity in human non-small-cell lung cancer cells35. Inhibition of p38 MAPK-dependent MSH2 expression by metformin could enhance gefitinib-induced cytotoxicity in human squamous lung cancer cells36. EXO1 up-regulated expression could protect ovarian cancer cells from cisplatin-mediated apoptosis and attenuating EXO1 expression by small interfering RNA could augment the chemotherapy efficacy against ovarian cancer37. Therefore, we speculated that prognosis might be an intrinsic property of the HCC classified by CDRC clusters.
DNA repair genes affect cell proliferation or survival indirectly by influencing the ability of the organism to repair nonlethal damage in other genes, including protooncogenes, tumor suppressor genes and genes that regulate apoptosis38,39. By removing DNA lesions in tumors, increased DNA repair is one of major mechanisms for development of resistance to therapy, which affects patient survival. Therefore, knowledge of DNA protein expression patterns in cancerous tissues may help guide development of therapeutic strategies and treatments for cancers and furthermore predict the response and survival. Numerous inhibitors of DNA repair which may help selectively kill tumors have been developed and are being tested in clinical trials. In our study, CDRC refers to mismatch excision repair, base excision repair, nucleotide excision repair and homologous recombination, which could reflect DNA repair capacity40. Therefore, CDRC clusters may potentially lead to personalized therapy for HCC patients41.
In addition, we investigated the mutational status of three major groups (CTNNB1 cluster, AXIN1 cluster and TP53 cluster) associated with CDRC expression in HCC. TP53 mutant tumors which had a high mutational burden were associated with high CDRC expression, with only 6.7% of cancers in the group exhibiting very low expression (group 3) being TP53 mutant. Genomic instability could be enhanced by TP53 mutant, which might lead to an increase DNA repair capacity. Furthermore, convincing evidence show that TP53 mutant proteins play an important gain-of-function role in promoting invasion and metastasis of tumors. Recent studies have indicated that genome-wide and cell type-specific alterations in miRNA expression during DNA damage response could be regulated by TP5342,43. In addition, TP53 mutation could affected the prognosis for patients with HCC in the total population from the TCGA dataset (data not shown), which was consistent with those from previous studies21,44. However, molecular classification by CDRC was the independent prognostic factor of HCC after adjusting TP53 mutation. CTNNB1 is a target for mutations when mismatch repair is impaired45. CTNNB1 mutation was significantly associated with lower CDRC expression, which designated a subset of low-grade, low-stage HCC and hence more favorable prognosis46. Our data complemented previous studies demonstrating widespread abnormalities of DNA repair response in mutational status of three major groups in HCC. Our findings indicated that methylation and copy number status of the CDRC also played an important role to regulate the gene expression.
Molecular characterization of HCC based on DNA repair gene expression would improve the prediction of the clinical outcome of HCC patients and selection of treatments for specific molecular subtypes of HCC. Nevertheless, our findings needed to be validated and further refined in a large prospective patient cohort. Furthermore, the efficacy of adjuvant therapies for HCC, such as chemotherapy, and/or molecular targeted therapies could be assessed in the context of specific molecular subgroups, as the response to different therapy modalities would likely differ.
Materials and Methods
Identification of initial DNA repair genes in HCC
Human DNA repair genes or genes in DNA repair pathways were extracted as initial genes13,18,19. Molecular network pathway analysis was used for expanding the gene list, by which further molecules and genes with known genetic, pathway and functional associations with our initial genes could be revealed. Finally, the genes that were relevant to the HCC were screened for inclusion into our final gene list (Supplementary Table S1).
Expression datasets preparation
HCC TCGA dataset containing Normalized Agilent microarray and RNAseq z-score data, copy-number status generated by GISTIC, log2 copy-number value and methylation (HM450) beta-values with the most anti-correlated with expression for each initial gene and mutation data in three major groups: the CTNNB1 cluster (CTNNB1, TERT, MLL2, ARID2, APOB, NFE2L2), the AXIN1 cluster (AXIN1, ARID1A, RPS6KA3) and the TP53 cluster (TP53, KEAP1, CCND1, TSC2) identified by exome sequencing of HCC were extracted20. The mRNA expression data was generated using the Illumina HiSeq 2000 RNA Sequencing platform and were normalized to sample medians as previously described. mRNA expression data for each initial gene was downloaded as z-scores from the cBioPortal (http://www.cbioportal.org/)47. Copy-number data for HCC samples were generated from array comparative genomic hybridization data acquired using the Affymetrix Genome-Wide Human SNP Array 6.0 platform. Raw data was analyzed using the GISTIC2 method to generate gene-level copy-number calls and downloaded from Memorial Sloan-Kettering Cancer Center’s cBioPortal for Cancer Genomics (http://www.cbioportal.org/). GISTIC2-generated copy-number estimates (log 2 -transformed values, not thresholded) were downloaded from the UCSC Cancer Browser (https://genome-cancer.ucsc.edu). Methylation data was generated using Illumina Infinium Human DNA Methylation 450 platform and downloaded as beta-values from the cBioPortal (http://www.cbioportal.org/)47. The genes would be excluded from the analysis, once expression data were unavailable (n = 268, up to June 1st, 2015). The full clinical dataset, including age, gender, the American Joint Committee on Cancer staging system (also be called AJCC TNM staging system), tumor status, vital status, new tumor eventrisk, risk factors and vascular invasion, were downloaded (up to June 1st, 2015) from the TCGA portal (https://tcga-data.nci.nih.gov/tcga/) and tabulated with genetic data.
Clinical tissue samples
A prospective cohort of 120 patients with HCC was recruited from the First Affiliated Hospital of Wenzhou Medical University. These patients were diagnosed with HCC between January, 2013 and December, 2014. Patients were excluded if they had a diagnosis of a concurrent cancer or cancers metastatic to the liver. Clinical information of each patient including details of pathology and outcomes with a regularly follow up was collected. The histology and clinical stages were classified according to the seventh edition of the American Joint Committee on Cancer (AJCC) staging system. The tumor grade 1–4 of HCC in pathology diagnosis is equivalent to well-differentiated, moderately-differentiated, poorly differentiated, or undifferentiated, respectively, under microscope (Supplementary Fig. 11). Grade 1 or well-differentiated: Cells appear normal and are not growing rapidly. Grade 2 or moderately-differentiated: Cells appear slightly different than normal. Grade 3 or poorly differentiated: Cells appear abnormal and tend to grow and spread more aggressively. Grade 4 or undifferentiated: features are not significantly distinguishing to make it look any different from undifferentiated cancers which occur in other organs. Vascular invasion is identified either as macroscopic, when the invasion of the vessel is visible on gross examination, or as microscopic, when the invasion is visible only on microscopy. Microscopic vascular invasion was defined as tumoral cells within a vascular space lined by endothelium that was visible only on microscopy and was assessed by several sections of non-tumoral hepatic parenchyma 1 cm away from the tumor48. The cases of HCC were selected in this study only if clinical data were available. The follow-up time was calculated from the date of surgery to the date of death, or the last known follow-up. None of them had received radiotherapy, chemotherapy, hormone therapy or other related anti-tumor therapies before surgery. Research ethics approval for this project was granted from the First Affiliated Hospital of Wenzhou Medical University and written informed consent was obtained from all patients or their guardians for the use of the biospecimens for research purposes, which were carried out in accordance with the approved guideline “Use of experimental animals and human subjects”. The samples were frozen and stored in liquid nitrogen immediately after surgically resected.
Immunochemistry
Immunohistochemical staining of MSH2 was performed on the 120 HCC sections. The sections were blocked in 3% hydrogen peroxide solution for 10 min at room temperature and then incubated with the MSH2 primary antibody (1:1000, Abcam, Cambridge, UK) overnight at 4 °C. A negative control was performed by replacing the primary antibody with PBS. The sections were then incubated with a horseradish peroxidase labeled secondary antibody (1:1000, Abcam, Cambridge, UK) at room temperature for 120 min. Finally, the signal was developed for visualization with diaminobenzidine and the sections were counterstained with hematoxylin. MSH2 expression score was conducted according to the percent of positive cells: 0–5% scored 0; 6–35% scored 1; 36–70% scored 2; more than 70% scored 3 and staining intensity: no staining scored 0, weakly staining scored 1, moderately staining scored 2 and strongly staining scored 3. The final score was determined using the ratio of positive cell score × staining intensity score as follows: “−” for a score of 0–1, “+” for a score of 2–3, “++” for a score of 4–6 and “+++” for a score of >6. Low expression was defined as a total score <4, moderate expression with a total score ≥ 4 and ≤6 and high expression with a total score > 6.
RNA extraction and real-time quantitative PCR
Total RNA was extracted from a section of fresh frozen tumor tissue using the RNeasy Mini Kit (Qiagen, Hilden, DE). Reverse transcription and PCR were executed using the RevertAid First Strand cDNA Synthesis kit (Thermo, Waltham, USA) and Power SYBR Green PCR kit (Applied Biosystems, Carlsbad, USA) according to the manufacturer’s instructions. Quantitative real-time PCR was performed using a 7500 Real-time PCR system (Applied Biosystems, Carlsbad, USA). Primer sequences for MSH2 detection were as follows, forward: 5′-AGAGACAGGTTGGAGTTGG-3′; reverse: 5′-CGGGTAAAACACATTCCTT-3′. The relative expression level of MSH2 was determined using the 2−∆∆Ct method and normalized to β-actin.
Western blotting
Total protein was extracted using a RIPA lysis buffer (Fdbio science, Hangzhou, China). The proteins were then separated by SDS-PAGE and transferred onto polyvinylidene fluoride membrane. The membrane was blocked with 5% nonfat dried milk and incubated with MSH2 primary antibody (1:5000, Abcam, Cambridge, UK) and goat anti-rabbit secondary antibody (1:10000, Abcam, Cambridge, UK). Proteins were detected by using enhanced chemiluminescence method and imaged by GelDoc™ XR+ system (Bio-Rad, Hercules, USA). GAPDH (1:5000, Abcam, Cambridge, UK) was chosen as an internal control and the expression level of MSH2 protein was normalized by GAPDH.
Statistical analysis
Unsupervised two-way hierarchical clustering of HCC samples and initial genes expression data was performed with Multiexperiment Viewer (MeV) 4.9.0 (Dana-Farber Cancer Institute, Boston, MA, USA), designed to allow the analysis of microarray data to identify patterns of gene expression and differentially expressed genes. Heatmap view of a diagram will come up that looks like lots of tiny blue and yellow boxes. Each row represents a specific gene, whereas each column represents each patient. Pearson correlation was used as the distance metric for hierarchical clustering. The number of patient and gene clusters can be adjusted by varying sample-tree or gene-tree distance-thresholds and visual analysis. The lower the distance range, the more clusters there will be since a shorter distance between similar genes means more groups. Gene cluster which could identify similar patients would be established. Normality of distributions was confirmed with the Anderson–Darling test. The correlations in gene expression within the main clusters and correlations of genes expression with their copy-number or methylation status were investigated by Pearson Coefficient of determination (R2) values using R software (v3.1.2). Multivariate linear regression models were used to analyze associations between genes expression with the mutations in three major groups: the CTNNB1 cluster (CTNNB1, TERT, MLL2, ARID2, APOB, NFE2L2), the AXIN1 cluster (AXIN1, ARID1A, RPS6KA3) and the TP53 cluster (TP53, KEAP1, CCND1, TSC2). An independent two-sample t-test was used for comparison of continuous variables for normally distributed data and Mann–Whitney U tests for non-normally distributed data. The χ2-test was used for comparison of categorical variables, if any groups contained less than 5, the Fisher’s Exact test was used in preference. Survival curves were evaluated using the Kaplan-Meier method and differences between survival curves were tested by the log-rank test in the different groups. Cox proportional hazards regression model was used to examine univariate and multivariate hazard ratios for the prognostic parameters, including molecular groups and clinicopathological variables that were dichotomized. Only significantly different variables in univariate analysis were entered into the next multivariate analysis. Furthermore, the performance of DNA repair genes clusters for identifying the HCC subgroups was validated in our own cohort. The deceased patients were excluded in the survival analysis due to overall survival more than 5 years; patients who died or were lost to follow-up within 30 days were also excluded. The analyses were carried out using the SPSS Statistics 21.0 (SPSS Inc., an IBM Company) and MedCalc version 14.8 (MedCalc, Mariakerke, Belgium). A 2-tailed P value < 0.05 was considered to be statistically significant.
Additional Information
How to cite this article: Lin, Z. et al. Prognostic value of DNA repair based stratification of hepatocellular carcinoma. Sci. Rep. 6, 25999; doi: 10.1038/srep25999 (2016).
References
El-Serag, H. B. & Kanwal, F. Epidemiology of hepatocellular carcinoma in the United States: where are we? Where do we go? Hepatology 60, 1767–1775, 10.1002/hep.27222 (2014).
Zucman-Rossi, J., Villanueva, A., Nault, J. C. & Llovet, J. M. Genetic Landscape and Biomarkers of Hepatocellular Carcinoma. Gastroenterology 149, 1226–1239 e1224, 10.1053/j.gastro.2015.05.061 (2015).
Friemel, J. et al. Intratumor heterogeneity in hepatocellular carcinoma. Clin Cancer Res 21, 1951–1961, 10.1158/1078-0432.CCR-14-0122 (2015).
Nault, J. C. & Villanueva, A. Intratumor molecular and phenotypic diversity in hepatocellular carcinoma. Clin Cancer Res 21, 1786–1788, 10.1158/1078-0432.CCR-14-2602 (2015).
Vilarinho, S. & Calvisi, D. F. New advances in precision medicine for hepatocellular carcinoma recurrence prediction and treatment. Hepatology 60, 1812–1814, 10.1002/hep.27311 (2014).
Pinyol, R., Nault, J. C., Quetglas, I. M., Zucman-Rossi, J. & Llovet, J. M. Molecular profiling of liver tumors: classification and clinical translation for decision making. Semin Liver Dis 34, 363–375, 10.1055/s-0034-1394137 (2014).
Amaddeo, G. et al. Integration of tumour and viral genomic characterizations in HBV-related hepatocellular carcinomas. Gut 64, 820–829, 10.1136/gutjnl-2013-306228 (2015).
Peveling-Oberhag, J. et al. Feasibility of global miRNA analysis from fine-needle biopsy FFPE material in patients with hepatocellular carcinoma treated with sorafenib. Clin Sci (Lond) 128, 29–37, 10.1042/CS20140007 (2015).
Miao, R. et al. Identification of prognostic biomarkers in hepatitis B virus-related hepatocellular carcinoma and stratification by integrative multi-omics analysis. J Hepatol 61, 840–849, 10.1016/j.jhep.2014.05.025 (2014).
Wei, L. et al. Application of microRNA and mRNA expression profiling on prognostic biomarker discovery for hepatocellular carcinoma. BMC Genomics 15 Suppl 1, S13, 10.1186/1471-2164-15-S1-S13 (2014).
Lachenmayer, A. et al. Wnt-pathway activation in two molecular classes of hepatocellular carcinoma and experimental modulation by sorafenib. Clin Cancer Res 18, 4997–5007, 10.1158/1078-0432.CCR-11-2322 (2012).
Kaposi-Novak, P. et al. Met-regulated expression signature defines a subset of human hepatocellular carcinomas with poor prognosis and aggressive phenotype. J Clin Invest 116, 1582–1595, 10.1172/JCI27236 (2006).
Wood, R. D., Mitchell, M., Sgouros, J. & Lindahl, T. Human DNA repair genes. Science 291, 1284–1289, 10.1126/science.1056154 (2001).
Pearl, L. H., Schierz, A. C., Ward, S. E., Al-Lazikani, B. & Pearl, F. M. Therapeutic opportunities within the DNA damage response. Nat Rev Cancer 15, 166–180, 10.1038/nrc3891 (2015).
Kirkali, G. et al. Evidence for upregulated repair of oxidatively induced DNA damage in human colorectal cancer. DNA Repair (Amst) 10, 1114–1120, 10.1016/j.dnarep.2011.08.008 (2011).
Dizdaroglu, M. Oxidatively induced DNA damage and its repair in cancer. Mutat Res Rev Mutat Res 763, 212–245, 10.1016/j.mrrev.2014.11.002 (2015).
Cardin, R. et al. Oxidative damage in the progression of chronic liver disease to hepatocellular carcinoma: an intricate pathway. World J Gastroenterol 20, 3078–3086, 10.3748/wjg.v20.i12.3078 (2014).
Wood, R. D., Mitchell, M. & Lindahl, T. Human DNA repair genes, 2005. Mutat Res 577, 275–283, 10.1016/j.mrfmmm.2005.03.007 (2005).
Lange, S. S., Takata, K. & Wood, R. D. DNA polymerases and cancer. Nat Rev Cancer 11, 96–110, 10.1038/nrc2998 (2011).
Guichard, C. et al. Integrated analysis of somatic mutations and focal copy-number changes identifies key genes and pathways in hepatocellular carcinoma. Nat Genet 44, 694–698, 10.1038/ng.2256 (2012).
Liu, J. et al. Alterations of TP53 are associated with a poor outcome for patients with hepatocellular carcinoma: evidence from a systematic review and meta-analysis. Eur J Cancer 48, 2328–2338, 10.1016/j.ejca.2012.03.001 (2012).
Villanueva, A. & Hoshida, Y. Depicting the role of TP53 in hepatocellular carcinoma progression. J Hepatol 55, 724–725, 10.1016/j.jhep.2011.03.018 (2011).
Burdova, K., Mihaljevic, B., Sturzenegger, A., Chappidi, N. & Janscak, P. The Mismatch-Binding Factor MutSbeta Can Mediate ATR Activation in Response to DNA Double-Strand Breaks. Mol Cell 59, 603–614, 10.1016/j.molcel.2015.06.026 (2015).
Li, L. & Wang, H. Heterogeneity of liver cancer and personalized therapy. Cancer Lett, 10.1016/j.canlet.2015.07.018 (2015).
Boyault, S. et al. Transcriptome classification of HCC is related to gene alterations and to new therapeutic targets. Hepatology 45, 42–52, 10.1002/hep.21467 (2007).
Villanueva, A., Toffanin, S. & Llovet, J. M. Linking molecular classification of hepatocellular carcinoma and personalized medicine: preliminary steps. Curr Opin Oncol 20, 444–453, 10.1097/CCO.0b013e328302c9e9 (2008).
Lee, J. S. et al. A novel prognostic subtype of human hepatocellular carcinoma derived from hepatic progenitor cells. Nat Med 12, 410–416, 10.1038/nm1377 (2006).
Kim, S. M. et al. Sixty-five gene-based risk score classifier predicts overall survival in hepatocellular carcinoma. Hepatology 55, 1443–1452, 10.1002/hep.24813 (2012).
Tu, Z. et al. BRCC3 acts as a prognostic marker in nasopharyngeal carcinoma patients treated with radiotherapy and mediates radiation resistance in vitro. Radiat Oncol 10, 123, 10.1186/s13014-015-0427-3 (2015).
Xu, W. et al. Enhanced expression of cohesin loading factor NIPBL confers poor prognosis and chemotherapy resistance in non-small cell lung cancer. J Transl Med 13, 153, 10.1186/s12967-015-0503-3 (2015).
Jentzsch, T., Robl, B., Husmann, M., Bode-Lesniewska, B. & Fuchs, B. Expression of MSH2 and MSH6 on a tissue microarray in patients with osteosarcoma. Anticancer Res 34, 6961–6972, (2014).
Deng, Q. et al. Prognostic value of ERCC1 mRNA expression in non-small cell lung cancer, breast cancer and gastric cancer in patients from Southern China. Int J Clin Exp Pathol 7, 8312–8321, (2014).
Butkiewicz, D. et al. Influence of DNA repair gene polymorphisms on prognosis in inoperable non-small cell lung cancer patients treated with radiotherapy and platinum-based chemotherapy. Int J Cancer 131, E1100–1108, 10.1002/ijc.27596 (2012).
Li, Y. et al. Potentially functional variants in the core nucleotide excision repair genes predict survival in Japanese gastric cancer patients. Carcinogenesis 35, 2031–2038, 10.1093/carcin/bgu142 (2014).
Tung, C. L. et al. Down-regulation of MSH2 expression by an Hsp90 inhibitor enhances pemetrexed-induced cytotoxicity in human non-small-cell lung cancer cells. Exp Cell Res 322, 345–354, 10.1016/j.yexcr.2014.02.002 (2014).
Ko, J. C. et al. Inhibition of p38 MAPK-dependent MutS homologue-2 (MSH2) expression by metformin enhances gefitinib-induced cytotoxicity in human squamous lung cancer cells. Lung Cancer 82, 397–406, 10.1016/j.lungcan.2013.09.011 (2013).
Zhou, J. et al. FOXM1 modulates cisplatin sensitivity by regulating EXO1 in ovarian cancer. Plos One 9, e96989, 10.1371/journal.pone.0096989 (2014).
Efimova, E. V. et al. Linking Cancer Metabolism to DNA Repair and Accelerated Senescence. Mol Cancer Res, 10.1158/1541-7786.MCR-15-0263 (2015).
Mjelle, R. et al. Cell cycle regulation of human DNA repair and chromatin remodeling genes. DNA Repair (Amst) 30, 53–67, 10.1016/j.dnarep.2015.03.007 (2015).
Srivastava, M. & Raghavan, S. C. DNA double-strand break repair inhibitors as cancer therapeutics. Chem Biol 22, 17–29, 10.1016/j.chembiol.2014.11.013 (2015).
Belzile, J. P., Choudhury, S. A., Cournoyer, D., Chow, T. Y. & Targeting, D. N. A. repair proteins: a promising avenue for cancer gene therapy. Curr Gene Ther 6, 111–123, (2006).
Hattori, H. et al. p53 shapes genome-wide and cell type-specific changes in microRNA expression during the human DNA damage response. Cell Cycle 13, 2572–2586, 10.4161/15384101.2015.942209 (2014).
Krell, J. et al. The role of TP53 in miRNA loading onto AGO2 and in remodelling the miRNA-mRNA interaction network. Lancet 385 Suppl 1, S15, 10.1016/S0140-6736(15)60330-0 (2015).
Zhan, P., Ji, Y. N. & Yu, L. K. TP53 mutation is associated with a poor outcome for patients with hepatocellular carcinoma: evidence from a meta-analysis. Hepatobiliary Surg Nutr 2, 260–265, 10.3978/j.issn.2304-3881.2013.07.06 (2013).
Chmara, M. et al. Multiple pilomatricomas with somatic CTNNB1 mutations in children with constitutive mismatch repair deficiency. Genes Chromosomes Cancer 52, 656–664, 10.1002/gcc.22061 (2013).
Yuan, R. H. et al. Opposite roles of human pancreatitis-associated protein and REG1A expression in hepatocellular carcinoma: association of pancreatitis-associated protein expression with low-stage hepatocellular carcinoma, beta-catenin mutation and favorable prognosis. Clin Cancer Res 11, 2568–2575, 10.1158/1078-0432.CCR-04-2039 (2005).
Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2, 401–404, 10.1158/2159-8290.CD-12-0095 (2012).
Minguez, B. et al. Gene-expression signature of vascular invasion in hepatocellular carcinoma. J Hepatol 55, 1325–1331, 10.1016/j.jhep.2011.02.034 (2011).
Acknowledgements
The Project Supported by Natural Science Foundation of Zhejiang Province, China (LY16H160047) and Scientific Research Foundation of Wenzhou, Zhejiang Province, China (Y20130073, Y20140718 and Y20150307).
Author information
Authors and Affiliations
Contributions
Z.L., K.-Q.S. and M.-T.Z. performed the research; Z.L., H.-Q.W., Y.-J.C., M.S., L.Y. and S.-J.D. collected and analysed the data; K.-Q.S., Z.L. and M.-T.Z. analyzed and interpreted of data; Z.L. and S.-H.X. drafted the article; K.-Q.S. and M.-T.Z. Revised the article critically for important intellectual content; Guarantor of article: K.-Q.S. and M.-T.Z.; Final approval of the version to be submitted: K.-Q.S. and M.-T.Z.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Lin, Z., Xu, SH., Wang, HQ. et al. Prognostic value of DNA repair based stratification of hepatocellular carcinoma. Sci Rep 6, 25999 (2016). https://doi.org/10.1038/srep25999
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep25999
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.