Identification of molecular subtypes and prognostic signature for hepatocellular carcinoma based on genes associated with homologous recombination deficiency

Lin, Hongsheng; Xie, Yangyi; Kong, Yinzhi; Yang, Li; Li, Mingfen

doi:10.1038/s41598-021-03432-3

Download PDF

Article
Open access
Published: 15 December 2021

Identification of molecular subtypes and prognostic signature for hepatocellular carcinoma based on genes associated with homologous recombination deficiency

Hongsheng Lin^1,2,3,4^na1,
Yangyi Xie^1,5^na1,
Yinzhi Kong^1,5,
Li Yang^1,2 &
…
Mingfen Li^1,2

Scientific Reports volume 11, Article number: 24022 (2021) Cite this article

1897 Accesses
4 Citations
Metrics details

Subjects

Abstract

Hepatocellular carcinoma (HCC) is a rapidly developing digestive tract carcinoma. The prognosis of patients and side effects caused by clinical treatment should be better improved. Nonnegative matrix factorization (NMF) clustering was performed using 109 homologous recombination deficiency (HRD)-related of HCC genes from The Cancer Genome Atlas (TCGA) database. Limma was applied to analyze subtype differences. Immune scores and clinical characteristics of different subtypes were compared. An HRD signature were built with least absolute shrinkage operator (LASSO) and multivariate Cox analysis. Performance of the signature system was then assessed by Kaplan–Meier curves and receiver operating characteristic (ROC) curves. We identified two molecular subtypes (C1 and C2), with C2 showing a significantly better prognosis than C1. C1 contained 3623 differentially expressed genes. A 4-gene prognostic signature for HCC was established, and showed a high predicting accuracy in validation sets, entire TCGA data set, HCCDB18 and GSE14520 queues. Moreover, the risk score was validated as an independent prognostic marker for HCC. Our research identified two molecular subtypes of HCC, and proposed a novel scoring system for evaluating the prognosis of HCC in clinical practice.

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Austin D. Reed, Sara Pensa, … Walid T. Khaled

Starfysh integrates spatial transcriptomic and histologic data to reveal heterogeneous tumor–immune hubs

Article Open access 21 March 2024

Siyu He, Yinuo Jin, … Elham Azizi

Hepatocellular carcinoma

Article 21 January 2021

Josep M. Llovet, Robin Kate Kelley, … Richard S. Finn

Introduction

Liver cancer is one of the most rapidly developing digestive tract tumors¹. Hepatocellular carcinoma (HCC), which accounts for 90% of all liver cancer types, is characterized by high mortality and poor prognosis².Only 5% to 15% of HCC tumors can be surgically removed after diagnosis. The first-line treatment options for the late stage is oral dosing with sorafenib, however, but it cannot effectively improve the condition of HCC³. Hence, improving the prognosis of patients and reducing side effects are currently the major problems to be solved in clinical practice.

Studies have shown that homologous recombination deficiency (HRD) is common in cancer development⁴. HRD as a functional defect in homologous recombinant DNA repair could result in permanent alteration in the genome in a specific, quantifiable pattern ("genomic scar")⁵. Studies showed that HRD occurs at different frequencies in many cancer types⁶. In cancer cells with HRD, double-stranded DNA breaks are repaired through error-prone pathways (i.e., non-homologous end ligations), leading to cell death and tumor shrinkage^7,8. Previous reports analyzed the role of HRD in a variety of cancers, and found that HRD is visibly associated with survival of patients with ovarian cancer and glioblastoma polymordia, and is also related to a poor prognosis of adrenal cortical carcinoma, squamous cell carcinoma of head and neck, clear cell carcinoma of kidney, renal papillary cell carcinoma, sarcoma and uterine corpus endometrial carcinoma⁶. However, the relationship between HRD and HCC prognosis has not been fully characterized.

As HCC has highly heterogeneous genomic aberrations and microenvironments, frequent recurrence is another challenge in the treatment of HCC². Therefore how targeted therapies to kill individual cells remains a major tackle to be resolved, which also points to the clinical importance of classifying patients into relative subtypes based on key characteristics⁹. A large-scale study identified three HCC subtypes using combined data from five platforms (DNA copy number, DNA methylation, mRNA expression, miRNA expression and RPPA) and simultaneous unsupervised clustering¹⁰. Although many genome-wide analyses of HCC have been performed, there is still a lack of hierarchical clustering analysis of HRD-associated genes to explore the prognostic characteristics of HCC.

In this study, we performed NMF clustering to classify HCC based on HRD-related genes, and investigated the relationship between subtypes, HCC immune infiltration and clinical characteristics. A risk score model was built to predict the prognosis of HCC patients. It is hoped that the current findings will provide new insights into personalized treatment of HCC.

Results

Characteristics of immune infiltration in HCC

To explore the immune cell infiltration in HCC, the relationship between prognosis and immune cells was analyzed according to MCP Counter, CIBERSORT and ssGSEA. The forest map showed that CD8 T cells, monocytic lineage, activated B cells, activated CD8 T cells, effector memory CD4 T cell, type 1 T helper cell, eosinophil, immature dendritic cell and natural killer T cell played a significant role in the prognosis of HCC (Fig. 1A-C). Analysis of the proportion of each immune cell in HCC demonstrated that T cells, Neutrophils, myeloid dendritic cells, monocytic lineage, fibroblasts and endothelial cells showed a high proportion in HCC (Fig. 1D).

Two molecular subtypes of HCC were identified by NMF clustering

From 109 HRD genes, 84 genes with strong prognostic significance for HCC were screened by univariate COX analysis. NMF clustering was performed on the 84 genes, and according to cophenetic, suspension and silhouette indicators, the optimal k was determined to be 2 (Fig. 2A,B). The two molecular subtypes were defined as C1 and C2. The OS and progress-free survival (PFS) between C1 and C2 were significantly different, and OS and PFS in C2 were significantly longer than C1 (Fig. 2C,D). The gene expression heat map of the two molecular subtypes demonstrated that prognosis-related HRD genes were high-expressed in the C1 subtype (Fig. 2E).

Identification of DEGs between subtypes and enrichment analysis

We also conducted association analysis between the typing results and published immunotyping results¹¹ from the TCGA cohort. The results indicated that C1 subtype mainly included the wound healing(C1), IFN-γ dominant (C2), inflammatory (C3) and Lymphocyte (C4) subtypes identified by Vesteinn Thorsson et al., and that C2 subtypes mainly include C2, C3 and C4 subtypes defined by Vesteinn Thorsson (Fig. 3A,B). Difference analysis (at the threshold of FDR < 0.05 and | | FC > 1.5) showed that a total of 3623 genes were differentially expressed in C1 in comparison to C2 (Supplementary Table S1). Specifically, among the 3623 genes, 3,301 were upward DEGS and 322 were downward DEGS (Fig. 4A). The heat map of top 100 DEGs were shown in Fig. 4B. GO analysis of up-regulated DEGs demonstrated that histone modification and covalent chromatin modification (biological processes, BP), condensed chromosome and spindle (cellular components, CC), nucleosome binding and damaged DNA binding (molecular function, MF) were the most significantly enriched terms (Fig. 5A). The most prominently enriched GO terms in down-regulated DEGs were regulation of protein processing and regulation of protein maturation (BP), high-density lipoprotein particle and lipoprotein particle (CC), monooxygenase activity and lipid transporter activity (MF) (Fig. 5C). KEGG analysis showed that the main enrichment pathways for upregulation of DEGs were mismatch repair, DNA replication, and homologous recombination (Fig. 5B). Supplementary S1.txt showed other up-regulated DEGs significantly enriched in GO terms and KEGG pathway in addition to Fig. 3A and B. The down-regulated DEGs were significantly enriched in metabolism-related pathways, including retinol metabolism, steroid hormone biosynthesis and drug metabolism (Fig. 5D). Supplementary S2.txt showed down-regulated DEGs significantly enriched in GO terms and KEGG pathway in addition to Fig. 3C and D mentioned. Moreover, GSEA also identified significant enrichment pathways for the C1 subtype, including mismatch repair, DNA replication, homologous recombination, cell cycle and base excision repair, while the C2 subtype was related to fatty acid metabolism, complement and coagulation cascades, retinol metabolism, drug metabolism cytochrome p450, and retinol metabolism (Fig. 5E). Therefore, the C1 subtype was more associated with tumor development, while the C2 subtype was more associated with metabolism.

Relationship between molecular subtypes, immune infiltration and clinical features

Immune score reflects immune infiltration based on lymphocyte gene expression¹². Hence, the immune score was applied to estimate the state of immune cell infiltration in HCC samples. The C1 subtype exhibited a significantly higher immune score of activated CD4 T cell, central memory CD4 T cell, effector memory CD4 T cell, memory B cell, type 2 T helper cell, activated dendritic cell, myeloid-derived suppressor cell (MDSC) and natural killer T cell (Fig. 6A). The analysis results of MCP counter demonstrated that T cells, CD8 T cells, cytotoxic lymphocytes, NK cells, B lineage, monocytic lineage, myeloid dendritic cells, neutrophils, fibroblasts obtained significantly high immune scores in C1 subtype of HCC (Fig. 6B). ESTIMATE was used to evaluate the infiltration status of immune cells in HCC subtype of TCGA. Compared with C2 subtype, the infiltration degree of activated memory T cells, CD4 follicular helper T cells, regulatory T cells (Tregs), M0 macrophages and activated mast cells in C1 subtype was significantly higher, while the infiltration degree of resting NK cells, monocytes, M2 macrophages and resting mast cells was sharply lower (Fig. 6C). These results suggested that tumors from different subtypes showed considerable variation in their immunoinfiltrative status. Clinical analysis of C1 and C2 subtypes indicated that C1 patients with a poor prognosis had a higher mortality, T staging, grade, and AJCC staging than those with C2 subtype, and there were no significant differences in gender, N stage, or M stage between the two subtypes (Fig. 6D).

Risk models were constructed based on prognosis-related HRD genes

Univariate Cox analysis identified 33 genes significantly associated with HCC prognosis (Supplementary Table S2). LASSO- Penalized Multivariate Cox analysis (Fig. 7A,B was performed on these 33 genes to establish a four-gene signature with the formula as follow:

$${\text{Risk}}\;{\text{score}} = 0.0{64}*{\text{FEN1}} + 0.{416}*{\text{HDAC2}} + 0.{149}*{\text{RECQL4}} + 0.{137}*{\text{TIPIN}}.$$

The risk score for each case in the TCGA training cohort was calculated and ranked from the lowest to the highest. The risk score was positively related to the number of death cases. The heat map demonstrated that the expression of four genes differed considerably between the high-risk group (n = 79) and the low-risk group (n = 103) (Fig. 7C). The 5-year OS of the high-risk and low-risk groups also showed significant differences (Fig. 7D). The area under the ROC curve (AUC) of 1, 3 and 5-year OS was 0.82, 0.65 and 0.65, respectively (Fig. 7E). In the TCGA validation cohort and the entire cohort, the risk scores of cases were calculated according to the risk scoring formula to divide the samples into high-risk and low-risk groups (Fig. 7F,I). The difference in prognosis between the two risk groups remained significant (Fig. 7G,J). The ROC curves of 1-year, 3-year and 5-year OS showed that the signature had a high OS sensitivity and predicting accuracy (Fig. 7H,K).

The prognostic model was validated in separate cohorts

We validated the robustness of the risk model in two independent external validation sets. In the GSE14520 cohort, 103 cases were assigned into the high-risk group, while 118 cases were in the low-risk group. In the HCCDB18 data set, 91 samples were in the high-risk group and 112 samples were in the low-risk group. This suggested that a higher risk score may indicate a higher death risk for HCC patients. The expression of four genes was also associated with increased risk scores in patients (Fig. 8A,D). In both cohorts, the high-risk group had distinctly shorter OS than the low- risk group (Fig. 8B,E). In the external validation set of GSE14520, the AUC for 1-year, 3-year, and 5-year OS predicted by the 4-gene signature was 0.72, 0.69, and 0.59, respectively (Fig. 8C). In the HCCDB18 validation cohort, the 4-gene signature predicted that 1-year, 3-year, and 5-year OS was 0.72, 0.79, and 0.72, respectively (Fig. 8F). Overall, the 4-gene signature showed a satisfactory prediction of the prognosis of HCC.

The 4-gene signature was a stand-alone prognostic factor for HCC

Analysis on the risk scores with different clinicopathological characteristics (gender, T stage, N stage, M stage, AJCC stage and grade) showed that the risk scores of patients between male and female, N0 and N1, M0 and M1 were not significantly different. However, significant differences were detected in the risk scores among T1, T2, T3 and T4, and the risk scores increased with the increase of T stages. In addition, grade and risk score also showed an increasing trend with a higher grade. The risk scores of patients with different AJCC stages also presented significant differences (Fig. 9). Stratified analysis was performed for all cases in the TCGA dataset according to age, sex, T stage, AJCC stage and grade, and we observed that the risk scores of patients calculated by the 4-gene signature were correlated with the survival time of age ≤ 65and age > 65, male or female, T1-T2 stage or T3-T4 stage, stage I-II or stage III-IV, G1-G2 or G3-G4 (Fig. 10). The results of univariate Cox and further multivariate Cox analyses validated that risk score was an independent prognostic factor for HCC (Fig. 11A,B). These results suggested that the risk score was an accurate model for predicting prognosis of patients with HCC.

Assessment of the risk models in predicting prognosis of HCC

To validate the reliability of the risk model in predicting prognosis, the risk model was compared with other previously reported signatures in HCC. 3 reports were selected through a small-scale literature search ^13,14,15. According to the HCC prognostic model reported in each literature, the risk assessment of TCGA-LIHC samples showed that the 5-year survival rate of patients with high-risk was much worse than that of patients with low-risk (Fig. 12A-C). Figure 12D showed the C-index of each model, and the model we developed had the highest C-index. In addition, the risk model built by this study had slightly better performance in predicting long-term prognosis of HCC than the other three models (Fig. 12E-G).

Discussion

HCC can adapt to high genomic stress conditions resulted from overactive DNA replication and genotoxic drug therapy, and the underlying mechanisms may involve enhanced DNA damage response/repair procedures¹⁶. Homologous recombinant DNA repair plays an important role in DNA repair. HRD has been found to be implicated in a variety of human cancers, especially in ovarian, breast, prostate, and pancreatic cancers¹⁷. In this study, we performed NMF clustering on HCC cases according to HRD-related genes, and identified two molecular subtypes of HCC, C1 and C2. C2 had longer OS and PFS in both subtypes, possibly because the C1 subtype was more associated with many oncology features, including mismatch repair, DNA replication, homologous recombination, cell cycle and base excision repair^18,19,20,21. Another important reason was the large number of T cells, such as the high degree of infiltration of activated CD4 T cell, central memory CD4 T cell, type 2 T helper cell, natural killer T cell, T cells, CD8 T cells, effector memory CD4 T cell, activated memory T cells, CD4 follicular helper T cells and regulatory T cells in the C1 subtype tissues^22,23.

Integrating various independent prognostic variables into a single formula could significantly improve prognostic prediction ability. To address the heterogeneity between and within various HCC subtypes, the use of multiple genes rather than one single gene or pathway have been applied to define the risk for HCC initiation, progression and recurrence²⁴. The joint analysis of Arjun Sarathi and Ashok Palaniappan showed that there were different stage-specific genes in different AJCC stages of HCC, specifically, they identified 2 genes specific to stage I and II, 10 specific to stage III, and 35 specific to stage IV ²⁵. Recently, a five-gene predictive signature was developed and highlighted potential prediction feasibility of recurrence of early-stage HCC ²⁶. In our work, 3623 DEGs between C1 and C2 subtypes were identified, and a risk score was constructed using univariate Cox analysis and LASSO-Cox regression analysis. Patients were ranked according to their risk score, and the number of death cases was found to be positively related to the risk score. Subgroup analysis revealed that the model was suitable for identifying and predicting HCC patients with different clinical characteristics. More importantly, our scoring model could accurately assess the prognostic risk of HCC cases in two independent external validation cohorts. We also noted that the risk scores of patients calculated according to the 4-gene signature were significantly correlated with age ≤ 65 and age > 65, male or female, T1-T2 stage or T3-T4 stage, stage I-II or stage III-IV, G1-G2 or G3-G4, indicating that the risk scoring model had a strong applicability. Additionally, the risk score was established as a stand-alone prognostic marker for HCC. All these evidence suggested a great potential of the risk scoring model for bedside application.

To conclude, study classified two molecular subtypes of HCC with different immune-infiltrating states and clinical characteristics. In addition, we established a 4-gene signature that showed high specificity and sensitivity in evaluating the prognosis of HCC and can be used as a stand-alone prognostic factor. To the best of our knowledge, our model was the first prognostic model constructed based on these four genes, and it could facilitate personalized treatment of HCC, as the signature showed a high stability and general applicability.

Methods

Organization and processing of original data

Original expression profile information and clinical data of HCC downloaded were from the TCGA-LIHC, HCCDB18 (http://lifeome.net/database/hccdb/home.html) and Gene Expression Omnibus (GEO) database. For the TCGA-LIHC dataset, batch number of each sample was obtained from UCSC Xenabrowser (https://xenabrowser.net/datapages/?dataset=TCGA-LIHC.GDC_phenotype.tsv&host=https%3A%2F%2Fgdc.xenahubs.net&removeHub=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443), and the combat function of R software package SVA was used for batch effect removal. The expression of genes with multiple probes in the TCGA dataset was the median value of these probes. When a probe corresponding to multiple genes in the HCCDB18 dataset, it was removed. After processing, 365, 203 and 221 HCC samples with complete clinical data from TCGA, HCCDB18 and GSE14520 were obtained. Table 1 presents the clinicopathological statistics of the samples from the 3 datasets. 108 HRD-related genes were collected from other studies^{4,11,27,28,29,30,31,32,33,34,35,36,37}. Supplementary Fig. S1 shows the study design.

Table 1 Clinicopathological statistical information of HCC patients in the TCGA, HCCDB18, and GSE14520 datasets.

Full size table

Molecular subtypes were identified by nonnegative matrix factorization (NMF) clustering

The expression of 109 HRD genes was obtained from TCGA, and univariate COX analysis was performed with coxph function in R. The HCC samples were clustered by NMF³⁸. Specifically, the standard "Brunet" was used for 100 iterations. The number of clusters k was set to 2–10, the average contour width of the common member matrix was determined using R package "NMF". In addition, Kaplan–Meier curve and log-rank test were used for survival analysis.

Difference analysis and enrichment analysis

The differences of different molecular subtypes were analyzed by R-package Limma³⁹. Differentially expressed genes (DEGs) were analyzed using WebGestalt ⁴⁰(V0.4.2) for KEGG pathway enrichment and GO function enrichment analysis. In addition, to analyze the enrichment of different molecular subtypes in different pathways, the cp.kegg.v7.0.symbols.gmt gene set was used as the reference gene set for Gene Set Enrichment Analysis (GSEA). The pathways with P < 0.05 and false discovery rate (FDR) < 0.25 threshold were considered as significantly enriched.

Immune scores and clinical characteristics of HCC patients with different molecular subtypes

R software package single-sample gene set enrichment analysis (ssGSEA)⁴¹, MCP counter⁴², CIBERSORT⁴³ were used to measure the immune score of each sample. Clinical characteristics of different molecular subtypes, including survival status, sex, TNM stage and AJCC stage, were analyzed using ggplot2⁴⁴.

Construction and evaluation of a prognostic scoring system

The 365 samples in the TCGA dataset were grouped into a training set (n = 182) and a verification set (n = 183). Chi-square test showed no significant differences in overall survival (OS), TNM stage, clinical stage, grade, gender or age between the training set and the validation set (Table 2). In the training set, the relationship between HRD gene and HCC was determined by univariate Cox regression analysis. Least absolute correlation and selection operator (LASSO) and multivariate Cox regression analysis were performed to filter the HRD genes significantly associated with HCC prognosis and to establish a risk scoring system. Patients were grouped into high-risk and low-risk groups based on their standardized risk scores. R software package timeROC was used to generate receiver operating characteristic curve (ROC). Finally, univariate and multivariate Cox regression analyses were conducted to evaluate the independence of the prognostic model.

Table 2 TCGA training set and validation set sample information table.

Full size table

Statistical analysis

The statistical analysis in this study was performed in R software, and the differences in clinicopathological and molecular features between different subtypes were calculated by student t- test and chi-square test. P < 0.05 was seen to be statistically significant.

Data availability

The datasets generated and/or analysed during the current study are available in the [GSE14520] repository, [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE14520].

Code availability

The code for generating the risk score model discussed in this study is available at Supplementary code.R.

References

Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2020. CA Cancer J. Clin. 70, 7–30. https://doi.org/10.3322/caac.21590 (2020).
Article PubMed Google Scholar
Shimada, S. et al. Comprehensive molecular and immunological characterization of hepatocellular carcinoma. EBioMedicine 40, 457–470. https://doi.org/10.1016/j.ebiom.2018.12.058 (2019).
Article PubMed Google Scholar
Anwanwan, D., Singh, S. K., Singh, S., Saikam, V. & Singh, R. Challenges in liver cancer and possible treatment approaches. Biochim. Biophys. Acta Rev. Cancer 1873, 188314. https://doi.org/10.1016/j.bbcan.2019.188314 (2020).
Article CAS PubMed Google Scholar
Hoppe, M. M., Sundar, R., Tan, D. S. P. & Jeyasekharan, A. D. Biomarkers for homologous recombination deficiency in cancer. J. Natl. Cancer Inst. 110, 704–713. https://doi.org/10.1093/jnci/djy085 (2018).
Article CAS PubMed Google Scholar
Stover, E. H., Fuh, K., Konstantinopoulos, P. A., Matulonis, U. A. & Liu, J. F. Clinical assays for assessment of homologous recombination DNA repair deficiency. Gynecol. Oncol. 159, 887–898. https://doi.org/10.1016/j.ygyno.2020.09.029 (2020).
Article CAS PubMed Google Scholar
Knijnenburg, T. A. et al. Genomic and molecular landscape of DNA damage repair deficiency across the cancer genome atlas. Cell Rep. 23, 239–254. https://doi.org/10.1016/j.celrep.2018.03.076 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ledermann, J. A. & Pujade-Lauraine, E. Olaparib as maintenance treatment for patients with platinum-sensitive relapsed ovarian cancer. Ther. Adv. Med. Oncol 11, 1758835919849753. https://doi.org/10.1177/1758835919849753 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tumiati, M. et al. A functional homologous recombination assay predicts primary chemotherapy response and long-term survival in ovarian cancer patients. Clin. Cancer Res. 24, 4482–4493. https://doi.org/10.1158/1078-0432.CCR-17-3770 (2018).
Article CAS PubMed Google Scholar
Wu, Y., Liu, Z. & Xu, X. Molecular subtyping of hepatocellular carcinoma: a step toward precision medicine. Cancer Commun. (Lond) 40, 681–693. https://doi.org/10.1002/cac2.12115 (2020).
Article Google Scholar
Wheeler, D. A., Roberts, L. R., & Cancer Genome Atlas Research Network. Comprehensive and integrative genomic characterization of hepatocellular carcinoma. Cell 169, 1327–1341, https://doi.org/10.1016/j.cell.2017.05.046 (2017).
Thorsson, V. et al. The immune landscape of cancer. Immunity 48, 812–830. https://doi.org/10.1016/j.immuni.2018.03.023 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pan, H. et al. Immunological analyses reveal an immune subtype of uveal melanoma with a poor prognosis. Aging (Albany NY) 12, 1446–1464. https://doi.org/10.18632/aging.102693 (2020).
Article CAS Google Scholar
Liu, G. M., Zeng, H. D., Zhang, C. Y. & Xu, J. W. Identification of a six-gene signature predicting overall survival for hepatocellular carcinoma. Cancer Cell Int. 19, 138. https://doi.org/10.1186/s12935-019-0858-2 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ke, K. et al. Evaluation and prediction of hepatocellular carcinoma prognosis based on molecular classification. Cancer Manag. Res. 10, 5291–5302. https://doi.org/10.2147/CMAR.S178579 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zheng, Y. et al. Large-scale analysis reveals a novel risk score to predict overall survival in hepatocellular carcinoma. Cancer Manag. Res. 10, 6079–6096. https://doi.org/10.2147/CMAR.S181396 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Nucleostemin modulates outcomes of hepatocellular carcinoma via a tumor adaptive mechanism to genomic stress. Mol. Cancer Res. 18, 723–734. https://doi.org/10.1158/1541-7786.MCR-19-0777 (2020).
Article CAS PubMed PubMed Central Google Scholar
Oza, J. et al. Homologous recombination repair deficiency as a therapeutic target in sarcoma. Semin. Oncol. 47, 380–389. https://doi.org/10.1053/j.seminoncol.2020.10.002 (2020).
Article CAS PubMed Google Scholar
Ijsselsteijn, R., Jansen, J. G. & de Wind, N. DNA mismatch repair-dependent DNA damage responses and cancer. DNA Repair Amst 93, 102923. https://doi.org/10.1016/j.dnarep.2020.102923 (2020).
Article CAS PubMed Google Scholar
Tubbs, A. & Nussenzweig, A. Endogenous DNA damage as a source of genomic instability in cancer. Cell 168, 644–656. https://doi.org/10.1016/j.cell.2017.01.002 (2017).
Article CAS PubMed PubMed Central Google Scholar
Icard, P., Fournel, L., Wu, Z., Alifano, M. & Lincet, H. Interconnection between metabolism and cell cycle in cancer. Trends Biochem Sci 44, 490–501. https://doi.org/10.1016/j.tibs.2018.12.007 (2019).
Article CAS PubMed Google Scholar
Wallace, S. S., Murphy, D. L. & Sweasy, J. B. Base excision repair and cancer. Cancer Lett 327, 73–89. https://doi.org/10.1016/j.canlet.2011.12.038 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zheng, C. et al. Landscape of infiltrating T cells in liver cancer revealed by single-cell sequencing. Cell 169, 1342–1356. https://doi.org/10.1016/j.cell.2017.05.035 (2017).
Article CAS PubMed Google Scholar
Tanaka, A. & Sakaguchi, S. Targeting Treg cells in cancer immunotherapy. Eur. J. Immunol. 49, 1140–1146. https://doi.org/10.1002/eji.201847659 (2019).
Article CAS PubMed Google Scholar
Tu, T. et al. Novel aspects of the liver microenvironment in hepatocellular carcinoma pathogenesis and development. Int. J. Mol. Sci. 15, 9422–9458. https://doi.org/10.3390/ijms15069422 (2014).
Article CAS PubMed PubMed Central Google Scholar
Sarathi, A. & Palaniappan, A. Novel significant stage-specific differentially expressed genes in hepatocellular carcinoma. BMC Cancer 19, 663. https://doi.org/10.1186/s12885-019-5838-3 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z. et al. A five-gene signature for recurrence prediction of hepatocellular carcinoma patients. Biomed. Res. Int. 2020, 4037639. https://doi.org/10.1155/2020/4037639 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome Atlas Research, N. Integrated genomic analyses of ovarian carcinoma. Nature 474, 609–615, doi:https://doi.org/10.1038/nature10166 (2011).
Coleman, R. L. et al. Rucaparib maintenance treatment for recurrent ovarian carcinoma after response to platinum therapy (ARIEL3): a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet 390, 1949–1961. https://doi.org/10.1016/S0140-6736(17)32440-6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Foulkes, W. D., Knoppers, B. M. & Turnbull, C. Population genetic testing for cancer susceptibility: founder mutations to genomes. Nat. Rev. Clin. Oncol. 13, 41–54. https://doi.org/10.1038/nrclinonc.2015.173 (2016).
Article CAS PubMed Google Scholar
Konstantinopoulos, P. A., Ceccaldi, R., Shapiro, G. I. & D’Andrea, A. D. Homologous recombination deficiency: exploiting the fundamental vulnerability of ovarian cancer. Cancer Discov. 5, 1137–1154. https://doi.org/10.1158/2159-8290.CD-15-0714 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mateo, J. et al. DNA-repair defects and olaparib in metastatic prostate cancer. N. Engl. J. Med. 373, 1697–1708. https://doi.org/10.1056/NEJMoa1506859 (2015).
Article CAS PubMed PubMed Central Google Scholar
Park, W. et al. Genomic methods identify homologous recombination deficiency in pancreas adenocarcinoma and optimize treatment selection. Clin. Cancer Res. 26, 3239–3247. https://doi.org/10.1158/1078-0432.ccr-20-0418 (2020).
Article CAS PubMed PubMed Central Google Scholar
Park, W. et al. Homologous recombination deficiency (HRD): a biomarker for first-line (1L) platinum in advanced pancreatic ductal adenocarcinoma (PDAC). J. Clin. Oncol. 37, 4132–4132. https://doi.org/10.1200/JCO.2019.37.15_suppl.4132 (2019).
Article Google Scholar
Riaz, N. et al. Pan-cancer analysis of bi-allelic alterations in homologous recombination DNA repair genes. Nat. Commun. 8, 857. https://doi.org/10.1038/s41467-017-00921-w (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Sharma, P. et al. Impact of homologous recombination deficiency biomarkers on outcomes in patients with triple-negative breast cancer treated with adjuvant doxorubicin and cyclophosphamide (SWOG S9313). Ann. Oncol. 29, 654–660. https://doi.org/10.1093/annonc/mdx821 (2018).
Article CAS PubMed Google Scholar
Swisher, E. M. et al. Rucaparib in relapsed, platinum-sensitive high-grade ovarian carcinoma (ARIEL2 Part 1): an international, multicentre, open-label, phase 2 trial. Lancet Oncol. 18, 75–87. https://doi.org/10.1016/S1470-2045(16)30559-9 (2017).
Article CAS PubMed Google Scholar
Vanderstichele, A., Busschaert, P., Olbrecht, S., Lambrechts, D. & Vergote, I. Genomic signatures as predictive biomarkers of homologous recombination deficiency in ovarian cancer. Eur. J. Cancer 86, 5–14. https://doi.org/10.1016/j.ejca.2017.08.029 (2017).
Article CAS PubMed Google Scholar
Ang, A. M. S. & Gillis, N. Accelerating nonnegative matrix factorization algorithms using extrapolation. Neural Comput. 31, 417–439. https://doi.org/10.1162/neco_a_01157 (2019).
Article MathSciNet PubMed MATH Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47. https://doi.org/10.1093/nar/gkv007 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, J., Vasaikar, S., Shi, Z., Greer, M. & Zhang, B. WebGestalt 2017: a more comprehensive, powerful, flexible and interactive gene set enrichment analysis toolkit. Nucleic Acids Res. 45, W130–W137. https://doi.org/10.1093/nar/gkx356 (2017).
Article CAS PubMed PubMed Central Google Scholar
Finotello, F. & Trajanoski, Z. Quantifying tumor-infiltrating immune cells from transcriptomics data. Cancer Immunol. Immunother. 67, 1031–1040. https://doi.org/10.1007/s00262-018-2150-z (2018).
Article CAS PubMed PubMed Central Google Scholar
Becht, E. et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 17, 218. https://doi.org/10.1186/s13059-016-1070-5 (2016).
Article CAS PubMed PubMed Central Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457. https://doi.org/10.1038/nmeth.3337 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ito, K. & Murphy, D. Application of ggplot2 to Pharmacometric Graphics. CPT Pharmacometr. Syst. Pharmacol. 2, e79. https://doi.org/10.1038/psp.2013.56 (2013).
Article CAS Google Scholar

Download references

Funding

This work was supported by National Natural Science Foundation of China [No.81960840] and Guangxi Nature Science Foundation [No. 2018GXNSFBA281092].

Author information

These authors contributed equally: Hongsheng Lin and Yangyi Xie.

Authors and Affiliations

Guangxi University of Chinese Medicine, Nanning, 530200, China
Hongsheng Lin, Yangyi Xie, Yinzhi Kong, Li Yang & Mingfen Li
Department of Laboratory, The First Affiliated Hospital of Guangxi University of Chinese Medicine, Nanning, 530023, China
Hongsheng Lin, Li Yang & Mingfen Li
Guangxi Medical University, Nanning, 530021, China
Hongsheng Lin
Department of Microbiology, School of Basic Medical Sciences, Guangxi Medical University, Nanning, 530021, China
Hongsheng Lin
The First Clinical Faculty of Guangxi University of Chinese Medicine, Nanning, 530200, China
Yangyi Xie & Yinzhi Kong

Authors

Hongsheng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yangyi Xie
View author publications
You can also search for this author in PubMed Google Scholar
Yinzhi Kong
View author publications
You can also search for this author in PubMed Google Scholar
Li Yang
View author publications
You can also search for this author in PubMed Google Scholar
Mingfen Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.S.L. and M.F.L. designed the study. Y.Y.X. and Y.Z.K. analyzed and interpreted the data. L.Y. wrote the initial draft of the manuscript. M.F.L. reviewed and edited the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Mingfen Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Figure S1.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Table S1.

Supplementary Table S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, H., Xie, Y., Kong, Y. et al. Identification of molecular subtypes and prognostic signature for hepatocellular carcinoma based on genes associated with homologous recombination deficiency. Sci Rep 11, 24022 (2021). https://doi.org/10.1038/s41598-021-03432-3

Download citation

Received: 20 July 2021
Accepted: 03 December 2021
Published: 15 December 2021
DOI: https://doi.org/10.1038/s41598-021-03432-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.