Bioinformatic identification of genomic instability-associated lncRNAs signatures for improving the clinical outcome of cervical cancer by a prognostic model

Zhang, Jian; Ding, Nan; He, Yongxing; Tao, Chengbin; Liang, Zhongzhen; Xin, Wenhu; Zhang, Qianyun; Wang, Fang

doi:10.1038/s41598-021-00384-6

Download PDF

Article
Open access
Published: 22 October 2021

Bioinformatic identification of genomic instability-associated lncRNAs signatures for improving the clinical outcome of cervical cancer by a prognostic model

Jian Zhang¹,
Nan Ding¹,
Yongxing He²,
Chengbin Tao¹,
Zhongzhen Liang¹,
Wenhu Xin¹,
Qianyun Zhang¹ &
…
Fang Wang¹

Scientific Reports volume 11, Article number: 20929 (2021) Cite this article

1226 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The research is executed to analyze the connection between genomic instability-associated long non-coding RNAs (lncRNAs) and the prognosis of cervical cancer patients. We set a prognostic model up and explored different risk groups' features. The clinical datasets and gene expression profiles of 307 patients have been downloaded from The Cancer Genome Atlas database. We established a prognostic model that combined somatic mutation profiles and lncRNA expression profiles in a tumor genome and identified 35 genomic instability-associated lncRNAs in cervical cancer as a case study. We then stratified patients into low-risk and high-risk groups and were further checked in multiple independent patient cohorts. Patients were separated into two sets: the testing set and the training set. The prognostic model was built using three genomic instability-associated lncRNAs (AC107464.2, MIR100HG, and AP001527.2). Patients in the training set were divided into the high-risk group with shorter overall survival and the low-risk group with longer overall survival (p < 0.001); in the meantime, similar comparable results were found in the testing set (p = 0.046), whole set (p < 0.001). There are also significant differences in patients with histological grades, FIGO stages, and different ages (p < 0.05). The prognostic model focused on genomic instability-associated lncRNAs could predict the prognosis of cervical cancer patients, paving the way for further research into the function and resource of lncRNAs, as well as a key approach to customizing individual care decision-making.

Genome-wide CRISPR screens identify the YAP/TEAD axis as a driver of persister cells in EGFR mutant lung cancer

Article Open access 24 April 2024

Multimodal analysis of cfDNA methylomes for early detecting esophageal squamous cell carcinoma and precancerous lesions

Article Open access 02 May 2024

Deep whole-genome analysis of 494 hepatocellular carcinomas

Article 14 February 2024

Introduction

The major cause of cancer mortality among women around the globe is cervical cancer (CC) which ranks 4th as a widely diagnosed cancer. Early CC patients were tested with thinprep cytologic tests (TCT) and treated with human papilloma (HPV) vaccines, but mortality between 2007 and 2017 rose by 19%¹. Particularly in developing countries, the long-term survival and prognosis of patients at advanced stage CC remain still poor. Patient features (such as age, the high-risk HPV infection, cancer grade, etc.) are already used to evaluate the recurrence or progression of patients with CC. CC is considered to be a complex, clinical heterogeneity cancer. Surgery, radiotherapy, and chemical treatment are often used for CC, but such treatments do not necessarily work². Therefore, there is an evident interest in finding new bioinformatic identification and novel therapeutic targets, which are capable of could reliably predict the clinical outcomes of CC accurately.

Genomic instability was established by increasing the incidence of gene destruction and genomic integrity loss as a significant feature of tumorigenesis³. More importantly, genomic instability is correlated and a prognostic factor with tumor development and survival^4,5,6. Though it is uncertain that disrupting the mechanism of genomic stability, numerous studies have confirmed that long noncoding RNA (lncRNA) is functional in such a process^3,7,8,9.

In this study, we established a computational model integrating lncRNA expression profiles and somatic mutation profiles in a tumor genome to explore better the dynamic mechanism of lncRNA signature as an indicator of CC genomic stability, and which might help improve its prognostic utility.

Materials and methods

Data collection

The data were collected from The Cancer Genome Atlas (TCGA) database included clinical features, transcriptome profiling data, and somatic mutation information of CC patients. 307 female samples were paired with the Fragments Per Kilobase Million (FPKM) values of lncRNA and mRNA expression profiles, somatic mutation data, and clinical survival data were to further analyze and validate. Data were deposited in the TCGA database (https://portal.gdc.cancer.gov/repository).

The training set was used to identify prognostic lncRNA signature and build a prognostic risk model. The testing set was used to validate the efficiency of the prognostic risk model independently. Besides, somatic mutation information and the corresponding lncRNA expression data of 294 CC patients were also downloaded from the TCGA database. The clinical and pathological characteristics were briefly summarized in Table 1.

Table 1 Clinical information for 3 cervical cancer patients sets in this study.

Full size table

Identification of genomic instability-associated lncRNAs

Briefly, we followed the methods of Bao et al. 2019 to identify genomic instability-associated lncRNA and use a mutator hypothesis-derived computational model¹⁰. The computational model incorporating lncRNA expression profiles and somatic mutation profiles in a tumor genome to screen the genes that are significantly associated with lncRNAs (Fig. 1): (1) the cumulative number of somatic mutations was computed and ranked in decreasing order for each patient; (2) the top 25% of patients were defined as genomic unstable (GU)-like group, and the last 25% were defined genomically stable (GS)-like group; (3) expression profiles of lncRNAs between the GU group and GS group were compared using significance analysis of microarrays (SAM) method; (4) differentially expressed lncRNAs (|log fold change|> 0.3 and false discovery rate (FDR) adjusted p < 0.05) were defined as genomic instability-associated lncRNAs¹¹.

Establishment of the prognostic model and validation

For the construction of the prognostic model, CC patients with overall survival of < 30 days were excluded. To select prognostic genes, we applied Univariate Cox regression analysis by R package survival (https://github.com/therneau/survival) with a cut-off of p < 0.05. The whole data set was randomly separated into the training set and the testing set using R package caret (https://github.com/topepo/caret).

We evaluated outcome prediction by using a lncRNA signature (LncSig) formula as follows: \(LncSig\;\left( {{\text{patient}}} \right) = \mathop \sum \limits_{i = 1}^{n} ceof\;\left( {lncRNA_{i} } \right)*expr\;\left( {lncRNA_{i} } \right)\). LncSig (patient) represents a prognostic risk score, expr (lncRNA_i) is the expression level of the ith prognostic lncRNA for the patient. coef (lncRNA_i) represents prognostic risk scores of the ith prognostic lncRNA, and coef was calculated by multivariate Cox analysis. Cox regression and stratified analysis were used in evaluating the link between LncSig and some important clinical factors. We determined the risk score for each study based on the expression of the outcome-related genes, the prognosis model coefficient, and patients' survival status. We calculated hazard ratio (HR) and 95% confidence interval (CI) by Cox analysis. The samples were consequently separated by the risk score median value of the low-risk or high-risk group. Finally, all statistical analyses were carried out by using R-version 4.0.2 (https://www.R-project.org). R package (survivalROC) and the time-dependent receiver operating characteristic (timeROC) curve were evaluated the prognostic performance of the model LncSig.

Functional enrichment analysis

The functional enrichment analysis was conducted using the R package (clusterProfiler). We have conducted the Pearson correlation to determine 15 LncRNAs (co-expressed LncRNA-associated mRNA partners) to determine the link between paired lncRNAs expression and protein-coding genes (PCGs) in CC. To improve the reliability and credibility of the results, we employed the Gene Ontology (GO) Enrichment Analysis and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, which targeted the co-expressed lncRNA-associated mRNA partners to further explore the potential functions and the molecular mechanism of lncRNAs based on the threshold with FDR < 0.05 and p < 0.05.

Results

Identification of genomic instability-related lncRNAs in cervical cancer patients

We collected 309 samples (306 tumor and 3 adjacent tissues) from the TCGA database to analyze the differences of gene expression between tumor and adjacent samples, and then identified the lncRNAs related to genomic instability in CC patients. The cumulative number of somatic mutations per patient was computed, and then ranked them in the decreasing order, the top 25% (n = 73) and last 25% (n = 74) as GU-like group and GS-like group according to the above order. 35 lncRNAs were found to be substantially differentially expressed with their |log fold change value|> 0.3 and FDR-adjusted p < 0.05 based on the SAM approach. We performed hierarchical clustering analysis on 147 samples of the whole set using the set of 35 differentially expressed lncRNAs, and then we clustered into GU and GS-like groups according to the expression levels of 35 differentially expressed lncRNAs (9 upregulated lncRNAs and 26 downregulated were found in GU-like group, R-package: limma, sparcl and pheatmap, Fig. 2A). Analytical findings revealed a statistically significant difference in the median value of somatic cumulative mutations between the GU-like (57.3) and the GS-like group (42.7), p < 0.001, Mann–Whitney U test, R-package: limma and ggpubr, Fig. 2B. We next compared the expression level of KRAS, PIK3CA, ARID1A, and UBQLN4 gene (a set of newly discovered drivers of genomic instability) between the GS-like group and GU-like group^12,13. When compared to the GS-like group, the GU-like group showed greater these gene expression levels (p < 0.05, Mann–Whitney U test, R-package: limma and ggpubr, Fig. 2C).

We performed functional enrichment analysis to predict possible roles and pathways, and aim to further grasp the relationship between the expression of 35 differentially lncRNAs and PCGs. We calculated the expression correlation between the 35 lncRNAs and PCGs, and then found lncRNA-correlated PCGs. A network of lncRNAs–mRNA co-expression was built with 35 nodes, and one node containing 1 lncRNA and 15 mRNAs, and if they were related, the lncRNAs and mRNAs are connected (R-package: limma and igraph, Table 2, Fig. 2D). The results of GO analysis of lncRNA-correlated PCGs showed that mRNAs in this network were substantially linked with genomic instability, including rRNA catabolic process, deoxyribonucleotide catabolic process, and transcriptionally active chromatin (R-package: clusterProfiler, org.Hs.eg.db, enrichplot and ggplot2, Fig. 2E). KEGG pathway analysis identified 15 pathways that were highly enriched, several of which were associated with transcriptional misregulation in cancer (Fig. 2E). While analyzing the 35 differentially expressed lncRNAs, we found that their altered expression might affect transcriptional genes, which may cause the genomic stability in CC cells (Table 2). Normal gene damage repair boosts genomic instability due to changes in the cell microenvironment, and the genomic instability brought on by changes in the molecular and metabolism function of the lncRNA-related PCGs regulatory network. As shown in the above findings, and it was found that 35 lncRNAs whose expression differed from that of their normal tissues were potential genomic instability-associated lncRNAs (GIlncRNAs).

Table 2 Differentially expressed lncRNAs and relative mRNAs.

Full size table

Establishing and validating the 3 lncRNAs based prognostic signature in the training set

The prognostic model was constructed by a group of 304 patients with a survival duration of more than 1 month and CC-related genes. The R package caret may randomly separate the whole data set into a training set (n = 152) and a testing set (n = 152). The baseline features are summarized in Table 1. The clinical parameters were not significantly different from the training set and testing set. The univariate Cox proportional hazard regression analysis study 35 genomic instability-associated lncRNAs was then used to establish the 5 candidate lncRNAs prognostic signature (R-package: survival, caret, glmnet, survminer and timeROC, Fig. 3A). After analyzing the training set using the Cox model, we found 3 of 5 candidate lncRNAs (AP001527.2, AC107464.2, and MIR100HG) as independent prognostic lncRNAs in the (p < 0.05). The genomic instability-derived lncRNA signature (LncSig) was constructed as follows: LncSig score = (− 1.4997 × expression level of AC107464.2) + (0.3111 × expression level of MIR100HG) + (0.0802 × expression level of AP001527.2). In this LncSig score, positive coef of AP001527.2 and MIR100HG suggested that they might be risk factors for a poor prognosis, while negative ceof of AC107464.2 indicated that it could be a protective factor for survival.

The median risk score (1.1467) was used to divide the training set into the high-risk and low-risk groups based on the LncSig. Kaplan–Meier analysis showed that the survival outcomes of patients in the low-risk group are significantly better than patients in the high-risk group (median survival 1.633 years versus 1.323 years, p < 0.001, log-rank test; R-package: survival and survminer, Fig. 3B). The survival rate of the high-risk group was 13.8% at 3 years and that of the low-risk group was 17.1%. The time-dependent ROC curves analysis of the LncSig yielded an area under curve (AUC) of 0.783 at 3 years (R-package: survival, survminer and timeROC, Fig. 3C). As the LncSig score increased, we observed how the count of somatic mutations and an increase in the expression level of KRAS. For the high score group, the expression levels of risk factors (AP001527.2 and MIR100HG) were upregulated, while the expression level of protective factor (AC107464.2) was downregulated in the low score group. Conversely, the low score group held an opposite expression of 3 lncRNAs (R-package: limma and pheatmap, Fig. 3D). Compared with the low-risk group, the somatic mutation was found to be substantially greater in the high-risk group (median 166.5 versus 177, p = 0.077, Mann–Whitney U test, R-package: limma and ggpubr, Fig. 3E). The expression levels of newly identified drivers of genomic instability (KRAS, PIK3CA, ARID1A, and UBQLN4) were analyzed, in which KRAS in the high-risk group was significantly higher compared to that of patients in the low-risk group (median 7.221 versus 7.036, p = 0.04, Mann–Whitney U test, Fig. 3F). Other divers revealed no significant differences.

Independent validation of LncSig in the testing set and whole set

To examine the applicability of the LncSig, the testing set (152 patients) was tested for its prognostic outcome in LncSig. The 152 patients of the testing set were assigned to the high-risk group (n = 90) and low-risk group (n = 62) by applying the median risk score (1.1467) of the training set, and the survival rate was significantly different in the testing set (p = 0.046). Kaplan–Meier analysis showed that the survival outcomes of patients in the low-risk group are significantly better than patients in the high-risk group (median survival 1.737 years versus 1.611 years, p = 0.046, log-rank test; Fig. 4A). The survival rate of the high-risk group was 12.5% at 5 years and that of the low-risk group was 13.8% in the training set. In comparison, the validation was identical to the findings above in the whole set. The patients of the whole set were categorized as the high-risk group (n = 166) and low-risk group (n = 138), which was much higher than patients in the high-risk population median results in the low-risk groups (survival 1.701 years versus 1.485 years, p < 0.001, log-rank test; Fig. 4B). The survival rate was 13.8% in the high-risk group at 5 years below 14.8% in the low-risk group.

The time-dependent ROC curves analysis of the LncSig was applied to the testing set yielded an AUC of 0.663 at 3 years (Fig. 4C). The consistent results of time-dependent ROC curves analysis in the whole set were observed as above, an AUC of 0.687 at 3 years (Fig. 4D).

We verified how the count of somatic mutations and expression of KRAS with increasing LncSig score in the testing set and whole set. The distribution of somatic mutation count and KRAS expression in the testing and whole samples were illustrated in Fig. 4E,F. The results of 2 sets were consistent with our earlier research of the training set. The somatic mutation pattern of the high-risk was marginally significantly higher than the low-risk group in the testing set (median 158 versus 146, p = 0.41). The expression level of KRAS was observed to be marginally significantly higher in the high-risk group than that in the low-risk group (median 7.469 versus 7.212, p = 0.44, p = 0.084, Mann–Whitney U test; Fig. 4G). The somatic mutation pattern of the high-risk was marginally significantly higher than the low-risk group in the testing set (median 149 versus 146, p = 0.31). The expression level of KRAS in the high-risk group was observed to be marginally significantly higher than that in the low-risk group (median 7.615 versus 7.605, p = 0.22, Mann–Whitney U test; Fig. 4H).

The LncSig model validation of different clinical groups

To observe whether the LncSig model was suitable for different clinical groups of patients, we performed multivariate Cox regression analyses on age, histological grade, and FIGO stage. The clinical information table of 3 CC patients set showed that there was no significant difference in age, histological grade, FIGO stage, tumor TNM stage, and vital status between the testing set group and training set group (p > 0.05, Chi-square test, Table 1). Stratification analysis was performed to determine whether the LncSig possessed a prognostic value that was independent of the age, histological grade, FIGO stage. Patients in the whole set were stratified into a younger group (n = 154) and an older group (n = 150) according to the median age (46-year-old). Patients in each age group further were divided into the high-risk and the low-risk group by using the LncSig model. There was a significant difference in Kaplan–Meier curve analysis of overall survival between the high-risk and low-risk groups in the younger group (p = 0.035, Fig. 5A). There was also a statistical difference in the older group (p < 0.001, Fig. 5B). Then patients in the whole set were stratified into a well-moderately differentiated group (histological grade 1–2, n = 153) and a poorly-no differentiated group (histological grade 3, n = 118). LncSig model could further classified patients in each stage into the high-risk and the low-risk group. There was a significant difference between the high-risk and low-risk groups in the well-moderately differentiated histological grade group (p = 0.014, Fig. 5C). There was also a statistical difference in the poorly-no differentiated histological grade group (p = 0.008, Fig. 5D). Finally, according to different FIGO stages and treatment methods, patients in the whole set were stratified into an earlier stage group (FIGO stage I–IIA, n = 188) and a later stage group (FIGO stage IIB–IVB, n = 109)¹⁴. LncSig model could further classified patients in each stage into the high-risk and the low-risk group. There was a significant difference between the high-risk and low-risk groups in the earlier stage group (p = 0.001, Fig. 5E). There was also a statistical difference in the advanced group (p = 0.017, Fig. 5F). The results suggested that the LncSig model was an independent prognostic factor for overall survival in CC patients.

The prediction outcome of LncSig model greater than KRAS mutation status

To further verify the reliability of the LncSig model, we compared it with KRAS mutation status. Samples were classified into the wild group and the mutation group according to their KRAS mutation. We further classified the mutation group based on somatic mutations into two groups: GU-like and GS-like. The wild group is the same as above. As shown in Fig. 6A, the groups were divided into KRAS Mutation/GS-like, KRAS Mutation/GU-like, KRAS Wild/GS-like, and KRAS Wild/GU-like group. The overall survival outcome of KRAS Mutation was lower than that of KRAS wild, R-package: survival and survminer. The result indicated that KRAS mutation/GU-like patients had marginally shorter survival than those with KRAS wild type (p = 0.067, log-rank test). According to LncSig, the mutation/wild KRAS group samples were divided into two groups: the high-risk and low-risk. As shown in Fig. 6B, the overall survival outcome of KRAS Mutation/high had significantly lower than those with KRAS wild type (p < 0.001, log-rank test). The survival curve of the KRAS Mutation/GU-like group (Fig. 6A) was not similar to KRAS Mutation/high group curves (Fig. 6B). Our results provide a more detailed analysis of the prognosis of patients with KRAS mutations. Therefore, The significant difference suggested that the LncSig may be better than the KRAS mutation status alone.

Survival performance prediction comparison of the LncSig with existing lncRNA-related signatures

We further compared the prediction performance of the LncSig with two recently published lncRNA signatures: 3-lncRNAs (H19, MALAT1, and CCHE1) signature derived from Cáceres’ study (hereinafter referred to as CácereslncSig)¹⁵ and 2-lncRNAs (HOTAIR and SNHG1) signature derived from Aalijahan’s study (hereinafter referred to as AalijahanlncSig)¹⁶ using the same TCGA patient cohort. As shown in Fig. 7, the AUC at 3 years for the LncSig is 0.687, which is significantly higher than that of CácereslncSig (AUC = 0.569) and AalijahanlncSig (AUC = 0.580), R-package: limma, survival, survminer and timeROC. These comparison results of ROC survival prediction demonstrated the better prognostic performance of the LncSig in predicting survival than two recently published lncRNA signatures.

Discussion

Cervical cancer is thought to bring a great threat to current women's health, and have important impacts on it. Statistics show that the age of diagnosed patients is tardily decreasing, with 80% developing aggressive cancer. Though traditional tumor grade and pathologic stage are used as the most important prognostic factors in the CC patients, it is still difficult to predict the clinical outcome more accurately^17,18. However, reliable and specific biomarkers for the diagnosis and prognosis of cervical cancer are scarce and lack exploration. Earlier research had focused on a single biomarker, which might reduce the prognostic performance^19,20,21. Therefore, more reliable prognostic models for CC patients currently are urgently need.

Recently, more and more scholars have been drawn to genomic instability. Genomic instability can not only initiate cancer, augment progression, and influence the overall prognosis of the affected patient, but also the survival of CC patients^3,5,22,23. Recent studies have shown that epigenetic modifications and DNA damage from endogenous and exogenous sources could affect genomic instability^24,25,26,27. An increasing number of reports have revealed that lncRNAs are implicated in the control of various cancer cellular disease progression^28,29,30. Though the comprehension of functional mechanisms of lncRNAs has shown that lncRNAs also are crucial for genomic stability, the systematic exploration of genomic instability-associated lncRNAs on their clinical significance in cancers is still in its infancy. Accumulative evidence has identified lncRNAs as functional regulators of cervical cancer oncogenesis and progression, and play critical roles in the regulation of the complex cellular comportements^31,32,33. We used a mutator hypothesis-derived computational model, which combined lncRNAs expression profiles and somatic mutation profiles in a tumor genome for screening lncRNAs.

A five-lncRNAs signature based on the TCGA database has been identified and validated in this report. And then, with GO enrichment, KEGG pathway, and co-expression analysis, we explored the potential mechanism of 35 lncRNAs. Our studies suggested that the genes that co-expressed with the 35 lncRNAs were enriched in rRNA catabolic process, deoxyribonucleotide catabolic process, and transcriptionally active chromatin. rRNA that was essential housekeeping genes found in all organisms can maintain genome integrity^34,35. Regulation of intracellular deoxynucleoside triphosphate (dNTP) pool is critical to genomic stability and cancer development, and imbalanced deoxyribonucleotide catabolic can lead to genomic instability and cell-cycle progression, thus promoting the proliferation of cancer cells³⁶. Specific DNA structures such as R-loops and topoisomerase-induced DNA double-strand break (DSBs) causing genotoxic stress and may lead to genome instability and consequently to cancer in the transcriptional activation³⁷. According to KEGG pathway analysis, the 35 lncRNAs were involved in transcriptional misregulation in the cancer pathway, ribosome, which are associated with genomic instability^38,39,40.

Furthermore, we examined whether genomic instability-related lncRNAs could allow the prediction of CC patients' outcome, and then resulted in a lncRNA signature (LncSig) including three genomic instability-related lncRNAs (AP001527.2, AC107464.2, and MIR100HG). The whole TCGA clinical set was classified into the high-risk and the low-risk group with significantly different survival in the training set, which was verified on the testing set. After a careful literature search, we found that AP001527.2 was associated with the immune microenvironment of cervical cancer⁴¹. MIR100HG was associated with promoter methylation of cervical cancer^42,43. The biological function of lncRNA AC107464.2 has not been reported until now. These validation results in multiple data sets indicated that the LncSig could predict the prognosis and genomic instability of CC patients.

Some studies suggested that activating KRAS mutation was the major oncogenic driver regardless of a specific site of origin^12,44,45. LncSig found that the expression level of KRAS in the high-risk group was observed to be marginally significantly higher than that in the low-risk group. In different clinical groups, we also found that the LncSig had a significantly different clinical outcome in CC patients. Furthermore, the LncSig could marginally significantly distinguish survival outcomes between KRAS mutation patients and other group patients. KRAS mutation/high patients had significantly shorter survival than those with KRAS wild type. The significant difference suggested that the LncSig may be better than the KRAS mutation status alone. These findings suggested that the prediction outcome of the LncSig model might be greater than the KRAS mutation status.

There are still some limitations that require further study. Although LncSig has been validated in the TCGA data set, it required more independent data sets to verify the LncSig to guarantee its reliability and replicability. The regulatory mechanisms of the genomic instability in CC patients are understood via large numbers of verification experiments.

Conclusion

In summary, we established a signature model based on 3 genomic instability-associated lncRNAs corrected to evaluate progression and prognosis in CC. The high- and low-risk groups present separate survival states, suggesting the capacity of genomic instability-associated lncRNAs to determine the survival of patients. The LncSig provides a critical approach and resource for further studies examining. We expect the LncSig model to pave the way for further research into the function and resource of lncRNAs, as well as a key approach to customizing individual care decision-making.

Data availability

The data used to support the findings of this study are available from the corresponding author upon request. The availability of data and materials is from the TCGA database (https://portal.gdc.cancer.gov/repository).

References

Fitzmaurice, C. et al. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 29 cancer groups, 1990 to 2017: A systematic analysis for the global burden of disease study. JAMA Oncol. 5, 1749–1768 (2019).
Article PubMed PubMed Central Google Scholar
Dong, J. et al. Long non-coding RNAs on the stage of cervical cancer (review). Oncol. Rep. 38, 1923–1931 (2017).
Article CAS PubMed Google Scholar
Guo, F. et al. Long noncoding RNA: A resident staff of genomic instability regulation in tumorigenesis. Cancer Lett. 503, 103–109 (2021).
Article CAS PubMed Google Scholar
D’amico, A. M. & Vasquez, K. M. The multifaceted roles of DNA repair and replication proteins in aging and obesity. DNA Repair (Amst.) 99, 103049 (2021).
Article CAS Google Scholar
King, L. et al. Survival outcomes are associated with genomic instability in luminal breast cancers. PLoS ONE 16, e0245042 (2021).
Article CAS PubMed PubMed Central Google Scholar
Meier, T. et al. Gene networks and transcriptional regulators associated with liver cancer development and progression. BMC Med. Genom. 14, 41 (2021).
Article CAS Google Scholar
Ding, L. et al. The emerging role of small non-coding RNA in renal cell carcinoma. Transl. Oncol. 14, 100974 (2021).
Article PubMed Google Scholar
Liu, W., Zhang, Y. & Luo, B. Long non-coding RNAs in gammaherpesvirus infections: Their roles in tumorigenic mechanisms. Front. Microbiol. 11, 604536 (2020).
Article PubMed Google Scholar
Zhou, M. et al. The patterns of antisense long non-coding RNAs regulating corresponding sense genes in human cancers. J. Cancer 12, 1499–1506 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bao, S. et al. Computational identification of mutator-derived lncRNA signatures of genome instability for improving the clinical outcome of cancers: A case study in breast cancer. Brief Bioinform. 21, 1742–1755 (2020).
Article PubMed Google Scholar
Sarmiento, M. E. et al. Comparative transcriptome profiling of horseshoe crab Tachypleus gigas hemocytes in response to lipopolysaccharides. Fish Shellfish Immunol. 117, 148–156 (2021).
Article PubMed Google Scholar
Lin, D. I. et al. Molecular profiling of mesonephric and mesonephric-like carcinomas of cervical, endometrial and ovarian origin. Gynecol. Oncol. Rep. 34, 100652 (2020).
Article PubMed PubMed Central Google Scholar
Mine, K. L. et al. Gene network reconstruction reveals cell cycle and antiviral genes as major drivers of cervical cancer. Nat. Commun. 4, 1806 (2013).
Article ADS PubMed CAS Google Scholar
Koh, W. J. et al. Cervical cancer, version 3.2019, NCCN clinical practice guidelines in oncology. J. Natl. Compr. Cancer Netw. 17, 64–84 (2019).
Article CAS Google Scholar
Cáceres-Durán, M., Ribeiro-Dos-Santos, Â. & Vidal, A. F. Roles and mechanisms of the long noncoding RNAs in cervical cancer. Int. J. Mol. Sci. 21, 9742 (2020).
Article PubMed Central CAS Google Scholar
Aalijahan, H. & Ghorbian, S. Long non-coding RNAs and cervical cancer. Exp. Mol. Pathol. 106, 7–16 (2019).
Article CAS PubMed Google Scholar
Dai, F. et al. Identification of candidate biomarkers correlated with the diagnosis and prognosis of cervical cancer via integrated bioinformatics analysis. Oncol. Targets Ther. 12, 4517–4532 (2019).
Article CAS Google Scholar
Yang, S. et al. Identification of diagnostic and prognostic lncRNA biomarkers in oral squamous carcinoma by integrated analysis and machine learning. Cancer Biomark. 29, 265–275 (2020).
Article CAS PubMed Google Scholar
Ding, X. Z. et al. Serum exosomal lncRNA DLX6-AS1 is a promising biomarker for prognosis prediction of cervical cancer. Technol. Cancer Res. Treat. 20, 1533033821990060 (2021).
PubMed PubMed Central Google Scholar
Gu, X. et al. The dual functions of the long noncoding RNA CASC15 in malignancy. Biomed. Pharmacother. 135, 111212 (2021).
Article CAS PubMed Google Scholar
Shimomura, M. et al. PRMT1 expression predicts response to neoadjuvant chemotherapy for locally advanced uterine cervical cancer. Oncol. Lett. 21, 150 (2021).
Article CAS PubMed Google Scholar
Cortés-Gutiérrez, E. I. et al. 1p36 is a chromosomal site of genomic instability in cervical intraepithelial neoplasia. Biotech. Histochem. 95, 137–144 (2020).
Article PubMed CAS Google Scholar
Gashi, G. et al. Genomic instability in peripheral blood lymphocytes of patients diagnosed with high-grade squamous intraepithelial lesions: CIN 2 versus CIN 3. Mutat. Res. 854–855, 503202 (2020).
Article CAS Google Scholar
Ferguson, L. R. et al. Genomic instability in human cancer: Molecular insights and opportunities for therapeutic attack and prevention through diet and nutrition. Semin. Cancer Biol. 35(Suppl), S5–S24 (2015).
Article PubMed PubMed Central CAS Google Scholar
Jilderda, L. J., Zhou, L. & Foijer, F. Understanding how genetic mutations collaborate with genomic instability in cancer. Cells 10, 342 (2021).
Article CAS PubMed PubMed Central Google Scholar
Suzuki, R. et al. The fragility of a structurally diverse duplication block triggers recurrent genomic amplification. Nucl. Acids Res. 49, 244–256 (2021).
Article CAS PubMed Google Scholar
Tayoun, T. et al. Tumor evolution and therapeutic choice seen through a prism of circulating tumor cell genomic instability. Cells 10, 337 (2021).
Article CAS PubMed PubMed Central Google Scholar
Guh, C. Y., Hsieh, Y. H. & Chu, H. P. Functions and properties of nuclear lncRNAs-from systematically mapping the interactomes of lncRNAs. J. Biomed. Sci. 27, 44 (2020).
Article CAS PubMed PubMed Central Google Scholar
Tsagakis, I. et al. Long non-coding RNAs in development and disease: Conservation to mechanisms. J. Pathol. 250, 480–495 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. Progress of long noncoding RNAs in anti-tumor resistance. Pathol. Res. Pract. 216, 153215 (2020).
Article CAS PubMed Google Scholar
Luo, F. et al. Roles of long non-coding RNAs in cervical cancer. Life Sci. 256, 117981 (2020).
Article CAS PubMed Google Scholar
He, J. et al. Long non-coding RNA in cervical cancer: From biology to therapeutic opportunity. Biomed. Pharmacother. 127, 110209 (2020).
Article CAS PubMed Google Scholar
Galvão, M. & Coimbra, E. C. Long noncoding RNAs (lncRNAs) in cervical carcinogenesis: New molecular targets, current prospects. Crit. Rev. Oncol. Hematol. 156, 103111 (2020).
Article PubMed Google Scholar
Ide, S. et al. Abundance of ribosomal RNA gene copies maintains genome integrity. Science 327, 693–696 (2010).
Article ADS CAS PubMed Google Scholar
Ye, C. et al. BCCIP is required for nucleolar recruitment of eIF6 and 12S pre-rRNA production during 60S ribosome biogenesis. Nucl. Acids Res. 48, 12817–12832 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kohnken, R., Kodigepalli, K. M. & Wu, L. Regulation of deoxynucleotide metabolism in cancer: Novel mechanisms and therapeutic implications. Mol. Cancer 14, 176 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ui, A., Chiba, N. & Yasui, A. Relationship among DNA double-strand break (DSB), DSB repair, and transcription prevents genome instability and cancer. Cancer Sci. 111, 1443–1451 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lee, T. I. & Young, R. A. Transcriptional regulation and its misregulation in disease. Cell 152, 1237–1251 (2013).
Article CAS PubMed PubMed Central Google Scholar
Khatami, M. Cancer; An induced disease of twentieth century! Induction of tolerance, increased entropy and “Dark Energy”: Loss of biorhythms (anabolism v. catabolism). Clin. Transl. Med. 7, 20 (2018).
Article PubMed PubMed Central Google Scholar
Sulima, S. O. et al. Ribosomal lesions promote oncogenic mutagenesis. Cancer Res. 79, 320–327 (2019).
Article CAS PubMed Google Scholar
Chen, P. et al. A prognostic model based on immune-related long non-coding RNAs for patients with cervical cancer. Front. Pharmacol. 11, 585255 (2020).
Article CAS PubMed PubMed Central Google Scholar
Roychowdhury, A. et al. Deregulation of H19 is associated with cervical carcinoma. Genomics 112, 961–970 (2020).
Article CAS PubMed Google Scholar
Shang, C. et al. Characterization of long non-coding RNA expression profiles in lymph node metastasis of early-stage cervical cancer. Oncol. Rep. 35, 3185–3197 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ding, Z. et al. MiR-16 inhibits proliferation of cervical cancer cells by regulating KRAS. Eur. Rev. Med. Pharmacol. Sci. 24, 10419–10425 (2020).
CAS PubMed Google Scholar
Fukahori, M. et al. Relationship between cervical esophageal squamous cell carcinoma and human papilloma virus infection and gene mutations. Mol. Clin. Oncol. 14, 41 (2021).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank The Cancer Genome Atlas (TCGA) for providing the publicly available data.

Funding

This work was funded by the National Natural Science Foundation of China (Grant No. 81960515).

Author information

Authors and Affiliations

Department of Reproductive Medicine, Lanzhou University Second Hospital, Lanzhou, 730030, China
Jian Zhang, Nan Ding, Chengbin Tao, Zhongzhen Liang, Wenhu Xin, Qianyun Zhang & Fang Wang
School of Life Sciences, Lanzhou University, Lanzhou, 730000, China
Yongxing He

Authors

Jian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Nan Ding
View author publications
You can also search for this author in PubMed Google Scholar
Yongxing He
View author publications
You can also search for this author in PubMed Google Scholar
Chengbin Tao
View author publications
You can also search for this author in PubMed Google Scholar
Zhongzhen Liang
View author publications
You can also search for this author in PubMed Google Scholar
Wenhu Xin
View author publications
You can also search for this author in PubMed Google Scholar
Qianyun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study conception and design were performed by J.Z., Y.H., and F.W. Material preparation, data collection, and analysis were performed by C.T., Z.L., W.X., and Q.Z. The first draft of the manuscript was written by J.Z. and N.D. The illustration is drawn by J.Z. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Fang Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, J., Ding, N., He, Y. et al. Bioinformatic identification of genomic instability-associated lncRNAs signatures for improving the clinical outcome of cervical cancer by a prognostic model. Sci Rep 11, 20929 (2021). https://doi.org/10.1038/s41598-021-00384-6

Download citation

Received: 25 June 2021
Accepted: 05 October 2021
Published: 22 October 2021
DOI: https://doi.org/10.1038/s41598-021-00384-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.