Introduction

Cervical cancer is one of the four most common gynecological tumors1. Every year, at least 569,847 women in the world are diagnosed with cervical cancer, and more than 311,365 people are killed2. In recent years, the incidence rate of cervical cancer has decreased significantly through universal screening and health knowledge. However, the incidence rate of cervical cancer is still high in developing countries3. For women with low education in less developed areas, the coverage rate of cervical cancer screening is still very low4. Squamous cell carcinoma is the most common type of cervical cancer, accounting for 75% of cervical cancer cases, while adenocarcinoma only accounts for about 20%. In developing countries, 70% of cervical cancer patients have local infiltration or metastasis, which has led to the high mortality of cervical cancer in developing countries. Early cervical cancer is usually treated by radical hysterectomy. When there are risk factors such as lymph node metastasis and endometriosis that may lead to recurrence, they will be treated with chemotherapy5. The standard treatment for patients with locally advanced cervical cancer is conventional radiation therapy(CRT)6. The five-year survival rate of patients with locally advanced cervical cancer can be as high as 75ā€“85% after surgical resection, radiotherapy, chemotherapy, CRT, and so on7. However, at present, all treatment methods are not effective for patients with paraaortic lymph node metastasis, and their three-year progression free survival time (PFS) and total survival time (OS) are 34% and 39%8,9, respectively. The five-year survival rate of cervical cancer patients with recurrence and metastasis was as low as 15%. Limited treatment is the main reason for this situation. Now, palliative chemotherapy is the most commonly used for patients with metastatic and recurrent cervical cancer10. The median survival time of patients with metastatic or recurrent cervical cancer treated with platinum/taxane chemotherapy and bevacizumab can be extended to 17Ā months11. However, these treatments are far from enough for most locally advanced and metastatic cervical cancer patients with positive lymph node metastasis.

In recent years, immunotherapy has been developed and increasingly used in cancer patients. For example, PD-L1 is overexpressed in a variety of tumor cells, including liver cancer cells and lung cancer cells, and plays an important role in regulating the immune response of tumor cells12,13,14,15. Currently, there are several clinical trials involving FDA-approved immunosuppressive checkpoint inhibitors, which attack tumor cells expressing PD-L1 by blocking the PD-L1/PD-1 signaling pathway, so as to improve the treatment and prognosis of patients. From the current situation, immunosuppressive therapy has achieved good results in many solid tumors16. The results of PD-1/PD-L1 inhibition in cervical cancer are also satisfactory. However, at present, immunoassay sites with a therapeutic effect are scarce, and the research on tumor immunotherapy is far from sufficient. In this study, we screened immune genes that are significantly related to the prognosis of cervical cancer, constructed an immune gene pair (IRGP) model based on these genes, and used it to verify the unique prognostic markers of cervical cancer.

Method

Data acquisition

Gene expression profile data of 255 patients with cervical squamous cell carcinoma were obtained from cancer and tumor gene map (TCGA https://www.tcga.org), and gene expression profile data set gse4400116 was obtained from gene expression compilation (GEO https://www.ncbi.nlm.nih.gov/geo/) database, including 300 samples of cervical cancer patients.

Acquisition of sample immune gene expression

2,498 immune related genes were downloaded from immport (https://www.immport.org/home), including antigen presenting cells, chemokines and their receptors, cytokines and their receptors, interferon, interleukin, etc. Using limma package in R (3.61), we compared the gene expression data of cervical cancer samples downloaded from TCGA database and geo database as training set and verification set with immune related genes, and extracted the expression amount of immune related genes in cervical cancer samples.

Construction of immune related gene pairs (IRGPs)

In the two groups of data processed in the previous step, the IRGP of the sample is calculated, and the relatively high change is selected according to the standard of media absolute deviationā€‰>ā€‰0.5. IRGP values are calculated by comparing gene expression levels in specific samples or profiles in pairs. The immune related genes are matched to compare the IRGPs. If the first IRG is larger than the second IRG, the output of the IRGP is 1; otherwise, the output is 0. If the ratio of IRGP score of 0 or 1 in training set and verification set is higher than 80%, then remove the IRGP and retain the remaining IRGP as candidate IRGP for prognosis prediction. The logistic rank test was used to screen the prognosis IRGP (FDRā€‰<ā€‰0.01). Cox risk regression analysis and glment in R (3.61) were used to perform tenfold cross validation to analyze the candidate IRGP and obtain the IRGP index. We constructed the best 29 gene pairs as immune gene pair model. We use ROC to calculate the optimal cutoff value of IRGP index, and use it as the basis to distinguish high and low risk groups.

IRGPs model validation

The single factor and multi factor Cox proportional risk analysis and survival analysis of TCGA and gse44001 cervical cancer samples were carried out with IRGPs model.

Infiltration of immune cells in cervical cancer samples.

In order to study the infiltration of immune cells in the high and low risk groups of cervical cancer, we used CIBERSORT17 to evaluate and predict the enrichment of immune cells in the samples. CIBERSORT is a tool for deconvolution of the expression matrix of immune cell subtypes based on the principle of linear support vector regression. RNA SEQ data can be used to estimate the infiltration of immune cells. CIBERSORT can analyze the relative abundance of 22 immune infiltrating cells in each sample, including NK cells, T cells, B cells and macrophages.

Functional enrichment analysis of GSEA, go and KEGG

Gene set enrichment analysis(GSEA) enrichment analysis was carried out for each gene related to immune prognosis using the fgsea package in R (3.61)18 .Cluster profiler19 was used to enrich Gene ontology(GO)function and KEGG pathway. Significant enrichment criteria: the absolute value of NES is greater than 1, the nomp value is less than 0.05, and the fdrq value is less than 0.25.

Expression of immune gene in cervical cancer

Ualcan 20 were used to analyze the expression of immune genes in cervical cancer.

Statistical analysis

Measured data were expressed as meanā€‰Ā±ā€‰standard deviation (xā€‰Ā±ā€‰s) and data were compared using t test. Kaplan Meier method was used for survival analysis. The receiver operating characteristic curve (ROC curve) and ROC analysis were completed by survivalROC(1.0.3). Single factor and multi factor analysis using Cox proportional risk regression model. Pā€‰<ā€‰0.05 was statistically significant, Pā€‰<ā€‰0.Ā 01 as the difference has very significant statistical significance.

Ethical approval and consent to participate

This article does not contain any studies with patients or animals performed by any of the authors.

Results

Expression of immune related genes and construction of IRGPs in cervical cancer samples

We obtained gene expression data of 255 cervical cancer samples from TCGA database as training set, 300 cervical cancer samples from gse44001 as verification set, 2,498 immune related genes from immport, and 479 immune related genes from cervical cancer samples by comparing the two. Through these 479 immune related genes, we constructed 23,355 IRGPs. We remove more than 80% of the IRGP with a score of 0 or 1 from the training set and validation set, leaving 12,379 IRGP as candidates. Combine TCGA clinical data with training set data (Table 1). 73 prognosis related IRGPs were screened by lasso Cox proportional risk regression analysis. After 1,000 iterations, we selected 29 optimal IRGPs to build the immune prognosis model (Table 2).

Table 1 TCGA clinical data.
Table 2 Model information about IRGPI.

IRGPs model validation

The immune prognosis model was applied to the training set, and the patients in each training set were scored. According to ROC curve analysis, the optimal cutoff value for patients to be divided into high and low risk groups is 1.124 (Fig.Ā 1A). After evaluating the model, we found that AUC value of model 1, 3 and 5Ā years is 0.913 (Fig.Ā 1B), 0.913 (Fig.Ā 1C) and 0.912 (Fig.Ā 1D). The results show that our immune prognosis gene has a high reliability for the model. The training set was divided into high-risk group (Fig.Ā 2A) and high-risk group (Fig.Ā 2B). The results showed that the overall survival rate (OS) of high-risk group was significantly lower than that of low-risk group. For TCGA training set data, single factor and multi factor Cox risk regression analysis showed that only IRGPs model showed significant prognostic effect in single factor Cox (Fig.Ā 2C), while age and IRGPs could be significant independent prognostic factors in multi factor Cox (Fig.Ā 2D). Applying this model to the validation set of gse44001 (Fig.Ā 3A) (Table 3), survival analysis showed that the OS of patients in the high-risk group was significantly lower than that in the low-risk group (Fig.Ā 3B). In the single factor and multi factor Cox analysis, IRGPs model and tumor size were significantly correlated with prognosis (Fig.Ā 3C,D).

Figure 1
figure 1

(A) Time-dependent ROC curve for IRGPI in the training cohort. (B) Time-dependent ROC curve for IRGPI in 1Ā year. (C) Time-dependent ROC curve for IRGPI in 3Ā year. (D) Time-dependent ROC curve for IRGPI in 5Ā year.

Figure 2
figure 2

(A) The model divides the training set patients into low-risk or high-risk groups. (B) Kaplan Meier curve between high and low risk groups. (C) Training set single factor Cox regression analysis forest map. (D) Training set multivariate Cox regression analysis forest map.

Figure 3
figure 3

(A) The model divides the validation set patients into low-risk or high-risk groups. (B) Kaplan Meier curve between high and low risk groups. (C) Validation set single factor Cox regression analysis forest map. (D) Validation set multivariate Cox regression analysis forest map.

Table 3 GSE44001 clincial data.

Infiltration of immune cells in cervical cancer samples

Most studies believe that the occurrence and development of tumor are closely related to immune cells, so it is an ideal method to study the infiltration of immune cells in tumor. We used CIBERSORT to analyze the infiltration of 22 kinds of immune cells in patients with high and low risk groups. FigureĀ 4A shows the expression of immune cells in different risk groups. Macrophage M0 (Fig.Ā 4B), activated mast cells (Fig.Ā 4C) were significantly overexpressed in the high-risk group, while stationary dendritic cells (Fig.Ā 4D), stationary mast cells (Fig.Ā 4E), activated CD4T cells (Fig.Ā 4F), and cd8t cells (Fig.Ā 4G) were overexpressed in the low-risk group.

Figure 4
figure 4

(A) Immune infiltration status within IRGPI risk groups. (B) Expression of Macrophage M0. (C) Expression of Mast cells activated. (D) Expression of Dendritic cells resting. (E) Expression of Mast cells resting. (F) Expression of T cells CD4 memory activated. (G) T cells CD8.

Functional enrichment analysis of GSEA, go and KEGG

We analyzed the function enrichment of IRGP in the model. The results of go analysis showed that IRGP in the model was mainly enriched in the binding of cytokines and their receptors, the binding of chemokines and their receptors, the binding of growth factors and their receptors, the binding of epidermal growth factor receptors, the binding of fibroblast growth factors and the activity of tyrosine kinase (Fig.Ā 5A,B). KEGG results showed that these IRGP were involved in cytokine cytokine receptor interaction, chemokine signaling, tumor necrosis factor signaling, MAPK signaling, NF kappa B signaling, natural killer cell-mediated cytotoxicity, viral proteins and cytokines, and Th1 and Th2 cell differentiation (Fig.Ā 5C,D). The results of GSEA (Fig.Ā 6A) showed that these IRGP were significantly enriched in trace ribonucleoprotein complex (Fig.Ā 6B), neurotransmitter transporter activity (Fig.Ā 6C), endopeptidase activity (Fig.Ā 6D), fibroblast growth factor receptor binding (Fig.Ā 6E), hormone activity (Fig.Ā 6F), fibroblast cell proliferation (Fig.Ā 6G), and growth factor receptor binding (Fig.Ā 6H).

Figure 5
figure 5

(A) Histogram graph of Immune-related genes GO analysis results. (B) Point graph of Immune-related genes GO analysis results. (C) Histogram graph of Immune-related genes KEGG pathway analysis results. (D) Point graph of Immune-related genes KEGG pathway analysis results.

Figure 6
figure 6

(A) GSEA analysis of 47 immune signature genes. (Bā€“H) In the high immune risk group of cervical cancer, 7 cancer marker genes were abundant (Pā€‰<ā€‰0.05, FDRā€‰<ā€‰0.25).

Expression of immune gene in cervical cancer

We explored the expression of IRGP in cervical cancer using the ualcan model (Table 4). There were 6 low-level expression of IRGP in cervical cancer (Fig.Ā 7) and 8 high-level expression of IRGP (Fig.Ā 8). There were differences in the expression of 5 low expression IRGP and 8 high expression IRGP in different age groups (Fig.Ā 9), and there were differences in the expression of 14 IRGP in different stages of cervical cancer (Fig.Ā 10).

Table 4 P value of IRGPs expression in different ages and stages.
Figure 7
figure 7

(A) Expression of CLCF1 in cervical cancer and normal tissues. (B) Expression of DLL4 in cervical cancer and normal tissues. (C) Expression of INHBA in cervical cancer and normal tissues (D) Expression of NOD1 in cervical cancer and normal tissues. (E) Expression of NRP1 in cervical cancer and normal tissues. (F) Expression of RBP7 in cervical cancer and normal tissues.

Figure 8
figure 8

(A) Expression of CXCR3 in cervical cancer and normal tissues. (B) Expression of DUOX1 in cervical cancer and normal tissues. (C) Expression of FGFR3 in cervical cancer and normal tissues. (D) Expression of HLA-DQA2 in cervical cancer and normal tissues. (E) Expression of LTB4R2 in cervical cancer and normal tissues. (F) Expression of STC2 in cervical cancer and normal tissues. (G) Expression of TNFSF10 in cervical cancer and normal tissues. (H) Expression of CESC in cervical cancer and normal tissues.

Figure 9
figure 9

(A) Expression of CLCF1 in cervical cancer and normal tissues. at different ages. (B) Expression of DLL4 in cervical cancer and normal tissues at different ages. (C) Expression of NOD1 in cervical cancer and normal tissues at different ages. (D) Expression of NRP1 in cervical cancer and normal tissues at different ages. (E) Expression of RBP7 in cervical cancer and normal tissues at different ages. (F) Expression of CXCR3 in cervical cancer and normal tissues at different ages. (G) Expression of DUOX1 in cervical cancer and normal tissues at different ages. (H) Expression of FGFR3 in cervical cancer and normal tissues at different ages. (I) Expression of HLA-DQA2 in cervical cancer and normal tissues at different ages. (J) Expression of LTB4R2 in cervical cancer and normal tissues at different ages. (K) Expression of STC2 in cervical cancer and normal tissues at different ages. (L) Expression of TNFSF10 in cervical cancer and normal tissues at different ages. (M) Expression of VAV3 in cervical cancer and normal tissues at different ages.

Figure 10
figure 10

(A) Expression of CLCF1 in cervical cancer and normal tissues. at different stages. (B) Expression of CXCR3 in cervical cancer and normal tissues at different stages. (C) Expression of DLL4 in cervical cancer and normal tissues at different stages. (D) Expression of DUOX1 in cervical cancer and normal tissues at different stages. (E) Expression of FGFR3 in cervical cancer and normal tissues at different stages. (F) Expression of HLA-DQA2 in cervical cancer and normal tissues at different stages. (G) Expression of INHBA in cervical cancer and normal tissues at different stages. (H) Expression of LTB4R2 in cervical cancer and normal tissues at different stages. (I) Expression of NOD1 in cervical cancer and normal tissues at different stages. (J) Expression of NRP1 in cervical cancer and normal tissues at different stages. (K) Expression of RBP7 in cervical cancer and normal tissues at different stages. (L) Expression of STC2 in cervical cancer and normal tissues at different stages. (M) Expression of TNFSF10 in cervical cancer and normal tissues at different stages. (N) Expression of VAV3 in cervical cancer and normal tissues at different stages.

Discussion

Cervical cancer is one of the most common gynecological malignancies. HPV infection is considered to be the main cause of cervical cancer21,22 although the incidence rate of cervical cancer has been significantly decreased due to the development and promotion of HPV vaccine23. But incidence rate of cervical cancer is still high in developing countries and China's low income countries24. At present, for cervical cancer patients without invasion and lymphatic metastasis, the effect of surgery combined with radiotherapy and chemotherapy is better. If metastasis and infiltration occur, the treatment effect of cervical cancer patients will become very unsatisfactory. In recent years, immunotherapy has performed well in a variety of cancers including cervical cancer25,26,27. Blocking PD-L1 / PD-1 signaling pathway to attack tumor cells expressing PD-L1 is the current mainstream method28. Although the anticancer activity of PD-1 and PD-L1 inhibitors is exciting, such immunotherapy is not effective for all patients, and a meta-analysis shows that patients who receive PD-1 / PD-L1 inhibitors have a higher risk of rash, thyroid dysfunction, pruritus, pneumonia and colitis29,30,31. Therefore, it is of great significance for the detection and treatment of cervical cancer to predict and find more biomarkers that may be related to immune prognosis.

At present, most of the prognostic genes need to be standardized to reduce the errors caused by sequencing platform and samples. In this study, the scores of IRGPs constructed by us are calculated from the gene expression data of the same sample, which can not only ignore the impact of different platforms, but also do not need to standardize and scale the data. This method has been used in many studies, including cancer molecular classification, with high reliability32,33.

In this study, we screened 29 pairs of IRGP to construct the immune prognosis model related to the overall survival rate of cervical cancer patients. The AUC values of the model in 1, 3 and 5Ā years were all greater than 0.9. According to these 29 pairs of IRGP, they were divided into high-risk group and low-risk group. In TCGA training group and GSE44001 verification group, the OS of high-risk group was significantly lower than that of low-risk group (Pā€‰<ā€‰0.01). These 29 pairs of IRGP have a good effect on sample discrimination. We found that macrophage Mo and activated mast cells were significantly over expressed in high-risk group by immunocyte infiltration analysis of samples. The existing research shows that mast cells and macrophages play an important role in cervical cancer, which can promote the development of cervical cancer by promoting lymphangiogenesis and angiogenesis34,35,36. However, in the low-risk group, the expression of static dendritic cells, static mast cells, activated CD4T cells and cd8t cells is high. Although the effect of CD4T cells on cervical cancer has not been agreed, the cd8t cells are closely related to the better prognosis of cervical cancer patients37,38,39, there is evidence that dendritic cells will decrease in patients with high HPV infection, which indicates that high expression of dendritic cells is beneficial to resist cervical cancer40, which is consistent with our results. The enrichment analysis of go and GSEA showed that these immune genes were mainly involved in the binding of cytokines and their receptors, the binding of chemokines and their receptors, the binding of growth factors and their receptors, the binding of epidermal growth factor receptors, the activity of metalloendopeptidase, the binding of fibroblast growth factors and their receptors, hormone activity, fibroblast proliferation, and the binding process of growth factor receptors.As we all know, cytokines and chemokines are the key factors in the immune response. for example: In cervical cancer, IL-10 can interfere with the differentiation of dendritic cells and thus play a strong immunosuppressive effect,TGFā€”Ī² 1 can inhibit T cell proliferation and attenuate immune response41. Research shows that growth factors and epidermal growth factors are closely related to the growth of cervical cancer and the survival rate of cervical cancer patients. High expression of growth factors and epidermal growth factors often predict poor prognosis42,43,44. Growth of fibroblasts can stimulate angiogenesis at the early stage of tumor The proliferation and invasion of cancer cells and the remodeling of extracellular matrix promote the growth of cervical cancer45,46. KEGG results showed that these immune genes were mainly enriched in chemokine signaling pathway, tumor necrosis factor signaling pathway, MAPK signaling pathway, NF kappa B signaling pathway, natural killer cell-mediated cytotoxicity, viral protein and cytokine, and Th1 and Th2 cell differentiation. Th1 and Th2 may be involved in the pathogenesis and growth of cervical cancer. Th1 may be the target of predicting chemotherapy response of advanced cervical cancer47,48,49,50, while other pathways are classical signal pathways related to cancer. Immune cytokines play an important role in cervical lesions. Torres et al. Found that IL-10 is highly expressed in the cervix of women with persistent HPV, which may be related to the persistence of HPV and the promotion of disease progression. Further research by their team showed that copy individuals of IL-4, IL-6, IL-10 and TGFB1 were significantly associated with cervical cancer, and could be used as biomarkers for susceptibility to the disease51,52.These 29 pairs of IRGP have 47 different immune genes, most of which are cytokines, antimicrobial agents and natural killer cells, which are involved in various stimulation reactions and play a key role. In cervical cancer, HPV can inhibit the apoptosis of cervical cancer cells by down regulating NOD153. In our sample, we also found that the expression of NOD1 in tumor tissue is low and there are differences in different ages and stages (Figs.Ā 7D, 9C, 10I). Sang Yeon Cho et al. Found that duox1 is highly expressed in cervical squamous cell carcinoma and can play a good prognostic role by increasing the amount of innate immune cells54. The analysis also showed that DUOX1 is highly expressed in tumor tissues and related to age and grade (Figs.Ā 8B, 9G, 10D). Stc2 can promote the proliferation of cervical cancer cells and increase the resistance to cisplatin55, while high expression of DDL4 is usually associated with low pelvic lymph node metastasis and survival rate of cervical cancer56. Therefore, we believe that the IRGP constructed in this study plays an important role in the development and prognosis of cervical cancer.

There are also some deficiencies in our research. Although we select data samples from two databases for analysis, and use more advanced methods to reduce the errors caused by platforms, samples, etc., this is still a retrospective analysis. If we can carry out a prospective study or obtain clinical samples and evaluate them with Western blot or immunohistochemistry, it will be more convincing.

Conclusion

We constructed an immune gene pair model which is closely related to the prognosis of cervical cancer patients. The model contains 29 IRGP and 47 immune-related genes. The biological functions of these 47 immune-related genes are closely related to the occurrence and development of cervical cancer. Therefore, we think that these IRGPs may be the target of predicting or diagnosing cervical cancer, and suggest that immunotherapy can improve the prognosis of cervical cancer patients by regulating these IRGPs.