Establish a novel tumor budding-related signature to predict prognosis and guide clinical therapy in colorectal cancer

Li, Qixin; Liu, Gaixia; Qiu, Quanpeng; Zhang, Jiaqi; Li, Ruizhe; Zhao, Jiamian; She, Junjun; Chen, Yinnan

doi:10.1038/s41598-024-52596-1

Download PDF

Article
Open access
Published: 25 January 2024

Establish a novel tumor budding-related signature to predict prognosis and guide clinical therapy in colorectal cancer

Qixin Li^1,2,3,
Gaixia Liu^1,2,3,
Quanpeng Qiu^1,2,3,
Jiaqi Zhang^2,3,
Ruizhe Li^1,2,3,
Jiamian Zhao^1,2,3,
Junjun She^1,2,3 &
…
Yinnan Chen^2,3

Scientific Reports volume 14, Article number: 2180 (2024) Cite this article

750 Accesses
Metrics details

Subjects

Abstract

Tumor budding is a long-established independent adverse prognostic marker for colorectal cancer (CRC), yet assessment of tumor budding was not reproducible. Therefore, development of precise diagnostic approaches to tumor budding is in demand. In this study, we first performed bioinformatic analysis in our single-center CRC patients’ cohort (n = 84) and identified tumor budding-associated hub genes using the weighted gene co-expression network analysis (WGCNA). A machine learning methodology was used to identify hub genes and construct a prognostic signature. Nomogram model was used to identified hub genes score for tumor budding, and the receiver operating characteristic (ROC) curve and calibration plot indicated high accuracy and stability of hub gene score for predicted the prognosis of CRC. The association between budding-associated hub genes and score and prognosis of CRC were further verified in TCGA CRC cohort (n = 342). Then gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA) were applied to explore the signaling pathways related to the tumor budding and validated by immunohistochemistry (IHC) of our clinical samples. Subsequently, immune infiltration analysis demonstrated that there was a high correlation between hub genes score and M2-like macrophages infiltrated in tumor tissue. In addition, somatic mutation and chemotherapeutic response prediction were analyzed based on the risk signature. In summary, we established a tumor budding diagnostic molecular model, which can improve tumor budding assessment and provides a promising novel molecular marker for immunotherapy and prognosis of CRC.

Identification of potential core genes in colorectal carcinoma and key genes in colorectal cancer liver metastasis using bioinformatics analysis

Article Open access 14 December 2021

A novel disulfidptosis and glycolysis related risk score signature for prediction of prognosis and ICI therapeutic responsiveness in colorectal cancer

Article Open access 16 August 2023

A prognostic index based on an eleven gene signature to predict systemic recurrences in colorectal cancer

Article Open access 02 October 2019

Introduction

Colorectal cancer (CRC) is among one of the leading causes of cancer-related mortality and morbidity worldwide, responsible for almost 2,000,000 new cases yearly and over 900,000 deaths globally¹. For decades, cancer staging has relied on anatomy based TNM staging and positive rates of lymph node metastasis ². In the most recent TNM (2017) and WHO (2019) classification schemes, the tumor budding in CRC was emphasized as an additional prognostic factor for this CRC patients^3,4,5. Tumor budding, defined as a single cancer cell of up to four cancer cells at the tumor-invasive margin, has emerged as a promising independent prognostic biomarker in CRC^6,7,8. Standardization of tumor budding assessment was achieved by the International Tumor Budding Consensus Conference (ITBCC) recommendations in 2016: identification of the budding hotspot (measuring 0.785 mm2) at the invasive front of the tumor and counting of the number of buds results in a score that can be classified as Bd1 (0–4 buds), Bd2 (5–9 buds) or Bd3 (10 or more buds)⁹. The prognostic role of Bd has been widely investigated and currently influences decision making in CRC patients. In patients with pT1 CRC, intermediate to high-grade Bd (Bd2-3) is an independent predictor of lymph node metastasis and is increasingly considered^10,11. In stage II colorectal cancer, high-grade Bd (Bd3) represents a poor prognostic factor that should warrant consideration of adjuvant chemotherapy^12,13.

In general, tumor budding has been evaluated by hematoxylin–eosin (H&E) staining, but does not always reproducible¹⁴. Moreover, positive rates of tumor budding are affected by the selection of slide and pathologists. Therefore, it is of great clinical importance to find additional biomarkers for evaluation and characterization of tumor budding.

Despite the application of machine learning approaches based on deep learning has improved the accuracy of detecting tumor budding, there are still some limitations in those semi-automatic and automatic detection methods^15,16. The development of a gene signature with an exceptional ability to represent budding levels would help assess the tumor budding in collaboration with pathological sections and overcome poor objectivity in budding judgment.

In the present article, we employed multiple bioinformatic approaches to pick up signature genes of tumor budding in CRC to construct a multigene signature for molecular diagnosis of tumor budding. These genes displayed remarkable diagnostic performance and the prognostic predictive value of the gene signatures was validated in the TCGA cohort. In addition, we investigated the underlying mechanism and immune cell infiltration of tumor budding, and preliminary experimental validation was also performed. Combining traditional methods with molecular markers for evaluation of tumor budding will be helpful for clinical doctors to make correct decision on therapeutic strategies and prevent possible under and overtreatment.

Results

Characteristics of patients

The research flow chart depicts the study design (Fig. 1). After the retrieval of diagnostic glass slides, a total of 84 surgically treated patients diagnosed with CRC were included in the study. Low-grade tumor budding (BD = 1,2) was observed in 70.2% of all cases (n = 59), and high-grade (BD = 3) budding was found in 29.8% (n = 25). Examples of tumor budding assessment in standard H&E staining are presented in Fig. 2. The clinicopathologic characteristics of the 84 patients are summarized in Table1. The significant associations were found between lymph node metastasis (p = 0.025), pathological stage (p = 0.036) and the grade of tumor budding. In addition, there were no significant differences in distributions of sex, age, smoking status, tumor size and primary site between the two groups.

Table 1 The clinicopathological parameters between CRC patients with high- and low-grade tumor budding.

Full size table

Screening for tumor budding-related genes by weighted gene co-expression network analysis (WGCNA) in CRC

Correlation networks are used for identifying clusters of highly correlated genes across colorecta tumor tissues. WGCNA was performed using the expression profiles of the top 50% of variance in the 84 CRC patients cohort, and no outlier was detected in the cohort (Fig. 3A). The adjacency matrix was constructed based on the criterion of gene distribution that conformed to a scale-free network when setting the soft-threshold power of β = 16 (R2 = 0.80), retaining high connectivity information (Fig. 3B). The dynamic cut tree was made after merging similar gene modules, with the number of genes per module not less than 50 (Fig. 3C, D). Among the 18 gene module, the green module is associated most significantly with tumor budding level (correlation = 0.45, p < 0.001). Furthermore, we performed a Gene Ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathway analysis for the green module. The GO enrichment results indicated that the genes in green model were primarily related to telomeric DNA binding and regulation of cell cycle checkpoint, among others (Fig. 3E). The analysis of KEGG pathway suggested that the genes were involved in the Nucleocytoplasmic transport and Base excision repair (Fig. 3F).

Construction and validation of a 4-gene risk signature and nomogram for tumor budding

To build an efficient model to identify tumor budding in CRC,38 genes were determined by univariate logistic regression with p < 0.01 filtering from genes in green module (Fig. 4A). The least absolute shrinkage and selection operator (LASSO) regression analysis was employed to reduce the number of candidate genes using the lowest value of lambda λ to avoid the risk of over-fitting (Fig. 4B,C), and 4 candidate genes with the most powerful predictive features were identified (MTIF3,POLR1D,MZT1 and SAP18). We then constructed a nomogram and scored risk of tumor budding based on the 4 candidate genes (Fig. 4D). We also observed that the mRNA expression levels of the four genes in tumor budding high-grade group generally exhibited higher compared to low-grade group (Fig. 4E). Meanwhile, we performed qPCR to validated the results from RNA-seq (low-grade budding n = 59, high-grade budding n = 25) (Fig. 4E).

In the Sankey diagram, we identified the interrelationship between the risk score ,tumor budding and pN stage, and the patients with high risk score were mainly concentrated in high grade tumor budding (Fig. 5A). The heatmap visualized the detailed expression of budding-related genes in the high and low risk score group (Fig. 5B). To further assess the predictive efficiency of the diagnostic column line graph model, the receiver operating characteristic (ROC) curve was established to assess the diagnostic specificity and sensitivity of each gene and the nomogram. We calculated the area under the curve (AUC) for each item. The results were as follows: MTIF3 (AUC 0.677), MZT1 (AUC 0.661), SAP18 (AUC 0.680), POLR1D (AUC 0.666), and nomogram (AUC 0.702) (Fig. 5C, D). These findings imply that all major genes are involved in tumor budding, and the constructed nomogram had the highest diagnostic value. According to the calibration curve, the error between the actual risk and the predicted risk was very small (Fig. 5E). The decision curve analysis (DCA) revealed that the nomogram had a higher accuracy that can provide evidence for clinical decisions (Fig. 5F). Based on the results of DCA, we further plotted clinical impact curves to evaluate the clinical utility of the nomogram. The ‘Number high risk’ curve was close to the ‘Number high risk with event’ curve at the risk threshold from 0.4 to 1, which demonstrated that the nomogram owned powerful predictive ability (Fig. 5G). We compared risk scores between high-grade and low-grade tumor budding groups in the 84-CRC cohort. The results showed that the risk score of the high-grade budding group was significantly higher than that of the low-grade group (Supplementary Figure S1). Subsequently, we evaluated the correlation between gene signature of tumor budding (risk score) and prognosis of the colon adenocarcinoma (COAD) and rectum adenocarcinoma (READ) patients from the TCGA. We classified CRC patients into high (n = 143) and low-risk (n = 199) subpopulations with the mean value of risk score. Survival analysis uncovered that low-risk CRC individuals displayed a remarkable survival advantage (Fig. 5H). Additionally, we conducted a comparison of clinical characteristics between the high- and low-risk tumor budding groups in the TCGA cohort and observed significant differences in lymph node metastasis (p = 0.001) and pathological stage (p = 0.002), consistent with our findings from our 84-CRC cohort (Supplementary Table S1).

Wnt beta catenin signaling and hedgehog signaling are highly correlation with tumor budding

To further elucidate the possible mechanisms underlying tumor budding, we performed gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA) using HALLMARK gene sets and KEGG signaling pathways in our 84-CRC cohort (low-grade budding n = 59, high-grade budding n = 25). The GSEA enrichment analysis revealed six pathways with significant enrichment (p < 0.05), including E2F targets, epithelial mesenchymal transition, hedgehog signaling, MYC targets V1, unfolded protein response, and Wnt beta-catenin signaling, as depicted in Fig. 6A. Furthermore, we performed GSVA to analyze differentially enriched pathways between low- and high-grade budding groups. The results revealed a significant enrichment of the primary bile acid biosynthesis, renin angiotensin system, wnt beta catenin signaling, as well as hedgehog signaling and RNA polymerase (Fig. 6B). We discovered that wnt beta catenin and hedgehog signaling enriched in high-grade tumor budding were detected in GSEA and GSVA. Subsequently, we verified that the risk score and 4-gene was related to the enrichment scores (ESs) of the GSVA in wnt beta catenin signaling and hedgehog signaling. As shown in Fig. 6C, D, the risk score was positively correlated with wnt beta catenin and hedgehog signaling. Several studies have reported that epithelial mesenchymal transition (EMT) plays a pivotal role in tumor budding. Therefore, the EMT scores calculated by gene set variation analysis (GSVA) were compared between the high-grade and low-grade tumor budding groups (Supplementary Figure S2). The results demonstrated a significantly higher EMT score in the high-grade tumor budding group compared to the low-grade group (p < 0.05). To further validate the association between tumor budding and wnt beta catenin signaling, we analyzed the expression of positive regulators (CK1 and wnt5a) and the negative regulator (DKK1) of wnt beta catenin signaling in CRC tissues (n = 84) by immunohistochemistry (IHC) staining (Fig. 6E, Supplementary Figure S3 and Table S2).The results revealed significant elevation of CK1 and wnt5a in high-grade tumor budding tissues compared with low-grade tumor budding tissues, while DKK1 was highly expressed in low tumor budding group. Overall, these results indicated that high risk-scores of the tumor budding were associated with activation of the wnt beta catenin and hedgehog signaling pathways.

M2-like macrophages cells were enriched in CRC tissue with high-grade tumor budding

The relationship between the tumor immunological microenvironment (TIM) and tumor development is well-established. To explore the landscape of the immunological microenvironment in patients with high and low grade of tumor budding in CRC, we used the CIBERSORT R script to assess the relative proportions of diverse immune cell populations within each CRC sample (Fig. 7A). Notably, a pronounced disparity between the high-grade and low-grade groups in the infiltration of M2-like macrophages cells was identified (Fig. 7B, C). Simultaneously, we stratified the cohort into high and low groups based on the calculated M2-like macrophages infiltration fraction and performed a chi-square analysis with respect to tumor budding grade (Supplementary Table S3). The results revealed a persistent significant difference (p = 0.05). Subsequently, we found a positive correlation between the proportion of M2-like macrophages cells and the risk score (R = 0.22, p < 0.05) (Fig. 7D). To further understand the role of 4-genes signature in immune infiltration, we used Spearman correlation analysis to determine whether they are associated with immune cell infiltration (Fig. 7E). We observed that genes SAP18 and MZT1 were positively correlated with M2-like macrophages, while all of 4-genes negatively correlated with memory B cells and activated NK cells. To further validate the association between tumor budding and infiltration of M2-like macrophages, we analyzed the densities of M2-like macrophages in CRC tissues by IHC staining (Fig. 7F, Supplementary Figure S3 and Table S2). IHC images showed that colorectal carcinoma tissue from patient with low-grade tumor budding possess lower density of M2-like macrophages compared with CRC tissue from high-tumor budding group. Collectively, these results indicated that tumor budding was associated with infiltration of M2-like macrophages in CRC.

CRC tissue with high-grade tumor budding was characterized by higher frequencies of APC and TP53 mutations and drug resistance

To identify the difference in cancer-related gene mutations between the high-risk and low-risk groups, we analyzed the distribution variations of the somatic mutations between two risk score groups in the TCGA COAD/READ cohort (Fig. 8A, B). Comparison of the mutant genes in the two groups revealed that patients with a high-risk score had markedly higher frequencies of APC and TP53 mutations. Our analysis of the mutation data from the TCGA COAD/READ cohort demonstrated that a lower tumor mutation burden (TMB) was observed in the sets of high risk score than that in the sets of low risk score (Fig. 8C). Accumulative evidence shows that patients with a high TMB may benefit from immune checkpoint inhibitors (ICIs)¹⁷. In addition, cancer stem cells (CSCs) have been recognized as promising therapeutic targets for cancer therapy¹⁸. As a result, we first assess the potential correlation between the 4-gene risk score and the cancer stem cell (CSC) in TCGA COAD/READ cohort. Figure 8D showed the results of the positive linear correlation between risk score and CSC index (R = 0.214, p < 0.001), indicating that CRC cells with higher risk score had more distinct stem cell properties and a lower degree of cell differentiation. We next use drug information from the GDSC database to calculate the half-maximal inhibitory concentration (IC50) values of 198 chemotherapy drugs or inhibitors. We found that the patients in the high-risk score group had lower IC50 value for Camptothecin, Dactolisib, Gemcitabine and Rapamycin, while IC50 values of chemotherapeutics such as Epirubicin and Foretinib were significantly lower in the patients with a low-risk score (Fig. 8E–J).

Discussion

Accumulating evidence has demonstrated that tumor budding has been associated with tumors with aggressive biology, and has been independently associated with poor outcome, lymph node metastasis, and high recurrence¹⁹. Identifying reliable and effective biomarkers for tumor budding is indispensable. However, studies on gene signature of tumor budding in colon cancer remain limited.

In this study, we used the WGCNA method to identify tumor budding-related genes. Simultaneously, LASSO-Cox was used to filter genes to construct the 4-gene signature containing: MTIF3, POLR1D, MZT1 and SAP18. The RNA-seq data utilized in this study were exclusively derived from tumor bulk tissues rather than tumor budding cells. Our hypothesis posits that tumor budding represents not merely a cellular variation, but rather an indication of the entire tumor tissue being predisposed to bud and subsequently metastasize. Obtaining transcriptome data from tumor bud cells is considerably more challenging and costly in clinical practice compared to acquiring such data from tumor bulk tissues. Therefore, the 4-gene signature may possess enhanced clinical significance in augmenting the diagnostic efficacy of tumor budding. However, the relation between the 4-gene signature and tumor budding in CRC remains unknown. RNA polymerase I subunit D (POLR1D), encoding a subunit of both RNA polymerase I and III, has been shown to be overexpressed in several human cancer types. It has been reported that POLR1D was positively correlated to tumor size and poor survival of CRC patients²⁰. Aberrant expression of POLR1D significantly promoted cell proliferation and migration in vitro, and influenced tumor growth in vivo. In our study, we also found that POLR1D was highly expressed in the high tumor budding group. These suggested that POLR1D may function as a risk factor for predicting the outcome of CRC patients, but the relationship between POLR1D and tumor budding needs to be further explored. Furthermore, to our knowledge, the role of the remaining gene signature in tumor budding has not been reported in the literature. Given that our primary research objective is to screen biomarkers capable of predicting tumor sprouting, this study did not delve into this issue extensively. In order to enhance the comprehension of the underlying mechanisms, a more comprehensive investigation will be conducted in subsequent studies.

We then established and verified a tumor budding-related diagnostic model that can accurately anticipate the risk of tumor budding in two CRC cohorts. Furthermore, four hub genes were utilized to construct a nomogram model and scored risk of tumor budding. The analysis of diagnostic value indicated that our diagnostic model possessed high accuracy and stability for tumor budding diagnosis, which implied great potential for clinical translation. Notably, the constructed nomogram model showed higher accuracy than four candidate biomarker genes alone. In summary, the combination of our genetic diagnostic models and H&E staining sections may reduce the low miss rate and improve the precision in tumor budding assessment, which will contribute to the development of more personalized and precise individualized therapy. Next, we explore the molecular characteristic of CRC with high grade tumor budding.

For signaling pathway related to tumor budding, we used GSEA and GSVA to explore the signaling pathways affecting the tumor budding. We found that wnt β-catenin and hedgehog signaling were simultaneously enriched in both analyses. Tumor budding has long been hypothesized to be comprised of cells undergoing the epithelial–mesenchymal transition (EMT), while the wnt pathway is considered to be the key modulator of EMT²¹. However, more experiments required to explore the relationship between wnt β-catenin signaling and tumor budding in CRC.

For immunological microenvironment, we carried out a differentiation of the contents of immune cells in high and low-grade tumor budding groups utilizing the CIBERSORT. The findings determined that the high-grade group had substantially elevated infiltrative levels of M2-like macrophages. Investigations have shown that in most tumor microenvironments, M2-like macrophages are engaged in inflammation resolution and suppress tumor cell immunity, thereby promoting cancer progression and metastasis^22,23. This suggests that M2-like macrophages may play a facilitative role during the development of tumor budding.

Furthermore, our study showed that CRC tissue with high-grade tumor budding was characterized by higher frequencies of APC and TP53 mutations and drug resistance. This finding, although not unexpected, substantiates the robustness of our gene model and contributes valuable insights for further investigation into the underlying mechanisms governing tumor sprouting.

In conclusion, we first established a tumor budding diagnostic molecular model based on our CRC cohort, 4-gene risk score was conducted to evaluation and characterization for tumor budding. The diagnostic model possessed high accuracy and stability for tumor budding diagnosis was validated in our CRC cohort and TCGA CRC cohort. We also explore the signaling pathway (wnt β-catenin signaling, hedgehog signaling) and immune cells (M2-like macrophages) enriched in CRC with high-grade tumor budding, which indicate the underlying mechanism of tumor budding. Our study improves tumor budding molecular assessment and provides a promising novel molecular marker for immunotherapy and prognosis of CRC.

Materials and methods

Study design

We first conducted a CRC patients’ cohort from The First Affiliated Hospital of Xi’an Jiao tong University, which enrolled all consecutive CRC patients with histologically confirmed adenocarcinoma from October 2022 to June 2023. Of these, 97 patients had diagnostic glass slides available for analysis and were included in the study. Our exclusion criteria included: 1. CRC Patients with age ≤ 18 years old (n = 4); 2. CRC Patients with hereditary syndromes (e.g., Lynch syndrome and familial adenomatous polyposis) (n = 4); 3. Patients with any malignant disease history, inflammatory bowel disease (IBD) history before the diagnosis of colorectal cancer (n = 2); 4. Patients with multiple primary colorectal carcinoma (n = 3). Finally, 84 CRC patients were included in this study for further analysis. The study was approved by the institutional review board of The First Affiliated Hospital of Xi'an Jiao Tong University. All patients signed an Institutional Review Board-approved written informed consent.

Data collection

The clinical data of all included patients were obtained from Biobank of the First Affiliated Hospital of Xi’an Jiao Tong University. Paraffin-embedded tissue blocks of all 84 patients were retrieved from the archives of the Department of Pathology, The First Affiliated Hospital of Xi'an Jiao Tong University. All histomorphological data were reviewed from the corresponding H&E-stained slides. This study was conducted in accordance with the Declaration of Helsinki and was approved by the institutional review board of The First Affiliated Hospital of Xi'an Jiao Tong University. All clinical data involved in the present study was anonymized and de-identified prior to analysis. Informed consent was obtained from all subjects involved in the study.

Definition and evaluation of tumor budding

Tumor cell clusters of four or less tumor cells infiltrating the adjacent parenchyma were defined as budding as previously described. The grade of budding was reviewed with the same microscope by two independent pathologists blinded to the clinical and outcome data through hematoxylin–eosin (H&E) staining. Evaluation of tumor buds was performed as described by the ITBCC 2016. In case of a discrepancy between the two pathologists, the overall result was decided by consensus. Patients were subsequently grouped into two budding grades as Low Grade (budding categories 1) and High Grade (budding category 2 and 3) for subsequent analyses.

RNA sequencing

Total RNA was extracted from human colorecta tumor bulk tissues. The generation and sequencing of cDNA libraries were performed on the Illumina sequencing platform (Nova) to generate 150 bp paired-end reads. Clean RNA-seq reads were mapped to human reference genome (hg38) along with the gene annotation data (genecode v29) from the Genecode database using STAR (v2.5.3a). For the TCGA cohort, clinical features, RNA-seq expression data [fragments per kilobase million (FPKM) value] were downloaded from the TCGA database (https://cancergenome.nih.gov/).

Weighted gene coexpression network analysis

We used WGCNA which can convert coexpression correlation into connection weights or topological overlap values, to identify coexpressed genes in tumor budding. Network type was set as the “unsigned” type. We used standard deviation (SD > 50%) to screen highly variable genes in the WGCNA expression data. A total of 84 samples was used as an expression matrix for further analysis. The soft thresholding power was chosen as 5 to construct a gene network and calculate coexpression similarity and adjacency, which was transformed into a topological overlap matrix (TOM). Hierarchical clustering based on TOM was used to cluster the modules. We used Pearson’s test to calculate the correlation between module eigengenes (MEs) and tumor budding. When p < 0.05, the module was considered to be significantly related to tumor budding²⁴.

Identification of distinct genes

We identified candidate hub genes by the key module genes. Then, univariate logistic regression and LASSO–Cox regression analysis was applied to these prognostic candidates. Lasso regression is a method based on linear regression that can be used to select the features that are most relevant to the target variable. It compresses the value of certain coefficients by punishing the L1 norm, thereby reducing unnecessary features in the model. Finally, by choosing the optimal penalty parameter λ correlated with the minimum tenfold cross-validation, we established a four-gene optimal prognostic model. Subsequently, the rms algorithm was used to construct a nomogram model that predicted the probability of tumor budding.

Diagnostic column line graph construction and validation

We created a column line graph model to predict the tumor budding using the “rms” program. In this process, we can obtain a tumor budding risk score based on four hub genes for each sample, and then we verified the diagnostic value of this tumor budding risk score for tumor budding in the ROC curve and calibration curves. Finally, decision curve analysis and clinical impact curves were used to assess the clinical utility of the model. The prognostic risk model was validated using TCGA-COAD-READ cohorts²⁵.

Functional enrichment analysis

Biological functional enrichment was analyzed using GO analysis and the KEGG pathway based on The Database for Annotation, Visualization and Integrated Discovery (DAVID) database (https://david.ncifcrf.gov/ (accessed on 29 September 2022)). The cutoff criterion was defined as p < 0.05²⁶. KEGG and Hallmaker pathways were analyzed using the R packages "GSVA" and "msigdbr" to explore biological process differences between high-grade and low-grade tumor budding groups in our 84-CRC cohort.

Tumor immune analysis

The proportion of 22 different types of immune cells in our CRC cohort (n = 84) was calculated through Cibersort. A Spearman-related analysis of infiltrating immune cells with Hub genes was calculated using “Corrplot” in R. P < 0.05 was considered statistically significant.

Mutation and drug susceptibility analysis

To identify somatic mutations in CRC patients belonging to high- and low-risk groups, the 'maftools' R package was utilized for the analysis of whole genome re-sequencing data from the CRC cohort (n = 337) in the TCGA database. Additionally, we computed the tumor mutational burden (TMB) score for each CRC patient in both risk groups. Furthermore, we investigated the association between these risk groups and cancer stem cells (CSCs). In order to assess potential differences in therapeutic efficacy of commonly used chemotherapeutic drugs for CRC patients in these two risk groups, we determined the IC50 values using the "oncoPredict" package²⁸.

Real-time polymerase chain reaction (qRT–PCR)

Total RNA was extracted using TRIzol reagent (Invitrogen, Thermo Fisher Scientific, Inc.) from tumor tissue. The PrimeScript™ RT reagent kit (TaKaRa) was used for reverse transcriptase reaction. RNA levels were determined by quantitative real-time PCR (qRT-PCR) in triplicate on a Bio-Rad CFX96 using the SYBR Green method (RR420A, Takara, Mountain View, CA, USA).The primers of the five genes were listed in Table S3. Fold differences were calculated for each group using normalized CT values²⁹. The primer sequences are listed in Supplementary Table S4.

Immunohistochemical analysis

Through IHC experiments, we detected the protein expression of CK1, DKK1, WNT 5a and CD163 in paraffin sections of human colon tissues. A total of 84 CRC tissues were obtained from The First Affiliated Hospital of Xi’an Jiao tong University. All the patients signed an informed consent form. The IHC staining results were interpreted by both the intensity of staining and the staining positive area, and were independently reviewed by two independent pathologists. The percentage of positive cells was scored as follows: < 5%, 0 point; 5–25%, 1 point; 26–50%, 2 points; 51–75%, 3 points; 76–100%, 4 points. The staining intensity evaluation criteria were as follows: 0 point for colorless; 1 point for pale yellow; 2 points for tan; 3 points for brown. The final IHC staining score for each tissue was obtained by multiplying the scores of positively stained cells and the scores of staining intensities.

Statistical analysis

R (version 4.0.5) was applied to perform statistical analysis in our study. χ2 test was used to obtain p-values when comparing categorical variables. Continuous variables were compared, and p-values were obtained by T test. The correlation matrices were conducted using Pearson or Spearman correlation. Wilcoxon test was conducted for the comparisons between the two groups. Survival differences were compared using K–M curves with a Log-rank test. All statistical tests were two-sided, and p < 0.05 was considered as statistical difference³⁰.

Data availability

The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2021) in National Genomics Data Center (Nucleic Acids Res 2022), China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences (GSA-Human: HRA005817) that are publicly accessible at https://ngdc.cncb.ac.cn/gsa-human.

References

Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424. https://doi.org/10.3322/caac.21492 (2018).
Article PubMed Google Scholar
Weiser, M. R., Gönen, M., Chou, J. F., Kattan, M. W. & Schrag, D. Predicting survival after curative colectomy for cancer: Individualizing colon cancer staging. J. Clin. Oncol. 29, 4796–4802. https://doi.org/10.1200/JCO.2011.36.5080 (2011).
Article PubMed PubMed Central Google Scholar
Cree, I. A., White, V. A., Indave, B. I. & Lokuhetty, D. Revising the WHO classification: Female genital tract tumours. Histopathology 76, 151–156. https://doi.org/10.1111/his.13977 (2020).
Article PubMed Google Scholar
Nagtegaal, I. D. et al. The 2019 WHO classification of tumours of the digestive system. Histopathology 76, 182–188. https://doi.org/10.1111/his.13975 (2020).
Article PubMed Google Scholar
Tan, P. H. et al. The 2019 World Health Organization classification of tumours of the breast. Histopathology 77, 181–185. https://doi.org/10.1111/his.14091 (2020).
Article PubMed Google Scholar
Cappellesso, R. et al. Tumor budding as a risk factor for nodal metastasis in pT1 colorectal cancers: A meta-analysis. Hum. Pathol. 65, 62–70. https://doi.org/10.1016/j.humpath.2017.04.013 (2017).
Article CAS PubMed Google Scholar
Lai, Y. H. et al. Tumour budding is a reproducible index for risk stratification of patients with stage II colon cancer. Colorectal. Dis. 16, 259–264. https://doi.org/10.1111/codi.12454 (2014).
Article PubMed Google Scholar
Zlobec, I. et al. Intratumoural budding (ITB) in preoperative biopsies predicts the presence of lymph node and distant metastases in colon and rectal cancer patients. Br. J. Cancer 110, 1008–1013. https://doi.org/10.1038/bjc.2013.797 (2014).
Article CAS PubMed Google Scholar
Lugli, A. et al. Recommendations for reporting tumor budding in colorectal cancer based on the International Tumor Budding Consensus Conference (ITBCC) 2016. Mod. Pathol. 30, 1299–1311. https://doi.org/10.1038/modpathol.2017.46 (2017).
Article PubMed Google Scholar
Bosch, S. L., Teerenstra, S., de Wilt, J. H. W., Cunningham, C. & Nagtegaal, I. D. Predicting lymph node metastasis in pT1 colorectal cancer: A systematic review of risk factors providing rationale for therapy decisions. Endoscopy 45, 827–834. https://doi.org/10.1055/s-0033-1344238 (2013).
Article PubMed Google Scholar
Ueno, H. et al. Risk factors for an adverse outcome in early invasive colorectal carcinoma. Gastroenterology 127, 385–394 (2004).
Article PubMed Google Scholar
van Wyk, H. C. et al. The role of tumour budding in predicting survival in patients with primary operable colorectal cancer: A systematic review. Cancer Treat Rev. 41, 151–159. https://doi.org/10.1016/j.ctrv.2014.12.007 (2015).
Article PubMed Google Scholar
Wang, L. M. et al. Tumor budding is a strong and reproducible prognostic marker in T3N0 colorectal cancer. Am. J. Surg. Pathol. 33, 134–141. https://doi.org/10.1097/PAS.0b013e318184cd55 (2009).
Article CAS PubMed Google Scholar
Haddad, T. S. et al. Improving tumor budding reporting in colorectal cancer: A Delphi consensus study. Virchows Arch. 479, 459–469. https://doi.org/10.1007/s00428-021-03059-9 (2021).
Article PubMed PubMed Central Google Scholar
Bokhorst, J.-M. et al. Fully automated tumor bud assessment in hematoxylin and eosin-stained whole slide images of colorectal cancer. Mod Pathol 36, 100233. https://doi.org/10.1016/j.modpat.2023.100233 (2023).
Article PubMed Google Scholar
Bokhorst, J.-M. et al. Semi-supervised learning to automate tumor bud detection in cytokeratin-stained whole-slide images of colorectal cancer. Cancers (Basel) 15, 2079. https://doi.org/10.3390/cancers15072079 (2023).
Article CAS PubMed Google Scholar
Jardim, D. L., Goodman, A., de Melo Gagliato, D. & Kurzrock, R. The challenges of tumor mutational burden as an immunotherapy biomarker. Cancer Cell 39, 154–173. https://doi.org/10.1016/j.ccell.2020.10.001 (2021).
Article CAS PubMed Google Scholar
Yang, L. et al. Targeting cancer stem cell pathways for cancer therapy. Signal Transduct. Target Ther. 5, 8. https://doi.org/10.1038/s41392-020-0110-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lugli, A., Zlobec, I., Berger, M. D., Kirsch, R. & Nagtegaal, I. D. Tumour budding in solid cancers. Nat. Rev. Clin. Oncol. 18, 101–115. https://doi.org/10.1038/s41571-020-0422-y (2021).
Article PubMed Google Scholar
Wang, M. et al. POLR1D promotes colorectal cancer progression and predicts poor prognosis of patients. Mol. Carcinog. 58, 735–748. https://doi.org/10.1002/mc.22966 (2019).
Article CAS PubMed Google Scholar
Zlobec, I. et al. Role of APAF-1, E-cadherin and peritumoral lymphocytic infiltration in tumour budding in colorectal cancer. J. Pathol. 212, 260–268 (2007).
Article CAS PubMed Google Scholar
Grivennikov, S. I., Greten, F. R. & Karin, M. Immunity, inflammation, and cancer. Cell 140, 883–899. https://doi.org/10.1016/j.cell.2010.01.025 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mantovani, A., Sozzani, S., Locati, M., Allavena, P. & Sica, A. Macrophage polarization: Tumor-associated macrophages as a paradigm for polarized M2 mononuclear phagocytes. Trends Immunol. 23, 549–555 (2002).
Article CAS PubMed Google Scholar
Zhu, Y. et al. M2 macrophage-related gene signature in chronic rhinosinusitis with nasal polyps. Front. Immunol. 13, 1047930. https://doi.org/10.3389/fimmu.2022.1047930 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yan, P., Ke, B., Song, J. & Fang, X. Identification of immune-related molecular clusters and diagnostic markers in chronic kidney disease based on cluster analysis. Front. Genet. 14, 1111976. https://doi.org/10.3389/fgene.2023.1111976 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chen, Y., Huang, W., Ouyang, J., Wang, J. & Xie, Z. Identification of anoikis-related subgroups and prognosis model in liver hepatocellular carcinoma. Int. J. Mol. Sci. 24, 2863. https://doi.org/10.3390/ijms24032862 (2023).
Article CAS Google Scholar
Zhang, C., Zeng, C., Xiong, S., Zhao, Z. & Wu, G. A mitophagy-related gene signature associated with prognosis and immune microenvironment in colorectal cancer. Sci. Rep. 12, 18688. https://doi.org/10.1038/s41598-022-23463-8 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Song, W., Ren, J., Xiang, R., Kong, C. & Fu, T. Identification of pyroptosis-related subtypes, the development of a prognosis model, and characterization of tumor microenvironment infiltration in colorectal cancer. Oncoimmunology 10, 1987636. https://doi.org/10.1080/2162402X.2021.1987636 (2021).
Article PubMed PubMed Central Google Scholar
Tang, Y. et al. Prognosis and dissection of immunosuppressive microenvironment in breast cancer based on fatty acid metabolism-related signature. Front. Immunol. 13, 843515. https://doi.org/10.3389/fimmu.2022.843515 (2022).
Article CAS PubMed PubMed Central Google Scholar
Xiong, Z. et al. The relationship between perianal fistula activity and abdominal adipose tissue in Crohn’s disease: An observational study. Insights Imaging 13, 156. https://doi.org/10.1186/s13244-022-01293-6 (2022).
Article PubMed PubMed Central Google Scholar
Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951. https://doi.org/10.1002/pro.3715 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51, D587–D592. https://doi.org/10.1093/nar/gkac963 (2023).
Article CAS PubMed Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

This work was supported by grants from the National Natural Science Foundation of China, grant numbers 82003807, 82173394, and Shaanxi Province Science Foundation (No. 2023-GHZD-19). Medical Foundation-Clinical Integration Program of Xi’an Jiaotong University (Grant No. YXJLRH2022043).

Author information

Authors and Affiliations

Department of General Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi, China
Qixin Li, Gaixia Liu, Quanpeng Qiu, Ruizhe Li, Jiamian Zhao & Junjun She
Center for Gut Microbiome Research, Med-X Institute, The First Affiliated Hospital of Xi’an Jiao Tong University, Xi’an, Shaanxi, China
Qixin Li, Gaixia Liu, Quanpeng Qiu, Jiaqi Zhang, Ruizhe Li, Jiamian Zhao, Junjun She & Yinnan Chen
Department of High Talent, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi, China
Qixin Li, Gaixia Liu, Quanpeng Qiu, Jiaqi Zhang, Ruizhe Li, Jiamian Zhao, Junjun She & Yinnan Chen

Authors

Qixin Li
View author publications
You can also search for this author in PubMed Google Scholar
Gaixia Liu
View author publications
You can also search for this author in PubMed Google Scholar
Quanpeng Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Jiaqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ruizhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiamian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Junjun She
View author publications
You can also search for this author in PubMed Google Scholar
Yinnan Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.L. and G.L. analyzed the data and wrote the manuscript; J.Z. and R.L. collected the data; Q.Q. and J.Z. prepared figures; Y.C. and J.S. designed the study and revised the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Junjun She or Yinnan Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure S1.

Supplementary Figure S2.

Supplementary Figure S3.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Q., Liu, G., Qiu, Q. et al. Establish a novel tumor budding-related signature to predict prognosis and guide clinical therapy in colorectal cancer. Sci Rep 14, 2180 (2024). https://doi.org/10.1038/s41598-024-52596-1

Download citation

Received: 12 October 2023
Accepted: 21 January 2024
Published: 25 January 2024
DOI: https://doi.org/10.1038/s41598-024-52596-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.