Introduction

Colorectal cancer (CRC) is universally acknowledged as the second cause of cancer-related death worldwide [1]. Similar to other types of malignant tumors, CRC is also characterized by tumor heterogeneity that poses great challenges to its treatment. Although the targeted therapy and immunotherapy could prolong the survival time to some extent [2], the long-term survival of the CRC population remains poor with a 5-year survival rate of nearly 60% in early staged patients [3]. Unfortunately, about half of sufferers will eventually progress to distant metastases. Despite the fact that extensive researches are related to the prognostic biomarkers of CRC patients, accurate prediction of the survival time of CRC individuals is still a tough task for oncologists [4]. Therefore, to optimize the survival prediction of the CRC population more accurately, it’s imperative to design effective prognostic biomarkers for them.

Chronic inflammation is closely related to the initiation of multiple cancers, including CRC [5]. The chronic inflammatory response can present in different tumor stages and lead to genetic modification as well as genomic instability [6]. The systemic inflammatory response could be well reflected by some serum inflammatory markers derived from complete blood counts [7]. There have been quite a few clinical investigations related to pre-treatment serum inflammatory markers which help in predicting post-operative survival of CRC patients, such as SII, PNI, ALRI, NLR and PLR. Recently, Jin et al. [8]. put forward a novel inflammation index based on peripheral neutrophils, monocytes and lymphocytes count, called the systemic inflammation response index (SIRI) in cardiovascular diseases, and exhibited talented prognostic ability in most malignant tumors [9,10,11,12,13]. Moreover, a recent meta-analysis included 10754 cancer patients from 38 clinical cohorts demonstrated that serum SIRI is a universal prognostic biomarker in individuals with cancer [14]. However, there is no clinical evidence indicating whether SIRI can also serve as a survival indicator to precisely predict CRC patient outcome.

The human gut microbiota includes >100 billion bacteria, viruses andparasites that maintain symbiotic interactions with the host [15]. Dysbiosis may contribute to the occurrence and progression of CRC. Colibactin, secreted by Escherichia coli, provides a prior advantage to compete with other bacteria to locate in cancerous lesions [16] and also causes DNA damage that may lead to the development of CRC [17]. Parasites are the important composition of gut microbiota. Among them, Schistosomal is the common infestation of the colorectal tract, and has been implicated in the occurrence and progression of CRC [18]. Gut eukaryotic virome is a new research area, and a recent study demonstrates that alterations in enteric virome are correlated with the progression and prognosis of CRC [19]. Increasing evidence proves that changes in the tumor-associated bacteria could affect the body’s metabolic and immune function, allowing environmental factors to initiate and promote CRC [20,21,22]. Therefore, modulation of tumor-associated bacteria might be one of the most promising new strategies to prevent and conquer CRC [23]. Recent researches have identified fusobacterium nucleatum, enterococcus faecalis as confirmed pathogens of CRC [24]. Chen et al. [24]. put forward that gut microbiota could provide some bacterial metabolites and inhibit intestinal inflammation. The dysbiosis occupies a critical role in the pathogenesis of CRC, causing initial inflammation response via modulating different inflammatory signaling pathways [25, 26]. Although quite a few microbiologists attempt to uncover the potential association between tumor-associated bacteria and chronic inflammation among CRC individuals, the relationship between the tumor-associated bacteria and serum inflammatory biomarkers as reflected by SIRI is still unknown in patients with CRC.

In the present study, we used propensity score matching (PSM) and survival analysis to investigate whether SIRI could be utilized for risk stratification among CRC individuals who experienced surgical intervention. Then, we explored the difference in tumor-associated bacteria between low and high SIRI groups based on paraffin-embedded tumor tissues and frozen tissues. As SIRI is a novel index that could reflect the systemic inflammatory response, our analysis is designed to reveal the potential association between tumor-associated bacteria and systemic inflammatory response among CRC patients.

Materials and methods

Cohort selection

We screened patients with CRC from Wuhan Union hospital between July 2013 and September 2017. Most of them underwent radical tumor resection, and some sufferers with advanced TNM stage experienced partial resection. The inclusion criteria: (1) The confirmed diagnosis of CRC via pathological reports; (2) Patients with intact laboratory data and follow-up information; (3) The tumor tissue is large enough for 16 S rRNA sequencing. The exclusion criteria: (1) Patients used antibiotics due to the acute bacterial infection before surgical resection; (2) Patients were complicated with systemic inflammatory disease; (3) Patients were reluctant to take part in this clinical research. Finally, 298 cases of CRC individuals who received surgical therapy were included in our research and all of them provided their informed consent to this research. Our research plan was approved by the clinical ethics committee before the initiation of this study (No. 2018-S377). In order to validate the reliability of the 16 S rRNA sequencing based on paraffin-embedded tissues, we prospectively collected the fresh tissues of 24 newly enrolled CRC individuals with the same inclusion criteria during November 2022 for 16 S rRNA sequencing analysis. For the exploration of the immune microenvironment between the low and high SIRI groups, we also collected the paraffin-embedded tissues of the 24 newly enrolled CRC individuals for subsequent immunohistochemistry assay.

Data collection

We collected the following clinical information, such as gender, body mass index (BMI), age at diagnosis, primary site, T stage, tumor grade, N stage, TNM stage, tumor size, M stage, postoperative chemotherapy, and laboratory data (liver, renal function; inflammatory indexes; serum tumor markers) and follow-up information. The inflammatory indexes consist of SIRI, systemic immune-inflammation index (SII), prognostic nutritional index (PNI), neutrophil to lymphocyte ratio (NLR), platelets to lymphocyte ratio (PLR), lymphocyte to monocyte ratio (LMR), and aspartate aminotransferase to lymphocyte ratio index (ALRI). Serum tumor markers consist of CEA, CA125, CA724, and CA199. Overall survival (OS) is defined as the interval between the first day of surgical resection and the date of death or last visit, and disease-free survival (DFS) is defined as the interval between the first day of surgical resection and the date of any type of tumor progression, recurrent or last visit. SIRI [27] is calculated as the formula of neutrophil count * monocyte count/lymphocyte count, and other inflammatory indexes are defined according to the previous study [28]. In addition, we also collected the paraffin-embedded tumor tissues and frozen tissues for 16 S rRNA sequencing.

PSM analysis

These patients were strictly matched with 1:1 between low and high SIRI groups via the nearest neighbor algorithm. We matched age at diagnosis, gender, BMI, TNM stage, histological grade, tumor size, tumor site, and chemotherapy to adjust for confounding indexes, and to facilitate the balanced comparison between the low SIRI and high SIRI groups. We applied an inverse probability of treatment weighting algorithm to further eliminate the potential imbalance between the low SIRI and high SIRI groups. A Cox proportional-hazards model was selected for the survival analysis by including the significant features in univariate Cox analysis. We also carried out sensitivity analyses in the primary cohort as well as the PSM cohort to further validate the conclusion of the univariate Cox analysis.

DNA extraction and 16 S rRNA sequencing

We used the Omega Mag-Bind soil DNA kit (Omega Bio-Tek, Norcross, GA, USA) to abstract tumor-associated bacteria DNA from the selected samples. Agarose gel electrophoresis was utilized to quantitatively measure the purity of the total tumor-associated bacteria DNA. V3-V4 bacterial genome of 16 S rRNA gene was further amplified via a polymerase chain reaction. The forward primer sequence was 5'-ACTCCTACGGGAGGCAGCA-3' and the reverse primer sequence was 5'-GGACTACHVGGGTWTCTAAT-3'. The high-throughput sequencing library was constructed by using the Illumina TruSeq Nano DNA LT library prep kit (Illumina, San Diego, CA, USA).

Analysis of the sequencing data

Quantitative Insights into Microbial Ecology2 (QIIME2) software was applied to preliminarily manage the raw sequences. Sequences with >97% similarity were automatically assigned to one operational taxonomic unit (OTU) via Uparse software. We also operate QIIME2 software to allocate the representative sequences taxonomically by the exploration of the Greengenes database (http://greengenes.secondgenome.com/). Alpha diversity, including Chao1, Shannon, Goods_coverage, Simpson, and observed spices, was measured using QIIME2 to compare the species diversity between low and high SIRI groups. Beta diversity was assessed to compare the differences in microbial community composition between low SIRI and high SIRI groups using principal-coordinate analysis (PCoA). Linear discriminant analysis (LDA) effect size (LEfSe) analysis was carried out to identify significant taxa between low and high SIRI groups at the levels of phylum and genus. The MetaCyc database and Kyoto Encyclopedia of Genes and Genomes (KEGG) database were explored to perform KEGG Orthology (KO) analysis.

Immunohistochemistry

Four serial sections of 5 μm per paraffin block are obtained for the following immunohistochemistry staining. These sections were first baked at 60 °C and then deparaffinized in xylene and ethanol. After hydration, 3% hydrogen peroxidase was utilized to block endogenous peroxidase activity. Standard antigen retrieval was conducted via heating the sections immersed in citric acid solution (pH = 6.0) in a pressure boiler. Subsequently, these slides were incubated with the primary antibodies [CD20 (60271-1-Ig, Proteintech, 1:5000); CD4(ab133616, Abcam, 1:500), CD8(ab85792, abcam, 1:400), CD68(ab959, Abcam, 1:6000)] at 4°Covernight, and then incubated with second antibody. After 3,3'-diaminobenzidine tetrahydrochloride staining and hematoxylin counterstaining, the slides were scanned for further quantitative analysis. The density of CD4 + , CD20,CD68 and CD8 + T cells both invasive margin (IM) and in the core of the tumor (CT) were automatically calculated using ImageJ software (version 1.48). The software generally contains positive cells and a positive nucleus, and we used its ratio (positive cells/positive nucleus) to represent the expression status of four immune cells in CRC tissues.

Statistical analysis

All the statistical analyses were performed via R software (version 3.0), Graphpad Prism 9, and SPSS 20.0. Availability of R codes involved in our analysis is available upon request. Accessibility of the SIRI threshold that may stratify the CRC patients into two gatherings with distinctive OS results was evaluated by using the X-tile software (version 3.6). Continuous data were presented as mean with standard deviation, and compared by t-test or nonparametric test, while categorical indexes were summarized as the frequency with percent and compared by chi-square or Fisher exact test. We plotted survival curves and compared the survival time between the low SIRI and high SIRI groups by log-rank test. ROC curves were drawn to compare the predictive ability of SIRI, SII, PNI, NLR, PLR, LMR, and ALRI for survival rates among CRC patients. The correlation between SIRI and other inflammatory biomarkers was quantified with Spearman analysis. The comparison of alpha diversity between the low SIRI and high SIRI groups using the Kruskal–Wallis test.

Results

Baseline features of included CRC individuals

A total of 298 CRC patients receiving surgical resection met the inclusion criteria, and were thus included in this research. Based on the optimal threshold of SIRI (1.4) measured by X-tile (Fig. S1), we divided these individuals into the low SIRI group (N = 192) and a high SIRI group (N = 106). As shown in Table 1, we found that histological grade (P = 0.01), tumor size (P = 0.005), N stage (P = 0.033), M stage (P = 0.005), TNM stage (P = 0.017), count of WBC (P < 0.0001), PLT(P < 0.001), serum ALB (P = 0.006), LDH (P < 0.001), CA72-4(P = 0.017), the death rate (P < 0.001) and recurrent rate (P < 0.001) are significantly different between the low and high SIRI groups. Hence, we used PSM analysis based on the ratio of 1:1 to balance these confounding factors between the two groups. The correlation between SIRI and clinical metrics in the PSM cohort and the weighted cohort is also shown in Table 1.

Table 1 Comparisons of clinical characteristics between low SIRI and high SIRI groups in the original, matched and weighted cohorts.

In the crude cohort, we assessed the correlation between SIRI and other common inflammatory indexes, such as SII, NLR, PLR, PNI, and ALRI. As shown in Fig. 1A, we discovered that SIRI exhibited a positive correlation with SII (r = 0.798), NLR (r = 0.869), and PLR (r = 0.517), while SIRI exhibited a negative association with PNI (r = −0.345). As for serum tumor markers, we found a positive relationship (r = 0.290) between SIRI and serum CA125, while SIRI showed a weak association with other serum tumor markers, such as CEA, CA199, and CA724. Then we used the ROC analysis to measure the predictive accuracy of common inflammatory biomarkers. Preoperative SIRI showed better AUC not only for the prediction of OS rate (Fig. 1B) but also for the DFS rate (Fig. 1C) among operative CRC individuals. The detailed comparison of each inflammatory index is listed in Table S1.

Fig. 1: Clinical correlation and predictive ability of serum inflammatory biomarkers.
figure 1

A The relationships between SIRI with other serum inflammatory biomarkers and serum tumor biomarkers. B ROC curves of serum inflammatory biomarkers for the prediction of overall survival among colorectal cancer patients. C ROC curves of serum inflammatory biomarkers for the prediction of disease-free survival in individuals with CRC.

Prognostic value of SIRI in CRC patients after PSM

In the crude CRC cohort, we used subgroup analysis to make sure whether the level of SIRI is a potent factor that independently affects the survival of CRC patients who received surgical removal. As exhibited in the forest plot, a strong relationship between high SIRI and less favorable OS existed in most subgroups (Fig. 2A), such as age, gender, and M stage. Similarly, a strong correlation between high SIRI and less favorable DFS existed in most subgroups (Fig. 2A), such as age, gender, primary site, N stage, M stage, and TNM stage.

Fig. 2: Subgroup analysis and survival analysis of SIRI in individuals with CRC.
figure 2

A The forest plot revealed the results of subgroup analysis for overall survival and disease-free survival. Kaplan–Meier plots of survival outcomes based on low and high SIRI groups in the crude cohort (B, C), PSM cohort (D, E) and weighted cohort (F, G).

Survival analysis was executed to assess the significance of SIRI in the stratification of CRC individuals with different survival risks. In the crude population, HR showed that a high SIRI group was correlated with a less favorable OS rate (HR = 3.21, 95%CI:1.79-5.15, P < 0.0001. Figure 2B) and DFS rate (HR = 3.31, 95%CI:2.03-5.38, P < 0.0001). Figure 2C among CRC individuals. This strong association also existed in the PSM population (Fig. 2D, E) and weighted cohort (Fig. 2F, G).

We adopted the univariate Cox model to explore the effects of SIRI on survival outcomes (OS & DFS) in the whole population, PSM population, and weighted cohort. As listed in Table 2, we noticed that high SIRI is the risk factor for inferior survival outcomes among CRC individuals. We also employed sensitivity analysis to confirm the positive relationship between SIRI and prognosis. After the adjustment for potential covariates in the three models, this association remained significant (P < 0.05).

Table 2 Results of clinical outcomes and sensitivity analysis.

Composition of tumor-associated bacteria between low and high SIRI groups

We drew the rarefaction curve to assess whether the sample size is enough for our analysis. As shown in Fig. 3A, the end of the curve lines for the low SIRI group and high group tend to be flat, implicating that the number of 16 S rRNA sequencing is almost reasonable. The rarefaction curve shows that more CRC cases will only add a few new OTUs, indicating the CRC cases are enough to cover most gut bacteria. A total of 15651 OTUs were identified in our analysis, including 11502 in the low SIRI group, 7452 in the high SIRI group, and 3303 OTUs in both low and high groups (Fig. 3B), indicating that OTUs were significantly more in low SIRI group than that in high SIRI groups. Microbial taxon assignment was used to compare the composition of tumor-associated bacteria between low and high SIRI groups at the levels of phylum and genus. As shown in Fig. 3C, the most abundant phyla between low and high SIRI groups at the phylum level were Proteobacteria, Thermi, Firmicutes, Bacteroidetes, Actinobacteria, Verrucomicrobia, Fusobacteria, Planctomycetes and TM7. We noticed that the abundance of Fusobacteria was significantly richer in the high SIRI group than that in the low SIRI group. At the genus level (Fig. 3D), the most abundant bacteria between low and high SIRI groups were Cupriavidus, Acinetobacter, Sphingomonas, Thermus, Sphingobium, Pseudomonadaceae_Pseudomonas, Brevundimonas, Massilia, Ochrobactrum and Lactobacillus. We could observe that the abundance of Acinetobacter was remarkably richer in the high SIRI group than that in the low SIRI group. Then, we further identified the most significant microbiota between the low and high SIRI groups. At the phylum level, we found that Proteobacteria, Synergistetes, WPS-2, Thermil, Fusobacteria were enriched in patients with the high SIRI group, while Cyanobacteria, Armatimonadetes, Acidobacteria, Gemmatimonadetes, Planctomycetes, Actinobacteria, Chloroflesxi, OD1, Tenericutes, Deferribacteres, Nitrospire, TM7, Fimicutes and Verrucomicrobia were enriched in low SIRI group (Fig. 3E). At the genus level, the abundance of Cupriavidus, Thermus, Ochrobactrum, Cupriavidus, Acidovorax, Janthinobacterium, Sphingomonas, Sphingobium, Shigella, Sphingobium and Pelomonas were up-regulated in high SIRI group, while Ralstonia, Brevundimonas, Bacteroides, Lactobacillus, Massilia, Anoxybacillus, Arthrobacter, Herbaspirillum, Acinetobacter and Flavobacterium, were up-regulated in low SIRI group (Fig. 3F).

Fig. 3: Rarefaction curve and gut microbiota composition between low and high SIRI groups.
figure 3

A Rarefaction curve. B The Venn plots reveal the unique and common taxa between low and high SIRI groups. Compositions of gut microbial taxonomic at the phylum (C) and genus (D) levels. Up-regulated and downregulated microbial taxonomic between low and high SIRI groups at the phylum (E) and genus (F) levels.

As alpha diversity is the general reflection of species richness, we applied this analysis to determine the richness of the tumor-associated bacteria between low and high SIRI groups. As listed in Fig. 4A, indexes of Chao 1(P = 9.9e-11), Goods coverage (P = 4.2e-8), Shannon (P = 3e-24), Simpson (P = 1.2e-27) and observed spices (P = 2e-11) were remarkably different between the low and high SIRI groups, indicating that species richness was significantly richer in low SIRI group than that in high SIRI group. Based on principal-coordinate analysis (PCoA), we found that PCo1 is 12.3% and PCo2 is 8.8% (Fig. 4B). LEfSe analysis revealed that there was a remarkable difference in species diversity between the two groups (Fig. 4C). When the LDA threshold was set at 3, a total of 42 species were identified in the low SIRI and high SIRI groups. Seven species were enriched in the high SIRI group, and 35 species were enriched in the low SIRI group. The 16 S rRNA sequencing results of CRC tissues were analyzed by KEGG and KEGG Orthology analyses between SIRI low and high groups. Figure S2 shows the relative abundances of functional pathways. The top 5 pathways of biosynthesis were amino acid biosynthesis; cofactor, prosthetic group, electron carrier, vitamin biosynthesis; nucleoside and nucleotide biosynthesis; fatty acid and lipid biosynthesis, and carbohydrate biosynthesis. In addition, the KO analysis uncovered eight significant metabolic pathways between SIRI low and high groups (Table S2).

Fig. 4: Diversity of microbiota and LEfSe analyses of the low and high SIRI groups.
figure 4

A Alpha diversity reveals that species richness was different between the two groups. B Beta diversity analysis revealed by PCoA. C LEfSe analysis revealed that there was a remarkable difference in species diversity between the two groups.

Validation with frozen CRC tissues

We also used the 24 cases of frozen tissues from newly enrolled CRC individuals for 16 S rRNA sequencing. The most abundant phyla between the low SIRI group and high SIRI group at the phylum level (Figure S3A) are Proteobacteria, Thermi, Firmicutes, Actinobacteria, Bacteroidetes, Cyanobacteria and TM7. At the genus level, the most abundant bacteria between low and high SIRI groups were Cupriavidus, Acinetobacter, Sphingomonas, Thermus, Psesudomonadaceae_Pseudomonas, Brevundimonas and Sphingobium (Figure S3B). Hence, the abundance between the low SIRI group and high SIRI group both at the phylum and genus levels based on frozen tissues is similar to the abundance based on paraffin tissue samples. Moreover, alpha diversity is also different between the low SIRI group and the high SIRI group based on fresh tissues(Figure S3C). Results of LEfSe analysis showed that there is a significant difference in species diversity between the low SIRI group and the high SIRI group based on frozen tissues (Figure S3D).

Comparison of immune cells in the low and high SIRI groups

We used the immunohistochemistry assay to determine the expression of four immune cells in the low and high SIRI groups. The density of CD4 + T cells regarding the ratio of positive cells to the positive nucleus in the IM and CT seem to be higher in the high SIRI group than that in the low SIRI group (Fig. S4A), but the difference between the low SIRI group and the high SIRI group is statistically insignificant (Fig. S4B). Moreover, CD8 + T cells (Fig. S4C, D), CD20 B cells (Fig. S4E, F) and macrophages (Fig. 4G, H) exhibited no significant difference neither in IM nor CT of CRC tissues. The representative staining figures of CD4 T cells, CD8 T cells, CD20 B cells and macrophages are listed in Fig. 5A–D. The insignificant difference in immune cells between the low SIRI and high SIRI is more likely due to the small sample size of newly enrolled CRC patients.

Fig. 5: The representative staining images of immune cells in the low and high SIRI groups.
figure 5

A CD4 + T cells; B CD8 + T cells; C CD20 + B cells; D CD68 + macrophages.

Discussion

Large metagenomic evidence highlights an essential role of the intestinal microbiota in chronic gut inflammation and CRC [29]. Tumor-associated bacteria plays a pivotal role in shaping inflammatory environments, which could enhance the tumor growth and metastasis of CRC [30]. Intestinal dysbiosis will lead to the decrease of beneficial bacteria-derived metabolites, enhanced release of toxic metabolites secreted by bacteria, and the disruption of the epithelial barrier, which could incur the aberrant activation of the immune response with chronic inflammation and thus promote the progression of CRC [31]. Hence, gaining deep insights into the correlation between tumor-associated bacteria and systemic inflammation in CRC is of great significance.

In the present study, we initially explored the prognostic significance of SIRI for stratifying CRC individuals with different survival risks, and we observed that high SIRI was not only connected with the worse OS but also linked to poorer DFS in CRC sufferers. ROC curves demonstrated that SIRI possessed a superior predictive ability for the survival rate of CRC patients to other common inflammatory biomarkers, such as SII, PNI, NLR, and PLR. Then, we especially compared the composition of gut microbiota between low SIRI and high SIRI patients via 16 S rRNA gene sequencing, and we noticed that there existed significant differences in the diversity and compositions of tumor-associated bacteria between the low and high SIRI groups, indicating that high levels of inflammation reduced the diversity of gut microbiota in CRC patients.

A clinical trial explored the correlation between SIRI and pathological complete response in patients with 241 cases of breast cancer receiving chemotherapy and concluded that serum SIRI could predict pathological complete response in breast cancer individuals receiving chemotherapy [12]. This conclusion was proved by another clinical trial containing 262 cases of breast cancer individuals [32]. Sun et al. [9] demonstrated that a high level of SIRI (≥0.89) is an independent predictor of worse prognosis among gallbladder cancer patients. Huang et al. [33] found that SIRI is useful in helping the differentiation of malignant and benign ovarian tumors, while this study did not assess the prognostic value of SIRI among ovarian tumor patients. Moreover, a recent study [34] investigated the prognostic influence of the SIRI on the survival outcomes of lung cancer patients receiving concurrent chemoradiotherapy and concluded that a high level of SIRI could independently affect the survival outcomes among those patients. However, the prognostic influence of SIRI on patients with CRC is still unknown. Our study focused on the clinical and prognostic significance of SIRI among CRC individuals, and we found that a high level of SIRI was correlated with less favorable survival outcomes of CRC patients not only in the entire cohort but also in the PSM cohort.

Recent studies have revealed that tumor-associated bacteria is pervasive among malignant tumors and a significant factor in cancer immunotherapy [35,36,37]. Yu et al. [38] collected fecal samples from 49 matched healthy individuals, 23 cases of primary gastric cancer, 26 metastatic gastric cancer patients, and the results of 16 S rRNA gene sequencing revealed that Streptococcus alteration was significantly correlated with liver metastasis of gastric cancer. Erick et al. [39] used 16 S rRNA gene sequencing to compare the composition of tumor-associated bacteria in pancreatic cancer patients with different survival times, and found that higher alpha diversity of tumor tissue is more likely to correlate with longer survival time. Another study applied The Microbe Identification Microarray to test for the presence of 272 bacterial species from 333 upper digestive tract tissues and found that decreased microbial abundance in the upper digestive tract was closely associated with both cancer-predisposing states [40]. Although most of these studies included similar sample sizes between different groups, the two groups are still not balanced regarding baseline features, which will cause bias in their study conclusions. As our current study was a retrospective cohort analysis, we determined propensity scores for low and high SIRI groups to adjust for confounding variables. We believe that PSM analysis will facilitate well-balanced comparability between the low and high SIRI groups. Gut microbiota is easily affected by many clinical factors, such as race, age, gender, and TNM stage, we employed a 1:1 PSM analysis to balance low and high SIRI groups, and we found that high levels of SIRI correlated well with the worse survival outcomes in CRC individuals experiencing surgical resection. After PSM, compositions of tumor-associated bacteria between low SIRI and high SIRI were significantly different.

Dysbiosis promotes chronic inflammation and carcinogenesis, and a high level of systemic inflammation is correlated with worse survival outcomes in CRC individuals. However, the potential relationship between systemic inflammation and gut microbiota is largely unknown in CRC. Bacteroidetes are reported to be closely linked to chronic intestinal inflammation [41]. Moreover, an increased abundance of Fusobacterium was detected in the intestinal tracts of individuals with CRC [42], and F. nucleatum subsp. polymorphism secrets outer membrane vesicles, which could produce NF-κB and TLR4 to activate pro-inflammatory pathways [43]. Our analysis utilized PSM to balance the low and high SIRI groups, to identify the most significant bacteria between the two groups in CRC individuals. At the phylum level, we noticed that Fusobacteris was enriched in the high SIRI group, indicating Fusobacteris plays a roinflammatory role in CRC occurrence and progression. We found that cupriavidus, acinetobacter, and sphingomomas are the top three bacteria among CRC patients with high SIRI. An experimental study reveal that a high-fat diet produced a pro-inflammatory microenvironment characterized by the increased abundance of sphingomomas [44]. Our study also pointed out that the abundance of sphingomomas was enriched in high inflammatory tissues in CRC.

Our study not only includes noteworthy strengths but also contains two limitations. We investigated the characteristic differences in microbiota profiles between low SIRI (N = 83) and high SIRI (N = 83) groups in CRC with PSM analysis. However, two obvious drawbacks also existed in our analysis. First, this was a retrospective analysis and we could only collect paraffin tissue samples, which may somewhat affect the composition of gut microbiota. Although we validated the reliability of 16 S rRNA sequence with frozen CRC tissues, the sample size of newly collected tissues is small due to the limited time. Next, the exact mechanism of how inflammation status reflected by high SIRI affected changes in tumor-associated bacteria composition in CRC is still unknown, and metagenomics along with metabolomics might be deductive to solve this clinical issue.

Conclusion

Our results in the PSM cohort of 166 cases of CRC patients treated with surgical removal showed that a high level of baseline SIRI was a robust biomarker to predict remarkably worse survival outcomes. Moreover, we detected significant differences in the compositions of tumor-associated bacteria between low and high SIRI groups and found that the diversity of microbiota in the low SIRI group was significantly richer than that in the high SIRI group. Our study roughly revealed the potential correlation between systemic inflammation and tumor-associated bacteria in CRC patients.