Introduction

Approximately 10% of patients with colorectal cancer with submucosal invasion (SM-CRC) experience lymph node metastasis. For such patients, the Japanese Society for Cancer of the Colon and Rectum guidelines recommend surgical resection with lymph node dissection1.

For patients diagnosed with SM colorectal cancer after endoscopic treatment, additional surgical resection is performed based on pathological findings. The guidelines recommend additional resection in cases with one or more of the following pathologic factors related to lymph node metastasis: depth of submucosal invasion greater than 1000 μm, positive lymphovascular invasion, positive endoscopic vertical margin, poorly differentiated adenocarcinoma, signet-ring cell carcinoma, or mucinous carcinoma, and grade 2/3 budding at the site of the deepest invasion1. These pathological risk factors for lymph node metastasis have varying sensitivities and specificities. For example, a depth of submucosal invasion ≥ 1000 μm has a sensitivity of 94.8% and specificity of 19.0% for positive lymph node metastasis, while an undifferentiated histological type has a sensitivity of 13.3% and specificity of 97.9%2. The combination of these risk factors results in lymph node risks ranging from 7.4 to 46.9% in patients with SM-CRC3. Therefore, to predict the risk of lymph node metastasis, various methods, such as nomograms, have been utilized4.

The identification of new risk factors for lymph node metastasis in patients with SM-CRC is crucial5,6. In recent years, cancer immunity has been one of the most important factors associated with tumor prognosis6, and tumor-infiltrating T cells (TILs) have been reported to improve the prognosis of many solid tumors7. T cells can be classified as CD8+ or CD4+ T cells8. CD8+ T cells are cytotoxic T cells that contribute significantly to tumor prognosis9. Nevertheless, the role of CD8+ TILs in predicting lymph node metastasis in patients with SM-CRC has not been well evaluated. Although TILs in colorectal cancer have prognostic value10, the relationship between TILs and lymph node metastasis in patients with SM-CRC has not been reported.

Recently, CD103+ CD8+ T cells have been found to play an important role in cancer immunity, particularly in cancers of epithelial origin11.

The ligand for CD103 is the epithelial cell surface molecule E-cadherin, an epithelial-mesenchymal transition (EMT) marker12. Loss of E-cadherin in tumors is the hallmark of EMT and has been reported to be associated with poor prognosis for patients with colorectal cancer13,14,15. Therefore, we hypothesized that the number of infiltrating CD103+ cells is indirectly related to the EMT of the tumor and examined the relationship between the number of CD103+ TILs and lymph node metastasis.

Thus, the purpose of this study was to examine the relationship between lymph node metastasis and TILs in patients with SM-CRC.

Results

Patient’s characteristics and immunohistochemical staining of CD8 + TILs

We identified CD8+ T cells in surgically resected specimens from patients with SM-CRC using immunohistochemical staining with CD103 and CD8 antibodies. Of the total 180 eligible patients enrolled in this study, 15 patients treated with local excision that lacked lymph node dissection data and 67 patients without available tumor tissue were excluded. Finally, CD8+ TILs were evaluated in 98 cases (Fig. 1).

Figure 1
figure 1

Scheme of the study. A total of 180 patients with surgically treated SM-CRC were enrolled in this study; 15 patients treated with local excision that lacked pathological lymph node metastasis data and 67 patients without available specimen of the tumor area were excluded.

The evaluation site of the lymphocytes is shown in Fig. 2a. As shown in the figure, the invasive margin was set up, and the deepest part of the invasive margin with a large amount of tumor epithelial tissue was evaluated at × 200 magnification. Immunohistochemical staining showed variation among cases in the number of CD8+ TILs (Fig. 2b).

Figure 2
figure 2

(a) The number of CD8+ TILs was counted at the invasive margin. The dotted line indicates the invasive margin. (b) Immunohistochemical staining for CD8 at the invasive margins of resected CRC tissue (representative images of a highly invasive tumor and low invasive tumor).

The characteristics of the 98 patients are presented in Table 1. Only two cases displayed undifferentiated histology, and lymph node metastasis was observed in 11 patients (11.2%). The median number of CD8+ TILs was 185 in 10 high-power fields (HPFs).

Table 1 Clinicopathologic factors of all patients.

The number of CD8+ TILs and lymph node metastases

The number of CD8+ TILs decreased as the number of lymph node metastases increased (Fig. 3a), with the exception of one case with three lymph node metastases and a high number of CD8+ TILs.

Figure 3
figure 3

(a) The number of CD8+ TILs and lymph node metastases from SM-CRC. (b) The number of CD8+ TILs and presence of lymph node metastasis from SM-CRC (P = 0.042).

The number of CD8+ TILs was significantly higher in cases without lymph node metastasis (median: 180/10 HPF, range 10–1042) than in those with lymph node metastasis (median: 42/10 HPF, range 15–394; P = 0.042; Fig. 3b).

The relationship between the specificity and sensitivity of the number of CD8+ TILs for the purpose of lymph node metastasis is represented by a receiver operating characteristic (ROC) curve (Fig. 4). The area under the ROC curve (AUC) was 0.689 for the number of CD8+ TILs.

Figure 4
figure 4

The receiver operating characteristic (ROC) curve of the relationship between the specificity and the sensitivity of the number of CD8+ TILs for lymph node metastasis.

The number of CD8+ TILs with the cutoff value identified by ROC curve analysis was compared with other clinicopathological risk factors for lymph node metastasis in univariate and multivariate analyses. Univariate analysis revealed that lymphatic invasion (+), budding grade (3), and CD8+ TILs (low) were risk factors for lymph node metastasis. Multivariate analysis confirmed that lymphatic invasion ( +) and the number of CD8+ TILs (low) were independent risk factors for lymph node metastasis.

The sensitivity, specificity, PPV, and NPV of each risk factor in the present dataset are shown in Table 2.

Table 2 Variables of lymph node metastasis (univariate and multivariate analyses).

The combination of the number of CD8+ and CD103+ TILs and lymph node metastases

To identify a better indicator of lymph node metastasis, we evaluated the presence of CD103, another T cell activation marker.

Immunohistochemical staining showed that the number of CD103+ TILs varied among cases (Fig. 5a). The median number of CD103+ TILs was 116/10 HPFs. Furthermore, the number of CD103+ TILs was correlated with the number of CD8+ TILs (r = 0.524, P < 0.001).

Figure 5
figure 5

(a) Immunohistochemical staining for CD103 TILs at invasive margins of resected CRC tissue (left: high invasion; right: low invasion). (b) The number of CD103+ TILs and presence of lymph node metastasis from SM-CRC (P = 0.134). (c) The number of CD103+ TILs and lymph node metastases from SM-CRC.

The number of CD103+ TILs decreased as the number of lymph node metastases increased (Fig. 5b), except for one case with three lymph node metastases and a high number of CD103+ TILs. Moreover, the number of CD103+ TILs tended to be low in patients with lymph node metastasis; however, the difference was not statistically significant (P = 0.314; Fig. 5c). The AUC for the number of CD103+ TILs was 0.640 (Fig. 6).

Figure 6
figure 6

The receiver operating characteristic (ROC) curve of the relationship between the specificity and the sensitivity of the number of CD103+ TILs for lymph node metastasis.

The number of CD103+ TILs with the cutoff value identified by ROC curve analysis was compared with other clinicopathological risk factors for lymph node metastasis in univariate and multivariate analyses (Table 3). Univariate and multivariate analyses confirmed that CD103+ TILs (low) were an independent risk factor for lymph node metastasis.

Table 3 Factors evaluated for lymph node metastasis (univariate and multivariate analyses).

Clinicopathological risk factors for lymph node metastasis and the number of CD8+ TILs, CD103+ TILs, and CD8+ CD103+ TILs were compared using sensitivity, specificity, PPV, and NPV.

The NPVs of CD8+ TILs and CD103+ TILs were higher than those of other risk factors, except for lymphatic invasion (Table 4). The PPV of CD8+ CD103+ TILs was also higher than that of other risk factors (Table 4).

Table 4 Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of risk factors.

We added some examinations under different conditions. First, we decided to examine pT1b cases, which are considered more likely to have lymph node metastasis. 78 pT1b cases were included, and 10 cases were associated with lymph node metastasis. The relationship between CD8+ TILs and CD103+ TILs and lymph node metastasis was almost the same as in the 98 cases including pT1a (Figure, Table). Risk factors for lymph node metastasis were similar to those in the 98 cases, including pT1a. Univariate analysis revealed that lymphatic invasion (+), budding grade (3), and CD8+ TILs (low) were risk factors for lymph node metastasis. Multivariate analysis confirmed that lymphatic invasion (+) and the number of CD8+ TILs (low) were independent risk factors for lymph node metastasis. (Supplementary Fig. 1a-d, Supplementary Table 1a, b).

Second, because of the strong impact of lymphatic invasion on the results, we examined the relationship between lymphatic invasion and tumor-infiltrating CD8+ T cells and found no predominant difference in lymphatic invasion and the number of CD8+ cells (P = 0.63). We examined whether the number of CD8+ cells is a risk factor for lymph node metastasis in cases without lymphatic invasion. Since there was only one case of lymph node metastasis in patients without lymphatic invasion, we did not find a statistically significant difference, but the results showed that CD8+ TILs are possibly a risk factor even in patients without lymphatic invasion (P = 0.08). (Supplementary Fig. 2 a, b).

Discussion

In this study, we examined CD8+ TILs as a risk factor for lymph node metastasis in patients with SM-CRC. Our findings indicate low CD8+ TIL levels, as determined by the cutoff value from ROC curve analysis, to be a new, independent risk factor for lymph node metastasis. Furthermore, by combining the numbers of CD8+ and CD103+ TILs, we found a high PPV for lymph node metastasis in patients with low levels of both types of TILs and a high NPV in patients with high levels of both types of TILs. Thus, we most likely identified an effective subset of CD8+ T cells.

The few existing reports on the connection between TILs and lymph node metastasis have focused on breast cancer16 and melanoma17. In these reports, low TIL density was associated with lymph node metastasis. In breast cancer, CD8+ TILs have been found to have a negative correlation with lymph node metastasis18. Another study from 2020 reported a relationship between lymph node metastasis and TILs in early-stage breast cancer16. Conversely, TIL density and lymph node metastasis have been reported to have a low correlation in patients with esophageal squamous cell carcinoma19. One report has examined the relationship between TILs and lymph node metastasis in colorectal cancer; however, it examined CD3+ TILs and reported a low correlation20, likely because CD3+ cells include immunosuppressive T cells, such as Tregs21. Tregs are involved in the development and progression of cancer by inhibiting anticancer immunity22 and express CD3, CD4, CD25, and Foxp3, but not CD8, which is a cytotoxic T-cell marker23. The number of tumor-infiltrating CD8+ T cells is associated with tumor prognosis in many solid tumors24. Therefore, we thought that evaluating CD3+ T cells, including Tregs and cytotoxic T cells, may obscure the relationship between TILs and colorectal cancer lymph node metastasis, and evaluation of CD8 mostly expressed on cytotoxic T cells, alone does not lead to a major loss of direction. The results of a single cell analysis of immune cells from colorectal cancer (GSE188381) indicate that CD8 is almost expressed on cells expressing CD3. Single cell RNA-seq analysis of tumor infiltrating lymphocytes in colorectal cancer. The t-distributed stochastic neighbor embedding (t-SNE) plot showed that CD8A is almost expressed on CD3D, CD3E expressing cells (Supplementary Fig. 3). We hypothesized that evaluating only cytotoxic T cells would better clarify the relationship between lymph node metastasis and TILs. In this study, we did not examine pan-T cell markers such as CD3 and CD4 but only evaluated markers of cytotoxic T cells such as CD8 and CD103. We focused on CD8 cytotoxic T cells and found a relationship between CD8+ TILs and lymph node metastasis.

This study did not examine how CD8+ TILs or CD103+ TILs suppress lymph node metastasis, and this is one of the limitations of the present study.

Because tumors are heterogeneous, the site of assessment of the number of tumor-infiltrating T cells is important25. For example, previous reports have shown that the number of TILs in the tumor infiltrate is a more sensitive indicator of tumor metastasis than that in the center of the tumor10. Therefore, in this study, we evaluated the TIL count at the tumor invasion site.

We further evaluated a subset of cytotoxic T cells that express CD103. CD103+ CD8+ T cells are associated with an epithelial-derived carcinoma prognosis11 and CD103 alone contributes to prolonged survival in patients with bladder cancer26, ovarian cancer27, non-small cell lung cancer28, and melanoma29. Moreover, CD103 is expressed by the recognition of tumor antigens and is a marker of tumor-specific TILs30. However, in this study, CD103 alone was not a more accurate risk factor for lymph node metastasis than CD8. Additionally, this study featured only two cases of undifferentiated tumors, neither of which had lymph node metastasis. As CD103 was expected to predict lymph node metastasis to some extent, we would like to re-evaluate CD103 as a risk factor for lymph node metastasis by accumulating more cases.

We have a limitation of this study. The limitation is sample size was small. The present study was single-centered and included only recent cases from 2014 to 2019, after budding was included as a risk factor for lymph node metastasis in cancer with submucosal invasion, according to the guidelines of the Japanese Society for Cancer of the Colon and Rectum. We included the 98 cases of SM-CRC at our hospital for whom resected pathological specimens were available. Additionally, 11 cases (11.2%) of lymph node metastasis were observed, which is consistent with previously established rates of lymph node metastasis in SM-CRC of approximately 10%. However, the small number of cases with lymph node metastasis is a limitation of this study, and future studies with more cases are warranted.

In the present study, both the sensitivity and specificity of lymphatic invasion were high. The data set showed a strong relationship between lymphatic invasion and lymph node metastasis, as 10 of the 11 cases with lymph nodes had lymphatic invasions. In a previous report of 293 cases, the sensitivity was 0.6 (21/37) and the specificity 0.7 (188/256)4. This indicates that lymphatic invasion is significant in lymph node metastasis, but in the current data set, the relationship between lymphatic invasion and lymph node metastasis was particularly strong.

In conclusion, we examined CD8+ TIL levels as a risk factor for lymph node metastasis in patients with SM-CRC and showed that low CD8+ TIL levels are positively correlated with lymph node metastasis. The number of CD8+ TILs using a cutoff value identified by receiver operating characteristic curve analysis was an independent risk factor for lymph node metastasis.

Material and methods

This study was approved by the Research Ethics Committee of Osaka University on October 9th, 2018 (approval no. 17448). The study was conducted in accordance with the principles of the 1964 Declaration of Helsinki and its subsequent amendments, or equivalent ethical standards. Written informed consent was obtained from all patients.

Study design and patients

In this study, we retrospectively evaluated 180 patients with SM-CRC who underwent surgical resection between 2014 and 2019 at Osaka University Hospital.

Immunohistochemical staining

Immunohistochemical staining was performed on formalin-fixed paraffin-embedded tissue Sects. (4.0 mm). After deparaffinization, antigen retrieval was performed using 10 mmol/L citrate buffer (pH 6), and intrinsic peroxidase activity was blocked using 3% H2O2 for 20 min, followed by nonspecific interaction blocking with a background sniper (Biocare Medical, Pacheco, CA, USA) for 10 min. Tissue sections were stained with anti-CD8 (ab75129, mouse, diluted 1:50) and CD103 (ab129202, rabbit, diluted 1:2,000) antibodies using the Vectastain ABC kit (Vector Laboratories, Burlingame, CA, USA). The number of CD8+ and CD103+ T cells in 10 HPFs at the tumor-invasive margins was counted using ImageJ software version 1.8.0 (NIH, Bethesda, MD, USA; http://imagej.nih.gov/ij). Specifically, we evaluated the image in the field of view with the highest number of CD8+ TILs in the most infiltrated area. We also used the cell count function in Image J to import the image and count the number of cells stained with DAB. The three most positive cell locations were identified in the 20 × field of view, and each was observed in the higher magnification field of view and measured with Image J. We measured the number of cells stained with DAB via Color Threshold with settings of Hue(200–250), Saturation(0–255), and Brightness(0–130). We counted the areas pointed out in red as shown in Supplementary Fig. 4.

Statistical analysis

Statistically significant differences between categorical variables were determined using the χ2 test and between continuous variables using the Mann–Whitney test.

All statistical analyses were performed using R version 4.0.2 and JMPpro 14.0.0 (SAS Institute, Cary, NC, USA). Statistical significance was set at P < 0.05.

Single cell RNA-seq analysis

The GSE108989 single-cell RNA seq data were download in Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/).

The GSE108989 gene sets were analyzed using the R package Seurat version 3.1.5. The cited gene expression matrices from GSE108989 were read into R version 4.0.1 and converted to Seurat objects. To reduce dimensionality, principal component analysis was performed based on highly variable genes after scaling the data with respect to unique molecular identifier counts. Principle components were selected for downstream clustering based on the heatmap, jackstraw plot, and elbow plot of principal components in order to further reduce dimensionality by using the UMAP algorithm.