Prediction of lymph node metastasis in early colorectal cancer based on histologic images by artificial intelligence

Takamatsu, Manabu; Yamamoto, Noriko; Kawachi, Hiroshi; Nakano, Kaoru; Saito, Shoichi; Fukunaga, Yosuke; Takeuchi, Kengo

doi:10.1038/s41598-022-07038-1

Download PDF

Article
Open access
Published: 22 February 2022

Prediction of lymph node metastasis in early colorectal cancer based on histologic images by artificial intelligence

Manabu Takamatsu^1,2,
Noriko Yamamoto^1,2,
Hiroshi Kawachi^1,2,
Kaoru Nakano^1,2,
Shoichi Saito³,
Yosuke Fukunaga⁴ &
…
Kengo Takeuchi^1,2,5

Scientific Reports volume 12, Article number: 2963 (2022) Cite this article

5065 Accesses
26 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Risk evaluation of lymph node metastasis (LNM) for endoscopically resected submucosal invasive (T1) colorectal cancers (CRC) is critical for determining therapeutic strategies, but interobserver variability for histologic evaluation remains a major problem. To address this issue, we developed a machine-learning model for predicting LNM of T1 CRC without histologic assessment. A total of 783 consecutive T1 CRC cases were randomly split into 548 training and 235 validation cases. First, we trained convolutional neural networks (CNN) to extract cancer tile images from whole-slide images, then re-labeled these cancer tiles with LNM status for re-training. Statistical parameters of the tile images based on the probability of primary endpoints were assembled to predict LNM in cases with a random forest algorithm, and defined its predictive value as random forest score. We evaluated the performance of case-based prediction models for both training and validation datasets with area under the receiver operating characteristic curves (AUC). The accuracy for classifying cancer tiles was 0.980. Among cancer tiles, the accuracy for classifying tiles that were LNM-positive or LNM-negative was 0.740. The AUCs of the prediction models in the training and validation sets were 0.971 and 0.760, respectively. CNN judged the LNM probability by considering histologic tumor grade.

Deep neural network trained on gigapixel images improves lymph node metastasis detection in clinical settings

Article Open access 10 June 2022

A promising deep learning-assistive algorithm for histopathological screening of colorectal cancer

Article Open access 09 February 2022

Predicting gastric cancer outcome from resected lymph node histopathology images using deep learning

Article Open access 12 March 2021

Introduction

Endoscopic resection for submucosal invasive (T1) colorectal cancer (CRC) has rapidly increased over the last decade^1,2. Approximately 10% of T1 CRC can metastasize to lymph nodes, making histologic evaluation for T1 CRC critical for determining the indications for additional surgery^3,4,5. Previous studies described five independent histologic risk factors for lymph node metastasis (LNM), as follows; depth of submucosal invasion (≥ 1000 µm), poorly differentiated or mucinous carcinoma, lymphatic invasion, venous invasion, and tumor budding^3,4,5. The 2020 Japanese Society for Cancer of the Colon and Rectum guidelines recommend surgical intervention for T1 CRC patients after endoscopic resection when at least one histologic risk factor is present⁶. While this approach minimizes the risk for relapse after endoscopic resection, it may also result in overtreatment because many T1 CRC patients without LNM undergo additional surgery. Indeed, overall relapse rate after endoscopic resection was reported to be 2.3–7.3%⁷, while the incidence of LNM for additional surgery was reported to be 9.7–11.9%^8,9,10. Many studies have reported the utility of various histologic risk factors for predicting LNM^11,12,13, but the interobserver disagreement among pathologists cannot be completely resolved¹⁴. To address these issues, a more reproducible and precise prediction system must be developed.

The application of machine-learning techniques including neural networks has expanded in the field of medicine^15,16,17,18. Several machine-learning models for histologic images have been proposed for histologic diagnosis or predicting prognosis^{19,20,21,22,23,24,25}. Most of these models utilize convolutional neural networks (CNN) for classifying histologic images because of their strong and stable tissue classification ability. One concern, however, is the lack of clarity regarding the relationship between histologic features and the decision process of CNN. When predicting prognosis for cancer patients, it is important for both clinicians and patients to understand why the machine made such a decision. A previous study revealed the relationship between histologic images and prognostic risk stratification of renal cancer by CNN²⁶, but no similar studies have been performed for CRC concerning the relationship between histologic patterns and the predictive values of artificial intelligence.

Here we introduce a machine-learning algorithm for predicting LNM for T1 CRC with H&E-stained histologic whole-slide images (WSI), and clarify the decision processes for the predictive values by combining CNN and random forests (RF).

Results

Neural network machine-learning models and characteristic tile images

The accuracy and loss of image classifier #1 for classifying cancer and others (2 classes) were 0.980 and 0.039, respectively (Supplementary Fig. 1). When classifying all histologic-type classes, the accuracy and loss were 0.920 and 0.247, respectively. The accuracy and loss of image classifier #2 were 0.740 and 0.524, respectively. (Supplementary Fig. 2).

The number of tiles sorted by predictive probability for classifier #2 showed a normal distribution (Fig. 1), and the most frequent tile probabilities for the training and validation sets were 0.53 and 0.52 in LNM-negative cases, and 0.73 and 0.57 in LNM-positive cases, respectively (Fig. 1). Histologic characteristics of representative Group A tiles showed large cancer glands with little fibrotic stroma, and Group B tiles resembled those of Group A with more obvious structural atypia. In contrast, Group D tiles showed fused or cribriform cancer glands, and Group E tiles showed severe structural atypia with some isolated cancer cells embedded in desmoplastic stroma, resulting in an architectural complexity. In addition, the grade of nuclear atypia tended to be increased in Groups D and E compared with that in Groups A and B. Group C tiles, unconfidently judged by the classifier, showed a variety of histologic patterns. Importantly, both the training and validation datasets showed similar histologic trends, suggesting that classifier #2 returned the predictive value on the basis of an understandable histologic theory.

Predictive power of the random forest machine-learning model

The area under the receiver operating characteristics curve (AUC) for predicting LNM in the training and validation sets were 0.971 and 0.760, respectively (Fig. 2a). The specificity and sensitivity were balanced when the RF score was 0.70, and the accuracy of the model for the training and validation sets was 0.91 and 0.75, respectively. The AUCs were highest when the depth and number of RF trees were 6 and 60, respectively (Fig. 2b). The important parameters for predicting LNM were the proportion of tiles predicted as LNM-negative or LNM-positive, average predictive probability of tiles, and number of Group D and E tiles (Fig. 2c). Tumor location was also important for predicting LNM. The differences in these parameters between LNM-negative and LNM-positive cases were statistically significant in the training set, but some differences were not statistically significant in the validation set (Table 1). The RF scores in the training set showed good risk stratification; 418 cases (76.3%) were classified as very-low risk with no metastatic cases (Table 2). In addition, 6 LNM-positive cases (5 training and 1 validation cases) with minimal conventional histologic risk factors (only deep submucosal invasion, > 1 mm) showed moderate to high risk, with RF scores ranging from 0.814 to 0.900. On the other hand, in the validation set, 4 LNM-positive cases were classified as very-low risk, resulting in a 2.4% LNM-positive rate (Table 2). For endoscopically resected cases, all of the LNM-positive cases in the training set (n = 5) were classified as moderate- or high-risk, while those in validation set (n = 1) were classified as very low-risk (Supplementary Table 1). These results indicated that the model successfully predicted LNM, but some validation cases were difficult to predict.

Table 1 Comparison of important parameters for random forests.

Full size table

Table 2 Proportion of random forest (RF) scores.

Full size table

Whole-slide mapping of representative cases

Cases with low RF scores had a small number of positive predicted tiles, while those with high RF scores had predominantly positive predicted tiles (Fig. 3). Cases that were LNM-negative but classified as high risk (i.e., false-positive) showed predominantly positive predicted tiles (Fig. 3b). In contrast, cases that were LNM-positive but classified as very-low risk (i.e., false-negative) tended to have many Group A and B tiles and few Group D and E tiles, resulting in low RF scores (Fig. 3c). Positive or negative predicted tiles tended to form clusters, but some single tiles existed within either cluster.

Relationships between RF scores and conventional histologic risk factors

Cases were stratified by combining conventional histologic risk factors and their average RF scores are shown in Table 3. The lower the number of histologic risk factors, the lower the RF score. The important point is that the RF scores of LNM-positive cases with few conventional risk factors were consistently higher than those of LNM-negative cases, and some were statistically significant (Table 3). All of the LNM-positive cases harbor at least one risk factor, especially the deep submucosal invasion. There were 6 LNM-positive cases after endoscopic resection and all but one case showed moderate to high-risk (Supplementary Table 2A). On the other hand, LNM-positive cases with RF scores less than 0.8 harbor at least two histologic risk factors (Supplementary Table 2B). The average RF score was significantly higher in moderately and poorly differentiated tumors than in well differentiated tumors (Supplementary Table 3). When we compared cases with an RF score cutoff value of 0.7 and at least one histologic risk factor for conventional risk evaluation, the odds ratios for LNM were higher in both the training and validation sets than when using the conventional method (Supplementary Table 4A and 4B1). Though there were some false-negative cases in the machine learning model, the number of false-positive cases was low, resulting in better predictive odds. Similarly, the odds ratios of RF scores were higher than those when evaluating histologic-grade or poorly differentiated clusters. (Supplementary Table 4A, 4B2 and B3) Regarding conventional histologic-grade evaluation, interobserver disagreement was observed in some cancer images, while the machine learning model outputted correct predictive values for these cases (Supplementary Fig. 3).

Table 3 Correlation between conventional histologic risk factors and RF scores.

Full size table

Discussion

We present an LNM prediction model for T1 CRC with explanatory histologic images for visualization of the basis of the predictive values. We combined CNN and RF to visualize the learning and deciding algorithms for predicting LNM, providing persuasive logic for pathologists and clinicians.

Prognostic prediction of CRC by artificial intelligence has been attempted using several approaches. Some researchers developed a machine-learning algorithm based on the conventional clinicopathologic information, including histologic factors of CRC^27,28,29. These studies were performed using several machine-learning algorithms. A possible advantage of utilizing conventional clinicopathologic information is the readability of the analysis results, which clinicians can use to explain to patients why the machine returned certain results. This point is critical for the application of machine-learning techniques in the field of clinical medicine because understandable evidence is the most important factor for both patients and clinicians in making a clinical decision. On the other hand, diagnostic reproducibility is also critical for a stable prediction model. In particular, interobserver disagreement of histopathologic risk factors for predicting LNM of T1 CRC has remained problematic and may not be completely resolved. Because artificial intelligence can manage a huge amount of information, direct analysis of histologic images may contribute to developing a stable prediction model. Interestingly, our LNM prediction model mimicked histologic-grade evaluation for tile images. Since the interobserver disagreement for histologic grades is difficult to solve as described in previous reports^30,31,32, and we also demonstrated in Supplementary Fig. 3, our model can provide more stable and reliable prediction results by skipping this evaluation step.

CNN is an ideal method for classifying histologic images into various categories that has been used for classifying histologic findings in many previous studies to predict gene alterations and patient prognosis^18,21,23,33. Supervised machine-learning with appropriately labeled data enables complex histologic analyses by extracting features of colored images. Skrede et al.²⁵ revealed that a combination of 2 neural networks for tile images can predict the prognosis of CRC through analyses of a large multicenter cohort. Most histologic analyses with CNN have been performed for small tile images with hundreds of pixels squares. Though it is difficult to understand the thorough process of CNN for returning its values, a visualizing method such as Grad-CAM may help to understand the histologic machine-learning theory^33,34. This method may clarify the importance of the pixels within a tile image. Because most carcinomas, however, including CRC, show intra-tumoral histologic heterogeneity³⁵, it is not enough to understand whole tumor characteristics by analyzing each tile. Learning how to extract features from a set of tile images is important to give clinicians and patients valuable information, such as a cancer prognosis.

RF has been used for predicting the prognosis of various cancers on the basis of radiologic and histopathologic images^15,17,20. The classification ability of the RF score depends on the given parameters, such as number of tile images and predictive probabilities, and differs from that of classification by CNN, which requires only input images. RF can output the importance of parameters of any type, which plays critical role for converting characteristics of various tile images into a simple score through the understandable logic of decision trees. Histologic WSIs contain a lot of information, and most solid tumors exhibit histologic heterogeneity; thus, visualization of the decision process for predicting prognosis may provide a novel approach for histologic classification. In this study, CNN calculated and extracted characteristic images affecting the risk of metastasis (Group D and E), which contained poorly differentiated cancer clusters and desmoplastic stroma. Furthermore, RF indicates the proportion of LNM-positive or LNM-negative tiles, and their average probabilities were important for predicting LNM for cases. RF scores of our model tended to be higher in the LNM positive cases, indicating that this method may produce different results from the conventional method. Moreover, RF can calculate variables other than image features, such as tumor location, which provides researchers some additional options to improve the accuracy of the model. Tumor location is a risk stratification factor for T1 CRC^13,36,37. Rectal cancers are more likely to metastasize than colon cancers³⁶, and our results revealed similar tendency (Table 1). According to this fact, in our prediction model, RF indicated the importance of the tumor location, but no local predilection was demonstrated in the validation set (Table 1). This phenomenon may be attributed to the small number of LNM-positive cases in the validation set, having a negative effect on the LNM prediction. It is important to note that the large amount of outcome data for predicting LNM cases with RF was much smaller than the amount of data from millions of tile images, which require oversampling the data of LNM-positive cases to fill the tenfold gap against those of LNM-negative cases and avoid inappropriate predilection to the majority group. In contrast, the cancer tile images for the LNM-positive group numbered nearly 50 thousand, which seemed to be enough to avoid overfitting in CNN without over- or under-sampling, as indicated by the learning curves (Supplementary Fig. 1 and 2).

Because most T1 CRC patients are curable by surgical treatment even if overtreated, it could be worse if the tumor relapsed locally or by distant metastases⁷. In this context, false-negative cases should be minimized to avoid life-threatening relapse. In this study, we successfully developed a prediction model extracting 76.3% of truly negative case with low RF scores in the training set. On the other hand, 4 LNM-positive cases in the validation set were predicted to be very-low risk. All 4 of these cases contained many negative-prediction tiles; therefore, the most probable cause of the false-negative would be a lack of training tile images of LNM-positive cases with the features of these cases. To solve this matter, additional images of LNM-positive T1 CRC should be provided. The variability of input data among different institutions is also important. The quality of histological images depends on several factors such as the resection procedure, preparation of paraffin sections, staining procedure, and digitizing steps. In this study, we analyzed assembled tile images originating from H&E stained whole-slide images. The image quality or color balance may differ from that in other institutions, even if the staining procedures and digitizing steps are fully automated. These factors may decrease the performance of the prediction model and should be carefully evaluated before applying this method in clinical practice. Because the factors are quite complex and some of their variabilities are difficult to resolve, a multicenter validation study should be performed to clarify the differences of the factors, and to understand how to maintain the precision of the model in any situation. These are limitations of this single institutional study, and further investigation should be performed in a large-scale multicentral cohort study. Another limitation of this study is that the followed up patients with no LNM by periodic colonoscopy and/or abdominal computed tomography scanning for 5 years after initial resection may be misdiagnosed relapsing LNM, will result in false-negative or false-positive that the ground truth data can be changed in such cases.

In conclusion, we introduced a machine-learning model for predicting LNM of T1 CRC, which enabled risk-stratification of cases without any conventional histologic risk assessment. Our method will be a useful tool for clinicians and patients to understand the decision process of artificial intelligence through visualizing characteristic histologic images.

Methods

Case recruitment

A total of 855 consecutive T1 CRC resected either endoscopically or surgically at The Cancer Institute Hospital between 2005 and 2015, were initially recruited in this study. Of these 855 cases, 68 were excluded because of inadequate sample conditions (e.g., non-enbloc resections), and 787 cases underwent further analyses. Among the patients, 61 cases (7.8%) showed metastasis in dissected lymph nodes (LNM-positive). Among the patients treated with endoscopic resection, 119 patients (43.1%) underwent additional intestinal resection with lymph node dissection, and 157 (56.9%) patients were followed up for 5 years by periodic colonoscopy and/or abdominal computed tomography scanning, based on the guidelines of Japanese Society for Cancer of the Colon and Rectum⁶. The cases were randomly sorted by computer to divide the cases into a training set (n = 551) and test set (n = 236). The LNM rate was equivalent for both datasets. Patient characteristics is described in Supplementary Table 5. To develop the histologic-type classifier, we randomly recruited 22 T1 CRC cases and an additional 100 T2/3 CRC cases that were surgically resected in 2008 and 2009.

Histologic evaluation and digitizing

Serial 2-mm- to 5-mm-thick tissue sections of the whole lesion were cut from resected specimens fixed with 20% buffered formalin and embedded in paraffin; 3-μm-thick sections were then prepared for staining. Each section was stained with H&E. For each case, all cancer-containing sections showing submucosal invasion were included. Conventional histologic evaluation of the depth of submucosal invasion (≥ 1000 µm), poorly differentiated or mucinous carcinoma, lymphatic invasion, venous invasion, and tumor budding was performed by 2 pathologists (M.T, H.K), to disclose the clinicopathologic characteristics of the cases (Supplementary Table 5), basically according to the Japanese Society for Cancer of the Colon and Rectum criteria⁶. In addition, we also evaluated histologic grades considering predominant tumor component, as follows: well, moderately and poorly differentiated. We also performed histologic grading by 3 pathologists (M.T, N.Y, H.K) for representative cases to demonstrate interobserver disagreement. Deep submucosal invasion was defined as tumor invasion of the submucosal layer with a depth of at least 1000 µm. The presence of poorly differentiated clusters was evaluated according to the criteria of Ueno et al.³⁸.

Whole-slide images (WSIs) of the sections were obtained by a digital slide scanner (NanoZoomer, Hamamatsu Photonics, Hamamatsu, Japan). The WSIs were cut into non-overlapping small tile images of 299-pixel squares (equal to a 273-µm square). To ensure that the image contained at least one nucleated cell, each tile image was temporarily converted to gray-scale (0–255), and the image containing the minimum pixel value of 110 was adopted for further image analysis (pixel thresholding, Fig. 4b).

Statistical analysis

The primary endpoint of this study was LNM, and 5-year disease-free survival was defined as no metastases for cases without surgical treatment. Receiver operating characteristics (ROC) curves were used to estimate the predictive power of the model. The balanced error rate of the ROC curve was considered an optimal cut-off value for determining the accuracy of the model. The area under the ROC curve (AUC) was defined as the predictive ability of the models. Qualitative factors were analyzed with the Fisher exact test and odds ratios were calculated. Quantitative factors were analyzed with the Student t-test. A P-value of less than 0.05 was considered statistically significant in both analyses. We conducted all analyses using R version 3.6.2³⁹.

Neural network machine-learning models

First, we created a histologic-type classifier by CNN to enable the efficient selection of cancer tiles. The tile images from 122 WSIs of randomly selected CRCs were labeled with “cancer” or 9 other labels, including non-tumoral mucosa, hyperplastic mucosa, adenoma, lymphoid tissue, smooth muscle, vessels, fat, nerve, and non-material background. The number of cancer-labeled images was 48,503, and that of the others was 200,259. A deep-learning pre-trained model, MobileNet V2⁴⁰, was re-trained to recognize 10 different tissue types to develop image classifier #1 (Fig. 4a). Labeling of tile images originating from WSIs was performed by 1 pathologist (M.T), using OpenSlide library running on Python 3.7⁴¹.

Second, the tile images from 1282 WSIs of T1 CRC underwent pixel thresholding and were classified with image classifier #1. Each image was re-labeled by the classifier with the probable histologic type and probability for classifying. Four cases were excluded because no cancer-class tiles were detected, therefore 548 training cases and 235 validation cases were applied to further analyses. Among the cancer-class images, we adopted images with a probability > 0.8 to create the LNM prediction model. These tile images were re-labeled again with the primary endpoint and re-trained to develop image classifier #2, which classified whether or not a cancer tile image was likely to metastasize.

Then, cancer-class tile images of both the training set and validation set with a probability > 0.5 (840 558 tiles) were classified by image classifier #2 and re-labeled with the probable LNM status and probability for metastasis. We defined the tile images according to the probability as follows: those with non-metastatic probability > 0.9 as Group A, those with non-metastatic probability between 0.8 and 0.9 as Group B, those with metastatic probability between 0.8 and 0.9 as Group D, those with metastatic probability > 0.9 as Group E, and those with either probability < 0.8 as Group C. We also defined probability score summary of CNN, which can be obtained by multiplying the cancer-class probability with the metastatic probability for each tile, and summarized for case. We tuned the classifiers by learning iteration and learning rate to obtain maximum accuracy and minimum loss while re-training. The loss was evaluated by cross entropy. The workflow from WSIs to complete classifying tile images is shown in Fig. 4b.

Random-forest machine learning model

To establish the LNM prediction model for the cases, we loaded data from the tile images of the training set onto a machine-learning tool, the Scikit-learn on Python 3.6. We selected a random forest (RF) classifier machine-learning algorithm to minimize the effect of overfitting. Because the number of LNM-positive (n = 43) and LNM-negative (n = 505) cases in the training dataset had a tenfold gap, we randomly oversampled LNM-positive cases to equalize the number of cases applied to the RF model. The data contained 18 parameters as follows: tumor location; total number of cancer-class tiles; number of tiles classified as metastatic or non-metastatic; number of Group A, B, D, E tiles; percentages of tiles classified as metastatic or non-metastatic; average probabilities; standard deviations of cancer-class probabilities and metastatic or non-metastatic probabilities; and probability score summary for each tile (Fig. 4c). Tumor locations were grouped into 3 parts as follows; (1) cecum, ascending and transverse colon, (2) descending and sigmoid colon, and (3) rectum. The RF algorithm used was based on a previous report^42,43. In brief, the RF randomly collected these parameters for creating dozens of decision trees. Teacher data of the primary endpoint were given to the RF, and the importance of parameters that minimize the impurity for separating LNM-positive or LNM-negative cases was calculated. The importance of each parameter was averaged in all the trees, and finally, the output predictive value of LNM for the case was determined. To reduce false-negative metastatic cases, we established 500 RFs to obtain a higher AUC for each RF and adopted the top 20 RFs for case analyses. The importance of the parameters of these RFs was averaged to show how the algorithm determined the score. The RFs returned values of metastatic probability between 0 and 1, and the maximum value of each case was adopted as the RF score. We tuned the RF to maximize the AUC for predicting LNM in all cases by adjusting the number and depth of the trees. We defined the RF scores as follows: 0–0.7, very-low risk; 0.7–0.8, low risk; 0.8–0.9, moderate risk; and 0.9–1.0, high risk.

Ethics

The experimental protocols in this study were approved by the Ethics Committee of the Cancer Institute, Japanese Foundation for Cancer Research (approved number: 2020-1045). The written informed consent was obtained from all participants. This study was performed in accordance with the Declaration of Helsinki.

References

Urabe, Y. et al. Impact of revisions of the JSCCR guidelines on the treatment of T1 colorectal carcinomas in Japan. Z. Gastroenterol. 53, 291–301 (2015).
Article CAS Google Scholar
Marin-Gabriel, J. C., Fernandez-Esparrach, G., Diaz-Tasende, J. & Herreros de Tejada, A. Colorectal endoscopic submucosal dissection from a Western perspective: today’s promises and future challenges. World J. Gastrointest. Endosc. 8, 40–55 (2016).
Article Google Scholar
Tateishi, Y., Nakanishi, Y., Taniguchi, H., Shimoda, T. & Umemura, S. Pathological prognostic factors predicting lymph node metastasis in submucosal invasive (T1) colorectal carcinoma. Mod. Pathol. 23, 1068–1072 (2010).
Article Google Scholar
Kawachi, H. et al. A three-tier classification system based on the depth of submucosal invasion and budding/sprouting can improve the treatment strategy for T1 colorectal cancer: a retrospective multicenter study. Mod. Pathol. 28, 872–879 (2015).
Article Google Scholar
Egashira, Y. et al. Analysis of pathological risk factors for lymph node metastasis of submucosal invasive colon cancer. Mod. Pathol. 17, 503–511 (2004).
Article CAS Google Scholar
Hashiguchi, Y. et al. Japanese Society for Cancer of the Colon and Rectum (JSCCR) guidelines 2019 for the treatment of colorectal cancer. Int. J. Clin. Oncol. 25, 1–42 (2020).
Article Google Scholar
Saitoh, Y. et al. Management of colorectal T1 carcinoma treated by endoscopic resection. Dig. Endosc. 28, 324–329 (2016).
Article Google Scholar
Nishimura, T. et al. Clinical significance of immunohistochemical lymphovascular evaluation to determine additional surgery after endoscopic submucosal dissection for colorectal T1 carcinoma. Int. J. Colorectal. Dis. 36, 949–958 (2021).
Article Google Scholar
Iguchi, K. et al. Additional surgical resection after endoscopic resection for patients with high-risk T1 colorectal cancer. In Vivo 33, 1243–1248 (2019).
Article Google Scholar
Yamashita, K. et al. Preceding endoscopic submucosal dissection for T1 colorectal carcinoma does not affect the prognosis of patients who underwent additional surgery: a large multicenter propensity score-matched analysis. J. Gastroenterol. 54, 897–906 (2019).
Article CAS Google Scholar
Harris, E. I. et al. Lymphovascular invasion in colorectal cancer: an interobserver variability study. Am. J. Surg. Pathol. 32, 1816–1821 (2008).
Article Google Scholar
Puppa, G. et al. Diagnostic reproducibility of tumour budding in colorectal cancer: a multicentre, multinational study using virtual microscopy. Histopathology 61, 562–575 (2012).
Article Google Scholar
Takamatsu, M. et al. Immunohistochemical evaluation of tumor budding for stratifying T1 colorectal cancer: optimal cut-off value and a novel computer-assisted semiautomatic method. Mod. Pathol. 32, 675–683 (2019).
Article Google Scholar
Kai, K. et al. Cytokeratin immunohistochemistry improves interobserver variability between unskilled pathologists in the evaluation of tumor budding in T1 colorectal cancer. Pathol. Int. 66, 75–82 (2016).
Article CAS Google Scholar
Liu, Z. et al. Survival prediction in gallbladder cancer using CT based machine learning. Front. Oncol. 10, 604288 (2020).
Article Google Scholar
Klauschen, F. et al. Scoring of tumor-infiltrating lymphocytes: From visual estimation to machine learning. Semin. Cancer Biol. 52, 151–157 (2018).
Article CAS Google Scholar
Cheng, L. et al. A random forest classifier predicts recurrence risk in patients with ovarian cancer. Mol. Med. Rep. 18, 3289–3297 (2018).
CAS PubMed PubMed Central Google Scholar
Bychkov, D. et al. Deep learning based tissue analysis predicts outcome in colorectal cancer. Sci. Rep. 8, 3395 (2018).
Article ADS Google Scholar
Ma, B. et al. Artificial intelligence-based multiclass classification of benign or malignant mucosal lesions of the stomach. Front. Pharmacol. 11, 572372 (2020).
Article Google Scholar
Montazeri, M., Montazeri, M., Montazeri, M. & Beigzadeh, A. Machine learning models in breast cancer survival prediction. Technol. Health Care 24, 31–42 (2016).
Article Google Scholar
Yu, K. H. et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat. Commun. 7, 12474 (2016).
Article ADS CAS Google Scholar
Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. USA 115, E2970–E2979 (2018).
Article CAS Google Scholar
Campanella, G. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1301–1309 (2019).
Article CAS Google Scholar
Chen, P. C. et al. An augmented reality microscope with real-time artificial intelligence integration for cancer diagnosis. Nat. Med. 25, 1453–1457 (2019).
Article CAS Google Scholar
Skrede, O. J. et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet 395, 350–360 (2020).
Article CAS Google Scholar
Wulczyn, E. et al. Deep learning-based survival prediction for multiple cancer types using histopathology images. PLoS ONE 15, e0233678 (2020).
Article CAS Google Scholar
Kudo, S. E. et al. Artificial intelligence system to determine risk of T1 colorectal cancer metastasis to lymph node. Gastroenterology 160, 1075 (2020).
Article Google Scholar
Jiang, D. et al. A machine learning-based prognostic predictor for stage III colon cancer. Sci. Rep. 10, 10333 (2020).
Article ADS CAS Google Scholar
Xu, Y., Ju, L., Tong, J., Zhou, C. M. & Yang, J. J. Machine learning algorithms for predicting the recurrence of stage IV colorectal cancer after tumor resection. Sci. Rep. 10, 2519 (2020).
Article ADS CAS Google Scholar
Komuta, K. et al. Interobserver variability in the pathological assessment of malignant colorectal polyps. Br. J. Surg. 91, 1479–1484 (2004).
Article CAS Google Scholar
Ueno, H. et al. New criteria for histologic grading of colorectal cancer. Am. J. Surg. Pathol. 36, 193–201 (2012).
Article Google Scholar
Barresi, V. et al. Histologic grading based on counting poorly differentiated clusters in preoperative biopsy predicts nodal involvement and pTNM stage in colorectal cancer patients. Hum. Pathol. 45, 268–275 (2014).
Article Google Scholar
Iizuka, T., Fukasawa, M. & Kameyama, M. Deep-learning-based imaging-classification identified cingulate island sign in dementia with Lewy bodies. Sci. Rep. 9, 8944 (2019).
Article ADS Google Scholar
Dai, G. et al. Exploring the effect of hypertension on retinal microvasculature using deep learning on East Asian population. PLoS ONE 15, e0230111 (2020).
Article CAS Google Scholar
Jones, H. G. et al. Genetic and epigenetic intra-tumour heterogeneity in colorectal cancer. World J. Surg. 41, 1375–1383 (2017).
Article Google Scholar
Aytac, E. et al. Impact of tumor location on lymph node metastasis in T1 colorectal cancer. Langenbecks Arch. Surg. 401, 627–632 (2016).
Article Google Scholar
Kobayashi, H. et al. Characteristics of recurrence after curative resection for T1 colorectal cancer: Japanese multicenter study. J. Gastroenterol. 46, 203–211 (2011).
Article Google Scholar
Ueno, H. et al. Proposed objective criteria for “grade 3” in early invasive colorectal cancer. Am. J. Clin. Pathol. 134, 312–322 (2010).
Article Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. (2013).
Mark Sandler, A.H., Menglong, Z., Andrey, Z., Liang-Chieh, C. MobileNetV2: inverted residuals and linear bottlenecks. In The IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–4520 (2018).
OpenSlide library for Python, version 1.1.2. https://openslide.org/download/. 2020.
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Takamatsu, M. et al. Prediction of early colorectal cancer metastasis by machine learning using digital slide images. Comput. Methods Prog. Biomed. 178, 155–161 (2019).
Article Google Scholar

Download references

Acknowledgements

The authors thank Tomoyo Kakita, Motoyoshi Iwakoshi, and Hiroshi Yoko-o for their excellent technical assistance.

Author information

Authors and Affiliations

Division of Pathology, Cancer Institute, Japanese Foundation for Cancer Research, 3-8-31, Ariake, Ko-to-ku, Tokyo, 135-8550, Japan
Manabu Takamatsu, Noriko Yamamoto, Hiroshi Kawachi, Kaoru Nakano & Kengo Takeuchi
Department of Pathology, Cancer Institute Hospital, Japanese Foundation for Cancer Research, Tokyo, Japan
Manabu Takamatsu, Noriko Yamamoto, Hiroshi Kawachi, Kaoru Nakano & Kengo Takeuchi
Department of Endoscopy, Cancer Institute Hospital, Japanese Foundation for Cancer Research, Tokyo, Japan
Shoichi Saito
Department of Colorectal Surgery, Cancer Institute Hospital, Japanese Foundation for Cancer Research, Tokyo, Japan
Yosuke Fukunaga
Pathology Project for Molecular Targets, Cancer Institute, Japanese Foundation for Cancer Research, Tokyo, Japan
Kengo Takeuchi

Authors

Manabu Takamatsu
View author publications
You can also search for this author in PubMed Google Scholar
Noriko Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Kawachi
View author publications
You can also search for this author in PubMed Google Scholar
Kaoru Nakano
View author publications
You can also search for this author in PubMed Google Scholar
Shoichi Saito
View author publications
You can also search for this author in PubMed Google Scholar
Yosuke Fukunaga
View author publications
You can also search for this author in PubMed Google Scholar
Kengo Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T. conceived the study; and managed the data acquisition, digitizing whole-slides, and data labeling. N.Y. and M.T. contributed to design the machine learning program and methodology, classifier model, data analysis, and manuscript drafting. H.K. and K.N. supervised the conventional histologic assessments. S.S. and Y.F. provided clinical information including patients’ outcome. K.T., N.Y. and H.K. supervised all the analyses and contributed to manuscript editing. All authors approved the manuscript.

Corresponding author

Correspondence to Manabu Takamatsu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure 1.

Supplementary Figure 2.

Supplementary Figure 3.

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Takamatsu, M., Yamamoto, N., Kawachi, H. et al. Prediction of lymph node metastasis in early colorectal cancer based on histologic images by artificial intelligence. Sci Rep 12, 2963 (2022). https://doi.org/10.1038/s41598-022-07038-1

Download citation

Received: 23 July 2021
Accepted: 08 February 2022
Published: 22 February 2022
DOI: https://doi.org/10.1038/s41598-022-07038-1

This article is cited by

Colorectal cancer prognosis based on dietary pattern using synthetic minority oversampling technique with K-nearest neighbors approach
- S. Thanga Prasath
- C. Navaneethan
Scientific Reports (2024)
Quantitative analysis of prion disease using an AI-powered digital pathology framework
- Massimo Salvi
- Filippo Molinari
- Daniele Imperiale
Scientific Reports (2023)
Prediction of disease recurrence or residual disease after primary endoscopic resection of pT1 colorectal cancer—results from a large nationwide Danish study
- Ilze Ose
- Katarina Levic
- Tine Plato Kuhlmann
International Journal of Colorectal Disease (2023)
“Pathologist-independent” strategy for T1 colorectal cancer after endoscopic resection
- Katsuro Ichimasa
- Shin-ei Kudo
- Khay Guan Yeoh
Journal of Gastroenterology (2022)
Preoperative prediction of lymph node status in patients with colorectal cancer. Developing a predictive model using machine learning
- Morten Hartwig
- Karoline Bendix Bräuner
- Ismail Gögenur
International Journal of Colorectal Disease (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Neural network machine-learning models and characteristic tile images

Predictive power of the random forest machine-learning model

Whole-slide mapping of representative cases

Relationships between RF scores and conventional histologic risk factors

Discussion

Methods

Case recruitment

Histologic evaluation and digitizing

Statistical analysis

Neural network machine-learning models

Random-forest machine learning model

Ethics

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links