Early gastric cancer detection and lesion segmentation based on deep learning and gastroscopic images

Zhang, Kezhi; Wang, Haibao; Cheng, Yaru; Liu, Hongyan; Gong, Qi; Zeng, Qian; Zhang, Tao; Wei, Guoqiang; Wei, Zhi; Chen, Dong

doi:10.1038/s41598-024-58361-8

Download PDF

Article
Open access
Published: 03 April 2024

Early gastric cancer detection and lesion segmentation based on deep learning and gastroscopic images

Kezhi Zhang¹^na1,
Haibao Wang¹^na1,
Yaru Cheng²^na1,
Hongyan Liu²,
Qi Gong²,
Qian Zeng¹,
Tao Zhang¹,
Guoqiang Wei^1,3,
Zhi Wei² &
…
Dong Chen¹

Scientific Reports volume 14, Article number: 7847 (2024) Cite this article

723 Accesses
2 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 19 April 2024

This article has been updated

Abstract

Gastric cancer is a highly prevalent disease that poses a serious threat to public health. In clinical practice, gastroscopy is frequently used by medical practitioners to screen for gastric cancer. However, the symptoms of gastric cancer at different stages of advancement vary significantly, particularly in the case of early gastric cancer (EGC). The manifestations of EGC are often indistinct, leading to a detection rate of less than 10%. In recent years, researchers have focused on leveraging deep learning algorithms to assist medical professionals in detecting EGC and thereby improve detection rates. To enhance the ability of deep learning to detect EGC and segment lesions in gastroscopic images, an Improved Mask R-CNN (IMR-CNN) model was proposed. This model incorporates a “Bi-directional feature extraction and fusion module” and a “Purification module for feature channel and space” based on the Mask R-CNN (MR-CNN). Our study includes a dataset of 1120 images of EGC for training and validation of the models. The experimental results indicate that the IMR-CNN model outperforms the original MR-CNN model, with Precision, Recall, Accuracy, Specificity and F1-Score values of 92.9%, 95.3%, 93.9%, 92.5% and 94.1%, respectively. Therefore, our proposed IMR-CNN model has superior detection and lesion segmentation capabilities and can effectively aid doctors in diagnosing EGC from gastroscopic images.

Dual-branch hybrid network for lesion segmentation in gastric cancer images

Article Open access 19 April 2023

Automated histological classification for digital pathology images of colonoscopy specimen via deep learning

Article Open access 27 July 2022

Ability of artificial intelligence to detect T1 esophageal squamous cell carcinoma from endoscopic videos and the effects of real-time assistance

Article Open access 08 April 2021

Introduction

According to the International Agency for Research on Cancer, gastric cancer accounted for 5.6% of all new cancer cases and 7.7% of cancer deaths globally in 2020¹. Despite a decline in the global incidence and mortality rates, the actual number of cases and deaths is still increasing, particularly in developing countries with aging populations^2,3. Gastric cancer has a poor overall prognosis, with a 5-year survival rate of only 10%-30%, whereas patients with EGC have a survival rate of 70%-90%, highlighting the importance of early detection and treatment^4,5. Although white-light endoscopy is the standard method for detecting EGC, its accuracy is heavily reliant on the expertise and experience of endoscopists, resulting in an accuracy rate of only 70–80%⁶. Furthermore, the large workload of medical image analysis also affects diagnostic results.

In recent years, artificial intelligence (AI) has been making remarkable progress in various fields, including healthcare⁷. Researchers are increasingly using AI techniques, such as computer vision methods based on deep learning, to assist in detecting EGC⁸. For example, Toshiaki Hirasawa's team⁹ developed a CNN diagnostic system that employs the Single Shot MultiBox Detector architecture to process endoscopic images quickly and accurately, achieving an overall sensitivity of 92.2% and a positive predictive value of 30.6%. Another study by Sakai et al.¹⁰ proposed a convolutional neural network scheme that employs transfer learning to automatically detect EGC in endoscopic images with an accuracy of 87.6%. These results demonstrate the potential of AI-assisted diagnosis to improve the efficiency and accuracy of clinical practice for endoscopists.

Recently, there has been a surge of interest in the application of AI technology in the medical field, and in particular, the use of computer vision methods for the EGC detection. Mitsuaki Ishioka et al.¹¹ developed a CNN system for detecting and recognizing gastric cancer in video images with an impressive accuracy of 94.1%. Similarly, Yoon et al.¹² developed an optimized model for EGC detection and prediction, which achieved high sensitivity (91%) and overall AUC (98.1%). Hiroya Ueyama et al.¹³ constructed an AI-based diagnosis system, "CNN-CAD", for detecting EGC in ME-NBI images, which demonstrated an overall accuracy, sensitivity, and specificity of 98.7%, 98%, and 100%, respectively. Zhigang Song et al.¹⁴ developed a clinical application system that achieved near-perfect sensitivity and an average specificity of 80.6% on a self-built dataset, suggesting that the system can help specialists improve diagnostic efficiency and prevent misdiagnosis. Hirotaka Oura et al. ¹⁵ developed a Double-Check Support System (DCSS) for esophagogastroduodenoscopy (EGD) still images, which outperformed endoscopists with an accuracy, sensitivity, and positive predictive value of 83.3%, 89.2%, and 93.3%, respectively. Lianlian Wu et al.¹⁶ developed the ENDOAN-GEL system, which demonstrated an average accuracy, sensitivity, and specificity of 84.7%, 100%, and 84.3%, respectively, for detecting gastric cancer and improving the quality of EGD. Finally, Shibata et al.¹⁷ investigated EGC detection and lesion segmentation using MR-CNN, achieving high sensitivity (96.0%) and low false positives (0.10 FP/image) per image. These findings suggest that AI-based detection systems have the potential to significantly improve the accuracy and efficiency of EGC detection and diagnosis.

After analyzing recent research on EGC detection based on deep learning, it has been found that the object detection algorithm is mainly used due to its advantages such as small size, high efficiency, and fast detection. However, the detection results only provide a rectangular bounding box in the lesion region, which covers both the lesion and non-lesion regions. This approach cannot accurately indicate the infiltrated region of gastric cancer, which is unfavorable to physicians for making accurate diagnoses. Our research objective is to improve the accuracy of EGC detection and lesion segmentation in gastroscopic images using a deep learning model. Since the clinical features of EGC are not always apparent, we had employed the classical object segmentation algorithm, MR-CNN¹⁸, based on which IMR-CNN was formed by adding a “Bi-directional feature extraction and fusion module” and a “Purification module for feature channel and space” to be better suited for detecting EGC. Our IMR-CNN model not only achieves precise detection of EGC but also performs high-precision segmentation of lesions. This indicates that our method is capable of detecting EGC while also providing additional information on the EGC lesions. The validation of our model was conducted on a self-built dataset and the publicly available Kvasir polyp segmentation dataset, and the IMR-CNN showed significant improvement over MR-CNN in terms of the primary evaluation metrics.

Materials and method

Deep‑learning models

MR-CNN is a simple and efficient instance segmentation network proposed by Kaiming He et al.¹⁸. Based on Faster R-CNN¹⁹, a new segmentation branch in parallel is added to achieve object detection and instance segmentation. The overall structure of MR-CNN can be divided into three parts: (i) a convolutional backbone for feature extracting of the input image, (ii) a region proposal network (RPN) for generating interesting regions, and (iii) a network head with three branches for classification, bounding box regression, and segmentation mask. The structure of MR-CNN is demonstrated in Fig. 1.

In this paper, the convolutional backbone network of MR-CNN was improved to enhance the capability of EGC feature extraction in gastroscopic images, and then high-quality results for object detection and segmentation were obtained. The improvement included two aspects: (i) a Bi-directional Feature Extraction and Fusion Module; (ii) a Purification Module for Feature Channel and Space. The structure of the improved convolutional backbone network is shown in Fig. 2. To verify the interaction of the two modules, another model was formed by adding only the “Bi-directional Feature Extraction and Fusion Module”, that is named IBMR-CNN.

Bi-directional feature extraction and fusion module

Generally, when image features are extracted by CNN, the low-level feature maps rich in spatial information and little semantic information, while the high-level feature maps are just the opposite. Spatial information and semantic information are helpful for object location and object classification, respectively. So, the appropriate use of the two types of information is beneficial for improving the accuracy of object detection and segmentation. The Feature Pyramid Network (FPN)²⁰ was introduced into the backbone network of MR-CNN. The multi-scale feature map is extracted by the bottom-up CNN, while the high-level features and low-level features are fused layer by layer through the top-down FPN structure and horizontal connectivity. Then a multi-scale feature map with semantic information and spatial information fusion is generated, which enhances the model’s ability to detect small objects.

To enhance the accuracy of EGC detection in gastroscopic images, the FPN structure is improved furtherly. A bottom-up feature fusion path has been added based on the FPN structure by referring to the Path Aggregation Network (PANet)²¹. The bottom-up feature fusion path allows the spatial information of low-level features to be better fused into the semantic information of high-level features. So that, the feature extraction capability of the model is improved. The module is displayed in the red dotted line box in Fig. 2.

Purification module for feature channel and space

It is well known that attention mechanism focuses on important features and suppressing unnecessary ones to obtain valuable information as efficiently as possible in limited computing resources²². The attention mechanism can be classified into channel attention, spatial attention, and hybrid attention. The channel attention focuses on the key features of the object, such as SENet²³, BAM²⁴, etc. The spatial attention pays more attention to the key location of the object, such as STNet²⁵, DCNet²⁶, OPAM²⁷, etc. The hybrid attention combines the capabilities of the two attentions, such as CBAM²², DANet²⁸, Coordinate Attention²⁹, etc. Considering our research objective, we should pay attention to not only the key features of the channel attention but also the key positions of the spatial attention. Therefore, the idea of CBAM was used to improve our model. As displayed in the purple dashed box in Fig. 2, an attention mechanism is introduced before the fusion of feature layers from different paths, aiming to enhance the features related to the object. We refer to this as “Purification module for feature channel and space”, and its structure is shown in Fig. 3. The following describes the details of the attention. The input feature map from each channel is max-pooled and average-pooled, generating two 1D vectors, which are sent to the fully connected layer and added to generate a 1D channel attention map. The channel attention map is then multiplied by the input feature map, yielding a channel attention-adjusted feature map F'. Then the F' map is max-pooled and average-pooled along the channel axis, generating two 2D maps. The F’ maps are then concatenated and convolved by a standard convolution layer, producing a 2D spatial attention map. The 2D spatial map is multiplied with the F' map element-wise. Finally, the feature map adjusted by spatial attention and channel attention with multi-scales can be obtained.

Evaluation metrics

To evaluate the performance of models quantitatively and objectively, some evaluation metrics^30,31 need to be introduced. According to the research, we reasonably set the statistical data³² and evaluation metrics for the experiment, as listed in Tables 1 and 2, respectively.

Table 1 Confusion matrix of experimental.

Full size table

Table 2 Evaluation metrics.

Full size table

Datasets

In the work, two datasets were prepared for experiments: a self-built EGC dataset and a public polyp dataset. Due to lack of EGC public dataset, the public Kvasir-SEG³³ polyp dataset was selected solely for segmentation task. To compare the performance of the three models, they were trained and tested on the both datasets. Although the two datasets differ significantly, they have little impact on the performance of models and help to verify the generalization performance of the models. We confirm that all methods were performed in accordance with the relevant guidelines and regulations, which include the Declaration of Helsinki, the International Ethical Guidelines for Human Biomedical Research (International Committee for the Organization of Medical Sciences), and the Guidelines for the Construction of Clinical Research Ethics Review Committees Involving Human Beings (National Health Commission of the People’s Republic of China, 2019 Edition).

The self-built EGC dataset

In the experiment, a total of 1,120 images were collected from 26 healthy subjects (140 images) and 180 cases (980 images) of gastric cancer for preoperative examinations between January 2020 to December 2021 in Shandong Second Provincial General Hospital. The size of images is 1080 × 1080 pixels. We were authorized to obtain gastroscopic images anonymously, and this research was approved by the Ethics Committee of the Shandong Second Provincial General Hospital. The number of Ethics Review Approval Statement is 20220703.

The gastroscopic images were analyzed using a multi-view k-proximal plane clustering algorithm³⁴ and verified to be valid data. Then, these images were strictly marked according to the recommendations of professional gastroscopists, and the marked images were verified by professional physicians. The 1120 images were randomly divided into training dataset, validation dataset and testing dataset with the ratio of 12:3:5.

The public Kvasir-SEG dataset

The Kvasir-SEG dataset³³ (available from https://datasets.simula.no/kvasir-seg/) is an extension of the Kvasir dataset³², which is the first multiclass dataset based on gastrointestinal disease. The Kvasir-SEG dataset contains 1000 annotated polyp images and their mask images for segmentation. According to the mask images, the annotated polyp images were re-marked strictly and new mask images were generated to meet experimental requirements. Similarly with self-built dataset, the Kvasir-SEG dataset was randomly divided into training dataset, validation dataset and testing dataset with the ratio of 12:3:5.

Training and testing

In this work, MR-CNN, IMR-CNN, and IBMR-CNN models were trained and tested on the two datasets to clearly demonstrate the ability of the improved model to detect EGC and segment lesion.

To begin with, the three models were trained on the self-built dataset by using migration learning, respectively. COCO pre-trained weights were loaded and the learning rate was set to 1.0e−5, the weight attenuation value was set to 1.0e−8, the threshold value was set to 0.7, and the number of training iterations was set to 100. The other training parameters were kept consistent. The optimal weights for each model were selected via a comprehensive evaluation based on several major metrics, such as Precision, Recall, Specificity and F1-Score. And then, the models were tested on the test dataset of the self-built dataset. The performance of the models was evaluated via the test results.

Furthermore, to verify the robustness and reliability of the three models, a fivefold cross-validation method was employed¹⁷ on the self-built EGC dataset.

Finally, the three models were trained on the public dataset with the same training method and parameter settings as on the self-built dataset, and then tested on the test dataset of the public dataset. The purpose is to verify the generalization performance of the models and illustrate the superior performance of our model objectively.

Ethics approval and consent to participate

This research was approved by the Ethics Committee of the Shandong Second Provincial General Hospital, and the Ethics Committee waived the requirement for informed consent of the subjects. The number of Ethics Review Approval Statement is 20220703.

Results and discussion

The three models were tested on the self-built dataset, and obtaining the PR curves, AP values and IoU values, which are summarized in Fig. 4 and Table 3. Break-Even-Point (BEP) on the PR curve is a valuable evaluation metric in which Precision and Recall are considered comprehensively, and it is the value when Precision and Recall are equal. As shown in Fig. 4, the BEP value of IMR-CNN is larger than that of MR-CNN and IBMR-CNN. As displayed in Table 3, compared with MR-CNN, the AP and IoU values of IMR-CNN increase by 2.8% and 0.8%, and those of IBMR-CNN increased by 0.8% and 0.5%, respectively. The results demonstrate that IMR-CNN exhibits superior accuracy compared to MR-CNN in both EGC detection and lesion segmentation. To enhance the robustness of the findings, a rigorous fivefold cross-validation was performed on a self-built dataset, verifying the reliability of the models. The corresponding values of AP and IoU were recorded in each experiment. The average values are taken as the result of the experiment, as listed in Table 4. The AP and IoU of IMR-CNN are higher than those of IBMR-CNN and MR-CNN in EGC detection and lesion segmentation.

Table 3 The values of AP and IoU for different models.

Full size table

Table 4 The results of fivefold cross-validation on the self-built dataset.

Full size table

The examples of the test results on the test set of self-built dataset are presented in Fig. 5. The EGC detection results on the test set of self-built dataset are listed in Table 5. The corresponding values of TP, TN, FP, and FN were counted for each model, and Precision, Recall, accuracy, specificity, and F1-scores were calculated. By comparing the evaluation metrics of the three models in Table 5, the values of all metrics of IMR-CNN are higher than those of IBMR-CNN and MR-CNN. The values of Precision, Recall, Accuracy, Specificity and F1-Score for IMR-CNN are 92.9%, 95.3%, 93.9%, 92.5% and 94.1%, respectively. These results indicate that the IMR-CNN model has better performance in EGC detection and lesion segmentation from gastroscopic images. This enhanced performance of IMR-CNN can be attributed to two primary factors. First, the model has been enhanced in feature learning and precise positioning of the target object by the bi-directional feature extraction and fusion module. Second, the model purifies the irrelevant features and strengthens the relevant features via the feature channel and spatial purification mechanism.

Table 5 The test results of EGC detection on the self-built dataset.

Full size table

The three models were also trained and tested on the public dataset, and the test results are shown in Fig. 6 and Table 3. Analyzing the PR curves of the three models, it is evident that IMR-CNN achieves the highest BEP value, further substantiating its superior performance over MR-CNN. In Table 3, the AP and IoU values of IMR-CNN are increased compared with MR-CNN, which also indicates that the detection and segmentation ability of IMR-CNN has been improved. In summary, IMR-CNN not only improves the accuracy of EGC detection and lesion segmentation in gastroscopic images, but also suitable for polyp detection in gastroscopic images. That means that IMR-CNN improves the performance and has better generalization performance compared with MR-CNN.

Conclusion

In this paper, we present an enhanced IMR-CNN model, which builds upon the foundation of the MR-CNN architecture by incorporating two novel modules: the "Bi-directional feature extraction and fusion module" and the "Purification module for feature channel and space". The experimental results demonstrate that the IMR-CNN model is higher than the original MR-CNN model in several major evaluation metrics, such as Precision, Recall, Specificity and F1-Score. The IMR-CNN model could be used to detect EGC and segment the lesion from gastroscopic images accurately, so as to help doctors to diagnose the disease better. This will help to improve the detection rate of EGC and reduce the risk of missed and false detections.

In future work, some new methods, such as graph learning³⁵ and multi-view clustering algorithms³⁶, will be introduced into the model to obtain richer image information, and then the whole performance of the model will be improved. Assisted medical diagnosis is an important research field, so we will continue to monitor its development and further explore the application of deep learning in EGC detection.

Data availability

The public Kvasir-SEG dataset could be downloaded from https://datasets.simula.no/kvasir-seg/. The self-built EGC dataset used and/or analyzed during the current study is available from the corresponding author on reasonable request.

Change history

19 April 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41598-024-59794-x

References

Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71(3), 209–249 (2021).
Article PubMed Google Scholar
He, Y. et al. Chinese and global burdens of gastric cancer from 1990 to 2019. Cancer Med. 10(10), 3461–3473 (2021).
Article PubMed PubMed Central Google Scholar
Arnold, M. et al. Is gastric cancer becoming a rare disease? A global assessment of predicted incidence trends to 2035. Gut. 69(5), 823–829 (2020).
Article PubMed Google Scholar
Tang, C., Zeng, L., Yang, J., Zeng, C. & Chen, Y. Analysis of the incidence and survival of gastric cancer based on the Lauren classification: A large population-based study using SEER. Front. Oncol. 10, 1212 (2020).
Article PubMed PubMed Central Google Scholar
Arnold, M. et al. Progress in cancer survival, mortality, and incidence in seven high-income countries 1995–2014 (ICBP SURVMARK-2): A population-based study. Lancet Oncol. 20(11), 1493–1505 (2019).
Article PubMed PubMed Central Google Scholar
Sumiyama, K. Past and current trends in endoscopic diagnosis for early stage gastric cancer in Japan. Gastric Cancer. 20(Suppl 1), 20–27 (2017).
Article PubMed Google Scholar
Amisha, A., Malik, P., Pathania, M. & Rathaur, V. Overview of artificial intelligence in medicine. J. Fam. Med. Prim. Care. 8(7), 2328–2331 (2019).
Article CAS Google Scholar
Jin, Z. et al. Deep learning for gastroscopic images: Computer-aided techniques for clinicians. Biomed. Eng. Online. 21(1), 12 (2022).
Article PubMed PubMed Central Google Scholar
Hirasawa, T. et al. Application of artificial intelligence using a convolutional neural network for detecting gastric cancer in endoscopic images. Gastric Cancer. 21(4), 653–660 (2018).
Article PubMed Google Scholar
Sakai, Y. et al. Automatic detection of early gastric cancer in endoscopic images using a transferring convolutional neural network. IEEE 2018, 4138–4141 (2018).
CAS Google Scholar
Ishioka, M., Hirasawa, T. & Tada, T. Detecting gastric cancer from video images using convolutional neural networks. Digest. Endosc. 31(2), 13306 (2018).
Google Scholar
Yoon, H. J. et al. A lesion-based convolutional neural network improves endoscopic detection and depth prediction of early gastric cancer. J. Clin. Med. 8(9), 1310 (2019).
Article PubMed PubMed Central Google Scholar
Ueyama, H. et al. Application of artificial intelligence using a convolutional neural network for diagnosis of early gastric cancer based on magnifying endoscopy with narrow-band imaging. J. Gastroen. Hepatol. 36(2), 482–489 (2021).
Article Google Scholar
Song, Z. et al. Clinically applicable histopathological diagnosis system for gastric cancer detection using deep learning. Nat. Commun. 11(1), 18147 (2020).
Article Google Scholar
Oura, H. et al. Development and evaluation of a double-check support system using artificial intelligence in endoscopic screening for gastric cancer. Gastric Cancer. 25(2), 392–400 (2022).
Article PubMed Google Scholar
Lianlian, Wu. X. H. M. L., Qiaoyun Tong, M. G. M. H., Xiaoda Jiang, X. H. G. M., Wang, H. Z. D. C. & Yijie Zhu, H. Y. Evaluation of the effects of an artificial intelligence system on endoscopy quality and preliminary testing of its performance in detecting early gastric cancer: A randomized controlled trial. Endoscopy. 53(12), 1199–1207 (2021).
Article Google Scholar
Shibata, T., Teramoto, A., Yamada, H., Ohmiya, N. & Fujita, H. Automated detection and segmentation of early gastric cancer from endoscopic images using mask R-CNN. Appl. Sci. 10(11), 3842 (2020).
Article CAS Google Scholar
He, K., Gkioxari, G., Dollar, P. & Girshick, R. Mask R-CNN. IEEE Trans. Pattern. Anal. 42(2), 386–397 (2020).
Article Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern. Anal. 39(6), 1137–1149 (2017).
Article Google Scholar
Pan, H., Chen, G. & Jiang, J. Adaptively dense feature pyramid network for object detection. IEEE Access. 7, 81132–81144 (2019).
Article PubMed PubMed Central Google Scholar
Liu, S., Qi, L., Qin, H., Shi, J. & Jia, J. Path aggregation network for instance segmentation. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 8759–8768 (2018).
Woo, S., Park, J., Lee, J. & Kweon, I. S. CBAM: Convolutional block attention module. in Proceedings of the European Conference on Computer Vision, 3–19 (2018).
Hu, J., Shen, L., Albanie, S., Sun, G. & Wu, E. Squeeze-and-excitation networks. IEEE Trans. Pattern. Anal. 42(8), 2011–2023 (2020).
Article Google Scholar
Park, J., Woo, S., Lee, J. & Kweon, I. S. BAM: Bottleneck Attention Module. arXiv:1807.06514 (2018).
Jaderberg, M., Simonyan, K., Zisserman, A. & Kavukcuoglu, K. Spatial Transformer Networks (MIT Press, 2015).
Google Scholar
Almahairi, A. et al. Dynamic capacity networks. Int. Conf. Mach. Learn. 2015, 2549–2558 (2015).
Google Scholar
Yuxin, P., Xiangteng, H. & Junjie, Z. Object-part attention model for fine-grained image classification. IEEE Trans. Image Process. 27(3), 1487–1500 (2018).
Article MathSciNet Google Scholar
Fu, J. et al. Dual attention network for scene segmentation. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3146–3154 (2019).
Hou, Q., Zhou, D. & Feng, J. Coordinate attention for efficient mobile network design. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13713–13722 (2021).
Hossin, M. & Sulaiman, M. N. A. Review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag. Process. 5(2), 1–11 (2015).
Article Google Scholar
Ali, S. et al. Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy. Med. Image Anal. 70, 102002 (2021).
Article PubMed Google Scholar
Pogorelov, K. et al. KVASIR: A multi-class image dataset for computer aided gastrointestinal disease detection. ACM 2017, 164–169 (2017).
Google Scholar
Jha, D. et al. Kvasir-SEG: A segmented polyp dataset. Int. Conf. Multim. Model. 2020, 451–462 (2020).
Article Google Scholar
Sun, F. et al. Multi-view k-proximal plane clustering. Appl. Intell. 52(13), 14949–14963 (2022).
Article Google Scholar
Wang, S., Chen, Y., Yi, S. & Chao, G. Frobenius norm-regularized robust graph learning for multi-view subspace clustering. Appl. Intell. 52(13), 14935–14948 (2022).
Article Google Scholar
Chao, G., Wang, S., Yang, S., Li, C. & Chu, D. Incomplete multi-view clustering with multiple imputation and ensemble clustering. Appl. Intell. 52(13), 14811–14821 (2022).
Article Google Scholar

Download references

Acknowledgements

This research was supported by Guangxi Science and Technology Department | Specific Research Project of Guangxi for Research Bases and Talents (No. AD22080022), Project for Enhancing Young and Middle-aged Teacher's Research Basis Ability in Colleges of Guangxi (No. 2020KY09028), Medical and Health Science and Technology Development Project of Shandong (No. 202103030876), National Natural Science Foundation of China (No. 62161031), Guangxi Natural Science Foundation (No. 2020GXNSFBA297097), and Fund of Shandong Second Provincial General Hospital (No. 2023MS10). We are grateful to Enshuang Gao of Nanning Normal University for collecting and assembling of data.

Author information

These authors contributed equally: Kezhi Zhang, Haibao Wang, and Yaru Cheng.

Authors and Affiliations

Guangxi Key Laboratory of Information Functional Materials and Intelligent Information Processing, School of Physics and Electronics, Nanning Normal University, 175 Mingxiu East Road, Nanning, 530001, Guangxi, China
Kezhi Zhang, Haibao Wang, Qian Zeng, Tao Zhang, Guoqiang Wei & Dong Chen
Department of Gastroenterology, Shandong Second Provincial General Hospital, 4 Duan Xing West Road, Jinan, 250022, Shandong, China
Yaru Cheng, Hongyan Liu, Qi Gong & Zhi Wei
School of Electronic Engineering, Hunan College of Information, Changsha, 410200, Hunan, China
Guoqiang Wei

Authors

Kezhi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haibao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yaru Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qi Gong
View author publications
You can also search for this author in PubMed Google Scholar
Qian Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Tao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guoqiang Wei
View author publications
You can also search for this author in PubMed Google Scholar
Zhi Wei
View author publications
You can also search for this author in PubMed Google Scholar
Dong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This research was conducted in collaboration with all authors. Conception and design: K.Z., G.W., Z.W., D.C.; Provision of study material: Y.C., H.L., Q.G.; Collection and/or assembly of data: H.W., Y.C., Z.W.; Data analysis and interpretation: H.W., Q.Z., T.Z.; Manuscript writing: G.W., K.Z.; Manuscript review: G.W., K.Z.; Manuscript revision: H.W., K.Z. and G.W.; All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Guoqiang Wei, Zhi Wei or Dong Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The Acknowledgements section in the original version of this Article was incomplete. Full information regarding the corrections made can be found in the correction notice for this Article.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, K., Wang, H., Cheng, Y. et al. Early gastric cancer detection and lesion segmentation based on deep learning and gastroscopic images. Sci Rep 14, 7847 (2024). https://doi.org/10.1038/s41598-024-58361-8

Download citation

Received: 21 February 2023
Accepted: 28 March 2024
Published: 03 April 2024
DOI: https://doi.org/10.1038/s41598-024-58361-8

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.