Automatic lumen detection and magnetic alignment control for magnetic-assisted capsule colonoscope system optimization

Yen, Sheng-Yang; Huang, Hao-En; Lien, Gi-Shih; Liu, Chih-Wen; Chu, Chia-Feng; Huang, Wei-Ming; Suk, Fat-Moon

doi:10.1038/s41598-021-86101-9

Download PDF

Article
Open access
Published: 19 March 2021

Automatic lumen detection and magnetic alignment control for magnetic-assisted capsule colonoscope system optimization

Sheng-Yang Yen¹^na1,
Hao-En Huang¹^na1,
Gi-Shih Lien^2,3,
Chih-Wen Liu¹,
Chia-Feng Chu¹,
Wei-Ming Huang¹ &
…
Fat-Moon Suk^2,3

Scientific Reports volume 11, Article number: 6460 (2021) Cite this article

1612 Accesses
11 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We developed a magnetic-assisted capsule colonoscope system with integration of computer vision-based object detection and an alignment control scheme. Two convolutional neural network models A and B for lumen identification were trained on an endoscopic dataset of 9080 images. In the lumen alignment experiment, models C and D used a simulated dataset of 8414 images. The models were evaluated using validation indexes for recall (R), precision (P), mean average precision (mAP), and F1 score. Predictive performance was evaluated with the area under the P-R curve. Adjustments of pitch and yaw angles and alignment control time were analyzed in the alignment experiment. Model D had the best predictive performance. Its R, P, mAP, and F1 score were 0.964, 0.961, 0.961, and 0.963, respectively, when the area of overlap/area of union was at 0.3. In the lumen alignment experiment, the mean degrees of adjustment for yaw and pitch in 160 trials were 21.70° and 13.78°, respectively. Mean alignment control time was 0.902 s. Finally, we compared the cecal intubation time between semi-automated and manual navigation in 20 trials. The average cecal intubation time of manual navigation and semi-automated navigation were 9 min 28.41 s and 7 min 23.61 s, respectively. The automatic lumen detection model, which was trained using a deep learning algorithm, demonstrated high performance in each validation index.

A pilot trial of Convolution Neural Network for automatic retention-monitoring of capsule endoscopes in the stomach and duodenal bulb

Article Open access 05 March 2020

Automatic evaluation of degree of cleanliness in capsule endoscopy based on a novel CNN architecture

Article Open access 19 October 2020

Edge artificial intelligence wireless video capsule endoscopy

Article Open access 12 August 2022

Introduction

Colonoscopy is considered the gold standard for the detection of colorectal cancer. Screening colonoscopy significantly reduces colorectal cancer incidence and cancer-related mortality among individuals who undergo screening colonoscopy^1,2,3,4. However, colonoscopy is an invasive examination; 16.7% of patients report moderate or severe abdominal pain after colonoscopy⁵, seriously hampering the successful completion of colon examinations.

Capsule colonoscopy was introduced in 2006 as a minimally invasive technique for examining the colon⁶. However, the movement of the capsule is passive, proceeding with the help of gastrointestinal tract peristalsis and gravity forces, which creates a large number of images that colonoscopists could spend a tremendous amount of time after the examination^7,8. External controllability of a capsule colonoscope by means of an applied magnetic field is a possible solution to the maneuverability problem^9,10. We have reported the feasibility and safety of a novel magnetic-assisted capsule endoscope system for the examination of the upper gastrointestinal tract^11,12.

We further developed a magnetic capsule colonoscope (MCC) and magnetic-assisted capsule colonoscope (MACC) system based on a magnetic-navigated endoscope system. Compared with traditional colonoscopy, this MACC system is able to control the movement and orientation of the MCC by using a magnetic field navigator (MFN). Furthermore, the magnetic-assisted system is a promising locomotion methodology that has advantages in effectively navigating and posing during the diagnostic task^13,14. Nevertheless, unknown front viewing angle, unpredictable rotation of capsule endoscope and unintuitive operation may cause confusion and inefficiency during operation^15,16.

Several studies have used computer-assisted diagnosis (CAD), an artificial intelligence (AI) auxiliary system¹⁷, to assist gastroenterologists in performing colonoscopy. Object detection^{17,18,19,20,21} using AI and deep learning is a key computer vision division within the CAD system. With its speed and accuracy, the proposed lumen detection method can provide clues that can not only be used to reorient the MCC but also align it with the gastrointestinal tract in real time. Hence, we integrated a computer vision-based object detection and alignment control scheme into the MACC system.

In this study, we developed a lumen detection and alignment algorithm that enhances the efficiency of lumen identification and navigation of the capsule.

Results

Lumen detection: inferencing with the endoscopic dataset

The purpose of this experiment was to inference lumen detection with endoscopic images. Two models, A and B, were used in this part. Table 1 presents the results of the IoU comparison. Model B had better testing results than model A. In inferencing by model B, when IoU was at 0.3, R was 0.678, P was 0.757, mAP was 0.614, and the F1 score was 0.715. The P-R curves for the experiments are depicted in Fig. 1. The areas under the P-R curves (i.e., Area under curve (AUC)) were 0.718 and 0.744 for models A and B, respectively. Model B was 3.62% better than model A in predictive performance.

Table 1 Results of the recall, precision, mAP, and F1 score in the lumen detection experiments for 4 models with images from endoscopic and simulated datasets.

Full size table

Lumen detection: inferencing with the simulated dataset

We implemented the testing on models C and D with and without negative samples. The purpose of these models was for lumen alignment experiments in the MACC system. Model D outperformed model C by approximately 5% in every validation index (Table 1). R tested by model D was 0.964, P was 0.961, mAP was 0.961, and the F1 score was 0.963 when IoU was at 0.3. The P-R curves for the experiments for models C and D are shown in Fig. 1c,d. The AUCs of the P-R curves for models C and D were 0.935 and 0.982, respectively. Model D outperformed models A, B, and C by 36.77%, 31.99%, and 5.03%, respectively.

MCE alignment control using the simulated dataset

Model D demonstrated the best R, P, mAP, and F1 score. Therefore, we applied model D to the MFN for the lumen alignment experiment. Figure 2 depicts a single alignment test. The MFN rotated the pitch and yaw angles of the MCC by 26.89° and 56.16°, respectively. The procedure required 1.14 s. Then, experiments were conducted in 8 directions, and each direction was performed 20 times for a total of 160 trials. Detailed results are listed in Table 2. The mean degrees of adjustment for yaw and pitch of the MCC were 21.70° and 13.78°, respectively. The mean alignment control time of the trials was 0.902 s.

Table 2 Results of yaw and pitch adjustments in lumen alignment experiments with 8 directions using model D.

Full size table

Intubation time of manual navigation and semi-automated navigation

To confirm the performance of the MACC system, we performed 20 trials of manual navigation and semi-automated navigation in the same colonoscopy training stimulator. The alignment rates of the automated alignment system at the rectum, sigmoid, descending, transverse and ascending colon were 10.61%, 95.45%, 76.47%, 84.31% and 86.11%, respectively. The average cecal intubation time of manual navigation was 9 min 28.41 s. In semi-automated navigation, the average cecal intubation time was 7 min 23.61 s. The cecal intubation time of semi-automated alignment was 21.96% lesser than that of manual navigation. Detailed results are shown in Fig. 3.

Discussion

Common applications of AI to endoscopy are for the detection and analysis of inflammatory lesions, polyps, and cancer. Detection of gastrointestinal bleeding is the most common application in capsule endoscopy¹⁷. The concept for lumen detection with AI has been proven in several studies^18,22,23. In this study, we used 2 image datasets to train 4 deep learning-based lumen detection models. We presented the performance of models A and B trained by an endoscopic dataset and models C and D trained by a simulated dataset. Our approach consisted of 2 steps. First, we developed 4 CNN models for locating the lumen. Second, we took the center of the predicted bounding box as a reference position during the testing phase in the lumen alignment experiment. Once the reference position was acquired, the MACC system aligned it to the center of the screen (Supplementary video). In a comparison of the AUC of P-R curves, the predictive performance of model D was better than that of models A, B, and C by 36.77%, 31.99%, and 5.03%, respectively. We then applied model D to an alignment experiment with the MACC system. MCC alignment was controlled in 8 directions by using a strong radial magnetized permanent magnet on the MFN.

Overfitting is a common concern in deep learning and statistics. It occurs when a constructed model excessively matches a training dataset but goes wrong with other external testing datasets^24,25. To prevent overfitting, we implemented several techniques when training the models, such as data augmentation, BN, and weight decay. The application of BN has several benefits, such as removing dropout and accelerating learning rate decay, that make networks train faster^26,27. Previous studies^28,29 have determined that weight decay is a regularizer that avoids overfitting; it also reduces the square error during training. Restated, penalizing the neural network during training according to the weights of the network minimizes overfitting.

Regarding the object detector, two-stage detectors, such as R-CNN³⁰, Fast R-CNN³¹, Faster R-CNN³², and Mask R-CNN³³, use a region proposal network to produce regions of interests in the first stage. In the second, the region proposals for object classification and bounding box regression are sent. By contrast, one-stage detectors, such as YOLO v1-v3^34,35,36 and the singe shot multiBox detector³⁷, treat object detection as a regression problem and skip the region proposal stage to detect directly. Because of its one-stage design, the one-stage detector is generally superior to the two-stage detector in inference speed but suffers in detection accuracy. However, YOLO v3 not only outperforms other conventional one-stage object detectors in speed but also compares with two-stage object detectors in accuracy. In addition, the model architecture of YOLO v3 uses Darknet-53 instead of Darknet-19 as the feature extractor. Darknet-53 requires fewer floating-point computations, making calculation more efficient and prediction faster. Furthermore, YOLO v3 uses multi-label classification and independent logistic classifiers for better performance than softmax. It uses binary cross-entropy loss to give normalized probabilities to predict class during the training process. In our experiment, however, we labeled only the region of the lumen for single class detection. We evaluated the model by P-R curve because no negative label was present in any image in our training set.

In a previous study, Zabulis et al. detected the lumen by the mean shift algorithm¹⁸. The algorithm runs several times with various data points. For each point, the mean shift defines a region around it and computes the mean of data points. Then, it shifts the center of the region to the mean and repeats this process until it converges. In the end, the region shifts to a denser place of the dataset. In their experiments, the frame rate was 0.33 fps, which is not fast enough to apply to a video during colonoscopy. However, our proposed method inferenced on the MCC with an average rate of 30 fps. Gallo et al. proposed a boosting classification-based method for lumen detection³⁸. Their best classification result for R and P were approximately 0.9 and 0.7, respectively. Wang et al. utilized Bayer-format downsample, adaptive threshold segmentation, and radial texture detection techniques to identify the intestinal lumen. The precision and sensitivity of lumen detection were reported as 95.5% and 98.1%, respectively²³. The speed was 0.02 s per frame, but they used low resolution images (64 × 64) to reduce computation complexity. In our results, precision and sensitivity were 96.1% and 96.4%, respectively, even at a high resolution (1920 × 1080). With YOLO v3, inference speed was 0.033 s per frame.

Two navigation scenarios have been designed to prove the effectiveness and feasibility of the MACC system. In semi-automated navigation, the MACC system manipulated the MFN based on the integration of automated alignment system. Although this system can perform automatically align, there were two situations that might need the operators to intervene during intubation process. The first situation was the lumen image with unclear contour because of the lubricant sticking on the camera. In the second situation, the alignment rate was poor when capsules passed through a sharp angle, which most frequently occurred at the rectosigmoid junction.

This study has several limitations. First, the models trained on the endoscopic dataset were less precise than those trained on the simulated dataset. The reason for this may be that the endoscopic dataset contained many lumen images with unclear contours or even with contours covered by stool, mucus, or bubbles, making it more difficult for the neural network to extract the features it was supposed to learn. Second, the variation of endoscopic images was higher than we expected. A possible solution for this may be additional data cleaning, scrubbing, and augmentation. Removing similar images from the endoscopic dataset and relabeling the lumen as the proper region could solve the problem. Finally, a clinical trial is required to prove that this MACC system with automatic lumen alignment shortens cecal intubation time.

Conclusion

This study used a deep learning algorithm and automatic lumen detection model that demonstrated high precision and recall with endoscopic and simulated datasets. Coordinating the lumen detection model with alignment control, this integrated method may increase the performance and efficiency of capsule colonoscopy. The MACC system has promise for increasing the navigation efficacy of capsule colonoscopy.

Methods

MACC system

The proposed MACC system consists of an MFN, MCC, image receiving decoder, and joystick (Fig. 4). The MFN is capable of 5 degrees of freedom operation with a working space of 650 × 650 × 410 \({\mathrm{mm}}^{3}\). The MFN has a radial magnetized ring-shaped magnet (NdFeB alloy) driven by a servo motor through the belt to locomote the MCC inside the colon lumen. The MCC measures 25.5 mm × 9.9 mm and weighs 4.64 g. Its components were described elsewhere¹¹. Briefly, it has an internal permanent magnet, 4 white light-emitting diodes, optical modules, including a lens and complementary metal-oxide semiconductor (CMOS) sensor, and a thin cable. Images are transmitted at 30 frames per second (fps) from the CMOS sensor, and the image resolution is 640 × 480 pixels. An Extreme 3D Pro Joystick (Logitech International S.A., Lausanne, Switzerland) is utilized to control the movement and direction of the MFN.

Colonoscopy image dataset

This study was approved by the Joint Institutional Review Board of Taipei Medical University. Owing to the retrospective review of colonoscopic images in this study, the Taipei Medical University ethics committee waived the need for patient informed consent. All methods were carried out in accordance with relevant guidelines and regulations. Two datasets were used in this study. The endoscopic dataset without any information of patients and the simulated dataset from a colonoscopy training model simulator (Kyoto Kagaku Co., Ltd., Japan), a previously validated physical model simulator³⁹. The endoscopic dataset contained a total of 9080 colonoscopic images. In all, 3934 images contained sight of lumen. The remaining 5146 were either unrecognizable or concealed as negative samples for lumen detection. We split the 9080 images into the training and testing sets randomly with a ratio of 4 to 1. The simulated dataset comprised 8414 images in total. We randomly chose 6731 images for training and 1683 images for testing for a ratio of 4 to 1.

Models A and B were trained by the endoscopic dataset. The purpose of these models was to prove that our lumen detection method can actually work well with real endoscopic images. Models C and D used the simulated dataset taken from the simulator. We chose the best model to inference as the input for the lumen alignment experiment in the MACC system. We trained and tested models A and C with images that included the negative samples, which means there could be no instance of the lumen in partial images. By contrast, models B and D were trained and tested with images without negative samples, which means, in every image, at least one instance of the lumen existed. All 4 models used the same parameter settings and personal computer to train. No training images would be used as inference sources.

We implemented several techniques while training the model, including data augmentation, batch normalization (BN), and weight decay, to prevent overfitting. BN is a method we can use to normalize the inputs of each layer. A BN layer has a regularizing effect similar to that of a dropout layer but makes networks train faster. Weight decay is another technique used to avoid overfitting by limiting the size of neuron weights. For data augmentation, images were flipped vertically and horizontally, and each image was randomly variated with salt-and-pepper noise and Gaussian noise to enhance the robustness of models.

Training and testing of the convolutional neural network model

All experiments were operated with Python 3.6.8 and Pytorch 1.3.1. Python is an interpreted, object-oriented, high-level dynamic programming language. Pytorch is a python package that provides tensor computation with graphics processing unit (GPU) acceleration. All models were trained and evaluated on a personal computer with an Nvidia GTX 1080Ti GPU (NVIDIA Corporation, Santa Clara, CA, USA) with 11 GB of total memory. The operating system was Windows 10. Using transfer learning, the models loaded a trained weight file that was pretrained on ImageNet to initialize them. ImageNet is a large visual database intended for research on visual object recognition. The 4 models were developed based on the You Only Look Once version 3 (YOLO v3), an end-to-end convolutional neural network (CNN) able to make inferences of multiple rectangle box locations and classes. Restated, YOLO v3 allows one-stage, simultaneous object detection and localization. With regard to bounding box prediction, YOLO v3 uses dimension clusters as an anchor box to predict 4 coordinates for each bounding box border. The labeling work was performed by 4 investigators (Chu, HE Huang, WM Huang, and Yen), and all annotations were confirmed by an experienced gastroenterologist (Suk).

In preprocessing, we saved the bounding box of the lumen as ground truth in text files. The text files were read during the training process. Next, the model parameters were updated by a stochastic gradient decent (SGD) optimizer according to training loss. The binary results generated from the SGD optimizer were classified as the presence of the lumen if any value was > 0.5. The network learning rate was 0.001 in our experiments and was tuned using exponentially decaying weights (0.0005). This method effectively suppressed model overfitting. The batch size was 4, and training stopped when the epoch was greater than 150. We used the nonmaximum suppression (NMS)⁴⁰ method with predicted results to remove redundant bounding boxes to find the best location for object prediction. Finally, the best lumen detection bounding box on the original image was added. The developed model architecture for lumen detection is presented in Fig. 5.

Magnetic lumen alignment control

An overview of the alignment control framework is depicted in Fig. 6. We read out the coordinate of the bounding box center and used it to calculate the calibration error with the center of the image. The calibration error was defined as the Euclidean distance between the bounding box center and image center. According to real-time posture angle values measured by an inertial measurement unit sensor (Freescale Semiconductor, Tempe, AZ, USA) constructed inside the MCC, we sent the posture angle to rotating matrix to maintain absolute horizon in camera for the sake of surgeon’s performance. Then, the results were sent to the proportional integral (PI) controller to manipulate the servo motor on the MFN. Use of PI control was for the purpose of quick minimization of alignment bias (steady-state error). Then, the MFN calibrated to adjust the yaw and pitch angles on the basis of instructions given by the PI controller. The alignment stopped when calibration error was smaller than 50 pixels (1 pixel = 0.265 mm), and our image resolution was 640 × 480 pixels. We implemented the alignment control for 160 trials in 8 orientations (north, northeast, east, southeast, south, southwest, west and northwest) and recorded the calibration time for each trial.

Semi-automated navigation and manual navigation

In semi-automated navigation, lumen alignment was controlled by the automated alignment system during the navigation from the rectum to cecum. During the navigation, automated alignment system took fully control to reorient the MCC to the right direction instead of manually operation. If the capsule was struck in the lumen, operators might need to interfere optionally. To compare with semi-automated navigation, we performed manual navigation in the same colonoscopy training simulator. In this system, we used the joystick to control the MFN to navigate the MCC from rectum to cecum.

Statistical analysis

The prediction was considered true positive (TP) if the area of overlap/area of union (IoU) was greater than the predefined IoU threshold. Otherwise, the prediction was a false positive (FP). That is, the model was confident of adding the bounding box at the correct place and moment on the image in the model testing phase, and the lumen was actually occurring in the image. The prediction was a false negative (FN) if the model failed to confirm the presence of a lumen when it actually occurred in the image.

The models were evaluated using several validation indexes⁴¹. Recall (R) was defined as the proportion of actual positives identified correctly (R = TP/[TP + FN]). Precision (P) was defined as the proportion of positive identifications actually correct (P = TP/[TP + FP]). Average precision (AP) was defined as the area under the P-R curve. The mean average precision (mAP) was the average of the AP for all classes; this was used to evaluate the precision of bounding box localization. The F1 score ([2 × R × P]/[R + P]) was a measure of a test’s accuracy. Its purpose was to balance P and R. The intersection over union (IoU = area of overlap/area of union) was defined as the overlap between the predicted bounding box and the ground truth bounding box. Therefore, IoU was a parameter used to test how accurately the boundary box was drawn in relation to ground truth. The P-R curve is a graph with y- and x-axis values for P and R. The curve indicates the trade-off between P and R for different thresholds and represents whether a data point was recognized in the positive class. AUC was used in the classification evaluation to determine which models best predicted the classes; a high AUC reflected high R and P. All statistical analyses were conducted using MATLAB 2019a numerical software.

Ethical approval

This study was approved by the Joint Institutional Review Board of Taipei Medical University.

Abbreviations

AI:: Artificial intelligence
AP:: Average precision
AUC:: Area under curve
BN:: Batch normalization
CAD:: Computer-assisted diagnosis
CMOS:: Complementary metal-oxide semiconductor
CNN:: Convolutional neural network
FN:: False negative
FP:: False positive
fps:: Frames per second
GPU:: Graphics processing unit
IoU:: Area of overlap/area of union
mAP:: Mean average precision
MCC:: Magnetic capsule colonoscope
MACC:: Magnetic-assisted capsule colonoscope
NMS:: Nonmaximum suppression
MFN:: Magnetic field navigator
P:: Precision
PI:: Proportional integral
R:: Recall
SGD:: Stochastic gradient decent
TP:: True positive
YOLO v3:: You Only Look Once version 3

References

Zauber, A. G. et al. Colonoscopic polypectomy and long-term prevention of colorectal-cancer deaths. N. Engl. J. Med. 366, 687–696. https://doi.org/10.1056/NEJMoa1100370 (2012).
Article CAS PubMed PubMed Central Google Scholar
Manser, C. N. et al. Colonoscopy screening markedly reduces the occurrence of colon carcinomas and carcinoma-related death: A closed cohort study. Gastrointest. Endosc. 76, 110–117. https://doi.org/10.1016/j.gie.2012.02.040 (2012).
Article ADS PubMed Google Scholar
Ladabaum, U., Dominitz, J. A., Kahi, C. & Schoen, R. E. Strategies for colorectal cancer screening. Gastroenterology 158, 418–432 (2020).
Article CAS Google Scholar
Doubeni, C. A. et al. Effectiveness of screening colonoscopy in reducing the risk of death from right and left colon cancer: A large community-based study. Gut 67, 291–298 (2018).
Article Google Scholar
Bretthauer, M. et al. Population-based colonoscopy screening for colorectal cancer: A randomized clinical trial. JAMA Intern. Med. 176, 894–902. https://doi.org/10.1001/jamainternmed.2016.0960 (2016).
Article PubMed PubMed Central Google Scholar
Schoofs, N., Deviere, J. & Van Gossum, A. PillCam colon capsule endoscopy compared with colonoscopy for colorectal tumor diagnosis: A prospective pilot study. Endoscopy 38, 971–977. https://doi.org/10.1055/s-2006-944835 (2006).
Article CAS PubMed Google Scholar
Park, J. et al. Artificial intelligence that determines the clinical significance of capsule endoscopy images can increase the efficiency of reading. PLoS ONE 15, e0241474 (2020).
Article CAS Google Scholar
Biniaz, A., Zoroofi, R. A. & Sohrabi, M. R. Automatic reduction of wireless capsule endoscopy reviewing time based on factorization analysis. Biomed. Signal Process. Control 59, 101897 (2020).
Article Google Scholar
Gu, H., Zheng, H., Cui, X., Huang, Y. & Jiang, B. Maneuverability and safety of a magnetic-controlled capsule endoscopy system to examine the human colon under real-time monitoring by colonoscopy: a pilot study (with video). Gastrointest. Endosc. 85, 438–443. https://doi.org/10.1016/j.gie.2016.07.053 (2017).
Article PubMed Google Scholar
Oh, D. J., Kim, K. S. & Lim, Y. J. A new active locomotion capsule endoscopy under magnetic control and automated reading program. Clin. Endosc. 53, 395 (2020).
Article Google Scholar
Lien, G. S., Wu, M. S., Chen, C. N., Liu, C. W. & Suk, F. M. Feasibility and safety of a novel magnetic-assisted capsule endoscope system in a preliminary examination for upper gastrointestinal tract. Surg. Endosc. 32, 1937–1944. https://doi.org/10.1007/s00464-017-5887-0 (2018).
Article PubMed Google Scholar
Lien, G. S., Liu, C. W., Jiang, J. A., Chuang, C. L. & Teng, M. T. Magnetic control system targeted for capsule endoscopic operations in the stomach–design, fabrication, and in vitro and ex vivo evaluations. IEEE Trans. Biomed. Eng. 59, 2068–2079. https://doi.org/10.1109/TBME.2012.2198061 (2012).
Article PubMed Google Scholar
Ciuti, G., Valdastri, P., Menciassi, A. & Dario, P. Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures. Robotica 28, 199 (2010).
Article Google Scholar
Verra, M. et al. Robotic-assisted colonoscopy platform with a magnetically-actuated soft-tethered capsule. Cancers (Basel) 12, 2485, https://doi.org/10.3390/cancers12092485 (2020).
Lee, H.-C., Jung, C.-W. & Kim, H. C. Real-time endoscopic image orientation correction system using an accelerometer and gyrosensor. PLoS ONE 12, e0186691 (2017).
Article Google Scholar
Arezzo, A. et al. Experimental assessment of a novel robotically-driven endoscopic capsule compared to traditional colonoscopy. Dig. Liver Dis. 45, 657–662 (2013).
Article Google Scholar
Le Berre, C. et al. Application of artificial intelligence to gastroenterology and hepatology. Gastroenterology 158, 76–94 e72, https://doi.org/10.1053/j.gastro.2019.08.058 (2020).
Zabulis, X., Argyros, A. A. & Tsakiris, D. P. in 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems 3921–3926.
Blue, S. T. & Brindha, M. in 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT) 1–5.
Chan, H. P., Hadjiiski, L. M. & Samala, R. K. Computer-aided diagnosis in the era of deep learning. Med. Phys. 47, e218–e227 (2020).
Article Google Scholar
Yang, S., Yoon, H. J., Yazdi, S. J. M. & Lee, J. H. A novel automated lumen segmentation and classification algorithm for detection of irregular protrusion after stents deployment. Int. J. Med. Robot. 16, e2033. https://doi.org/10.1002/rcs.2033 (2020).
Article PubMed Google Scholar
Sfakiotakis, M., Zabulis, X. & Tsakiris, D. in Extended Abstract, 7th International Conference on Wearable Micro & Nano Technologies for Personalized Health 26–28.
Wang, D., Xie, X., Li, G., Yin, Z. & Wang, Z. A lumen detection-based intestinal direction vector acquisition method for wireless endoscopy systems. IEEE Trans. Biomed. Eng. 62, 807–819. https://doi.org/10.1109/TBME.2014.2365016 (2015).
Article PubMed Google Scholar
Hernández-García, A. & König, P. in Artificial Neural Networks and Machine Learning—ICANN 2018. (eds Věra Kůrková et al.) 95–103 (Springer, 2018).
Uličný, M., Lundström, J. & Byttner, S. in Intelligent Computing Systems. (eds Anabel Martin-Gonzalez & Victor Uc-Cetina) 16–30 (Springer).
Gitman, I. & Ginsburg, B. Comparison of batch normalization and weight normalization algorithms for the large-scale image classification. arXiv preprint https://arxiv.org/abs/1709.08145 (2017).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint https://arxiv.org/abs/1502.03167 (2015).
Krogh, A. & Hertz, J. A. in Advances in Neural Information Processing Systems 950–957.
Krizhevsky, A., Sutskever, I. & Hinton, G. E. in Advances in Neural Information Processing Systems. 1097–1105.
Girshick, R., Donahue, J., Darrell, T. & Malik, J. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 580–587.
Girshick, R. in Proceedings of the IEEE International Conference on Computer Vision 1440–1448.
Ren, S., He, K., Girshick, R. & Sun, J. in Advances in Neural Information Processing Systems 91–99.
He, K., Gkioxari, G., Dollár, P. & Girshick, R. in Proceedings of the IEEE International Conference on Computer Vision 2961–2969.
Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 779–788.
Redmon, J. & Farhadi, A. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7263–7271.
Redmon, J. & Farhadi, A. Yolov3: An incremental improvement. arXiv preprint https://arxiv.org/abs/1804.02767 (2018).
Liu, W. et al. in European Conference on Computer Vision 21–37 (Springer).
Gallo, G. & Torrisi, A. Lumen detection in endoscopic images: a boosting classification approach. Int. J. Adv. Intell. Syst. 5 (2012).
Plooy, A. M. et al. Construct validation of a physical model colonoscopy simulator. Gastrointest. Endosc. 76, 144–150. https://doi.org/10.1016/j.gie.2012.03.246 (2012).
Article ADS PubMed Google Scholar
Neubeck, A. & Gool, L. V. in 18th International Conference on Pattern Recognition (ICPR'06). 850–855.
Powers, D. & Ailab. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness & correlation. J. Mach. Learn. Technol. 2, 2229–3981, https://doi.org/10.9735/2229-3981 (2011).

Download references

Acknowledgements

This work was supported by research grants from the Ministry of Health and Welfare (MOHW) Phase III Cancer Research Grant (MOHW109-TDU-B-212-134020) and Taipei Medical University Wan Fang Hospital (106TMU-WFH-01-3).

Author information

These authors contributed equally: Sheng-Yang Yen and Hao-En Huang.

Authors and Affiliations

Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
Sheng-Yang Yen, Hao-En Huang, Chih-Wen Liu, Chia-Feng Chu & Wei-Ming Huang
Division of Gastroenterology, Department of Internal Medicine, Taipei Municipal Wan Fang Hospital, Taipei Medical University, No. 111, Section 3, Xing Long Road, Taipei, 116, Taiwan
Gi-Shih Lien & Fat-Moon Suk
Department of Internal Medicine, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
Gi-Shih Lien & Fat-Moon Suk

Authors

Sheng-Yang Yen
View author publications
You can also search for this author in PubMed Google Scholar
Hao-En Huang
View author publications
You can also search for this author in PubMed Google Scholar
Gi-Shih Lien
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Wen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Feng Chu
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ming Huang
View author publications
You can also search for this author in PubMed Google Scholar
Fat-Moon Suk
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.S.L., C.M.L. and F.M.S. contributed to conception and design. S.Y.Y., H.H.E., C.C.F. and H.W.M. were involved in data collection; all authors involved in interpretation of data and editing the manuscript. All authors revised the manuscript together and approved the final version of this manuscript.

Corresponding author

Correspondence to Fat-Moon Suk.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary video.

Supplementary video legend.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yen, SY., Huang, HE., Lien, GS. et al. Automatic lumen detection and magnetic alignment control for magnetic-assisted capsule colonoscope system optimization. Sci Rep 11, 6460 (2021). https://doi.org/10.1038/s41598-021-86101-9

Download citation

Received: 21 September 2020
Accepted: 10 March 2021
Published: 19 March 2021
DOI: https://doi.org/10.1038/s41598-021-86101-9

This article is cited by

Locomotion and disaggregation control of paramagnetic nanoclusters using wireless electromagnetic fields for enhanced targeted drug delivery
- Kim Tien Nguyen
- Gwangjun Go
- Chang-Sei Kim
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.