Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study

Kwon, Gitaek; Ryu, Jongbin; Oh, Jaehoon; Lim, Jongwoo; Kang, Bo-kyeong; Ahn, Chiwon; Bae, Junwon; Lee, Dong Keon

doi:10.1038/s41598-020-74653-1

Download PDF

Article
Open access
Published: 16 October 2020

Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study

Gitaek Kwon¹^na1,
Jongbin Ryu²^na1,
Jaehoon Oh^3,4,
Jongwoo Lim^1,4,
Bo-kyeong Kang^4,5,
Chiwon Ahn⁶,
Junwon Bae³ &
…
Dong Keon Lee⁷

Scientific Reports volume 10, Article number: 17582 (2020) Cite this article

3886 Accesses
13 Citations
2 Altmetric
Metrics details

Subjects

Abstract

This study aimed to verify a deep convolutional neural network (CNN) algorithm to detect intussusception in children using a human-annotated data set of plain abdominal X-rays from affected children. From January 2005 to August 2019, 1449 images were collected from plain abdominal X-rays of patients ≤ 6 years old who were diagnosed with intussusception while 9935 images were collected from patients without intussusception from three tertiary academic hospitals (A, B, and C data sets). Single Shot MultiBox Detector and ResNet were used for abdominal detection and intussusception classification, respectively. The diagnostic performance of the algorithm was analysed using internal and external validation tests. The internal test values after training with two hospital data sets were 0.946 to 0.971 for the area under the receiver operating characteristic curve (AUC), 0.927 to 0.952 for the highest accuracy, and 0.764 to 0.848 for the highest Youden index. The values from external test using the remaining data set were all lower (P-value < 0.001). The mean values of the internal test with all data sets were 0.935 and 0.743 for the AUC and Youden Index, respectively. Detection of intussusception by deep CNN and plain abdominal X-rays could aid in screening for intussusception in children.

Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children

Article Open access 19 December 2019

Sungwon Kim, Haesung Yoon, … Hyun Joo Shin

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Article Open access 30 September 2023

Yuanyuan Pei, Guijuan Wang, … Hongkui Yu

Convolutional-neural-network-based diagnosis of appendicitis via CT scans in patients with acute abdominal pain presenting in the emergency department

Article Open access 12 June 2020

Jin Joo Park, Kyung Ah Kim, … Jeongbae Rhie

Introduction

Intussusception is an acquired invagination of the proximal segment of the intestine into the distal segment and is the most common cause of intestinal obstruction among children aged 3 to 36 months old^1,2,3. This disease is a relatively common cause of emergency room visits in children. Rapid diagnosis and treatment with air enema within 24 h from the onset can alleviate symptoms in approximately 84% of patients; however, prolonged cases can develop ischaemia, necrosis, and perforation^4,5.

There are several imaging studies available for diagnosing intussusception. Hydrostatic or pneumatic enemas were considered the gold standards for both diagnosing and treating intussusception⁶. However, these are invasive radiologic procedures that must be performed by radiologists and are not always readily available⁷. Conversely, ultrasonography has been proven to be a reliable first-line diagnostic modality for patients suspected to have intussusception^8,9,10. However, the utility of this procedure is affected by the skill of the operator and variations in equipment—the availability of which may be limited in certain areas. Plain abdominal radiography is inexpensive and is commonly used as a first-line screening test for intussusception in patients with gastrointestinal signs and symptoms^11,12. Despite its low sensitivity (< 50%) and poor rate of inter-observer agreement in diagnosing intussusception, it remains an important diagnostic modality and has long been used to screen for other diseases such as constipation, ileus, and peritoneal air^6,12,13.

Deep convolutional neural networks (CNN) are used for widespread image detection and classification and have been utilised in the fields of radiology and medical image analysis^14,15,16,17. An automated method for screening plain abdominal radiographs and prioritising positive images for rapid review and diagnosis may minimise possible delays in diagnosing intussusception and reduce the incidence of misdiagnoses; this is especially important in medical environments, such as primary care institutions, where there is little or no knowledge of intussusception during emergency situations. Deep CNN models (1) require large and well-curated training data sets that contain significant visual heterogeneity, (2) must be tested through external validation, and (3) must undergo optimisation of equipment and settings to ensure high accuracy and performance in various clinical environments¹⁷. There are no previous studies on the availability and external validity of deep learning in diagnosing intussusception using large data sets of plain abdominal radiographs. This study aimed to create a human-annotated data set of plain abdominal X-rays of children with intussusception for internal and external validation, and to verify a possibility of deep CNN to detect intussusception with this set.

Results

A total of 11,384 images consisting of 1449 positive images and 9935 negative images were collected (Fig. 1). The baseline characteristics of participants who provided these images are shown in Table 1. Significant differences between the two groups (positive and negative image groups) were observed regarding age and sex in the sets gathered from hospitals B and C but not from the set provided by hospital A.

Table 1 Baseline characteristics of participants who provided images for the data sets.

Full size table

Phase 1: Training evaluation and internal validation tests using two data sets and external validation tests using the excluded data set

The diagnostic performance matrix of the internal and external validation tests, including the optimal cut-off values, are shown in Table 2. The values of the internal validation test after training with data sets A + B, B + C, and C + A were: 0.966 (0.955, 0.975), 0.971 (0.959, 0.980), and 0.946 (0.926, 0.961), respectively, for the AUC (95% CI); 0.952, 0.943, and 0.927, respectively, for the highest accuracy; and 0.818, 0.848, and 0.764, respectively, for the highest Youden index. The values of the external validation test using the excluded sets (external validation of set C as the counterpart of the internal validation test using sets A + B, etc.) were: 0.811 (0.784, 0.835), 0.895 (0.874, 0.913), and 0.844 (0.828, 0.858), respectively, for AUC; and 0.421, 0.431, and 0.493, respectively, for the Youden indices. All values had a P-value < 0.001 (Table 3, Fig. 2).

Table 2 Diagnostic performance matrix of the internal and external validation tests with optimal cut-off values (Phase 1).

Full size table

Table 3 Outcomes of the internal validation test after the training with two data sets and of the external validation test using the excluded data set (Phase 1).

Full size table

Phase 2: Internal validation test using all data sets

The results of the internal validation tests are summarised in Table 4. The mean values (95% CI) gathered from the internal validation tests were 0.935 (0.928, 0.941) and 0.743 (0.722, 0.763) for the AUC and the highest Youden index, respectively. The mean values for sensitivity and specificity were 0.816 (0.789, 0.842) and 0.925 (0.893, 0.957), respectively, in the highest Youden index. (Table 4, Fig. 2).

Table 4 Outcomes on the internal validation test after training with all data sets (Phase 2).

Full size table

We visualised the feature maps of images from the second internal validation test where intussusception was detected with the highest Youden index (0.731). From the visualisation of 292 images, the correct area was chosen by the network in 255 cases, which indicates that the network has learned how to detect and classify intussusception. True positive images are shown in Fig. 3.

Discussion

The classic triad of intussusception—red currant jelly stools, colicky abdominal pain, and vomiting—was seen in less than 40% of the children in this study; these nonspecific signs and symptoms make the diagnosis of intussusception challenging and force clinicians to rely only on the patient’s history and physical examination findings^18,19,20. Point-of-care ultrasound, when performed by an emergency medicine physician, has a high diagnostic accuracy for intussusception, with sensitivity and specificity values of 0.94 and 0.98, respectively; these results are similar to those of radiologist-performed ultrasounds²¹. Ultrasound is easy for other physicians—even novice ones—to perform, and it allows minimisation of radiation exposure for the patient. However, because the mean annual intussusception incidence rate is approximately 30 per 100,000 live births in the first 3 year of life³, using ultrasound as a screening exam to rule out intussusception in all children who present with nonspecific signs and symptoms is difficult.

In a study on the use of risk stratification in evaluating intussusception in children, it was found that abdominal radiography could be used as the initial diagnostic modality to identify children at risk with sensitivity and specificity values of 0.77 and 0.79, respectively²². However, these radiographs were interpreted by paediatric radiologists using predefined criteria such as small bowel obstruction, target or crescent signs, and findings consistent with ileocolic intussusception. Kim et al. reported that drawing rectangular ROI indicators on abdominal radiographs could allow deep learning-based algorithms to aid in screening for right upper quadrant ileocolic intussusception in young patients. According to a 75-image internal validation test, the sensitivity and specificity values of their algorithm were 0.76 and 0.96, respectively, which are better than those of a radiologist who was found to have sensitivity and specificity values of 0.56 and 0.92, respectively²³. In our study, we drew a rectangular ROI that encompassed the entire abdomen; the ranges of the sensitivity and specificity values after conducting training and internal tests using two data sets were 0.913–0.943 and 0.851–0.905, respectively. In a study on the use of deep learning for diagnosing small bowel obstruction using plain abdominal radiography, the detection accuracy was found to significantly improve with the number of positive training radiographs used²⁴. We believe that our algorithm, which used a large volume of data, improved the outcomes of using deep learning to detect intussusception. The application of this deep learning-based algorithm as a screening tool in the hospitals that provided the data sets used can decrease the unnecessary use of abdominal ultrasonography.

The AUC and Youden index values from all three external validations that were performed were found to be lower by approximately 0.15 and 0.4, respectively, than the values from the internal test. Possible explanations for these findings include differences in data volume, variations in the proportion of positive and negative images, and differences in the quality of each data set. However, the sensitivity of the external validation test was higher by at least 0.65; this indicates that the completed model, which was trained using two hospital data sets, can be transferred to other hospitals and used as a screening tool for diagnosing intussusception. In internal validation tests with fivefold cross-checking and training with all sets, all values of the Youden index, including sensitivity and specificity, were higher than values from the external validation tests with other set after the training and internal validation with two sets. To optimise performance in specific environments, hospitals that will use the model must train it using their own positive and negative images. In our study used CAM for visualisation, we showed which part of the plain abdominal X-rays the model focused on.

There are several limitations to this study. First, we did not compare the performance of our model against that of physicians with respect to key factors such as clinical outcomes, the time required to arrive at a diagnosis, and the equipment needed to use the model as a screening tool. Second, we did not annotate the actual location of intussusception on the X-ray images. Thus, we trained deep CNN under weak supervision using only the existence of intussusception. Better performance can be expected with full supervision and coordinated information regarding the location of the intussusception. Third, there was a difference in resolution between the medical images and the input images of the deep CNN. The resolution of extracted medical images in our data set was approximately 3000 × 4000, while the resolution of input images for our model was only 224 × 224. Therefore, it is possible for information loss to occur when attempting to detect intussusception since the medical images were downsampled. However, if the image size is too large, both the number of computations and the size of the memory consumed increase exponentially; this might render the operation too slow or even impossible to perform. Therefore, further studies that minimise information loss by appropriate resizing of images or selection of only the ROI are needed. Fourth, differences of age and sex between negative and positive group could make other information including body shape and bone growth in images and influence the training and detection of intussusception with deep CNN. Fifth, we stored the images with 8-bit JPEG gray scale format. This process could cause degradation of the data, since the image intensity levels and contrast for details are reduced and removed. Finally, although the ratios of training datasets were equally assigned for positive and negative cases by mini-batch training, the imbalanced testing dataset would decrease reliability of testing results.

In conclusion, we verified a possibility of a deep CNN algorithm that consists of abdominal detection and intussusception classification networks using plain abdominal X-rays to help physicians screen for intussusception. This algorithm can be trained by hospitals that can provide images before being transferred to other hospitals and used to screen for intussusception in children.

Methods

Study design

We conducted a retrospective study at three tertiary academic hospitals (Seoul and Gyeonggi-Do, Republic of Korea) between October 2019 and January 2020 to evaluate the role of deep learning in diagnosing intussusception using plain abdominal X-rays. This study was approved by the Institutional Review Board (IRB) of Hanyang University Hospital (ref. no. HYUH 2019-06-015), the IRB of Hanyang University Guri Hospital (ref. no. GURI 2020-01-006), and the IRB of Seoul National University Bundang Hospital (ref. no. B-1907-555-102) and the requirement for informed consent were waived by the IRBs of Hanyang University Hospital, Hanyang University Guri Hospital, and Seoul National University Bundang Hospital. All methods and procedures were carried out in accordance with the Declaration of Helsinki.

Data set

Plain abdominal X-rays of patients diagnosed with intussusception (positive images)

We gathered data on patients who were diagnosed with intussusception and treated with hydrostatic or pneumatic enema at the emergency room from the medical records of Hanyang University Hospital (set A) and Hanyang University Guri Hospital (set B) from January 2005 to August 2019, and from Seoul National University Bundang Hospital (set C) from January 2010 to August 2019. The inclusion criterion was age ≤ 6 years. We obtained the supine and erect views of plain abdominal X-rays in all eligible patients; these images were validated, and a diagnosis of intussusception was made by radiologists before an abdominal ultrasound was performed.

Plain abdominal X-rays of patients not diagnosed with intussusception (negative images)

The candidate images for inclusion in the negative group were identified using X-rays of patients of the same age who visited the emergency room with complaints of abdominal pain, vomiting, or diarrhoea that was not indicative of intussusception. Their reports were stated by radiologists as ‘unremarkable study’, ‘non-specific finding’, ‘rule out paralytic ileus’, or ‘rule out gastroenteritis’. We collected these images from the same hospitals and within the same time period.

The collected images had a positive-to-negative ratio of approximately 1:3–1:12. All candidate images were extracted in the Digital Imaging and Communications in Medicine (DICOM) format used by the picture archiving and communication system (PACS, Centricity, GE Healthcare, Milwaukee, WI, USA), using a custom-built automated image retrieval system. We stored the images in an 8-bit JPEG grayscale format.

Abdominal detection and Intussusception classification

The overall workflow of the proposed intussusception screening system is shown in Fig. 4. Our architecture consists of (1) an abdominal detection model that detects the abdominal region and (2) an intussusception classification model that detects intussusception.

Abdominal detection model

We used the Single Shot Multibox Detector (SSD) for the abdominal detection model²⁵. The SSD generates default boxes with various ratios and scales from multiple feature maps to learn the regression model for object coordinates and the classification model for object label confidence. As we needed to detect the abdominal region, we changed the last fully connected layer to predict two classes: the abdomen and the background. Moreover, we retrained the last fully connected layer to compute the coordinates and confidence values for the abdominal region and the background. To train the abdominal detection model, we manually annotated the abdominal regions using Python 3.7 (https://www.python.org). Using the images of the patients’ abdomens, we selected rectangular regions of interest (ROI) spanning the diaphragm to the upper margin of the acetabulum along with the corresponding lateral borders.

Intussusception classification model

Among the deep learning CNN models for classification, which includes AlexNet, VGG, ResNet, and DenseNet, we used ResNet (Residual Network) as the intussusception classifier^26,27,28,29. ResNet uses a skip connection that adds the input feature to the output of the residual layer. Because the skip connection allows the model to learn the difference between input and output features, it solves the gradient vanishing problem that occurs as the layer becomes deeper. Furthermore, we modified the last fully connected layer to predict the class probability of intussusception. A sigmoid activation function placed after the last fully connected layer normalised the class probability values to [0, 1]. The network weights were updated by the binary cross-entropy loss,

$$\text{BCE}\left(\text{x}\right)=-\sum_{i=1}^{C=2}\left[{y}_{i}logp\left(Y=i|X\right)+\left(1-{y}_{i}\right)\text{log}\left(1-p\left(Y=i|X\right)\right)\right],$$

(1)

where ${\text{y}}_{\text{i}}$ is the ground-truth label of the ${i}$th class in C ∈ {Intussusception, Normal}, and $\text{p}(\text{Y}=\text{i}|\text{X})$ denotes the probability for the ${i}$th class that the proposed method predicts for $\text{X}$ as the input X-ray image.

We used the MatConvNet deep learning library (version 1.0-beta25, https://www.vlfeat.org/matconvnet/) from MATLAB R2019b (https://mathworks.com/) to implement our detection and classification models. The trainings and tests were performed using a GTX Titan Xp GPU (NVIDIA, Santa Clara, CA, USA). The network weights were initialised from a pre-trained model on ImageNet³⁰, and the network was trained end-to-end using stochastic gradient descent (SGD). We trained the model in batches of 16 with an initial learning rate of 0.001 that was linearly decreased over 100 epochs to 0.00001.

Data augmentation and balanced training

Due to difficulties in acquiring large-scale medical images, effective augmentation of training data was needed to conduct robust training for the deep learning CNN. Although we collected approximately 11,384 images, which is not a small data size for evaluating the diagnostic capability of the algorithm, there remained immense potential to improve diagnostic performance through data augmentation. Therefore, we performed elaborate augmentations on the images by applying random rotation and translation changes. Overfitting problems would degrade diagnostic performance as the proportion of negative images was much higher than that of positive images. Thus, we sampled mini-batch training data that included the same number of positive and negative images to balance the training.

Data experiments

We validated the performance of our method through two experimental phases. First, we used images from two of the three hospitals as the sets for training and internal validation tests, while the images from the other hospital were used as the external validation test set. The data from the two hospitals were separated as training (80%) and internal validation test (20%) data, to determine the optimal cut-off value for the external validation test. Since there were three hospital data sets, three cases of external validations were examined. Second, we performed training and internal tests using data from all three sets (A, B, and C). Eighty and 20% of each data set were used for training and internal validation tests via fivefold cross-validation, respectively. Any data used in these tests were excluded from the initial training data set.

The proposed method is a computer-aided diagnosis (CAD) system that assists radiologists and emergency physicians in analysing medical images. Therefore, it is better to show areas that are suspicious for intussusception rather than simply determining whether the input X-ray image is a case of intussusception or not. To intuitively identify intussusception, we visualised which areas of the X-ray image were predicted to contain the diagnosis using class activation maps (CAMs)³¹. To generate CAMs, we extracted the activation map, ${\text{f}}_{\text{k}}$, before the last global average pooling layer of the intussusception classification model. When the intussusception classification model determined the input X-ray image as intussusception, CAMs were obtained by multiplying the extracted activation map, ${\text{f}}_{\text{k}}$, with the weight in the final classification layer for the feature map k leading to pathology y ${\text{w}}_{\text{k}}$

$${\text{M}}_{\text{c}}\left(\text{x},\text{y}\right)=\sum_{{\text{k}}}{\text{w}}_{{\text{k}}}{\text{f}}_{\text{k}}\left(\text{x},\text{y}\right). $$

(2)

Outcomes and validation

Our primary outcome was a favourable performance in detecting intussusception in our data sets. In the internal validation test, we used the AUC, highest accuracy, and highest Youden index to measure performance³². Accuracy measures the fraction of correct predictions over the total number of predictions. The Youden index is defined as sensitivity + specificity – 1, that is, the vertical distance between the 45° line and the point on the ROC curve. In the external validation tests, we selected the optimal cut-off value based on the highest Youden index value³³ from the internal validation tests; this was done because plain abdominal radiography is commonly used as a first-line screening test for intussusception in patients with gastrointestinal signs and symptoms. Furthermore, we applied the cut-off values in the external validation to determine the AUC and Youden index values.

Statistical analysis

All the data were compiled using a standard spreadsheet application (Excel 2016; Microsoft, Redmond, WA, USA) and analysed using NCSS 12 (Statistical Software 2018, NCSS, LLC. Kaysville, Utah, USA, ncss.com/software/ncss). The Kolmogorov–Smirnov test was used to verify that all data sets had a normal distribution. We generated descriptive statistics and presented them as frequencies and percentages for categorical data, and as medians and interquartile ranges, (IQR) (non-normal distribution), means and standard deviation (SD) (normal distribution), or 95% confidence intervals (95% CI) for continuous data. The independent t-test or the Kruskal–Wallis test was used to compare the positive and negative groups. Categorical variables were presented as numbers and percentages and analysed using a chi-square test. Two-tailed p-values < 0.05 were considered statistically significant. We used a single ROC curve and cut-off analysis for the internal test and two ROC curves with the independent groups design for comparing the ROC curves of the external and internal validation tests. Two-tailed p-values < 0.05 were considered statistically significant.

Abbreviations

CNN:: Convolution neural network
AUC:: Area under the receiver operating characteristic curve
ROC curve:: Receiver operating characteristic curve
DICOM:: Digital imaging and communication in medicine
PACS:: Picture archiving and communication system
SSD:: Single shot multibox detector
ROI:: Regions of interest
SGD:: Stochastic gradient descent
CAD:: Computer-aided diagnosis
CAMs:: Class activation map
IQR:: Interquartile ranges
SD:: Standard deviation
CI:: Confidence interval

References

Stringer, M. D., Pablot, S. M. & Brereton, R. J. Paediatric intussusception. Br. J. Surg. 79, 867–876 (1992).
CAS PubMed Google Scholar
Parashar, U. D. et al. Trends in intussusception-associated hospitalizations and deaths among US infants. Pediatrics 106, 1413–1421 (2000).
CAS PubMed Google Scholar
Buettcher, M., Baer, G., Bonhoeffer, J., Schaad, U. B. & Heininger, U. Three-year surveillance of intussusception in children in Switzerland. Pediatrics 120, 473–480 (2007).
PubMed Google Scholar
Gluckman, S., Karpelowsky, J., Webster, A. C. & McGee, R. G. Management for intussusception in children. Cochrane Database Syst. Rev. 6, 6476. https://doi.org/10.1002/14651858.CD006476.pub3 (2017).
Article Google Scholar
Waseem, M. & Rosenberg, H. Intussusception. Pediatr. Emerg. Care. 24, 793–800 (2008).
PubMed Google Scholar
Carroll, A. G. et al. Comparative effectiveness of imaging modalities for the diagnosis and treatment of intussusception: A critically appraised topic. Acad. Radiol. 24, 521–529 (2017).
PubMed Google Scholar
Lam, S. H., Wise, A. & Yenter, C. Emergency bedside ultrasound for the diagnosis of pediatric intussusception: A retrospective review. World J. Emerg. Med. 5, 255–258 (2014).
PubMed PubMed Central Google Scholar
Hryhorczuk, A. L. & Strouse, P. J. Validation of US as a first-line diagnostic test for assessment of pediatric ileocolic intussusception. Pediatr. Radiol. 39, 1075–1079 (2009).
PubMed Google Scholar
Bhistkul, D. M. et al. Clinical application of ultrasonography in the diagnosis of intussusception. J. Pediatr. 121, 182–186 (1992).
Google Scholar
Riera, A., Hsiao, A. L., Langhan, M. L., Goodman, T. R. & Chen, L. Diagnosis of intussusception by physician novice sonographers in the emergency department. Ann. Emerg. Med. 60, 264–268 (2012).
PubMed PubMed Central Google Scholar
Del-Pozo, G. et al. Intussusception in children: Current concepts in diagnosis and enema reduction. Radiographics 19, 299–319 (1999).
CAS PubMed Google Scholar
Smith, D. S. et al. The role of abdominal x-rays in the diagnosis and management of intussusception. Pediatr. Emerg. Care. 8, 325–327 (1992).
CAS PubMed Google Scholar
Sargent, M. A., Babyn, P. & Alton, D. J. Plain abdominal radiography in suspected intussusception: A reassessment. Pediatr. Radiol. 24, 17–20 (1994).
CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
ADS CAS PubMed Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
ADS CAS PubMed PubMed Central Google Scholar
Taylor, A. G., Mielke, C. & Mongan, J. Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study. PLoS Med. 15, e1002697. https://doi.org/10.1371/journal.pmed.1002697 (2018).
Article PubMed PubMed Central Google Scholar
Kuppermann, N., O’Dea, T., Pinckney, L. & Hoecker, C. Predictors of intussusception in young children. Arch. Pediatr. Adolesc. Med. 154, 250–255 (2000).
CAS PubMed Google Scholar
Samad, L. et al. Prospective surveillance study of the management of intussusception in UK and Irish infants. Br. J. Surg. 99, 411–415 (2012).
CAS PubMed Google Scholar
Blanch, A. J., Perel, S. B. & Acworth, J. P. Paediatric intussusception: Epidemiology and outcome. Emerg. Med. Australas. 19, 45–50 (2007).
PubMed Google Scholar
Tsou, P. Y. et al. Accuracy of point-of-care ultrasound and radiology-performed ultrasound for intussusception: A systematic review and meta-analysis. Am. J. Emerg. Med. 37, 1760–1769 (2019).
PubMed Google Scholar
Weihmiller, S. N., Buonomo, C. & Bachur, R. Risk stratification of children being evaluated for intussusception. Pediatrics 127, e296-303 (2011).
PubMed Google Scholar
Kim, S. et al. Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children. Sci. Rep. 9, 19420. https://doi.org/10.1038/s41598-019-55536-6 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Cheng, P. M., Tran, K. N., Whang, G. & Tejura, T. K. Refining convolutional neural network detection of small-bowel obstruction in conventional radiography. AJR Am. J. Roentgenol. 212, 342–350 (2019).
PubMed Google Scholar
Liu, W. et al. SSD: Single shot multibox detector. arXiv:1512.02325 (2016).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Adv. Neural. Inf. Process Syst. 2012, 1097–1105 (2012).
Google Scholar
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2015).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. arXiv:1512.03385 (2015).
Huang, G., Liu, Z., van der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. arXiv:1608.06993 (2018).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. Proc. IEEE Conf. Comput. Vis. Pattern Recogn. 564, 248–255 (2009).
Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. Proc. IEEE Conf. Comput. Vis. Pattern Recogn. 6070, 2921–2929 (2016).
Google Scholar
Park, S. H. & Han, K. Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction. Radiology 286, 800–809 (2018).
PubMed Google Scholar
Krzanowski, W. J. & Hand, D. ROC Curves for Continuous Data (Chapman and Hall, New York, 2009).
MATH Google Scholar

Download references

Acknowledgements

This study was supported by the National Research Foundation of Korea (2019R1F1A1063502). We would like to thank Editage (https://www.editage.co.kr) for English language editing.

Author information

These authors contributed equally: Gitaek Kwon and Jongbin Ryu.

Authors and Affiliations

Department of Computer Science, Hanyang University, Seoul, Republic of Korea
Gitaek Kwon & Jongwoo Lim
Department of Software and Computer Engineering, Ajou University, Suwon, Gyeonggi Do, Republic of Korea
Jongbin Ryu
Department of Emergency Medicine, College of Medicine, Hanyang University, Seoul, Republic of Korea
Jaehoon Oh & Junwon Bae
Machine Learning Research Center for Medical Data, Hanyang University, Seoul, Republic of Korea
Jaehoon Oh, Jongwoo Lim & Bo-kyeong Kang
Department of Radiology, College of Medicine, Hanyang University, Seoul, Republic of Korea
Bo-kyeong Kang
Department of Emergency Medicine, College of Medicine, Chung-Ang University Hospital, Seoul, Korea
Chiwon Ahn
Department of Emergency Medicine, Seoul National University Bundang Hospital, Gyeonggi-do, Republic of Korea
Dong Keon Lee

Authors

Gitaek Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Jongbin Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Jaehoon Oh
View author publications
You can also search for this author in PubMed Google Scholar
Jongwoo Lim
View author publications
You can also search for this author in PubMed Google Scholar
Bo-kyeong Kang
View author publications
You can also search for this author in PubMed Google Scholar
Chiwon Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Junwon Bae
View author publications
You can also search for this author in PubMed Google Scholar
Dong Keon Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.O. and J.L. conceived the study and designed the trial. J.O., B.K., C.A., J.B., and D.L. supervised the conduction of the trial conduct and were also involved in data collection. G.K., J.R., B.K., and J.L. analysed all images and data. G.K., J.R., and J.O. drafted the manuscript, and all authors substantially contributed to its revision. J.O. and J.L. take responsibility for the content of the paper.

Corresponding authors

Correspondence to Jaehoon Oh or Jongwoo Lim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kwon, G., Ryu, J., Oh, J. et al. Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study. Sci Rep 10, 17582 (2020). https://doi.org/10.1038/s41598-020-74653-1

Download citation

Received: 10 June 2020
Accepted: 05 October 2020
Published: 16 October 2020
DOI: https://doi.org/10.1038/s41598-020-74653-1

This article is cited by

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study
- Yuanyuan Pei
- Guijuan Wang
- Hongkui Yu
npj Digital Medicine (2023)
MediNet: transfer learning approach with MediNet medical visual database
- Hatice Catal Reis
- Veysel Turk
- Serhat Kaya
Multimedia Tools and Applications (2023)
External Validation of Deep Learning Algorithm for Detecting and Visualizing Femoral Neck Fracture Including Displaced and Non-displaced Fracture on Plain X-ray
- Junwon Bae
- Sangjoon Yu
- Dong Keon Lee
Journal of Digital Imaging (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Convolutional-neural-network-based diagnosis of appendicitis via CT scans in patients with acute abdominal pain presenting in the emergency department

Introduction

Results

Phase 1: Training evaluation and internal validation tests using two data sets and external validation tests using the excluded data set

Phase 2: Internal validation test using all data sets

Discussion

Methods

Study design

Data set

Plain abdominal X-rays of patients diagnosed with intussusception (positive images)

Plain abdominal X-rays of patients not diagnosed with intussusception (negative images)

Abdominal detection and Intussusception classification

Abdominal detection model

Intussusception classification model

Data augmentation and balanced training

Data experiments

Outcomes and validation

Statistical analysis

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

MediNet: transfer learning approach with MediNet medical visual database

External Validation of Deep Learning Algorithm for Detecting and Visualizing Femoral Neck Fracture Including Displaced and Non-displaced Fracture on Plain X-ray

Comments

Search

Quick links