A robust convolutional neural network for lung nodule detection in the presence of foreign bodies

Schultheiss, Manuel; Schober, Sebastian A.; Lodde, Marie; Bodden, Jannis; Aichele, Juliane; Müller-Leisse, Christina; Renger, Bernhard; Pfeiffer, Franz; Pfeiffer, Daniela

doi:10.1038/s41598-020-69789-z

Download PDF

Article
Open access
Published: 31 July 2020

A robust convolutional neural network for lung nodule detection in the presence of foreign bodies

Manuel Schultheiss^1,2,
Sebastian A. Schober¹,
Marie Lodde²,
Jannis Bodden²,
Juliane Aichele²,
Christina Müller-Leisse²,
Bernhard Renger²,
Franz Pfeiffer^1,2 &
…
Daniela Pfeiffer²

Scientific Reports volume 10, Article number: 12987 (2020) Cite this article

7958 Accesses
24 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Lung cancer is a major cause of death worldwide. As early detection can improve outcome, regular screening is of great interest, especially for certain risk groups. Besides low-dose computed tomography, chest X-ray is a potential option for screening. Convolutional network (CNN) based computer aided diagnosis systems have proven their ability of identifying nodules in radiographies and thus may assist radiologists in clinical practice. Based on segmented pulmonary nodules, we trained a CNN based one-stage detector (RetinaNet) with 257 annotated radiographs and 154 additional radiographs from a public dataset. We compared the performance of the convolutional network with the performance of two radiologists by conducting a reader study with 75 cases. Furthermore, the potential use for screening on patient level and the impact of foreign bodies with respect to false-positive detections was investigated. For nodule location detection, the architecture achieved a performance of 43 true-positives, 26 false-positives and 22 false-negatives. In comparison, performance of the two readers was 42 ± 2 true-positives, 28 ± 0 false-positives and 23 ± 2 false-negatives. For the screening task, we retrieved a ROC AUC value of 0.87 for the reader study test set. We found the trained RetinaNet architecture to be only slightly prone to foreign bodies in terms of misclassifications: out of 59 additional radiographs containing foreign bodies, false-positives in two radiographs were falsely detected due to foreign bodies.

Short-term Reproducibility of Pulmonary Nodule and Mass Detection in Chest Radiographs: Comparison among Radiologists and Four Different Computer-Aided Detections with Convolutional Neural Net

Article Open access 10 December 2019

Lung nodule detection in chest X-rays using synthetic ground-truth data comparing CNN-based diagnosis to human performance

Article Open access 04 August 2021

Deep learning for the detection of benign and malignant pulmonary nodules in non-screening chest CT scans

Article Open access 27 October 2023

Introduction

With about 1.7 million deaths in 2018, lung cancer is one of the most common causes of cancer death¹. As an early diagnosis improves outcomes², regular screening with imaging methods is beneficial. For screening, especially low-dose computed tomography (CT) shows promising results in order to reduce mortality³. While a regular clinical chest CT requires a dose of 4–18 mSv⁴, low-dose CT applies an effective dose of around 1.5 mSv³. Compared to a posteroanterior study of the chest, which requires around 0.02 mSv⁴, the applied dose is still significantly higher. Hence, due to wider availability and possible avoidance of radiation induced long term effects, chest X-ray (CXR) is a potential alternative to chest CT for lung cancer screening. However, interpretation of CXR images is often challenging, as small lung nodules can easily be missed. For successful lung cancer screening it is mandatory to keep the rate of false-negatives low.

False-positive cancer diagnoses on the other hand may lead to substantial psychological consequences in patients, such as changes in self-perception or anxiety, as investigated for colorectal cancer⁵. Thus, for successful lung cancer screening, keeping the rate of false-negatives and false-positives as low as possible is mandatory.

With the rise in computing power, deep-learning based computer-aided diagnosis (CAD) systems have gained interest in the research community. Only recently, performance of human readers in disciplines such as breast cancer screening⁶ and dermoscopic melanoma image classification^{7, 8} was met or even exceeded. For mammography and chest X-ray classification, networks which are trained with case-level labels showed promising results^9,10,11,12. However, such systems can only provide disease locations by the use of techniques such as saliency maps¹³. As these usually provide only inaccurate location boundaries, it is of interest to train such system with detailed annotations such as box coordinates or segmentations. Besides, it is also possible to train such networks in a semi-supervised manner, e.g. where a part of the data is labeled on pixel-level and the remaining radiographs are annotated on case-level¹⁴. For deep learning applied on CXR images with pixel level annotations, U-Net-like architectures can be employed for segmentation tasks^{15, 16}. Current state of the art methods for pneumothorax detection¹⁷ or mammography screenin⁶ make use of box-annotations, which can be derived from pixel wise annotations. Both aforementioned studies use a RetinaNet architecture, a one stage detector^{18, 19}, which is characterized by a faster inference time than two stage detectors^{20, 21}. The aim of this study was to train a RetinaNet detector for the task of pulmonary nodule detection, which is robust to foreign bodies. We evaluated its accuracy for screening and nodule detection tasks. Furthermore, we compared its performance to the participants of a reader study.

Results

Nodule location detection

For nodule localisation, the assessed RetinaNet architecture achieved 43 true-positives, 26 false-positives and 22 false-negatives. In comparison, performance of the two readers was was 42 ± 2 true-positives, 28 ± 0 false-positives and 23 ± 2 false-negatives. Detailed results are shown in Table 1. If not otherwise stated, all results in this paper are given in the form mean ± standard deviation. The nodule detection performance of RetinaNet can be inspected visually (Fig. 1). Lung segmentation was used to exclude extrathoracic detections. For lung segmentation a Dice score of 0.97 was achieved.

Table 1 Results for the nodule detection task for radiologists and the RetinaNet model. Evaluation was performed with respect to true-positives (TP), false-positives (FP) and false-negatives (FN).

Full size table

In order to investigate if larger nodules can be detected more easily and if nodules in radiographs with many additional nodules are detectable more easily, we plotted these parameters against the detection score and performed a linear regression model fit for the number of nodules (Fig. 2A) and the nodule size (Fig. 2B). Furthermore, a free response receiver operating characteristic (FROC) curve is shown in Fig. 3B.

Screening

The classification performance was also assessed on case-level using a ROC (receiver operating characteristic) curve. Here the true-positive rate is plotted over the false-positive rate. For RetinaNet, an AUC (area under the ROC curve) value of 0.87 with a confidence interval (CI) range from 0.80 to 0.94 was found for the model. The performance of the two radiologists for the case-level screening task was 26 TP/4 FP and 31 TP/11 FP respectively. The ROC curve for the model with radiologist scores is shown in Fig. 3A.

Investigation of foreign body detections

We investigated false-positive detections for five different types of foreign bodies (Table 2). These included ports, electrocardiography devices (ECG), surgical clips (Clips), sternum cerclages and pacemakers. For RetinaNet, one false-positive detection due to foreign bodies occurred for each, ECG and port. Examples of foreign body detections, as predicted by the RetinaNet architecture are shown in Fig. 4A and B. Overlapping boxes (Fig. 4B) occur rarely and probably due to a suspected bigger nodule behind two smaller ones.

Table 2 Number of radiographs with false-positive (FP) detections due to foreign bodies (FB) made by the RetinaNet architecture.

Full size table

Discussion

In this study, we trained and investigated a CNN for pulmonary nodule detection and compared the predictions made by the CNN to the results of two professional radiologists. Besides nodule localisation, the usability of the algorithm for case-level screening was investigated. An important aspect of this study was to evaluate the possibility of foreign bodies contributing to wrong decisions in CNN-based nodule detection systems. This was not investigated in previous work^{9, 10, 22} and therefore we evaluated the question, if foreign bodies are wrongly detected as nodules by a CNN.

For the nodule detection task, the CNN was able to outperform one radiologist. The CNN was found to identify larger nodules more easily, but also performed well on smaller nodules (Fig. 2B).

For the screening task, the underlying question was if a radiograph contains nodules or not. Case-level predictions were derived from the predicted box-annotations, similar as it was done for mammography classification by McKinney et al.⁶. Compared to CNNs already trained with case-level annotations⁹, CNNs trained with box-annotations, such as the investigated RetinaNet architecture, have substantial differences: Unfortunately, box-level annotations have to be generated additionally by an expert radiologist and cannot be retrieved easily from existing PACS data. While the availability of annotated data is a common limitation for the training of deep learning systems in a clinical setting, the investigated technology relies on the availability of such annotations. In case such annotations are not available, weakly-supervised training^23,24,25 may be a possible alternative. However, in order to understand the CNNs decisions, CNNs trained with box-annotations allow a more detailed evaluation of retrieved results: CNNs trained with box annotations are capable of providing an independent score and an accurate location for each lesion. In contrast, CNNs trained with case level annotations can only provide a score for the whole image.

In literature, Wang et al.⁹ and Rajpurkar et al.¹⁰ reported a ROC AUC of 0.72 and 0.78 for weakly supervised CNN based lung nodule screening, respectively. For screening, we achieved a ROC AUC of 0.87 in our experiments.

Several studies have reported CAD performance for lung nodule detection. Here, sensitivities of CAD systems vary widely between literature, whereat a higher sensitivity usually yields a higher false-positive rate. Li et al.²⁶ reported 47 of 66 nodules were correctly marked by a CAD system for CXR lung nodule detection. This gives a sensitivity of 0.71 at a mean false-positive rate of 1.3. For nodule detection Kim et al.²² reported a sensitivity of 0.83 at a false-positive rate per radiograph of 0.2. For CT based nodule detection CAD systems, an average sensitivity of 0.82 was reported at a cutoff of 3 false-positives per radiograph by Jacobs et al.²⁷. At a false positive cutoff of about 0.2, their FROC curve yielded a sensitivity of around 0.53. To compare these results to our experiments, we can select a specific point on the FROC Curve (Fig. 3B), e.g. a sensitivity of 0.59 at a false-positive rate of 0.2 per radiograph.

Comparison of performance within literature is difficult, however, as results highly depend on the dataset (e.g. nodule count and nodule size) and objective of the study. Quekel et al.²⁸ reported lesion miss rates for lung cancer on chest radiographs between 25 and 90% for different studies involving human observers. Therefore, a comparison of the given ROC and FROC results between different papers should be interpreted with caution.

Besides, the potential of deep learning systems, this study also shows that before clinical use of a deep learning system, it has to be carefully assessed how uncommon image characteristics can contribute to false decisions of CNNs. While we could have designed an ideal dataset without foreign bodies, a remarkable feature of our dataset is that we intentionally included foreign bodies in our training and test sets. Therefore, our dataset is closer to the clinical routine and tests the robustness of the detection algorithm. Hence, an important finding of this study was, that the trained CNN produced only a few false-positive nodule detections in radiographs due to foreign bodies (in two out of 59 radiographs). This low number should not be neglected, however: in clinical routine, a lot of patients have foreign bodies like ECG electrodes typically seen in inpatient treatment or ports in oncological patients with a history of chemotherapy.

The present study has some limitations: above all, the size of the dataset was rather small. While we used 411 radiographs for training, other studies use larger datasets (e.g. 11,734 images for training by Mckinney et al.⁶ for mammography), which could further improve performance. Next, we only utilized PA radiographies for the detection task. In addition, the use of additional lateral chest radiographs could increase the performance further, but requires additional segmentations from human experts.

Moreover, we only used a single center data-set from our institution, which may inhibit the ability to translate the model to different populations and devices²⁹. Last, we limited the CNN input resolution to \(512 \times 512\) pixels, in order to reduce the computational workload.

Conclusion

In this study, we trained and evaluated a RetinaNet based CNN and conducted a reader study. In summary, the presented CNN has the potential to help radiologists during clinical routine and is robust to foreign bodies. The CNN’s decisions can be followed by inspection of individual lesion scores and box-predictions, which is an advantage over other CNN architectures.

As there are still a few foreign body detections, in future work it has to be investigated, if it is sufficient to train with a larger dataset, or an auxiliary CNN³⁰ is needed to identify abnormal cases and react correspondingly. With advances in healthcare digitisation, information about foreign bodies may also be available in machine readable form soon. Such information, stored in patient records, may be used to alter the CNNs decision (e.g. to invalidate lesion scores in the region of a known pacemaker). Furthermore, to enhance classification performance, we plan to collect and annotate more data. Additionally, the CNN could be trained to detect multiple pathologies, as done for case-level annotations in prior studies¹⁰.

Materials and methods

Dataset

Data access was approved by the institutional ethics committee at Klinikum Rechts der Isar (Ethikvotum 87/18 S) and the data was anonymized. The ethics committee has waived the need for informed consent. All research was performed in accordance with relevant guidelines and regulations. A dataset of 391 CXRs (Chest PA) was collected from our institution’s picture archiving and communication system (PACS). Patient demographics for training and test sets are shown in Table 3. Additional clinical information from the medical report, such as follow-up CT scans were available to verify the diagnosis. Thereby, case-level ground-truth labels (unsuspicious or nodulous) were assigned based on the diagnosis of two radiologists: the first radiologist made the diagnosis in clinical routine and a second radiologist (JB, 3 years of experience in chest imaging) verified and segmented the nodules retrospectively using our in-house built web-based platform. For the reader study test-set, one more radiologist verified the segmented nodules (DP, 12 years of experience).

From the segmentations, bounding boxes were extracted based on the segmentation boundaries. From the radiographs with nodules, 257 were used for training. Training data was supplemented by the Japanese Society of Radiological Technology (JSRT) dataset³¹, from which 154 additional radiographs with annotated nodules were obtained. Therefore, the total number of radiographs used for training was 411.

Additionally, lung segmentations for all 247 JSRT files were obtained from the segmention in chest radiography (SCR) database³² in order to train a lung segmentation network. Please note that data for lung segmentation also includes 93 additional non-nodulous images from the JSRT database. For lung segmentation train, validation and test set size was set to 157, 40 and 50.

Table 3 Patient demographics for training and test subsets. Mass size is given as a fraction of the radiograph size (1.0 would indicate every pixel of the radiograph is a nodule). As for screening and foreign body (FB) test sets segmentations were unavailable, nodule mass and location were not provided. Within a radiograph multiple foreign bodies may occur. Secondary pathologies (SP) were excluded from the reader study. AM indicates acromastinum induced artifacts, which often show nodule-like morphological characteristics.

Full size table

Network training

General workflow for network training is illustrated in Fig. 5. For nodule detection, we employed a RetinaNet architecture¹⁸. This architecture was successfully utilitzed in prior literature^{6, 22} for nodule detection in radiographs. It inputs a preprocessed radiograph and outputs multiple box-coordinates of nodule locations with additional scores (assessing confidence).

For preprocessing, images were resampled to \(512 \times 512\) pixels. Afterwards, histogram equalization was performed. Resulting intensities were normalized to values between 0 and 1.

Training was performed using a batch size of 1 with 1000 steps per epoch. For the utilized loss function (focal loss), hyperparameters were set to \(\alpha = 0.25, \gamma = 2.0\). The initial learning rate was set to \(10^{-5}\) and reduced by factor 0.1 after 3 epochs of stagnating loss (\(\delta = 0.0001\)). The network was trained for 50 epochs in total. Data augmentation transformations included contrast, brightness, shear, scale, flip, and translation. From the training set, 80 percent of the radiographs were used for training and 20 percent for validation. None of the training or validation data was part of the reader study or foreign body test sets. Models were implemented based on keras-retinanet³³ using Tensorflow³⁴ and Keras³⁵. As a RetinaNet backbone, ResNet-101³⁶ was used.

To invalidate extrathoracic nodule detections made by RetinaNet, an additional lung segmentation network was developed. For lung segmentation, an U-Net¹⁵ like architecture is applied in illustrated in Fig. 6. U-Net like architectures were successfully applied for lung segmentation in previous literature¹⁶. Training masks were generated by combining the left lung lob, right lung lobe and heart mask from the SCR dataset. An Adam optimizer with a learning rate of \(10^{-4}\) was used. Total number of epochs was set to 30. Augmentation operations included zoom, height, shift and rotation. As a loss function, the Dice loss according to³⁷ was used.

Metrics

In each radiograph, we evaluated the number of true-positives (TP), false-positives (FP) and false-negatives (FN). True-negatives were not counted, as these include all possible remaining boxes within the radiograph. To determine the aforementioned numbers, a distance measurement is required, whereat we utilized a method similar to Shapira et al.²⁵: within a single radiograph, first the center of masses of all ground-truth and prediction lesions are determined. Next, between each pair \((G_i,P_i)\) of a ground-truth lesion \(G_i\) and a prediction lesion \(P_i\), the euclidean distance is calculated. If the euclidean distance is below a certain threshold D, the nodule accounts as a true-positive. If there is no neighbour within the distance D for a \(G_i\) or \(P_i\), the nodule counts as false-negative and false-positive respectively. For a 512 × 512 pixel image we set the value of D to 23, which means a distance below 23 pixels for lesion centers between ground-truth and prediction yields a TP. To determine this value, we let the radiologist who annotated the groundtruth mark the lesion center in an additional experiment. Afterwards we calculated the distances between groundtruth segmentation center-of-mass and marked lesion center. The maximum distance between the groundtruth center-of-mass and the radiologist mark yielded 23 pixels. Furthermore, the sensitivity of the lesion detection can be controlled by ignoring predictions below a certain lesion score. Setting a lower threshold usually increases sensitivity, but also false-positives. This trend was visualized for different thresholds using a FROC curve, similar to Kim et al.²². For the absolute true-positive and false-positive numbers for RetinaNet results, we set a threshold of 0.6, which yields a lower false-positive rate than radiologists and therefore makes the absolute number of true-positives comparable. For evaluation of the nodule location detection task, the returned boxes were analyzed with respect to ground-truth annotations using the described metric.

Additionally it is required to retrieve a case-level score, for the screening task. This case-level score indicates whether there is one or more nodule in the radiograph. As we retrieved individual nodule scores from the RetinaNet predictions, we chose the maximum of all nodule scores within the radiograph as a case-level score.

Reader study setup

For the reader study, two radiologists (CML and JA) interpreted 75 chest PA (posterior anterior) radiographs. The radiologists had 4 and 6 years experience. In order to simulate a clinical setting, each radiologist was given a time constraint of 10 seconds per radiograph. The assignment was to mark all nodules with a mouse click using our in-house built web-based platform. At least one nodule occured in 36 radiographs. Total nodule count was 65 and the average nodule count in radiographs with nodules was 1.8 ± 1.6.

Statistical analysis

The bootstrap approach³⁸ was used to calculate CIs of the ROC retrieved in the screening task for the RetinaNet architecture. We conducted the following experiment with 1,000 replications: In each experiment, we selected 75 random samples from the test set and calculated the ROC AUC values from these samples. In order to retrieve the 95% confidence interval, we sorted the resulting AUCs from all experiments incrementally and took the AUC value at 2.5% and 97.5% as minimum and maximum of the CI, respectively.

Data availability

Models for inference can be retrieved from the authors on reasonable request. The data for training is not available due to patient privacy. However, all methods are described in sufficient detail in order to be replicated with own data.

References

Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Cancer J. Clin. 68, 394–424. https://doi.org/10.3322/caac.21492 (2018).
Article Google Scholar
Yang, P. Epidemiology of lung cancer prognosis: quantity and quality of life. Cancer 471, 469–486. https://doi.org/10.1007/978-1-59745-416-2_24 (2009).
Article Google Scholar
The National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. N. Engl. J. Med. 365, 395–409. https://doi.org/10.1056/NEJMoa1102873 (2011).
Mettler, F. A., Huda, W., Yoshizumi, T. T. & Mahesh, M. Effective doses in radiology and diagnostic nuclear medicine: a catalog. Radiology 248, 254–263. https://doi.org/10.1148/radiol.2481071451 (2008).
Article PubMed Google Scholar
Toft, E. L., Kaae, S. E., Malmqvist, J. & Brodersen, J. Psychosocial consequences of receiving false-positive colorectal cancer screening results: a qualitative study. Scand. J. Prim. Health Care 37, 145–154. https://doi.org/10.1080/02813432.2019.1608040 (2019).
Article PubMed PubMed Central Google Scholar
Mckinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, https://doi.org/10.1038/s41586-019-1799-6 (2020).
Brinker, T. J. et al. Deep learning outperformed 136 of 157 dermatologists in a head-to-head dermoscopic melanoma image classification task. Eur. J. Cancer 113, 47–54. https://doi.org/10.1016/j.ejca.2019.04.001 (2019).
Article PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118. https://doi.org/10.1038/nature21056 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Wang, X. et al. Chest X-ray8: hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. Proceedings: 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 2017, 3462–3471, https://doi.org/10.1109/CVPR.2017.369 (2017). arXiv:1705.02315.
Rajpurkar, P. et al. CheXNet: radiologist-level pneumonia detection on chest X-rays with deep learning. 3–9, 1711.05225 (2017). arXiv:1711.05225.
Ausawalaithong, W., Marukatat, S., Thirach, A. & Wilaiprasitporn, T. Automatic lung cancer prediction from chest X-ray images using deep learning approach. (2018). arXiv:1808.10858.
Geras, K. J., Wolfson, S., Kim, S. G., Moy, L. & Cho, K. High-resolution breast cancer screening with multi-view deep convolutional neural networks. 1–7 (2017). arXiv:1703.07047.
Simonyan, K., Vedaldi, A. & Zisserman, A. Deep inside convolutional networks: visualising image classification models and saliency maps. 1–8, https://doi.org/10.1080/00994480.2000.10748487 (2013). arXiv:1312.6034.
Nam, J. G. et al. Development and validation of deep learning-based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology 290, 218–228. https://doi.org/10.1148/radiol.2018180237 (2019).
Article PubMed Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: convolutional networks for biomedical image segmentation. MICCAI 234–241, https://doi.org/10.1007/978-3-319-24574-4_28 (2015). arXiv:1505.04597.
Tang, Y., Tang, Y., Xiao, J. & Summers, R. M. XLSor: a robust and accurate lung segmentor on chest X-rays using criss-cross attention and customized radiorealistic abnormalities generation. 457–467 (2019). arXiv:1904.09229.
Pan, I., Cadrin-Chênevert, A. & Cheng, P. M. Tackling the radiological society of North America pneumonia detection challenge. Am. J. Roentgenol. 213, 568–574. https://doi.org/10.2214/AJR.19.21512 (2019).
Article Google Scholar
Lin, T.-Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. https://doi.org/10.1016/j.ajodo.2005.02.022 (2017). arXiv:1708.02002.
Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. You only look once: unified, real-time object detection. https://doi.org/10.1109/CVPR.2016.91 (2015). arXiv:1506.02640.
Girshick, R. Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision 11–18 Dec, 1440–1448, https://doi.org/10.1109/ICCV.2015.169 (2016). arXiv:1504.08083.
Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031 (2017). arXiv:1506.01497.
Kim, Y. G. et al. Short-term reproducibility of pulmonary nodule and mass detection in chest radiographs: comparison among radiologists and four different computer-aided detections with convolutional neural net. Scientific Reports 9, 1–9. https://doi.org/10.1038/s41598-019-55373-7 (2019).
Article CAS Google Scholar
Hwang, S. & Kim, H. E. Self-transfer learning for weakly supervised lesion localization. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9901 LNCS, 239–246, https://doi.org/10.1007/978-3-319-46723-8_28 (2016). arXiv:1602.01625.
Oquab, M., Bottou, L., Laptev, I. & Sivic, J. Is object localization for free? Weakly-supervised learning with convolutional neural networks. Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recogn., 685–694, https://doi.org/10.1109/CVPR.2015.7298668 (2015).
Shapira, N. et al. Liver lesion localisation and classification with convolutional neural networks: a comparison between conventional and spectral computed tomography. Biomed. Phys. Eng. Express https://doi.org/10.1088/2057-1976/ab6e18 (2020).
Li, F., Engelmann, R., Armato, S. G. & MacMahon, H. Computer-aided nodule detection system: results in an unselected series of consecutive chest radiographs. Acad. Radiol. 22, 475–480. https://doi.org/10.1016/j.acra.2014.11.008 (2015).
Article PubMed Google Scholar
Jacobs, C. et al. Computer-aided detection of pulmonary nodules: a comparative study using the public LIDC/IDRI database. Eur. Radiol. 26, 2139–2147. https://doi.org/10.1007/s00330-015-4030-7 (2016).
Article PubMed Google Scholar
Quekel, L. G., Kessels, A. G., Goei, R. & Van Engelshoven, J. M. Miss rate of lung cancer on the chest radiograph in clinical practice. Chest 115, 720–724. https://doi.org/10.1378/chest.115.3.720 (1999).
Article CAS PubMed Google Scholar
Zech, J. R. et al. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study. PLOS Med. 15, e1002683. https://doi.org/10.1371/journal.pmed.1002683 (2018).
Article PubMed PubMed Central Google Scholar
Blinov, D. Advanced neural network solution for detection of lung pathology and foreign body on chest plain radiographs. 7–9.
Shiraishi, J. et al. Development of a digital image database for chest radiographs with and without a lung nodule. Am. J. Roentgenol. 174, 71–74. https://doi.org/10.2214/ajr.174.1.1740071 (2000).
Article CAS Google Scholar
van Ginneken, B., Stegmann, M. B. & Loog, M. Segmentation of anatomical structures in chest radiographs using supervised methods: a comparative study on a public database. Med. Image Anal. 10, 19–40. https://doi.org/10.1016/j.media.2005.02.002 (2006).
Article PubMed Google Scholar
Gaiser, H. Keras-Retinanet, Accessed 2 January 2020, https://doi.org/10.5281/zenodo.1188105.
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems (2015).
Chollet, F. et al. Keras, Accessed 7 December 2018 (2015).
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. https://doi.org/10.1109/CVPR.2016.90 (2015). arXiv:1512.03385.
Milletari, F., Navab, N. & Ahmadi, S.-A. V-Net: Fully convolutional neural networks for volumetric medical image segmentation. IEEE Int. Conf. 3D Vis., 1–11. arXiv:1606.04797 (2016).
Efron, B. Bootstrap methods: another look at the jackknife. Ann. Stat. 7, 1–26. https://doi.org/10.1214/aos/1176344552 (1979).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We acknowledge financial support through the DFG (Research Training Group GRK 2274). Open access funding provided by Projekt DEAL.

Author information

Authors and Affiliations

Chair of Biomedical Physics, Department of Physics and Munich School of BioEngineering, Technical University of Munich, 85748, Garching, Germany
Manuel Schultheiss, Sebastian A. Schober & Franz Pfeiffer
Department of Diagnostic and Interventional Radiology, School of Medicine & Klinikum rechts der Isar, Technical University of Munich, 81675, München, Germany
Manuel Schultheiss, Marie Lodde, Jannis Bodden, Juliane Aichele, Christina Müller-Leisse, Bernhard Renger, Franz Pfeiffer & Daniela Pfeiffer

Authors

Manuel Schultheiss
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian A. Schober
View author publications
You can also search for this author in PubMed Google Scholar
Marie Lodde
View author publications
You can also search for this author in PubMed Google Scholar
Jannis Bodden
View author publications
You can also search for this author in PubMed Google Scholar
Juliane Aichele
View author publications
You can also search for this author in PubMed Google Scholar
Christina Müller-Leisse
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Renger
View author publications
You can also search for this author in PubMed Google Scholar
Franz Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.P. and F.P. designed the research study. J.B. and D.P annotated the data. J.A. and C.M.L. participated in the reader study. M.S. wrote the manuscript. All authors commented and reviewed the manuscript. S.A.S., M.L and M.S. preprocessed and curated the data. M.S. developed the software for data annotation. M.S., S.A.S. and B.R. analysed the data and developed the pipeline.

Corresponding author

Correspondence to Manuel Schultheiss.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schultheiss, M., Schober, S.A., Lodde, M. et al. A robust convolutional neural network for lung nodule detection in the presence of foreign bodies. Sci Rep 10, 12987 (2020). https://doi.org/10.1038/s41598-020-69789-z

Download citation

Received: 19 March 2020
Accepted: 14 July 2020
Published: 31 July 2020
DOI: https://doi.org/10.1038/s41598-020-69789-z

This article is cited by

Enhanced YOLOv5 network-based object detection (BALFilter Reader) promotes PERFECT filter-enabled liquid biopsy of lung cancer from bronchoalveolar lavage fluid (BALF)
- Zheng Liu
- Jixin Zhang
- Yaoping Liu
Microsystems & Nanoengineering (2023)
A systematic approach to deep learning-based nodule detection in chest radiographs
- Finn Behrendt
- Marcel Bengs
- Alexander Schlaefer
Scientific Reports (2023)
On using a Particle Image Velocimetry based approach for candidate nodule detection
- R. Jenkin Suji
- Sarita Singh Bhadauria
- Joydip Dhar
Multimedia Tools and Applications (2023)
DraiNet: AI-driven decision support in pneumothorax and pleural effusion management
- Ozan Can Tatar
- Mustafa Alper Akay
- Semih Metin
Pediatric Surgery International (2023)
Lung nodule detection of CT images based on combining 3D-CNN and squeeze-and-excitation networks
- Hassan Mkindu
- Longwen Wu
- Yaqin Zhao
Multimedia Tools and Applications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Nodule location detection

Screening

Investigation of foreign body detections

Discussion

Conclusion

Materials and methods

Dataset

Network training

Metrics

Reader study setup

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links