Identification of kidney stones in KUB X-ray images using VGG16 empowered with explainable artificial intelligence

Ahmed, Fahad; Abbas, Sagheer; Athar, Atifa; Shahzad, Tariq; Khan, Wasim Ahmad; Alharbi, Meshal; Khan, Muhammad Adnan; Ahmed, Arfan

doi:10.1038/s41598-024-56478-4

Download PDF

Article
Open access
Published: 14 March 2024

Identification of kidney stones in KUB X-ray images using VGG16 empowered with explainable artificial intelligence

Fahad Ahmed¹,
Sagheer Abbas²,
Atifa Athar³,
Tariq Shahzad⁴,
Wasim Ahmad Khan¹,
Meshal Alharbi⁵,
Muhammad Adnan Khan^6,7,8 &
…
Arfan Ahmed⁹

Scientific Reports volume 14, Article number: 6173 (2024) Cite this article

1562 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

A kidney stone is a solid formation that can lead to kidney failure, severe pain, and reduced quality of life from urinary system blockages. While medical experts can interpret kidney-ureter-bladder (KUB) X-ray images, specific images pose challenges for human detection, requiring significant analysis time. Consequently, developing a detection system becomes crucial for accurately classifying KUB X-ray images. This article applies a transfer learning (TL) model with a pre-trained VGG16 empowered with explainable artificial intelligence (XAI) to establish a system that takes KUB X-ray images and accurately categorizes them as kidney stones or normal cases. The findings demonstrate that the model achieves a testing accuracy of 97.41% in identifying kidney stones or normal KUB X-rays in the dataset used. VGG16 model delivers highly accurate predictions but lacks fairness and explainability in their decision-making process. This study incorporates the Layer-Wise Relevance Propagation (LRP) technique, an explainable artificial intelligence (XAI) technique, to enhance the transparency and effectiveness of the model to address this concern. The XAI technique, specifically LRP, increases the model's fairness and transparency, facilitating human comprehension of the predictions. Consequently, XAI can play an important role in assisting doctors with the accurate identification of kidney stones, thereby facilitating the execution of effective treatment strategies.

Vision transformer and explainable transfer learning models for auto detection of kidney cyst, stone and tumor from CT-radiography

Article Open access 06 July 2022

Improving explainable AI with patch perturbation-based evaluation pipeline: a COVID-19 X-ray image analysis case study

Article Open access 09 November 2023

Self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation

Article Open access 04 July 2022

Introduction

Urolithiasis, or kidney stones, is one of the most common urological conditions worldwide¹. Kidney stones are hard concretion or stone-like pieces that form in the kidneys due to dietary minerals in the urine². Symptoms, including flank pain, nausea, and vomiting, can indicate kidney stones³. While they can manifest in individuals of any gender, prevalence is higher in males, with approximately 7% of females and 13% of males experiencing them in their lifetime⁴. Factors such as dietary habits, sedentary lifestyle, diabetes mellitus, obesity, hypertension, and metabolic syndrome elevate the risk of stone formation⁵.

Medical professionals use imaging techniques to identify kidney stones removed by surgical intervention. After treatment, kidney stones may recur and develop into a chronic condition after treatment and kidney malfunctions can be life-threatening⁶. The ureter can become blocked depending on the size of the stone, causing significant pain, particularly in the lower back, although it can hurt the groin⁷. Older people are more likely to report atypical or no pain when passing a stone, making diagnosing kidney stone disease challenging in this demographic⁸. Different stages of disease evaluation employ various imaging techniques. The typical imaging techniques for examining kidney stones are sonography⁹, computed tomography (CT)¹⁰, and KUB X-ray imaging¹¹. Sonography, also known as ultrasonography or simply ultrasound, is a quick, safe, and easy procedure that can provide valuable evidence for a kidney stone diagnosis. Still, its sensitivity for detecting kidney stones is limited. CT can identify kidney stones and determine their number, location, and size; however, it involves exposure to ionizing radiation. KUB X-ray can also detect kidney stones and provide essential information regarding their classification, shape, number, position, and size. In this context, the most popular method is two plain KUB X-ray imaging, which is already available, less expensive, and exposes patients to less radiation than CT¹².

One of the most crucial stages in locating, measuring, and identifying the composition of kidney stones before and during treatment involves using KUB x-ray imaging, which is also employed to evaluate prognosis. Figure 1 displays samples of KUB X-ray images depicting kidney stones and normal images used in this article.

Nephrologists typically use KUB X-ray images to identify kidney stones. This information helps determine whether the individual is healthy or a patient requiring treatment. Directing an X-ray beam through the body obtains a KUB radiograph. The resulting image appears in shades of black and white, depending on the varying densities and X-ray absorption of different body parts. Muscles and fat appear grey due to medium densities, while bones appear white due to their high density. The low density of the air in the lungs makes them appear black on the radiograph.

Given the increasing prevalence and the complexity of diagnosing kidney stones, there is an urgent need for innovative diagnostic techniques. Traditional imaging techniques may produce low-quality images, making interpreting results easier. This limitation has led to the exploration of new methods, including using artificial intelligence (AI) to improve diagnostic accuracy. Integrating AI into novel diagnostic methodologies holds significant promise for refining diagnostic accuracy and facilitating therapeutic interventions¹³.

Machine learning (ML), a subfield of AI, is widely considered a powerful tool for enhancing disease prediction and diagnosis^14,15. Recently, there has been a substantial increase in the quantity and quality of research focused on utilizing ML for automatic disease identification. However, effective feature extraction methods are essential for improving ML models. The need for the manual formulation of complex hypotheses in traditional ML classifiers constitutes a disadvantage¹⁶. In contrast, deep neural networks (DNNs) can autonomously generate complex hypotheses, rendering them practical for learning nonlinear correlations¹⁶. This autonomous capability is part of why deep learning (DL), a subset of which includes DNNs, has historically diverged from traditional ML methods¹⁷. Due to their enhanced efficacy in processing large-scale data sets, their ability to extract hidden valuable knowledge from data, and to employ specific pre-trained networks, DL models are therefore frequently used in medical imaging systems.

DL can learn from and model vast amounts of data¹⁸. Due to their advanced information processing capabilities, DL models can effectively represent complex, high-dimensional datasets¹⁹. Deep models have been effectively applied in various applications, including lesion detection^20,21, classification^22,23,24,25, object tracking²⁶, image super-resolution reconstruction²⁷, image inpainting^{28,29,30,31,32}, and segmentation of medical images^33,34. Autoencoder (AE), Recurrent Neural Networks (RNN), Deep Belief Networks (DBN), Direct Deep Reinforcement Learning, Recursive Neural Networks, and Convolutional Neural Networks (CNN) are standard DL techniques³⁵. CNNs are frequently used in DL to automatically learn features, which are then used for classification and detection^36,37.

DL approaches find extensive use within the healthcare sector. However, these models, often called “black boxes,” challenge our understanding of the rationale behind their decisions or predictions, and the absence of interpretability can create issues. In this context, implementing explainable AI (XAI) techniques can enhance transparency and improve understanding of its decisions. This article introduces a model that suggests identifying kidney stones by applying transfer learning (TL) empowered by XAI.

The development of medical image processing methods has accelerated the introduction of smart prediction and diagnosis tools³⁸. AI can assist doctors in making better clinical decisions in specific functional areas of healthcare, such as radiography, or may even replace human judgment in certain circumstances³⁹. AI employs DL, ML, and other learning-based methods⁴⁰. Recent research has shown the utilization of DL techniques in specific applications, including an enhanced rime optimization-driven multi-threshold segmentation for COVID-19 X-ray images⁴¹, high-precision multiclass classification of lung diseases using customized MobileNetV2⁴², phase retrieval for X-ray differential phase contrast radiography with knowledge transfer learning⁴³, attention-based VGG-16 model for COVID-19 chest X-ray image classification⁴⁴, and pre-trained VGG-16 with CNN architecture for classifying X-ray images into normal or pneumonia categories⁴⁵. Researchers in the field of AI have created numerous ML and DL algorithms for detecting kidney stones over the past few decades.

For the Computer-aided diagnosis (CAD) of kidney stones, Ishioka et al.⁴⁶ employed a CNN (ResNet) method utilizing over 1000 KUB x-ray images from three hospitals. The researchers used 190 as testing data and 827 as training data. The test dataset’s precision, sensitivity, and F1 score were 0.49, 0.72, and 0.58, respectively. Chiang et al.⁴⁷ introduced an algorithm for detecting kidney stones using an artificial neural network (ANN) and discriminant analysis (DA) in conjunction with genetic polymorphisms and environmental factors such as milk consumption, water consumption, and outdoor activities. The research revealed that considering only genetic factors does not produce noticeable distinctions in the success of the models. However, considering the environmental and genetic factors, the ANN model outperforms the DA model with 89% accuracy.

Dussol et al.⁴⁸ implemented ANN models to examine 11 clinical and biochemical markers in 119 males with kidney stone formation and 96 males in the control group. Using linear discriminant analysis (LDA), they accurately identified 75.8% of the cases. Multivariate discriminant analysis (MVDA) accurately classified 74.4% of the patients.

In a parallel investigation, Cauderella et al.⁴⁹ implemented the ANN models in conjunction with traditional statistical methodologies to predict the recurrence of incidents within a five-year timeframe post-initial clinical diagnosis and metabolic assessment. They based their model on a dataset from 80 patients with kidney stone disease. Owing to its established reliability as a traditional statistical technique, logistic regression (LR) was selected as a comparison tool for ANN. The same training and testing sets as for ANN were used to create and test LR. The statistical software Statistical Package for the Social Sciences (SPSS) was used to develop LR. The ANN model demonstrated a predictive accuracy of 88.8%, significantly outperforming the LR model, which yielded an accuracy rate of 67.5%.

In a separate investigation conducted by Kumar and Abhishek⁵⁰, researchers made a comparative analysis to evaluate the diagnostic efficacy of three distinct neural network algorithms: Learning Vector Quantization (LVQ), Multilayer Perceptron (MLP), and Radial Basis Function (RBF). They compared the algorithms in terms of their level of accuracy, training dataset size, and the time required to construct a model. The MLP algorithm emerged as the most productive, with an accuracy of 92%, thereby establishing itself as an optimal tool for the early detection of kidney stones in patients and reducing the time required for diagnosis.

Ebrahimi and Mariano⁵¹ created a semi-automated program to enhance kidney stone detection in KUB computed tomography (KUB CT) analysis using image processing techniques and geometry principles. The program outlines and segments the kidney area, identifies kidney stones, and determines their size and position using pixel count metrics. An evaluation of the framework's performance on KUB CT scans from a cohort of 39 patients yielded a detection accuracy of 84.61%, indicating its potential to augment diagnostic precision in kidney stone identification. Kazemi and Mirroshandel⁵² proposed a novel method for predicting the chance of a kidney stone using ensemble learning. They sourced data from 936 patients diagnosed with nephrolithiasis at the Renal Center of the Razi Hospital in Rasht between 2012 and 2016. The ensemble-based model's final accuracy was 97.1%. Li and Elliot⁵³ conducted a study to assess the accuracy of natural language processing (NLP) in recognizing a group of patients (n = 1874) with positive CT KUB results for renal stones. The NLP achieved an accuracy rate of 85%.

De Perrot et al.⁵⁴ developed an ML algorithm that employs radiomics feature extraction from low-dose CT (LDCT) images to differentiate between kidney stones and phleboliths. This ML classification model, trained on radiomics characteristics, achieved an overall accuracy of 85.1% on the independent testing set. In another study, Kahani et al.⁵⁵ presented a classification technique for urinary stones utilizing KUB x-ray images. They employed the least absolute shrinkage and selection operator (LASSO) algorithm with ML classifiers. This methodology yielded a classification accuracy of 96% for kidney stones. Jungmann et al.⁵⁶ created an NLP technique trained on subjective assessment to automatically collect positive hit rates and clinical information to evaluate 1714 narrative LDCT reports. In 38% of occurrences, there was a minimum of one kidney stone, and in 45%, there was a minimum of one ureter stone.

Annameti Rohith et al.⁵⁷ developed a technique employing median and rank filters to increase the detection rate of identifying kidney stones in ultrasound images regarding accuracy and sensitivity. They evaluated the median and rank filters for their accuracies and sensitivities using a MATLAB simulation tool with a sample size 114 and a p value of 0.8. The median filter achieved an accuracy of 86.4%, the rank filter attained an accuracy of 82.2%, the median filter's sensitivity was 87.7%, and the rank filter's sensitivity was 82.5%. The median filter significantly outperformed the rank filter in both accuracy and sensitivity. Suresh and Abhishek⁵⁸ proposed image-processing techniques to detect kidney stones in KUB ultrasound images, including pre-processing, segmentation, and morphology. Their model achieved an accuracy of 92.57% in kidney stone detection.

To discriminate between distal ureteric calculi and phleboliths using the characteristics of non-contrast CT (NCCT) images, Jendenber et al.⁵⁹ trained and created a CNN model. They then compared their findings to the assessments of seven professional radiologists. The radiologists' accuracy was 86%, whereas the CNN model's was significantly higher at 92%. Cui et al.⁶⁰ proposed a DL and threshold-based model for detecting kidney stones. They performed experiments employing a small dataset of 625 CT images and achieved an accuracy of 90.30% and a sensitivity of 95.9%.

Yildirim et al.⁶¹ proposed a DL model for automated kidney detection utilizing 1799 coronal CT images. For kidney stone detection, they used XResNet-50. Using CT images to identify kidney stones, the designed automated model obtained a 96.82% identification rate. Tsitsiflis et al.⁶² constructed an ANN to evaluate extracorporeal shockwave lithotripsy (ESWL) parameters in patients with urinary lithiasis. Medical data from 716 patients were collected. 549 were used for training, 167 for testing, and 12 nodes were used as inputs for the ANN. The ANN achieved a testing accuracy of 81.43%.

Valencia et al.⁶³ introduced an image-processing methodology for detecting kidney stones in CT scans. The study comprised four steps: image preprocessing with a median filter, segmentation using the k-means clustering algorithm, kidney stone detection, and classification. The team gathered data from approximately 40 patients diagnosed with kidney stone diseases, utilizing CT scans in a clinical setting. The novel approach in this study aimed to detect boundaries and segment areas and enhance kidney stone detection through pixel-level analysis. This methodology enables both the localization of kidney stones and the quantification of affected patients. The algorithm achieved an accuracy rate of 92.5%.

While existing literature has made valuable contributions to the field, some areas could benefit from further exploration (Table 1 outlines gaps identified in previous research). Given the identified research gaps, our proposed method aims to overcome these limitations and drive progress in kidney stone identification. The main motivations and innovations of our work are outlined below:

1.
The studies encompassed in the review, ranging from references^{47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63}, have not incorporated data augmentation methodologies. Data augmentation methodologies improve model performance, reduce overfitting, and enhance the ability of the model to generalize to new, unseen data.
2.
Previous literature^{47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63} may have yet to attain optimal accuracy in identifying and predicting kidney stones. Improved accuracy increases the chances of identifying kidney stones.
3.
Current models could benefit from enhanced transparency and fairness to improve the interpretability of their predictions. A deeper understanding of the decision-making process and contributing factors is essential for achieving more transparent, fair, and effective diagnostic outcomes.

Table 1 Limitations and outcomes of previous work.

Full size table

For this paper, the main contributions are as follows:

1.
The proposed research introduces a novel deep TL model that autonomously extracts relevant features from KUB X-ray images. This model successfully identifies the presence of kidney stones in these images.
2.
The proposed model uses various performance measures, including accuracy, misclassification rate, precision, sensitivity, specificity, false positive rate (FPR), false negative rate (FNR), and F1 Score. The evaluations show that the model performs reliably and commendably.
3.
The study conducts a comparative analysis between the proposed model and existing methodologies documented in the literature^{47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63}. This evaluation reveals that the proposed model achieves higher accuracy than previous approaches, thus showcasing its superiority in kidney stone identification.
4.
The research includes a technique called XAI, specifically layer-wise relevance propagation (LRP), to improve the transparency and fairness of the model's predictions. LRP helps clarify the reasoning behind the model's predictions, thereby promoting transparency and fairness in the kidney stone identification process.

The rest of the article is divided into the following sections: The proposed model's methodology is described in Section “Methodology”. Simulation and results are presented in Section “Simulation and results”. The conclusion is presented in Section “Conclusion”. Limitations and future work are briefly discussed in Section “Limitations and future work”.

Methodology

The proposed kidney stone identification model employs DL empowered with XAI (Fig. 2). The model consists of five layers and two phases: training and validation. During the training phase, Layer 1 is dedicated to acquiring raw kidney-ureter-bladder (KUB) x-ray images, categorized as either 'kidney stone' or 'normal.' These images are high-resolution JPEG files with dimensions exceeding 2000 × 2000 pixels. In Layer 2, raw data undergoes preprocessing per the requirements of the DL model. The images are resized to dimensions of 224 × 224 × 3 and converted into PNG format. In this context, '224 × 224' signifies length and width, and '3' denotes the number of channels. Following preprocessing, data is separated between training and testing, with 70% allocated for training and 30% for testing. The pre-trained VGG16 model is imported and customized to the DL model.

Layer 3 describes the predictions made by the DL model. While these predictions hold potential utility in decision-making, they do not offer insights into the model's reasoning, thus making it a 'black box.' To mitigate this, Layer 4 incorporates XAI into the model. This feature compares the DL model's predictions with the preprocessed data to furnish explanations. If the explanations are unfair, the model is retrained; otherwise, it is stored on the cloud.

Layer 5 represents the validation phase of the model, wherein the trained model is imported from the cloud to validate the pre-processed data acquired from various sources. The proposed model intelligently classifies the KUB x-ray images into two classes with explanations. Following the successful identification of kidney stones, the system saves the corresponding data.

Table 2 represents the pseudocode for the proposed kidney stone identification model.

Table 2 Pseudocode for proposed kidney stone identification model.

Full size table

KUB x-ray images dataset

KUB X-ray images were acquired from the Department of Urology and Kidney Transplantation at MAYO Hospital in Lahore, Pakistan. The dataset consists of 500 KUB X-ray images selected from patients who had undergone radiographic examinations for kidney stones between February 2021 and October 2022. The images were obtained through the anteroposterior (AP) view. Two radiology specialists examined the collected KUB X-ray images and determined the presence or absence of kidney stones. Of the 500 images, 250 were identified as exhibiting kidney stones, while the remaining 250 were not. Subsequently, the images were augmented into 14,265 KUB x-ray images. Within this augmented dataset, 8941 images displayed instances of kidney stones, while 5324 images represented normal cases. As mentioned in the methodology section, the dataset is divided into 70:30. The number of training images is 9986 (kidney stone 6259, normal 3727), while the number of testing images is 4279 (kidney stone 2682, normal 1597).

TL

TL is a technique for applying a model's previously acquired knowledge to a new dataset⁶⁴. TL enables the utilization of highly competent, pre-trained networks rather than creating CNNs for each application. The core idea is that specific applications can be modeled by training a large model on a diverse and broad dataset. The initial layers will learn generic properties such as color, while later layers will serve particular applications. A pre-trained model, VGG16, is employed in this article to identify and predict kidney stones.

VGG16

In 2014, Simonyan and Zisserman introduced VGG16, a TL-based CNN model characterized by a sequential network structure⁶⁵. VGG16 is a deep CNN architecture with a total of 16 layers^65,66, which includes 13 convolutional layers and 3 fully connected dense layers (Fig. 3).

The original VGG16 model was initially trained to classify 1000 different object classes. However, the two classes of KUB x-ray images used in this study cannot be directly classified by the original VGG16 model. The current study introduces a model to classify KUB x-rays using a modified version of the VGG16 model (Fig. 4). This modified version of the VGG16 model enables the direct classification of the two KUB x-ray classes.

XAI

According to⁶⁷, explainability means the capacity to communicate how an AI decision has reached a broader range of end users in ways humans can comprehend. Many AI models, particularly those based on DL, have the potential to be challenging to understand. These models often involve millions of parameters and rely on complex patterns and correlations that are difficult to decipher. This complexity can raise concerns about bias, privacy, ethics, fairness, and transparency.

To address these concerns, XAI refers to the capability of AI systems to provide understandable and interpretable explanations for their decisions and actions, techniques that aim to enhance the comprehensibility and transparency of AI models. In this study, the LRP technique is used to determine which features of the DL model are responsible for specific predictions.

Layer-wise relevance propagation

For enhancing the explainability of networks utilizing the back-propagation algorithm, one of the principal algorithms employed is LRP⁶⁸. A backward propagation technique called LRP gives relevance scores to a model's input features based on how much they contribute to the output. The most crucial neurons for the prediction are then identified through the model layers using the relevance scores. Additionally, LRP deals with the shortcomings of shattered gradients in gradient methods (Grad-CAM) and perturbation methods (occlusion maps)⁶⁹.

Simulation and results

We utilized Google Colab and PyTorch for simulation and obtaining results. Google Colab furnished the necessary computational resources, while PyTorch was an efficient framework for constructing and training DL models. Our performance assessment employed the metrics derived from Eqs. (1–8)^70,71, wherein Kp/Sp represents true positives, Km/Sm denotes true negatives, Ke/Se signifies false positives, and Kn/Sn indicates false negatives. The computed metrics encompassed accuracy, misclassification rate, precision, sensitivity, specificity, FPR, FNR, and F1 Score.

Accuracy Accuracy is the proportion of correctly classified instances out of the total predictions made by a model, often represented as a percentage.

$${\text{Accuracy}} = \frac{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ) + \left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right){ }}}{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ) + \left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right){ } + ({\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} ) + \left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right){ }}}{*}100$$

(1)

Misclassification rate The misclassification rate is the proportion of incorrectly classified instances out of the total predictions, usually expressed as a percentage or a fraction.

$${\text{Misclassification rate}} = \frac{{({\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} ) + \left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right){ }}}{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ) + \left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right){ } + ({\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} ) + \left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right){ }}}{*}100$$

(2)

Precision Precision measures the ratio of true positive predictions to the total positive predictions made by a model, emphasizing the accuracy of positive classifications.

$${\text{Precision}} = \frac{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ){ }}}{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ) + \left( {{\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} } \right){ }}}{*}100$$

(3)

Sensitivity Sensitivity calculates the proportion of true positive predictions relative to all actual positive instances, indicating a model's ability to identify positives correctly.

$${\text{Sensitivity}} = \frac{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ){ }}}{{({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ) + \left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right){ }}}{*}100$$

(4)

Specificity Specificity quantifies the ratio of true negative predictions to all actual negative instances, measuring a model's capacity to identify negatives correctly.

$${\text{Specificity}} = \frac{{\left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right)}}{{\left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right) + \left( {{\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} } \right)}}{*}100$$

(5)

FPR FPR is the proportion of false positive predictions relative to all actual negative instances, demonstrating the model's tendency to misclassify negatives as positives.

$${\text{FPR}} = \frac{{\left( {{\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} } \right)}}{{\left( {{\text{K}}_{{\text{e}}} /{\text{S}}_{{\text{e}}} } \right) + \left( {{\text{K}}_{{\text{m}}} /{\text{S}}_{{\text{m}}} } \right)}}{*}100$$

(6)

FNR FNR calculates the ratio of false negative predictions to all actual positive instances, illustrating the model's likelihood to misclassify positives as negatives.

$${\text{FNR}} = \frac{{\left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right)}}{{\left( {{\text{K}}_{{\text{n}}} /{\text{S}}_{{\text{n}}} } \right) + ({\text{K}}_{{\text{p}}} /{\text{S}}_{{\text{p}}} ){ }}}{*}100$$

(7)

F1 Score The F1 Score is the harmonic mean of precision and sensitivity, providing a single metric that balances both aspects of classification accuracy.

$${\text{F}}1{\text{ Score}} = \frac{{2{*}\left( {\text{Precision*Sensitivity}} \right)}}{{{\text{Precision}} + {\text{Sensitivity}}}}$$

(8)

For the model's training hyperparameters, we maintained the mini-batch size at 32, determined the optimal training epoch to be 10, applied a learning rate of 0.00001 during network training, and utilized the Adam optimization algorithm for the training process (Table 3 outlines each hyperparameter, accompanied by an explanatory note).

Table 3 Training hyperparameters.

Full size table

Subsequently, we tested the modified VGG16 model to analyze a dataset comprising 4279 KUB X-rays, aiming to distinguish between X-rays featuring kidney stones and those categorized as normal (Fig. 5; Table 4). Regarding kidney stone X-rays from KUB, the model identified 2612 X-rays as kidney stones (true positives). While mistakenly labeling 70 X-rays as normal (false positives). For normal X-rays of KUB, the model correctly identified 1556 X-rays as normal (true negatives) and erroneously labeled 41 X-rays as kidney stones (false negatives).

Table 4 Statistical significance of each criterion for the modified VGG16 model.

Full size table

Table 4 illustrates the statistical significance of each criterion for the modified version of the VGG16, including accuracy, misclassification rate, precision, sensitivity, specificity, FPR, FNR, and F1 Score.

Employing the LRP technique on the modified VGG16 model allowed us to pinpoint the regions in the KUB X-ray image that significantly contribute to the model's prediction of kidney stone presence. Notably, highlighted areas in KUB X-rays indicate the presence of kidney stones, while normal X-rays exhibit clarity and lack visible indications (Fig. 6).

Numerous ways have been utilized to detect kidney stones; nevertheless, TL is a revolutionary method for identifying the presence of kidney stones. Table 5 compares the proposed model's performance to previously reported state-of-the-art literature. The proposed model integrates modified VGG16 architecture with the XAI technique, significantly advancing kidney stone identification. This model distinguishes itself through exceptional performance, achieving a remarkable testing accuracy of 97.41% and an impressively low misclassification rate of 2.59%. Utilizing the XAI technique enhances the model's transparency and interpretability, addressing critical concerns related to the opacity of DL models. Additionally, the model benefits from a substantial dataset of 14,265 KUB x-ray images, enabling it to capture intricate patterns effectively.

Table 5 Comparison of the proposed model with state-of-the-art literature.

Full size table

Conclusion

Kidney stone formation can lead to a significant obstruction in renal function, consequently affecting human health and survival. As a result, the prompt identification and prediction of kidney stones assume critical importance. Recent technological advancements have enabled the broad integration of ML and DL methodologies into diagnosing kidney stones. In this study, we introduced and used a modified VGG16 model to identify kidney stones in KUB x-ray images. The results of our experiments show that the modified VGG16 model has an accuracy of 97.41% in identifying kidney stones within KUB x-ray images.

DL models like VGG16 can be perceived as “black boxes” because they lack transparency or prediction fairness. In addressing this issue, the study employs the XAI technique LRP to elucidate the model's predictions, enhancing users’ comprehension of the rationale behind the decision-making process. This approach provides a transparent and effective solution for arriving at definitive diagnostic conclusions, reducing the time needed for diagnosis and enhancing diagnostic accuracy.

Limitations and future work

One of the critical limitations of our research is the availability of high-quality and diverse medical image data of KUB X-rays of kidney stones. The quality and diversity of the dataset are crucial in identifying kidney stones. In the future, overcoming this limitation will require continued efforts to collect, curate, and make a broader range of medical image data more readily available to improve model performance.

Even using XAI techniques such as LRP, the model’s interpretation may still be inconspicuous or might not give meaningful insight into the model's decision-making. In the future, further research in advanced XAI techniques and methodologies will have the potential to visually enhance the transparency, fairness, and interpretability of the model’s predictions, allowing users to understand better and trust the model.

The development of AI-based medical diagnosis enables personalized and science-based approaches to medical care. However, ethical considerations must be carefully weighed; strategies must be developed to mitigate patient privacy data security and algorithmic bias and to minimize unintended consequences of AI-based medical diagnosis. Blockchain technology can address patient privacy and data security in future work by providing decentralized storage and secure access controls for patient data. With blockchain, the training of the AI models is transparent and auditable, improving algorithmic bias and enabling accountability, which is a cornerstone of trusted AI-based medical diagnosis.

The current study focused on developing and evaluating the proposed model. In the future, the proposed model's computational complexity and resource requirements will be analyzed to determine its size.

Data availability

The dataset & Simulation files used during the current study are available from the corresponding author upon reasonable request.

Code availability

Current code version: V1.0. Permanent link to code/repository used for this code version: https://github.com/deepfindr/xai-series/blob/master/05_lrp.py. Software code languages, tools and services used: Python 3 (Used in Google Colab). Compilation requirements, operating environments, and dependencies: CUDA GPU, Microsoft Windows. Support email for questions: ara4013@qatar-med.cornell.edu , adnan@gachon.ac.kr.

References

Lang, J. et al. Global trends in incidence and burden of urolithiasis from 1990 to 2019: An analysis of global burden of disease study data. Eur. Urol. Open Sci. 35, 37–46. https://doi.org/10.1016/j.euros.2021.10.008 (2022).
Article PubMed PubMed Central Google Scholar
Vineela, T., Akhila, R. V. G. L., Anusha, T., Nandini, Y. & Bindu, S. Kidney stone analysis using digital image processing. Int. J. Res. Eng. Sci. Manag. 3(3), 275–278 (2020).
Google Scholar
Alelign, T. & Petros, B. Kidney stone disease: An update on current concepts. Adv. Urol. 2018, 1–12 (2018).
Article Google Scholar
Solie, I. & Situm, M. Kidney stones: Is there a way to see them better? In 2022 7th International Conference on Smart and Sustainable Technologies (SpliTech), Split/Bol, Croatia 9–11 (2022).
Caglayan, A., Horsanali, M. O., Kocadurdu, K., Ismailoglu, E. & Guneyli, S. Deep learning model-assisted detection of kidney stones on computed tomography. Int. Braz. J. Urol. 48(5), 830–839. https://doi.org/10.1590/S1677-5538.IBJU.2022.0132 (2022).
Article PubMed PubMed Central Google Scholar
Vinoth, R. & Bommannaraja, K. FPGA design of efficient kidney image classification using algebric histogram feature model and sparse deep neural network (SDNN) techniques. In 2017 Conference on Emerging Devices and Smart Systems (ICEDSS), Mallasamudram, India 1–6 (2017). https://doi.org/10.1109/ICEDSS.2017.8073687.
Kidney Stones: Symptoms, Diagnosis & Treatment - Urology Care Foundation. Accessed Sep 19, 2023. [Online]. https://www.urologyhealth.org/urology-a-z/k/kidney-stones
Krambeck, A. E. et al. Effect of age on the clinical presentation of incident symptomatic urolithiasis in the general population. J. Urol. 189(1), 158–164. https://doi.org/10.1016/j.juro.2012.09.023 (2013).
Article PubMed Google Scholar
Ulusan, S., Koc, Z. & Tokmak, N. Accuracy of sonography for detection renal stone: Comparison with CT. J. Clin. Ultrasound 35(5), 256–261. https://doi.org/10.1002/jcu (2007).
Article PubMed Google Scholar
Odenrick, A., Kartalis, N., Voulgarakis, N., Morsbach, F. & Loizou, L. The role of contrast-enhanced computed tomography to detect renal stones. Abdom. Radiol. 44(2), 652–660. https://doi.org/10.1007/s00261-018-1778-7 (2019).
Article Google Scholar
Sandhu, C., Anson, K. M. & Patel, U. Urinary tract stones—Part I: Role of radiological imaging in diagnosis and treatment planning. Clin. Radiol. 58(6), 415–421. https://doi.org/10.1016/S0009-9260(03)00103-X (2003).
Article CAS PubMed Google Scholar
Thomson, J. M. Z., Glocer, J., Abbott, C., Maling, T. M. J. & Mark, S. Computed tomography versus intravenous urography in diagnosis of acute flank pain from urolithiasis: A randomized study comparing imaging costs and radiation dose. Diagnostic Radiol. 45(3), 291–297 (2001).
CAS Google Scholar
Kim, H. S., Kim, E. J. & Kim, J. Y. Emerging trends in artificial intelligence-based urological imaging technologies and practical applications. Int. Neurourol. J. 27(Suppl 2), S74–S81. https://doi.org/10.5213/inj.2346286.143 (2023).
Article Google Scholar
Kaur, M., Gianey, H. K., Singh, D., Sabharwal, M. & Science, C. Multi-objective differential evolution based random forest for e-health applications. Mod. Phys. Lett. B 33(5), 1–13. https://doi.org/10.1142/S0217984919500222 (2019).
Article CAS Google Scholar
Rahmani, A. M. et al. Machine learning (Ml) in medicine: Review, applications, and challenges. Mathematics 9(22), 1–52. https://doi.org/10.3390/math9222970 (2021).
Article ADS Google Scholar
Dong, S., Wang, P. & Abbas, K. A survey on deep learning and its applications. Comput. Sci. Rev. 40, 1–22. https://doi.org/10.1016/j.cosrev.2021.100379 (2021).
Article MathSciNet Google Scholar
Abdullah, A. A., Hassan, M. M. & Mustafa, Y. T. A review on bayesian deep learning in healthcare: Applications and challenges. IEEE Access 10, 36538–36562. https://doi.org/10.1109/ACCESS.2022.3163384 (2022).
Article Google Scholar
Rao, T. V. N., Gaddam, A., Kurni, M. & Saritha, K. Reliance on artificial intelligence, machine learning and deep learning in the era of industry 4.0. Smart Healthc. Syst. Des. https://doi.org/10.1002/9781119792253.ch12 (2022).
Article Google Scholar
Alaskar, H. et al. Deep learning approaches for automatic localization in medical images. Comput. Intell. Neurosci. 2022, 1–17. https://doi.org/10.1155/2022/6347307 (2022).
Article Google Scholar
Yan, K., Wang, X., Lu, L. & Summers, R. M. DeepLesion: Automated mining of large-scale lesion annotations and universal lesion detection with deep learning. J. Med. Imaging 5(03), 1–11. https://doi.org/10.1117/1.jmi.5.3.036501 (2018).
Article Google Scholar
Kijowski, R., Liu, F., Caliva, F. & Pedoia, V. Deep learning for lesion detection, progression, and prediction of musculoskeletal disease. J. Magn. Reson. Imaging 52(6), 1607–1619. https://doi.org/10.1002/jmri.27001 (2020).
Article PubMed Google Scholar
Talo, M., Baloglu, U. B., Yıldırım, Ö. & Rajendra Acharya, U. Application of deep transfer learning for automated brain abnormality classification using MR images. Cogn. Syst. Res. 54, 176–188. https://doi.org/10.1016/j.cogsys.2018.12.007 (2019).
Article Google Scholar
Ozturk, T. et al. Automated detection of COVID-19 cases using deep neural networks with X-ray images. Comput. Biol. Med. 121, 1–11. https://doi.org/10.1016/j.compbiomed.2020.103792 (2020).
Article CAS Google Scholar
Kott, O. et al. Development of a deep learning algorithm for the histopathologic diagnosis and gleason grading of prostate cancer biopsies: A pilot study. Eur. Urol. Focus 7(2), 347–351. https://doi.org/10.1016/j.euf.2019.11.003 (2021).
Article PubMed Google Scholar
Shkolyar, E. et al. Augmented bladder tumor detection using deep learning. Eur. Urol. 76(6), 714–718. https://doi.org/10.1016/j.eururo.2019.08.032 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, J., He, Y., Chen, W., Kuang, L.-D. & Zheng, B. CorrFormer: Context-aware tracking with cross-correlation and transformer. Comput. Electr. Eng. 114, 109075. https://doi.org/10.1016/j.compeleceng.2024.109075 (2024).
Article Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. MICU: Image super-resolution via multi-level information compensation and U-net. Expert Syst. Appl. 245, 123111. https://doi.org/10.1016/j.eswa.2023.123111 (2024).
Article Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. GCAM: lightweight image inpainting via group convolution and attention mechanism. Int. J. Mach. Learn. Cybern. https://doi.org/10.1007/s13042-023-01999-z (2023).
Article PubMed PubMed Central Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. DGCA: High resolution image inpainting via DR-GAN and contextual attention. Multimed. Tools Appl. 82(30), 47751–47771. https://doi.org/10.1007/s11042-023-15313-0 (2023).
Article Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. DARGS: Image inpainting algorithm via deep attention residuals group and semantics. J. King Saud Univ. Comput. Inf. Sci. 35(6), 101567. https://doi.org/10.1016/j.jksuci.2023.101567 (2023).
Article Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. MFMAM: Image inpainting via multi-scale feature module with attention module. Comput. Vis. Image Underst. 238(January 2024), 103883. https://doi.org/10.1016/j.cviu.2023.103883 (2024).
Article Google Scholar
Chen, Y., Xia, R., Yang, K. & Zou, K. DNNAM: Image inpainting algorithm via deep neural networks and attention mechanism. Appl. Soft Comput. 154, 111392. https://doi.org/10.1016/j.asoc.2024.111392 (2024).
Article Google Scholar
Hesamian, M. H., Jia, W., He, X. & Kennedy, P. Deep learning techniques for medical image segmentation: Achievements and challenges. J. Digit. Imaging 32(4), 582–596. https://doi.org/10.1007/s10278-019-00227-x (2019).
Article PubMed PubMed Central Google Scholar
Roth, H. R. et al. Deep learning and its application to medical image segmentation. Med. IMAGING Technol. 36(2), 63–71. https://doi.org/10.11409/mit.36.63 (2018).
Article Google Scholar
Shinde, P. P. & Shah, D. S. A review of machine learning and deep learning applications. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), Pune, India 1–6 (IEEE, 2018).
Zhang, J., Lv, Y., Tao, J., Huang, F. & Zhang, J. A robust real-time anchor-free traffic sign detector with one-level feature. IEEE Trans. Emerg. Top. Comput. Intell. https://doi.org/10.1109/TETCI.2024.3349464 (2024).
Article Google Scholar
Islam, U., Al-atawi, A. A. & Alwageed, H. S. Detection of renal cell hydronephrosis in ultrasound kidney images: A study on the efficacy of deep convolutional neural networks. PeerJ Comput. Sci. 10, 1–28. https://doi.org/10.7717/peerj-cs.1797 (2024).
Article Google Scholar
Kaur, M. & Singh, D. Fusion of medical images using deep belief networks. Clust. Comput. 23(2), 1439–1453. https://doi.org/10.1007/s10586-019-02999-x (2020).
Article Google Scholar
Jiang, F. et al. Artificial intelligence in healthcare: Past, present and future. Stroke Vasc. Neurol. 2(4), 230–243. https://doi.org/10.1136/svn-2017-000101 (2017).
Article PubMed PubMed Central Google Scholar
Abualigah, L. et al. Wind, solar, and photovoltaic renewable energy systems with and without energy storage optimization: A survey of advanced machine learning and deep learning techniques. Energies 15(2), 1–26. https://doi.org/10.3390/en15020578 (2022).
Article CAS Google Scholar
Li, Y. et al. CDRIME-MTIS: An enhanced rime optimization-driven multi-threshold segmentation for COVID-19 X-ray images. Comput. Biol. Med. 169(February 2024), 107838. https://doi.org/10.1016/j.compbiomed.2023.107838 (2024).
Article PubMed Google Scholar
Shamrat, F. J. M. et al. High-precision multiclass classification of lung disease through customized MobileNetV2 from chest X-ray images. Comput. Biol. Med. 155(January), 106646. https://doi.org/10.1016/j.compbiomed.2023.106646 (2023).
Article PubMed Google Scholar
Tao, S. et al. Phase retrieval for X-ray differential phase contrast radiography with knowledge transfer learning from virtual differential absorption model. Comput. Biol. Med. 168(January 2024), 107711. https://doi.org/10.1016/j.compbiomed.2023.107711 (2024).
Article CAS PubMed Google Scholar
Sitaula, C. & Hossain, M. B. Attention-based VGG-16 model for COVID-19 chest X-ray image classification. Appl. Intell. 51, 2850–2863. https://doi.org/10.1007/s10489-020-02055-x (2021).
Article Google Scholar
Naveen, P. & Diwan, B. Pre-trained VGG-16 with CNN Architecture to classify X-Rays images into Normal or Pneumonia. In 2021 International Conference on Emerging Smart Computing and Informatics (ESCI). IEEE, Pune, India 102–105 (2021). https://doi.org/10.1109/ESCI50559.2021.9396997.
Ishioka, J. et al. Computer-aided diagnosis with a convolutional neural network algorithm for automated detection of urinary tract stones using kub. J. Urol. 201(4), e845 (2019).
Google Scholar
Chiang, D., Chiang, H. C., Chen, W. C. & Tsai, F. J. Prediction of stone disease by discriminant analysis and artificial neural networks in genetic polymorphisms: A new method. BJU Int. 91(7), 661–666. https://doi.org/10.1046/j.1464-410X.2003.03067.x (2003).
Article CAS PubMed Google Scholar
Dussol, B., Verdier, J. M., Le Goff, J. M., Berthezene, P. & Berland, Y. Artificial neural networks for assessing the risk of urinary calcium stone among men. Urol. Res. 34(1), 17–25. https://doi.org/10.1007/s00240-005-0006-4 (2006).
Article PubMed Google Scholar
Caudarella, R., Tonello, L., Rizzoli, E. & Vescini, F. Predicting five-year recurrence rates of kidney stones: An artificial neural network model. Arch. Ital. Urol. Androl. 83(1), 14–19 (2011).
PubMed Google Scholar
Kumar, K. & Abhishek, A. Artificial neural networks for diagnosis of kidney stones disease. Int. J. Inf. Technol. Comput. Sci. 4(7), 20–25. https://doi.org/10.5815/ijitcs.2012.07.03 (2012).
Article Google Scholar
Ebrahimi, S. & Mariano, V. Y. Image Quality Improvement in Kidney Stone Detection on Computed Tomography Images. J. Image Graph. 3(1), 40–46. https://doi.org/10.18178/joig.3.1.40-46 (2015).
Article Google Scholar
Kazemi, Y. & Mirroshandel, S. A. A novel method for predicting kidney stone type using ensemble learning. Artif. Intell. Med. 84, 117–126. https://doi.org/10.1016/j.artmed.2017.12.001 (2018).
Article PubMed Google Scholar
Li, A. Y. & Elliot, N. Natural language processing to identify ureteric stones in radiology reports. J. Med. Imaging Radiat. Oncol. 63(3), 307–310. https://doi.org/10.1111/1754-9485.12861 (2019).
Article PubMed Google Scholar
De Perrot, T. et al. Differentiating kidney stones from phleboliths in unenhanced low-dose computed tomography using radiomics and machine learning. Eur. Radiol. 29, 4776–4782 (2019).
Article PubMed Google Scholar
Kahani, M., Hariri Tabrizi, S., Kamali-Asl, A. & Hashemi, S. A novel approach to classify urinary stones using dual-energy kidney, ureter and bladder (DEKUB) X-ray imaging. Appl. Radiat. Isot. 164(May 2019), 109267. https://doi.org/10.1016/j.apradiso.2020.109267 (2020).
Article CAS PubMed Google Scholar
Jungmann, F. et al. Towards data-driven medical imaging using natural language processing in patients with suspected urolithiasis. Int. J. Med. Inform. 137, 1–21. https://doi.org/10.1016/j.ijmedinf.2020.104106 (2020).
Article Google Scholar
Rohith, A. Detection of kidney stones in ultrasound images using median filter compared with rank filter. Rev. Gestão Inovação e Tecnol. 11(4), 1096–1111. https://doi.org/10.47059/revistageintec.v11i4.2171 (2021).
Article Google Scholar
Suresh, M. B. & Abhishek, M. R. Kidney stone detection using digital image processing techniques. In Proceedings of the 3rd International Conference on Inventive Research in Computing Applications, ICIRCA 2021, Coimbatore, India 556–561 (IEEE, 2021). https://doi.org/10.1109/ICIRCA51532.2021.9544610.
Jendeberg, J., Thunberg, P. & Lidén, M. Differentiation of distal ureteral stones and pelvic phleboliths using a convolutional neural network. Urolithiasis 49(1), 41–49. https://doi.org/10.1007/s00240-020-01180-z (2021).
Article PubMed Google Scholar
Cui, Y. et al. Automatic detection and scoring of kidney stones on noncontrast CT images using S.T.O.N.E. nephrolithometry: Combined deep learning and thresholding methods. Mol. Imaging Biol. 23(3), 436–445. https://doi.org/10.1007/s11307-020-01554-0 (2021).
Article CAS PubMed Google Scholar
Yildirim, K. et al. Deep learning model for automated kidney stone detection using coronal CT images. Comput. Biol. Med. 135, 1–7. https://doi.org/10.1016/j.compbiomed.2021.104569 (2021).
Article Google Scholar
Tsitsiflis, A. et al. The use of an artificial neural network in the evaluation of the extracorporeal shockwave lithotripsy as a treatment of choice for urinary lithiasis. Asian J. Urol. 9(2), 132–138. https://doi.org/10.1016/j.ajur.2021.09.005 (2022).
Article PubMed Google Scholar
Valencia, F. A. C., Muñoz, J. J. A. M. & Montaño, F. M. J. Stone detection in kidney with image processing technique: CT images. J. Posit. Sch. Psychol. 6(6), 7643–7653 (2022).
Google Scholar
Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C. & Liu, C. A survey on deep transfer learning. In 27th International Conference on Artificial Neural Networks, Rhodes, Greece 270–279 (Springer, 2018). https://doi.org/10.1007/978-3-030-01424-7.
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Published as a Conference Paper at ICLR 1–14 (2015).
Althubiti, S. A., Alenezi, F., Shitharth, S., Sangeetha, K. & Reddy, C. V. S. Circuit manufacturing defect detection using VGG16 convolutional neural networks. Wirel. Commun. Mob. Comput. 2022, 1–10. https://doi.org/10.1155/2022/1070405 (2022).
Article Google Scholar
Doshi-Velez, F. & Kim, B. Towards A rigorous science of interpretable machine learning. arXiv Prepr., 1–13 (2017).
Bach, S. et al. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10(7), 1–46. https://doi.org/10.1371/journal.pone.0130140 (2015).
Article CAS Google Scholar
Huang, X., Jamonnak, S., Zhao, Y., Wu, T. H. & Xu, W. A visual designer of layer-wise relevance propagation models. Eurographics Conf. Vis. 40(3), 227–238 (2021).
Google Scholar
Seliya, N., Khoshgoftaar, T. M. & Van Hulse, J. A study on the relationships of classifier performance metrics. In 21st IEEE International Conference on Tools with Artificial Intelligence, Newark, NJ, USA 59–66 (2009). https://doi.org/10.1109/ICTAI.2009.25.
Ahmed, F., Khan, W. A., Iqbal, M., Riad, A., Abazeed, A., Alrababah, H. & Khan, M. F. Rock-paper-scissors image classification using transfer learning. In 2023 International Conference on Business Analytics for Technology and Security (ICBATS), Dubai, United Arab Emirates 1–6 (IEEE, 2023). https://doi.org/10.1109/ICBATS57792.2023.10111433.

Download references

Funding

This research work is supported by Qatar National Library.

Author information

Authors and Affiliations

School of Computer Science, National College of Business Administration and Economics, Lahore, 54000, Pakistan
Fahad Ahmed & Wasim Ahmad Khan
Department of Computer Sciences, Bahria University, Lahore Campus, Lahore, 54000, Pakistan
Sagheer Abbas
Department of Computer Science, Comsats University Islamabad, Lahore Campus, Lahore, 54000, Pakistan
Atifa Athar
Department of Computer Sciences, COMSATS University Islamabad, Sahiwal Campus, Sahiwal, 57000, Pakistan
Tariq Shahzad
Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, 11942, Alkharj, Saudi Arabia
Meshal Alharbi
School of Computing, Skyline University College, University City Sharjah, 1797, Sharjah, UAE
Muhammad Adnan Khan
Department of Software, Faculty of Artificial Intelligence and Software, Gachon University, Seongnam-si, 13120, Republic of Korea
Muhammad Adnan Khan
Riphah School of Computing and Innovation, Faculty of Computing, Riphah International University, Lahore Campus, Lahore, 54000, Pakistan
Muhammad Adnan Khan
AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
Arfan Ahmed

Authors

Fahad Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Sagheer Abbas
View author publications
You can also search for this author in PubMed Google Scholar
Atifa Athar
View author publications
You can also search for this author in PubMed Google Scholar
Tariq Shahzad
View author publications
You can also search for this author in PubMed Google Scholar
Wasim Ahmad Khan
View author publications
You can also search for this author in PubMed Google Scholar
Meshal Alharbi
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Adnan Khan
View author publications
You can also search for this author in PubMed Google Scholar
Arfan Ahmed
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.A., A.A., and T.S. have collected data from different resources. S.A., F.A., and M.A.K. performed formal analysis and Simulation, S.A., W.A.K., M.A., and T.S. contributed to writing—original draft preparation, A.A., M.A.K., A.A., and W.A.K.; writing—review and editing, S.A., M.A.K., and A.A.; performed supervision, A.A., M.A., and F.A.; drafted pictures and tables, A.A., F.A., and S.A.; performed revisions and improve the quality of the draft. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Muhammad Adnan Khan or Arfan Ahmed.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ahmed, F., Abbas, S., Athar, A. et al. Identification of kidney stones in KUB X-ray images using VGG16 empowered with explainable artificial intelligence. Sci Rep 14, 6173 (2024). https://doi.org/10.1038/s41598-024-56478-4

Download citation

Received: 19 October 2023
Accepted: 06 March 2024
Published: 14 March 2024
DOI: https://doi.org/10.1038/s41598-024-56478-4

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.