A cognitive deep learning approach for medical image processing

Fakhouri, Hussam N.; Alawadi, Sadi; Awaysheh, Feras M.; Alkhabbas, Fahed; Zraqou, Jamal

doi:10.1038/s41598-024-55061-1

Download PDF

Article
Open access
Published: 24 February 2024

A cognitive deep learning approach for medical image processing

Hussam N. Fakhouri¹,
Sadi Alawadi^2,3,
Feras M. Awaysheh⁴,
Fahed Alkhabbas^5,6 &
…
Jamal Zraqou⁷

Scientific Reports volume 14, Article number: 4539 (2024) Cite this article

1424 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

In ophthalmic diagnostics, achieving precise segmentation of retinal blood vessels is a critical yet challenging task, primarily due to the complex nature of retinal images. The intricacies of these images often hinder the accuracy and efficiency of segmentation processes. To overcome these challenges, we introduce the cognitive DL retinal blood vessel segmentation (CoDLRBVS), a novel hybrid model that synergistically combines the deep learning capabilities of the U-Net architecture with a suite of advanced image processing techniques. This model uniquely integrates a preprocessing phase using a matched filter (MF) for feature enhancement and a post-processing phase employing morphological techniques (MT) for refining the segmentation output. Also, the model incorporates multi-scale line detection and scale space methods to enhance its segmentation capabilities. Hence, CoDLRBVS leverages the strengths of these combined approaches within the cognitive computing framework, endowing the system with human-like adaptability and reasoning. This strategic integration enables the model to emphasize blood vessels, accurately segment effectively, and proficiently detect vessels of varying sizes. CoDLRBVS achieves a notable mean accuracy of 96.7%, precision of 96.9%, sensitivity of 99.3%, and specificity of 80.4% across all of the studied datasets, including DRIVE, STARE, HRF, retinal blood vessel and Chase-DB1. CoDLRBVS has been compared with different models, and the resulting metrics surpass the compared models and establish a new benchmark in retinal vessel segmentation. The success of CoDLRBVS underscores its significant potential in advancing medical image processing, particularly in the realm of retinal blood vessel segmentation.

FIVES: A Fundus Image Dataset for Artificial Intelligence based Vessel Segmentation

Article Open access 04 August 2022

State-of-the-art retinal vessel segmentation with minimalistic models

Article Open access 13 April 2022

A lightweight deep learning model for automatic segmentation and analysis of ophthalmic images

Article Open access 20 May 2022

Introduction

Artificial intelligence (AI) and cognitive computing have revolutionized various sectors, including medicine, pharmacy, and healthcare, in the realm of big data¹. While AI operates on algorithms and patterns, cognitive computing takes it further, mimicking the human brain’s reasoning processes and adaptability to deliver more nuanced solutions. Together, their transformative power is redefining modern medical image processing, streamlining processes, enhancing accuracy, and potentially saving lives. The synergy between AI, cognitive computing, and medical imaging offers robust tools that extend and amplify human capabilities in diagnosing and treating various diseases².

In this regard, recent medical imaging technology advancements allow us to capture retina details with unprecedented clarity³. Nevertheless, research is still seeking more accuracy and efficiency due to the domain’s natural complexities. For instance, retinal blood vessel segmentation demands precise differentiation of these vessels, which presents numerous challenges⁴. Traditional manual and semi-automatic methods, plagued by inefficiency and proneness to errors, falter, especially when faced with the voluminous data churned out by modern imaging systems⁵.

The convergence of AI’s algorithmic and cognitive computing ushers a transformative shift using the amalgamation of hybrid models. With the capacity to learn, adapt, and discern complex patterns from vast datasets, this confluence is reshaping healthcare⁶. Tasks like image segmentation, disease identification, and prognosis prediction have recently seen a significant infusion of AI and cognitive computing principles. AI, specially trained on extensive retinal image datasets, can refine its accuracy in blood vessel delineation, converting raw data into actionable insights⁷. Also, convolutional neural networks (CNNs) stand out for their adeptness in image analysis due to their hierarchical data learning⁶.

Deep learning, enriched with cognitive approaches, can surpass traditional methods in accuracy and efficiency for retinal blood vessel segmentation⁸. However, exploring the full capability of combining AI and cognitive computing in diagnostics and mitigating vision loss is still far from its potential. Various deep learning architectures have been used for retinal blood vessel segmentation, such as convolutional neural networks (CNNs)⁹, fully convolutional networks (FCNs)¹⁰, and U-nets¹¹. These architectures are trained on different datasets of retinal images labeled with the location of blood vessels, allowing them to learn to identify the vessels in new images⁸ accurately. Automatic analysis of the retinal vascular tree by image processing techniques is essential for many clinical investigations and constitutes a field of scientific research leading¹². Detection and characterization of small blood vessels on retina images are essential in diagnosing certain diseases, such as diabetes or hypertension. The current state of the art indicates that methods based on supervised learning currently have the best performance. However, accurate segmentation of retinal blood vessels is a crucial step in diagnosing and monitoring various ocular diseases like diabetic retinopathy and glaucoma.

Nevertheless, the task is challenging due to the intricate and varying structure of the retinal blood vessels, the presence of pathologies, and differences in image quality. While demonstrating promise, current methods still need to be revised regarding sensitivity, specificity, and robustness to image variations. Dash et al.¹³ highlights a pressing need for an advanced, reliable, and more precise retinal blood vessel segmentation technique to address these challenges. Thus, this research proposes a new hybrid model for retinal blood segmentation and tries to answer the following questions.

This research aims to address the previous concern and propose an innovative approach called CoDLRBVS to improve the detection and segmentation of fine retinal vessels by integrating deep learning methods(i.e., U-Net architecture) with diverse image processing techniques, such as matched filter (MF), multi-scale line detection, scale space representation, and morphological operations. Each technique contributes to creating an approach that emulates human cognition and adaptability in processing complex visual information such as medical images. In our proposed approach, the matched filter and multi-scale line detection are calibrated to amplify the initial segmentation of blood vessels, integrating cognitive principles to interpret intricate patterns and enhance contrasting features, much like the human visual system. Subsequently, the U-Net architecture, revered for its efficacy in biomedical image segmentation, refines these results by learning hierarchical features and making informed decisions, reflecting human-like analytical reasoning.

Moreover, scale space representation is implemented to analyze blood vessels at varied scales, mirroring the human ability to perceive objects at different distances and sizes, and morphological operations are employed to refine the segmentation results by eliminating noise and filling gaps, emulating the human brain’s inherent ability to filter out irrelevant information and focus on the essential. The culmination of these techniques is expected to yield a system that transcends the current paradigms in segmentation methods regarding accuracy, sensitivity, and specificity, facilitating early detection and intervention for retinal diseases. Hence, CoDLRBVS provides a unique integration of a pre-processing phase using the MF for feature enhancement. Also, CoDLRBVS includes a post-processing phase employing morphological techniques (MT) to refine the segmentation output. Finally, it incorporates multi-scale line detection and scale space methods to enhance segmentation capabilities.

The main contribution of this paper is CoDLRBVS, an innovative approach to improve the detection and segmentation of fine retinal vessels. Compared to our approach, most existing approaches fail to handle retinal images’ intrinsic variability and complexity. CoDLRBVS integrates deep learning methods, mainly U-Net architecture, and diverse image processing techniques to address such shortcomings and achieve better results. The developed model exhibits robust performance across different retinal image datasets and under various image quality conditions, as proved by our experiments. CoDLRBVS achieved high segmentation performance with a mean accuracy of 96.7% across all datasets. Additionally, the model supports the adaptation to various sizes of retinal blood vessels by effectively incorporating adaptive techniques like scale space and multi-scale line detection into the U-Net architecture.

The remainder of this paper is organized as follows: “Background and related work” provides background on retinal imaging, cognitive computing, image pre-processing techniques, and convolutional neural networks and explores the related work in the field. Next, we introduce the proposed approach and its main phases in “CoDLRBVS approach”. The datasets used in the study, experimental settings, approach evaluation, and the obtained results are described in “Result analysis and experiment description”. Finally, we conclude the paper in “Conclusion”, draw the main finding, and outline potential future work direction.

Background and related work

In this section, we present background about closely related fields, including retinal imaging, cognitive computing, and image pre-processing techniques in medical image processing. Additionally, we discuss studies related to our work.

Background

Retinal imaging

Ophthalmologists rely on retinal imaging because it enables them to diagnose and treat eye disorders at an early stage. Retinal imaging plays a crucial role in health prediction enabled by deep learning techniques¹⁴. There are multiple types of retinal imaging methods, including the following ones¹⁴: (1) fundus photography. It is the most frequent form of retinal imaging, which results in a color picture of the retina. Its primary applications are in the early detection and follow-up of retinal disorders. (2) optical coherence tomography. It produces cross-sectional images of the retina, enabling the measurement of retinal thickness and the identification of minor structural changes that may not be detectable with conventional imaging techniques. The macular hole, macular pucker, and macular edema are all disorders that benefit greatly from this diagnosis and treatment method. (3) Adaptive optics in imaging. This cutting-edge method provides real-time correction for optical flaws in the eye and results in cellular-level retina imaging.

Cognitive computing

It simulates human thinking processes in computers, employs self-learning algorithms for utilizing data mining, pattern recognition, and natural language processing. Gudivada et al.¹⁵ provided insights into its relevance in diagnosis assistance, drug discovery, and patient management. The authors emphasized its potency in processing complex medical imaging datasets, making it suitable for retinal vessel segmentation.

Beyond data analysis, cognitive computing holds transformative potential for patient care, research, and healthcare operations in many areas, including RNA sequencing^16,17. Srivani et al.¹⁸ illustrated its pivotal role in patient-centric care, spotlighting its capability in predicting patient needs and shaping care plans, especially in managing chronic diseases. Similarly, Kumar et al.¹⁹ elaborated on system’s efficacy in adaptability in dynamic healthcare settings for predicting patient inflow, optimizing hospital resources, staffing, and bed allocation. Sathananthavathi and Indumathi showcased potential of retinal imaging in^20,21. In²⁰, the authors integrated cognitive systems with image processing for retinal vessel segmentation. Whereas in²¹, the authors utilized cognitive computing to discern anomalies in retinal images linked to diabetic retinopathy, highlighting the adaptive evolution of cognitive models for improved accuracy over time.

Deep learning techniques, especially CNNs, have made remarkable strides in medical imaging. With recent progress in CNN architectures, there has been a surge in precise image segmentation. For instance, Chen et al.²² enhanced the U-Net structure tailored for retinal blood vessel segmentation by integrating with conventional image processing. Traditionally, vessel segmentation has utilized techniques like adaptive thresholding and morphological operations and notably, the outcomes were enhanced when integrated with machine learning techniques. Wang et al.²³ demonstrated innovative use of cognitive systems with generative adversarial networks for medical image augmentation. Similarly, Nahiduzzaman et al.²⁴ illustrated the benefits of integrating cognitive systems with CNNs in chest X-ray imaging, using cognition to understand anomalies and direct the CNNs more accurately. Furthermore, Shrma et al.²⁵ spotlighted the advantages of integrating cognitive systems and neural architectures in ultrasound imagery, demonstrating cognitive systems’ potential in aiding deep learning models to discern ambiguous regions, and consequently honing segmentation and classification.

Image pre-processing techniques

Image pre-processing techniques play prominent role in enhancing the quality of the images, which is crucial for accurate diagnosis and treatment planning^26,27. The normalization techniques enable the adjustment of pixel intensity of medical images to a standard range, resulting in improving the contrast and making the details more visible. This technique is useful in medical scenarios where images suffer from poor contrast mainly due to the imaging environment²⁸. It exploits linear or non-linear adjustments to improve images’ clarity for medical analysis purposes²⁹.

Color space conversion is another important pre-processing step in medical imaging. This step is important in scenarios where color information is crucial, such as in histology images or stained tissue samples³⁰. Converting images into appropriate color spaces (e.g., from RGB to the CIE Lab* color space) would enable medical professionals to study specific features more effectively³¹. Moreover, matched filters is another technique applied in the medical imaging to enhance specific patterns, such as the detection of microcalcifications in mammograms or blood vessels in retinal images. Using the cross-correlation between the image and a predefined pattern, matched filters can suppress the noise and detect and spotlight areas of interest³².

To identify and analyze structures of various sizes and orientations in medical images (e.g., such as blood vessels, neural pathways, or skeletal structures), multi-scale line detection technique cab be applied³³. For this purpose, this technique involves examining images at multiple scales or resolutions³⁴. This technique is useful in complex and critical medical scenarios such as detecting tumors and classifying tissues³⁵.

Convolutional neural networks

CNN is a class of deep learning algorithms designed to automatically and adaptively learn spatial hierarchies of features. Thus, CNN are suitable for analyzing visual data³⁶. U-Net is one of the most notable CNN architectures for medical image segmentation¹¹. The U-Net is useful to achieve more accurate segmentations in case less training data is available. It is commonly used in fields, including magnetic resonance imaging, computed tomography, and microscopy. Other commonly used CNN architectures applied in the medical imaging domain include SegNet³⁷, V-Net³⁸, and DeepLab³⁹.

Related work

Soares et al.⁴⁰ developed a supervised technique for the segmentation of vessels. The technique exploits a two-dimensional Gabor wavelet and a selection of morphological variables. The experiments results indicated that the technique was effective in separating the vessels from the backdrop. Staal et al.⁴¹ proposed an unsupervised and automated method for vessel segmentation. The method achieved reliable performance by utilizing a combination of line detectors and the hysteresis thresholding of the vessel’s likelihood map.

Dash et al.⁴² present a method to enhance the performance of curvelet transform. To improve retinal blood vessel segmentation, the method enables the fusion of curvelet transform and the Jerman filter, while the Mean-C threshold is used for the segmentation purpose. Further, in⁴³, the authors developed an automated method to extract the blood vessels from fundus. For this purpose, the method integrates discrete wavelet transform and Tyler Coye algorithm. Additionally, the methods exploits the gamma correction to enhance the images contrast. Furthermore, in⁴⁴, the authors proposed a model for enhancing abnormal retinal images containing low vessel contrasts. For this purpose, the proposed approach exploits both a fast guided filter and a matched filter for improving the performance measures for vessel extraction.

The current state of art indicates that methods based on supervised learning currently have the best performance⁸. Those methods are based on patches of real-size images and use CNNs and start with classifying each pixel of the image according to a fixed centered pixel neighborhood¹¹. Unsupervised learning algorithms represent another approach to segment retinal vessels. They are capable of automatically segmenting vessels without requiring annotated training data. Frangi vesselness filter is a well-known unsupervised learning method designed to enhance vessel-like structures in images⁴⁵. Fraz et al.⁴⁶ developed an ensemble classification method that exploits a combination of boosting and random forests ML algorithms with a novel set of rotating invariant features. The method achieved an improved performance over the other state-of-the-art techniques. Recently, several deep learning-based methods have been introduced to address the issue of retinal vessel segmentation. Orlando et al.⁴⁷ proposed deep learning based approach that exploits a U-Net architecture. The proposed model was trained end-to-end on a large dataset and showed superior performance compared to other techniques. Melinscak et al.⁴⁸ developed an approach based on multi-scale line detection and used it for segmentation of retinal blood vessels. The authors validated their approach by conducting experiments on four publicly available datasets and reported competitive results.

Further, CNNs have been extensively applied in retinal blood vessel segmentation⁴⁹. They are designed to automatically and adaptively learn spatial hierarchies of features, making them exceptionally suited for image recognition tasks. Their architecture, which allows for the simultaneous examination of several scales, improves the detection of both small and large blood vessels⁴⁵. Furthermore, the FCNs have also been utilized for the same purpose⁵⁰. The main advantage of applying FCNs is that they support end-to-end and pixels-to-pixels learning without the need for patch extraction and selection. Consequently, this makes the segmentation process more efficient¹⁰. The U-net architecture is specifically designed for biomedical image segmentation. This makes it one of the most successful implementations of FCNs¹¹. The U-net architecture consists of two paths where the first is a contracting path that capture the context. Whereas, the second is a symmetric expanding path that enables precise localization. Consequently, this unique architecture enables an effective retinal blood vessel segmentation process.

Moreover, methods based on the integration of deep learning and classical image processing techniques have also been proposed. For instance, Li et al.⁵¹ developed a framework that integrates CNNs and scale-space theory for blood vessel segmentation. Similarly, Zhang et al.⁵² introduced the LCU-Net, a novel low-cost U-Net based approach for the environmental microorganism image segmentation task. The approach extends and improves the traditional U-Net architecture by integrating inception and concatenate operations, which enables it to address the single receptive field’s limitations and high memory cost. In⁵³, the authors conducted a comprehensive review considering different techniques, including conventional multilayer perceptrons, convolutional neural networks, and visual transformers. The findings of the review highlights the critical role of neural networks in various applications, including environmental pollution control and disease prevention.

Chen et al.⁵⁴ developed IL-MCAM, a framework for colorectal histopathology image classification. The framework exploits interactive learning and multi-channel attention mechanism to enhance the images’ classification accuracy. Li et al.⁵⁵ conducted a comprehensive review of automated image analysis techniques. The study discusses the progression of ML techniques and their integration into whole-slide image analysis. Additionally, it outlines the developments and challenges in feature extraction, segmentation, and classification methods. Finally, the optimization of retinal blood vessel segmentation has recently been widely used in many fields^56,57. It also has been used for tuning the hyper-parameter of deep learning algorithms³⁶, which can further enhance the blood vessel model.

To summarize, although several approaches have been proposed to enable the automated retinal blood vessel segmentation, they often fall short in handling the intrinsic variability and complexity of retinal images. Many existing models lack the adaptability to accurately segment vessels of varying sizes and shapes, or they do not adequately address the issues of noise and fine structure preservation in the images. Moreover, the integration of advanced image processing techniques with deep learning methods has not been fully explored or optimized, leaving potential improvements in accuracy and efficiency untapped. The reliance on single-scale methods or non-adaptive techniques often results in suboptimal performance, particularly in the presence of pathological changes or varied image qualities. Furthermore, while some models demonstrate decent performance on specific datasets, their generalizability across different datasets and under diverse imaging conditions remains a significant challenge. In light of these gaps, we propose the CoDLRBVS model, a novel hybrid approach that combines the strengths of image processing and deep learning. Our method addresses these shortcomings by introducing a more adaptable, robust, and precise segmentation solution, setting a new standard for accuracy and performance in retinal blood vessel segmentation.

CoDLRBVS approach

The CoDLRBVS introduces an innovative approach to improve the detection and segmentation of fine retinal vessels by integrating deep learning algorithms, precisely the U-Net architecture model, with various image processing techniques. These techniques include matched filter, multi-scale line detection, scale space, and morphological operations as shown in algorithm 1. While each method contributes distinct advantages, their combined utilization significantly influences the final output of the approach.

For instance, applying the matched filter significantly improves the visibility and contrast of blood vessel structures’ essential details by amplifying the visibility of the vessels’ patterns and enhancing their detectability. The multi-scale line detection method allows the model to analyze the image at different scales. It ensures accurate detection and segmentation of vessels with various thicknesses that address the inherent diversity in vessel diameters. The U-Net model is designed explicitly for semantic segmentation tasks. Its encoder–decoder architecture enables it to capture local and global contextual information, which contributes to accurately segmenting the blood vessels.

Additionally, the scale-space analysis allows the model to effectively capture the vessel structures at distinct scales, considering both vessel size variations and thickness. Ultimately, leveraging morphological techniques refines the segmented vessel map by eliminating noise, filling gaps, enhancing both the overall connectivity and smoothness of the segmented vessels. These methods improve the CoDLRBVS’s robustness against noise, anomalies, and image variations, which are well-known issues in medical images.

By seamlessly incorporating these techniques with cognitive computing attributes, there is a notable increase in the accuracy and robustness of blood vessel segmentation. Moreover, the approach exhibits enhanced resilience against common challenges in medical images, such as noise and anomalies. This human-centred integrated approach holds transformative potential across various medical domains, spanning disease diagnosis, monitoring, surgical planning, and developing innovative treatment methodologies. Figure 1 illustrates the CoDLRBVS approach abstract diagram, while algorithm 1 shows the CoDLRBVS detailed steps.

The proposed approach involves three distinct phases; each phase comprises various steps as described below:

Preprocessing phase: This phase consists of five steps, each one associated with specific techniques to preprocess the retinal images and then use them in the segmentation phase, However, an illustration of the model preprocessing steps output result is shown in Fig. 2. The individual preprocessing steps are explained below:
1. 1.
  Initialize the environment parameters and load the medical images. Then, to ensure uniformity in feature range and have a common scale, each retinal image is preprocessed and normalized using Eq. (1).
  $$\begin{aligned} I_{norm} = \frac{I - \mu }{\sigma } \end{aligned}$$
  (1)
  where $ I $ is the original image, $ \mu $ represent the mean pixels value, $ \sigma $ is the standard deviation, and $ I_{norm} $ represents the normalized image.
2. 2.
  Perform the color space conversion to extract relevant channels that contain blood vessels.
3. 3.
  Apply the matched filter technique, where the match filter bank is designed based on typical blood vessel profiles, such as line shapes of varying widths. The input image undergoes convolution with this filter, and the maximum response across the filter is computed for each pixel using Eqs. (2 and 3). To design a matched filter bank, let’s suppose $ F = \{f_1, f_2, \ldots , f_N\} $ be a set of $ N $ matched filters designed using blood vessel profiles. Convolve the input image using the filter bank; suppose $ R(i, j, k) $ represents the response of the $ k $-th filter for pixel $ (i, j) $ in the convolved image as shown in Eq. (2).
  $$\begin{aligned} R(i, j, k) = I_{norm} * f_k(i, j) \end{aligned}$$
  (2)
  where $ * $ represent the convolution operation. Then, compute the maximum response $ M(i, j) $ for pixel $ (i, j) $ across the filter bank as shown in Eq. (3):
  $$\begin{aligned} M(i, j) = \max _k R(i, j, k) \end{aligned}$$
  (3)
  where $ k = 1 $ to $ N $. This process generates a “matched filter response” image, highlighting potential blood vessel locations.
4. 4.
  Integrate a multi-scale line detection technique, such as a Hessian-based method or a Steerable filter, as shown in Eq. (4). The detections from each scale are combined into a “line detection response” image, which further helps identify blood vessel locations. Let’s suppose $ L(i, j, s) $ represent the line detection response at pixel $ (i, j) $ for scale $ s $.
  $$\begin{aligned} L(i, j, s) = \text {LineDetection}(I_{norm}, s) \end{aligned}$$
  (4)
  where $\text {LineDetection}()$ denotes the line detection algorithm applied to the normalized image $ I_{norm} $.
5. 5.
  Create the scale-space representation of the image according to Eq. (5), by generating a series of images that represent the original image at various scales. These representations are combined to highlight blood vessels, enhancing the visibility of their structures.
  $$\begin{aligned} S_m(i, j) = \text {ScaleSpace}(I_{norm}, m) \end{aligned}$$
  (5)
  where $ S = \{S_1, S_2, \ldots , S_M\} $ represents a set of $ M $ scale-space images representing the original image at different scales, and $\text {ScaleSpace}()$ generates the $ m $-th scale-space image for the normalized image $ I_{norm} $.
Segmentation phase: In this phase, we train the U-Net model with residual connections using the images generated from the prior steps, where the ground truth vessel segmentation is known. The U-Net model’s convolution operation helps the model recognize patterns and features indicative of blood vessels using Eq. (6). Incorporating the residual connections enhances the approach’s capacity to learn more complex features and representations of the data, including detecting small blood vessels.
$$\begin{aligned} I'_{x,y} = \sum _{i=-k}^{k} \sum _{j=-k}^{k} I_{x+i,y+j} \cdot K_{i,j} \end{aligned}$$
(6)
where $ I' $ is the convolved image, $ I $ is the original image, $ K $ is the convolution kernel, and $ k $ is the size of the kernel. After completing the U-Net training process, we evaluate the model’s performance on unseen images using several metrics, including accuracy, precision, F1 measure, Kappa, and others. This evaluation offers valuable insights into the model’s ability to identify and segment retinal blood vessels accurately.
Postprocessing phase: In this phase, we use morphological techniques to enhance the output of the segmentation obtained from the U-Net model (${\textbf {V}} $), including removing noise from vessels or filling gaps to improve the quality and connectivity of the segmented vessels. Subsequently, the postprocessing stage involves applying a threshold to the final output to create a binary segmentation image, as shown in Eq. (7). This thresholding step transforms the segmented vessel probabilities into a binary map, classifying pixels as either vessel or non-vessel.
$$\begin{aligned} B(i, j) = \text {Threshold}(V(i, j)) \end{aligned}$$
(7)
where $\text {Threshold}()$ converts the pixel value $ V(i, j) $ to a binary value based on a predefined threshold.

Result analysis and experiment description

Datasets

We evaluated the CoDLRBVS approach in terms of performance and effectiveness using several metrics over widely known benchmark datasets for blood vessel segmentation, such as DRIVE (Digital Retinal Images for Vessel Extraction) dataset⁵⁸, CHASE_DB1 dataset⁵⁹, High-Resolution Fundus (HRF) Image Database², STARE (STructured Analysis of the Retina) dataset⁶⁰, and the Retina Blood Vessel⁶¹, Table 1 provides more details about the used datasets, such as the number of training and testing samples, image height and width. However, the DRIVE database contains 20 retinal images captured using a Canon CR5 camera in a 24-bit color space. Each image comes with expert annotations of vascular segmentation, serving as ground truth for performance evaluation⁵⁸. Similarly, the CHASE_DB1 dataset offers 28 high-resolution images of multi-ethnic children’s retinas, each with two sets of manual segmentations, providing a rich ground for algorithm testing and validation⁵⁹. The HRF image database, known for its detailed imagery used in various comparative studies, is a valuable resource for algorithm evaluation⁶⁰. With its 20 retinal fundus images, the STARE dataset provides additional variance and challenges, including images with and without pathology, crucial for assessing the adaptability and robustness of the segmentation algorithm⁶⁰. Further, the 100 retinal images from⁶¹ offer additional challenges for retinal blood vessel segmentation, contributing significantly to developing and evaluating advanced segmentation algorithms⁶¹.

Table 1 Overview of the compared with retina vessel segmentation datasets.

Full size table

Algorithm evaluation and discussion

The CoDLRBVS approach has been validated from different angles using various metrics across all earlier-mentioned datasets, such as (1) the Jaccard index (Eq. 8), which evaluates the similarity between the predicted segmentation and the ground truth to give a clear understanding of the overlap between segmented and actual vessels. (2) The F1 score (Eq. 9) harmonizes precision and recall, reflecting the accuracy and thoroughness of covering vessel pixels. (3) Sensitivity (Eq. 10) is vital for ensuring no vessel regions are missed, significant in diagnostic contexts. (4) Precision (Eq. 11) is critical in clinical settings to minimize false positives and avoid misdiagnosis. (5) Accuracy (Eq. 12) reflects overall correctness, while (6) the Kappa coefficient (Eq. 13) provides a normalized measure of agreement between the model prediction and the ground truth. (7) Area under the curve (AUC) is essential for understanding trade-offs at various thresholds. (8) Specificity (Eq. 14) measures correct identification of non-vessel areas, and finally, (9) average frames per second (FPS) (Eq. 15) indicates computational efficiency, which is essential for real-time applications. Therefore, these metrics provide a comprehensive evaluation framework for our approach.

$$\begin{aligned} J = \frac{|A \cap B|}{|A \cup B|} \end{aligned}$$

(8)

where $J$ is the Jaccard index, $A$ is the set of true positives, and $B$ is the set of predicted positives.

$$\begin{aligned} F1 = 2 \times \frac{\text {Precision} \times \text {Recall}}{\text {Precision} + \text {Recall}} \end{aligned}$$

(9)

where $F1$ is the F1 score, precision is the proportion of true positive predictions in all positive predictions, and recall is the proportion of true positive predictions in all actual positives.

$$\begin{aligned} \text {Recall} = \frac{TP}{TP + FN} \end{aligned}$$

(10)

where recall is the true positive rate, $TP$ is the number of true positives, and $FN$ is the number of false negatives.

$$\begin{aligned} \text {Precision} = \frac{TP}{TP + FP} \end{aligned}$$

(11)

where precision is the proportion of true positives in the predicted positive cases, $TP$ is the number of true positives, and $FP$ is the number of false positives.

$$\begin{aligned} \text {Acc.} = \frac{TP + TN}{TP + TN + FP + FN} \end{aligned}$$

(12)

where $Acc.$ is the accuracy of the model, $TP$ is the number of true positives, $TN$ is the number of true negatives, $FP$ is the number of false positives, and $FN$ is the number of false negatives.

$$\begin{aligned} \kappa = \frac{p_o - p_e}{1 - p_e} \end{aligned}$$

(13)

where $\kappa $ is the Kappa coefficient, $p_o$ is the relative observed agreement among raters, and $p_e$ is the hypothetical probability of chance agreement.

$$\begin{aligned} \text {Specificity} = \frac{TN}{TN + FP} \end{aligned}$$

(14)

where specificity is the true negative rate, $TN$ is the number of true negatives, and $FP$ is the number of false positives.

$$\begin{aligned} \text {FPS} = \frac{\text {Total Frames}}{\text {Total Time}} \end{aligned}$$

(15)

where $FPS$ is the average frames per second, Total Frames is the total number of frames or images processed, and Total Time is the total time taken for processing.

Table 2 CoDLRBVS model metric results over DRIVE, CHASE_DB, HRF, DRIVE retinal blood vessel, and STARE dataset.

Full size table

Table 2 reports different performance metrics used to evaluate the CoDLRBVS approach across various retinal image datasets, including DRIVE, CHASE, HRF, retinal blood vessel and STARE datasets. The results show that the proposed approach achieved a high accuracy of 0.9903% over the retinal blood vessel images dataset that contains the largest number of images, which is 100 images, and this is due to the fact that it was trained more than other datasets, and this indicates that training the model with more images results in more accurate results, further CoDLRBVS achieved an accuracy of 0.9629%, 0.9619%, 0.9581%, 0.9655% over DRIVE, CHASE, HRF, and STARE dataset respectively, underscore its effectiveness in accurately segmenting blood vessels. Such robustness across datasets suggests the CoDLRBVS’s generalizability and adaptability to different imaging conditions and vessel structures. The results also show that the model achieved very good Precision results over all datasets, where it achieved 0.9301% on the Retinal Blood Vessel and close to or exceeding 0.9% on the other datasets, including 0.9080% on STARE, highlight the approach proficiency in identifying true positive pixels across varied datasets, reducing false positives significantly. The Sensitivity of the model remains consistently high, with 0.9927%, 0.9961%, 0.9954%, 0.9947%, 0.9909% over the retinal blood vessel, DRIVE, CHASE, HRF, and STARE dataset respectively, indicating the CoDLRBVS’s ability to detect the majority of actual vessel pixels, a testament to the U-Net’s feature-learning capabilities and the effectiveness of the scale space representation. further the results in Table 2 show that the specificity has a wider variation with 0.7972% on the retinal blood vessel dataset, 0.8048% on DRIVE, lower values on CHASE_DB and HRF datasets, and a significantly higher 0.9708% on STARE, reflecting a challenge in distinguishing non-vessel regions across different datasets, which could be attributed to dataset-specific characteristics and noise factors. This metric’s variation underscores the need for dataset-specific adjustments to optimize the model’s performance. Moreover, CoDLRBVS achieves commendable F1 scores, indicating a balanced precision and recall, with the highest being 0.8473% on the DRIVE dataset and a close 0.8469% on STARE. The Jaccard index also demonstrates the model’s segmentation accuracy, with the highest value of 0.6820% on DRIVE and a comparable 0.7014% on STARE. The average frames per second (FPS) values, with the highest 0.966867% on DRIVE and 0.924556% on STARE, indicate the model’s efficiency, suggesting its potential for real-time applications. In addition, the integration of matched filter, multi-scale line detection, and U-Net, along with strategic post-processing, offers a comprehensive approach that caters to the nuances of retinal blood vessel segmentation. Figures 3, 4 , 5 and 6, shows the results of the vessel segmentation method from studied datasets. In all figures; (A) shows the original RGB image; (B) shows the ground truth images, which are manually segmented by experts; (C) shows the vessel segmentation results of the proposed model, highlighting the algorithm’s performance in accurately delineating blood vessels.

As it can be seen in Figs. 3 , 4, 5 and 6, that the resulted images (c) serves as a testament to the model’s proficiency in segmenting retinal blood vessels, exhibiting a high degree of detail with both central and peripheral vessels crisply outlined against the contrasting background. The continuity and uniformity of the vessel structures, mirroring the ground truth with notable precision, highlight the model’s capability to capture essential details necessary for accurate segmentation.

Table 3 Performance of multiple vessel segmentation methods using the DRIVE dataset.

Full size table

Table 4 Comparison with existing retinal vessel segmentation methods using STARE dataset.

Full size table

Table 5 Comparison with existing retinal blood vessel segmentation methods using CHASE database.

Full size table

Table 6 Blood vessel segmentation comparison on HRF database.

Full size table

As it can be seen in Table 3, which illustrates the comparison results of CoDLRBVS performance with different vessel segmentation methods using the DRIVE dataset. The proposed CoDLRBVS model demonstrates high sensitivity, with a score of 0.9961%, indicating its outperforming capability in correctly identifying retinal blood vessels. This metric is significantly higher than the other listed methods, which range between 0.64 and 0.74%, suggesting that CoDLRBVS is particularly effective at minimizing false negatives and reliably detecting even the most delicate vessels. However, while its accuracy is competitive at 0.9629%, indicating a high overall rate of correct predictions, its specificity is comparatively lower at 0.8048%. This lower specificity implies a higher rate of false positives, meaning the model might sometimes mistakenly identify non-vessel areas as vessels.

As it can be seen in Table 4, which displays the performance of the CoDLRBVS with other compared algorithms over the STARE dataset, CoDLRBVS also exhibits outstanding sensitivity at 0.9909%, significantly higher than other methods, which range from approximately 0.6751 to 0.7769%. Furthermore, in terms of accuracy, CoDLRBVS performs well with a score of 0.9655%, suggesting that it generally makes correct predictions. This indicates that the model has reliable performance in segmenting retinal blood vessels. The model’s specificity is also high at 0.9708%, denoting its ability to identify non-vessel areas over the STARE dataset correctly. This specificity is competitively placed within the range of other methods, which mostly fall between 0.9550 and 0.9819%. When comparing methods across the board, it’s evident that while many provide balanced performance, CoDLRBVS’s standout feature remains its exceptional sensitivity, making it a potentially valuable tool for medical imaging tasks where missing a small detail can lead to significant consequences.

The results in Tables 5 and 6 show that the CoDLRBVS model exhibits a very high sensitivity of 0.9954% and 0.9947% in both CHASE DB1 and HRF datasets respectively, whereas in term of accuracy, the CoDLRBVS model achieved 0.9619% and 0.9581% which is considered comparable with other models, where its values range from 0.94 to 0.97%. However, combining matched filters, multi-scale line detection, u-net architecture, scale space, and morphological techniques has contributed to robust feature recognition and precise vessel detection in CoDLRBVS, ensuring minimal false negatives. However, the CoDLRBVS model’s specificity is notably lower than other methods. While the model excels at detecting vessels (high sensitivity), it also tends to mark non-vessel elements as vessels (lower specificity). This could be due to the nature of the dataset.

The proposed model, however, maintains high performance, demonstrating a balanced and robust approach to retinal vessel segmentation. This balanced performance is due to the integration of deep learning U net with various techniques, each contributing to different aspects of the segmentation of retinal blood vessels. whereas, the matched filter and multi-scale line detection techniques are used in the pre-processing stages of the model. The matched filter is designed to respond maximally to typical blood vessel profiles, enhancing the image’s vessels. multi-scale line detection, on the other hand, identifies vessels of varying widths across the image. Combining these two methods benefits the model due to the fact that both methods facilitate model sensitivity to the presence of vessels and adaptivity to their varying widths, where the single-scale or non-adaptive methods might miss.

Further, applying the U-Net architecture, a deep learning model, plays a crucial role in segmenting the blood vessels. Its ability to learn and extract high-level features from the retinal images contributes to the high sensitivity of the CoDLRBVS hybrid model. The U-Net architecture, with its encoding and decoding pathways, is designed specifically for segmentation tasks, and it utilizes the spatial information in the image, which is crucial for precise segmentation. The U-Net’s ability to generalize and learn complex representations likely contributes to the CoDLRBVS hybrid model’s superior accuracy results.

Furthermore, the scale space representation technique helps handle the varying size of retinal blood vessels. It generates images representing the original image at different scales, which is particularly beneficial for capturing both larger vessels and smaller, more intricate vascular structures. This likely contributes to the model’s balanced performance in terms of sensitivity and specificity, as it helps ensure that vessels of all sizes are accounted for in the segmentation. Finally, the morphological techniques are applied in the post-processing stage to refine the segmentation. Morphological operations can remove noise (small false-positive detections) and fill in gaps in detected vessels (false negatives), leading to a cleaner and more accurate final segmentation. It is worth noting that it would be interesting to extend our research on a federated learning scale^85,86,87.

Conclusion

This paper presents CoDLRBVS, a pioneering cognitive-based deep learning model for medical image processing, namely retinal blood vessel segmentation. Our approach combines (1) a Matched Filter to detect segmentation in noisy data. It works by designing a filter that matches the shape of the signal being transmitted or received. (2) Multi-scale line detection, a technique to capture the vessels at different angles at that point. (3) U-Net architecture, a deep learning semantic segmentation technique. (4) Scale space for handling images at different scales, suppressing fine-scale structures. Moreover, (5) morphological techniques extract features based on an image’s topographic surface. Our model demonstrates a remarkable balance across performance metrics, achieving notable mean accuracy of 96.7%, precision of 96.9%, sensitivity of 99.3%, and specificity of 80.4% across all of the studied datasets, positioning it as a strong contender among existing state-of-the-art methods. The model’s high specificity significantly mitigates false positives, which is vital for precise segmentation. Hence, the novelty of our approach lies in its balanced performance, avoiding the common trade-offs among sensitivity, specificity, and accuracy observed in other methods. This balance reflects the model’s robustness and the efficacy of integrating various techniques following the cognitive computing principles. Hence, CoDLRBVS improves the accuracy and efficiency of retinal image analysis and enhances the ability to detect and diagnose retinal diseases. Future work will broaden dataset testing to enhance our model’s versatility and incorporate cutting-edge techniques and architectures to refine its performance.

Data availability

The data used in this study are available on the following websites: https://paperswithcode.com/dataset/drive https://www.kaggle.com/datasets/abdallahwagih/retina-blood-vessel https://www.kaggle.com/datasets/pradosh123/retinal-vessel-segmentation-combined https://www.kaggle.com/datasets/rashasarhanalharthi/chase-db1.

References

Awaysheh, F. M., Alazab, M., Garg, S., Niyato, D. & Verikoukis, C. Big data resource management and networks: Taxonomy, survey, and future directions. IEEE Commun. Surv. Tutor. 23, 2098–2130 (2021).
Article Google Scholar
Pradosh. Retinal Vessel Segmentation Combined. https://www.kaggle.com/datasets/pradosh123/retinal-vessel-segmentation-combined (2023). Accessed 15 Dec 2023.
Dash, S., Senapati, M. R., Sahu, P. K. & Chowdary, P. Illumination normalized based technique for retinal blood vessel segmentation. Int. J. Imaging Syst. Technol. 31, 351–363 (2021).
Article Google Scholar
Fraz, M. M. et al. An approach to localize the retinal blood vessels using bit planes and centerline detection. Comput. Methods Programs Biomed. 108, 600–616 (2012).
Article CAS PubMed Google Scholar
Abràmoff, M. D., Garvin, M. K. & Sonka, M. Retinal imaging and image analysis. IEEE Rev. Biomed. Eng. 3, 169–208 (2010).
Article PubMed PubMed Central Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Martinez-Perez, M. E. et al. Retinal vascular tree morphology: A semi-automatic quantification. IEEE Trans. Biomed. Eng. 49, 912–917 (2002).
Article PubMed Google Scholar
Liskowski, P. & Krawiec, K. Segmenting retinal blood vessels with deep neural networks. IEEE Trans. Med. Imaging 35, 2369–2380 (2016).
Article PubMed Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 25 (2012).
Google Scholar
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3431–3440 (2015).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18, 234–241 (Springer, 2015).
Fraz, M. M. et al. Blood vessel segmentation methodologies in retinal images—a survey. Comput. Methods Programs Biomed. 108, 407–433 (2012).
Article CAS PubMed Google Scholar
Dash, S. et al. Curvelet transform based on edge preserving filter for retinal blood vessel segmentation. Comput. Mater. Contin. 71, 25 (2022).
Google Scholar
Bellemo, V. et al. Artificial intelligence screening for diabetic retinopathy: The real-world emerging application. Curr. Diab. Rep. 19, 1–12 (2019).
Article Google Scholar
Gudivada, V. N., Pankanti, S., Seetharaman, G. & Zhang, Y. Cognitive computing systems: Their potential and the future. Computer 52, 13–18 (2019).
Article Google Scholar
Filgueira, R., Awaysheh, F. M., Carter, A., White, D. J. & Rana, O. Sparkflow: Towards high-performance data analytics for spark-based genome analysis. In 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 1007–1016 (IEEE, 2022).
Caderno, P. V. et al. Opera-gsam: Big data processing framework for umi sequencing at high scalability and efficiency. In 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing Workshops (CCGridW), 160–167 (IEEE, 2023).
Srivani, M., Murugappan, A. & Mala, T. Cognitive computing technological trends and future research directions in healthcare—a systematic literature review. Artif. Intell. Med. 10, 2513 (2023).
Google Scholar
Kumar, A. et al. A novel smart healthcare design, simulation, and implementation using healthcare 4.0 processes. IEEE Access 8, 118433–118471 (2020).
Article Google Scholar
Sathananthavathi, V. & Indumathi, G. Encoder enhanced atrous (EEA) UNET architecture for retinal blood vessel segmentation. Cogn. Syst. Res. 67, 84–95 (2021).
Article Google Scholar
Sathananthavathi, V. & Indumathi, G. Case studies of cognitive computing in healthcare systems: Disease prediction, genomics studies, medical image analysis, patient care, medical diagnostics, drug discovery. Cogn. Intell. Big Data Healthc. 20, 303–326 (2022).
Google Scholar
Chen, L.-C., Papandreou, G., Schroff, F. & Adam, H. Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587 (arXiv preprint) (2017).
Wang, W., Wu, W. & Yin, J. A retinal blood vessel segmentation approach based on top-hat transformation. In 2017 2nd International Conference on Mechatronics and Information Technology (ICMIT 2017), 391–394 (Francis Academic Press, 2017).
Nahiduzzaman, M. et al. A novel method for multivariant pneumonia classification based on hybrid CNN-PCA based feature extraction using extreme learning machine with cxr images. IEEE Access 9, 147512–147526 (2021).
Article Google Scholar
Sharma, H., Drukker, L., Papageorghiou, A. T. & Noble, J. A. Machine learning-based analysis of operator pupillary response to assess cognitive workload in clinical ultrasound imaging. Comput. Biol. Med. 135, 104589 (2021).
Article PubMed PubMed Central Google Scholar
Vasuki, P., Kanimozhi, J. & Devi, M. B. A survey on image preprocessing techniques for diverse fields of medical imagery. In 2017 IEEE International Conference on Electrical, Instrumentation and Communication Engineering (ICEICE), 1–6 (IEEE, 2017).
Hudaib, A. A., Fakhouri, H. N. & Ghnemat, R. New methodology for microarray spot segmentation and gene expression analysis. Sci. Res. Essays 11, 126–134 (2016).
Article CAS Google Scholar
Anwar, S. M. et al. Medical image analysis using convolutional neural networks: A review. J. Med. Syst. 42, 1–13 (2018).
Article Google Scholar
Bankman, I. Handbook of Medical Image Processing and Analysis (Elsevier, 2008).
Google Scholar
Lakhwani, K., Murarka, P. & Chauhan, N. Color space transformation for visual enhancement of noisy color image. Int. J. ICT Manage. 3, 9–13 (2015).
Google Scholar
Bhairannawar, S. S. Efficient medical image enhancement technique using transform hsv space and adaptive histogram equalization. In Soft Computing Based Medical Image Analysis 51–60 (Elsevier, 2018).
Chapter Google Scholar
Al-Rawi, M., Qutaishat, M. & Arrar, M. An improved matched filter for blood vessel detection of digital retinal images. Comput. Biol. Med. 37, 262–267 (2007).
Article PubMed Google Scholar
Sato, Y. et al. Three-dimensional multi-scale line filter for segmentation and visualization of curvilinear structures in medical images. Med. Image Anal. 2, 143–168 (1998).
Article CAS PubMed Google Scholar
Alhadidi, B. & Fakhouri, H. N. Automation of iron deficiency anemia blue and red cell number calculating by intictinal villi tissue slide images enhancing and processing. In 2008 International Conference on Computer Science and Information Technology, 407–410 (IEEE, 2008).
Tang, Q., Liu, Y. & Liu, H. Medical image classification via multiscale representation learning. Artif. Intell. Med. 79, 71–78 (2017).
Article PubMed Google Scholar
Fakhouri, H. N., Alawadi, S., Awaysheh, F. M. & Hamad, F. Novel hybrid success history intelligent optimizer with gaussian transformation: Application in CNN hyperparameter tuning. Cluster Comput. 20, 1–23 (2023).
Google Scholar
Badrinarayanan, V., Kendall, A. & Cipolla, R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2481–2495 (2017).
Article PubMed Google Scholar
Milletari, F., Navab, N. & Ahmadi, S.-A. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 Fourth International Conference on 3D Vision (3DV), 565–571 (Ieee, 2016).
Rawat, W. & Wang, Z. Deep convolutional neural networks for image classification: A comprehensive review. Neural Comput. 29, 2352–2449 (2017).
Article MathSciNet PubMed Google Scholar
Soares, J. V., Leandro, J. J., Cesar, R. M., Jelinek, H. F. & Cree, M. J. Retinal vessel segmentation using the 2-d gabor wavelet and supervised classification. IEEE Trans. Med. Imaging 25, 1214–1222 (2006).
Article PubMed Google Scholar
Staal, J., Abràmoff, M. D., Niemeijer, M., Viergever, M. A. & Van Ginneken, B. Ridge-based vessel segmentation in color images of the retina. IEEE Trans. Med. Imaging 23, 501–509 (2004).
Article PubMed Google Scholar
Dash, S. et al. A hybrid method to enhance thick and thin vessels for blood vessel segmentation. Diagnostics 11, 2017 (2021).
Article PubMed PubMed Central Google Scholar
Dash, S. & Senapati, M. R. Enhancing detection of retinal blood vessels by combined approach of dwt, tyler coye and gamma correction. Biomed. Signal Process. Control 57, 101740 (2020).
Article Google Scholar
Dash, S. et al. Guidance image-based enhanced matched filter with modified thresholding for blood vessel extraction. Symmetry 14, 194 (2022).
Article ADS Google Scholar
Ilesanmi, A. E., Ilesanmi, T. & Gbotoso, A. G. A systematic review of retinal fundus image segmentation and classification methods using convolutional neural networks. Healthc. Anal. 20, 100261 (2023).
Article Google Scholar
Fraz, M. M. et al. An ensemble classification-based approach applied to retinal blood vessel segmentation. IEEE Trans. Biomed. Eng. 59, 2538–2548 (2012).
Article PubMed Google Scholar
Orlando, J. I., Fracchia, M., Del Rio, V. & del Fresno, M. Retinal blood vessel segmentation in high resolution fundus photographs using automated feature parameter estimation. In 13th International Conference on Medical Information Processing and Analysis, Vol. 10572, 313–325 (SPIE, 2017).
Melinscak, M., Prentasic, P. & Loncaric, S. Retinal vessel segmentation using deep neural networks. VISAPP 1, 577–582 (2015).
Google Scholar
Moccia, S., De Momi, E., El Hadji, S. & Mattos, L. S. Blood vessel segmentation algorithms-review of methods, datasets and evaluation metrics. Comput. Methods Programs Biomed. 158, 71–91 (2018).
Article PubMed Google Scholar
Sekou, T. B., Hidane, M., Olivier, J. & Cardot, H. From patch to image segmentation using fully convolutional networks—application to retinal images. arXiv:1904.03892 (arXiv preprint) (2019).
Li, Q. et al. A cross-modality learning approach for vessel segmentation in retinal images. IEEE Trans. Med. Imaging 35, 109–118 (2015).
Article PubMed Google Scholar
Zhang, J. et al. Lcu-net: A novel low-cost u-net for environmental microorganism image segmentation. Pattern Recogn. 115, 107885 (2021).
Article Google Scholar
Zhang, J., Li, C., Yin, Y., Zhang, J. & Grzegorzek, M. Applications of artificial neural networks in microorganism image analysis: A comprehensive review from conventional multilayer perceptron to popular convolutional neural network and potential visual transformer. Artif. Intell. Rev. 56, 1013–1070 (2023).
Article PubMed Google Scholar
Chen, H. et al. Il-mcam: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach. Comput. Biol. Med. 143, 105265 (2022).
Article CAS PubMed Google Scholar
Li, X. et al. A comprehensive review of computer-aided whole-slide image analysis: From datasets to feature extraction, segmentation, classification and detection approaches. Artif. Intell. Rev. 55, 4809–4878 (2022).
Article Google Scholar
Fakhouri, H. N., Hudaib, A. & Sleit, A. Multivector particle swarm optimization algorithm. Soft. Comput. 24, 11695–11713 (2020).
Article Google Scholar
Fakhouri, H. N., Hamad, F. & Alawamrah, A. Success history intelligent optimizer. J. Supercomput. 20, 1–42 (2022).
Google Scholar
Beam, A. L. DRIVE: Digital Retinal Images for Vessel Extraction. https://www.kaggle.com/datasets/andrewmvd/drive-digital-retinal-images-for-vessel-extraction (2023). Accessed 15 Dec 2023.
Alharthi, R. S. CHASE-DB1. https://www.kaggle.com/datasets/rashasarhanalharthi/chase-db1 (2023). Accessed 15 Dec 2023.
Hoover, A., Kouznetsova, V. & Goldbaum, M. Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Trans. Med. Imaging 19, 203–210 (2000).
Article CAS PubMed Google Scholar
Wagih, A. Retina Blood Vessel. https://www.kaggle.com/datasets/abdallahwagih/retina-blood-vessel (2023). Accessed 15 Dec 2023.
Niemeijer, M., Staal, J., Van Ginneken, B., Loog, M. & Abramoff, M. D. Comparative study of retinal vessel segmentation methods on a new publicly available database. In Medical Imaging 2004: Image Processing Vol. 5370 648–656 (SPIE, 2004).
Chapter Google Scholar
Martinez-Perez, M. E., Hughes, A. D., Thom, S. A., Bharath, A. A. & Parker, K. H. Segmentation of blood vessels from red-free and fluorescein retinal images. Med. Image Anal. 11, 47–61 (2007).
Article PubMed Google Scholar
Ramlugun, G. S., Nagarajan, V. K. & Chakraborty, C. Small retinal vessels extraction towards proliferative diabetic retinopathy screening. Expert Syst. Appl. 39, 1141–1146 (2012).
Article Google Scholar
You, X., Peng, Q., Yuan, Y., Cheung, Y.-M. & Lei, J. Segmentation of retinal blood vessels using the radial projection and semi-supervised approach. Pattern Recogn. 44, 2314–2324 (2011).
Article ADS Google Scholar
Marín, D., Aquino, A., Gegúndez-Arias, M. E. & Bravo, J. M. A new supervised method for blood vessel segmentation in retinal images by using gray-level and moment invariants-based features. IEEE Trans. Med. Imaging 30, 146–158 (2010).
Article PubMed Google Scholar
Dai, P. et al. A new approach to segment both main and peripheral retinal vessels based on gray-voting and gaussian mixture model. PLoS One 10, e0127748 (2015).
Article PubMed PubMed Central Google Scholar
Mendonca, A. M. & Campilho, A. Segmentation of retinal blood vessels by combining the detection of centerlines and morphological reconstruction. IEEE Trans. Med. Imaging 25, 1200–1213 (2006).
Article PubMed Google Scholar
Zhang, B., Zhang, L., Zhang, L. & Karray, F. Retinal vessel extraction by matched filter with first-order derivative of gaussian. Comput. Biol. Med. 40, 438–445 (2010).
Article PubMed Google Scholar
Li, Q., You, J. & Zhang, D. Vessel segmentation and width estimation in retinal images using multiscale production of matched filter responses. Expert Syst. Appl. 39, 7600–7610 (2012).
Article Google Scholar
Ricci, E. & Perfetti, R. Retinal blood vessel segmentation using line operators and support vector classification. IEEE Trans. Med. Imaging 26, 1357–1365 (2007).
Article PubMed Google Scholar
Karn, P. K., Biswal, B. & Samantaray, S. R. Robust retinal blood vessel segmentation using hybrid active contour model. IET Image Proc. 13, 440–450 (2019).
Article Google Scholar
Zhang, B., Huang, S. & Hu, S. Multi-scale neural networks for retinal blood vessels segmentation. arXiv:1804.04206 (arXiv preprint) (2018).
Fraz, M. M., Rudnicka, A. R., Owen, C. G. & Barman, S. A. Delineation of blood vessels in pediatric retinal images using decision trees-based ensemble classification. Int. J. Comput. Assist. Radiol. Surg. 9, 795–811 (2014).
Article PubMed Google Scholar
Roychowdhury, S., Koozekanani, D. D. & Parhi, K. K. Blood vessel segmentation of fundus images by major vessel extraction and subimage classification. IEEE J. Biomed. Health Inform. 19, 1118–1128 (2014).
Google Scholar
Roychowdhury, S., Koozekanani, D. D. & Parhi, K. K. Iterative vessel segmentation of fundus images. IEEE Trans. Biomed. Eng. 62, 1738–1749 (2015).
Article PubMed Google Scholar
Azzopardi, G., Strisciuglio, N., Vento, M. & Petkov, N. Trainable cosfire filters for vessel delineation with application to retinal images. Med. Image Anal. 19, 46–57 (2015).
Article PubMed Google Scholar
Chakraborti, T., Jha, D. K., Chowdhury, A. S. & Jiang, X. A self-adaptive matched filter for retinal blood vessel detection. Mach. Vis. Appl. 26, 55–68 (2015).
Article Google Scholar
Fan, Z. et al. A hierarchical image matting model for blood vessel segmentation in fundus images. IEEE Trans. Image Process. 28, 2367–2377 (2018).
Article ADS MathSciNet Google Scholar
Biswal, B., Pooja, T. & Bala Subrahmanyam, N. Robust retinal blood vessel segmentation using line detectors with multiple masks. IET Image Process. 12, 389–399 (2018).
Article Google Scholar
Jiang, Z., Zhang, H., Wang, Y. & Ko, S.-B. Retinal blood vessel segmentation using fully convolutional network with transfer learning. Comput. Med. Imaging Graph. 68, 1–15 (2018).
Article PubMed Google Scholar
Joshua, A. O., Nelwamondo, F. V. & Mabuza-Hocquet, G. Blood vessel segmentation from fundus images using modified u-net convolutional neural network. J. Image Graph. 8, 21–25 (2020).
Article Google Scholar
Zhou, L., Yu, Q., Xu, X., Gu, Y. & Yang, J. Improving dense conditional random field for retinal vessel segmentation by discriminative feature learning and thin-vessel enhancement. Comput. Methods Programs Biomed. 148, 13–25 (2017).
Article PubMed Google Scholar
Odstrcilik, J. et al. Retinal vessel segmentation by improved matched filtering: Evaluation on a new high-resolution fundus image database. IET Image Proc. 7, 373–383 (2013).
Article MathSciNet Google Scholar
Haller, M., Lenz, C., Nachtigall, R., Awayshehl, F. M. & Alawadi, S. Handling non-iid data in federated learning: An experimental evaluation towards unified metrics. In 2023 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), 0762–0770 (IEEE, 2023).
Alkhabbas, F., Alawadi, S., Ayyad, M., Spalazzese, R. & Davidsson, P. Art4fl: An agent-based architectural approach for trustworthy federated learning in the IOT. In 2023 Eighth International Conference on Fog and Mobile Edge Computing (FMEC), 270–275 (IEEE, 2023).
Awaysheh, F. M., Alawadi, S. & AlZubi, S. Fliodt: A federated learning architecture from privacy by design to privacy by default over iot. In 2022 Seventh International Conference on Fog and Mobile Edge Computing (FMEC), 1–6 (IEEE, 2022).

Download references

Funding

Open access funding provided by Blekinge Institute of Technology.

Author information

Authors and Affiliations

Department of Data Science and Artificial Intelligence, The University of Petra, Amman, Jordan
Hussam N. Fakhouri
Department of Computer Science, Blekinge Institute of Technology, Karlskrona, Sweden
Sadi Alawadi
Computer Graphics and Data Engineering (COGRADE) Research Group, University of Santiago de Compostela, Santiago de Compostela, Spain
Sadi Alawadi
Institute of Computer Science, Delta Research Centre, University of Tartu, Tartu, Estonia
Feras M. Awaysheh
Internet of Things and People Research Center, Malmö University, Malmö, Sweden
Fahed Alkhabbas
Department of Computer Science and Media Technology, Malmö University, Malmö, Sweden
Fahed Alkhabbas
Virtual and Augment Reality Department, Faculty of Information Technology, University of Petra, Amman, Jordan
Jamal Zraqou

Authors

Hussam N. Fakhouri
View author publications
You can also search for this author in PubMed Google Scholar
Sadi Alawadi
View author publications
You can also search for this author in PubMed Google Scholar
Feras M. Awaysheh
View author publications
You can also search for this author in PubMed Google Scholar
Fahed Alkhabbas
View author publications
You can also search for this author in PubMed Google Scholar
Jamal Zraqou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors have contributed to the manuscript equally.

Corresponding author

Correspondence to Sadi Alawadi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fakhouri, H.N., Alawadi, S., Awaysheh, F.M. et al. A cognitive deep learning approach for medical image processing. Sci Rep 14, 4539 (2024). https://doi.org/10.1038/s41598-024-55061-1

Download citation

Received: 05 October 2023
Accepted: 20 February 2024
Published: 24 February 2024
DOI: https://doi.org/10.1038/s41598-024-55061-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.