Rapid measurement of epidermal thickness in OCT images of skin

Lin, Chieh-Hsi; Lukas, Brandon E; Rajabi-Estarabadi, Ali; May, Julia Rome; Pang, Yanzhen; Puyana, Carolina; Tsoukas, Maria; Avanaki, Kamran

doi:10.1038/s41598-023-47051-6

Download PDF

Article
Open access
Published: 26 January 2024

Rapid measurement of epidermal thickness in OCT images of skin

Chieh-Hsi Lin¹,
Brandon E Lukas²,
Ali Rajabi-Estarabadi^3,4,
Julia Rome May⁵,
Yanzhen Pang⁵,
Carolina Puyana⁶,
Maria Tsoukas⁶ &
…
Kamran Avanaki^2,6

Scientific Reports volume 14, Article number: 2230 (2024) Cite this article

1077 Accesses
4 Citations
Metrics details

Subjects

Abstract

Epidermal thickness (ET) changes are associated with several skin diseases. To measure ET, segmentation of optical coherence tomography (OCT) images is essential; manual segmentation is very time-consuming and requires training and some understanding of how to interpret OCT images. Fast results are important in order to analyze ET over different regions of skin in rapid succession to complete a clinical examination and enable the physician to discuss results with the patient in real time. The well-known CNN-graph search (CNN-GS) methodology delivers highly accurate results, but at a high computational cost. Our objective was to build a computational core, based on CNN-GS, able to accurately segment OCT skin images in real time. We accomplished this by fine-tuning the hyperparameters, testing a range of speed-up algorithms including pruning and quantization, designing a novel pixel-skipping process, and implementing the final product with efficient use of core and threads on a multicore central processing unit (CPU). We name this product CNN-GS-skin. The method identifies two defined boundaries on OCT skin images in order to measure ET. We applied CNN-GS-skin to OCT skin images, taken from various body sites of 63 healthy individuals. Compared with CNN-GS, our described method reduced computation time by 130 $\times$ with minimal reduction in ET determination accuracy (from 96.38 to 94.67%).

Line-field confocal optical coherence tomography coupled with artificial intelligence algorithms to identify quantitative biomarkers of facial skin ageing

Article Open access 24 August 2023

Automatic segmentation of skin cells in multiphoton data using multi-stage merging

Article Open access 15 July 2021

Opening a window to skin biomarkers for diabetes stage with optoacoustic mesoscopy

Article Open access 18 September 2023

Introduction

Epidermal thickness (ET) changes are associated with several skin diseases such as Vitiligo¹, diabetic foot², psoriasis, atopic dermatitis, basal cell carcinoma, squamous cell carcinoma, melanoma, and many more^{3,4,5,6,7,8,9}

The epidermis is the skin’s outermost layer that is a defense mechanism against a hostile environment. ET can be measured non-invasively using identified skin layer boundaries on images acquired by optical coherence tomography (OCT). Traditional OCT analysis manually extracts the layer border positions and subsequent structural characteristics, which can be laborious, time-consuming, and prone to errors. Hence, there is great value in an automatic, efficient, and accurate segmentation method to reduce errors and minimize the need for training on interpretation of OCT images. When OCT is used in the clinic, the handheld device is used on the skin and moved to different parts of lesions to detect potential abnormalities. A fast analysis of skin layers is essential to complete a clinical examination and enable the physician to discuss results with the patient in real time.

Neural network models are commonly adopted to assist in this type of image analysis. Such methods include convolutional neural networks (CNN)¹⁰ and recurrent neural networks (RNN)¹¹. Both of these neural frameworks have been applied in diverse medical image analysis, such as brain tumor segmentation¹², retinal blood vessel segmentation¹³, retinal lesion detection¹⁴, and skin segmentation^15,16,17. A further refinement, which has shown increased accuracy, is the use of a hybrid structure combining a CNN model with two or more approaches. Chen et al.¹⁸ worked on semantic image segmentation using a deep CNN model with fully connected conditional random fields. Gophinath et al.¹⁹ utilized a CNN model for extracting layers of interest and edges from the images, while, simultaneously, bidirectional long short-term memory (BLSTM) tracing the layer boundaries on segments of retinal layers. Fang et al.²⁰ applied this patch-wise based CNN model with a graph search algorithm (CNN-GS) to segment retinal layer boundaries in OCT images of non-exudative age-related macular degeneration (AMD). Likewise, Kugelman et al.²¹ attempted the same composition but used the RNN model (RNN-GS) instead to segment retinal layers from healthy children and adult patients with AMD, which showed comparable results to the CNN-GS model. While CNN-GS and RNN-GS have demonstrated excellent accuracy for analyzing retinal layer boundaries, we have found they are somewhat computationally expensive to implement for OCT skin image analysis. Some of the results contained herein were presented previously in a brief conference paper²², but due to considerable positive feedback, we here present a more thorough elulcidation of the method developed and additional results. Here we have thoroughly explained the testing of speed-up algorithms, optimization of hyperparameters, selection of appropriate loss function, and, most significantly, use of our novel pixel-skipping algorithm.

Below we describe a comprehensive modification of CNN-GS, which we dub CNN-GS-skin, and how it can be used to detect boundaries of the epidermal layer including stratum corneum (SC) and dermal-epidermal junction (DEJ), and in this way, measure ET. We aim to develop a computation-efficient method while preserving reasonable accuracy on ET measurement. We then compare CNN-GS-skin to CNN-GS in terms of accuracy, number of parameters required to be computed, and overall execution time. We also describe an implementation of skin segmentation on a widely-used deep learning image analysis method, UNet. Finally, we compare position accuracy and ET thickness for different body sites measurements relative to manual segmentation for CNN-GS-skin, CNN-GS, and UNet.

Methodology

Imaging system

W used a multi-beam, swept-source OCT (SS-OCT) system (Vivosight, Michelson Diagnostic Inc.). The OCT system is an FDA-approved machine with a hand-held scanning probe for skin imaging. The lateral and axial resolutions are 7.5 $\upmu$m and 10 $\upmu$m, respectively. The scan area of the OCT system is 6 mm (width) $\times$ 6 mm (length) $\times$ 2 mm (depth). A tunable broadband laser source (Santec HSL-2000-11-MDL), with central wavelength of 1305 $\nu$m, successively sweeps through the optical spectrum and leads the light to four separate interferometers and forms four consecutive confocal gates.

Dataset description

All of the imaging procedures and experimental protocols were approved and carried out according to the guidelines of the University of Illinois at Chicago’s Institutional Review Board (IRB). To implement the proposed method comprehensively and to ensure a diverse set of features, we created a large dataset of OCT images composed of 315 stacks of OCT images taken from the forearm, back of hand, forehead, neck, and palm of 63 healthy individuals. Each stack has 50 images from adjacent transverse locations, creating volume data. We used 5 images from each stack for processing. Each image is 460 $\times$ 1500 pixels (1.5 mm $\times$ 6 mm). Our dataset included a total of 1575 images, 70% (selected randomly and divided between the five body sites evenly) were used for CNN model training, and the remaining images were used for testing and validation. To prepare the images for model training and testing, each OCT image was manually annotated with two-layer boundaries, labeled Boundary 1 for SC, the upper bound of the epidermis; and Boundary 2 for DEJ, the boundary that separates epidermis and dermis. For each OCT image, manual segmentation was done blindly by 4 independent medical professionals (two medical students and two dermatology fellows), who all had been trained in OCT image segmentation and OCT image interpretation and have experience in OCT interpretation. We used the median location between the boundaries blindly performed by the 4 independent markers as the ground truth to train and test the models. In doing this, we observed a maximum deviation of 4 pixels, minimum deviation of zero pixels and an average deviation of 2 pixels for all of the images. Segments were delineated using pixel-wise labeling, where each boundary pixel was assigned a label (pixels are 4 (width) $\upmu$m $\times$ 3.26 (depth) $\upmu$m.

Proposed framework

We implemented three different frameworks for automated ET thickness detection to analyze the OCT images to identify the boundaries. The first one, following Fang²⁰, was a combination of a patch-wise based CNN model and a graph search algorithm (CNN-GS). The second framework, CNN-GS-skin, modifies the parameters from Fang and implements additional compression techniques to reduce computational time without significant loss of accuracy. The third framework (for comparison of computational time and accuracy) was an implementation of UNet described below.

The rationale for the use of a CNN-GS is as follows: CNNs are more effective when looking at smaller sections of an image to find specific features. The use of small sections also leverages the fact that nearby pixels are more strongly related than distant ones, as evaluated by traditional neural networks²³. Graph Search is an algorithm that systematically explores all the vertices and edges of a graph: In Fang, as in our method, Dijkstra’s algorithm is used to implement graph search. In Chiu et al.²⁴, for example, an OCT image is taken as the graph where each pixel represents the vertex of the graph, and neighboring pixels are connected by edges. Each connected edge is assigned a value of the weight, termed edge-weight. The edge-weight is a user-defined value depending on the users’ requirements. The shortest path found by the algorithm is a set of connected edges with a minimum cost of edge-weight. The framework is constructed by Keras API with Tensorflow²⁵. We implemented CNN-GS in Python. CNN-GS contains two stages: (1) patch-based CNN training and classification; and (2) probability maps and graph search. Figure 1 illustrates the outline of a CNN-GS framework; and the color blocks represent the two stages.

Stage 1: Patch-based CNN training and classification

Prior to the training phase, we normalized the images in the range between 0 and 1²⁶. To train the CNN model as a patch-based classifier, we created a dictionary of patches from and corresponding labels as the input to train the model. These patches are overlapped patches of size 55$\times$55 pixels extracted from the normalized training images. Each patch is assigned a class label based on the center pixel. The patches are labeled 1 and 2 according to their layer boundaries and assigned label 0 for those that are not centered on the layer border. The patchset was balanced by randomly selecting the same amount of boundary and non-boundary patches from all patches. Both patches and patch labels are used to train the CNN model. Once the training process is completed, the CNN model is optimized by fine-tuning hyperparameters.

CNN-GS, as implemented by Fang et al.²⁰, is not particularly efficient for fast OCT skin image analysis. For a more practical implementation of CNN-GS, we aimed to develop a model which spends less computation time on execution, regardless of the training time, and can be implemented in a multicore central processing unit while still preserving satisfactory classification outcomes. With these conditions in mind, we first examined the impact of various hyperparameters to speed up execution times. Figure 2 shows CNN-GS, containing three blocks of convolutional and pooling layers, and a fully connected block consisting of hidden layers and an output layer.

To segment OCT images for training the CNN model, that is, to create the training patchsets, we first deleted the region on top of the OCT image, above the SC, which is the image of air. We detected the edge pixels generated from the gradient of the image due to the great contrast between the pixel intensities of air and SC; and utilized non-local means filtering to reduce speckle²⁷. The above pre-processing assists in finding the approximate outermost layer boundary to constrain the target predicting area. To lessen the blunder from the edge detection procedure, from the outermost boundary found, we extended a few rows of pixels above it and fixed the row size beneath it to create a smaller target image, decreasing the target patchset size and the computation time for classifying. The training patchset is then created from the modified target image. The output of the CNN model is three probability maps, corresponding to three defined labels, and a class label that has the highest probability of showing which class the patch will belong to. These predicted probability maps are required for Stage 2.

Stage 2: probability maps and graph search

Since an OCT image may consist of several layer boundaries, when segmenting on a specific boundary, users are required to select or estimate corresponding boundary start and end nodes for each per-class probability map constructed from the output of the machine learning model. Therefore, before directly creating the graph from the probability map of the target image, we implemented automatic endpoint initialization²⁴, which is a method that bypasses the need for manual end-point selection. This initialization is based on two assumptions: (1) the layer segmentation extends across the entire width of the image; and, 2) Dijkstra’s algorithm preferentially finds the minimum weight of the path. The edge-weight of the connected edges, $w_{cn}$ is given by the equation:

$$\begin{aligned} w_{cn} = 2-(P_{c}+P_{n})+w_{min} \end{aligned}$$

(1)

where $P_{c}$ and $P_{n}$ are the probabilities (0–1) of the current pixel and its neighboring pixel, and $w_{min}$ is set to $1\times 10^{-5}$, a small positive number to avoid errors while applying graph theory. Consequently, we added an additional column of nodes on both sides of the images and assigned them a maximum probability of 1. Doing this allows users to have the start and end nodes fixed on the top left and bottom right of images with the newly added columns, and the algorithm is able to traverse in the vertical direction of these columns with minimal resistance until the end node of the particular layer creates an even lower cost path.

Moreover, to reduce intervention by users, we do not limit the segmented area between the top and bottom layer boundaries, nor limit the segmented direction. Every neighboring pixel (at most eight) is taken into consideration. When the directed weighted graph is found from the modified probability map, we created an adjacency list to represent this finite graph. Instead of using an adjacency matrix, an adjacency list is applied since the larger OCT image creates a larger graph with a larger number of nodes, therefore, using the adjacency matrix requires more memory storage and computation time²⁸. In the adjacency list, each list describes a vertex and the set of its neighboring vertices with the assigned edge weight in the graph. By doing so, the connected vertex with the lower edge cost among all the neighboring pixels can be easily found based on the previously reported vertex. To delineate the shortest path, Dijkstra’s shortest path algorithm²⁹, is implemented to traverse the whole image, without human interference, to find the minimum cost path. Once the image is segmented, the two additional columns can be removed, leaving an accurate cut and original size of the image for analysis preventing the error occurrence of endpoint initialization. The shortest path results, after removing the additional columns, are presented as the final predicted layer boundaries. The visualization of the graph search process is shown in Fig. 3.

UNet (for comparative study)

To compare the performance of our proposed CNN-GS algorithm to a semantic image segmentation method, we employed a UNet architecture. UNet consists of an encoder and decoder linked together via skip connections. We utilized the UNet in³⁰, but use padded 33$\times$33 convolutions instead of using unpadded convolutions in order to preserve spatial dimensions. For implementing CNN-GS and CNN-GS-skin, approximately 1200 images are sufficient to train the model, but for UNet, a larger number of training images are needed. To artificially increase the number of training examples, we augmented the data in the training set using random horizontal flips and elastic deformations at each iteration. Elastic deformations are achieved using a piecewise affine algorithm that places a 3$\times$3 grid of points on the image and randomly moves the neighborhood of these points around via affine transformations. These modifications are reasonable based on the physical properties of skin. The augmented images are then randomly cropped as 464$\times$256 pixel patches. This large patch size allows the network to capture context throughout the entire height of the image. A batch size of 4 is used.

To segment the epidermis, we labeled all pixels between Boundary 1 and Boundary 2 epidermis. Pixels above Boundary 1 are labeled as air, and pixels below Boundary 2 are labeled as tissue. To overcome the issue of heavily imbalanced classes and to force the model to learn boundaries, we used the weighted cross entropy loss function³⁰. The model was optimized using a stochastic gradient descent optimizer with a high momentum of 0.99. The initial learning rate was set automatically using a learning rate finder algorithm³¹. The model was trained until the validation loss starts to plateau or degrade. The output of UNet is a pixel map containing raw, unnormalized scores for each class. We applied a softmax function on the scores to obtain the relative class membership probability. We then labeled pixels with over 50% probability of belonging to the epidermis class as epidermis. Finally, for each column, we labeled the top-most epidermis pixels as Boundary 1 and the bottom-most epidermis pixels as Boundary 2.

Human study

Informed consent was obtained from all subjects and/or their legal guardian(s).

Results and discussion

This section provides results from each stage of the CNN-GS methodology described in “Proposed framework”. The first part focuses on how the parameters of the proposed model structure were chosen and the performance of patch classification; the second part displays the outcome of the segmentation after applying graph search on probability maps generated by the trained model. Also included is a report on accuracy of ET thickness determinations using CNN-GS, CNN-GS-skin, and UNet for a validation set of OCT images (not used in training the models).

CNN configuration optimization

The computational time for predicting ET across a single image (B-scan) using CNN-GS was found to be 60 s. The motivation for developing CNN-GS-skin was to reduce the execution time to less than 1 second while preserving the accuracy as much as possible; we allowed for a 3% decrease in accuracy. To achieve this goal, we developed a revised model with fewer parameters and implemented additional processes described below. The generated probability maps from our revised trained model were the intermediate results, therefore some error ranges were acceptable. Using this standard, we gradually adjusted the hyperparameters and implemented additional processes to obtain CNN-GS-skin (the final proposed model).

The model was tuned with a batch size of 512 and selecting optimum results. We used categorical cross-entropy and RMSprop optimizer³² for our model. All the models were trained identically as described in Stage 1 of “Proposed framework”. The model was trained until the validation loss started to plateau or degrade. The degradation rate was measured empirically. Increasing epochs and batch sizes did not significantly reduce the training loss, but increased the training computation and memory cost. In addition, to avoid model overfitting to the patchset, dropout³³ was set in the model where a random number of nodes in the layer were ignored in each epoch, and early stopping was used to measure where to stop training.

Parameter tuning

The best configuration of the model depends on several factors including chosen dataset, model structure, and specific hyperparameters. CNN model structure contains various tunable hyperparameters²³, such as filter and kernel size and number, pooling size, and type. To demonstrate the effects of different hyperparameters, in addition to comparing classification accuracy, we also compared the average execution time on target images from our model to the results taken from the baseline (CNN-GS) model. The execution time is calculated using the target patchset. The sections below show the results that have significantly influenced the model structure. We followed a sequential optimization where one parameter is modified at a time, and the remaining model settings stay constant. Once a better parameter set is selected, we updated our previous setup and tuned the next parameter, repeating until the last chosen parameter is tuned.

Table 1 Effect of modifying parameters of CNN-GS model for use to identify epidermal boundaries.

Full size table

Convolutional layer filter size : Each convolutional layer is performed on the input data with the use of filters. The filters are the feature detectors, where each filter convolves through the entire input and generates one feature map accordingly. The output size of the feature map is a result of the filter size. The results of reducing the filter size from 5$\times$5 in the baseline model (CNN-GS) to 3$\times$3 in all three convolutional layers are shown in Table 1. This change reduced the number of parameters somewhat, and slightly improved the execution time.

Filter count: After the selection of filter size, we considered the filter count parameter. The more filters, intuitively, the more explanatory factors are found, and the more the network learns, but learning a particular system may not require a large number of features: the most suitable number generally can be learned from experience. The experimental results are shown in Table 1. Reducing the filter number sacrifices some features extracted from the input, however, it reduces the number of parameters massively and reduces the amount of computation time needed for the following convolution. We selected the result, filter count = (8, 16, 16), because, with a relatively large training set, we did not want to lose too much information from the input. When filter count is (8, 16, 16), it has already eliminated about 78% of the parameters and deceased execution time by a factor of 5, with < 2% loss of accuracy.

Pooling kernal size: The function of pooling is to continuously reduce the input dimensionality leading to a smaller number of parameters and computations in the network. The maximum pooling layer downsamples the input by keeping the maximum activation in a given window. For feature extraction, we decided to utilize a larger filter size in the convolutional layer to explore sensible features and a small pooling kernel size to prevent useful information from being removed. Since we have already significantly decreased the number of parameters in convolutional layers, to not lose valuable data, we decreased the pooling kernel size. As can be seen in Table 1, the smaller the kernel size the model has, the larger the number of parameters it must learn from, leading to a longer training time in each epoch. However, it also leads to higher accuracy. Considering the trade-off between the additional training cost and the potential loss of valuable information, we opted for smaller size filters and a slight increase in execution time.

Units for the fully connected (FC) layer: Unlike convolutional layers, FC layers do not share parameters. On the contrary, they connect to every node from the previous layer, creating the majority of parameters inside the model, among all layers. Thereby, applying dropout to eliminate partial numbers, and controlling the unit value of FC layers can greatly reduce the number of parameters in the model. While CNN-GS models utilize two FC layers, only the unit value in the first FC layer is tuned, since the unit value of the final FC layer, the output layer, is used to reduce output to a single vector of probability (to identify whether the pixel is a boundary pixel or not). Table 1 shows that modifying the unit values can decrease parameters, as predicted by theory, and results in reduced training and execution time. The smaller rectified linear unit values (16, 8) lower the classification accuracy and number of parameters, but not the execution time. For future experiments, we selected a unit value of 32, as it has less than a 1% accuracy difference but reduces the number of parameters in the model by about 50%. In summary, we have demonstrated how simply tuning the parameters of the CNN-GS model enables a reduction in number of parameters and execution times. This improvement, while significant, is not sufficient to reach our goal of < 1 s, so we will explore further modifications to the model.

Loss function optimization

While training machine learning models, optimization algorithms are used to change attributes of the model, such as learning rate, and weights, to reduce losses and get the best possible results. The current error of the model has to be estimated repeatedly so that the weights can be updated to improve model learning and move to the next evaluation. This process requires loss functions. The choice of a suitable loss function depends on the predictive modeling problem. Our model is undertaking a multiclass classification where each patch is assigned to one of three classes, therefore, we compared the effect of applying two different loss functions commonly used for multiclass classifications. To perform the optimizations up to this point, we used the well known loss function ‘categorical_crossentropy’. To probe our choice of loss function, we reran the analysis using ‘Kull_Leibler_divergence’ (KL divergence). Cross entropy and KL divergence both measure the difference between two probability distributions, but cross entropy evaluates the number of bits needed to encode events from one distribution using the optimal code for another distribution, and KL divergence measures the information loss when one distribution is used to approximate another. We compared the use of cross entropy and KL-divergence on the previously identified best parameters from Table 1, and found the two tests had the same training time/epoch (115 s), and execution time (9 s). The training performance of these two loss functions is very similar, but cross entropy had slightly higher training, validation, and test accuracy (0.10–0.24% improvement) so we stayed with the selection of ‘categorical_crossentropy’ for our model.

Pruning and quantization

When optimizing a classification problem, it is always valuable to test different compression methods to see if they can reduce model complexity and computational cost without sacrificing test accuracy or execution speed. Network pruning focuses on reducing redundant weights or parameters, which are not sensitive to performance in a dense model. Pruning can lead to net reduction in inference time, but the degree to which it reduces time can vary widely depending on a specific system’s parameters. Network quantization compresses the original network by reducing the number of bits to represent model weights. By doing so, the weights can be quantized to small bits and the size of the model can be significantly reduced³⁴. The results shown in Table 2 demonstrate that the original model without pruning or quantization shows slightly better results and shorter execution times. The fact that pruning did not make a significant difference suggests the model is already close to the minimal number of parameters needed to solve the task. Surprisingly, the quantized model has the longest execution time. This may be due to our hardware system, which cannot operate quantized data natively as is required for deep learning inference.

Table 2 Comparison of loss function applied in the model.

Full size table

Pixel skipping

Up to this point, we simply optimized hyperparameters on the CNN-GS method and tested variations in the loss function and pruning and quantization to reduce execution time. However, at 9 s, we were nowhere near our goal of CPU analysis in under 1 s. We reflected that reducing the input size of the model needed for predicting can be used to save computation costs and reduce execution time. To preserve the predicted results, instead of directly decreasing the number of predicted patches, we implemented a simple technique called pixel skipping. Pixel skipping takes a pixel from the image and skips several pixels in between to get the next pixel, repeating until the last pixel in the image. Pixel skipping may be a very effective technique for analyzing the epidermis where the main outputs of interest are the boundaries of the epidermis. The effect of pixel skipping is to create a smaller size target patchset where patches are centered on selected pixels. After the model has predicted these selected pixels, we introduce a $kernel\;mask = [0 \hspace{0.6mm} 1 \hspace{0.6mm} 0, 1\hspace{0.6mm} 0\hspace{0.6mm} 1, 0 \hspace{0.6mm}1 \hspace{0.6mm}0]$ to calculate the mean of the sum of its neighboring pixels’ probability. If the probability is low in all of the pixels, it continues skipping. Figure 4 shows a simple example of applying pixel skipping on an OCT image.

Figure 5 illustrates the pixel skipping concept. In Fig. 5a, the results were predicted on the pixel skip patchset only. In Fig. 5b, we display the results after putting the missing pixels back with the kernel mask, creating a more saturated image, and Fig. 5c shows the original results, without pixel skipping. With a step size equal to 2, which takes one pixel and skips the next pixel, the full target patchset is reduced in size to half. And when the step size is 4, the size reduces to a quarter. We then tested these three patchsets with our selected configuration and found no loss of accuracy for boundary selection. The results are presented in Table 3. The smaller patchset size has improved the execution time significantly.

Table 3 Predicting average processing time with patchset size.

Full size table

Effect of parallel processing on execution time

To further reduce the execution time, we used parallel processing on a multicore CPU. We noticed that the Python language predates multicore CPUs, and has a global interpreter lock (GIL), meaning only one thread can be executed at a time. As Python does not use multicore natively, it is a performance bottleneck in CPU-bound solutions. We designed a workaround using multiprocessing, which utilizes all possible CPU cores on users’ setup and handles several tasks in parallel. We created multiple processes, and each process is loaded with a trained model and chunks of the patchset to achieve parallelism. At first, multiprocessing did not significantly speed up the process. This is because creating copies of the trained model and needing to load the deep learning model setup on every processor resulted in increased inference time, causing the system to slow down. After some experimentation, we found that the installed Tensorflow-CPU version affects the inference time, therefore, Tensorflow-CPU versions greater than 2.3 must be used for speeding up. Our hardware and OS configuration was: Intel(R) Xeon(R) Gold 624R CPU @ 3.00GHz $\times$ 8664 (and DDR5 6000MHz RAM) with Tensorflow-CPU version 2.9.4 and Keras version 2.4.3. With this setup, we reran the model using our three sizes of target patchsets. From Table 3, a smaller size input patchset decreased the execution time, and the patchsets processed with multiprocessing show improved execution time.

Table 4 CNN structure comparison of CNN-GS and CNN-GS-skin.

Full size table

In summary, based on the experimental discoveries presented in “Parameter tuning”, “Loss function optimization”, “Pruning and quantization” and “Pixel skipping”, a tuned CNN configuration was achieved. An overview of the baseline (CNN-GS) and optimized (CNN-GS-skin) configurations are depicted in Table 4. The size of filters in convolutional layers, as well as pooling kernel size, were modified. Fewer filters and fewer units of FC layer were also used. Besides refinement of the model, reduction of the patchset size via pixel skipping and multiprocessing was also implemented. In addition to the analytical information in Tables 1, 2 and 3, where the accuracy is analyzed patch by patch, we provide below additional data below to support our model performance.

The receiver operating characteristic (ROC) demonstrates how well a model can distinguish between classes. The higher the area under the curve (AUC), the better the model. The ROC curves in Fig. 6 were plotted for the SC (boundary 1), the epidermis (non-boundary), and the DEJ (boundary 2). The numerical measurements are presented in Table 5. We can see that numerical results of the CNN-GS-skin model have minor differences compared to the CNN-GS model, however, these differences cause no significant reduction in accuracy. CNN-GS-skin has promising results and high efficiency. It has shrunk about 90% of the parameters but kept an average 94.68% testing accuracy, only $\sim$2% less than CNN-GS. More importantly, the proposed model runs, in the test phase, in less than 0.5 s.

Table 5 Classification report of CNN-GS and CNN-GS-skin.

Full size table

Segmentation performance analysis

Previously, accuracy was determined for each analyzed patch. In this section, the patch results are converted into computed segmentation performance, and the computed segmentation performance for the different frameworks are analyzed and compared with the manual segmentation ground truth. The boundaries are delineated through the procedure described in stage 2 of the CNN-GS framework. The visual results of the CNN-GS methodology overlaid on the target image are illustrated in Fig. 7. To analyze the predicted segmentation, we took manual segmentation as the gold standard and calculated the similarity between predicted segmentation and manual segmentation. Note that, manual segmentation is a somewhat subjective process (results of manual segmentation varied by 0–4 pixels among the 4 independent analysts, and the ground truth was defined as the median result for each pixel). For each boundary in the OCT image, we marked the position for both predicted and manual segmentation and calculated the mean error in terms of pixel size. After taking the value of these differences, the mean difference and standard deviation were calculated. The position accuracy (3) was also measured based on the mean of the number of boundary positions that were predicted correctly and the total number of labeled pixels. Please note: axial pixel size $\approx$ 3.25 $\upmu$m and system axial resolution is 10 $\upmu$m. To correct human bias in manual segmentation, we calculated the visual error tolerance, indicating the acceptable error ranges for humans while analyzing the OCT boundaries. For this experiment, we shifted boundaries by 1–4 pixels and checked the results with seven experts, who are familiar with OCT imaging, by asking them to compare the modified image to their results. Their responses indicated predicted positions are generally satisfactory if within 2 pixels (2).

$$\begin{aligned} Correct\;Position = |P_{pred} - P_{man}|\le 2 \end{aligned}$$

(2)

$$\begin{aligned} Position\;Accuracy = \frac{\#\;of\;Correct\;Position}{Total\;Labeled\;Pixels} \end{aligned}$$

(3)

where $P_{pred}$ is position index of current predicted pixel; and $P_{man}$ is the corresponding manual segmented pixel position.

Table 6 Border position error (in pixels) comparing CNN-GS and CNN-GS-skin with manual segmentation, for each layer boundary in skin OCT images.

Full size table

To determine the ability of CNN-GS-skin, besides comparing it to manual segmentation, we compared it to CNN-GS to check accuracy. As the results show in Table 6, for Boundary 1 (the SC) position accuracy is very high for both CNN-GS and CNN-GS-skin. For Boundary 2 (the DEJ), both models performed less well. And, in fact, Boundary 2 is not instinctively visible for people to delineate: when we combined several annotations for the same image, the results still fluctuated. However, even though CNN-GS-skin performs worse than CNN-GS during patch classification, the results show a negligible difference in boundary delineation (overall less than 1% reduction in accuracy). This supports our intuition that the system could tolerate a 2–3% decrease in patch classification accuracy without significantly reducing ET boundary detection. To demonstrate this conclusion, the segmented images using both models and manual segmentation are shown in Fig. 8. Also plotted are the contours of the two boundaries to show how similar our predicted results (CNN-GS-skin) are to the prediction using the CNN-GS and the manual segmentation. As is shown, the second boundary is more divergent compared to the first boundary, which results in less position accuracy, but still within the tolerance of manual annotation.

To demonstrate the computational strength of our CNN-GS approach, we compared our results to a method run by UNet, an encoder-decoder architecture that eliminates all pre-processing procedures on patch creation. UNet is a more complicated model that is typically trained on GPU; however, using the trained UNet, introduced in 2.4, we processed our testing images on CPU and obtained an execution time of 27.57 s with a standard deviation of 0.1522 s; the execution time may be improved by incorporating some speed-up algorithms. In terms of execution time, CNN-GS-skin (< 0.5 s) far outperforms UNet. To evaluate the accuracy of the ET measurement, we defined the parameter “ET Accuracy” as shown in Eq. (4).

$$\begin{aligned} ET Accuracy = \frac{\#\;of\;Img_{correct}}{\#\;of\;Test\;Images} \end{aligned}$$

(4)

where $Img_{correct}$ represents those predicted OCT images having the ET within standard deviation compared to manual segmentation.

Table 7 Comparison with manual segmentation of position accuracy (Acc.) at stratum corneum (SC) and dermal epidermal junction (DEJ) and epidermal thickness (ET) accuracy.

Full size table

The robustness and application of our algorithm on different body sites are visually demonstrated in Fig. 9. For detailed analysis, we calculated the position accuracy and epidermal thickness by body site in Table 7 where we compared CNN-GS-skin with CNN-GS and UNet to verify our methodology. Defining the performance of our algorithm as the absolute difference between the average value obtained from each model and manual segmentation and divided by the results of manual segmentation, we reached $\approx$95%(0.12) accuracy across all body sites (i.e., OCT images taken from the forearm, back of hand, forehead, neck, and palm of 63 healthy individuals). Additionally, in Fig. 10, the bar chart was plotted with the epidermal thickness and a standard deviation to show the comparison of predicted and manual segmentation, which further supports our proposed model having competitive performance. From these results, even though CNN-GS has slightly better performance, the average predicted epidermal thickness for each location is similar to the manually segmented thickness. We observed variability of ET measurement accuracy in different skin locations. We note that these values for epidermal thickness are well within published values for these body sites^35,36,37.

Conclusion

OCT is a three-dimensional high-resolution imaging modality that has been used in assessing a range of skin conditions including psoriasis, contact or atopic dermatitis, lichen planus, acne lesion^3,4, papule⁵, wound healing⁶ and skin cancer⁷. In most skin diseases, there is an alteration in the epidermis, which may result in epidermal-dermal layer thickness change^{9,38,39,40,41,42,43,44,45,46}. Because of high multiple scattering in skin, accurate annotation of epidermis structure based on OCT image is a challenging task⁴⁷. Moreover, manual segmentations are extremely time-consuming with variable interpretation⁴⁸, repeatability, and interobserver agreement, which is not suitable for clinical applications. Additionally, the traditional OCT epidermal segmentation that works based on the detection of the minimum local intensity of the DEJ highly depends on the image quality and the skin pathologies⁴⁹. Therefore, automated segmentation using deep-learning methods has become increasingly popular in OCT imaging. Although many researchers have implemented deep convolutional neural networks and achieved great success in the segmentation tasks^{5,47,50,51,52}, the execution time of these methods is too long, which limits their practicality^49,53,54.

We have developed a method that is capable of accurately segmenting layer boundaries from different body sites in near real time, although there are enhancements that can be made to improve boundary delineation such as increasing the number of OCT images in our dataset, expanding number of manual segmentations to make our analysis more objective, model optimization to enhance classification ability, and also methods to reduce the complex process of patch-based encoder network. The CNN-GS-skin algorithm has shown a convincing ability to delineate boundaries while achieving high computational efficiency. We could have selected a more complex CNN or used higher-efficiency hardware to improve our performance, but we chose to reduce the cost of computation within specified limitations, requiring the implementation of additional techniques such as pixel skipping and multiprocessing. These add-ons significantly reduced the computational cost of our method. The reason we are interested in CPU is that most OCT imaging systems and their reconstruction algorithms are already implemented on CPU, rather than GPU. Developing a methodology for CPU is therefore highly preferable. There are some pitfalls that need to be addressed. A major issue is that this methodology is reliant on an annotated dataset from manual segmentation, which has biases and variability. The annotated data has not been verified by histology, but has been verified to be accurate within 2 pixels (6.5 $\upmu$m). Hence, the manual segmentation we took as the gold standard for analysis is also not perfect. In addition, our OCT image dataset may contain some imaging artifacts causing errors for segmentation. Despite the acknowledged shortcomings, this approach was validated on a variety of body regions, which is an important step to making this methodology accessible and practical in the clinic and assisting with an early-stage diagnosis of skin diseases.

Nevertheless, our proposed system stands out because of its simplicity, efficiency, and versatility, which makes it a robust method for automatic layer segmentation.

Data availability

The datasets used and/or analyzed during the current study and the CNN code are available from the corresponding author on reasonable request.

References

Jung, S.-E., Kang, H. Y., Lee, E.-S. & Kim, Y. C. Changes of epidermal thickness in vitiligo. Am. J. Dermatopathol. 37 (2015).
Chao, C. Y., Zheng, Y.-P. & Cheing, G. L. Epidermal thickness and biomechanical properties of plantar tissues in diabetic foot. Ultrasound Med. Biol. 37, 1029–1038 (2011).
Article PubMed ADS Google Scholar
Baran, U. High resolution imaging of acne lesion development and scarring in human facial skin using oct-based microangiography. Lasers Surg. Med. 47 (2015).
Deegan, A. J. et al. Optical coherence tomography angiography of normal skin and inflammatory dermatologic conditions. Lasers Surg. Med. 50 (2018).
Deegan, A. J., Lu, J., Sharma, R. & Mandell, S. P. Imaging human skin autograft integration with optical coherence tomography. Quant. Imaging Med. Surg. 11 (2021).
Deegan, A. J. Optical coherence tomography angiography monitors human cutaneous wound healing over time. Quant. Imaging Med. Surg. 8 (2018).
Mogensen, M. Oct imaging of skin cancer and other dermatological diseases. J. Biophotonics 2 (2009).
Adabi, S. et al. Learnable despeckling framework for optical coherence tomography images. J. Biomed. Opt. 23, 016013–016013 (2018).
Article ADS Google Scholar
Adabi, S., Turani, Z., Fatemizadeh, E., Clayton, A. & Nasiriavanaki, M. Optical coherence tomography technology and quality improvement methods for optical coherence tomography images of skin: A short review. Biomed. Eng. Comput. Biol. 8, pages1179597217713475 (2017).
Khan, S., Rahmani, H., Sha, S. A., Bennamoun, M., Medioni, G., & Dickinson, S. A Guide to Convolutional Neural Networks for Computer Vision (2018).
Wang, F. & Tax, D. M. J. Survey on the attention based RNN model and its applications in computer vision. CoRR 1601.06823 (2016).
Havaei, M. et al. Brain tumor segmentation with deep neural networks. Med. Image Anal. 35, 18–31 (2017).
Article PubMed Google Scholar
Ricci, E. & Perfetti, R. Retinal blood vessel segmentation using line operators and support vector classification. IEEE Trans. Med. Imaging 26, 1357–1365. https://doi.org/10.1109/TMI.2007.898551 (2007).
Article PubMed Google Scholar
Lam, C., Yu, C., Huang, L. & Rubin, D. Retinal lesion detection with deep learning using image patches. Investig. Ophthalmol. Visual Sci. 59, 590–596 (2018).
Article Google Scholar
Jafari, M. H. et al. Skin lesion segmentation in clinical images using deep learning. In booktitle2016 23rd International Conference on Pattern Recognition (ICPR), pages 337–342, https://doi.org/10.1109/ICPR.2016.7899656 (2016).
Amor, R. et al. Automatic segmentation of epidermis and hair follicles in optical coherence tomography images of normal skin by convolutional neural networks. Front. Med. (2020).
Kepp, T. et al. Segmentation of mouse skin layers in optical coherence tomography image data using deep convolutional neural networks. Biomed. Opt. Express 10, 3484–3496 (2019).
Article PubMed PubMed Central Google Scholar
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K. & Yuille, A. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834–848. https://doi.org/10.1109/TPAMI.2017.2699184 (2018).
Article PubMed Google Scholar
Gopinath, K., Rangrej, S. B. & Sivaswamy, J. A deep learning framework for segmentation of retinal layers from oct images. In booktitle2017 4th IAPR Asian Conference on Pattern Recognition (ACPR), pp. 888–893, https://doi.org/10.1109/ACPR.2017.121 (2017).
Fang, L., Cunefare, D., Wang, C., Robyn H. Guymer, S. L. & Farsiu, S. Automatic segmentation of nine retinal layer boundaries in oct images of non-exudative amd patients using deep learning and graph search. Biomed. Opt. Express 8, 2732–2744, https://doi.org/10.1364/BOE.8.002732 (2017).
Kugelman, J., Alonso-Caneiro, D., Read, S. A., Vincent, S. J. & Collins, M. J. Automatic segmentation of oct retinal boundaries using recurrent neural networks and graph search. Biomed. Opt. Express 9, 5759–5777. https://doi.org/10.1364/BOE.9.005759 (2018).
Article PubMed PubMed Central Google Scholar
Lin, C.-H. et al. Epidermal thickness measurement on skin oct using time-efficient deep learning with graph search. In Photonics in Dermatology and Plastic Surgery 2022, vol. 11934, pp. 115–124 (SPIE, 2022).
Rawat, W. & Wang, Z. Deep convolutional neural networks for image classification: A comprehensive review. Neural Comput. 29, 1–98. https://doi.org/10.1162/NECO_a_00990 (2017).
Article MathSciNet Google Scholar
Chiu, S. J. et al. Automatic segmentation of seven retinal layers in SDOCT images congruent with expert manual segmentation. Opt. Express 18, 19413–19428 (2010).
Article PubMed PubMed Central ADS Google Scholar
Abadi, M. et al. Tensorflow: A system for large-scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 265–283 (2016).
Lang, A. et al. Retinal layer segmentation of macular OCT images using boundary classification. Biomed. Opt. Express 4, 1133–1152 (2013).
Article PubMed PubMed Central ADS Google Scholar
Raman, M. & Aggarwal, H. Study and comparison of various image edge detection techniques. Int. J. Image Process. 3 (2009).
Singh, H. & Sharma, R. Role of adjacency matrix and adjacency list in graph theory. Int. J. Comput. Technol. 3, 179–183 (2012).
Article Google Scholar
Barbehenn, M. A note on the complexity of dijkstra’s algorithm for graphs with weighted vertices. IEEE Trans. Comput. 47, 263. https://doi.org/10.1109/12.663776 (1998).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. CoRRarXiv:1505.04597 (2015).
Smith, L. N. Cyclical learning rates for training neural networks. CoRRarXiv:1505.04597 (2015).
Ruder, S. An overview of gradient descent optimization algorithms. CoRRarXiv:1609.04747 (2016).
Srivastava, N., Hinton, G., andcIlya Sutskever, A. K. & Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
Yu Cheng, P. Z., Wang, D., & Zhang, T. A survey of model compression and acceleration for deep neural networks. CoRRarXiv:1710.09282 (2017).
Czekalla, C., Schönborn, K. H., Lademann, J. & Meinke, M. C. Noninvasive determination of epidermal and stratum corneum thickness in vivo using two-photon microscopy and optical coherence tomography: impact of body area, age, and gender. Skin Pharmacol. Physiol. 32, 142–150 (2019).
Article PubMed Google Scholar
Sandby-Møller, J., Poulsen, T. & Wulf, H. C. Epidermal thickness at different body sites: Relationship to age, gender, pigmentation, blood content, skin type and smoking habits. Acta Derm. Venereol. 83, 410–413 (2003).
Article PubMed Google Scholar
Lintzeri, D., Karimian, N., Blume-Peytavi, U. & Kottner, J. Epidermal thickness in healthy humans: A systematic review and meta-analysis. J. Eur. Acad. Dermatol. Venereol. 36, 1191–1200 (2022).
Article CAS PubMed Google Scholar
Lintzeri, D., Karimian, N., Blume-Peytavi, U. & Kottner, J. Epidermal thickness in healthy humans: A systematic review and meta-analysis. J. Eur. Acad. Dermatol. Venereol. 36 (2022).
Turani, Z. et al. Optical radiomic signatures derived from optical coherence tomography images improve identification of melanoma. Can. Res. 79, 2021–2030 (2019).
Article CAS Google Scholar
Hojjatoleslami, A. & Avanaki, M. R. Oct skin image enhancement through attenuation compensation. Appl. Opt. 51, 4927–4935 (2012).
Article PubMed ADS Google Scholar
Adabi, S. et al. Universal in vivo textural model for human skin based on optical coherence tomograms. Sci. Rep. 7, 17912 (2017).
Avanaki, M. R. et al. Quantitative evaluation of scattering in optical coherence tomography skin images using the extended huygens-fresnel theorem. Appl. Opt. 52, 1574–1580 (2013).
Article PubMed ADS Google Scholar
Avanaki, K. & Andersen, P. Oct radiomic features for differentiation of early malignant melanoma from benign nevus (2020). noteUS Patent App. 15/931,937.
Avanaki, K. & Andersen, P. E. Optical coherence tomography for melanoma detection. In New Technologies in Dermatological Science and Practice, pp. 47–58 (CRC Press, 2021).
Lee, J. et al. Noninvasive imaging exploration of phacomatosis pigmentokeratotica using high-frequency ultrasound and optical coherence tomography: Can biopsy of ppk patients be avoided? Skin Res. Technol. 29, 13279 (2023).
Lee, J. et al. Optical coherence tomography confirms non-malignant pigmented lesions in phacomatosis pigmentokeratotica using a support vector machine learning algorithm. Skin Res. Technol. 29, e13377 (2023).
Ud-Din, S. Objective assessment of dermal fibrosis in cutaneous scarring, using optical coherence tomography, high-frequency ultrasound and immunohistomorphometry of human skin. Br. J. Dermatol. 181 (2019).
Sanchez, M. M., Orneles, D. N., Park, B. H. & Morgan, J. T. Automated epidermal thickness quantification of in vitro human skin equivalents using optical coherence tomography. Biotechniques 72, 194–200 (2022).
Article CAS PubMed Google Scholar
Hori, Y. Automatic characterization and segmentation of human skin using three-dimensional optical coherence tomography. Opt. Express 14 (2006).
Amor, R. D. Automatic segmentation of epidermis and hair follicles in optical coherence tomography images of normal skin by convolutional neural networks. Front. Med. 7 (2020).
Kepp, T. Segmentation of mouse skin layers in optical coherence tomography image data using deep convolutional neural networks. Biomed. Opt. Express 10 (2019).
Ho, C. J. Detecting mouse squamous cell carcinoma from submicron full-field optical coherence tomography images by deep learning. J. Biophotonics 14 (2021).
Srivastava, R., Yow, A. P., Cheng, J., Wong, D. W. K. & Tey, H. L. Three-dimensional graph-based skin layer segmentation in optical coherence tomography images for roughness estimation. Biomed. Opt. Express9 (2018).
Li, A. Epidermal segmentation in high-definition optical coherence tomography. 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (2015).

Download references

Acknowledgements

The authors would like to thank the Melanoma Research Alliance and grant number 624320 for their support. The authors have no financial interests in any material discussed in this article.

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois at Chicago, Chicago, IL, 60607, USA
Chieh-Hsi Lin
Richard and Loan Hill Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
Brandon E Lukas & Kamran Avanaki
Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami Miller School of Medicine, Miami, FL, 33136, USA
Ali Rajabi-Estarabadi
Department of Dermatology, Broward Health Medical Center, Fort Lauderdale, FL, USA
Ali Rajabi-Estarabadi
University of Illinois College of Medicine, Chicago, IL, 60607, USA
Julia Rome May & Yanzhen Pang
Department of Dermatology, University of Illinois at Chicago, Chicago, IL, 60607, USA
Carolina Puyana, Maria Tsoukas & Kamran Avanaki

Authors

Chieh-Hsi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Brandon E Lukas
View author publications
You can also search for this author in PubMed Google Scholar
Ali Rajabi-Estarabadi
View author publications
You can also search for this author in PubMed Google Scholar
Julia Rome May
View author publications
You can also search for this author in PubMed Google Scholar
Yanzhen Pang
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Puyana
View author publications
You can also search for this author in PubMed Google Scholar
Maria Tsoukas
View author publications
You can also search for this author in PubMed Google Scholar
Kamran Avanaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H.-L.: formal analysis, data collection, and wrote the manuscript, B.L.: data collection, processing, wrote the manuscript, A.R.-E.: revision, J.M.: data collection, Y.P. data collection, C.P. and M.T.: revision, K.A.: project overview, conceptualization, formal analysis, funding, revision.

Corresponding author

Correspondence to Kamran Avanaki.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, CH., Lukas, B.E., Rajabi-Estarabadi, A. et al. Rapid measurement of epidermal thickness in OCT images of skin. Sci Rep 14, 2230 (2024). https://doi.org/10.1038/s41598-023-47051-6

Download citation

Received: 18 August 2023
Accepted: 08 November 2023
Published: 26 January 2024
DOI: https://doi.org/10.1038/s41598-023-47051-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Line-field confocal optical coherence tomography coupled with artificial intelligence algorithms to identify quantitative biomarkers of facial skin ageing

Automatic segmentation of skin cells in multiphoton data using multi-stage merging

Opening a window to skin biomarkers for diabetes stage with optoacoustic mesoscopy

Introduction

Methodology

Imaging system

Dataset description

Proposed framework

Stage 1: Patch-based CNN training and classification

Stage 2: probability maps and graph search

UNet (for comparative study)

Human study

Results and discussion

CNN configuration optimization

Parameter tuning

Loss function optimization

Pruning and quantization

Pixel skipping

Effect of parallel processing on execution time

Segmentation performance analysis

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links