Automatic Choroid Layer Segmentation from Optical Coherence Tomography Images Using Deep Learning

Masood, Saleha; Fang, Ruogu; Li, Ping; Li, Huating; Sheng, Bin; Mathavan, Akash; Wang, Xiangning; Yang, Po; Wu, Qiang; Qin, Jing; Jia, Weiping

doi:10.1038/s41598-019-39795-x

Download PDF

Article
Open access
Published: 28 February 2019

Automatic Choroid Layer Segmentation from Optical Coherence Tomography Images Using Deep Learning

Scientific Reports volume 9, Article number: 3058 (2019) Cite this article

7529 Accesses
67 Citations
1 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 13 December 2019

This article has been updated

Abstract

The choroid layer is a vascular layer in human retina and its main function is to provide oxygen and support to the retina. Various studies have shown that the thickness of the choroid layer is correlated with the diagnosis of several ophthalmic diseases. For example, diabetic macular edema (DME) is a leading cause of vision loss in patients with diabetes. Despite contemporary advances, automatic segmentation of the choroid layer remains a challenging task due to low contrast, inhomogeneous intensity, inconsistent texture and ambiguous boundaries between the choroid and sclera in Optical Coherence Tomography (OCT) images. The majority of currently implemented methods manually or semi-automatically segment out the region of interest. While many fully automatic methods exist in the context of choroid layer segmentation, more effective and accurate automatic methods are required in order to employ these methods in the clinical sector. This paper proposed and implemented an automatic method for choroid layer segmentation in OCT images using deep learning and a series of morphological operations. The aim of this research was to segment out Bruch’s Membrane (BM) and choroid layer to calculate the thickness map. BM was segmented using a series of morphological operations, whereas the choroid layer was segmented using a deep learning approach as more image statistics were required to segment accurately. Several evaluation metrics were used to test and compare the proposed method against other existing methodologies. Experimental results showed that the proposed method greatly reduced the error rate when compared with the other state-of-the-art methods.

Automatic choroidal segmentation in OCT images using supervised deep learning methods

Article Open access 16 September 2019

Jason Kugelman, David Alonso-Caneiro, … Michael J. Collins

Automated segmentation of macular edema for the diagnosis of ocular disease using deep learning method

Article Open access 28 June 2021

Zhenhua Wang, Yuanfu Zhong, … Biao Yan

Semantic Segmentation of the Choroid in Swept Source Optical Coherence Tomography Images for Volumetrics

Article Open access 23 January 2020

Shingo Tsuji, Tetsuju Sekiryu, … Satoshi Eifuku

Introduction

The choroid layer is vital for the oxygenation and metabolic activity of the Retinal Pigment Epithelium (RPE) and outer retina. It is a vascular interface between the retina and sclera and requires some of the highest amounts of blood flow for any tissue in the human body. The choroid layer also acts as a source of blood supply for the optic nerve¹. The structure of the layer is mainly divided into two parts based on anatomical structure: the vascular plexus, consisting of various capillaries contiguous to BM, and the choroidal stroma. Changes in the shape and anatomical structure of the choroid have been acknowledged in primary macular degeneration and in other advanced diseases. Quantitative and qualitative analysis of BM and the choroid layer can help in understanding the relationship between these various retinal diseases.

Optical coherence tomography (OCT) is an increasingly significant modality for the identification, monitoring, and measurement of many retinal and macular diseases as it aids in resolving cross-sectional details of the human retina. The growing need for OCT in retinal disease analysis necessitates investigation into a fully automated approach². Useful information can be extracted with the blend of OCT, image processing, and segmentation techniques. This can provide detailed information regarding different retinal layers and the associated diseases. DME, a chief source of vision damage in patients suffering from diabetes, can be diagnosed with the help of choroid thickness maps as changes in thickness, such as those resulting from choroidal macular degeneration and other advanced diseases, can provide a relative measure of health of the choroid layer. Aging may also cause choroidal thinning that leads to a condition referred to as choroidal atrophy^3,4. Increased choroidal thickness can result in diseases like serous retinopathy, polypoidal choroidal vasculopathy, autoimmune diseases, etc. Early diagnosis of these diseases can prove to be very beneficial in the treatment of these abnormalities. Thus, quantification of pathological changes can be effectively achieved by the analysis of retinal thickness and, for such diagnoses, choroidal thickness measurement is essential^5,6.

In OCT imaging, some approaches find the correlation among the measurable and morphological topographies of retinal thickness maps⁷. Such normal standards for the thickness maps can help physicians in comparing different patients’ choroidal thickness maps with normal sets. Consequently, the automatic segmentation of the choroidal boundary has garnered the attention of many researchers worldwide. A review of existing approaches in this domain shows that not much work has been carried out for automatic segmentation of the choroid layer, as most methods only manually or semi-automatically segment out the region of interest. Manually outlining the choroid boundary is a tedious, time consuming and sometimes impossible task because of possible indistinct structures and boundaries. Moreover, the measurement lacks objectivity, requires the trainer to be trained perfectly, and is vulnerable to inter-observer imprecision^8,9. Also, while some fully automatic segmentation methods do exist, there is still room for improvement. Major challenges in the accurate segmentation of the choroid layer are visualized in Fig. 1 and are detailed as follows:

Low contrast of OCT images makes the region between the sclera and choroid inseparable, resulting in a likewise inseparable histogram between the two layers
Methods based on thresholding and intensity are not effective because of the aforementioned low contrast in OCT images
Due to the presence of the vascular structure, the choroid layer has inconsistent texture and inhomogeneous intensity that makes extraction of the region of interest difficult
The anatomical structure of BM and the choroid layer interface is quite weak and is often invisible

In consideration of current challenges within the choroid segmentation domain, the major contributions of this work include:

A two-stage segmentation approach with emphasis on segmentation accuracy and consistency of the method
OCT image segmentation based on the combination of morphological and deep learning methodologies
Testing on the data of real patients, with experimental outcomes showing that the technique achieved high precision and reduced error rates to a significant extent

The remainder of the paper is structured as follows: Section 2 outlines literature review. Proposed methodology details are described in Section 3. Experimental results with a quantifiable examination are presented in Section 4. Finally, the conclusion and future directions are discussed in Section 5.

Related Work

Existing literature in the context of choroid layer segmentation includes manual, semi-automatic and few automatic segmentation methods in OCT images. As the prime objective of this research focuses on the use of deep learning methodologies, the literature review is divided into two subgroups: deep learning methods and non-deep learning methods.

Machine learning is a method of data analysis that employs statistical and analytical tools on training data. These tools are later used to learn from past examples and employ the learned information from the preceding training to categorize new data, envisage new inclinations and find new patterns. Image classification or segmentation is one of the fundamental methods in the domain of machine learning. Analysis of literature shows the use of several, non-deep learning, traditional methods such as graph-based, k-nearest neighbor, Bayesian network, support vector machine and decision tree approaches, all of which involve the use of hand-crafted features for the specific classification purpose. The hand-crafted features include shape, pixel density and texture of the image features.

In the context of these non-deep learning methods, a previous automatic choroid layer segmentation method was used to extract choroidal vessels for quantification of choroidal vasculature thickness¹⁰. This approach focused on the thickness of vessels rather than the choroid layer. Another automatic segmentation technique¹¹ applied a statistical model on choroid layers in OCT images. However, the processing time of the method is quite high and extensive training is required. Use of phase information for automatic segmentation of the interface between the choroid and sclera was proposed and implemented^12,13. While successful, these methods are not clinically practical as the used imaging modalities are not commercially obtainable. The use of dynamic programming (DP) in one study was used to find the shortest path of a graph for choroid layer segmentation, giving a segmentation accuracy of about 90 percent⁸. Additionally, a two-stage active contour model for choroidal boundary extraction with a segmentation accuracy of 92.7 percent was proposed¹⁴. Graph based approaches have also been used in this context but, due to the heterogeneous nature of OCT images, these methods are not generally helpful in choroid layer segmentation^15,16,17. Still, one study utilized a graph search algorithm from 3D OCT volumes to perform choroid layer segmentation¹⁸. This method was a semi-automatic approach. The interface between the choroid and sclera using Dijkstra’s algorithm was investigated⁹.

OCT B-scan choroidal segmentation based on a dual probability gradient has also been attempted¹⁹. Another study presented an automatic segmentation method based on a multi-resolution textural graph cut method¹⁶. The combination of Markov Random Field (MRF) and level set approach was also used to segment the choroid layer²⁰. Distance regularization and edge constraint terms were rooted into the level set technique to evade uneven and trivial areas and preserve information around the boundary between the choroid and sclera. MRF based methods^21,22,23 have been proposed and implemented to detect the intra-retinal layers from 2D or 3D OCT images. Another method focused on obtaining the spatial distribution of choroidal sub-layers with 3D 1060-nm OCT mapping using Haller’s and Sattler’s layer²⁴. A hybrid approach has made the use of level set, multi-region continuous max-flow method to segment out different retinal layers. The approach also used nonlinear anisotropic diffusion in order to eliminate the spackle noise present in the OCT images²⁵. Seven layers of the retina were also segmented in this context; the proposed approach involved a combination of graph cut and dynamic programming²⁶. Two sequential diffusion map based segmentations of intra-retinal layers from 3D SD-OCT scans have been developed²⁷. A similar approach for the segmentation of multiple retinal layers utilized spectral rounding for segmentation²⁸.

Review of Recent Deep Learning Techniques in Computer Vision

The conducted literature review showed that most of the existing choroid layer segmentation approaches made use of non-deep learning methodologies. An overview of existing deep learning methods revealed that, in this context, these methods are not used specifically for the choroid layer segmentation. They are generally applied in medical image segmentation, such as on the brain, retina, liver, knee, urinary bladder, chest, heart, etc. Deep learning is a branch of Artificial Intelligence (AI) with the ability to make use of optimization, probabilistic and statistical tools; these methods maintain an extensive contribution to the analysis of medical images.

With regard to biomedical image segmentation, one major study contributed an approach for the segmentation of biomedical image segmentation using Convolutional Neural networks (CNNs)²⁹. The model employed a network and training strategy that relied on the robust use of data augmentation. Low grade glioma assessment through a modified CNN architecture with 6 convolutional layers and a fully connected layer was used to classify these brain tumors³⁰. A similar approach was used in the diagnosis of white matter hyperintensities from brain MRI images through a series of CNN architectures that considered multi-scale patches to yield the obvious position of required features while training³¹. Another group proposed an automatic computer-aided diagnosis method for the classification of solid and non-solid nodules in pulmonary computerized tomography (CT) images through a CNN³². Brain image segmentation to produce high nonlinear mappings between inputs and outputs using deep learning was analyzed; the segmentation problem was solved using CNNs^33,34. The approach made use of local features in conjunction with more global contextual features to perform the segmentation. Other studies have investigated the use of automatic segmentation and assessment of rectal cancers from the multi-parametric MR images through the use of CNN architectures³⁵. The combination of CNN and total kidney volume calculation from Computed Tomography for the segmentation has been proposed in³⁶. The method was tested on the images of real patients demonstrating trivial to adequate function to severe renal inadequacy. The detection of bladder cancer has also incorporated use of CNN architectures in a combination with level set methods to acquire the region of interest³⁷.

In the domain of retinal image segmentation using deep learning approaches, segmentation of the optic disc, fovea and retinal vasculature has been carried out using a CNN model. The approach segmented three channels of input from the point’s neighborhood and propagated the response across the 7-layer network. The output layer was comprised of four neurons, denoting background, blood vessels, optic disc, and fovea³⁸. Another study showed segmentation of retinal blood vessels using a deep neural network with zero phase whitening, global contrast normalization, and gamma corrections³⁹. A similar approach for blood vessel segmentation using deep learning has been performed⁴⁰. Segmentation of retinal blood vessels has also been analyzed as a multi-label implication task through the use of implied benefits of the blend of convolutional neural networks and the structured estimate⁴¹. The main observation in the overview of retinal based segmentation using deep learning is that the imaging modality being used in these approaches is the fundus image. The use of OCT images for the segmentation of retinal layers has not observed in existing methods of deep learning. As OCT imaging technology allows capturing of the cross-section of the retina, it may be very helpful to analyze OCT images for the diagnosis of several retinal diseases. Because choroid layer segmentation and associated thickness measurements help diagnose retinal based diseases, several approaches have been proposed and implemented to diagnose them. For instance, retinitis pigmentosa⁴², central serous chorioretinopathy⁴³, age-related macular degeneration⁴⁴, and diabetic retinopathy^45,46 have been found to have associated changes in the thickness of the choroid.

One study showed the development of an automatic method for the segmentation of retinal layers based on deep learning methodologies, highlighting one of only few current implementations of an automatic technique. The method made use of CNN to perform the retinal layer segmentation in OCT images⁴⁷. The procedure was limited in accuracy of edge detection as it depended on at most one-pixel accuracy; however, the Bidirectional Long Short-term Memory (BLSTM) entails sub-pixel accuracy. This can lead to confusion in the accurate segmentation of the corresponding boundaries. A similar method⁴⁸ performed OCT image semantic segmentation through fully convolutional neural networks, but the results were tested on a data-set of normal individuals with fewer images of mild spectrum diabetic retinopathy. A combination of CNNs and graph search methods has also been employed in the segmentation of 9 retinal boundaries⁴⁹. The method was computationally expensive and the black-boxed architecture of the CNN made customization and performance examination of every step less controllable. Segmentation of retinal layers with emphasis on fluid masses has also been performed using deep learning methods⁵⁰. While results were promising, evaluation was conducted on a limited number of B-scans. The tested and training data sets contained a total of 110 B-scans.

According to analysis of the related works, it can be observed that higher segmentation accuracy was achieved through deep learning methodologies. The key restraint of non-deep leaning approaches is that these methods mainly rely on the feature extraction phase for the accurate segmentation of the region of interest. It is tough to extract appropriate image features for a definite medical image recognition problem. As a result, the classifier cannot provide effective segmentation accuracy because the segmented features are not effective enough. In order to tackle problems faced by non-deep learning approaches, significant segmentation accuracy is achieved by the use of deep learning methods through adaptive learning of image features. Considering this, it is likely that the application of deep learning methods on the categorization of OCT images for automated disease diagnosis would be more successful than counterpart non-deep learning procedures. Thus, the focus of this research is to overcome existing challenges in choroid layer segmentation and provide a fully realized automatic segmentation approach. The proposed method makes use of deep learning to achieve the desired task, with a combination of morphological operations and CNNs. In order to calculate the thickness map of choroid layer, segmentation was considered for two layers. The desired layers included BM and the choroid layer. BM was segmented out using a series of morphological operations followed by the use of CNNs for choroid segmentation. Then, a thickness map was generated based on the extracted layers.

Material and Methods

Pipeline Overview

The proposed methodology is a two-stage segmentation process. Quantification of choroidal thickness relies on the extraction of two layers from OCT images. The segmentation of BM was carried out followed by the extraction of choroid layer from the OCT images. Figure 2 represents a standard OCT image with the required layers segmented in different colors. The regions of interest are labeled by the experts manually; BM is labeled in green whereas the choroid layer is labeled in red.

The proposed method takes a set of OCT images focused on the retina around the macula as input and depicts the position of two required boundaries. As the data of every individual contained 25 B-scans of the macula, the OCT images were comprised of a series of B-scans that the proposed method segmented individually. The result of segmentation of all B-scans was examined, combined and, lastly, incorporated to shape graphical depictions as 2D maps. Initially, we regulated tomography volume data by representing it as a sorted series of adjacent B-scans. Later the images were labeled by image statistics from adjacent B-scans and the region of interest was segmented in every B-scan individually. Then, the 2D retinal and choroidal thickness maps were generated. Figure 3 depicts the procedure carried out to generate the thickness map.

The proposed methodology is comprised of 3 main modules including BM segmentation, choroid layer segmentation, and thickness map generation. Figure 4 depicts the overall approach of the proposed method. For the segmentation of the regions of interest, OCT images were taken as inputs to the system. The BM was segmented using a series of morphological operations whereas the segmentation of choroid layer, which required more image statistics, was extracted using a CNN. Later, thickness maps were generated using the extracted layers.

Bruch’s Membrane Segmentation

A series of steps were performed to carry out BM segmentation. If we analyze the OCT image, we can observe that BM segmentation is relatively easier than choroid layer segmentation. The reason is that the choroid layer has inhomogenous intensity and inconsistent texture, so it requires more image statistics to accurately segment. We first segmented the BM followed by segmentation of choroid layer. In this case, we can consider the segmented BM as a constraint while carrying out choroid layer segmentation. The segmentation of BM was performed using a sequence of morphological operations. Morphological operations mainly represent a collection of non-linear operations associated with the morphology or shape of the features in the OCT image. Morphological image processing follows the goal of eliminating all the defects and maintaining structure of the image. These operations are confident on the associated ordering of pixel values, rather than their numerical values, so they are utilized more on binary images. BM boundary represents a clear white boundary in the OCT image as compared to the other layers/structures in the image. Additionally, as it is evident from the OCT images, the intensities of the BM boundary are homogeneously distributed. The histogram peaks corresponding to BM are distinct from each other as well as from the background. The distinctive feature of the proposed method is that it ensures the homogeneous intensity distribution for the BM boundary. This helped to improve the visibility of delicate mass lesions and helped to reduce unwanted background information. A summary of the performed steps are shown in Fig. 5. The steps involved in this process are detailed as follows:

Thresholding

The first step was to convert the image into binary and extract the region of interest. The reason behind this step was to extract the pixels containing the region belonging to the BM boundary within the image. In order to obtain an optimal threshold value, OCT images were tested using a range of threshold values. After the analysis, the optimal threshold value was set to 190. This step helped determine the foreground and background pixels of the image. The output of the step was binarized data containing a range of intensity values. The threshold was defined as:

$$(\begin{array}{cc}{p}_{(ij)}\in {S}_{1} & {\rm{if}}\,0\le {P}_{(i,j)} < {t}_{b}\\ {p}_{(ij)}\in {S}_{2} & {\rm{if}}\,{t}_{b}\le {P}_{(i,j)} < L-1\end{array}$$

(1)

where p_i,j, i = 1, 2, …, r, j = 1, 2, …, c represents the pixel intensity of the image (with size r × c and n = L gray levels from zero to L-1), t_b is the set threshold, S₁ and S₂ are sets in which pixels with intensities between thresholds k and k-1 are located. Image was divided into two sets S₁ and S₂ (background and foreground pixels respectively) using a threshold at the level t_b. Here S₁ = [0, …, t_b−1] and S₂ = [t_b, …, L − 1].

Reconstruction

After acquiring the binary image, a reconstruction approach based on the morphological opening was applied to remove the noise and preserve the shape of BM layer that was not removed by the morphological erosion operator in the OCT image. The block analysis method can be assumed to be similar to this approach based on the fact the image is manipulated as a whole rather than dividing it into blocks. Firstly, a structuring element B of size 3 × 3 was formulated by considering minimum I_min(x) and maximum intensity max I_max(x). Later, the background benchmarks R_i were defined based on the obtained values. The background criterion was formulated as:

$$R(x)=\frac{{I}_{min}(x)+{I}_{max}(x)}{2}$$

(2)

where I_min(x) and I_max(x) represents morphological erosion and dilation respectively. Therefore,

$$R(x)=\frac{{\epsilon }_{\mu }(f)(x)+{\delta }_{\mu }(f)(x)}{2}$$

(3)

where ϵ represents the erosion operator followed by δ operation and μ represents the structuring element. The employed reconstruction approach yielded better results when compared to the traditional block cutting method as the used method provided better local analysis of the OCT image. Eight neighboring pixels at every point within the OCT images were considered by the structuring element μ.

Thinning

After image reconstruction, the number of rows spanned by the objects were taken into account. In order to select the object-spanned rows, a range was defined, given that the rows contained most of the points. Any point outside the selected range was discarded. Then, foreground pixels not part of the BM boundary were eliminated. The morphological thinning operation was applied on the extracted rows containing the BM area in the OCT image. The purpose of this step was to iteratively remove pixels exclusive to the shape and to shrink it without shortening or breaking BM boundary apart. To achieve the desired goal, we observed whether an edge pixel P₁ should be removed by considering its 8 neighbors in the 3 × 3 neighborhood. To delete or mark the pixels, the approach of connectivity numbers was employed. The approach helped us to find the number of objects being connected to a specific pixel in the OCT image. The connectivity number was calculated using the following equation:

$${C}_{n}=\sum _{k\in s}{N}_{k}-({N}_{k}\,\ast \,{N}_{k+1}\,\ast \,{N}_{k+2})$$

(4)

where N_o represents the central pixel and the color of eight neighbors is represented by N_k. The pixel to the right of the central pixel is represented by N₁. All remaining neighboring pixels were numbered in counter-clockwise order around the central pixel.

Curve Fitting

The next step was to mark the boundary of segmented BM. This step was performed using spline curve fitting. Curve fitting was applied to find a function f(x) based on the data (x_i, y_i) where i = 1, 2, …, n. The residual was minimized by the function (x). The residual is the distance between the data samples and f(x). A better curve fitting can be obtained through a smaller residual. Spline fitting being applied for the curve fitting was calculated as:

$$f(x)={a}_{i}{(x-{x}_{i)})}^{3}+{b}_{i}{(x-{x}_{i})}^{2}+{c}_{i}(x-{x}_{i})+{d}_{i}$$

(5)

where a_i, b_i, c_i an d_i represents set of polynomial coefficients and x_i = 1, 2, …, n represents the data points to be mapped in the curve. The output of this step is the final segmented BM boundary. Figure 5 illustrates the output of each step being performed for the BM boundary segmentation; it has 7 subparts, each representing a result of the different morphological operations being applied on the input OCT image.

Choroid Layer Segmentation Using Deep Learning

As discussed above, BM segmentation is easier when compared to the segmentation of the choroid layer. Based on the the aforementioned challenges and provisions, a deep learning approach was taken to carry out the desired segmentation.

Figure 6 illustrates the steps that constituted the segmentation process. These steps include pre-processing, data sampling, data conversion, CNN training and final choroid layer segmentation.

Pre-Processing

In the pre-processing phase, the segmented BM was kept as a point. All points in the area above the reference point were set to black. This was to reduce the area to be processed for the segmentation of the choroid layer. The output of BM segmentation (the segmented layer) was fed as an input to this stage. Analysis of the OCT image showed that the interface between the sclera and choroid layer is inseparable in the majority of cases and, as a result, accurate segmentation of the choroid layer is a challenging task. Based on this analysis, the purpose of the pre-processing stage was to obtain the definite region of the OCT image containing the choroid layer. Consequently, the pre-processing step simply eliminated all of the area above the segmented BM.

Data Sampling

The data provided by the doctors contained manually segmented BM and choroid layers. The segmentation of the required boundaries on OCT images was performed by the experts manually. The data sampling step made use of the provided data to sample the patches to be used for training of the CNN. The manually segmented images were sampled patch-wise from top to bottom and left to right. The purpose of sampling was to classify pixels as on-line or off-line patch. The patch size used in our research was 32 × 32. This patch size was used because:

Very small patch sizes make it difficult to extract useful information from the region of interest
Very large patch sizes may contain surplus information and increase the complexity of processing

As patched images were manually segmented by the ophthalmologist, the choroid layer was marked by a red curve. The classification of the patches was performed in order to pick the patches containing the choroid layer. The illustration of the patch labeling process is described in Fig. 7.

The stride between each patch was taken as 5 pixels. A threshold was then set to discard patches containing too many black pixels. If more than 1/3 of the patch contained black pixels, it was discarded. The data set contained images of 21 patients. Data of 11 individuals was used to train the model whereas data of 10 individuals was used to test the model. In order to balance labels 0 and 1, the patches of label 0 were randomly chosen to have the same number as label 1. There were almost 3000 patches for one figure, so the total number of patches being sampled from the data was about 1,575,000.

Data Conversion

Next, we converted the data to binary format. The binary format for CIFAR-10 is shown in Equation 6. Here, the first image label (i.e. 0 or 1) was represented by the first byte. Image pixel values were denoted by the next 3072 bytes. Red channel values were indicated through the first 1024 bytes, the following 1024 bytes represented green channel values and the last 1024 bytes corresponded to blue channel values. Row major order was used to store these values, where an initial 32 bytes represented the red channel in the initial row of the image.

$$\begin{array}{ccc}\mathrm{ < 1}xlabel\mathrm{ > < 3072}xpixel > & \ldots \ldots \ldots & \mathrm{ < 1}xlabel\mathrm{ > < 3072}xpixel > \end{array}$$

(6)

The next step was to convert the binary file to leveldb format because caffe only supports leveldb or lmdb⁵¹. We used the program provided by Caffe to do this transformation. After the transformations, the CNN model was trained using the extracted information. After training the model it was used for the segmentation of the choroid layer.

CNN Training

The CNN layering structure used in this work was the Cifar-10 model⁵². We used a pre-trained CNN Cifar-10 architecture to generate features and trained the network using sampled data. The CNN architecture was composed of a layering structure. The layers of the CNN included layers of convolution, pooling, and linear unit nonlinearities. There was also a linear classifier for contrast normalization on top of all these layers. The modification was carried out in the last layer of the CIFAR-10 model. The architecture of the CNN used in our approach is shown in Fig. 8.

Figure 8 shows the overall structure of the CNN model being used in this work. Because contrast of OCT images was low and the texture between the choroid layer and sclera was inhomogeneous, we preferred to carry out the segmentation through slices being sampled from the OCT images. Thus, our model processed each 2D OCT image sequentially in slices. The proposed approach predicted the class of every patch based on the associated label of the patch. The CNN processed the M × N patch centered on that pixel. Therefore the input to the CNN model was an M × N 2D patch. The CNN architecture’s main structuring element was the convolutional layer. In this case, numerous layers can be piled on top of each other to make a pyramid of image features. Each layer in the pyramid harvests features from the prior layer. A stack of input planes was fed to the convolutional layer of our model as its input. The convolutional layer processed the procedures that were being fed and produced feature maps as output. The feature maps were produced by applying spatially local non-linear feature extractors to all spatial neighbors of the input planes. As a result, we obtained an organized topological map of the responses. For the first convolutional layer, different OCT 2D M × N patches were represented through the individual input planes. In successive layers, the input planes were characteristically comprised of the feature maps of the preceding layer. The implemented model contained 5 layers starting with three convolutional layers followed by two fully connected layers. The final layer of the network was a softmax layer used for the final classification. The convolutional layers were followed by max pooling and Rectified Linear Units (ReLU). Kernels in the convolutional layer were of size 5 × 5 with a step size of 1. The kernels of max pooling layer were of size 3 × 3 with a step size of 2. The convolutional layer mainly performed the feature extraction phase - it helped translate the input to its equivalent characteristics.

The pyramid of complex features in the CNN model was considered by giving a convolutional layer’s output feature maps as input channels to the successive layer of convolution. In precise denotation, if focusing on a feature map, it represents a layer of neurons. Each neuron in the layer corresponds to a coordinate within the feature map. The size of the neuron’s separate field correlates to the size of the kernel. The connection among the neurons of the same layer and the previous layer is represented by the value (weight) of the kernel. In the learned kernel’s weights, each kernel is adjusted to a diverse orientation, spatial frequency and scale suitable for the training data. Finally, in order to obtain segmentation labels, we connected the convolutional hidden layer to a fully connected layer followed by a fully connected and softmax layer. As the layers were fully connected, there were 64 output channels because input channels were 32. Each kernel in the fully connected layer acted as the ultimate detector of the choroid from one of the segmentation labels. The purpose of this layer was to map activation volume from the blend of preceding different layers into a class probability distribution. The CNN was trained based on the extracted patches from the OCT images. Test image patches were used to get the overall test accuracy of the proposed model and develop an overall understanding of the image classification as a segmentation procedure for the choroidal boundary.

As discussed in the data sampling and data conversion sections of the methodology, the input OCT image was analyzed with a patch window being traversed on the OCT image. The window size was set to 32 × 32 and the patches were analyzed and given labels according to the contents contained within that window patch. The whole image was traversed and the image was cut into 32 × 32 image slices. As a result, we obtained many patches from a single OCT image. The OCT image was classified into two classes: part of the choroid layer or not part of the choroid layer. Therefore, all patches were classified into the two classes. The total sampled patches amounted to about 1,575,000 patches, extracted from 525 OCT images of 21 individuals. Data of 11 individuals was used to train the network, meaning 825,000 patches (275 images) were used to train the network. Data of 10 individuals was used to test the network, meaning 750,000 patches (250 images) were used.

After sampling patches from all the images, the sampled patches were used to train the model. Figure 9 shows the testing and training phase of the network. The distribution of the segmentation labels was obtained by analyzing the output of the convolutional network. The objective of the training condition was to lessen the negative log-probability from every OCT image by exploiting the likelihood of all labels in the training set being used. This was formulated as:

$$-Lo{g}_{max}\frac{Y}{X}=su{m}_{ij}-Lo{g}_{max}\frac{{Y}_{ij}}{X}$$

(7)

The concept of weight decay was applied for regularization of the learned variables. The sum of weight decay and cross-entropy was calculated to formulate the objective function of the network.

Based on the proposed methodology, the database that was used contained data as D = X, Y = xⁱ, yⁱ, i ∈ 1, …, N, where xⁱ represents the input patch and yⁱ is its corresponding label. As the patch can be one of the two categories, either part of the choroid layer or not, the considered problem was multi-categorical. In this case, every label yⁱ belonged to one of C categories, yⁱ ∈ 1, …, C. Likelihood over the training data D was maximized to find the parameters of the model with respect to the parameters θ:

$$\begin{array}{rcl}{\theta }_{\ast } & = & \mathop{{\rm{argmax}}}\limits_{\theta }=p(Y|X,\theta )\\ & = & \mathop{{\rm{argmax}}}\limits_{\theta }=p(Y|X,\theta )({y}^{1},{y}^{2},\ldots ,{y}^{N}|{x}^{1},{x}^{2},\ldots ,{x}^{N},\theta )\\ & = & \mathop{{\rm{argmax}}}\limits_{\theta }\mathop{\prod }\limits_{i=1}^{N}p({y}^{i}|{x}^{i},\theta )\end{array}$$

(8)

where p(θ) is the objective function being updated by the standard gradient descent, and D = X, Y = xⁱ, yⁱ, i ∈ 1, …, N, where xⁱ is an input image and yⁱ is its corresponding label. This meant that an anonymous independent and identical distribution (i.i.d.) was used to sample the training set. The basic purpose was to minimize the negative log-likelihood of the training data. This was formulated as:

$${\theta }_{\ast }=\mathop{{\rm{argmin}}}\limits_{\theta }-\sum _{i}\sum _{c}||{c}^{yi}[{f}_{c}({x}^{i},\theta )-log\sum _{j}{e}^{fj({x}^{i},\theta )}]$$

(9)

where f represents the monotonic function defined at the label, yⁱ belongs to one of categories c and e denotes the energy function over the monotonic function f and input data x_i. The cross entropy of the real targets can be minimized based on these assumptions. Target distribution over the data set was carried out through one-hot encoding. Considering these assumptions the problem of minimization was formulated as:

$${\theta }_{\ast }=\mathop{{\rm{argmin}}}\limits_{\theta }-\sum _{j}[f{c}_{i}^{\theta }({x}^{i},\theta )-log\sum _{j}{e}^{fj({x}^{i},\theta )}]$$

(10)

Here one-hot encoding of the training data is represented by ${c}_{i}^{\theta }$. An iterative approach was used to find the minimum, i.e. the loss function was minimized in order to estimate the parameters θ of the network. The criterion was defined as:

$$L(\theta ,X,Y)=-\sum _{i}[f{c}_{i}^{\theta }({x}^{i},\theta )-log\sum _{j}{e}^{fj({x}^{i},\theta )}]$$

(11)

where L represents the loss function computed over objective function θ, the input patch is X_i and its associated label is Y_i. Stochastic approximation of the gradient was applied for optimization of the network. In this case, a random training sample xⁱ, yⁱ was used to approximate the gradient. The parameters of the network were updated as:

$${\theta }_{t+1}\leftarrow {\theta }_{t}-\eta {\nabla }_{\theta }L(\theta ,X,Y)$$

(12)

where η represents the learning rate. Each different learning rate for every parameter θ_t is represented as θ_{t + 1} and ▽ is the gradient of the cost function. Mini-batch gradient descent was then used to optimize the loss function. The optimization was achieved through the midterm among the batch method and stochastic. The approach made use of a subset of n < |D| training data to update the parameter of the network. This was formulated as:

$${\theta }_{t+1}\leftarrow {\theta }_{t}-\eta {\nabla }_{\theta }L(\theta ,{X}^{i,\ldots ,i+n}),{Y}^{i,\ldots ,i+n})$$

(13)

After training the model, it was used for the segmentation of the choroid layer. In order to segment out the choroid layer, a counter matrix was proposed. The matrix was created with the same size as the image and initialized as a zero matrix. After each prediction of a patch in terms of its label, if the label was 1, all the pixels in the counter matrix corresponding to the patch were increased by 1. After finishing the prediction, the counter with the highest number in each column was chosen to fit in the cubic curve for the choroid layer segmentation.

Figure 10 shows the concept of the counter matrix, where every patch was analyzed based on its label and, if the label was 1, all pixels corresponding to that patch in the counter matrix were set to the value 1. Later, in order to mark the boundary of the choroid layer, the counter having highest number in every column was selected to be fit in the curve for the marking of the boundary. Figure 11 shows the comparison between the OCT image labeled by specialists and OCT image segmented by the proposed method.

Thickness Map

As the focus of this research was to measure the choroidal thickness for the analysis of the health of the retina, thickness maps were generated following segmentation of BM and choroid layers. Thickness maps corresponding to each individual was generated based on the segmentation being performed. As each individual had 25 OCT scans, representing a different depth of choroid, all 25 images were considered to generate the thickness map. The thickness map can be defined as the Euclidean distance between the two surfaces: BM and choroid layers. In order to measure the required distance, the boundary of BM was taken as a reference boundary for the entire choroid region including the choriocapillaris. The distance between the BM and the upper surface of the choroid layer represents the choriocapillaris distance. Finally, in order to calculate the choroidal thickness, the distance between the BM and the lower surface of the choroidal vasculature was calculated. Choroidal vasculature and BM-equivalent thickness maps were created for all subjects. The error rate between the thickness map generated by doctors and map generated by the proposed method was also calculated. Figure 12 illustrates how the thickness of choroid layer was calculated, with specific steps described below:

For each figure, the width was 760 pixels and the height was 456 pixels
Next, a scaling operation was applied. We imposed 200 um maps, with 25 pixels in width and 76 pixels in height, so we could get the true thickness of each point
There were 25 figures for each patient and the interval between two figures was 240 um
According to steps 1–3, we could map a 25 × 760 matrix (refers to the pixel value on the figure) to 5760 um × 6080 um (refers to the real value of patient). The thickness value would then be mapped in different colors in the generated map.

Experimental Analysis

Dataset

The data set of the ocular images used in this work was collected from Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, China by using the volume scan mode with swept source OCT. The task of collecting data was undertaken in agreement with the organization’s research ethics conventions. The study was performed in accordance with the Declaration of Helsinki(DoH) and approved by the Ethics Committee of Shanghai Sixth People’s Hospital (reference: 2014-11-01). Written consent has been obtained from all subjects, and informed consent for study participation has been obtained. The data set contained 525 OCT scans from 21 healthy subjects. The images were centered at the macula region. All 21 subjects included 25 OCT scans showing different depths of the macula region. 275 images were used to train the network whereas 250 images were used to test the methodology. The OCT scans of each subject were uniformly selected and manually labeled along the outermost edges of the choroidal and BM boundaries. 25 OCT scans of every individual were examined using Heidelberg Eye Explorer Software (Heidelberg Engineering, Heidelberg, Germany).

The examined data provided (i) the automated retinal boundary for the choroidal boundary, (ii) the dots on the internal limiting membrane to the RPE boundary, and (iii) the marks on the RPE boundary to the choroidal scleral junction. Choroidal Volume (CV) was also calculated at the 6-mm circle. Out of 21 individuals, data of 11 people had normal OCT B-Scans, 4 OCT B-Scans were from individuals affected by short sightedness, 3 individual’s data were affected by glaucoma and another 3 patients were suffering from mild DME. For each image, an expert ophthalmologist from Shanghai Hospital 6 manually annotated 2 boundaries (BM and the choroid layer). To overcome and reduce the manual segmentation error, two expert ophthalmologists provided the segmentation measurements of the required boundaries. Results between the two experts did not differ significantly (p = 0.359). The average of the two results was used in the conducted study. The manual segmentation by the experts was used as ground truth. Variation between slices (different depth of the choroid) was relatively small around the macula area; this interpolated ground truth was confirmed as appropriate by the experts. Table 1 can be analyzed to analyze the data division for the testing and training of the method.

Table 1 Data Distribution for testing and training, Total Patients: 21, data of 11 people was used for training and data of 10 individuals was used as testing.

Full size table

Ground Truth Labeling

Images being used in this research were from real patients. Ground truth was specified by the ophthalmologist of Shanghai Jiao Tong University Affiliated Sixth People’s Hospital. The specialist manually segmented BM and choroid layer on the OCT images. Thus, these manually segmented images were used as ground truth in the proposed method.

Experimental settings

Our algorithm was implemented in MATLAB R2016b and run on a server with a GPU of 12 with 2 GB memory, with Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20 GHz (10 cores) processor, 64 GB of RAM space and an operating system of 64 bits. Cross entropy loss function was used following a pixel-wise soft-max over the network’s final output. The learning rate of the CNN was selected to be 0.001 for the first 20 epochs and 0.0001 for the last 20 epochs (total of 40 epochs). The system was trained with a weight decay of 0.0005 and a momentum of 0.9. In our trials, exploring further epochs did not significantly reduce the training error, but instead improved computational time. Because the system was convergent after 40 epochs, the model was not further trained. The Stochastic approximation of the gradient for the optimization was performed in mini batches of a size of 50 slices spliced from the training B-Scans with augmentation. The segmentation time for each OCT volume (25 slices) was 5 s−0.2 seconds per OCT B-scan.

Evaluation Metrics

In order to test the results, some error calculation matrices were used to compute the error rate on the test data set. The metric was proposed in order to calculate the average error rate in terms of pixel values i.e. the average difference between the doctor’s segmented image and the result of our proposed method. The error rate was calculated as:

$$er{r}_{1}=\frac{\Vert \overrightarrow{A}\overrightarrow{B}\Vert }{h\ast w}$$

(14)

where $\Vert \overrightarrow{A}\Vert ={A}_{1}^{2}+{A}_{2}^{2}+\cdots +{A}_{2}^{n}$, $\overrightarrow{A}$ is the computed thickness vector, and $\overrightarrow{B}$ is the thickness vector provided by the ophthalmologists. h and w represent the height and width of the image, respectively. Another metric was proposed to calculate the average error between the thickness map generated by doctor and the thickness map generated by the proposed methodology. The metric can be defined as:

$$\begin{array}{cc}er{r}_{2}=\frac{|\bar{A}-\bar{B}|}{h} & where\,\bar{A}=\frac{{A}_{1}+{A}_{2}+\cdots +{A}_{w}}{w}\end{array}$$

(15)

This metric was used to calculate the error rate of the proposed algorithm. Table 2 represents the average mean, variance and standard deviation calculated by the proposed methodology on the test data set. The term err1 represents the error rate between the doctor’s segmented image and segmentation performed by the proposed algorithm and the term err2 represents the error rate between the thickness map generated by the doctors and thickness map generated by the proposed method. Figure 13 shows the error rate computed on the test data set. Computed results are presented in Table 2.

Table 2 Mean, Variance and Standard deviation of the proposed method.

Full size table

After calculating the error rate, dice coefficient was also created to measure the similarity between the segmentation result of the proposed method and the ground truth. The dice coefficient was computed on the test data set. The coefficient was calculated as:

$$D=\frac{\mathrm{2|}A\cap B|}{|A|+|B|}$$

(16)

where A and B are the segmented choroidal region and the manually labeled choroidal region, respectively. Figure 14 shows the similarity measures observed between the manual segmentation on the test data set and segmentation performed by the proposed method. It was found that the average of the dice coefficients over 250 tested images was 97.35% with a standard deviation of 2.3%, showing good consistency between manual labeling and segmentation results of our algorithm.

Results of the proposed algorithm shows consistency with the ground truth. Observation of the manually segmented images illustrates the fact that the input points are limited and thus represents a smoother boundary whereas the proposed algorithm monitors the valley pixels more narrowly. The error rates were observed to be higher when the choroidal region was thinner. Thus, the similarity measurement through dice’s coefficient resulted in smaller similarity for the thinner choroid. It is notable that the manually segmented choroidal-scleral boundary is paralleled to automated segmentation. As the choroidal-scleral contains many small curvature deviations, specialists are required to mark surplus points as well as the corresponding boundaries. These curvature deviations are not clearly visible unless the images are zoomed closely while performing the manual labeling. A more precise approach is adopted by the automatic segmentation through considering the small bumps and gaps present in the choroidal-scleral boundary. Thus the inconsistency between the automatic and manual segmentation is minimal.

The visual results of thickness map generated by our method and the doctors’ segmented images can be seen in Fig. 15. The results were then compared with other existing methods for choroid segmentation.

It is important to note that choroidal thickness measurements are highly variable in practice owing to the lack of a standardized definition of the choroidal-scleral junction and the often obscure imaging appearance in this region⁸. In our experiments, the outermost edges of the choroidal vessels are labeled as the posterior choroidal boundary and sample results of the choroid layer and BM are shown in Fig. 16.

The proposed method has been compared with other state-of-the-art methods as well. Results were computed on the test data set. The outcome of the comparison of the proposed method with a few of those state-of-the-art methods is shown in Table 3.

Table 3 Mean Error Comparison of choroid layer segmentation and thickness map with other state-of-the-art methods.

Full size table

For further validation purposes, for each boundary segmentation, the mean signed and unsigned boundary positioning errors were compared with other algorithms such as graph cut, k-means and methods of^11,19. Results are presented in Table 4 for each boundary. We implemented each of these algorithms and tested them on the test data set. In the k-means algorithm, we selected k = 3 and applied k-means algorithm on the image directly. In graph cut method, we used the result of k-means algorithm to create an initial prototype for each class and the distance between each image pixel to initial class was also calculated by k-means algorithm. The evaluation was performed on the same database and with the manually labeled ground truth. The experimental results show that the proposed method performed better than other states of art methods^53,54,55,56.

Table 4 Signed and Unsigned Mean Error Rate Comparison (mean ± std).

Full size table

According to Table 4, observed signed border positioning errors were 0.43 ± 1.01 pixels for BM extraction and 2.80 ± 1.50 pixels for choroid segmentation, and the unsigned border positioning errors were 1.39 ± 0.25 pixels for BM extraction and 2.89 ± 1.05 pixels for choroid segmentation. Figures 17 and 18 show the mean signed and unsigned error observed on the tested data set, respectively, the signed and unsigned error were calculated with respect to the ground truth. The magnitude of signed error is much smaller than for unsigned error; hence, using signed error would misrepresent the level of deviation between the output and ground truth. Therefore, the unsigned error is reported in studies as a comprehensive measure of localization errors. The errors between the proposed algorithm and the reference standard were similar to those computed between the ground truth. The border positioning errors of the proposed method showed significant improvement over other algorithms, compared in both of the tables above.

Dice coefficient similarity measures and signed and unsigned border positioning errors of the proposed technique exhibited substantial improvement over other state-of-the-art methods. Results confirmed significant improvement through the proposed method. For instance, k-means and graph cut algorithm error rates are comparatively high when compared to the proposed methodology. Our method is also highly robust, overcoming many segmentation challenges posed by tissue anomalies such as drusen and RPE discontinuities to a large extent, and by low-quality images. This is reflected in the high number of correctly segmented images and the error rate that was observed. Validation against manual tracing demonstrated that the segmentation accuracy performed at least as well as specialists.

The role of deep retinal layers in the analysis of the progression of several diseases is becoming the focus of a notable number of studies. Besides the choroidal thickness measurement, the analysis of these layers opens the door to an array of global and local retinal feature descriptors that might be associated with ocular anatomy and functioning, which are still available to be explored. These functional and anatomical features can be highlighted with the help of precise tools for description of the choroid.

Conclusion

In this paper, we have proposed and implemented a deep learning approach to automatically segment out the choroid layer. OCT image segmentation was carried out using morphological operations and convolutional neural networks. Analysis of OCT images showed that the BM boundary is easy to extract when compared to the choroid layer, so BM was segmented first using a series of morphological operations. The appearance of choroid layer is inhomogeneous, so more image statistics were required for accurate segmentation, and a CNN model was used. As the focus of this research was to analyze the health of retina based on the segmented boundaries, the thickness of segmented layers was computed to analyze the presence of retinal abnormalities. Data for real patients was used to test the results of the proposed method. The results showed an accuracy of about 97 percent. The acquired segmentation results were perceived as quite comparable to the segmentation performed by specialists. Results showed that the proposed method maintained reduced error rates to a great extent as compared to some other existing state-of-the-art methods. To further improve the precision of the proposed method, we will work on the following related topics: improving present techniques on 2D images, improving compatibility with 3D images, optimizing parameters by considering different patch sizes, optimizing stride between the extracted patches, and hopefully widening the scope of the work observe other retinal abnormalities including macular edema, hypertensive retinopathy, and glaucoma.

Data Availability

The data sets generated during the study are available from the corresponding author on reasonable request.

Change history

13 December 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Drexler, W. et al. Enhanced visualization of macular pathology with the use of ultrahigh-resolution optical coherence tomography. Archives of Ophthalmology 121. 5, 695–706 (2003).
Article Google Scholar
Yonetsu, T. et al. Optical coherence tomography. Circulation Journal 77(8), 1933–1940 (2013).
Article Google Scholar
Abadia, B. et al. Choroidal thickness measured using swept-source optical coherence tomography is reduced in patients with type 2 diabetes. PloS one 13(2) (2018).
Article Google Scholar
Melancia, D. et al. Diabetic choroidopathy: a review of the current literature. Graefe’s Archive for Clinical and Experimental Ophthalmology 254(8), 1453–1461 (2016).
Article Google Scholar
Schmitt, J. M. et al. Optical coherence tomography (OCT): a review. IEEE Journal of Selected Topics in Quantum Electronics 5(4), 1205–1215 (1999).
Article ADS CAS Google Scholar
Puliafito, C. A. et al. Imaging of macular diseases with optical coherence tomography. Ophthalmology 102(2), 217–229 (1995).
Article CAS Google Scholar
Fernández, D. C., Salinas, H. M. & Puliafito, C. A. Automated detection of retinal layer structures on optical coherence tomography images. Optics Express 13(25), 10200–10216 (2005).
Article Google Scholar
Tian, J., Marziliano, P., Baskaran, M., Tun, T. A. & Aung, T. Automatic measurements of choroidal thickness in EDI-OCT images. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 12, 5360–5363 (2012).
Tian, J., Marziliano, P., Baskaran, M., Tun, T. A. & Aung, T. Automatic segmentation of the choroid in enhanced depth imaging optical coherence tomography images. Biomedical Optics Express. 4(3), 397–411 (2013).
Article Google Scholar
Zhang, L. et al. Automated segmentation of the choroid from clinical SD-OCT. Investigative Ophthalmology & Visual Science. 53, 7510–7519 (2012).
Article ADS Google Scholar
Kaji, V. et al. Automated choroidal segmentation of 1060 nm OCT in healthy and pathologic eyes using a statistical mode. Biomedical Optics Express. 3, 86–103 (2012).
Article Google Scholar
Torzicky, T. et al. Automated measurement of choroidal thickness in the human eye by polarization sensitive optical coherence tomography. Optics Express. 20, 7564–7574 (2012).
Article ADS Google Scholar
Duan, L., Yamanari, M. & Yasuno, Y. Automated phase retardation oriented segmentation of chorio-scleral interface by polarization sensitive optical coherence tomography. Optics Express. 20, 3353–3366 (2012).
Article ADS Google Scholar
Lu, H., Boonarpha, N., Kwong, M. T. & Zheng, Y. Automated segmentation of the choroid in retinal optical coherence tomography images. In Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society 13, 5869872 (2013).
Garvin, M. K. et al. Intraretinal layer segmentation of macular optical coherence tomography images using optimal 3-D graph search. IEEE Transactions on Medical Imaging. 27, 1495–1505 (2008).
Article Google Scholar
Danesh, H., Kafieh, R., Rabbani, H. & Hajizadeh, F. Segmentation of choroidal boundary in enhanced depth imaging OCTs using a multiresolution texture based modeling in graph cuts. Computational and Mathematical Methods in Medicine (2014).
Haeker, M., Wu, X., Abramoff, M., Kardon, R. & Sonka, M. Incorporation of regional information in optimal 3-D graph search with application for intraretinal layer segmentation of optical coherence tomography images. Biennial International Conference on Information Processing in Medical Imaging, 607–618(2007).
Hu, Z., Wu, X., Ouyang, Y., Ouyang, Y. & Sadda, S. R. Semiautomated segmentation of the choroid in spectral-domain optical coherence tomography volume scans. Investigative Ophthalmology and Visual Science. 54(3), 1722–1729 (2013).
Article Google Scholar
Alonso-Caneiro, D., Read, S. A. & Collins, M. J. Automatic segmentation of choroidal thickness in optical coherence tomography. Biomedical Optics Express. 4(12), 2795–2812 (2013).
Article Google Scholar
Wang, C., Li, Y. & Wang, Y. X. Automatic choroidal layer segmentation using markov random field and level set method. IEEE journal of Biomedical and Health Informatics (2017).
Rossant, F., Ghorbel, I., Bloch, I., Paques, M. & Tick, S. Automated segmentation of retinal layers in OCT imaging and derived ophthalmic measures. IEEE International Symposium on Biomedical Imaging 1370–1373 (2009).
Ghorbel, I., Rossant, F., Bloch, I., Tick, S. & Paques, M. Automated segmentation of macular layers in OCT images and quantitative evaluation of performances. Pattern Recognition. 44(8), 1590–1603 (2011).
Article Google Scholar
Wang, C. et al. Segmentation of intra-retinal layers in 3d optic nerve head images. International Conference on Image and Graphics 321–332 (2015).
Esmaeelpour, M. et al. Choroidal haller’s and sattler’s layer thickness measurement using 3-dimensional 1060-nm optical coherence tomography. PloS One. 9(6), e99690 (2014).
Article ADS Google Scholar
Wang, C. et al. Automated layer segmentation of 3d macular images using hybrid methods. International Conference on Image and Graphics. 6(14–28) (2015).
CChiu, S. J. et al. Automatic segmentation of seven retinal layers in SDOCT images congruent with expert manual segmentation. Optics Express 18. 8(18), 19413–19428 (2010).
Article ADS Google Scholar
Kafieh, R. et al. Intra-retinal layer segmentation of 3D optical coherence tomography using coarse grained diffusion map. Medical image analysis. 17(8), 907–928 (2013).
Article Google Scholar
Tolliver, D. A., Koutis, I., Ishikawa, H., Schuman, J. S. & Miller, G. L. Automatic multiple retinal layer segmentation in spectral domain OCT scans via spectral rounding. Investigative Ophthalmology & Visual Science 49(13), 1878–1878 (2008).
Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention 234–241 (2015).
Li, Z., Wang, Y., Yu, J., Guo, Y. & Cao, W. Deep Learning based Radiomics (DLR) and its usage in noninvasive IDH1 prediction for low grade glioma. Scientific reports. 7(1), 5467 (2017).
Article ADS Google Scholar
Ghafoorian, M. et al. Location sensitive deep convolutional neural networks for segmentation of white matter hyperintensities. Scientific Reports. 7(1), 5110 (2017).
Article ADS Google Scholar
Tu, X. et al. Automatic Categorization and Scoring of Solid, Part-Solid and Non-Solid Pulmonary Nodules in CT Images with Convolutional Neural Network. Scientific Reports. 7(1), 8533 (2017).
Article ADS Google Scholar
Zhang. et al. Deep convolutional neural networks for multi-modality isointense infant brain image segmentation. NeuroImage. 108, 214–224 (2015).
Article Google Scholar
Havaei, M. et al. Brain tumor segmentation with deep neural networks. Medical Image Analysis. 35, 18–31 (2017).
Article Google Scholar
Trebeschi, S. et al. Deep learning for fully-automated localization and segmentation of rectal cancer on multiparametric MR. Scientific reports. 7(1), 5301 (2017).
Article ADS Google Scholar
Sharma, K. et al. Automatic segmentation of kidneys using deep learning for total kidney volume quantification in autosomal dominant polycystic kidney disease. Scientific reports. 7(1), 2049 (2017).
Article ADS Google Scholar
Kenny, C. H. et al. Urinary bladder segmentation in CT urography using deep learning convolutional neural network and level sets. Medical Physics 43. 4, 1882–1896 (2016).
Google Scholar
Hong, T. J. et al. Segmentation of optic disc, fovea and retinal vasculature using a single convolutional neural network. Journal of Computational Science. 20, 70–79 (2017).
Article Google Scholar
Liskowski, P. & Krawiec, K. Segmenting retinal blood vessels with deep neural networks. IEEE Transactions on Medical Imaging. 35 (2016).
Ngo, L., Han, J. H. Advanced deep learning for blood vessel segmentation in retinal fundus images. Brain-Computer Interface, 5th International Winter Conference 91–92 (2017).
Dasgupta, A., Singh, S. A. Fully convolutional neural network based structured prediction approach towards the retinal vessel segmentation. Biomedical Imaging, IEEE 14th International Symposium 248–251 (2017).
Dhoot, D. S. et al. Evaluation of choroidal thickness in retinitis pigmentosa using enhanced depth imaging optical coherence tomography. British Journal of Ophthalmology. 97, 66–69 (2013).
Article Google Scholar
Imamura, Y., Fujiwara, T., Margolis, R. & Spaide, R. F. Enhanced depth imaging optical coherence tomography of the choroid in central serous chorioretinopathy. Retina. 29, 1469–1473 (2009).
Article Google Scholar
Manjunath, V., Goren, T., Fujimoto, T. G. & Duker, J. S. Analysis of choroidal thickness in age-related macular degeneration using spectral-domain optical coherence tomography. RAm. Journal of Ophthalmology. 4, 663–668 (2011).
Google Scholar
Esmaeelpour, M. et al. Mapping choroidal and retinal thickness variation in type 2 diabetes using three-dimensional 1060-nm optical coherence tomography. Investigative Ophthalmology and Visual Science. 8, 5311–5316 (2011).
Article Google Scholar
Sim, D. A. et al. Repeatability and reproducibility of choroidal vessel layer measurements in diabetic retinopathy using enhanced depth optical coherence tomography. Investigative Ophthalmology and Visual Science. 4, 2893–2901 (2013).
Article Google Scholar
Gopinath, K., Rangrej, S. B. & Sivaswamy, J. A deep learning framework for segmentation of retinal layers from OCT images.
Pekala, M. et al. Deep Learning based Retinal OCT Segmentation. arXiv preprint arXiv, 801.09749 (2018).
Fang, L. et al. Automatic segmentation of nine retinal layer boundaries in OCT images of non-exudative AMD patients using deep learning and graph search. Biomedical optics express 8(5), 2732–2744 (2017).
Article Google Scholar
Roy, A. G. et al. ReLayNet: retinal layer and fluid segmentation of macular optical coherence tomography using fully convolutional networks. Biomedical optics express 8(8), 3627–3642 (2017).
Article Google Scholar
Jia, Y. et al. Caffe: Convolutional Architecture for Fast Feature Embedding. arXiv preprint arXiv, 1408.5093 (2014).
Krizhevsky, A. & Hinton, G. Convolutional deep belief networks on cifar-10. Unpublished manuscript 40, 7 (2010).
Google Scholar
Giani, A. et al. Artifacts in automatic retinal segmentation using different optical coherence tomography instruments. Retina. 30(4), 607–6164 (2010).
Article MathSciNet Google Scholar
Zhang, L. Automated segmentation and analysis of layers and structures of human posterior eye. he University of Iowa (2015).
Vuong, V. S. et al. Repeatability of choroidal thickness measurements on enhanced depth imaging optical coherence tomography using different posterior boundaries. American Journal of Ophthalmology 169, 104–112 (2016).
Article Google Scholar
Ben-Cohen, A. et al. ReLayNet: etinal layers segmentation using Fully Convolutional Network in OCT images. RSIP Vision. RSIP Vision (2017).

Download references

Acknowledgements

This work was supported by the Macau Science and Technology Development Fund (0027/2018/A1) to P.L., the National Natural Science Foundation of China (61872241, 61572316), the National Key Research and Development Program of China (2017YFE0104000, 2016YFC1300302), and the Science and Technology Commission of Shanghai Municipality (18410750700, 17411952600, 16DZ0501100) to B.S.

Author information

Saleha Masood, Ruogu Fang, Ping Li, and Huating Li contributed equally.

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
Saleha Masood & Bin Sheng
J. Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Gainesville, FL 32611, USA
Ruogu Fang & Akash Mathavan
Faculty of Information Technology, Macau University of Science and Technology, Macau, 999078, China
Ping Li
Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, 200233, China
Huating Li, Xiangning Wang, Qiang Wu & Weiping Jia
Department of Computer Science, Liverpool John Moores University, Liverpool, L3 3AF, UK
Po Yang
Centre for Smart Health, School of Nursing, The Hong Kong Polytechnic University, Hong Kong, 999077, China
Jing Qin

Authors

Saleha Masood
View author publications
You can also search for this author in PubMed Google Scholar
Ruogu Fang
View author publications
You can also search for this author in PubMed Google Scholar
Ping Li
View author publications
You can also search for this author in PubMed Google Scholar
Huating Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Akash Mathavan
View author publications
You can also search for this author in PubMed Google Scholar
Xiangning Wang
View author publications
You can also search for this author in PubMed Google Scholar
Po Yang
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jing Qin
View author publications
You can also search for this author in PubMed Google Scholar
Weiping Jia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M. designed the study, performed experiments and wrote the manuscript. P.L. and B.S. helped to design the study and the experiments and supervised the project. H.L., X.W., P.Y., and Q.W. provided the dataset and the manual annotations. R.F., A.M., J.Q., and W.J. provided comments and feedback on the study and the results. All the authors reviewed the manuscript.

Corresponding authors

Correspondence to Bin Sheng or Qiang Wu.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Sample Results

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Masood, S., Fang, R., Li, P. et al. Automatic Choroid Layer Segmentation from Optical Coherence Tomography Images Using Deep Learning. Sci Rep 9, 3058 (2019). https://doi.org/10.1038/s41598-019-39795-x

Download citation

Received: 27 March 2018
Accepted: 21 January 2019
Published: 28 February 2019
DOI: https://doi.org/10.1038/s41598-019-39795-x

This article is cited by

Automatic segmentation of layers in chorio-retinal complex using Graph-based method for ultra-speed 1.7 MHz wide field swept source FDML optical coherence tomography
- Raju Poddar
- Vinita Shukla
- Muktesh Mohan
Medical & Biological Engineering & Computing (2024)
Correlation of choroidal thickness with age in healthy subjects: automatic detection and segmentation using a deep learning model
- Chen Yu Lin
- Yu Len Huang
- Chia Jen Chang
International Ophthalmology (2022)
Synthetic OCT data in challenging conditions: three-dimensional OCT and presence of abnormalities
- Hajar Danesh
- Keivan Maghooli
- Rahele Kafieh
Medical & Biological Engineering & Computing (2022)
Choroidal imaging using optical coherence tomography: techniques and interpretations
- Tetsuju Sekiryu
Japanese Journal of Ophthalmology (2022)
Automatic choroid layer segmentation in OCT images via context efficient adaptive network
- Qifeng Yan
- Yuanyuan Gu
- Yitian Zhao
Applied Intelligence (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Related Work

Review of Recent Deep Learning Techniques in Computer Vision

Material and Methods

Pipeline Overview

Bruch’s Membrane Segmentation

Thresholding

Reconstruction

Thinning

Curve Fitting

Choroid Layer Segmentation Using Deep Learning

Pre-Processing

Data Sampling

Data Conversion

CNN Training

Thickness Map

Experimental Analysis

Dataset

Ground Truth Labeling

Experimental settings

Evaluation Metrics

Conclusion

Data Availability

Change history

13 December 2019

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links