Comparative analysis of image classification methods for automatic diagnosis of ophthalmic images

Wang, Liming; Zhang, Kai; Liu, Xiyang; Long, Erping; Jiang, Jiewei; An, Yingying; Zhang, Jia; Liu, Zhenzhen; Lin, Zhuoling; Li, Xiaoyan; Chen, Jingjing; Cao, Qianzhong; Li, Jing; Wu, Xiaohang; Wang, Dongni; Li, Wangting; Lin, Haotian

doi:10.1038/srep41545

Download PDF

Article
Open access
Published: 31 January 2017

Comparative analysis of image classification methods for automatic diagnosis of ophthalmic images

Liming Wang¹^na1,
Kai Zhang²^na1,
Xiyang Liu^2,3^na1,
Erping Long⁴,
Jiewei Jiang²,
Yingying An²,
Jia Zhang²,
Zhenzhen Liu⁴,
Zhuoling Lin⁴,
Xiaoyan Li⁴,
Jingjing Chen⁴,
Qianzhong Cao⁴,
Jing Li⁴,
Xiaohang Wu⁴,
Dongni Wang⁴,
Wangting Li⁴ &
…
Haotian Lin⁴

Scientific Reports volume 7, Article number: 41545 (2017) Cite this article

10k Accesses
39 Citations
6 Altmetric
Metrics details

Subjects

Abstract

There are many image classification methods, but it remains unclear which methods are most helpful for analyzing and intelligently identifying ophthalmic images. We select representative slit-lamp images which show the complexity of ocular images as research material to compare image classification algorithms for diagnosing ophthalmic diseases. To facilitate this study, some feature extraction algorithms and classifiers are combined to automatic diagnose pediatric cataract with same dataset and then their performance are compared using multiple criteria. This comparative study reveals the general characteristics of the existing methods for automatic identification of ophthalmic images and provides new insights into the strengths and shortcomings of these methods. The relevant methods (local binary pattern +SVMs, wavelet transformation +SVMs) which achieve an average accuracy of 87% and can be adopted in specific situations to aid doctors in preliminarily disease screening. Furthermore, some methods requiring fewer computational resources and less time could be applied in remote places or mobile devices to assist individuals in understanding the condition of their body. In addition, it would be helpful to accelerate the development of innovative approaches and to apply these methods to assist doctors in diagnosing ophthalmic disease.

A multidomain bio-inspired feature extraction and selection model for diabetic retinopathy severity classification: an ensemble learning approach

Article Open access 30 October 2023

A concentrated machine learning-based classification system for age-related macular degeneration (AMD) diagnosis using fundus images

Article Open access 29 January 2024

Infrared retinal images for flashless detection of macular edema

Article Open access 01 September 2020

Introduction

The diagnosis of ocular diseases mainly depends on the observation of various ophthalmic images from patients. Numerous studies have been reported on computer-aided diagnosis of ophthalmic diseases^{1,2,3,4,5,6,7,8,9,10,11,12,13}. However, not all image classification methods are suitable for automatic diagnosis of ocular disease. Therefore, it is imperative to compare the feasibility and performance of image classification methods in diagnosing ophthalmic diseases, which could facilitate the follow-up studies on the automatic diagnosis of ophthalmic diseases and shed a light on its application.

Slit-lamp image is an important kind of ophthalmic image and thus some diseases can be diagnosed using it, such as cataracts. The slit-lamp images from cataracts patients show the heterogeneity among different patients, and represent the complexity of ophthalmic images as well. Furthermore, there have been some achievements about automatic diagnosis of ophthalmic diseases have been made, but the researches with slit-lamp images are still relatively less¹⁴. To fill this gap, common slit-lamp images from patients suffering from pediatric cataract were chosen in our study as research material to explore which methods are effective and efficient in diagnosing ophthalmic disease.

We select some methods which have been applied to diagnose ocular diseases^4,9,10,12, where Liye Guo et al. proposed an entire structure for the automatic diagnosis of cataracts using fundus images. After the features in the images are extracted with wavelet transformation and sketch-based methods, the multiple-fisher classifier divides all of the samples according to the severity of the disease⁴; Muthu Rama Krishnan Mookiah et al. proposed an automated dry age-related macular degeneration (AMD) detection system using various features, including entropies, HOS (higher order spectra) bispectral features, fractional dimension (FD), and Gabor wavelet features, which are extracted from grayscale fundus images. The features are then selected with a series of statistical test methods and used to classify with a group of classifiers, such as k-nearest neighbor (k-NN), probabilistic neural network (PNN), decision tree (DT) and SVMs methods; the average accuracy was greater than 90%⁹; Muthu Rama Krishnan Mookiah et al. selected HOS and discrete wavelet transform (DWT) features as image descriptor portraying eye images and SVMs (support vector machines) as classifier to diagnose glaucoma and obtained a higher accuracy than before¹⁰; Anushikha Singh et al. made use of genetic feature selection to choose useful features from wavelet features which is extracted from segmented optic disc images, in which the blood vessels were removed to automatically detect glaucoma with fundus images. Experiment demonstrated that doing so is better than directly classifying with whole or sub-fundus images¹². Besides, several methods¹⁵ which have not been employed in this field yet in image classification realm are also selected to try to identify ophthalmic diseases.

Experimental result reveals some methods [e.g. LBP (local binary pattern) +SVMs (support vector machines), wavelet transformation +SVMs] own good performance (recognition accuracy is over 87%). However, although with same features, some methods [e.g. extreme learning machine (ELM), sparse representation] are not suitable for diagnosing ocular disease. Methods with better performance can be adopted to aid doctors in preliminary screening diseases. Furthermore, some methods (e.g. color feature, texture feature combining with SVMs or kNN (k-nearest neighbor)) requiring less memory or consuming less time can be practically applied in some specific situations, such as mobile devices. In addition, the present study can aid people in remote places or hospitals without advanced equipment in choosing suitable methods for screening ophthalmic diseases.

Results and Discussion

Dataset

All of the slit-lamp images in the dataset were obtained from the Zhongshan Ophthalmic Center, Sun Yat-sen University, which is the most advanced ophthalmic hospital in China and sets up a state of the art ophthalmology laboratory¹⁶. There are 476 slit-lamp images (positive samples) from patients who suffer from pediatric cataracts and 410 slit-lamp images (negative samples) from control individuals. Because cataracts occurs in lens of patients and other parts of eyes is useless for diagnosing this disease, all of the images were manually segmented with tools to allow a rectangle to cover the range of lens and include other parts of the image which is useless for classification as little as possible. The above process will be illustrated in Fig. 1 which provide some examples of slit lamp images and the ROI (region of interest) from them, where Fig. 1(a) and (b) are from normal people and Fig. 1(c) and (d) are from patients.

Experimental settings and performance evaluation indicator

Eight classification schemas are shown in Table 1 along with the parameters of them. Four-fold cross validation was adopted to permit a fair comparison of these methods. Eight groups of feature extraction algorithms and image classification methods are assembled to automatically diagnose pediatric cataracts, and their performances were compared in various respects. All schemas are divided in three groups: the feature extraction methods in the first group [schema (1), (2), (3) and (4)] is color features combined with texture features and the classification methods are different, subsequently the best classifier in these four schemas can come into being; the second group [schema (5) and (6)] combines better features and the best classifier which is obtained in the first group so that the best schema will be obtained; the last, sparse representation which has been rarely used to recognize medical images is used to identify ocular images in the third group [schema (7) and (8)]. The eight schemas were implemented with MATLAB R2014a on a personal computer with an Intel 3.60 GHz i7 processor with 16 G RAM. The objective of our study is to apply these methods to automatic identifying of ophthalmic disease, validation of their feasibility and comparison of the performance of these methods; the performance indicators that were employed to evaluate the performance are as follows:

Table 1 The relevant parameters of eight schemas.

Full size table

where P and N are the number of positive samples and negative samples respectively; TP indicates the number of positive samples which are classified into the positive class; FN denotes the number of positive samples classified as negative samples; TN is the number of negative samples recognized as negative samples; and FP refers to the number of negative samples that are identified as positive samples. Besides, ROC (receiver operating characteristics) curve, which indicates how many positive samples are recognized conditioned on a given false positive rate and AUC (area under the curve) which means the area of the zone under ROC curve are also adopted to assess the performance.

Comparison of the performance of schemas (1), (2), (3) and (4)

First, the performance of schemas (1), (2), (3) and (4) are compared. As is shown in Table 2, the format of these data is μ ± δ (μ is the mean value, δ is standard deviation). Since FNR and FPR could be computed with sensitivity and specificity, all FNR and FPR are not shown. The linear kernel was chosen as the kernel function in schema (2) and (3), because the SVMs with other kernel functions could not converge within an acceptable time. Schema (3) demonstrated highest accuracy among these four schemas in terms of recognition accuracy, and compared with schema (2), the classification accuracy of schema (3) which select a part of features that are helpful for diagnosis of ocular images has been improved significantly. Besides, the performance of schema (1) is unsatisfactory and schema (4) performs relatively satisfactory no matter how great the parameter k is. Then the overall performance of these four schemas was further assessed by using ROC curve and AUC values which are shown in Fig. 2(a), where the k of schema (4) is 10. GA (genetic algorithm) is a probabilistic algorithm so that the results obtained by schema (3) may be different each time, thus schema (3) is repeated ten times and the ROC curve results from highest accuracy, which shows that three color features and seventeen texture features are helpful for classification and is shown in Fig. 2(a). All of the above results indicate that schema (3) clearly demonstrates the highest accuracy in these four schemas.

Table 2 Performance comparison of schemas (1), (2), (3) and (4) with different k.

Full size table

**Figure 2: ROC curves for eight schemas.**

Comparison of the performance of schemas (5), and (6)

Because the performance of different classifiers with same image features has been compared in above comparison stage, SVMs is selected to be the classifier in this comparison stage. In schema (5), all of the wavelet coefficients were obtained using two-level discrete wavelet transformation originating from three different types of wavelet. The performance of schema (5) with the linear kernel is almost as good as that using the polynomial kernel, while the performance of schema (5) using other kernel functions was not desirable. A performance comparison for schemas (5) and (6) with two types of kernel functions is shown in Table 3, which indicates the two schemas are good at identifying ophthalmic images. The ROC curves and AUC values for schemas (5) and (6) with different kernel functions are shown in Fig. 2(b) and (c). Compared with the other schemas, schemas (5) and (6) demonstrate the most outstanding robustness and performance. On the other hand, this result also indicates that the feature extraction procedure of schemas (5) and (6) was able to exactly depict the special characteristics of samples belonging to two different classes and the features from them subsequently facilitate the classification. Because the ROC curves of schemas (5) and (6) are similar, which is not helpful for comparing their performance, the AUC values were used for a more objective and precise comparison. As is shown in Fig. 2(b) and (c), LBP is more suitable than wavelet in depicting image features in terms of the AUC values.

Table 3 Performance comparison of schemas (5) and (6).

Full size table

Comparison of the performance of schemas (7), and (8)

In schema (7), the sample size (image size) is different for different application, so the sample size is investigated in experiment to find how it affect the classification accuracy. Table 4 shows the performance of schema (7) and schema (8). It is found that schema (7) was able to achieve better performance with smaller sample size while schema (8) performs relatively poorly in solving this problem. Thus, we attempted to establish a set of proper parameters for schemas (7) and (8).

Table 4 Performance of schemas (7) and (8).

Full size table

We use an over-complete dictionary formed with fewer images and all of the remaining images not included in the over-complete dictionary were chosen as testing samples. The number of positive samples in the dictionary is larger than that of negative samples, which is expected because negative samples are quite similar, whereas the conditions of positive samples are very complicated. That’s to say, it is not necessary to use more negative representative samples that are to be classified. The performance of schema (7) in which all images are resized to be uniform size (15 × 20 or 5 × 10) and schema (8) can be improved by using an over-complete dictionary with the appropriate size. Schema (7) also can obtain better performance with a proper sample size. The performance of schemas (7) and (8) with different parameters are shown in Table 5 which illustrates that compared with the original schemas (7) and (8), the performance of the schemas (7) and (8) with different parameters was enhanced. Furthermore, the performance of schema (7) is slightly better with a smaller sample size rather than a larger sample size, which is verified in Table 5. Then compared with schema (4) whose features are same with schema (8), the performance of schema (8) is unacceptable, which signifies sparse representation is unsuited to solve this task.

Table 5 Performance of schema (7) using an over-complete dictionary with different sizes and different sample sizes and schema (8) for different over-complete dictionary sizes.

Full size table

The ROC curves for schema (7) with one negative sample and 80 positive samples and for schema (8) with five negative samples and 70 positive samples (all of the samples are resized to a uniform size (5 × 10)) are shown in Fig. 2(d). Though the classification accuracy of schemas (7) and (8) are able to surpass that of schema (1) when proper parameters are utilized, the AUC values of schemas (7) and (8) are smallest in these eight schemas. Furthermore, the AUC values of schema (7) and (8) are so close to 0.5, which indicates that the classification nearly becomes randomly guessing without any reason.

Comparison of computational and memory complexity of all schemas

Finally, the memory usage which is shown as Fig. 3(a) and running time which is shown as Fig. 3(b) were used to evaluate the memory and computational complexity of the schemas. Because the classification of 4-fold cross validation is combined together in schema (3), the memory usage and running time of schema (3) are divided into four equal parts. The running time and memory usage of schemas (5) and (6) with different kernel functions and different wavelets are almost the same, thus the mean values are shown in Fig. 3. The performance indicators for schema (7) and (8) with modified parameters (5 negative samples, 75 positive samples and 5 × 10 sample size for schema (7); 1 negative samples and 80 positive samples for schema (8)) are shown in Fig. 3. The relevant data for schema (4) with k = 10 are shown in Fig. 3. There are some significant differences between the eight schemas in memory usage. Schemas (5) and (6) occupy more memory than the other schemas, but the performances of schemas (5) and (6) are excellent, so it is worth exchanging some memory space for high performance and accuracy. Schema (2) can achieve satisfactory performance with little memory space within a little time. Compared with schemas (1), (2) and (4), schema (7) was unable to provide a relatively good result with occupying a comparatively larger memory space and consuming a relatively longer time. Although schema (8) does not consume as much resources as schema (7), the result provided by schema (8) is not satisfactory either so no meaningful diagnosis advices could be provided by schemas (7) and (8). In terms of running time, the method that requires the longest time is schema (3) whose loss outweighs the gain in some extent. Except for schema (3), the computational time of the remaining schemas are reasonable in terms of computational complexity.

**Figure 3: Memory usage and Computational time of the 8 schemas.**

Conclusions and Future Work

Eight groups of representative methods for image feature extraction and image classification have been applied to automatically identify ophthalmic images. In present study, the performance of the eight schemas are compared with regard to multiple aspects. No matter which feature extraction method is adopted, SVMs (support vector machines) classifier yielded desirable performance, which verified by four schemas (schemas (2), (3), (5), (6)). Some traditional methods, such as the kNN (k-nearest neighbor) approach, were also able to provide relatively satisfactory results. Although the ELM is quicker than other neural network architectures when applied to classification, the results returned from it were undesirable. Similar to the ELM (extreme learning machine), sparse representation is not good at solving this problem and the accuracy of sparse representation with modified parameters are improved insignificantly. This problem may be solved by using an advanced or improved optimization algorithm¹⁷ and an excellent dictionary learning algorithm¹⁸, which is promising in the field of image classification. At the same time, the feature extraction methods, such as wavelet transformation, the LBP (local binary pattern) algorithm, color features and texture features, were able to express the image characteristics in varying degrees to facilitate classification. In the future, efforts can be made to explore other image feature extraction methods and image classification algorithms, such as deep learning which finishes feature extraction and classification meanwhile and thus suited to solve this problem and obtain a better performance. At last, those algorithms that own better performance and do not need much computational resource could be used to help doctors screen ophthalmic diseases or to help individuals eliminate these diseases. Some methods could be deployed in remote places without advanced equipment or hospitals without experienced doctors. Furthermore, methods with better performance could be deployed on mobile phones without large capacity memory or high-speed processors to help individuals take precautions against ophthalmic diseases.

Methods

Color features extraction

The pictures are formed with pixels that express the color features¹⁹, which can provide valuable statistical information about the color distribution of a picture; thus, color features is an important category of image features. Color features are applied in image segmentation problems¹⁹, image classification problems²⁰ and so on. Color features extraction should be based on a specific color space, such as HSV (hue, saturation, value), HSI (hue, saturation, intensity) or RGB (red, green, blue). The RGB color space is chosen in the present study to determine the first-, second- and third-order moments of 3 channels, as is shown in equations (1, 2, 3).

where N is the total number of pixels in an image and p_i,j is the level of the R, G or B component in one pixel. Nine color features are analyzed in equations (1, 2, 3).

Texture features extraction

Texture features are used to depict the variation relations among different pixels in an image. Gray tone spatial dependence matrices and gray gradient co-occurrence matrices are employed to extract the texture features of an image and have been applied in many fields, such as automatic medical diagnosis^21,22, defect detection²³ and pattern classification²⁴.

As a typical texture feature extraction method, statistic-based gray tone spatial dependence matrices²⁵ can present the comprehensive information of the gray distribution in terms of different aspects, including direction, variation range and local domain. The elements in gray tone spatial dependence matrices are defined as the occurrence frequency of gray values of 2 different pixels separated by d pixels in direction θ and whose gray values are i and j, respectively. θ is generally set as 0°, 45°, 90° or 135°; therefore, the 4 matrices whose elements are shown in equations (4,5,6,7) can be obtained with this method. A succession of features can be computed with gray tone spatial dependence matrices and can be used to form part of a whole feature vector in the present study.

where (k, l) and (m, n) are pixel coordinates in the images; I(k, l) and I(m, n) are the gray values of the corresponding pixels; L_x and L_y are the numbers of rows and columns of the gray tone spatial dependence matrices respectively, which are related to the level of gray. The summation of the four gray tone spatial dependence matrices is then normalized to compute 14 texture features. The # symbol denotes the number of elements in a set.

Compared with the gray tone spatial dependence matrices, the gray gradient co-occurrence matrix²⁶ containing both the variation of gray and the gradient is also commonly employed to represent the texture features of an image. The 15 features computed with the gray gradient co-occurrence matrix are also chosen as a part of the feature vector of the images. The gradient image G is calculated, and its elements are dispersed into L_g gray levels. Subsequently, the gray gradient co-occurrence matrix can be built with G and the original images I using equation (8).

where (i, j) is the pixel coordinate in the image; I(i, j) and G(i, j) are the gray values of corresponding pixels in I and G respectively. The # symbol denotes the number of elements in a set.

Wavelet transformation

The wavelet transformation²⁷, which can be used to effectively analyze the features of an image at different scales, has been frequently applied in the field of image processing and image classification^27,28. The wavelet transformation provides the time-frequency information of the images, which is a preferred method for replacing some shallow time invariant image features.

Given f(x), a continuous, square-integrable function, the continuous wavelet transformation of f(x) is

where ψ_s,t(x) is constructed from real-valued mother wavelets with different scale factor s and different transition factor t.

If s₀ is a proper value such that , where s₀ > 0 is an expansion step; when the scale factor s = s₀ and the translation factor t = t₀, each translation can be kt₀. For , , k, j ∈ Z, and s₀ > 1, t₀ > 1 are constants, and the discrete wavelet is derived as equation (10):

Thus, the corresponding wavelet transformation is given as equation (11):

The wavelet coefficients extracted from images using equation (11) can be regarded as feature vectors and forwarded into the classifier to identify pediatric cataracts.

LBP (local binary pattern)

The powerful LBP^29,30 not only characterizes rotation invariance, gray invariance and other elements but is also a simple yet effective operator depicting the local texture feature of images; the LBP operator summarizes the relationship between the gray value of the central pixel and the gray values of its neighbors, which is refined as equation (12). The gray intensity of each surrounding pixel is compared with the gray intensity of the central pixel. If the gray value of the surrounding pixel is larger than or equal to the gray value of the central pixel, the corresponding bit of the LBP code is assigned 1; otherwise, the corresponding bit of the LBP code is assigned 0. The LBP code can be expressed as equation (12), where g_c and g_i denote the gray value of the central pixel and the gray value of a neighboring pixel surrounding the central pixel, respectively. Here P = 8 and s(·) is defined as in equation (13):

If equation (12) is exploited to calculate the LBP code, 256 different LBP codes will arise in the LBP descriptor of the image. The fact that some special patterns known as uniform patterns emerge more frequently than remaining type of patterns can help to drastically reduce the number of LBP codes. If the total number of transitions from 1 (0) to 0 (1) in an LBP code is less than 2, this LBP code is classified as a uniform pattern. The LBP value of a pixel can be computed with this extended LBP operator; see equation (14). The LBP feature is created by summarizing the histogram of LBP values in each region of the partitioned image and then connecting these histograms.

where U(LBP_P,r) is the total number of transition from 1 (0) to 0 (1). s(·) is given as equation (12).

SVMs (support vector machines) classifier

Statistical learning-based SVMs³¹, which apply structural risk minimization theory, have emerged as a new pattern classification method in recent years and exhibits strong generalization. SVMs demonstrates high performance, owns reliable theoretical support and have been applied in many fields^32,33,34. It is utilized to solve binary classification problem at first. Given a dataset in which the samples belong to one of two classes, the SVMs attempt to find a hyperplane to separate samples from the two different categories and to maximize the distance between the 2 different classes. Given i pairs of training data from two classes , where x and y are the feature vector and label of one sample respectively, the decision surface of linear separable data can be expressed as equation (15):

In this case, a_i is a Lagrange multiplier, the vectors corresponding to a_i > 0 are supporting vectors, and f(x) is not affected by the dimension of the feature space. The decision surface of the non-linear separable data can be presented as equation (16):

For the non-linear separable problem, the kernel function mapping low-dimension data into high-dimension space can define a new data point corresponding to the old data point in a new feature space. Because the type of kernel function is not unique, the choice of kernel function should depend on specific conditions. The LIBSVM³⁵ is adopted to carry out the experiments relevant to SVMs in the present study.

ELM (extreme learning machine) classifier

The ELM³⁶, which does not require gradient information of error function and only needs to solve a matrix equation rapidly during its training process, is an improved neural network architecture. Suppose there are N samples and a three-layer feedforward neural network, whose hidden layer contains neurons. If g(x) is the activation function of neurons in the hidden layer, then the output matrix of the hidden layer is . The weight matrix between the hidden layer and the output layer is , The output matrix of the output layer is . The classification problem can be completed by solving the matrix formula Hβ = T, namely, β = H⁺T, where H⁺ is the generalized inverse of H.

Sparse representation

Sparse representation^37,38 is an effective method converting classification problem into an optimization problem. Firstly, an over-complete dictionary is constructed using a portion of the images, and the remaining images that are not included in the over-complete dictionary are represented by the images in the over-complete dictionary. Suppose that some samples belong to k classes and the training samples belong to the ith class constitute ∈ ; then training samples belonging to k classes can be combined as the over-complete dictionary A = [A₁, A₂,…,A_k] ∈ , and a sample y to be classified can be presented as equation (17):

where x is the coefficient vector, which can be understood as the projection of y on A. Ideally, the coefficients corresponding to the ith class, which is the real class of y, are non-zero, and the remaining coefficients are zero. If n is large enough, then x is sparse. If m < n, this problem will convert into a minimization problem of the L₀ norm, namely,

Because this problem is NP hard, according to compressed sensing theory, this problem can be converted as follows:

To solve equation (19), The differential evolution (DE) algorithm³⁹, whose key procedures include differential mutation, crossover and greedy choice, is introduced. The differential mutation and crossover should be implemented as equations (20) and (21), respectively.

where v_i is the differential vector created from equation (20) and x, , and are the aim vector, first randomly selected vector, second randomly selected vector and scaling factor respectively. N is the size of the population.

where x^j is the jth component of the cross vector corresponding to the selected aim vector, is the jth component of the differential vector and is the cross factor.

Because this problem is a constrained optimization problem, the Deb criterion is introduced into the greedy choice of DE as follows. If both solutions are in feasible region, choose the individual with the higher fitness; if neither of the two solutions is in feasible region, choose the individual that violates constraints less; if one of the solutions is in feasible region and the other solution is not, choose the individual in the feasible region.

Feature selection

Because the high dimension of the feature vector of the images leads to difficulty in distinguishing which feature is helpful for classification, the genetic algorithm (GA)^40,41 and SVMs (support vector machines) are combined to solve the image classification problem, which will implement feature selection and classification at the same time. The accuracy of the SVMs is adopted as the fitness evaluation function of the GA. The chromosome coding method is binary coding; the length of the chromosome equals the dimension of the feature vector; and a bit 0 signifies that the feature corresponding to this bit is not needed in classification; otherwise, the feature is needed in classification.

kNN (k-nearest neighbor) classifier

kNN^42,43 is a simple yet well-known classification approach that uses information from the k closest data points for classification. This algorithm makes use of some distance measurements to compare the distance or similarity between two data points, such as the Euclidean distance, Manhattn distance and Canberra distance. The labels of the data point to be classified should be consistent with the labels of the majority surrounding this data point. The kNN approach has been applied in many fields, such as big data⁴² and biology⁴³.

Additional Information

How to cite this article: Wang, L. et al. Comparative analysis of image classification methods for automatic diagnosis of ophthalmic images. Sci. Rep. 7, 41545; doi: 10.1038/srep41545 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Acharya, U. R., Dua, S., Du, X., Vinitha Sree, S. & Chua, C. K. Automated diagnosis of glaucoma using texture and higher order spectra features. Information Technology in Biomedicine, IEEE Transactions on 15, 449–455 (2011).
Article Google Scholar
Gao, X., Lin, S. & Wong, T. Y. Automatic feature learning to grade nuclear cataracts based on deep learning. Biomedical Engineering, IEEE Transactions on 62, 2693–2701 (2015).
Article Google Scholar
Gao, X. et al. Automatic grading of cortical and PSC cataracts using retroillumination lens images in Computer Vision–ACCV 2012, 256–267 (Springer, 2012).
Google Scholar
Guo, L., Yang, J.-J., Peng, L., Li, J. & Liang, Q. A computer-aided healthcare system for cataract classification and grading based on fundus image analysis. Computers in Industry 69, 72–80 (2015).
Article Google Scholar
Hijazi, M. H. A., Coenen, F. & Zheng, Y. Data mining techniques for the screening of age-related macular degeneration. Knowledge-Based Systems 29, 83–92 (2012).
Article Google Scholar
Huang, W. et al. A computer assisted method for nuclear cataract grading from slit-lamp images using ranking. Medical Imaging, IEEE Transactions on 30, 94–107 (2011).
Article Google Scholar
Mookiah, M. R. K. et al. Local configuration pattern features for age-related macular degeneration characterization and classification. Computers in biology and medicine 63, 208–218 (2015).
Article Google Scholar
Mookiah, M. R. K. et al. Automated detection of age-related macular degeneration using empirical mode decomposition. Knowledge-Based Systems 89, 654–668 (2015).
Article Google Scholar
Mookiah, M. R. K. et al. Automated diagnosis of age-related macular degeneration using greyscale features from digital fundus images. Computers in biology and medicine 53, 55–64 (2014).
Article Google Scholar
Mookiah, M. R. K., Acharya, U. R., Lim, C. M., Petznick, A. & Suri, J. S. Data mining technique for automated diagnosis of glaucoma using higher order spectra and wavelet energy features. Knowledge-Based Systems 33, 73–82 (2012).
Article Google Scholar
Raja, C. & Gangatharan, N. A Hybrid Swarm Algorithm for optimizing glaucoma diagnosis. Computers in biology and medicine 63, 196–207 (2015).
Article Google Scholar
Singh, A., Dutta, M. K., ParthaSarathi, M., Uher, V. & Burget, R. Image processing based automatic diagnosis of glaucoma using wavelet features of segmented optic disc from fundus image. Computer methods and programs in biomedicine 124, 108–120 (2015).
Article Google Scholar
Xu, Y. et al. Automatic grading of nuclear cataracts from slit-lamp lens images using group sparsity regressionin Medical Image Computing and Computer-Assisted Intervention–MICCAI 2013, 468–475 (Springer, 2013).
Google Scholar
Zhang, Z. et al. A survey on computer aided diagnosis for ocular diseases. Bmc Medical Informatics & Decision Making 14, 169–176 (2014).
Google Scholar
Cao, J., Zhang, K., Luo, M., Yin, C. & Lai, X. Extreme learning machine and adaptive sparse representation for image classification. Neural Networks 81, 91–102 (2016).
Article Google Scholar
Lin, H., Long, E., Chen, W. & Liu, Y. Documenting rare disease data in China. Science 349 6252 (2015).
Google Scholar
Yuan Y.-F., Liang X.-M. Chen Y.-X. & Chen L.-F. Image sparse decomposition algorithm based on multi-population discrete differential evolution. Pattern Recgnition and Artificial Intelligence 27, 900–906 (2014).
Google Scholar
Chang, H. Learning a Structure Adaptive Dictionary for Sparse Representation based Classification. Neurocomputing 190, 124–131 (2015).
Article Google Scholar
Shih, H.-C. & Liu, E.-R. New quartile-based region merging algorithm for unsupervised image segmentation using color-alone feature. Information Sciences 342, 24–36 (2016).
Article MathSciNet Google Scholar
Shrivastava, V. K., Londhe, N. D., Sonawane, R. S. & Suri, J. S. Exploring the color feature power for psoriasis risk stratification and classification: A data mining paradigm. Computers in biology and medicine 65, 54–68 (2015).
Article Google Scholar
Rastghalam, R. & Pourghassem, H. Breast cancer detection using MRF-based probable texture feature and decision-level fusion-based classification using HMM on thermography images. Pattern Recognition 51, 176–186 (2015).
Article Google Scholar
Shrivastava, V. K., Londhe, N. D., Sonawane, R. S. & Suri, J. S. Computer-aided diagnosis of psoriasis skin images with HOS, texture and color features: A first comparative study of its kind. Computer Methods & Programs in Biomedicine 25, 161–183 (2016).
Google Scholar
Yonghua, X. & Jincong, W. Study on the identification of the wood surface defects based on texture features. Journal of the American Chemical Society 97, 1218–1224 (2015).
Google Scholar
Nanni, L. & Melucci, M. Combination of projectors, standard texture descriptors and bag of features for classifying images. Neurocomputing 173, 1602–1614 (2015).
Article Google Scholar
Haralick, R. M., Shanmugam, K. & Dinstein, I. H. Textural features for image classification. Systems, Man and Cybernetics, IEEE Transactions on 3, 610–621 (1973).
Article Google Scholar
Hong J.-G. Texture analysis with gray gradient co-occurrence matrix. Acta automatica sinica 10, 22–25 (1984)
Google Scholar
Nayak, D. R., Dash, R. & Majhi, B. Brain MR image classification using two-dimensional discrete wavelet transform and AdaBoost with random forests. Neurocomputing 177, 188–197 (2015).
Article Google Scholar
Zhang, L., Chen, J. & Qiu, B. Region of interest extraction in remote sensing images by saliency analysis with the normal directional lifting wavelet transform. Neurocomputing 179, 186–201 (2016).
Article CAS Google Scholar
Hadid, A., Ylioinas, J., Bengherabi, M., Ghahramani, M. & Taleb-Ahmed, A. Gender and texture classification: A comparative analysis using 13 variants of local binary patterns. Pattern Recognition Letters 68, 231–238 (2015).
Article Google Scholar
Tang, Z. et al. A local binary pattern based texture descriptors for classification of tea leaves. Neurocomputing 168, 1011–1023 (2015).
Article Google Scholar
Yang, F., Xia, G.-S., Liu, G., Zhang, L. & Huang, X. Dynamic texture recognition by aggregating spatial and temporal features via ensemble SVMs. Neurocomputing 173, 1310–1321, doi: 10.1016/j.neucom.2015.09.004 (2016).
Article Google Scholar
Uřičář, M., Franc, V., Thomas, D., Sugimoto, A. & Hlaváč, V. Multi-view facial landmark detector learned by the Structured Output SVM. Image and Vision Computing, doi: 10.1016/j.imavis.2016.02.004 (2016).
Tahir, M. & Khan, A. Protein subcellular localization of fluorescence microscopy images: Employing new statistical and Texton based image features and SVM based ensemble classification. Information Sciences 345, 65–80, doi: 10.1016/j.ins.2016.01.064 (2016).
Article Google Scholar
Leng, Y. et al. Employing unlabeled data to improve the classification performance of SVM, and its application in audio event classification. Knowledge-Based Systems 98, 117–129, doi: 10.1016/j.knosys.2016.01.029 (2016).
Article Google Scholar
Chang, C.-C. & Lin, C.-J. LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 389–396 (2011).
Google Scholar
Liu, Y., Yu, Z., Zeng, M. & Zhang, Y. LLE for submersible plunger pump fault diagnosis via joint wavelet and SVD approach. Neurocomputing 185, 202–211 (2016).
Article Google Scholar
Wright, J., Yang, A. Y., Ganesh, A., Sastry, S. S. & Ma, Y. Robust face recognition via sparse representation. Pattern Analysis and Machine Intelligence, IEEE Transactions on 31, 210–227 (2009).
Article Google Scholar
Liu, C., Chang, F., Chen, Z. & Liu, D. Fast Traffic Sign Recognition via High-Contrast Region Extraction and Extended Sparse Representation. IEEE Transactions on Intelligent Transportation Systems 17, 79–92 (2016).
Article Google Scholar
Tian, G., Ren, Y. & Zhou, M. Dual-Objective Scheduling of Rescue Vehicles to Distinguish Forest Fires via Differential Evolution and Particle Swarm Optimization Combined Algorithm. IEEE Transactions on Intelligent Transportation Systems 82, 1–13 (2016).
Google Scholar
Choi, K. et al. Hybrid Algorithm Combing Genetic Algorithm With Evolution Strategy for Antenna Design. IEEE Transactions on Magnetics 52, 1–4 (2016).
Google Scholar
Park, Y.-B., Yoo, J.-S. & Park, H.-S. A genetic algorithm for the vendor-managed inventory routing problem with lost sales. Expert Systems With Applications 53, 149–159 (2016).
Article Google Scholar
Deng, Z., Zhu, X., Cheng, D., Zong, M. & Zhang, S. Efficient kNN classification algorithm for big data. Neurocomputing 195, 143–148 (2016).
Article Google Scholar
Shen, L. et al. A novel local manifold-ranking based K-NN for modeling the regression between bioactivity and molecular descriptors. Chemometrics and Intelligent Laboratory Systems 151, 71–77 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

This study was funded by the NSFC (91546101 and 11401454); the Guangdong Provincial Natural Science Foundation (YQ2015006, 2014A030306030, 2014TQ01R573, 2013B020400003); the New Star of Pearl River Science and Technology of Guangzhou City (2014J2200060); the State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University (2015ykzd11, 2015QN01); the Special Program for Applied Research on Super Computation of the NSFC-Guangdong Joint Fund (the second phase); and the Clinical Research and Translational Medical Center for Pediatric Cataract in Guangzhou City; the Fundamental Research Funds for the Central Universities (This fund number is repeated. BDZ011401 and JB151005); the Novel Technology Research of Universities Cooperation Project, the State Key Laboratory of Satellite Navigation System and Equipment Technology (KX152600027). The principal investigator of this study (Haotian Lin) is currently supported by the Pearl River Scholar Program of Guangdong Province.

Author information

Liming Wang, Kai Zhang and Xiyang Liu: These authors contributed equally to this work.

Authors and Affiliations

Institute of Software Engineering, Xidian University, Xi’an, 710071, China
Liming Wang
School of Computer Science and Technology, Xidian University, Xi’an, 710071, China
Kai Zhang, Xiyang Liu, Jiewei Jiang, Yingying An & Jia Zhang
School of Software, Xidian University, Xi’an, 710071, China
Xiyang Liu
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, 510060, China
Erping Long, Zhenzhen Liu, Zhuoling Lin, Xiaoyan Li, Jingjing Chen, Qianzhong Cao, Jing Li, Xiaohang Wu, Dongni Wang, Wangting Li & Haotian Lin

Authors

Liming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiyang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Erping Long
View author publications
You can also search for this author in PubMed Google Scholar
Jiewei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yingying An
View author publications
You can also search for this author in PubMed Google Scholar
Jia Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenzhen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhuoling Lin
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyan Li
View author publications
You can also search for this author in PubMed Google Scholar
Jingjing Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qianzhong Cao
View author publications
You can also search for this author in PubMed Google Scholar
Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Dongni Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wangting Li
View author publications
You can also search for this author in PubMed Google Scholar
Haotian Lin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.T.L. and X.Y.L. designed the research; L.M.W. and K.Z. conducted the study; L.E.P., Y.Y.A. and Z.Z.L. collected the dataset; K.Z., J.W.J. and J.Z. were responsible for coding; Z.L.L., X.H.W. and D.N.W. analyzed and finished experimental result; K.Z., J.W.J., X.Y.L., J.J.C., Q.Z.C. co-wrote the manuscript; All authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to Xiyang Liu or Haotian Lin.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wang, L., Zhang, K., Liu, X. et al. Comparative analysis of image classification methods for automatic diagnosis of ophthalmic images. Sci Rep 7, 41545 (2017). https://doi.org/10.1038/srep41545

Download citation

Received: 12 July 2016
Accepted: 22 December 2016
Published: 31 January 2017
DOI: https://doi.org/10.1038/srep41545

This article is cited by

An Image Diagnosis Algorithm for Keratitis Based on Deep Learning
- Qingbo Ji
- Yue Jiang
- Han Zhang
Neural Processing Letters (2022)
Machine Learning for Cataract Classification/Grading on Ophthalmic Imaging Modalities: A Survey
- Xiao-Qing Zhang
- Yan Hu
- Jiang Liu
Machine Intelligence Research (2022)
Smartphone-based DNA diagnostics for malaria detection using deep learning for local decision support and blockchain technology for security
- Xin Guo
- Muhammad Arslan Khalid
- Jonathan M. Cooper
Nature Electronics (2021)
Application of machine learning in ophthalmic imaging modalities
- Yan Tong
- Wei Lu
- Yin Shen
Eye and Vision (2020)
Dense anatomical annotation of slit-lamp images improves the performance of deep learning for the diagnosis of ophthalmic disorders
- Wangting Li
- Yahan Yang
- Haotian Lin
Nature Biomedical Engineering (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Dataset

Experimental settings and performance evaluation indicator

Comparison of the performance of schemas (1), (2), (3) and (4)

Comparison of the performance of schemas (5), and (6)

Comparison of the performance of schemas (7), and (8)

Comparison of computational and memory complexity of all schemas

Conclusions and Future Work

Methods

Color features extraction

Texture features extraction

Wavelet transformation

LBP (local binary pattern)

SVMs (support vector machines) classifier

ELM (extreme learning machine) classifier

Sparse representation

Feature selection

kNN (k-nearest neighbor) classifier

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links