Introduction

Glaucoma is an irreversible optic neuropathy resulting from retinal ganglion cell death that leads to progressive visual field (VF) loss. Glaucomatous damage is characterized by structural changes of the optic nerve head (ONH) and retinal nerve fiber layer (RNFL), functional changes of the VF, and ultimate loss of central visual acuity. Structural changes may precede the development of the functional VF defects.1, 2 Early detection of these changes around the ONH is therefore essential in diagnosing and monitoring glaucoma. In glaucoma clinics, optical coherence tomography (OCT) and ONH photography (with red-free RNFL photography) have been commonly used to detect structural deterioration associated with glaucoma.

New and more sensitive imaging devices to detect structural progression have been recently developed. Guided progression analysis (GPA) using Cirrus OCT (Carl Zeiss Meditec) is an example of such a device that detects structural changes of RNFL using statistical methods.3, 4 As technology advances, more advanced OCT devices have been developed and are rapidly replacing previous models. For glaucoma progression analysis, it is difficult to compare OCT data directly if we compare the data from different OCT machines.5, 6 Therefore, careful analyses of sequential optic disc photographs are still useful to determine long-term changes of ONH and RNFL defects in glaucoma. To improve the detection ability of early ONH structural changes, a novel method using alternation flicker chronoscopy has been introduced. Alternating flickers can be achieved by manually aligning and alternating images of fundus photographs taken sequentially with overlapping slide projectors or by manual registration of digital images.7 This technique has been reported to be effective and comparable to photographic review in several studies.8, 9, 10 Automated flicker has recently become available, rendering it possible to make automated flicker images using Matched-Flicker 8 software (ver. 1.2; EyeIC, Narberth, PA, USA). This approach has an excellent ability to detect ONH contour changes, parapapillary atrophy, and disc hemorrhages.7, 11, 12

We have developed a new algorithm, by collaborating with the engineering department, to make automated flickering images of ONH and RNFL photography for clinical purposes. To make matched alternation flicker images using two fundus photographs taken at different time points using different camera angles, each image needs to be processed to adjust the size and rotation angle. In addition, detection of changes in RNFL defects can be a problem if two fundus images have different brightnesses. An ideal algorithm should detect changes that are not associated with the image-capturing procedure, but with glaucoma progression itself. In the following report, we investigated the reproducibility of our algorithm for automatically matched alternation flicker (AMAF), and compared its ability to detect glaucoma progression with the conventional method using serial ONH and RNFL photographs.

Materials and methods

Selection of patients

In this longitudinal and retrospective study, the medical records of patients who visited the glaucoma clinic from January 2010 to December 2012 at the Department of Ophthalmology, Ajou University Hospital, for glaucoma evaluation, were reviewed. The study adhered to the Declaration of Helsinki and was approved by the Institutional Review Board of Ajou University Hospital (AJIRB-MED-MDB-13-137).

Primary open-angle glaucoma (POAG) patients with baseline mean deviation (MD) values better than −6 dB were identified from the medical records. Baseline and last fundus photography examinations had an interval of ≥1 year. All patients received at least five VF tests for GPA using the Humphrey Field Analyzer (HFA; Carl Zeiss Meditec, Dublin, CA, USA). All patients underwent a comprehensive ophthalmic examination, including a medical history review, best-corrected visual acuity, spherical equivalent of the refractive error, slit lamp biomicroscopy, central corneal thickness (DGH-500; DGH Technology, Exton, PA, USA), intraocular pressure (IOP) measurement using Goldman applanation tonometry, gonioscopy, dilated fundus examination, and automated perimetry (HFA) using the Swedish interactive thresholding algorithm with standard 24–2 strategies. Stereo optic-disc photography (ODP; AFC-210; NIDEK, Aichi, Japan) and time domain OCT (Stratus OCT; Carl Zeiss Meditec) were performed on the same day. VF tests, and ODP and OCT examinations, were performed within 3 months of other ocular examinations. All the patients in this study had an IOP<22 mm Hg at every visit with or without medication, and had gonioscopic open-angle glaucoma without secondary causes. The baseline VF tests showed at least two reproducible VF defects compatible with glaucoma according to Anderson’s criteria for minimal abnormality in glaucoma.13 RNFL defects, corresponding with VF defects, were noted in red-free RNFL photography and/or OCT.

Creating the application

This application was designed to help ophthalmologists detect differences between two fundus images taken at two different time points, by alternately exposing the images. Owing to the time gap between the two images, there are usually changes in scale, rotation, translation, brightness, and noise. A well-known computer vision technique, Speeded Up Robust Features (SURF),14 was used to better match corresponding features of the two images. SURF facilitated feature correspondence that was scale- and rotation-invariant, and also accounted for noise and photometric deformations. Using the feature correspondence, a transformation matrix was used to deform one image to another by solving a linear least squares optimization problem. The feature points in one image therefore had almost the same positions relative to their corresponding feature points as the other deformed image. Attempts to match all the parts in an image sometimes caused unwanted distortion of the image, especially around the ONH. To increase the degree of match around the ONH, we therefore added a function that allowed the examiners to specify an area as a reference for image fusion (Figure 1). After this feature matching, the brightness levels of the two matched images were adjusted as closely as possible. This application provided users with controls for the scale, rotation invariants, and exposure frequency. The application was implemented using the C++ programming language, OpenCV, and Fast Light Toolkit libraries using the Windows 7 operating system (Microsoft Corp., Redmond, WA, USA).

Figure 1
figure 1

(a) The appearance of the application. (b) When examiner check ‘User mask’ to ‘Yes’ on flicker applications, examiner can mark area that examiners want to make it a reference area for image fusion. (c) Result of matching point after examiner marks area that examiners want to make it a reference area.

We conducted a separate study in a different set of subjects (66 eyes of 66 patients), prior to this study to evaluate agreements among three examiners who identified glaucomatous structural changes using AMAF method. In that study, intraclass correlation (ICC) coefficient were from 0.777 to 0.850 for inter-reader agreement and from 0.853 to 0.988 for intra-reader agreement.15

Making flicker images and the determination of progression

Two original fundus photographs taken at two different time points for each patient were extracted from the fundus photography database. Using our flicker application for AMAF, the first and second images were selected and three hertz flicker images were automatically created with contrast adjustments. If the examiner determined that the generated flicker images were sufficient to determine fundus changes, the glaucoma progression was evaluated by a single glaucoma specialist (ML) using a 3-point grading scale. Grade 3 was defined as definite glaucomatous change in the fundus images, and grade 2 was defined as a fundus change with suspected glaucoma. We only used the 3-point grading scale to evaluate glaucoma progression. If there was no detectable change in the flicker image, it was designated as grade 1. To compare the progression-detecting ability of AMAF with the parallel comparisons, glaucoma progression was evaluated in terms of four different ONH and RNFL changes, involving changes in the neuroretinal rim of the optic disc, in the path of the vessel around the optic disc, and in the widening and deepening of the RNFL defect (defined as a loss or decrease in RNFL striation leading to darkening of the previous defects).

Conventional side-by-side comparisons of fundus photographs were performed by the same examiner (ML) in a blinded manner. Another operator prepared two fundus photographs to make an automated flicker image with randomly assigned serial numbers. The inspector (ML) evaluated it in the same manner using the 3-point grading scale.

VF progression was determined independently after evaluating glaucoma progression using AMAF and side-by-side comparisons in a blinded manner. VF progression was defined by GPA (integrated in the HFA) and by the collaborative initial glaucoma treatment study (CIGTS) criteria. A ‘likely progression’ using the GPA software was defined as progression of GPA criterion. An increase in CIGTS score of three or more compared with an average of two baseline VF tests was defined as progression of CIGTS criterion.16 A VF test conducted on the same day or within 3 months from the first fundus photography was used as a baseline VF. A VF test conducted at the closest time point (within 3 months) of the final fundus photography was used to determine glaucoma progression.

For progressing and stable eyes assessed by both AMAF and conventional ODP analyses, changes of average RNFL thickness using Stratus OCT were compared between the two methods.

Statistical analysis

Descriptive results were expressed as means±SD, and P-values<0.05 were considered statistically significant. The kappa statistic was used to quantify and evaluate the agreement between evaluations of glaucoma progression determined by different methods. A kappa value of 0.1–0.2 was considered as slight agreement, 0.21–0.40 as moderate agreement, 0.41–0.60 as fair agreement, 0.61–0.80 as substantial agreement, and 0.81~0.99 as almost perfect agreement.17 SPSS (ver. 20.0; SPSS Inc., Chicago, IL, USA) and MedCalc software (ver. 12.0; Mariakerke, Belgium) were used for analyses.

Results

In total, 103 POAG patients were enrolled after screening 210 patients and excluding patients who did not meet the inclusion criteria. The mean age of the patients was 52.8±9.9 years (range: 19–79 years), and 54 patients (52.4%) were male. The mean interval between the first baseline fundus photography and the final examination was 28.7±9.0 months. The MD values decreased significantly from −0.93±2.14 dB to −0.62±2.93 dB, and both pattern SD values and CIGTS scores significantly increased. RNFL thicknesses also decreased significantly in all quadrants, including the average value (Table 1).

Table 1 Descriptive statistics of enrolled subjects (means±SD)

Kappa coefficients showed almost perfect intraobserver agreement (κ=0.863, 95% CI=0.806–0.919, P<0.001) using AMAF analysis and substantial agreement (κ=0.797, 95% CI=0.716–0.878, P<0.001) using conventional side-by-side comparisons of ODP.

We compared two methods for assessing structural progression based on the agreement with two progression criteria of functional tests using kappa statistics. The AMAF showed moderate agreement with both GPA (κ=0.337) and CIGTS (κ=0.241) progression criteria, whereas the conventional comparison method showed no significant agreement with two criteria for functional progression (Table 2). The AMAF and conventional comparison showed a fair agreement with each other (κ=0.468, P<0.001).

Table 2 Agreement between image analysis methods and VF criteria to detect glaucoma progression

The AMAF designated 47 of 103 eyes with glaucoma progression, whereas 21 eyes were designated with glaucoma progression using the conventional comparison method. The two methods showed a substantial agreement in detecting widening of RNFL defects (κ=0.785; P<0.001). RNFL defect depth changes (κ=0.354; P<0.001) and changes of the neuroretinal rim (κ=0.241; P=0.001) showed moderate agreement. Detection of glaucoma progression using a change in path of the vessel around the ONH showed slight agreement (κ=0.109; P=0.015). AMAF detected changes of the vessel around the optic disc in 15 eyes, but only a single eye was detected with a vessel change using the conventional comparison method (Table 3).

Table 3 Agreement between matched flicker and conventional comparison method to detect changes of fundus photography

The rate of average RNFL thickness decrease of progressing eyes (−3.49±2.86 μm per year) was significantly different from that of stable eyes (−1.83±3.22 μm per year; P=0.007) when progression was determined using AMAF. However, there was no significant difference between progressing and stable eyes using the conventional method (−3.24±2.42 vs −2.42±3.33 μm per year; P=0.290). The average RNFL thickness of progressing eyes decreased significantly more than that of stable eyes when the GPA criterion was used (−3.81±3.20 vs −2.87±3.86 μm per year; P=0.008), whereas the rate of RNFL decrease between progressing and stable eyes was not significantly different when the CIGTS criterion was used (−3.63±2.24 vs −2.34±3.31 μm per year; P=0.101). When progression was determined by GPA combined with AMAF or the conventional method, the average RNFL thickness of progressing eyes decreased significantly more than that of stable eyes using both the AMAF+GPA criteria (−3.52±3.02 vs −1.47±2.99 μm per year; P=0.001) and the conventional comparison + GPA criteria. (−3.56±2.98 vs −1.86±3.12 μm per year; P=0.006; Table 4).

Table 4 Comparison of retinal nerve fiber layer decay between progressing and stable eyes according to the progression criteria

Discussion

Previous studies have reported that alteration flicker chronoscopy is more advantageous in detecting changes of the optic disc and peripapillary structures than conventional side-by-side comparison of fundus photographs.7, 9, 18, 19 Automated matched flicker was developed using evolving imaging technology. It can better match the flicker images using simple operations, and has been reported to have good inter-reader and intra-reader agreements.10 It has also been reported that flicker chronoscopy was better than the conventional side-by-side comparison method for evaluating the progression of glaucoma, and was better at detecting changes in the contour of the ONH, detecting increases in the area of the peripapillary atrophy, and detecting the appearance and disappearance of disc hemorrhages.7, 11, 12, 20

Using AMAF to detect structural changes during glaucoma progression, we focused on changes in the shape of the neuroretinal rim of the ONH, changes of the vessel contour around the optic disc, widening of the RNFL defects, and deepening (defined as loss or decrease in RNFL striation leading to darkening of the previous defects even after adjusting brightness and contrast of images compared) of RNFL defects. Development of disc hemorrhages and peripapillary atrophy changes were not included in the progression criteria if there were no other associated changes. To enhance the ability to detect glaucomatous changes, we have made an application to adjust one image to another based on the optic disc margin and major vessels and each points around the optic nerve head were automatically matched to minimize image distortion around optic nerve even though it might increase the distortion of far periphery of the fundus image. Although the matching process is automated using AMAF, an examiner can directly specify the desired matching area using stable structural markers, including the optic disc margin and major blood vessels. The AMAF application can automatically adjust the brightness of the two fundus photographs to increase the brightness of the darker image before performing the automated matched flicker. And with our application examiners can increase or decrease the sensitivity of the founded intersection points easily and can adjust matched flicker images such that the desired degree of fusion. In addition, when we look at the flicker pictures continuously, it is difficult to know whether any photos are firstly recorded. To overcome this disadvantage AMAF automatically marks sequence of recorded images. And to improve the convenience of reading, frequency of flickering of images also can be adjusted easily before and after making automated flicker. These new features of AMAF improved the agreement between structural and functional progression criteria in our study (Table 2; Supplementary Video).

The agreement between structural and functional progression criteria and the ability to detect glaucoma progression are known to be influenced by both detection methods and the severity of glaucoma.21, 22 In a recent observational cohort study, progression of glaucoma was detected by stereo photographs, GPA of the HFA, and GPA using the Cirrus spectral domain OCT in 6.9%, 15%, and 25.6% of eyes, respectively. In this study, normal, suspected glaucoma, and glaucoma patients were included in 246 eyes of 148 patients.23 The detection rate of stereo photography was lower than that of the VF test in those patients with early stages of glaucoma. A higher detection rate of glaucoma progression was reported using flicker chronoscopy in another study, and 50 (48.5%) of 103 eyes showed at least one sign of structural progression, although the severity of glaucoma was not reported.24 In our study, the conventional comparison method, AMAF, and GPA of the VF test could detect glaucoma progression in 20.4%, 45.6%, and 31.1% of the eyes with a mild baseline VF defect (MD>−6 dB), respectively. Consistent with a previous study,23 we also found that stereo photographs detected fewer eyes with glaucoma progression than GPA of the VF test in the early stages of glaucoma, but AMAF could detect progression in more eyes than GPA. These reports are comparable with previous studies using flicker chronoscopy.24 In a previous study,23 the agreement between stereo photographs and GPA of the VF test was moderate (κ=0.21), as was the agreement between the AMAF (κ=0.337) and the conventional method (κ=0.124) in our study (Table 2). Compared with the conventional comparison method, AMAF improved agreement with the functional test and performance. It identified structural changes caused by glaucoma progression because it probably detected subtle changes of the RNFL, neuroretinal rim, and blood vessel contours (Table 3), and produced more reproducible results.

Structural progression was determined by a single grader in our study, although each image for grading was prepared by another examiner in a masked manner. To overcome any discrepancies, we compared the RNFL thickness decay of progressing eyes with that of stable eyes in each progression criteria, because the rate of structural progression should be faster in progressing eyes than in stable ones if each criterion identified the true progression caused by glaucoma (Table 4). On the basis of the limited number of OCT examinations, the RNFL thickness decay was not calculated from linear regression analysis, but from the difference between the first and the last examinations during the follow-up periods. However, the rate of average RNFL thickness decrease in our study was comparable to a previous report using the same OCT (between −2.22 and −7.60 μm per year for the Stratus OCT; Leung et al25). In our series, AMAF and GPA showed greater RNFL thickness decreases in progressing eyes than stable eyes, whereas the conventional comparison method and the CIGTS did not. It is possible that the flicker technology and GPA criteria outperformed the conventional comparison and CIGTS methods in detecting glaucoma progression in patients with mild baseline VF defects. Both progressing and stable eyes, by all criteria, showed significant RNFL thickness decreases from the baseline results in our study. Liu et al26 reported that RNFL thicknesses decreased significantly even in eyes showing no progression, when estimated by conventional methods using both GPA of the VF test and stereo disc photography. Although the rate of RNFL decay of the study of Liu et al26 (between −0.71±0.09 and −1.0±0.20 μm per year) was less than that of our results (−1.83±3.22 and −2.87±3.86 μm per year in stable eyes using AMAF and GPA criteria, respectively), the use of different devices (Stratus vs Cirrus) and decay-calculating algorithms may have influenced the results. The rate of RNFL thickness decrease as reported using Cirrus OCT may be slower than that using Stratus OCT in the same patients.25 When GPA criterion was combined with the conventional comparison method, RNFL thickness decreases between progressing and stable eyes were significantly different (Table 4). On the basis of these results, conventional stereo photographs combined with GPA can improve the ability to assess glaucoma progression when flicker technology is not available.

Our study had some limitations. First, it used a retrospective design and a limited number of OCT examinations. The number of examinations was therefore insufficient for regression analysis to detect changes of RNFL thicknesses. Futhermorer, the optic disc photographs were interpreted by a single grader, although a masked method was used.

In conclusion, optic disc photography using a flicker chronoscopic method, AMAF, can detect more glaucoma progression than GPA of the VF in patients with a mild baseline VF defect. AMAF was better than the side-by-side comparison method in discriminating glaucomatous changes, and showed moderate agreement with functional progression using the VF test.