Introduction

Glaucoma is a major cause of vision impairment characterized by structural changes of the optic nerve head (ONH) and retinal nerve fiber layer (RNFL). These changes lead to visual field loss and eventually blindness.1, 2

The primary goal of glaucoma treatment is to prevent further damage to these structures and to minimize the decline in eye function. Assessment of glaucomatous changes is dependent of the examination method and demands a high level of examiner experience. The introduction of imaging devices may decrease the reliance on the subjective interpretation of the observer. The performance of these tests, however, should be compared with stereophotographic evaluation of ONH and RNFL by trained observers, as recommended by recent clinical trials, such as the Advanced Glaucoma Intervention Study.3

Major scientific societies have presented recommendations for glaucoma management, for example, the European Glaucoma Society Guidelines, the American Academy of Ophthalmology Preferred Practice Patterns for Open-Angle Glaucoma, and the Finnish Evidence-Based Guideline for Open-Angle Glaucoma.4, 5, 6 The latter guideline is on the basis of evaluation of the ONH and RNFL and the recommendations for action are on the basis of very high quality studies. These guidelines are aimed at providing a uniform diagnosis and treatment of glaucoma by applying evidence-based knowledge to central issues. Therefore, the use of the Finnish Guideline may help to produce effective individual treatment plans for glaucoma patients (Table 1).

Table 1 Extracted from the Finnish Evidence-Based Guideline for Open-Angle Glaucoma (Tuulonen et al6)

Application of the Finnish Evidence-Based Guideline requires evaluation of the ONH and RNFL, as well as evaluation of the function. Patient classification depends on the tests used for these evaluations. In clinical practice, ONH and RNFL evaluations are carried out using funduscopy or ONH stereophotography and red-free fundus photography. In both cases, subjective evaluation of the images may induce an error in application of the Finnish guideline. The Finnish Evidence-Based Guideline may be applied to findings of the ONH and RNFL obtained using imaging devices; the use of these devices decreases the need for subjective input and may therefore improve the accuracy of the Finnish guideline.

In this study, we compared the performance of Heidelberg retina tomograph (HRT) and scanning laser polarimetry (GDx) with photography for evaluating the ONH and RNFL for application of the Finnish Evidence-Based Guideline for Open-Angle Glaucoma.

Materials and methods

Participants

All participants were prospectively enrolled in this study at the Department of Ophthalmology of the Miguel Servet University Hospital (Zaragoza, Spain). Only one eye from each patient was randomly selected for the study.

All participants met the following criteria: best corrected visual acuity of at least 20/30; refractive errors of <5 dioptres spherical and 3.00 dioptres of astigmatism; open anterior chamber angle; and transparent ocular media. Written informed consent was obtained from all the participants and the study followed the tenets of the Declaration of Helsinki principles. Exclusion criteria included ocular trauma, surgery, or disease (other than glaucoma); systemic disease with ophthalmic involvement; or inability to perform any procedure of the study.

Procedures

All participants underwent a full ophthalmic examination, including evaluation by slit-lamp biomicroscopy, measurement of intraocular pressure (IOP), central corneal ultrasonic pachymetry (DGH Technology, model DGH 500), ONH stereophotography, standard automated perimetry (SAP), HRT3, laser polarimetry with variable corneal compensation (GDx-VCC), and optical coherence tomography (OCT). Short wavelength-automated perimetry (SWAP) was also evaluated in patients classified into certain groups, following the recommendations of the Finnish guideline.

Study groups

Application of the Finnish Evidence-Based Guideline classifies the participants into several groups, such as participants with normal findings, and several groups with suspected glaucoma or glaucoma.

Similar to other studies,7 the participants were not selected by applying the gold-standard glaucoma test. The gold-standard glaucoma test for diagnosis is widely accepted as providing an accurate diagnosis and can therefore be used as the standard against which other diagnostic strategies are judged. As glaucoma cannot be defined outside the context of structure and function, and both structure and function were being evaluated in this study, the use of a structural or functional gold standard could induce a bias. Nevertheless, sufficient representation of normal participants and glaucoma patients and suspects was ensured by selecting patients from two different sources:

Source 1 (Control group): Hospital staff or participants with refractive problems (eg, near-sightedness or low refractive problems) or patients with problems other than glaucoma that would not interfere with the study (eg, patients seen in the emergency room with irrelevant ocular symptoms). All these participants met the inclusion criteria and an additional inclusion criterion of IOP <21 mm Hg.

Source 2 (Study group): Participants referred to us by other ophthalmologists for glaucoma evaluation. Our department is the referral hospital for several ophthalmic outpatient facilities and hospitals that do not have ancillary tests available for glaucoma other than SAP.

Application of the Finnish Evidence-Based Guidelines for Open-Angle Glaucoma in participants from the Control group (Source 1) and the Study group (Source 2)

The Finnish Evidence-Based Guideline classifies a given participants into one of the eight diagnostic categories (Table 1). In this study, we applied the Finnish guideline on the basis of standard photography (see Application A) or findings from imaging devices (see Application B).

A. Clinical application of the Finnish guideline on the basis of stereophotography

A1. SAP. Perimetries were carried out with a Humphrey Field analyzer 750, using the 24-2 Swedish interactive threshold algorithm standard strategy. Distance correction was added to participant refraction. If fixation losses were >20%, or false-positive or false-negative rates were >33% (rates of reliability fixed by the perimeter software), the test was repeated. The second reliable perimetry was used for this study to minimize the learning effect.8 Consistency of the defects was required; otherwise the perimetries were repeated.

Visual field defects were defined by the deficits that produced a cluster of three or more points that were lower than a 5% probability level or a cluster of two or more points that were lower than a 1% probability level,9 and/or pattern standard deviation (PSD) with a 5% probability level or lower, and/or glaucoma hemifield test outside normal limits.

A2. Expert evaluation of ONH stereophotographic sets. Glaucomatous defects were defined by the presence of diffuse or focal defects in the neuroretinal rim or optic disc hemorrhages. The stereophotographic sets were evaluated by two independent, blinded observers (LP, JML). In case of disagreement, both observers re-evaluated the diagnosis together and reached a consensus.

A3. Expert evaluation of RNFL red-free fundus photography. RNFL photographs were evaluated by two experts on a different day than the optic disc photographs. RNFL defects were defined by the presence of focal or diffuse loss of retinal nerve fiber bundles.

B. Application of the Finnish guideline on the basis of imaging devices:

B1. SAP. Evaluation of the visual field was not a study variable. The same criteria as mentioned above were applied.

B2. ONH evaluation by HRT (parameter glaucoma probability score (GPS)). The HRT3 was carried out in a standardized manner by one examiner. To avoid operator variability in delineating the optic disc rim, GPS was the parameter used in this study. The abnormality criterion was GPS >0.51, as proposed in earlier reports.10 The optic disc contour was also delineated, however, to obtain data about rim and cup parameters for comparison among groups.

B3. RNFL evaluation by GDx (parameter NFI). The evaluation was carried out with a laser polarimeter GDx-VCC. The diagnostic criterion used for the study was NFI >30, as suggested by the manufacturer and earlier studies.7

Statistical analysis

All statistical analyses were calculated using SPSS (version 15.0; SPSS Inc., Chicago, IL, USA) statistical software.

The Kolmogorov Smirnov test was used to check for a normal distribution of the data. Then, differences between both groups were tested using the Student́s t-test.

Results

A total of 41 control participants (Source 1) and 312 patients (Source 2) were included in the study. The descriptive data of the participants are shown in Table 2.

Table 2 Clinical data of participants included in the study and differences between both sources of participants

Diagnostic performance of criteria in the control group (Source 1)

Although we did not use a gold standard for glaucoma in our study, we expected that in control participants (Source 1) the number of abnormal findings would fall below the expected prevalence of the disease in the general population, particularly in a group of participants with a normal ophthalmic evaluation. As expected, most of the control participants showed no abnormal findings in the tests or after the application of the Finnish Evidence-Based Guideline the percentage of control participants defined as normal by the guideline (specificity) was high (Table 3).

Table 3 Comparative classification of control participants (Source 1) after the application of the guideline with photographs vs imaging devices (HRT and GDx)

Diagnostic performance of the Finnish Evidence-Based Guideline for open-angle glaucoma in the study group (Source 2)

A comparative distribution of the participants of the study group (Source 2) for the stereophotographic criteria vs the imaging criteria is shown in Table 4. The Finnish guideline suggests possible sources of error in the different diagnostic groups, for example, large ONH size. In other cases, a diagnosis of preperimetric glaucoma was confirmed with the SWAP findings, as suggested by the Finnish guideline. In participants that required SWAP evaluation, defects were defined by deficits that produced a cluster of four or more points that were lower than a 5% probability level or a cluster of three or more points that were lower than a 1% probability level.11

Table 4 Comparative classification of participants (from Source 2) on the basis of the findings of photographic evaluation of optic nerve head and nerve fiber layer vs classification on the basis of GPS (HRT) and NFI (GDx-VCC)

Quantitative data of the ONH and RNFL measured by HRT and GDx are shown in the corresponding tables (photographic evaluation and imaging devices, tables 5 and 6). Also, mean retinal thickness, measured by OCT (Stratus OCT), is shown to allow comparison of groups tested with an independent device.

Table 5 Finnish Evidence-Based Guideline for Open-Angle Glaucoma implemented by photographic evaluation of optic nerve head and RNFL
Table 6 Finnish Evidence-Based Guideline for Open Angle Glaucoma implemented by imaging devices: HRT (parameter GPS) and GDx (parameter NFI)

Discussion

This study was designed to test the performance of the Finnish Evidence-Based Guideline for Open-Angle Glaucoma in control participants with no obvious signs of ocular disease (Controls; Source 1) and patients sent to our Glaucoma unit without further discrimination (Subjects; Source 2). Therefore, the difference between groups is on the basis of the origin of the group.

As explained in the Materials and methods, a gold standard for the diagnosis of glaucoma was not used so as to avoid biasing our results by the use of any structural or functional definition of glaucoma. Control participants (Source 1) are a sample extracted from general population, with similar demographic data (Caucasians, age, visual acuity) as the study group without glaucoma findings on routine ophthalmic evaluation. As in the general population, this group may contain glaucoma patients, but the prevalence of such cases is expected to be lower than the prevalence of glaucoma in the general adult population because the routine examination excluded participants with any evident damage in the fundus evaluation. Despite the fact that it is unlikely that there were any glaucoma patients in the control group, these participants should not be considered ‘conventional normal participants’, but rather as a sample that allows for comparison with a general adult population.

Application of the guideline classified most of control participants as normal individuals (without abnormal findings). Therefore, the diagnostic criteria defined for stereophotographic evaluation of the ONH and RNFL and the imaging devices showed a high specificity, although the strict sense of specificity should be questioned because a gold standard was not used to verify the diagnosis (inherent to the study design). The prevalence of glaucoma in these participants (without obvious signs of ocular disease) is expected to be lower than that of glaucoma in the general population, thus the specificity of the Finnish guideline was at least 78% (stereophotographic) and 83% (imaging devices). There were few differences between the two methods. The Finnish guideline on the basis of stereophotographic evaluation did not classify any participant into a category for which treatment is advised, whereas the Finnish guideline on the basis of imaging devices classified 1 of the 41 participants in the category of preperimetric glaucoma. One participant showed abnormal ONH and NFL and normal VF, and was advised to initiate, or consider initiating, treatment. One of the strong points of the use of these guidelines is that it minimizes diagnostic errors on the basis of a single criterion (or gold standard) that may jeopardize the quality of life of a participant because these guidelines require that two of the three criteria are met to initiate treatment. Otherwise, participants may have suspected glaucoma, but no treatment is advised. Use of this liberal definition of specificity (on the basis of procedure recommendations, see Table 1) increases the specificity values to 100 and 97.6%, respectively. These results are in general agreement with the specificity reported by the earlier authors for photographic and imaging criteria.12

In contrast, the diagnostic performance of the Finnish guideline in the participants from Source 2 produced a high number of positives. Group 2 is not a conventional ‘case or glaucoma’ group and cannot be considered as the glaucoma group. This group includes both normal participants and glaucoma cases. Normal findings in Source 2 are expected and do not indicate a lack of sensitivity because this group is not a glaucoma group, although the proportion of glaucoma participants in this group can be compared with a clinical setting of a Glaucoma Unit.

The distribution of patients according to application of the Finnish guideline by either method showed certain agreement. The concordance of diagnostic groups was 52.9% (patients on the diagonal of Table 5), whereas the concordance within a group with similar treatment advice was 56.4%. In every classification group, however, there were several findings that warrant special comment.

The participants (Source 2) with normal findings tended to have thicker corneas than controls (Source 1, P=0.05). In our opinion, the most interesting finding is that the differences from the control group were greater among participants referred for a glaucoma workup with normal performance in the tests than among study participants with evidence of glaucoma. Although the differences were not significant at the P<0.05 level (pachymetry in the control group vs normal findings with imaging devices; P=0.17), the pachymetry findings of the glaucoma patients and controls were similar (P=0.85–1.00), whereas participants biased by increased IOP may have thicker corneas (P=0.15). If pachymetry findings were a pathogenic factor, elevated pachymetries should be equally distributed in all the categories.

A considerable number of participants was classified as abnormal on ONH (and normal NFL and VF) by imaging, but normal on stereophotographs (47 subjects). In fact, imaging devices classified 84 participants in this category, whereas the photographic evaluation classified only four patients in this category. This fact may be explained by interference because of optic disc size. Participants with abnormal ONH findings, but without abnormal visual field and RNFL findings, had greater optic disc sizes than controls, although the differences were significant only when the Finnish guideline was applied to the results obtained by imaging devices. Thus, optic disc size may interfere with the classification on the basis of GPS, whereas stereophotographic evaluation compensates for this factor.

In contrast, the preperimetric glaucoma group defined by abnormal findings in RNFL and ONH included a high number of participants with abnormal SWAP findings, confirming the existence of functional deficits not detected by SAP (41 and 61% of participants). This preperimetric glaucoma was also confirmed by significant differences in the mean retinal thickness compared with control participants.

A high number of participants matched the preperimetric glaucoma category and therefore it was the most frequent overlapping category between normalcy and glaucoma, with structural deficits detected without functional deficits found in SAP. It was more common to find alterations in both the RNFL and the optic disc than in just the RNFL (in imaging and photographs). This fact might be partially explained by the exclusion of participants without good quality images, as 18 participants were excluded from the study because of low-quality RNFL photographs, and 6 of these 18 subjects also had low-quality ONH stereophotographs.

Abnormal performance in SAP with normal findings in the RNFL and ONH was not common and is most likely an artefact, as there were no significant differences from normalcy in any parameters other than PSD. Also, few patients were classified in this category. The guideline considers these findings to be suspicious of artefacts or other visual field anomalies.

In the case of abnormal findings in the visual field and ONH, suspicion of glaucoma would theoretically increase, but on the basis of the photographic evaluation, none of the participants matched this category. On the other hand, GPS-based criteria included some participants in this category, but GPS tends to overestimate the size of large optic discs, as mentioned earlier. This fact may support the use of RNFL red-free photography because evaluation of the entire bundle of fibers could be more accurate than evaluation with imaging devices of the peripapillary retina. In fact, there are no reports on this type of glaucoma in the literature (6), and the Finnish Guideline for Open-Angle Glaucoma recommends that a diagnosis other than glaucoma be considered (eg, neurologic disease) in such cases.

In conclusion, the Finnish Evidence-Based Guideline for Open-Angle Glaucoma proved to be a useful tool for classifying normal participants and patients with glaucoma or suspected glaucoma through photographic or imaging evaluation. The GPS criterion tended to overestimate the size of large optic discs. In general, structural changes were more frequently detected than functional changes. Although clinical guidelines are useful for glaucoma diagnosis and treatment, clinical decisions should always be made on an individual basis to achieve optimum patient care.