Introduction

Glaucoma is an optic neuropathy characterized by ganglion cell death that manifests clinically as characteristic optic nerve head (ONH) and retinal nerve fiber layer (RNFL) change with correlating visual field changes. Optical coherence tomography, a well-accepted tool for glaucoma diagnosis, enables to objectively measure the ONH, RNFL and macular thickness parameters.1, 2, 3, 4, 5 Spectral domain OCT (SD–OCT) is a recent technique that enables the imaging of ocular structures with higher resolution and faster scan rate compared with the previous version of this technology (Stratus OCT, Carl Zeiss Meditec Inc., Dublin, CA, USA).6, 7

In addition to the changes that occur in ONH and RNFL, another region that has been proposed to manifest changes in glaucoma is the macula, because more than 50% of the ganglion cells in the retina are located at macula and the ganglion cell layer is more than one-cell-layer thick at the macula.8, 9 However, previous studies with Stratus OCT reported that the macular thickness measurements were inferior to RNFL measurements to detect glaucoma.1, 2 One possible reason for this was that the limited resolution of Stratus OCT permitted measurements only of the entire retinal thickness, whereas the changes in glaucoma preferentially occurred in the inner retinal layers. SD–OCT by its ability to measure individual layers of the retina may capture these changes in the macula, and a few recent studies with SD–OCT, in fact, have shown that the macular inner retinal (MIR) thickness measurements, which consists of the innermost 3 layers of the retina, namely the nerve fiber layer, ganglion cell layer, and the inner plexiform layer, were as good as the RNFL measurements in detecting glaucoma.10, 11, 12, 13, 14 These studies, however, included glaucoma patients with all severities of visual field damage that might have biased the results, as the estimates of diagnostic accuracy are known to improve with the severity of disease.15, 16, 17 In clinical practice, the challenge is to diagnose glaucoma in early stages and there is limited literature on the diagnostic ability of SD–OCT in early glaucoma.17, 18

The purpose of the present study was to compare the diagnostic accuracies of the RNFL and MIR measurements obtained with SD–OCT in early glaucoma.

Methods

This was an observational, cross-sectional, study conducted at a tertiary eye care facility in South India between August 2008 and June 2009. Informed consent was obtained from each participant, and the Ethics Committee of LV Prasad Eye Institute approved all methodology. All methods adhered to the tenets of the Declaration of Helsinki for research involving human subjects.

Consecutive glaucoma patients, categorized as early stage by Hodapp–Anderson–Parrish classification19 formed the study group, whereas the normal subjects were recruited from among those who came for a routine eye examination, patients’ relatives and hospital staff. Inclusion criteria were age ≥18 years, best corrected visual acuity of 20/40 or better, refractive error within ±5.0 D sphere and ±3 D cylinder, and willingness to participate in the study. Exclusion criteria were presence of any media opacities that prevented good imaging, intraocular surgery within the previous 6 months, any retinal (including macular) or neurological diseases other than glaucoma that could confound the results of visual field examination and structural measurements with SD–OCT.

All participants underwent a comprehensive ocular examination that included a detailed medical history, best corrected visual acuity measurement, slit-lamp biomicroscopy, Goldmann applanation tonometry, gonioscopy, dilated fundus examination, standard automated perimetry (SAP), and SD–OCT imaging with RTVue (Optovue Inc, Fremont, CA, USA).

Control subjects had a normal ocular examination, intraocular pressure (IOP) less than 22 mm Hg in both eyes, no past history of increased IOP, no family history of glaucoma, no optic disc morphology suspicious of glaucoma, and normal visual fields. Optic disc examinations were done by two glaucoma specialists, and glaucomatous damage was suspected on the basis of focal or diffuse neuroretinal rim thinning, localized notching or nerve fiber layer defects. Glaucoma patients had characteristic optic disc changes, as defined above, with correlating visual field changes. All glaucoma patients had previous experience of undergoing visual field examination.

SAP was performed using a Humphrey Field analyzer, model 750 (Zeiss Humphrey Systems, Dublin, CA, USA), with the Swedish interactive threshold algorithm standard 24-2 algorithm. Reliability criteria were xation losses and false positive or false negative response rates of less than 20%. Abnormal SAP results were considered to be related to glaucoma, if they correlated with the optic disc findings and there were no other abnormalities to explain the defect. Glaucomatous visual eld defects were dened by two of the following three criteria on SAP: the presence of a cluster of three points on pattern deviation probability plot with a P-value of less than 5%, one of which had a P-value less than 1%, or a pattern standard deviation with a P-value less than 5%, or a glaucoma hemield test result outside normal limits.

Instrumentation

SD–OCT examination was performed with the RTVue (software version 4.0.5.39; Optovue Inc., Fremont, CA, USA). RTVue uses a scanning laser diode with a wavelength of 840±10 nm to provide images of ocular microstructures. The protocols used for imaging with RTVue in this study were ONH and GCC (ganglion cell complex). These protocols have been explained earlier.11 All patients had both protocols performed on the same day.

Optic nerve head scan

The ONH protocol consists of 12 radial scans, 3.4 mm in length and 6 concentric ring scans ranging from 2.5 to 4.0 mm in diameter all centered on the optic disc. Retinal pigment epithelium (RPE) tips were automatically detected by the software, which were refined manually by the operator. The RTVue software, then, delineates the optic disc margin by joining the RPE tips. The optic cup is automatically defined by the software by fitting a plane 150 μm parallel to and above a plane that fits the coordinates of the RPE tips by the least squared error method. ONH protocol calculates various parameters that describe the ONH and also generates a polar RNFL thickness map that is the RNFL thickness measured along a circle 3.45 mm in diameter centered on the optic disc. The parameters generated are the average RNFL thickness in the temporal, superior, nasal, and inferior quadrant as well as the overall average along the entire measurement circle. In addition, the superior, inferior, and the overall average measurements are compared with the normative database within the software and a diagnostic categorization is provided as outside normal limits, if the value falls lower than the 99.9% confidence interval (CI) of the healthy, age-matched population. A borderline result indicates that the value is between the 95% and 99.9% CI, and a within-normal-limits result indicates that the value is within the 95% CI. The normative database of RTVue has data on 1081 subjects of different ethnicities, which are used to derive the normal, borderline, and outside normal limits cutoffs for these parameters.

GCC scan

The GCC protocol was used to obtain MIR measurements. The parameters generated by the GCC analysis are the average, superior, and inferior MIR thickness, average superior-minus inferior-MIR thickness and superior-minus-inferior thickness standard deviation. In addition to the above parameters, the GCC protocol provides three other parameters called global loss volume (GLV), focal loss volume (FLV) and root mean square (RMS). These have been explained previously.10, 11 GLV measures the average amount of GCC loss over the entire GCC map. It is calculated from the fractional deviation map, which is the map showing the percentage of GCC thickness decreases at each pixel location compared with the expected or normal value at each pixel determined by the instrument's built-in normative database. GLV is calculated by dividing the sum of the values at all pixel locations where the fractional deviation map value is <0, by the total area. FLV measures the average amount of focal loss over the entire GCC map. FLV detects focal loss using a pattern deviation map to correct for the overall absolute changes, much like the corrected pattern standard deviation in the visual fields. For calculation of FLV, a pattern map, which is a normalized map calculated by dividing the GCC thickness values at each location by the average GCC thickness value from the entire map (for an individual), is calculated. The difference between this pattern map for an individual and the average pattern map of the normative database gives the pattern deviation map. A probability value is assigned at each pixel depending on the statistical significance of this difference. FLV is calculated by dividing the sum of the pattern deviation map values at pixels where the fractional deviation map value is <0 and the pattern deviation map value is statistically significant at P<5%, by the total area. RMS or the pattern coefficient of variation provides a summary of how well the fractional and pattern deviation maps of an individual fit the normal pattern. The worse the fit, the higher the value. In addition, the average, superior, and inferior MIR thickness, FLV and GLV are compared with the normative database within the software and a diagnostic categorization is provided as outside normal limits, borderline, and within normal limits similar to the RNFL measurements. All GCC scans were inspected for segmentation algorithm failures and one eye in glaucoma group where the algorithm failed was excluded.

Only good quality images, as defined by a signal strength index of ≥30 were used for analysis. RTVue imaging was done on the same day as visual field testing.

Statistical analysis

Descriptive statistics included mean and standard deviation for normally distributed variables, and median, first quartile, and third quartile values for non-normally distributed variables. Receiver operating characteristic (ROC) curves were used to describe the ability to discriminate glaucomatous from healthy eyes for each RTVue software-provided parameter. Sensitivities at fixed specificities of 80% and 95% were determined for all the parameters. To obtain CIs for areas under the receiver operating characteristic curves (AUC), a bootstrap re-sampling procedure was used (n=1000 re-samples). As measurements from both eyes of the same subject are likely to be correlated, the cluster of data for the study subject was considered as the unit of re-sampling when calculating standard errors. This procedure has been used in literature to adjust for the presence of multiple correlated measurements from the same unit.20, 21

Likelihood ratios (LRs) were reported for diagnostic categorization (outside normal limits, borderline, or within normal limits) provided by the instrument after comparison with the respective normative database. LR is the probability of a given test result in those with disease divided by the probability of the same test result in those without the disease.22, 23, 24 The LR for a given test result indicates how much that result will raise or lower the probability of disease. A LR of 1 or close to 1 would mean that the test provides no further information about the post-test probability of the disease. LRs higher than 10 or lower than 0.1 would be associated with large effects on post-test probability; LRs from 5 to 10 or from 0.1 to 0.2 would be associated with moderate effects; LRs from 2 to 5 or from 0.2 to 0.5 would be associated with small effects.22 The 95% CIs for LRs were calculated according to the method proposed by Simel et al.25

Statistical analyses were performed using commercial software (Stata ver. 10.0; StataCorp, College Station, TX, USA). A P-value of <0.05 was considered statically significant.

Results

In all, 125 eyes of 64 normal subjects and 91 eyes of 59 early glaucoma patients were included for the analysis. Age and SAP parameters of the two groups of participants are shown in Table 1. Patients in the glaucoma group had worse mean deviation and pattern standard deviation on SAP compared with the normal subjects.

Table 1 Age and visual field parameters of the study participants

The mean values of the RNFL parameters in the two groups of participants are shown in Table 2. Significant differences between the two groups were found for all the parameters except the temporal quadrant RNFL thickness. AUC and sensitivities at fixed specificities for the parameters are also shown in Table 2. The AUCs for the RNFL parameters ranged from 0.537 for the temporal quadrant thickness to 0.821 for the inferior quadrant RNFL thickness. Inferior quadrant thickness had the highest sensitivity of 58.2% at a specificity of 95%.

Table 2 RNFL thickness parameters in glaucoma and healthy eyes with AUC and sensitivities at fixed specificities

The mean values of the MIR measurements in the two groups of participants are shown in Table 3. Significant differences between the two groups were found for all the parameters. AUC and sensitivities at fixed specificities for the parameters are also shown in Table 3. The AUCs for the MIR parameters ranged from 0.603 for the superior-minus-inferior thickness averaged to 0.908 for FLV. FLV also had the largest sensitivity of 52.7% at a specificity of 95%.

Table 3 MIR parameters in glaucoma and healthy eyes with AUC and sensitivities at fixed specificities

Table 4 shows the LRs with their 95% CIs for the parameters that are compared with the instrument's normative database. For most of the parameters except FLV and GLV, outside normal limits results were associated with large effects on the post-test probability of disease. Borderline results were associated with small effects, whereas, within normal limits, results were associated with no effects on the post-test probability of disease.

Table 4 Likelihood ratios (with 95% confidence interval)a of the normative database classification of RNFL and MIR measurements with RTVue

Figure 1 shows the ROC curves of the best RNFL and the best MIR parameter. The AUC of the MIR parameter with the largest AUC, FLV, was significantly higher than that of the RNFL parameter with the largest AUC, inferior quadrant RNFL thickness (0.908 vs 0.821; P<0.001). The sensitivities of these parameters at high specificity of 95%, however, appeared comparable (52.7 vs. 58.2%). Figure 2 shows the ROC curves of the average MIR and average RNFL thickness parameters that were also comparable (0.735 vs. 0.799; P=0.07).

Figure 1
figure 1

Receiver operating characteristic curves of the best retinal nerve fiber layer (inferior quadrant RNFL thickness) and macular inner retinal (focal loss volume, FLV) parameters.

Figure 2
figure 2

Receiver operating characteristic curves of the macular inner retinal average thickness and average retinal nerve fiber layer thickness parameters.

Discussion

In this study, we demonstrated that the MIR parameters were as good as the RNFL parameters to detect early glaucoma in Indian eyes. We also reported the LRs of the diagnostic classification of the parameters.

Diagnostic tests are usually summarized in terms of sensitivity, specificity, AUC and LR. The advantages and limitations of these measures have been described earlier.26, 27, 28 LR appears to be higher than the previous measures in hierarchy; it expresses the magnitude by which the probability of a diagnosis in a given patient is modified by the results of the test. In other words, the LR indicates how much a given diagnostic test result will raise or lower the pretest probability of the disease in question. We found that the outside normal limits results of most of the RNFL and MIR parameters had a large effect on the post-test probability of glaucoma. Borderline results were associated with small effects that would mean that borderline results were of little use in either ruling in or ruling out the diagnosis. Within normal limits, results were associated with no effect on the post-test probability of glaucoma, which would mean that, within normal limits, results were of little use in ruling out the diagnosis. However, it should be noted that even small effects on post-test probability may be relevant and useful, depending on other clinical information and the pretest probability of disease.

Our results are in agreement with earlier studies comparing the diagnostic accuracies of RNFL and macular parameters with SD–OCT in early glaucoma.17, 18 Seong et al,18 in normal tension glaucoma eyes with early visual field damage, found that the diagnostic ability of MIR parameters were comparable to that of RNFL parameters; however, their analysis was limited to average, superior and inferior quadrant thicknesses of both RNFL and MIR. Using a regression model, Rao et al17 also found comparable diagnostic abilities between average RNFL and GCC RMS parameters at different severities of visual field damage. Mori et al12 also found that the MIR volume had diagnostic abilities comparable to that of RNFL measurements. In contrast, Huang et al,29 in a similar study found that the MIR parameters were inferior compared with RNFL parameters to detect glaucoma. However, Huang et al used the earlier software version of RTVue and evaluated only the superior, inferior, and average MIR parameters and not FLV, GLV, or RMS that were in fact, the MIR parameters associated with higher AUCs both in our study and in previous studies by Seong et al and Rao et al.

These results of improved diagnostic abilities of macular evaluation algorithms with SD–OCT compared with Stratus OCT may be attributable to the better resolution and greater speed of image acquisition of SD–OCT. The limited axial resolution of the Stratus OCT allowed for only full macula thickness measurements, whereas the higher resolution of SD–OCT allows for more specific segmentation of the retinal layers with which layers associated with glaucomatous damage, namely nerve fiber layer, ganglion cell layer and inner plexiform layer can be studied and analyzed. The greater scanning speed of SD–OCT enables acquisition of a large number of scans in a single imaging session, reducing the need for data interpolation.

One of the limitations of the study is the case-control design with a clear distinction between glaucoma patients (cases) defined, based on the presence of glaucomatous visual field defects, and normal subjects (controls) recruited from the general population and having no suspicious findings of glaucoma. Such a design has been shown to inflate the actual diagnostic ability of a test.30, 31 Another limitation, in general to all studies in glaucoma is the lack of reference standard for diagnosing the disease. Our glaucoma cohort included patients with evidence of both structural changes in optic disc and correlating visual field changes. Although the strict inclusion criteria would have reduced the false positives to a great extent, this would have excluded a few cases of glaucoma with damage detectable only on structural or functional tests. However, as the aim of our study was to compare RNFL and MIR parameters of SD–OCT in the same population, this is unlikely to have influenced the comparison.

In conclusion, MIR parameters with SD–OCT were as good as the RNFL parameters to detect early glaucoma in Indian eyes. Further studies should evaluate if our study results are reproducible in other ethnic groups.