Introduction

Age-related macular degeneration (AMD) is the most common cause of legal blindness among patients older than 50 years of age in the developed countries [1,2,3]. A number of studies and research groups have evaluated risk factors for progression from early to late AMD, defined as the development of macular neovascularization (MNV) or geographic atrophy (GA) [4,5,6,7,8,9,10]. Initial studies identified risk factors from colour fundus photos, and these biomarkers have included features such as drusen area, presence of large drusen, hyperpigmentation and depigmentation [8, 9]. Optical coherence tomography (OCT), however, has largely supplanted colour photography as the dominant imaging technology in clinical practice and clinical trials. A number of groups, including ours, have now identified risk factors for AMD progression on OCT scans. In a longitudinal study of individual drusenoid lesions followed over time, intraretinal hyperreflective foci (IRHF) and hyporeflective drusen cores (hDC) were found to predict which drusen went on to develop atrophy or MNV at that location. Reticular pseudodrusen (RPD), which have now been shown to correspond to subretinal drusenoid deposits (SDD), have also been shown to increase the risk for development of atrophy as well as MNV (especially Type 3). Using drusen volumes automatically computed by Advanced RPE analysis software on the Cirrus OCT, Abdelfattah and colleagues were able to show that a drusen volume ≥0.03 mm3 within the central 3 mm of the macula was associated with a higher risk of progression to late AMD, and in particular GA [11]. The importance of high central drusen volume (based on this cutpoint of 0.03 mm3) was confirmed in subsequent larger studies [12,13,14].

Lei and colleagues incorporated high central drusen volume and these other factors (SDD, IRHF, hDC) into a simple OCT-based staging system that was patterned after the AREDS simple scale, and was intended to be used for prognostication in clinical practice or identifying high-risk patients for enrolment into early intervention clinical trials [12]. Drusen volume measurements, however, are not automatically computed by many OCT devices, and even if 3rd party software were to be used for drusen quantification, this would generally require that clinicians had obtained high-density volume scans to allow volumetric analysis. These limitations hinder the broad applicability of these OCT-based risk stratification systems.

To address this limitation, this study evaluated whether subjective inspection of OCT B-scans alone could be used to determine if a patient’s eye contained a drusen load above or below the high-risk threshold of 0.03 mm3.

Materials and methods

This study was approved by the Institutional Review Boards of the University of California Los Angeles, the University of Miami, and the University of Pennsylvania. All procedures were conducted in accordance with the Declaration of Helsinki and informed consent was obtained by all subjects before being enroled.

We randomly selected 100 subjects (200 eyes) enroled in the Amish Eye Study who were diagnosed with early or intermediate AMD in both eyes based on review of colour fundus photographs and OCT. The Amish Eye Study is an ongoing NEI-supported clinical research investigation aimed at identifying novel phenotypic and genetic biomarkers for evaluating AMD in an elderly population (individuals over the age of 50). The baseline features, including the OCT characteristics, were described in detail in a previous report [13].

Exclusion criteria for the study were evidence on OCT or colour photos of late AMD defined by the presence of MNV or RPE atrophy, the presence of any other retinal disease other than AMD, poor OCT image quality due to significant media opacity and signal strength <6 which is the minimum level specified by the manufacturer in order to produce reliable drusen analysis. All subjects were imaged on the same day with two SD-OCT devices: (A) Cirrus OCT (Carl Zeiss Meditec, Inc., Dublin, CA) with a 6 × 6 mm macular cube protocol of 128 horizontal B-scans and 512 A-scans per B-scan and (B) Heidelberg Spectralis HRA + OCT (Heidelberg Engineering, Heidelberg, Germany) with a 6 × 6 mm macular cube protocol of 97 horizontal B-scans and 1024 A-scans per B-scan.

Grading procedure

Optical coherence tomography data for each subject were exported in a coded de-identified fashion for subsequent analysis at the Doheny Image Reading Center (DIRC). After exportation and transmission to DIRC, OCT images were reviewed for qualitative features and analysed for quantitative measures by certified OCT graders using the Cirrus Advanced RPE Analysis software (Version 6.0). The method used by the automated Advanced RPE Analysis software to quantify drusen volume has been described in previous reports [15, 16]. Briefly, the algorithm segments the RPE band following the RPE contour and also estimates the original RPE position (termed the RPE fit) in the absence of an RPE elevation. The region between the RPE band and the RPE fit is then quantified as the drusen volume. Automatic drusen volumes within the 3-mm circle (centred on the fovea) of the Cirrus grid was recorded by a senior independent grader (MN). As drusen volumes were calculated by automated FDA-cleared instrument software, and the accuracy and reliability of this software have been assessed in prior reports [15, 16], the reproducibility of this “gold-standard” metric was not re-assessed in this study.

A drusen volume training set (five cases) was chosen by the senior grader (MN) from the cases and had both Spectralis and Cirrus OCT data with known drusen volume values from the Cirrus. The training set was reviewed by two other OCT graders (FC and SS) to establish subjective recognition for the appearance on Spectralis OCT B-scans of a case which met the threshold drusen volume on the Cirrus OCT of “high central drusen volume” (Fig. 1). After this training, the two graders reviewed the cases collected for this study in an independent, masked fashion. The estimation of drusen volume was performed evaluating whether the central drusen volume (within the central 3 mm) was either <0.03 or ≥0.03 mm3. Thus, to assess central drusen volume, the graders only considered those B-scans (or portions of B-scans) which they deemed to fall within the central 3 mm (based on the middle ring of the Early Treatment of Diabetic Retinopathy Study (ETDRS) grid) of the macula.

Fig. 1: Five cases of intermediate AMD used for training graders for drusen volume assessment.
figure 1

The first row shows a structural OCT B-scan from the Cirrus OCT volume showing the inner and outer segmentation boundaries at the level of RPE and RPE fit (estimated Bruch’s membrane position), respectively. The drusen volume measurement (in mm3) as calculated by the Cirrus Advanced RPE Analysis is shown (ranging from 1.63 to 0.01 mm3). The second row shows the corresponding drusen volume maps produced by the Cirrus Advanced RPE Analysis software, highlighting the drusen which contributed to the drusen volume. The third through seventh row shows representative B-scans from the Spectralis OCT within the central 3 mm of the ETDRS grid. The bottom row shows the Spectralis infrared reflectance images with locations of the selected B-scans shown as dashed lines. AMD age-related macular degeneration, OCT optical coherence tomography, RPE retinal pigment epithelium, ETDRS Early Treatment Diabetic Retinopathy Study.

Statistical analysis

Statistical analysis was performed with IBM SPSS Statistics for Windows, Version 26.0 (IBM Corporation, Armonk, NY, USA). Drusen volume was measured in millimetre cubed, and the results were expressed as mean ± SD. Cohen’s kappa (κ) was calculated to measure the agreement between two graders and the agreement between graders and automatic drusen volumes.

Results

A total of 200 eyes from 100 subjects with early or intermediate AMD were included in this study. Fifty-eight (58%) were women with a mean age of the population of 77.1 years (SD, 9.5), with a median age of 79 years (range, 53–99 years). Among these 200 eyes, 11 were excluded due to poor image quality, 5 were used for initial training of the graders, and thus ultimately 184 eyes were included in the analysis.

The mean drusen volume was 0.05 ± 0.15 mm3, and 67 eyes (36%) were measured to have a drusen volume ≥0.03 mm3 and 117 (64%) had a volume lower than 0.03 mm3. The distribution of drusen volumes among all cases is shown in the histogram in Fig. 2. The agreement between the graders’ subjective inspection of Spectralis B-scans and the automated Cirrus Advanced RPE Analysis for the presence of high central drusen volume (≥0.03 mm3) was excellent with a K = 0.92 (95% confidence intervals (CI) = 0.86–0.98) for grader 1 and K = 0.84 (95% CI = 0.76–0.93) for grader 2 (Table 1). The agreement between graders for determination of high central drusen volume was excellent with a K = 0.88 (exact 95% confidence intervals, 0.81–0.95). Most cases where there was either a disagreement between graders or a disagreement between both graders and the automated drusen volume measurement were close to the threshold drusen volume (0.03 mm3; Fig. 2).

Fig. 2: Distribution of drusen volumes in study cohort.
figure 2

The frequency of drusen volumes (binned in 0.001 mm3 intervals) is plotted in this histogram. The numbers of cases where the graders disagreed with the automated drusen volume analysis or the graders disagreed between themselves is shown. Note that most cases of disagreements were near the 0.03 mm3 threshold.

Table 1 Agreement between graders.

Discussion

In this study, we evaluated the ability of two independent graders to qualitatively determine whether central drusen volume was ≥0.03 mm3 based on rapid inspection of the Spectralis B-scans alone. We found an excellent agreement between the two graders and the drusen volume automatically computed by the Cirrus OCT using the Cirrus Advanced RPE Analysis software. These findings suggest that a qualitative determination of high central drusen volume may be sufficient and could potentially be used in the estimation of the risk for progression to advanced AMD.

The importance of assessing drusen volume and in particular determining the presence of high central drusen volume has been established by several studies. Baseline drusen volume appears to be an important predictor for the development of late AMD within 2 years, suggesting that OCT-derived drusen volume measurements may be a useful biomarker to identify eyes at the highest risk for progression to late AMD. In particular, Abdelfattah et al. found in patients with AMD, that a drusen volume >0.03 mm3 was associated with 4-fold increased risk for developing late AMD compared with those with lower values [11]. Nassisi et al. investigated the relationship between several OCT features, including drusen volume, and progression to late AMD in the fellow eyes of patients enroled in the Study of Ranibizumab Administered Monthly or on an As-needed Basis in Patients with Subfoveal Neovascular AMD (HARBOR trial), finding a hazard ratio of 1.46 for high central drusen volume [14].

Drusen volume in these previous studies was based on an automatic analysis performed by the Cirrus OCT using the Advanced RPE Analysis software which has been cleared by the FDA. The automated software algorithm relies on the detection of the retinal pigment epithelium (RPE) surface and an estimated fit of the original or expected RPE assuming no elevation within the 3- and 5-mm circles centred on the fovea. Previous studies have demonstrated that OCT-based automated drusen quantification is reproducible [16,17,18]. A challenge for relying on automated drusen volume measurements for use in multi-centre clinical trials, clinical research or clinical practice, however, is that it is limited to the few OCT devices that perform this measurement. Most OCT devices do not automatically generate volume and even off-line image processing software tools will generally require a dense volume scan. Our study, however, shows that after a very brief training period (review of only 5 cases), human graders are able to reliably determine the presence of high central drusen volume. This is important as it may also allow this risk factor to be assessed in many retrospective/historical OCT datasets where automated drusen volume computation may not be possible.

A post hoc review of cases in which there were disagreements in drusen volume assessment was conducted to determine if there were opportunities to further improve the subjective evaluation. There was only one case where both graders agreed that a high central drusen volume was present but was not confirmed by the automated analysis, which measured a volume of only 0.02 mm3 (Fig. 3). In four cases, both graders agreed the drusen volume was below the threshold but the automated software measured a value ≥0.03 mm3—the mean automated drusen volume in these cases was 0.037 (range: 0.03–0.05). In 10 cases, the graders disagreed on their assessment of the presence of high central drusen volume. Overall, among these 15 cases in which there was either disagreement with the automated measurement or between the two graders, the mean automated drusen volume in these cases was 0.029. As 12 of these 15 cases had a drusen volume close to the 0.03 mm3 threshold value (range between 0.02 and 0.04 mm), we suspect that this reflects the limits of the precision of the human grader to subjectively discriminate drusen volumes. On the other hand, the exact 0.03 mm3 cutpoint is somewhat arbitrary, so missing a few cases close to threshold may be acceptable for general clinical use. Overall, in the cohort, there were a total of 54 cases (30%) with drusen volume between 0.02 and 0.04 mm3.

Fig. 3: Two representative cases of disagreement between graders and automated software.
figure 3

On the left, a case where both graders agreed that a high central the drusen volume was present but was not confirmed by the automated analysis, as the value was 0.02 mm3. On the right, a case where both graders agreed that the drusen volume was below the 0.03 mm3 threshold, but the automated analysis computed a value of 0.04 mm3. The top right and left images show the Cirrus infrared reflectance image with superimposed drusen volume maps from the Cirrus OCT. The middle right and left images show the superimposed drusen volume maps on the Spectralis infrared reflectance image with the red and white dashed lines indicating the locations of the Spctralis B-scans for the two cases which are shown in the central two columns.

Our study is not without limitations, which should be considered when assessing our results. First, graders were trained before starting the drusen volume assessments. On the other hand, the training was brief and based on only five representative cases, which suggests that most clinicians should be able to learn to do this quickly. Second, although this approach appears to be useful for determining whether a particular case meets the threshold for a pre-defined high central drusen volume, it does not provide a precise quantitative measurement. While the categorical determination is sufficient for use of the current OCT-based simple scale for AMD severity, future more precise AMD severity scales may utilize drusen volume as continuous measure.

In summary, we observed that the presence of high central drusen volume could be determined reliably by a qualitative inspection of OCT B-scans. This approach may be feasible for use in clinical practice and in clinical trials to assess the risk for progression to late AMD.

Summary

What was known before

  • Although an optical coherence tomography (OCT)-derived central drusen volume ≥0.03 mm3 has been found to be a risk factor for progression to late age-related macular degeneration (AMD), this parameter is not currently available on most OCT devices or acquisition protocols.

What this study adds

  • The purpose of this study was to evaluate the ability of human graders to qualitatively assess drusen volume by inspection of OCT B-scans. The presence of a high central drusen volume can be assessed reliably by qualitative inspection of OCT B-scans. This approach may be useful in the assessment of risk for progression to late AMD.