Prediction of visual function from automatically quantified optical coherence tomography biomarkers in patients with geographic atrophy using machine learning

Balaskas, Konstantinos; Glinton, S.; Keenan, T. D. L.; Faes, L.; Liefers, B.; Zhang, G.; Pontikos, N.; Struyven, R.; Wagner, S. K.; McKeown, A.; Patel, P. J.; Keane, P. A.; Fu, D. J.

doi:10.1038/s41598-022-19413-z

Download PDF

Article
Open access
Published: 16 September 2022

Prediction of visual function from automatically quantified optical coherence tomography biomarkers in patients with geographic atrophy using machine learning

Konstantinos Balaskas¹,
S. Glinton¹,
T. D. L. Keenan²,
L. Faes¹,
B. Liefers^1,4,
G. Zhang¹,
N. Pontikos¹,
R. Struyven¹,
S. K. Wagner¹,
A. McKeown³,
P. J. Patel¹,
P. A. Keane¹ &
…
D. J. Fu¹

Scientific Reports volume 12, Article number: 15565 (2022) Cite this article

2996 Accesses
10 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Geographic atrophy (GA) is a vision-threatening manifestation of age-related macular degeneration (AMD), one of the leading causes of blindness globally. Objective, rapid, reliable, and scalable quantification of GA from optical coherence tomography (OCT) retinal scans is necessary for disease monitoring, prognostic research, and clinical endpoints for therapy development. Such automatically quantified biomarkers on OCT are likely to further elucidate structure–function correlation in GA and thus the pathophysiological mechanisms of disease development and progression. In this work, we aimed to predict visual function with machine-learning applied to automatically acquired quantitative imaging biomarkers in GA. A post-hoc analysis of data from a clinical trial and routine clinical care was conducted. A deep-learning automated segmentation model was applied on OCT scans from 476 eyes (325 patients) with GA. A separate machine learning prediction model (Random Forest) used the resultant quantitative OCT (qOCT) biomarkers to predict cross-sectional visual acuity under standard (VA) and low luminance (LLVA). The primary outcome was regression coefficient (r²) and mean absolute error (MAE) for cross-sectional VA and LLVA in Early Treatment Diabetic Retinopathy Study (ETDRS) letters. OCT parameters were predictive of VA (r² 0.40 MAE 11.7 ETDRS letters) and LLVA (r² 0.25 MAE 12.1). Normalised random forest feature importance, as a measure of the predictive value of the three constituent features of GA; retinal pigment epithelium (RPE)-loss, photoreceptor degeneration (PDR), hypertransmission and their locations, was reported both on voxel-level heatmaps and ETDRS-grid subfields. The foveal region (46.5%) and RPE-loss (31.1%) had greatest predictive importance for VA. For LLVA, however, non-foveal regions (74.5%) and PDR (38.9%) were most important. In conclusion, automated qOCT biomarkers demonstrate predictive significance for VA and LLVA in GA. LLVA is itself predictive of GA progression, implying that the predictive qOCT biomarkers provided by our model are also prognostic.

Clinical validation for automated geographic atrophy monitoring on OCT under complement inhibitory treatment

Article Open access 29 April 2023

Fundus autofluorescence and optical coherence tomography biomarkers associated with the progression of geographic atrophy secondary to age-related macular degeneration

Article 16 August 2021

Automated deep learning-based AMD detection and staging in real-world OCT datasets (PINNACLE study report 5)

Article Open access 09 November 2023

Introduction

Geographic atrophy (GA) is a chronic progressive degeneration of the macula, the central 24 mm² (20°) region of the retina required for central vision. It is the defining lesion of late age-related macular degeneration (AMD)^1,2. GA is associated with significant, irreversible vision loss. To assess visual function in AMD, visual acuity (VA) measured using an eye chart under standardised lighting is an established key outcome measure as defined by the International Consortium for Health Outcomes Measurement³. Interestingly, in early stages of AMD, VA measured under low luminance conditions (low luminance visual acuity [LLVA]) can be reduced whilst standard VA remains unaffected^4,5. Most patients initially present with non-central GA (affecting the parafoveal region of the macula) and gradually proceed to foveal involvement (central GA)^6,7,8. In this context, LLVA correlates with future deterioration of VA and its reduction indicates that the patient will head towards the beginning of end-stage disease, and loss of foveal function may be impending. Yet the physiological mechanism underlying this outcome measure is unknown^9,10.

With no current treatments for GA and promising therapies on the horizon^11,12,13, it is increasingly important to establish structure–function relationships within GA, as they: (i) provide further understanding of the pathophysiological mechanisms underlying GA; (ii) refine monitoring of disease activity and thereby diagnostic and prognostic counselling at the individual patient level; (iii) serve to define clinical endpoints in clinical trials and enable identification of earlier stages, i.e. opportunities for interventions that can prevent vision loss.

The current reference standard for diagnosing, characterising, and monitoring progression in GA is spectral domain optical coherence tomography (SD-OCT), as it captures the cross-sectional morphology of retinal structures¹⁴. Indeed, an international consortium of experts in AMD and retinal imaging—the Consensus of Atrophy Meetings (CAM) group—chose to define disease progression in GA based on SD-OCT structural markers^1,15). Their proposed terms for macular atrophy in the context of AMD each describe the affected anatomical layers and represent distinct disease stages. Herein, complete RPE (retinal pigment epithelium) and outer retinal atrophy (cRORA) represents the endpoint of atrophy—encompassing GA—and is defined by regions of: choroidal hypertransmission with diameter ≥ 250 µm; RPE attenuation or disruption with diameter ≥ 250 µm; overlying photoreceptor degeneration; and absence of RPE tear¹⁵. Regions in which these features overlap but are less than 250 µm are termed incomplete RPE and outer retinal atrophy (iRORA)—a precursor stage to cRORA¹⁶. Recently, we developed a deep learning model to automatically segment RPE-loss, photoreceptor degeneration, and hypertransmission from OCT scans and was shown through external validation to be comparable with human specialist efforts¹⁷. Regions in which these features overlap represent RORA (RPE and outer retinal atrophy) and can be considered as a continuous variable that encompasses both cRORA (GA) and iRORA.

To date, the relationship between OCT structural features and visual function in GA has relied on manual segmentation and does not consider the biomarkers defined in the CAM consensus statement^18,19. This study makes use of an externally-validated algorithm that automatically segments these quantitative OCT (qOCT) biomarkers, applying it to non-neovascular AMD datasets with GA (n = 476) from both a clinical trial and routine clinical care. Applying machine learning modelling, both standard and low luminance VA could be predicted to a level that has yet to be achieved. Furthermore, spatial localisation and severity of anatomical disruption of qOCT biomarkers were mapped onto the macula providing further insight into structure–function relationship in GA and the (otherwise unknown) underlying physiological mechanism of early LLVA impairment.

Methods

Study design

This is a non-interventional, post hoc analysis of patients with GA secondary to non-neovascular AMD. Reporting adhered to guidelines for observational studies put forth by the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement²⁰.

Study cohort

This study considered data from two sources: participants enrolled in the FILLY trial (NCT02503332)^11,21 and real-world data collected as part of routine clinical care for patients with GA at Moorfields Eye Hospital NHS Foundation Trust, London, United Kingdom.

The FILLY trial was a phase II, international, multicenter clinical trial assessing safety, tolerability, and evidence of activity of intravitreal pegcetacoplan in eyes with GA secondary to non-neovascular AMD with best-corrected visual acuity (BCVA) greater than 24 Early Treatment Diabetic Retinopathy Study (ETDRS) letters (Supplementary Methods 1 and Supplementary Fig. 1). Here, only baseline trial data were considered, i.e. the timepoint prior to initiation of the intervention. The VA values represent BCVA testing using ETDRS charts by certified examiners after refraction. BCVA under low luminance conditions (low luminance visual acuity; LLVA) was measured as for BCVA but with a 2.0 log neutral density filter covering the eye. The study eye was assessed first, after allowing for adequate time for adaptation to low luminance conditions, empirically determined. LLVA was measured prior to BCVA to avoid memorisation of letters. Low luminance deficit (LLD) was defined as the difference between BCVA and LLVA (i.e. BCVA–LLVA).

Patients from Moorfields Eye Hospital were included if all of the following criteria were met (Supplementary Fig. 1): they attended a medical retina clinic between 01-January-2016 and 31-January-2019; GA was included as a term in the clinic correspondence letter; Heidelberg OCT scans (greater than 25 b-scans per volume) were obtained from both eyes within 15 days of the appointment; absence of prior anti-VEGF therapy; and the OCT scans were confirmed to contain GA secondary to non-neovascular AMD (according to manual validation by a Reading Centre expert grader at the Moorfields Reading Centre). Here, VA (with habitual correction or pinhole) was measured as part of a clinical examination with an ETDRS chart. Pinhole VA was used if better than VA with habitual correction. For any eyes with multiple timepoints meeting the eligibility criteria, the earliest timepoint was selected.

For all patients, only a single macular OCT scan and a corresponding VA measurement were considered per eye. The luminance range of ETDRS charts across the two patient cohorts was 85–120 cd/m². All OCT volumes were acquired using Heidelberg Spectralis OCT (Heidelberg Engineering, Heidelberg, Germany) having equal to or greater than 25 b-scans covering 6 × 6 × 2 mm³.

The study was conducted in compliance with the tenets of the Declaration of Helsinki and has approval from the Moorfields Eye Hospital Institutional Review Board (research reference: ROAD17/031, clinical audit reference: CA17/MR/28). The requirement for informed consent was waived by the national health research and ethics review board of England (Health Research Authority of England, Project ID: 281957, Protocol number: KEAP1006, Research Ethics Committee Reference: 20/HRA/2158) as this is the standard for use of retrospective, de-identified data for research within the UK National Health Service (NHS).

Image analysis workflow

All OCT volumes were processed using validated deep learning models¹⁷. The deep-learning models were developed as a variation of the U-Net architecture. Briefly, for every pixel of an input image, the model outputs a likelihood estimate for a given feature. Models were trained for each of the morphological features that define geographic atrophy: RPE loss, overlying photoreceptor degeneration, and hypertransmission. A fourth model that segments RORA was also trained. RORA was defined as overlapping regions of RPE-loss, photoreceptor degeneration, and hypertransmission, i.e. any retinal areas with all three features present and therefore encompassing both iRORA and cRORA^15,22. The models were applied on every OCT volume scan and performed automated segmentation of each of the 49 b-scans per volume.

For each two-dimensional b-scan, the outputs of the models on a pixel level were converted into a one-dimensional binary label, representing the presence or absence of the feature per vertical column (A-scan). (Fig. 1a)¹⁷.

The automatic segmentation process assigns a probability for each of the features to each voxel within an OCT volume. Voxel spatial localisation was interpolated in relation to the central fovea, thereby standardising locations and enabling comparison between patients (Fig. 1b). Central foveal points were manually annotated by a reading centre expert grader at the Moorfields Reading Centre. The spatial localisation of feature probabilities was also considered by dividing the macula into each of the ETDRS regions, i.e. mean probability within each of the nine ETDRS regions (foveal, 4 parafoveal, and 4 perifoveal areas for the nasal, temporal, superior, and inferior regions) (Fig. 1c). The area of each segmented feature in square millimeters (mm²) was considered by applying optimised probability thresholds identified from the original model development and validation (Fig. 1d)¹⁷.

Predicting visual acuity

A random forest regression model was trained using the segmentation output (i.e. the raw probabilities at the voxel level for each feature (RPE-loss, photoreceptor degeneration, hypertransmission, and RORA) as input variables to predict cross-sectional VA in ETDRS letters. Three individual instances of the same model were trained: one with OCTs from the clinical trial FILLY cohort, one with OCTs from the MEH cohort, and one with OCTs from a combined Overall cohort. Regarding prediction of VA from qOCT biomarkers under standard-luminance conditions, the three models were evaluated for: (i) VA under RCT conditions i.e., FILLY; (ii) VA from real-world routine care, i.e., MEH; and (iii) VA from the Overall cohort.

In separate analyses, VA under low luminance conditions and Low Luminance Deficit (LLD), only available in the FILLY study cohort, were each considered as the dependent variable for separate instances of the random forest regression model.

The goodness of model fit was evaluated by comparing the regression coefficient (r²) and mean absolute error (MAE) computed from 100-fold bootstrapped random forest models trained on 80% of the bootstrap sample and evaluated on the other 20% split at the patient level. Feature importance was calculated by the method incorporated in the scikit-learn implementation of random forest regression, whereby features’ rankings for their variance reduction capability are averaged over the ensemble. Output was multiplied by 100 to give percentage contribution towards model performance. Analyses were carried out with Python (version 3.6.9) and summarised metrics in the text are expressed as median with ± interquartile range (IQR), unless otherwise specified, as the measurements were not normally distributed.

Meeting presentation

EURETINA 2021 Prize Paper Presentation—Artificial Intelligence session.

Results

Cohort demographics

The study cohort comprised 476 eyes with GA secondary to non-neovascular AMD from 325 patients that were undergoing a clinical trial of pegcetacoplan in GA secondary to AMD (n = 195) or routine clinical care at a large UK tertiary centre (n = 130) (Supplementary Fig. 1). Mean (SD) age was similar across groups, with 80.1 (7.6) and 79.1 (9.1) years in the FILLY and MEH cohorts, respectively. Overall median age was 80.5 ± IQR 11.1 years. In both cohorts, the majority were female (FILLY: 63.1%; MEH: 56.2%) (Table 1). No patient had a history of treatment with hydroxychloroquine or chloroquine.

Table 1 Cohort demographics. Mean, median, minimum value, maximum value, standard deviation (SD), and proportional distributions are shown for gender, ethnicity, and age of cohorts including patients with geographic atrophy secondary to non-neovascular AMD. Patients were recruited as part of an international, multicentre trial (FILLY) or retrospectively through real-world clinical care at Moorfields Eye Hospital (MEH). Direct comparison of ethnicity was infeasible as each cohort represented ethnicities with distinct categorical variables that could not be intuitively merged.

Full size table

Clinical features and GA segmentation

A wide distribution in standard VA was observed in both study cohorts: median standard BCVA was 57.5 ± IQR 30.0 in the FILLY cohort and median best recorded VA was 50.9 ± 35.0 ETDRS letters in the MEH cohort, giving an overall median standard VA 60.5 ± IQR 32.0 ETDRS letters. Similarly, the other visual function metrics available for the FILLY cohort were widely distributed. LLVA was 32.0 ± 28.0, and LLD 21.0 ± 21.0 (Table 2 and Supplementary Fig. 2a).

Table 2 Visual function and corresponding GA feature segmentation. Mean, standard deviation (SD), and distribution are shown for (a) visual function metrics (standard visual acuity [VA] in ETDRS [early treatment diabetic retinopathy study] letters], low-luminance visual acuity [LLVA], low-luminance deficit [LLD, difference between VA and LLVA]); (b) feature segmentations (total areas in square millimeters [mm²] of RPE-loss, photoreceptor degeneration, hypertransmission, and geographic atrophy; (c) proportion of each feature overlapping with central foveal region (0.79 mm²). Cohort data was collectively summarised (Overall) and sub-stratified by recruitment (FILLY trial or MEH [Moorfields Eye Hospital]). LLVA and LLD were not measured as part of routine clinical care at MEH and these metrics were not available for 2 persons of the FILLY sub-cohort—these were collectively referred to as missing.

Full size table

Automatic segmentation of the Overall cohort’s OCTs revealed that across all parameters analysed, the MEH cohort featured on average larger areas affected by GA (Table 2 and Supplementary Fig. 2b). Median (IQR) RORA was 6.03 mm2 (5.59) in the FILLY cohort versus 6.6 (9.32) in the MEH cohort. Similarly, mean areas affected by RPE loss, photoreceptor degeneration and hypertransmission were nominally larger in the MEH cohort compared to the FILLY cohort. The total area occupied by each feature was highly variable in the Overall cohort, resulting in a distribution with median values of: 7.82 ± 7.58 mm² RPE-loss, 14.4 ± 9.57 mm² photoreceptor degeneration, 9.23 ± 7.58 mm² hypertransmission, and 6.22 ± 6.49 mm² RORA (Table 2 and Supplementary Fig. 2b). Presence of RORA and its constituent features was also highly variable within each region, ranging from completely absent (0%) to confluent (> 99%). Indeed, RORA was wholly absent from the circular ETDRS foveal region (diameter 1 mm; 0.79 mm²) in 14.7% of all OCT volumes (70/476; 41/299 FILLY and 29/177 MEH) (Table 2).

Prediction of standard visual acuity using machine learning

For standard VA, the accuracy of using automatically segmented feature probabilities at the voxel level to predict VA was higher in the FILLY cohort (r² 0.46 MAE 10.2) than the real-life MEH cohort (r² 0.30 MAE 15.3). The random forest regression model trained on the two combined cohorts (Overall cohort) had r² 0.40, with mean absolute error (MAE) of 11.7 ETDRS letters (Table 3a). Normalised random forest feature importance, as a measure of the predictive value of the three constituent features of GA; RPE-loss, PDR, hypertransmission and their locations, was reported both on voxel-level heatmaps (Fig. 2a) and ETDRS-grid subfields (Table 3b). Predictive importance for each feature and its position was ranked, wherein RORA contributed the most (30.5% in Overall cohort; 30.9% in FILLY and 28% in MEH), followed by hypertransmission (25.1% Overall; 25.7% FILLY 26.8% MEH), RPE-loss (23.7% Overall; 24.8% FILLY 22.4% MEH), and photoreceptor degeneration (20.7% Overall; 18.8% FILLY 22.9% MEH) (Fig. 2a and Table 3b). Feature ranking was further considered by summing feature importance across ETDRS regions. Critically, features within the foveal region contributed the most (46.5% Overall; 47.1% FILLY; 46.5% MEH) despite being the smallest ETDRS region—0.78 mm² (Table 3b).

Table 3 Structure–function correlation between qOCT biomarkers of GA area and VA. (a) A random forest regression model was trained using the deep-learning segmentation output (i.e. the raw probabilities at the voxel level for each feature (RPE-loss, photoreceptor degeneration, hypertransmission, and RORA) as input variables to predict cross-sectional VA under standard luminance conditions, low-luminance VA, and low-luminance deficit in ETDRS letters. For VA under standard-luminance conditions, separate models were evaluated for: (i) BCVA under RCT conditions i.e., FILLY; (ii) VA from real-world routine care, i.e., MEH; and (iii) a third that combines the two. Model bootstrapped 100-fold with resultant regression coefficients (r²) and mean absolute error (MAE) shown. Importance of qOCT biomarker features in predicting (b) standard visual acuity and (c) low luminance visual acuity was queried using machine learning. Random forests modelling was used to evaluate value of the qOCT biomarkers RPE-loss, photoreceptor degeneration, hypertransmission, and RORA in predicting cross-sectional visual acuity under standard lighting conditions (Overall model). The resultant adjusted feature importance values were summed according to location within ETDRS region and multiplied by 100 to give the percentage contribution towards the model's performance. For example, RORA within the foveal region accounted for 16.8% of the model’s performance of r² 0.40 MAE 11.7 ETDRS letters for standard visual acuity.

Full size table

Prediction of low luminance visual acuity using machine learning

Additional visual parameters were available in the FILLY cohort. Here, random forest cross-sectional predictions of LLVA and LLD from Deep-Learning segmentation model output feature probabilities. Regression coefficients (r²) of 0.25 (MAE 12.1) and 0.25 (MAE 10.1) were observed for LLVA and LLD, respectively (Table 3a). Ranking of feature importance for LLVA as visualised in corresponding heatmaps was led by photoreceptor degeneration (38.9%), followed by hypertransmission (26.0%), RORA (21.4%), and RPE-loss (13.8%) (Fig. 2b and Table 3c). In contrast to standard VA, where feature importance was greatest for the foveal region, features at non-foveal regions were most important in predicting LLVA. When the VA and LLVA random forest regression models were repeated with one eye per patient, a similar pattern of feature importance was observed (Supplementary Table 1). The correlation between actual and predicted values for VA and LLVA and the corresponding Bland-Altman plots are presented in Supplementary Figure 3.

Discussion

Main findings

Our data demonstrate that both standard and low luminance VA in patients with GA secondary to non-neovascular AMD can be predicted using qOCT features that have been automatically segmented and quantified. This is the first time a structure–function relationship has been described for LLVA, creating a tool to assess its underlying physiological mechanisms. The cross-sectional predictive performance demonstrated for standard VA in the Overall cohort (r² = 0.40; Table 3a) is superior to that from similar efforts in neovascular AMD also using machine-learning based segmentation algorithms—r² = 0.11–0.21^23,24. An MAE of 11.7 ETDRS letters is a step towards the limit of predicting VA, as the test–retest repeatability for standard VA is 5–6 ETDRS letters in eyes without disease and thought to be even greater in eyes with GA^25,26.

Mapping structure and location using probability heatmaps

The algorithm described here can produce a “GA feature probability heatmap”, wherein the algorithm output of segmentation probabilities is interpolated in relation to the fovea and projected onto an en face fundus image. Probability maps allow us to instantly consider quantitative segmentation data across an entire image volume simultaneously through an intuitive graphic. Consideration of segmentation features as continuous variables (i.e. probability of feature presence) rather than binary variables (feature present or not present) obviates difficulties of assigning fixed thresholds. Landmarking each voxel to a reference point enables comparison of features over time and even between samples and patients (an obligate step for modelling).

Structure–function correlation of standard visual acuity

A stratified feature-region analysis based on the Overall cohort suggested that foveal RORA, followed by foveal RPE-loss, is the strongest predictor of standard VA in GA. The strong foveal contribution to predicting standard VA is largely unsurprising, reflecting the clinical observation that central GA is accompanied by poor VA⁷. Indeed, topographical analyses of fundus autofluorescence as well as SD-OCT have demonstrated that foveal sparing is an independent covariate of GA²⁷ and that VA is likely to be worse in eyes with definitive foveal involvement²⁸. To date, a correlation between total GA area and VA has not been readily apparent using fundus photography^{27,28,29,30,31}. Here, we present evidence to show SD-OCT imaging can also detect correlations between non-foveal features and VA. This may be because fundus photography only provides two-dimensional, en face representations of the retina, limiting in-depth assessment of the retinal layers and discrimination between histological subtypes of GA and their respective contribution to functional deficit³². Macular segmentation with SD-OCT thus presents a potentially more sensitive modality for GA and its sequelae on visual function.

Currently, GA is usually considered present when the lateral spread of RORA affects an area of atrophy ≥ 250 μm in diameter¹⁵. Using the algorithm described here allows evaluation of RORA as a continuous variable, and thresholding based on the extent of the lateral spread may be applied as a secondary step. This may facilitate the continuous monitoring and evaluation of GA in a research context, with a view to eventually develop clinical monitoring and preventative strategies.

Predicting standard visual acuity

Separate instances of the ML prediction model for standard VA were trained on two datasets: one originating from a Randomised Clinical Trial (the FILLY study) and one from real-life clinical practice at Moorfields Eye Hospital. The resulting regression coefficients showed a higher predictive performance in the RCT than the real-life cohort, which wasn’t unexpected. There are differences in the protocol for VA measurement between the two cohorts and VA data from real-life clinical practice is likely to contain higher levels of noise. These contributing factors play certainly a role in explaining the observed discrepancy. It is worth noting, however, that VA measurements at Moorfields Eye Hospital follow a standardised protocol delivered by trained staff with high levels of adherence and the inter-session repeatability for standard VA from MEH cohorts previously reported was comparable to that encountered in clinical trial settings²⁶. The smaller sample size of the MEH cohort may also have contributed to this discrepancy as increased diversity and heterogeneity within larger training datasets improve predictive performance of Random Forest prediction models.

Despite the difference in strength of correlation, the fact that Feature Importance ranking in the two cohorts with ML methods led to the identification of very similar anatomical features (RPE loss, RORA) and geographical location (foveal area) on OCT scans as more predictive of standard visual acuity in both cohorts, is novel and provides new insight into the pathophysiology of GA, especially in conjunction with the Feature Importance findings for LLVA reported below.

A third instance of the ML prediction model for standard VA was trained on combined datasets of the two cohorts and showed overall predictive performance higher than previously reported in relevant literature. It also confirmed Feature Importance ranking of individual cohorts, as depicted in the corresponding probability heatmap (Fig. 2a). The Overall prediction model is likely to produce more accurate predictions than the two separately trained models given the properties of the Random Forest ML methodology used in this study.

Random Forests are an ensemble learning method for both classification and regression. Ensemble learning models in ML yield better results when there is diversity among the models they combine. Random Forests apply bootstrapping to decrease variance of resultant models (thus preventing overfitting), without increasing bias (thus preventing underfitting)³³. This means that increasing data diversity leads to decreased sensitivity to noise and improved prediction accuracy. The Random Forest model developed on the combined datasets from an RCT and a real-life clinical practice cohort is thus more likely to generalise to other patient cohorts predicting VA values that are closer to the ‘true’ VA in each case. A further property of Random Forest modeling, known as ‘feature bagging’, involves random subset feature selection at each split, thus achieving the de-correlation of features in the training set³⁴. This ensures that Feature Importance ranking is a true representation of each feature’s independent contribution to the prediction, while preventing the inflation of importance of genuinely strong predictors, which could be caused by strong correlation among features. In training our models, we performed 100-fold bootstrap resampling.

Predicting low luminance visual acuity

LLVA is a simple, inexpensive, quick assessment with standard ophthalmic equipment and has a test–retest repeatability (between 1.6 and 1.9 logMAR [5–6.5 ETDRS letters]) comparable to standard luminance VA^35,36. It has been shown to correlate with microperimetry retinal sensitivities³⁷ and patient-reported night vision symptoms³⁸. LLVA is also an earlier clinical marker of change in central retinal function than standard VA. LLVA deterioration precedes deterioration in standard VA and thus predicts impending loss of foveal function. This is consistent with the structure–function correlation analyses presented here, as the most predictive features differed between the two measures of visual function: photoreceptor degeneration for LLVA and RPE-loss and RORA for standard VA. This aligns with previous observations that photoreceptor degeneration can precede RPE-loss and eventual RORA in GA, as well as with current understanding that RPE dysfunction is common to all, or at least most, early AMD^39,40. That is, photoreceptor cells are metabolically dependent on RPE and therefore degeneration arises secondarily to RPE dysfunction, which itself eventually atrophies. Furthermore, features within non-foveal areas were more predictive of LLVA than for standard VA. This may reflect the very high density of cones within the foveola vs. the para-fovea. Hence redundancy in the foveola may ostensibly mask early standard luminance vision loss, yet low luminance visual acuity is affected earlier coinciding with photoreceptor degeneration in the para-foveal area due to lack of redundancy.

This might also support the hypothesis that low light sensitivity is mediated by a circuit function of horizontal and amacrine cells within the plexiform layers and thus a larger area of preserved central macula is required for LLVA⁴¹.

Limitations

VA is the most commonly used functional measure to evaluate the visual system. It is widely accepted in clinics and by regulatory authorities as a key measure of visual function and represents the gold standard by which the efficacy of treatment is judged. It correlates with quality of life and defines key functional thresholds, such as eligibility for driving and for sight impairment registration. However, VA change over time is non-linear, can improve from one timepoint to the next, and does not wholly capture the sequelae of GA on visual function, as it largely represents central acuity of the fovea⁴². Other functional manifestations include parafoveal functions such as dark adaptation, reading speed, face recognition, and perimetry. Foveal-sparing disease (i.e. not affecting VA) can impact other visual functions (including reading speed, contrast sensitivity, fixation, and VFQ-25)⁴³. Changes in visual function can even occur prior to VA deterioration. Thus, other markers of vision-related performance in everyday life should be considered to complement VA in patients with GA. This study only considers cross-sectional VA values, wherein future enquiry would benefit from consideration of future VA and whether that can be predicted.

Conclusion

Our results demonstrate the utility of automatically segmented imaging biomarkers in predicting visual function. This is an important step towards standardising care by reliably predicting ‘true’ visual function from refined imaging biomarkers enabled by AI and may contribute to the development of point-of-care decision-aid systems for personalised ophthalmology. Here we have used this tool to further our insight into the (otherwise unknown) underlying physiological mechanism of LLVA and thereby progression from Intermediate AMD to GA and its subtypes.

Data availability

FILLY Study Data: The FILLY study dataset, which was used under license for the current study, is not publicly available due to licensing restrictions. Moorfields Data: Mooorfields data analysed during the current study is not publicly available at present due to information governance policies pertaining to real-life clinical data. The de-identified Moorfields dataset will become available to the scientific community within the Data Governance framework of the National HDR UK INSIGHT Hub (https://www.insight.hdrhub.org/). Please address inquiries for updates and data access requests to the corresponding author (k.balaskas@nhs.net).

References

Schmitz-Valckenberg, S. et al. GEOGRAPHIC ATROPHY: Semantic considerations and literature review. Retina 36, 2250–2264 (2016).
Article PubMed PubMed Central Google Scholar
Gass, J. D. M. Drusen and disciform macular detachment and degeneration. Arch. Ophthalmol. https://doi.org/10.1001/archopht.1973.01000050208006 (1973).
Article PubMed Google Scholar
Rodrigues, I. A. et al. Defining a minimum set of standardized patient-centered outcome measures for macular degeneration. Am. J. Ophthalmol. 168, 1–12 (2016).
Article PubMed Google Scholar
Sunness, J. S. et al. Low luminance visual dysfunction as a predictor of subsequent visual acuity loss from geographic atrophy in age-related macular degeneration. Ophthalmology 115(1480–8), 1488.e1–2 (2008).
Google Scholar
Wood, L. J., Jolly, J. K., Buckley, T. M., Josan, A. S. & MacLaren, R. E. Low luminance visual acuity as a clinical measure and clinical trial outcome measure: A scoping review. Ophthalmic Physiol. Opt. 41, 213–223 (2021).
Article PubMed Google Scholar
Danis, R. P., Lavine, J. A. & Domalpally, A. Geographic atrophy in patients with advanced dry age-related macular degeneration: Current challenges and future prospects. Clin. Ophthalmol. 9, 2159–2174 (2015).
Article CAS PubMed PubMed Central Google Scholar
Keenan, T. D. et al. Progression of geographic atrophy in age-related macular degeneration: AREDS2 Report number 16. Ophthalmology 125, 1913–1928 (2018).
Article PubMed Google Scholar
Sunness, J. S. Face fields and microperimetry for estimating the location of fixation in eyes with macular disease. J. Vis. Impair. Blind. 102, 679–689 (2008).
Article PubMed PubMed Central Google Scholar
Stockman, A. & Sharpe, L. T. Into the twilight zone: The complexities of mesopic vision and luminous efficiency. Ophthalmic Physiol. Opt. 26, 225–239 (2006).
Article PubMed Google Scholar
Zele, A. J. & Cao, D. Vision under mesopic and scotopic illumination. Front. Psychol. 5, 1594 (2014).
PubMed Google Scholar
Liao, D. S. et al. Complement C3 inhibitor pegcetacoplan for geographic atrophy secondary to age-related macular degeneration: A randomized phase 2 trial. Ophthalmology 127, 186–195 (2020).
Article PubMed Google Scholar
Kuppermann, B. D. et al. Phase 2 study of the safety and efficacy of brimonidine drug delivery system (BRIMO DDS) generation 1 in patients with geographic atrophy secondary to age-related macular degeneration. Retina 41, 144–155 (2021).
Article CAS PubMed Google Scholar
Allingham, M. J., Mettu, P. S. & Cousins, S. W. Elamipretide, a mitochondrial-targeted drug, for the treatment of vision loss in dry AMD with high risk drusen: Results of the Phase 1 ReCLAIM Study. Ethnicity. 24, 60–60 (2019).
Google Scholar
Holz, F. G. et al. Imaging protocols in clinical studies in advanced age-related macular degeneration: Recommendations from classification of atrophy consensus meetings. Ophthalmology 124, 464–478 (2017).
Article PubMed Google Scholar
Sadda, S. R. et al. Consensus definition for atrophy associated with age-related macular degeneration on OCT: Classification of atrophy report 3. Ophthalmology 125, 537–548 (2018).
Article PubMed Google Scholar
Guymer, R. H. et al. Incomplete retinal pigment epithelial and outer retinal atrophy in age-related macular degeneration: Classification of atrophy meeting report 4. Ophthalmology 127, 394–409 (2020).
Article PubMed Google Scholar
Zhang, G. et al. Clinically relevant deep learning for detection and quantification of geographic atrophy from optical coherence tomography: A model development and external validation study. Lancet Digit. Health. 3, e665–e675 (2021).
Article PubMed Google Scholar
Pfau, M. et al. Determinants of cone and rod functions in geographic atrophy: AI-based structure-function correlation. Am. J. Ophthalmol. 217, 162–173 (2020).
Article PubMed Google Scholar
Sayegh, R. G. et al. Geographic atrophy and foveal-sparing changes related to visual acuity in patients with dry age-related macular degeneration over time. Am. J. Ophthalmol. 179, 118–128 (2017).
Article PubMed Google Scholar
Gallo, V. et al. STrengthening the Reporting of OBservational studies in epidemiology—Molecular epidemiology (STROBE-ME): An extension of the STROBE statement. Mutagenesis 27, 17–29 (2011).
Article PubMed Google Scholar
Steinle, N. & Hamdani, M. Evaluation of baseline factors on progression in a large phase-2 clinical trial for geographic atrophy (FILLY Study). Investig. Ophthalmol. Vis. Sci. 60, 973–973 (2019).
Google Scholar
Göbel, A. P., Fleckenstein, M., Schmitz-Valckenberg, S., Brinkmann, C. K. & Holz, F. G. Imaging geographic atrophy in age-related macular degeneration. Ophthalmologica 226, 182–190 (2011).
Article PubMed Google Scholar
Fu, D. J. et al. Predicting incremental and future visual change in neovascular age-related macular degeneration using deep learning. Ophthalmol. Retina. https://doi.org/10.1016/j.oret.2021.01.009 (2021).
Article PubMed Google Scholar
Schmidt-Erfurth, U. et al. Machine learning to analyze the prognostic value of current imaging biomarkers in neovascular age-related macular degeneration. Ophthalmol. Retina. 2, 24–30 (2018).
Article PubMed Google Scholar
Siderov, J. & Tiu, A. L. Variability of measurements of visual acuity in a large eye clinic. Acta Ophthalmol. Scand. 77, 673–676 (1999).
Article CAS PubMed Google Scholar
Patel, P. J., Chen, F. K., Rubin, G. S. & Tufail, A. Intersession repeatability of visual acuity scores in age-related macular degeneration. Investig. Ophthalmol. Vis. Sci. 49, 4347–4352 (2008).
Article Google Scholar
Bagheri, S. et al. Percentage of foveal vs total macular geographic atrophy as a predictor of visual acuity in age-related macular degeneration. J. Vitreoretin. Dis. 3, 278–282 (2019).
Article PubMed PubMed Central Google Scholar
Lindner, M. et al. Combined fundus autofluorescence and near infrared reflectance as prognostic biomarkers for visual acuity in foveal-sparing geographic atrophy. Investig. Ophthalmol. Vis. Sci. 58, 61–67 (2017).
Article Google Scholar
You, J. I., Kim, E. S., Yu, S.-Y., Kim, K. et al. Correlation between topographic progression of geographic atrophy and visual acuity changes. 2020. https://www.researchsquare.com/article/rs-68760/latest.pdf. Accessed 24 February 2022.
Heier, J. S. et al. Visual function decline resulting from geographic atrophy: Results from the chroma and spectri phase 3 trials. Ophthalmol. Retina. 4, 673–688 (2020).
Article PubMed Google Scholar
Liefers, B. et al. A deep learning model for segmentation of geographic atrophy to study its long-term natural history. Ophthalmology 127, 1086–1096 (2020).
Article PubMed Google Scholar
Liefers, B. et al. Quantification of key retinal features in early and late age-related macular degeneration using deep learning. Am. J. Ophthalmol. 226, 1–12 (2021).
Article PubMed Google Scholar
Ho, T. K. Random decision forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, 14–16 August 1995. 278–282 (1995).
Ho, T. K. A data complexity analysis of the comparative advantages of decision forest constructors. Pattern Anal. Appl. 5(2), 102–112 (2002).
Article MathSciNet MATH Google Scholar
Lovie-Kitchin, J. E. & Brown, B. Repeatability and intercorrelations of standard vision tests as a function of age. Optom. Vis. Sci. https://doi.org/10.1097/00006324-200008000-00008 (2000).
Article PubMed Google Scholar
Pluháček, F. & Siderov, J. Mesopic visual acuity is less crowded. Graefes Arch. Clin. Exp. Ophthalmol. 256, 1739–1746 (2018).
Article PubMed Google Scholar
Wu, Z., Ayton, L. N., Luu, C. D. & Guymer, R. H. Longitudinal changes in microperimetry and low luminance visual acuity in age-related macular degeneration. JAMA Ophthalmol. 133, 442–448 (2015).
Article PubMed Google Scholar
Wu, Z., Guymer, R. H. & Finger, R. P. Low luminance deficit and night vision symptoms in intermediate age-related macular degeneration. Br. J. Ophthalmol. 100, 395–398 (2016).
Article PubMed Google Scholar
Bird, A. Role of retinal pigment epithelium in age-related macular disease: A systematic review. Br. J. Ophthalmol. https://doi.org/10.1136/bjophthalmol-2020-317447 (2020).
Article PubMed Google Scholar
Bird, A. C., Phillips, R. L. & Hageman, G. S. Geographic atrophy: A histopathological assessment. JAMA Ophthalmol. 132, 338–345 (2014).
Article PubMed PubMed Central Google Scholar
Owsley, C., Clark, M. E., Huisingh, C. E., Curcio, C. A. & McGwin, G. Visual function in older eyes in normal macular health: Association with incident early age-related macular degeneration 3 years later. Investig. Opthalmol. Vis. Sci. https://doi.org/10.1167/iovs.15-18962 (2016).
Article Google Scholar
Csaky, K. et al. Report from the NEI/FDA endpoints workshop on age-related macular degeneration and inherited retinal diseases. Investig. Ophthalmol. Vis. Sci. 58, 3456–3463 (2017).
Article Google Scholar
Burguera-Giménez, N. et al. Multimodal evaluation of visual function in geographic atrophy versus normal eyes. Clin. Ophthalmol. 14, 1533–1545 (2020).
Article PubMed PubMed Central Google Scholar

Download references

Funding

Apellis Pharmaceuticals (Waltham, Massachusetts, United States) provided financial support and critical review of manuscript for publication.

Author information

Authors and Affiliations

NIHR Biomedical Research Centre at Moorfields Eye Hospital NHS Foundation Trust, UCL Institute of Ophthalmology, Moorfields Reading Centre and Clinical AI Hub, 162 City Rd, London, EC1V 2PD, UK
Konstantinos Balaskas, S. Glinton, L. Faes, B. Liefers, G. Zhang, N. Pontikos, R. Struyven, S. K. Wagner, P. J. Patel, P. A. Keane & D. J. Fu
Division of Epidemiology and Clinical Applications, National Eye Institute, National Institutes of Health, Bethesda, MD, USA
T. D. L. Keenan
Apellis Pharmaceuticals, Inc, Waltham, MA, USA
A. McKeown
Department of Ophthalmology, Erasmus University Medical Center, Rotterdam, The Netherlands
B. Liefers

Authors

Konstantinos Balaskas
View author publications
You can also search for this author in PubMed Google Scholar
S. Glinton
View author publications
You can also search for this author in PubMed Google Scholar
T. D. L. Keenan
View author publications
You can also search for this author in PubMed Google Scholar
L. Faes
View author publications
You can also search for this author in PubMed Google Scholar
B. Liefers
View author publications
You can also search for this author in PubMed Google Scholar
G. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
N. Pontikos
View author publications
You can also search for this author in PubMed Google Scholar
R. Struyven
View author publications
You can also search for this author in PubMed Google Scholar
S. K. Wagner
View author publications
You can also search for this author in PubMed Google Scholar
A. McKeown
View author publications
You can also search for this author in PubMed Google Scholar
P. J. Patel
View author publications
You can also search for this author in PubMed Google Scholar
P. A. Keane
View author publications
You can also search for this author in PubMed Google Scholar
D. J. Fu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.B., P.A.K., D.J.F.: research design. K.B., S.G., T.K., L.F., B.L., G.Z., S.W., P.K., D.J.F.: Data analysis, interpretation, research execution. All authors contributed towards the preparation of the manuscript and approved the final submitted version. Corresponding author is solely responsible for managing communication between co-authors; that all authors are included in the author list; order has been agreed by all authors; and that all authors are aware that the paper was submitted.

Corresponding author

Correspondence to Konstantinos Balaskas.

Ethics declarations

Competing interests

NP: Moorfields Eye Charity Career Development Award (R190031A), equity owner, Phenopolis Ltd. DJF: Consulting for Abbvie, Allergan, DeepMind. LF: Nothing to declare. GZ: No conflicts of interest. BL: No conflicts of interest. SG: Moorfields Eye Charity Grant (GR001003), Wellcome Trust Grant (206619_Z_17_Z). SKW: MRC Clinical Research Training Fellowship (MR/T000953/1). RS: No conflicts of interest. PAK: Moorfields Eye Charity Career Development Award (R190028A), UK Research & Innovation Future Leaders Fellowship (MR/T019050/1); Consulting for DeepMind, Roche, Novartis, Apellis and BitFount; equity owner in Big Picture Medical; speaker fees from Heidelberg Engineering, Topcon, Allergan, and Bayer. KB: Speaker fees from Novartis, Bayer, Alimera, Allergan and Heidelberg, consulting Novartis and Roche and research support from Apellis, Novartis and Bayer. PJP: Speaker fees from Bayer, Heidelberg, Roche and Topcon, consulting Bayer, Novartis, Oxford Bioelectronics and Roche and research support from Bayer. AM: Employee of Apellis. TK: No conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Supplementary Table 1.

Supplementary Figure 1.

Supplementary Figure 2.

Supplementary Figure 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Balaskas, K., Glinton, S., Keenan, T.D.L. et al. Prediction of visual function from automatically quantified optical coherence tomography biomarkers in patients with geographic atrophy using machine learning. Sci Rep 12, 15565 (2022). https://doi.org/10.1038/s41598-022-19413-z

Download citation

Received: 22 June 2022
Accepted: 29 August 2022
Published: 16 September 2022
DOI: https://doi.org/10.1038/s41598-022-19413-z

This article is cited by

Artificial intelligence in age-related macular degeneration: state of the art and recent updates
- Emanuele Crincoli
- Riccardo Sacconi
- Giuseppe Querques
BMC Ophthalmology (2024)
Defining the structure–function relationship of specific lesions in early and advanced age-related macular degeneration
- Ting Fang Tan
- Chun Lin Yap
- Anna Cheng Sim Tan
Scientific Reports (2024)
Artificial intelligence in retinal disease: clinical application, challenges, and future directions
- Malena Daich Varela
- Sagnik Sen
- Michel Michaelides
Graefe's Archive for Clinical and Experimental Ophthalmology (2023)
An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases
- Satoru Inoda
- Hidenori Takahashi
- Yasuo Yanagi
Graefe's Archive for Clinical and Experimental Ophthalmology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Study design

Study cohort

Image analysis workflow

Predicting visual acuity

Meeting presentation

Results

Cohort demographics

Clinical features and GA segmentation

Prediction of standard visual acuity using machine learning

Prediction of low luminance visual acuity using machine learning

Discussion

Main findings

Mapping structure and location using probability heatmaps

Structure–function correlation of standard visual acuity

Predicting standard visual acuity

Predicting low luminance visual acuity

Limitations

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links