Deep learning visual field global index prediction with optical coherence tomography parameters in glaucoma patients

Kim, Dongbock; Seo, Sat Byul; Park, Seong Joon; Cho, Hyun-kyung

doi:10.1038/s41598-023-43104-y

Download PDF

Article
Open access
Published: 25 October 2023

Deep learning visual field global index prediction with optical coherence tomography parameters in glaucoma patients

Dongbock Kim¹^na1,
Sat Byul Seo¹^na1,
Seong Joon Park¹ &
…
Hyun-kyung Cho^2,3

Scientific Reports volume 13, Article number: 18304 (2023) Cite this article

735 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The aim of this study was to predict three visual filed (VF) global indexes, mean deviation (MD), pattern standard deviation (PSD), and visual field index (VFI), from optical coherence tomography (OCT) parameters including Bruch's Membrane Opening-Minimum Rim Width (BMO-MRW) and retinal nerve fiber layer (RNFL) based on a deep-learning model. Subjects consisted of 224 eyes with Glaucoma suspects (GS), 245 eyes with early NTG, 58 eyes with moderate stage of NTG, 36 eyes with PACG, 57 eyes with PEXG, and 99 eyes with POAG. A deep neural network (DNN) algorithm was developed to predict values of VF global indexes such as MD, VFI, and PSD. To evaluate performance of the model, mean absolute error (MAE) was determined. The MAE range of the DNN model on cross validation was 1.9–2.9 (dB) for MD, 1.6–2.0 (dB) for PSD, and 5.0 to 7.0 (%) for VFI. Ranges of Pearson’s correlation coefficients were 0.76–0.85, 0.74–0.82, and 0.70–0.81 for MD, PSD, and VFI, respectively. Our deep-learning model might be useful in the management of glaucoma for diagnosis and follow-up, especially in situations when immediate VF results are not available because VF test requires time and space with a subjective nature.

Estimating visual field loss from monoscopic optic disc photography using deep learning model

Article Open access 03 December 2020

Predicting the central 10 degrees visual field in glaucoma by applying a deep learning algorithm to optical coherence tomography images

Article Open access 26 January 2021

Deep learning approaches to predict 10-2 visual field from wide-field swept-source optical coherence tomography en face images in glaucoma

Article Open access 05 December 2022

Introduction

Glaucoma is caused by injuries to retinal ganglion cells (RGC) and their axons, leading to retinal nerve fiber layer (RNFL) deficit and neuroretinal rim (NRR) thinning that can result in visual field (VF) defects¹. Measurement of peripapillary RNFL using optical coherence tomography (OCT) scan is a broadly accepted method for the quantitative assessment of structural damage in glaucoma². Standard automated perimetry (SAP) is the standard method to detect and monitor functional VF defect in the management of glaucoma^3,4. However, there are some intrinsic limitations of a VF test. First of all, this test has a subjective nature. Moreover, it has a high intra-subject variability (high test-to-test variability), a lengthy test time, and a necessity for a designated place to perform SAP^5,6. Structure–function relationship is important in the understanding and management of glaucoma^7,8,9,10. Detectable structural changes usually precede VF functional loss at each individual degree^{10,11,12,13,14}.

Recently, spectral-domain OCT provides Bruch’s membrane opening-minimum rim width (BMO-MRW) as a new parameter in addition to conventional peripapillary RNFL. BMO-MRW measures the shortest length from the inner opening of BMO to the internal limiting membrane (Fig. 1A), which has been introduced for assessing optic nerve head^{15,16,17,18,19}. BMO-MRW provides more accurate evaluation of the NRR than conventional optic disc inspection^{15,16,17,18,19,20}. Previous studies have demonstrated that BMO-MRW showed superior diagnostic ability in glaucoma to previously used parameters of NRR^21,22,23. BMO-MRW has also been reported to show a better structure–function relationship than other NRR parameters using conventional confocal scanning laser ophthalmoscopy or peripapillary RNFL^23,24.

We have previously reported a high diagnostic performance in distinguishing early normal-tension glaucoma (NTG) from glaucoma suspect (GS) (AUC, 0.966) based on a deep learning model using OCT parameters of BMO-MRW, peripapillary RNFL, and color classification of RNFL²⁵. Interestingly, BMO-MRW, as a single parameter, provided a higher diagnostic performance (AUC: 0.959) than RNFL alone (AUC: 0.914) and RNFL with its color code classification (AUC: 0.934)²⁵. Moreover, BMO-MRW alone showed similar diagnostic performance to that of all three OCT parameters combined. These results suggest that BMO-based optic disc assessment might be a better evaluation for different aspects of the optic disc than conventional disc assessments in the diagnosis of glaucoma.

Previous structure–function studies have used deep learning models to predict global VF indexes including mean deviation (MD) from OCT-derived images such as RNFL thickness maps^26,27. Other previous studies have predicted pointwise threshold of VF from OCT-derived image scans like peripapillary RNFL or macular ganglion cell complex thickness maps^28,29,30,31. However, none of these previous studies included any information regarding BMO-MRW. Moreover, none predicted all three VF global indexes of MD, pattern standard deviation (PSD), and visual field index (VFI) from OCT-derived images or maps. Each global index of VF test has its own advantage, and therefore, only one index cannot tell all the aspects of VF test results. Actual figures of global indexes of VF could provide an outline of VF summary, which might be clinically useful in the management of glaucoma including diagnosis and detection of progression.

Thus, the aim of the present retrospective cross-sectional study was to predict three VF global indexes using deep-learning model from OCT-derived parameters of BMO-MRW and RNFL. We intended to assess the usefulness of this deep-learning model as a reference in glaucoma clinic. It might be beneficial in situations when immediate VF results are not available since VF test takes time and cooperation of the patient. We applied a deep-learning model to integrate all data available from spectral-domain OCT images to predict VF global indexes, which might be challenging for general physicians.

Results

Baseline characteristics of subjects

A total of 720 eyes (720 patients) with glaucoma and glaucoma suspect (GS) were included in the final analysis. Glaucoma diagnosis included early normal-tension glaucoma (NTG), moderate stage of NTG, pseudo exfoliation glaucoma (PEXG), primary angle closure glaucoma (PACG), and primary open angle glaucoma (POAG). The mean age of glaucoma patients was 53.7 ± 13.3 (mean ± standard deviation) years. Females accounted for 46% (328/720). Of all patients, 8.3% (60/720) had a family history of glaucoma. Baseline spherical equivalent (SE) was − 1.8 ± 2.9 diopters. Baseline intraocular pressure (IOP) was 15.6 ± 4.1 mmHg with central corneal thickness (CCT) of 542.0 ± 42.7 um. Baseline MD was − 4.5 ± 5.8 dB, PSD was 5.3 ± 4.2 dB, and VFI was 88.6 ± 17.0 dB. Baseline characteristics including VF global indexes for the training set and test set, respectively, are summarized in Table 1. Baseline OCT parameters of BMO-MRW and RNFL are demonstrated in Table 2.

Table 1 Baseline characteristics of included glaucoma patients.

Full size table

Table 2 Baseline OCT parameters in glaucoma patients.

Full size table

Workflow of deep learning model for predicting visual field global indexes

We aimed to estimate three VF global indexes, MD, PSD, and VFI among parameters of BMO-MRW and RNFL based on deep learning. The main workflow of our deep learning model for predicting visual field indexes is as follows. First, we extracted numerical parameters of BMO-MRW and RNFL from OCT scan images using Heidelberg licensed software and included the age of patients in the dataset to train and test the deep neural network (DNN) model. A total of 720 eyes from 720 patients were used. Sixteen sub-parameters were used as input parameters in the dataset. Three DNN models were built and trained independently to predict the value of each VF global index: MD, PSD, and VFI. These models had three hidden layers and a single output layer. Exponential linear unit (ELU) was used as activation function. Batch normalization was applied after each hidden layer. The three models were constructed with the same structure. The model for each VF global index (MD, PSD, and VFI) had minor differences in the number of nodes and the degree of regulation in detail. To improve model performance, we applied fivefold cross validation and tuned model hyper-parameters such as learning rate, the degree of regulation, the number of layers, and the number of nodes in each layer. In each fold, the validation set consisted of 137 eyes (137 patients) and the training set consisted of 547 eyes (537 patients). We calculated the MAE in the validation set for each VF global index. To evaluate predicting performance, mean absolute error (MAE), Pearson’s correlation coefficient, and ${R}^{2}$ of each model were calculated, and the results showed in Table 4. The overview of the workflow of each model is illustrated in Fig. 1A. Figure 1B shows the detailed structure of the DNN model.

Predictive performances of DNN and ML models

To evaluate performance of prediction for our DNN model, we calculated MAE for each VF global index with the validation set. The loss curves of the DNN model for predicting VF global indexes with increasing number of epochs was plotted in Fig. 2A–C. With these loss functions of each index, it was verified that the performance of the DNN model was stable and robust. We also trained other machine learning (ML) models: Random Forest, extreme gradient boosting (XGBoost), and support vector machine (SVM) using Radial Basis Function (RBF) kernel to compare their performances with the DNN model. Results of MAE comparison of DNN and ML models for each VF global index on a fivefold cross validation are demonstrated in Fig. 2D–F. The DNN model showed the lowest MAE VF global indexes. First, the MAE of MD in each model was as follows. The MAE of MD ranged from 1.9 to 2.9 dB for our DNN model, 2.2–2.9 dB for SVM using RBF kernel, 2.3–3.0 dB for Random Forest, and 2.4–3.2 dB for XGBoost. The MAE range of PSD was 1.6–2.0 dB for DNN, 1.8–2.3 dB for XGBoost, 1.8–2.2 dB for Random Forest, and 1.7–2.3 dB SVM using RBF kernel. The MAE of VFI in each model was as follows. The MAE of VFI ranged from 5.0 to 7.0% (6.3–6.9% for Random Forest, 6.5–7.4% for XGBoost, and 6.5–8.0% for SVM using RBF kernel). These results are summarized in Table 3.

Table 3 The MAE for DNN model with other machine learning algorithms.

Full size table

Comparison of actual and DNN predicted values of VF global indexes

Statistical analysis was proceeded to compare actual data of each VF global index with data predicted by the DNN model. Figure 2G–I show scatter plots of predicted and actual values of three indexes (MD, PSD, and VFI) in the dataset. Pearson's correlation coefficient and ${R}^{2}$ were also measured. Between predicted values and actual values of MD in the fivefold cross validation, Pearson's correlation coefficient was in the range of 0.76 to 0.85 ($p<0.001)$. In the PSD estimation, Pearson's correlation coefficient ranged from 0.74 to 0.82 ($p<0.001)$. In VFI prediction, the Pearson's correlation coefficient ranged from 0.70 to 0.81 ($p<0.001)$. In addition, ${R}^{2}$ ranges were 0.59–0.65, 0.58–0.66, and 0.58–0.65 for MD, VFI, and PSD, respectively. Statistical results of the DNN on five-fold cross validation are summarized in Table 4.

Table 4 Statistical results of the DNN model on five-fold cross validation.

Full size table

Predictive performances of DNN model according to OCT- derived parameters

We evaluated performances of DNN model for predicting VF index (MD) according to the OCT-based parameters respectively: BMO-MRW alone, RNFL alone, and both BMO-MRW and RNFL combined. The mean absolute error (MAE) of the DNN model based on the parameters of BMO-MRW alone and RNFL alone were 2.72 dB and 2.87 dB, respectively. The performance of the DNN model based on both BMO-MRW and RNFL combined was 2.28 dB of MAE, which showed the smallest value.

Deep learning predictive performance analysis according to glaucoma severity

To evaluate the predictive performances of the DNN model according to glaucoma severity, we measured absolute errors of the actual value and predicted value of MD for each eye. Figure 3A shows a scatter plot of absolute error showing the prediction performance according to the actual MD values of each eye. The mean absolute error (MAE) of the DNN model was $2.19\pm 1.84$ dB in the test set as shown in Fig. 3B. The prediction performance for each glaucoma severity in the test set is as follows. The MAE for unaffected control (NN; MD ≥ 0.0) was $1.76\pm 1.31$ dB, and the mild glaucoma grade (G1; − 6.0 < MD < 0.0) showed its MAE was $2.05\pm 1.98$ dB. The MAE for moderate glaucoma grade (G2; − 12.0 < MD ≤ − 6.0) class was $2.17\pm 0.87$ dB, and the severe glaucoma grade (G3; MD ≤ − 12.0) was it MAE was $3.58\pm 2.75$ dB. It is noticeable that the MAE of the early stage of glaucoma is the smallest among all the stages of glaucoma.

Discussion

To our knowledge, the present study was the first to predict all of VF global indexes including MD, PSD, and VFI from OCT-derived parameters of BMO-MRW, a new parameter, and RNFL using a deep learning model. We found that the performance of our DNN model was outstanding along with other machine-learning models in predicting VF global indexes. For all three indexes, the DNN model showed the best performance. We also found that there was a strong correlation between each predicted value and the actual value.

The availability of BMO-MRW obtained from spectral-domain OCT has grown for clinicians. It provides some advantages when compared to the previous standard morphometric optic nerve head analysis confocal scanning laser tomographic measurements^21,22,23. Compared to existing ophthalmic examinations, BMO-MRW allows for a more precise geometric assessment of the neuroretinal rim (NRR)^15,16,17,20. It has been shown that BMO-MRW is advantageous in providing an accurate reflection of the amount of neural tissue present in the optic nerve³². Our previous study reported a high diagnostic performance in discriminating early normal-tension glaucoma (NTG) from glaucoma suspect (GS) (AUC, 0.966) based on a deep learning model using OCT parameters of BMO-MRW, peripapillary RNFL, and color classification of RNFL²⁵. Interestingly, BMO-MRW, as a single parameter, provided a higher diagnostic performance (AUC: 0.959) than RNFL alone (AUC: 0.914) and RNFL with its color code classification (AUC: 0.934)²⁵. Moreover, BMO-MRW alone showed similar diagnostic performance to that of all three OCT parameters combined. These results suggest that BMO-based optic disc assessment might be a better evaluation for different aspects of the optic disc than conventional disc assessments in the diagnosis of glaucoma. These findings suggest that BMO-MRW is clinically useful in the diagnosis of glaucoma. It might be even better than conventional RNFL. Integrating assessment of BMO-MRW and RNFL is beneficial for better diagnosis of glaucoma based on these findings. However, the integration of these two different parameters is a complex and challenging for human beings, including general physicians other than glaucoma specialists. This is where the latest technology of artificial intelligence can be useful. Recent reports indicate that machine-learning classifiers can aid in clinical practice and efficiently enhance glaucoma diagnosis for general ophthalmologists in the primary eye care setting when there is a lack of glaucoma specialists³³. The deep learning model can provide rapid diagnostic results in the clinics after inputting ophthalmic examination data without the need for a multi-day analysis. Ultimately, the decision to treat glaucoma is up to the physician, but the deep learning model can suggest a preliminary diagnosis for reference³⁴. Moreover, the DNN diagnostic model is more cost-effective clinically easy to access compared to other imaging-based CNN diagnostic programs that require costly equipment, such as workstations with GPUs and take several days to produce results.

A previous study by Park et al.²⁹ has predicted VF regional thresholds with deep learning based on inception V3 using combined OCT images of macular ganglion cell-inner plexiform layer (mGCIPL) and peripapillary pRNFL thicknesses maps. They conducted pointwise estimation of VF for a regional analysis. With the deep learning method, the root mean squared error (RMSE) of the entire VF area for all patients was 4.79 $\pm$ 2.56 dB (mean $\pm$ standard deviation). In our study, we estimated global VF. The MAE of MD was found to be 2.57 $\pm$ 0.33 dB. Our results showed lower MAE, suggesting better results in predicting the entire VF threshold. Hemelings et al.³¹ have conducted a study to predict VF MD and 52 threshold values based on a customized CNN model with Xception using peripapillary RNFL map and scanning laser ophthalmoscopy en face images. The MAE for MD estimation the deep learning model was 2.89 dB (range, 2.50–3.30 dB).

In our study, the MAE for MD prediction was 2.57 dB (range, 1.95–2.87 dB). Therefore, the present study showed lower MAE, indicating better results for predicting the entire VF threshold. Christopher et al.²⁶ have developed a deep learning system based on ResNet50 to predict MD, PSD, and mean VF sectoral pattern deviation (PD) using image data of RNFL thickness map, RNFL enface image, and confocal scanning laser ophthalmoscopy image. In MD estimation, the deep learning model with RNFL enface image achieved the highest performance with ${R}^{2}$ of 0.70 (range, 0.64–0.74) and MAE of 2.5 dB (range, 2.3–2.7 dB). In PSD estimation, ${R}^{2}$ was 0.61(range, 0.55–0.66) and MAE was 1.5 dB (range, 1.4–1.6 dB). Our deep learning model, which utilized combined parameters of RNFL and BMO-MRW, demonstrated similar performance to other previous studies. It could also predict additional VF global indexes such as VFI. Results of our study were highly comparable to those of previous research, thus having a significant meaning. Yu et al. have used 3D CNN model to estimate VF global indexes of MD and VFI, but not all three indexes from combining macula and optic disc OCT scans in healthy, glaucoma suspect, and glaucoma patients²⁷. Each global index of VF test has its own advantage, and thus, only one index cannot tell all the aspects of the entire VF results. For example, MD is useful to estimate the overall stage of glaucoma. On the other hand, PSD reflects the focal VF defect in an early stage of glaucoma, which is beneficial in the diagnosis of early glaucoma.

Using the deep learning model based on macular and optic nerve head scans, the MAE was 1.57 dB for MD and 2.7% for VFI. Yu et al. have shown great results with a larger number of images. However, their study included multiple visit data from one patient to have a larger number of images. We used single visit data from each subject, which might be more independent and reliable. Moreover, we used data extracted from OCT using lighter and cost-effective model to predict VF global indexes. Our results were quite comparable to results of the study by Yu et al. using images from OCT with a more complicated model. Results of VFI seemed to be better in the study by Yu et al. (2.7 dB for VFI). However, considering VFI percentage in our study, results were substantially good. The VFI reflects RGC loss and function as a percentage, with central points having more weights³⁵. It is expressed as a percentage of remaining proportion of visual function. It is a reliable index on which glaucomatous visual field severity staging can be based. VFI can also be used to calculate the rate of progression which is shown in trend-based glaucoma progression analysis of Humphrey Field Analyzer software³⁶. While VFI is important in the management of glaucoma, previous studies have predicted that this global index (VFI) is rare to be found in the field of AI (artificial intelligence) using deep learning methods. Most of previous studies have mainly focused on predicting MD as a global index from different images of OCT or HRT device^{26,27,28,29,30,31}. Our study also had a significant meaning in that we predicted VFI as a global index from extracted OCT data. This has not been reported before in the field of AI using deep learning method.

The result of the current study has a significant clinical meaning in that it provides summary outline numbers of functional VF test from structural OCT test. OCT test is objective. It offers quantitative values of optic nerve head parameters. However, VF requires patient cooperation, a relatively long time, and designated space to be performed. Sometimes and quite frequently, VF test results are not available at the time of clinical practice. Since VF test also requires cognitive ability and motor reaction, for old patients and those with dementia or stroke and/or those with motor disability, VF test cannot be performed correctly. Moreover, in some clinics, VF tests need appointment. They cannot be done at the first visit because all appointed VF tests are being performed at that time. If that patient cannot come back in a short time, VF test can be delayed for a very long time. Thus, correct diagnosis of glaucoma or decision for the disease progression is difficult to be made. In such situations, if summary results of VF test could be predicted from OCT test without actually performing the VF test, it could be clinically very helpful in the management of glaucoma. Especially, in our deep learning VF global indexes prediction model, the performance of the prediction was the best in early stage of glaucoma based on the MAE as shown in Fig. 3A. Early stage of glaucoma or glaucoma suspects usually visit glaucoma clinic to be diagnosed of glaucoma for the first time and in these cases VF test results are necessary. Our relatively quick DNN model may be also useful in these situations, which frequently occur in clinics.

NTG comprises the majority (76.3%) among patients with POAG in Asian populations as reported by previous population-based studies³⁷. Thus, information regarding NTG is clinically important for Asians. It applies to Asian countries and also other countries elsewhere with a substantial proportion of Asian population. However, previous deep-learning studies rarely included NTG. It is difficult to find studies including data of NTG or those even classified NTG. As previous deep-learning studies including data of NTG are scarce, the current study might have a significant meaning to be added in the literature for providing additive information and future deep-learning studies in the field of glaucoma.

The current study had several limitations. First of all, there are potential limitations owing to its retrospective design. We included only those who had taken both RNFL and BMO-MRW tests with an acceptable images quality. In addition, only those who had reliable VF tests were included. The impact of the subject selection on our results remains unclear. Second, the study was conducted at a referral university hospital within the province using a hospital-based design, rather than a population-based approach.

The individuals included in the study may not be fully representative sample of the general population. Additionally, this study included only Korean patients. Thus, results of our study, including NTG, might not be applicable to other ethnic groups. Third, it should be considered that the sample size of this study is relatively small. Although 720 subjects with either glaucoma or GS were included in this study, this number might not be insufficient to train or test the performance to predict a single test result from single device data. Other studies with large number used both eyes from multiple visits. However, we used only one randomly selected eye from one person from a single visit. Our data might be more independent and more reliable/correct than previous studies. If we have included both eyes from multiple visits, the number of data could be much larger, for example, six times. Finally, the analysis of OCT images utilizing deep neural network (DNN) in this study was based on the extraction of numerical data from the images rather than using direct images. However, it is still meaningful in that clinicians can use deep-learning models with free open-sources to obtain prompt results and get aid in the management of glaucoma. This approach is more economically feasible than using convolutional neural networks (ConvNets) for image analysis, which can be costly to achieve high accuracy. We might consider developing our own program to be used in clinical practice to aid preliminary diagnosis from direct OCT-image analysis employing ConvNets in future studies achieving accurate performance.

In conclusion, our DNN model showed high performance in predicting VF global indexes of MD, PSD, and VFI based on OCT-derived parameters of BMO-MRW, a new parameter, and RNFL. Prediction based on VFI was the highest, followed by that based on MD and PSD using our DNN model in GS and glaucoma patients. Our DNN model might be beneficial in clinical practice in the management of glaucoma including diagnosis and monitoring progression. Given that our DNN model provides prompt outputs, it has the potential to the particularly valuable in settings where there are no glaucoma specialists available, such as primary eye care. Nonetheless, a more conclusive determination would require a larger, multi-center study with a substantial patient cohort.

Material & methods

Ethics statement

This retrospective observational, cross-sectional study was conducted in accordance with the tenets of the Declaration of Helsinki. It was approved by the Institutional Review Board (IRB) of Gyeongsang National University Changwon Hospital, Gyeongsang National University School of Medicine. The requirement for informed consent was waived by the IRB of Gyeongsang National University Changwon Hospital due to its retrospective nature.

Subjects

Among 1487 patients with glaucoma and glaucoma suspects who were evaluated between February 2016 and December 2021 in a glaucoma clinic at Gyeongsang National University Changwon Hospital, a total of 720 eyes (720 subjects) were included. Glaucoma diagnosis included early NTG, PACG, PEXG, POAG, and GS. Subjects consisted of 224 eyes of those with GS, 245 eyes of those with early NTG, 59 eyes of those with moderate stage of NTG, 36 eyes of those with PACG, 57 eyes of those with PEXG, and 99 eyes of those with POAG. The study included only those participants who met the diagnostic criteria below and demonstrated reliable results for both BMO-MRW and RNFL.

Diagnosis of glaucoma was assessed by a single glaucoma specialist (H-k Cho) applying consistent criteria. To diagnose NTG, patients needed to meet specific criteria, including having an IOP ≤ 21 mmHg without treatment who demonstrated glaucomatous optic disc injury and corresponding VF loss, an open-angle assessed by gonioscopic inspection, and no other underlying cause of optic disc injury other than glaucoma³⁸. Early NTG was defined as the VF test results of MD > − 6.0 dB. PACG was determined as eyes with shallow anterior chamber (appositional contact between the peripheral iris and the trabecular meshwork (TM) > 270 degrees on gonioscopy and showed glaucomatous optic disc damage (decline of NRR with a vertical cup-to-disc ratio of 0.7 or an asymmetry between eyes of 0.2, or notching ascribe to glaucoma) and showing corresponding visual field defects³⁹. To diagnose PEX glaucoma, the criteria included the observation of PEX material at the margin of the pupil and on the anterior lens capsule after maximal pupil dilatation, along with the presence of baseline IOP of at least 22 mmHg, glaucomatous optic nerve head damage, visual field loss consistent with optic disc injury, and the absence of other conditions causing secondary glaucoma⁴⁰. POAG was defined as a patient with a baseline IOP of more than 21 mmHg prior to treatment who showed findings of glaucomatous optic nerve head injury and corresponding VF loss, an open-angle assessed by gonioscopic inspection, and no other underlying cause for optic nerve head injury besides glaucoma¹.

The exclusion criteria were as follows: low-quality image scans resulting from eyelid blinking or poor fixation, history of optic neuropathies aside from glaucoma or an acute angle-closure crisis that could affect the thickness of the RNFL or BMO-MRW (e.g., optic neuritis, acute ischemic optic neuritis), history of any intraocular surgery except for uneventful phacoemulsification, and retinal disease associated with retinal swelling or edema and subsequent RNFL or BMO-MRW swelling. Preperimetric glaucoma was excluded from the current study. Subjects were not excluded by axial length or refractive error, or the size of optic disc for the present study.

Optical coherence tomography

Imaging of spectral-domain OCT was accomplished using the Glaucoma Module Premium Edition. Radial B-scans of 24 in number were acquired to analyze BMO-MRW. Among three scan circle diameters (3.5, 4.1, and 4.7 mm), a scan circle diameter of 3.5 mm was chosen for peripapillary RNFL thickness measurement. Only those images that were correctly centered and accurately segmented and quality scores ≥ 20 were selected for this study. Images taken with OCT were aligned in FoBMO axis, that is an individual specific axis that measures between the center of BMO and the fovea of macula. Employing this FoBMO axis could enable more correct analysis of Garway-Heath sector considering cyclotorsion of each individual and more precise analysis compared with normative database than the existing way of using only simple clock-hour locations.

Perimetry

We used a Humphrey Field Analyzer (HFA model 840; Humphrey Instruments Inc.) for perimetry with a central 30-2 program of Swedish Interactive Threshold Algorithm standard strategy. A reliable VF test had to qualify the following criteria: false-positive rate < 15%; false-negative rate < 15%; and fixation loss less than 20%.

Data preprocessing

The dataset consisted of OCT parameters and age of 720 eyes. Parameters included the following: age, BMO Area, BMO-MRW Global, BMO-MRW Temporal, BMO-MRW superotemporal (TS), BMO-MRW inferotemporal (TI), BMO-MRW Nasal, BMO-MRW superonasal (NS), BMO-MRW inferonasal (NI), RNFL Mean Global, RNFL Mean Temporal, RNFL Mean TS, RNFL Mean TI, RNFL Mean Nasal, RNFL Mean NS, and RNFL Mean NI. Each feature was standardized by its mean and standard deviation to make learning process more efficiently. Stratified sampling was used to compensate for the relatively small size of dataset to be divided randomly. Out of 720 eyes, 684 eyes were used to construct a train set (95%) and 36 eyes were used to form a test set (5%). Since test data were used for comparing prediction performances of each model, they contained five percent of the dataset. K-fold cross validation (k = 5) was applied. The train set was re-slitted to a ratio of 8:2 for train set (n = 541) and validation set (n = 137). Programming language Python version 3.9.7 (https://www.python.org/) and the package Scikit-learn 0.24.2 (https://scikit-learn.org/) were used to preprocess all data.

Machine learning algorithm

Machine learning means the use of an algorithm to make prediction not based on logics but based on data. Models rarely had any explicit rule or strict logic. Instead, they generate results by using the data⁴¹. The process of getting results can vary depending on the method of the ML algorithm. In our study, several ML models, Random Forest, XGBoost, SVM, and SVM with Radial Basis Function, were used and compared with a DNN model. Random Forest algorithm is one of the mainly used ML algorithms for tasks of classification and regression. It combines several decision trees and makes predictions by using voting system which averages all decision trees’ results⁴². XGBoost is also based on decision tree like Random Forest. However, it implements a boosting process which is the ensemble learning technique of building several models sequentially⁴³. SVM is an ML algorithm that maps data from the feature space into the kernel space⁴⁴. We also used SVM with RBF kernel⁴⁵.

Deep neural network architecture

A DNN is an artificial neural network with more than two hidden layers and a non-linear activation function. DNN proceeds learning process by repeating feedforward and backpropagation⁴⁶. We built our model using open-source neural network APIs, Keras (https://keras.io/), and TensorFlow (https://www.tensorflow.org/). Each model was built slightly differently because each VF global index had different meaning, values, and distributions. According to the index, we made three DNN models in this study: MD prediction model (MD model), PSD prediction model (PSD model), and VFI prediction model (VFI model). These models had the same number of layers: a single input layer, three hidden layers, and an output layer as shown in Fig. 1B. Each model received input data with 16 parameters which consisted of age and other ocular parameters extracted from OCT scans and related to BMO-MRW and RNFL. Batch Normalization was used after each hidden layer⁴⁷. An ELU function was used as an activation function⁴⁸. To prevent overfitting, l2-regularizer was used. An adaptive moment estimation optimizer (Adam) (learning rate = 0.05) was used for each model⁴⁹. Learning rate decay method was applied. MSE was used for its loss function. Architectures of these models used in this study are shown in Fig. 1.

Statistical analysis

To evaluate the performance of the deep-learning model, MAE was utilized. MAE was evaluated to determine the performance of a regression model interpretably. It is generally known as more intuitive and easier to interpret than root mean squared error. MAE is the average of the absolute value of the deviation. The formula to calculate MAE for each indicator is shown as follows:

$$MAE= \frac{1}{n}\sum_{i=1}^{n}\left|\text{Actual value}-\text{Predicted value}\right|$$

We also calculated Pearson's correlation coefficient ($\rho$) and ${R}^{2}$ to evaluate how our models were trained and whether they showed convincing prediction⁵⁰. All statistical analyses were performed using programming language Python version 3.9.7 (https://www.python.org/) and the package Scikit-learn 0.24.2 (https://scikit-learn.org/).

Data availability

Dataset used in this study might be obtained from Hyun-kyung Cho (MD, PhD) upon reasonable request.

References

Weinreb, R. N. & Khaw, P. T. Primary open-angle glaucoma. Lancet 363, 1711–1720 (2004).
PubMed Google Scholar
Banegas, S. A. et al. Evaluation of the retinal nerve fiber layer thickness, the mean deviation, and the visual field index in progressive glaucoma. J. Glaucoma 25, e229-235 (2016).
PubMed Google Scholar
Prum, B. E. et al. Primary open-angle glaucoma preferred practice pattern(®) guidelines. Ophthalmology 123, P41–P111 (2016).
PubMed Google Scholar
BMJ Publishing Group Ltd. BMA House, Square, T., London & 9jr, W. European glaucoma society terminology and guidelines for glaucoma, 4th Edition—Part 1 supported by the EGS foundation. Br. J. Ophthalmol. 101, 1–72 (2017).
Artes, P. H., Iwase, A., Ohno, Y., Kitazawa, Y. & Chauhan, B. C. Properties of perimetric threshold estimates from Full Threshold, SITA Standard, and SITA Fast strategies. Invest. Ophthalmol. Vis. Sci. 43, 2654–2659 (2002).
PubMed Google Scholar
Gardiner, S. K., Swanson, W. H., Goren, D., Mansberger, S. L. & Demirel, S. Assessment of the reliability of standard automated perimetry in regions of glaucomatous damage. Ophthalmology 121, 1359–1369 (2014).
PubMed Google Scholar
Gardiner, S. K., Johnson, C. A. & Cioffi, G. A. Evaluation of the structure-function relationship in glaucoma. Invest. Ophthalmol. Vis. Sci. 46, 3712–3717 (2005).
PubMed Google Scholar
Ferreras, A., Pablo, L. E., Garway-Heath, D. F., Fogagnolo, P. & García-Feijoo, J. Mapping standard automated perimetry to the peripapillary retinal nerve fiber layer in glaucoma. Invest. Ophthalmol. Vis. Sci. 49, 3018–3025 (2008).
PubMed Google Scholar
Leite, M. T. et al. Structure-function relationships using the Cirrus spectral domain optical coherence tomograph and standard automated perimetry. J. Glaucoma 21, 49–54 (2012).
PubMed PubMed Central Google Scholar
Malik, R., Swanson, W. H. & Garway-Heath, D. F. ‘Structure-function relationship’ in glaucoma: Past thinking and current concepts. Clin. Exp. Ophthalmol. 40, 369–380 (2012).
PubMed PubMed Central Google Scholar
The AGIS Investigators. The Advanced Glaucoma Intervention Study (AGIS): 7. The relationship between control of intraocular pressure and visual field deterioration. Am. J. Ophthalmol. 130, 429–440 (2000).
Google Scholar
Kass, M. A. et al. The Ocular Hypertension Treatment Study: A randomized trial determines that topical ocular hypotensive medication delays or prevents the onset of primary open-angle glaucoma. Arch. Ophthalmol. 120, 701–713 (2002).
PubMed Google Scholar
Keltner, J. L. et al. The association between glaucomatous visual fields and optic nerve head features in the Ocular Hypertension Treatment Study. Ophthalmology 113, 1603–1612 (2006).
PubMed Google Scholar
Hood, D. C. & Kardon, R. H. A framework for comparing structural and functional measures of glaucomatous damage. Prog. Retin. Eye Res. 26, 688–710 (2007).
PubMed PubMed Central Google Scholar
Chauhan, B. C. & Burgoyne, C. F. From clinical examination of the optic disc to clinical assessment of the optic nerve head: A paradigm change. Am. J. Ophthalmol. 156, 218-227.e2 (2013).
PubMed PubMed Central Google Scholar
Chen, T. C. Spectral domain optical coherence tomography in glaucoma: Qualitative and quantitative analysis of the optic nerve head and retinal nerve fiber layer (an AOS thesis). Trans. Am. Ophthalmol. Soc. 107, 254–281 (2009).
PubMed PubMed Central Google Scholar
Povazay, B. et al. Minimum distance mapping using three-dimensional optical coherence tomography for glaucoma diagnosis. J. Biomed. Opt. 12, 041204 (2007).
PubMed ADS Google Scholar
Reis, A. S. C. et al. Influence of clinically invisible, but optical coherence tomography detected, optic disc margin anatomy on neuroretinal rim evaluation. Invest. Ophthalmol. Vis. Sci. 53, 1852–1860 (2012).
PubMed PubMed Central Google Scholar
Strouthidis, N. G., Fortune, B., Yang, H., Sigal, I. A. & Burgoyne, C. F. Longitudinal change detected by spectral domain optical coherence tomography in the optic nerve head and peripapillary retina in experimental glaucoma. Invest. Ophthalmol. Vis. Sci. 52, 1206–1219 (2011).
PubMed PubMed Central Google Scholar
Chauhan, B. C. et al. Bruch’s membrane opening minimum rim width and retinal nerve fiber layer thickness in a normal white population: A multicenter study. Ophthalmology 122, 1786–1794 (2015).
PubMed Google Scholar
Chauhan, B. C. et al. Enhanced detection of open-angle glaucoma with an anatomically accurate optical coherence tomography-derived neuroretinal rim parameter. Ophthalmology 120, 535–543 (2013).
PubMed Google Scholar
Mizumoto, K., Gosho, M. & Zako, M. Correlation between optic nerve head structural parameters and glaucomatous visual field indices. Clin. Ophthalmol. 8, 1203–1208 (2014).
PubMed PubMed Central Google Scholar
Pollet-Villard, F., Chiquet, C., Romanet, J.-P., Noel, C. & Aptel, F. Structure-function relationships with spectral-domain optical coherence tomography retinal nerve fiber layer and optic nerve head measurements. Invest. Ophthalmol. Vis. Sci. 55, 2953–2962 (2014).
PubMed Google Scholar
Gardiner, S. K. et al. A method to estimate the amount of neuroretinal rim tissue in glaucoma: Comparison with current methods for measuring rim area. Am. J. Ophthalmol. 157, 540–549 (2014).
PubMed Google Scholar
Seo, S. B. & Cho, H.-K. Deep learning classification of early normal-tension glaucoma and glaucoma suspects using Bruch’s membrane opening-minimum rim width and RNFL. Sci. Rep. 10, 19042 (2020).
CAS PubMed PubMed Central Google Scholar
Christopher, M. et al. Deep learning approaches predict glaucomatous visual field damage from OCT optic nerve head en face images and retinal nerve fiber layer thickness maps. Ophthalmology 127, 346–356 (2020).
PubMed Google Scholar
Yu, H.-H. et al. Estimating global visual field indices in glaucoma by combining macula and optic disc OCT scans using 3-dimensional convolutional neural networks. Ophthalmol. Glaucoma 4, 102–112 (2021).
PubMed Google Scholar
Hashimoto, Y. et al. Deep learning model to predict visual field in central 10° from optical coherence tomography measurement in glaucoma. Br. J. Ophthalmol. 105, 507–513 (2021).
PubMed Google Scholar
Park, K., Kim, J. & Lee, J. A deep learning approach to predict visual field using optical coherence tomography. PLoS ONE 15, e0234902 (2020).
CAS PubMed PubMed Central Google Scholar
Mariottoni, E. B. et al. Artificial intelligence mapping of structure to function in glaucoma. Transl. Vis. Sci. Technol. 9, 19 (2020).
PubMed PubMed Central Google Scholar
Hemelings, R. et al. Pointwise visual field estimation from optical coherence tomography in glaucoma using deep learning. Transl. Vis. Sci. Technol. 11, 22 (2022).
PubMed PubMed Central Google Scholar
Toshev, A. P., Lamparter, J., Pfeiffer, N. & Hoffmann, E. M. Bruch’s membrane opening-minimum rim width assessment with spectral-domain optical coherence tomography performs better than confocal scanning laser ophthalmoscopy in discriminating early glaucoma patients from control subjects. J. Glaucoma 26, 27–33 (2017).
PubMed Google Scholar
Phu, J., Khuu, S. K., Agar, A. & Kalloniatis, M. Clinical evaluation of Swedish interactive thresholding algorithm–faster compared with Swedish interactive thresholding algorithm–standard in normal subjects, glaucoma suspects, and patients with glaucoma. Am. J. Ophthalmol. 208, 251–264 (2019).
PubMed Google Scholar
Sengupta, S., Singh, A., Leopold, H. A., Gulati, T. & Lakshminarayanan, V. Ophthalmic diagnosis using deep learning with fundus images—A critical review. Artif. Intell. Med. 102, 101758 (2020).
PubMed Google Scholar
Bengtsson, B. & Heijl, A. A visual field index for calculation of glaucoma rate of progression. Arch. Ophthalmol. 145, 343–353 (2008).
Google Scholar
Casas-Llera, P. et al. Visual field index rate and event-based glaucoma progression analysis: Comparison in a glaucoma population. Br. J. Ophthalmol. 93(12), 1576–1579 (2009).
CAS PubMed Google Scholar
Cho, H.-K. & Kee, C. Population-based glaucoma prevalence studies in Asians. Surv. Ophthalmol. 59, 434–447 (2014).
PubMed Google Scholar
Cho, H.-K., Lee, J., Lee, M. & Kee, C. Initial central scotomas vs peripheral scotomas in normal-tension glaucoma: Clinical characteristics and progression rates. Eye 28, 303–311 (2014).
PubMed Google Scholar
Foster, P. J., Buhrmann, R., Quigley, H. A. & Johnson, G. J. The definition and classification of glaucoma in prevalence surveys. Br. J. Ophthalmol. 86, 238–242 (2002).
PubMed PubMed Central Google Scholar
Park, D. Y., Won, H.-H., Cho, H.-K. & Kee, C. Evaluation of lysyl oxidase-like 1 gene polymorphisms in pseudoexfoliation syndrome in a Korean population. Mol. Vis. 19, 448–453 (2013).
CAS PubMed PubMed Central Google Scholar
Mahesh, B. Machine learning algorithms-a review. Int. J. Sci. Res. (IJSR). 9, 381–386 (2020).
Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
MATH Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (2016).
Hearst, M. A., Dumais, S. T., Osuna, E., Platt, J. & Scholkopf, B. Support vector machines. IEEE Intell. Syst. Appl. 13, 1828 (1998).
Google Scholar
Amari, S. & Wu, S. Improving support vector machine classifiers by modifying kernel functions. Neural Netw. 12, 783–789 (1999).
CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
CAS PubMed ADS Google Scholar
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. in Proceedings of the 32nd International Conference on Machine Learning (eds. Bach, F. & Blei, D.) 37, 448–456 (PMLR, 07–09 Jul 2015).
Clevert, D.-A., Unterthiner, T. & Hochreiter, S. Fast and accurate deep network learning by exponential linear units (ELUs). arXiv preprint arXiv:1511.07289 (2015).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Benesty, J., Chen, J., Huang, Y. & Cohen, I. Pearson correlation coefficient. in Noise Reduction in Speech Processing (eds. Cohen, I., Huang, Y., Chen, J. & Benesty, J.) 1–4 (Springer Berlin Heidelberg, 2009).

Download references

Acknowledgements

This work was supported by a grant of the National Research Foundation (NRF) funded by the Ministry of Science, ICT & Future Planning (MSIP), Republic of Korea (No.2020R1G1A1A01007469).

Author information

These authors contributed equally: Dongbock Kim and Sat Byul Seo.

Authors and Affiliations

Department of Mathematics Education, School of Education, Kyungnam University, 7 Kyugnamdaehak‑ro, Masanhappo‑gu, Changwon, Gyeongsangnam-do, 51767, Republic of Korea
Dongbock Kim, Sat Byul Seo & Seong Joon Park
Department of Ophthalmology, Gyeongsang National University Changwon Hospital, School of Medicine, Gyeongsang National University, 11 Samjeongja-ro, Seongsan-gu, Changwon, Gyeongsangnam-do, 51472, Republic of Korea
Hyun-kyung Cho
Institute of Health Sciences, School of Medicine, Gyeongsang National University, Jinju, Republic of Korea
Hyun-kyung Cho

Authors

Dongbock Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sat Byul Seo
View author publications
You can also search for this author in PubMed Google Scholar
Seong Joon Park
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-kyung Cho
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.-k.C. and S.B.S. conceived and designed this study; H.-k.C, contributed to data collection and data management; D.K., S.B.S, and S.J.P. developed neural network architectures and performed computational experiments; H.-k.C, S.B.S., and D.K. discussed experimental results and wrote the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Hyun-kyung Cho.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, D., Seo, S., Park, S. et al. Deep learning visual field global index prediction with optical coherence tomography parameters in glaucoma patients. Sci Rep 13, 18304 (2023). https://doi.org/10.1038/s41598-023-43104-y

Download citation

Received: 01 March 2023
Accepted: 20 September 2023
Published: 25 October 2023
DOI: https://doi.org/10.1038/s41598-023-43104-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Estimating visual field loss from monoscopic optic disc photography using deep learning model

Predicting the central 10 degrees visual field in glaucoma by applying a deep learning algorithm to optical coherence tomography images

Deep learning approaches to predict 10-2 visual field from wide-field swept-source optical coherence tomography en face images in glaucoma

Introduction

Results

Baseline characteristics of subjects

Workflow of deep learning model for predicting visual field global indexes

Predictive performances of DNN and ML models

Comparison of actual and DNN predicted values of VF global indexes

Predictive performances of DNN model according to OCT- derived parameters

Deep learning predictive performance analysis according to glaucoma severity

Discussion

Material & methods

Ethics statement

Subjects

Optical coherence tomography

Perimetry

Data preprocessing

Machine learning algorithm

Deep neural network architecture

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links