Deep learning system to predict the 5-year risk of high myopia using fundus imaging in children

Foo, Li Lian; Lim, Gilbert Yong San; Lanca, Carla; Wong, Chee Wai; Hoang, Quan V.; Zhang, Xiu Juan; Yam, Jason C.; Schmetterer, Leopold; Chia, Audrey; Wong, Tien Yin; Ting, Daniel S. W.; Saw, Seang-Mei; Ang, Marcus

doi:10.1038/s41746-023-00752-8

Download PDF

Article
Open access
Published: 26 January 2023

Deep learning system to predict the 5-year risk of high myopia using fundus imaging in children

Li Lian Foo ORCID: orcid.org/0000-0002-7785-9556^1,2^na1,
Gilbert Yong San Lim¹^na1,
Carla Lanca^3,4,
Chee Wai Wong^1,2,5,
Quan V. Hoang^1,2,6,7,
Xiu Juan Zhang⁸,
Jason C. Yam^8,9,10,11,12,
Leopold Schmetterer^1,2,
Audrey Chia^1,2,
Tien Yin Wong^1,2,
Daniel S. W. Ting ORCID: orcid.org/0000-0003-2264-7174^1,2^na2,
Seang-Mei Saw^1,2^na2 &
…
Marcus Ang^1,2^na2

npj Digital Medicine volume 6, Article number: 10 (2023) Cite this article

6352 Accesses
8 Citations
38 Altmetric
Metrics details

Subjects

Abstract

Our study aims to identify children at risk of developing high myopia for timely assessment and intervention, preventing myopia progression and complications in adulthood through the development of a deep learning system (DLS). Using a school-based cohort in Singapore comprising of 998 children (aged 6–12 years old), we train and perform primary validation of the DLS using 7456 baseline fundus images of 1878 eyes; with external validation using an independent test dataset of 821 baseline fundus images of 189 eyes together with clinical data (age, gender, race, parental myopia, and baseline spherical equivalent (SE)). We derive three distinct algorithms – image, clinical and mix (image + clinical) models to predict high myopia development (SE ≤ −6.00 diopter) during teenage years (5 years later, age 11–17). Model performance is evaluated using area under the receiver operating curve (AUC). Our image models (Primary dataset AUC 0.93–0.95; Test dataset 0.91–0.93), clinical models (Primary dataset AUC 0.90–0.97; Test dataset 0.93–0.94) and mixed (image + clinical) models (Primary dataset AUC 0.97; Test dataset 0.97–0.98) achieve clinically acceptable performance. The addition of 1 year SE progression variable has minimal impact on the DLS performance (clinical model AUC 0.98 versus 0.97 in primary dataset, 0.97 versus 0.94 in test dataset; mixed model AUC 0.99 versus 0.97 in primary dataset, 0.95 versus 0.98 in test dataset). Thus, our DLS allows prediction of the development of high myopia by teenage years amongst school-going children. This has potential utility as a clinical-decision support tool to identify “at-risk” children for early intervention.

Deep learning models for screening of high myopia using optical coherence tomography

Article Open access 04 November 2021

Performance of artificial intelligence for the detection of pathological myopia from colour fundus images: a systematic review and meta-analysis

Article Open access 07 August 2023

Artificial intelligence enhanced ophthalmological screening in children: insights from a cohort study in Lubelskie Voivodeship

Article Open access 02 January 2024

Introduction

Myopia is one of the leading causes of uncorrected, reversible visual impairment in the world^1,2,3. It has been projected that myopia could affect 50% (4.7 billion) of the world’s population by 2050, with 10% (1 billion) suffering from high myopia^4,5. High myopia carries an increased risk of sight-threatening complications such as retinal detachment, open-angle glaucoma, myopic macular degeneration, and choroidal neovascularization^6,7 Occurrence of these complications would inevitably increase public health burden⁸, and lead to loss of productivity⁹.

Currently, known potential risk factors for the development of high myopia in children include a younger age of myopia onset^10,11,12, higher myopia diagnosed at presentation¹³, rapid myopia progression^14,15, reduced outdoor and increased near work time¹⁴, parental myopia¹⁴, education years¹⁶, and polygenic risk scores^17,18. However, translating these factors into clinical practice can be challenging. Many of these factors are based on subjective recall, or require the use of cycloplegic eyedrops which may not be widely available due to the need for staff with higher level of training in administering the eyedrops^19,20. Studies have attempted to develop risk prediction based on statistical^19,20 approaches that rely on serial visual and refraction assessments, which is resource intensive and may delay necessary intervention¹⁹. Moreover, there is also increased demand for skilled eye care professionals, compounding challenges brought about by the growing magnitude of vision and eye health related problems²¹.

The trend of an increasing number of children and teenagers with high myopia has become a global concern²². This is particularly evident in East Asia where prevalence rates of high myopia have risen up to 21.6%^23,24,25,26. While there are interventions such as atropine eyedrops and optical devices (e.g., myopic defocus spectacles and multifocal contact lenses) that may reduce myopia progression in children²⁷, these interventions may not be suitable for all children and have potential side effects^28,29.

A key area of research is to identify children who are “at risk” of developing high myopia with greater precision, so that these interventions may be appropriately introduced to these children. Thus, there is an unmet need to identify children at risk of developing high myopia, based on simple, accessible and objective measures, ideally at a single baseline visit. In this study, we aim to develop a deep learning system (DLS) utilizing objective fundus imaging and/or clinical data to identify children who are at risk of developing high myopia (5-year prediction) later in their teenage years.

Results

From a total of 965 children (1878 eyes) with 7456 retina images from school 2 and 3, the deep learning model was trained with fivefold cross validation for detection of high myopia using 769 children (1502 eyes) with 5945 retina images and tested on 196 children (376 eyes) with 1511 retina images. The models were externally validated on the test dataset using 99 children (189 eyes) with 821 retina images from school 1. The overall subjects’ demographics, myopic status at baseline and in teenage years are listed in Supplementary Table 1. Comparison of baseline characteristics of subjects from the different schools showed that School 1 is generally statistically different from Schools 2 and 3 with the exception of proportion of males (P = 0.056) using Chi-squared test (Supplementary Table 2).

In the primary dataset for internal validation, among the 1502 eyes used in training and fivefold cross validation, 60.7% had no myopia, 29.8% had low myopia and 9.5% had moderate myopia at baseline. In teenage years, 5 years after the initial visit, 31.8% had no myopia, 36.9% had low myopia, 26.8% had moderate myopia and 4.6% had high myopia. In the 376 eyes used in testing, 62.2%, 28.7% and 9.0% had no, low, moderate myopia at baseline, respectively. After 5 years, 39.6%, 31.1%, 25.8% and 3.5% had no, low, moderate and high myopia, respectively. In the test dataset for external validation (189 eyes), 33.9%, 37.0% and 29.1% had no, low, moderate myopia at baseline, respectively. After 5 years, 23.3%, 33.9%, 34.4% and 8.5% had no, low, moderate and high myopia, respectively.

Using only variables at baseline, all our models achieved clinically acceptable performance. In the fundus image-only models, fundus image alone achieved an AUC of 0.93 in both the primary as well as the test dataset while in the clinical data-only models, baseline SE alone achieved an AUC of 0.90 and 0.93 in the primary and the test dataset, respectively. In the mixed models (image + clinical), the combination of fundus image and baseline SE achieved an AUC of 0.97 in both the primary as well as the test dataset. The addition of 1 year SE progression to the clinical and mixed models provided only marginal improvement or decline in model performance. The performance of the algorithm for image models, clinical models and mixed models is presented in Table 1.

Table 1 Internal and external validation of the DLS in image models, clinical models and mixed models for 5-year high myopia prediction.

Full size table

Fundus image-only models

The algorithm with baseline childhood fundus imaging input alone achieved clinically acceptable prediction of high myopia in teenage years. The AUC and accuracy in classification was 0.93, 0.84 and 0.93, 0.86 for internal and external validation, respectively. The addition of age, race and gender resulted in marginal improvement or decline in performance (AUC 0.95 in primary dataset, 0.91 in test dataset). The AUC curves and confusion matrixes for internal and external validation are shown in Fig. 1.

**Fig. 1: Performance of image models.**

Clinical data-only models

Two baseline clinical models were developed for (i) Baseline SE, (ii) Age + Race + Gender + Baseline SE. The AUC and accuracy achieved was 0.90–0.97, 0.80–0.87 and 0.93–0.94, 0.65–0.74 for internal and external validation, respectively. Similar to the fundus image-only model, additional input with 1 year progression data led to marginal improvement in performance (AUC 0.98 in primary dataset, 0.97 in test dataset). The addition of parental myopia did not further improve the performance. The AUC and confusion matrixes for internal and external validation are shown in Fig. 2, while the random forest feature importance for clinical models can be found in Supplementary Fig. 1.

**Fig. 2: Performance of clinical models.**

Mixed Models

The algorithm with baseline childhood fundus imaging together with clinical data input achieved the highest performance in prediction of high myopia in teenagers. Two baseline mixed models were developed for (i) Fundus photo + Baseline SE and (ii) Fundus photo + Age + Race + Gender + Baseline SE with AUC and accuracy in classification of 0.97, 0.93 and 0.97–0.98, 0.82 for internal and external validation, respectively. The addition of 1 year progression input data delivered marginal improvement or decline in performance (AUC 0.99 in primary dataset, 0.95 in test dataset). Similar to the clinical model, the addition of parental myopia did not further improve the performance. The AUC and confusion matrixes are shown in Fig. 3, respectively- The random forest feature importance for mixed models can be found in Supplementary Fig. 1.

Performance of models for AL prediction

We have also attempted to perform long axial length (AL ≥ 26.5mm) prediction in image models, clinical models and mixed models. In the primary dataset for internal validation, among the 1502 eyes used in training and fivefold cross validation, 100% had AL < 26.5mm at baseline. In teenage years, five years after the initial visit, 98.1% had AL < 26.5mm while 1.86% (28 eyes) had AL ≥ 26.5 mm. In the 376 eyes used in testing, 100% had AL <26.5 mm at baseline and 2.13% (8 eyes) had AL ≥ 26.5 mm after 5 years. In the test dataset for external validation (189 eyes), 100% had AL <26.5 mm at baseline and 8.47% (16 eyes) had AL ≥ 26.5 mm after five years (Supplementary Table 1).

In the fundus image-only models, fundus image alone achieved an AUC of 0.67 and 0.57 while in the clinical data-only models, baseline AL alone achieved an AUC of 0.83 and 0.91 in the primary and the test dataset, respectively. In the mixed models (image + clinical), the combination of fundus image and baseline AL achieved an AUC of 0.98 and 0.88 in the primary and the test dataset, respectively. The addition of 1 year SE progression to the clinical and mixed models provided only marginal or no improvement in model performance. The performance of the algorithm for image models, clinical models and mixed models were presented in Table 2.

Table 2 Internal and external validation of the DLS in image models, clinical models and mixed models for 5-year axial length ≥26.5 mm prediction.

Full size table

Discussion

In this study, we developed a modular DLS based on single-time point objective data and fundus imaging to predict the 5-year development of high myopia in a multi-ethnic group of children aged between 6 and 12 years old at baseline. Our models demonstrated clinically acceptable predictive performance with AUCs ranging from 0.90 to 0.98. Importantly, the fundus image-only model demonstrated comparable performance (AUC = 0.94) against clinical models (AUC 0.90–0.97). Marginal benefit or decline in performance was noticed with additional 1 year follow-up progression data of SE. The performance of our fundus image-only model has the potential to be translated and implemented into community or school-based programs to identify at-risk children for further assessment and intervention if required.

In the recent report Impact of Myopia by the International Myopia Institute (IMI), the global cost of myopia care is expected to increase from an estimated USD$358.7 billion in 2019 to USD$ 870 billion by 2050³⁰. In particular, these reports indicate that high and pathologic myopia place significant financial burden on both the individual and society, with annual costs increasing substantially with age³⁰. Thus, the IMI Clinical Management Guidelines Report advocates for myopia control, which includes risk assessment, followed by clinical evaluation, and treatment selection³¹. In addition, identifying children “at risk” of high myopia and those with “premyopia” are important for early intervention³². Though such broad definitions could support clinical decision making, in reality, accurately identifying a child at risk of developing high myopia in clinical practice is challenging. High-risk features such as family history^33,34,35,36 and environmental factors (near work and outdoor exposure)^{37,38,39,40,41} are helpful but not deterministic, hence current childhood myopia management is heavily reliant on eyecare professionals’ judgment and experience. Based on the current available clinical evidence^{42,43,44,45,46,47,48}, children who are already myopic may be started on myopia control therapies to prevent further progression; while environmental and behavioral modifications should be encouraged in ‘pre-myopic’ high-risk children. However, current clinical management requires a more precise approach that is individualized, to reliably identify and initiate timely treatment for high-risk children.

Our predictive DLS is designed to address these specific challenges, and with the most clinical impact. First, we have a target age group of children aged 6–12 years old who are most vulnerable to myopia progression and also amenable to myopia control therapies^31,49. Second, we used only objective inputs to avoid biases related to subjective recall—and these are obtained at one single time point (baseline), thereby eliminating the need for repeated, longitudinal follow-up before a clinical decision can be made and avoiding unnecessary delay of treatment for high-risk individuals. Third, our various image-based and mixed-clinical models produced clinically acceptable performances, allowing for implementation in various clinical settings with the availability of imaging systems. As a further enhancement, our DLS was trained and tested using a multi-ethnic population which improves the overall generalizability of the results. Fourth, our DLS delivered robust predictive performance against an external validation dataset which was dissimilar to the training datasets. This suggests that our DLS could have the capacity to achieve good performance against unique external datasets, which would require substantiation through further validations.

Comparing our fundus image-only model with other models in this study that require clinical predictors input, using baseline fundus image alone appeared to be comparable in performance and adequate in predicting 5-year high myopia. We also compared our fundus image-only model to previous studies using regression methods¹⁹ and machine learning (ML)²⁰ to predict high myopia in childhood (Supplementary Table 3). Our approach provided several distinct advantages. Firstly, our DLS based baseline fundus image as a single input variable, was able to exceed or was at least on par with the performance of the 5-year high myopia prediction ML algorithms, using big data (age and refraction) from electronic medical records, proposed by Lin et al. However, the ML algorithms required cycloplegic refraction and a minimum of three repeated annual visits before a prediction could be made²⁰. Secondly, the statistical models in Chen et al were only able to predict the development of high myopia at 18 years old (5 or 6-year prediction) in 12 to 13 years old children, using age, gender and cycloplegic SE ranging from 1–3 visits¹⁹. However, by 18 years of age, myopia progression and axial elongation would have occurred, missing the window period for myopia control treatment. In comparison, our models target children aged 6–12 years old in order for potential myopia interventions to remain effective. Moreover, our DLS was also able to achieve comparable performance utilizing the same variables used in models proposed by Chen et al. (Supplementary Table 3).

Logistically, utilizing baseline fundus image alone can eliminate the need for cycloplegic refraction¹⁹ without significant degradation in predictive performance. Cycloplegic refraction is a time consuming process with a waiting time ranging between 1 and 2 h. Additionally, the children routinely experience side effects of pupil dilation and glare lasting up to 72 h. Hence, this procedure is not routinely performed in clinical assessments or myopia screening programs⁵⁰. On the other hand, predictive utility of non-cycloplegic SE alternatives such as manifest refraction or basic autorefraction could be confounded by the spurious effects of pseudomyopia. In comparison, our approach utilizing fundal imaging coincides with the maturation of non-mydriatic imaging technology. It is now feasible to obtain high quality images with minimal latency. This provides comparative advantages, including significant time savings versus cycloplegic SE and better accuracy versus non-cycloplegic SE modalities.

The implementation of AI into myopia screening or evaluation programs will depend on the availability of skilled manpower, imaging systems and infrastructure support, and therefore the selection (or design) of a candidate system needs to consider these issues^51,52,53,54. The World Report on Vision highlighted the increasing demand for trained eye health human resources to address the substantial burden of vision and eye health issues globally⁵⁵. With 19 million children with visual impairment, including blindness in 1.4 million and low vision in 17.5 million⁵⁶, the shortage of trained eye care professionals particularly optometrists and ophthalmologists, represent major barriers to accessing quality childhood myopia management²¹. Majority of these children reside in developing countries⁵⁶ which are chronically underserved due to a paucity of optometrists and unequal optometrist workforce distribution⁵⁷. This gap would likely widen with time, given the increasing trend in economic inequality.

For example, our image-only model based on a single baseline fundus photograph may be implemented easily without the need for other sophisticated, costly equipment to measure AL and autorefraction. It also eliminates the need to perform cycloplegic refraction which requires skilled manpower and is difficult to conduct in primary healthcare settings⁵⁸. In the future, such DLS image analysis could be integrated into the fundus camera or performed using cloud computing⁵⁹. Separately, our DLS system may be integrated into national myopia screening programs^60,61, or even in less developed healthcare systems that have access to portable fundal cameras⁶². However, this would need to be balanced against the higher false-negative rate in comparison with models solely based on clinical data or a mixture of both. This may result in misclassification of children at-risk, resulting in the subsequent development of high myopia. It is therefore important for public health experts and policies makers to weigh the risk of screening inaccuracies against the benefit of scalable large-scale screening based on a single fundus photograph. In addition, incidences of misclassification may also be minimized through regular annual screenings. On the other hand, the mixed-model would be better suited to a tertiary healthcare setting where imaging equipment and cycloplegic refraction are readily available. This would then function as a clinical-decision support tool by identifying suitable candidates for treatment.

In order to identify at risk individuals, we postulate that our DLS could have indirectly detected the early phases of accelerated myopia development through higher presenting SE as well as subtle morphological changes on fundus imaging. This could include differences in macular choroidal thickness or topographical differences at the macular and disc. Moreover, post-processing techniques applied to saliency heatmaps, generated using Integrated Gradient techniques, had identified the disc and macular as areas of interest (Fig. 4), consistent with areas of future myopic disc changes and myopic macular degeneration⁶³.

Our study has several limitations. Firstly the distribution of children with and without high myopia in our current cohort after 5 years exhibited significant imbalance. While this could affect the utility of our results, it would be an unavoidable challenge as our dataset is reflective of the naturally skewed-distribution of the disease. Furthermore, with Singapore ranking amongst the highest in the world in terms of high myopia prevalence, our current data distribution is likely to reflect the upper end of high myopia disease frequency in the real world. Hence, augmenting with population data from other studies would represent an impractical effort to address this constraint.

While our algorithm was developed using a longitudinal school-based dataset, the algorithm would require further testing in other cohorts with different study population. Ideally, external validation involving children of different ethnicity and geographical locations could be performed—but similar longitudinal cohorts that capture fundus images from childhood and with 5-year longitudinal follow up, are not widely available. Furthermore, it is challenging to locate treatment naïve longitudinal cohorts in children for external validation, due to ethical considerations in withholding treatment. To address this, we are in active engagements with international partners to create a consortium to collate required data prospectively in the future.

Next, fundal images were obtained by a single mydriatic fundus camera platform (CR6-NM45, EOS-D60, Cannon) under cycloplegic conditions. However, with the introduction of non-mydriatic fundus cameras, fundus image capture is now more widely available without the need for application of eyedrops for pupil dilation. Thus, further studies are required to assess the performance of our DLS across various fundus image systems including non-mydriatic fundus cameras. However, it should be noted that studies have established that the performance of automated AI analysis between non-mydriatic and mydriatic cameras platforms for screening of undiagnosed diabetic retinopathy has been shown to be highly comparable⁶⁴.

In addition, we were unable to achieve clinically acceptable performance in AI prediction for AL, particularly in fundus image-only model due to the small dataset available for training and testing with limited numbers of subjects with AL ≥ 26.5mm at 5-years (internal validation = 36 eyes, external validation = 16 eyes). Further studies with larger datasets would be required to assess the performance of AL prediction.

In summary, we have developed a DLS using a baseline fundus image and objective clinical data (age, race, gender, baseline SE) to identify schoolchildren at risk of developing high myopia later on in their teenage years. Through early identification, targeted and timely myopia control therapies may be instituted to reduce the risk of developing high myopia in these children. The fundus image-only model may be implemented via integration into fundus camera systems or cloud-based computing. However, further external validation in various treatment naïve populations and using new non-mydriatic cameras will further strengthen the potential application of this AI system. Nonetheless, we present promising results from a DLS that addresses several clinical challenges faced by myopia evaluation and prevention programs, by reducing reliance on cycloplegic refraction, axial length measurements or repeated reviews. With further development of this AI system, it may be used as a clinical assistive tool to identify children “at risk” of developing high myopia with greater precision and introduce myopia control therapies if needed.

Methods

Study design

In this retrospective population-based study, high myopia was defined as spherical equivalent (SE) ≤ −6 D or axial length (AL) ≥ 26.5 mm^15,32. We predicted the development of each class using fundus images and/or clinical data. Three types of models were developed—image-only, clinical data-only and mixed (clinical + image). The models were trained, validated and tested using the Singapore Cohort of Risk factors for Myopia (SCORM) dataset^65,66,67.

The image-only models were developed using pre-processed fundus images and pre-trained DenseNet-121 deep neural network models. These image models were then used to generate image-based risk scores for each eye. The clinical data-only models utilized random forest to extract the relevant clinical features and generate a clinical data-based risk score for each eye. For the mixed model, the image-based scores, which represented clinical features extracted from image data, were combined with the clinical data-based scores to derive an overall mixed-model risk score.

The study was approved by the Ethics Committee at the Singapore Eye Research Institute and the Centralized Institutional Review Boards of the Singapore Health Services (2016/2215) and conducted in accordance with the tenets of the Declaration of Helsinki. Written informed consent was obtained from the parents after the nature of the study was explained.

Clinical training, validation, and testing datasets

In the SCORM study cohort, children from grades 1 to 3 were recruited from three Singapore schools (n = 1979) based on methodology previously described^65,66,67. The exclusion criteria included children with serious medical conditions or syndromes associated with myopia or any eye disorders at baseline. Questionnaires in the three most common languages (English, Chinese, and Malay) were administered to parents by a trained interviewer during the baseline visit. This was performed to obtain demographic data, including the number of parents with myopia⁶⁵. Parents were considered myopic if they required corrective lenses for distance vision. Data for this study was derived from 1979 children (aged 6–12 years), who attended the visit in 2001 (baseline), 2002 (1-year follow up) and 2006 (5-year follow up) visits. Fundus imaging was only performed at 2001 (baseline). None of these children had myopia control treatment during the follow-up period. The primary dataset comprised of 1666 subjects from schools 2 and 3, of which 701 subjects were excluded due to baseline high myopia and/or missing data (clinical/fundus image). For the primary validation, 965 subjects (1878 eyes) with 7456 retina images were included in training/validation and testing of AI algorithm. The training/validation to testing data set was split randomly using a 4:1 ratio, with 769 subjects (1502 eyes) in the training set and 196 subjects (376 eyes) in the test set (Fig. 5). The independent test dataset comprised of 313 subjects from schools 1, of which 214 subjects were excluded due to baseline high myopia and/or missing data (clinical/fundus image). For the external validation, 99 subjects (189 eyes) with 821 retina images were included.

Eye measurements and imaging

Annual cycloplegic refraction was performed for the participants. One drop of topical proparacaine 0.5% was first instilled followed by three drops of 1% cyclopentolate instilled at 5 min intervals to achieve sufficient cycloplegic response. Cycloplegic autorefraction was then performed after an interval of at least 30 min after the last eye drop. This was performed using a table-mounted autorefractor (model RK5; Canon, Japan). In total, five measurements were performed per eye (ensuring maximum difference between readings were <0.25 D apart) and total mean was used for analysis. AL measurements were obtained using contact ultrasound biometry (Echoscan model US-800, probe frequency 10 mHz; Nidek Co., Ltd., Tokyo Japan) after instillation of 1 drop of 0.5% proparacaine. The average of six measurements was taken and accepted only if the SD of these readings was less than 0.12 mm. Subsequently, the SE for each eye was calculated based on the formula sphere power plus half cylinder power. After pupil dilatation, digital retinal photographs centered on the optic disc were taken for both eyes using standardized settings (6.3 mega-pixel, resolution 3072 × 2048; CR6-NM45, EOS-D60; Canon USA, Lake Success, NY). Myopia was defined as an SE ≤ −0.5 D with low myopia (-3.0 D < SE ≤ −0.5 D), moderate myopia (−6.0 D < SE ≤ −3.0 D) and high myopia (SE ≤ −6 D). One-year mean SE and AL progression were calculated as SE at year 2002 visit minus SE at year 2001 (baseline) and AL at year 2002 visit minus AL at year 2001 (baseline), respectively.

Architecture of deep learning system

We utilized baseline childhood fundus photographs as the input for the development of the image-only models. All fundus photographs were first pre-processed into a square template to extract the central circular region, which then underwent contrast normalization. Pre-trained DenseNet-121 deep neural network models were employed, with a batch size of 16, an initial learning rate of 0.001, Nesterov momentum of 0.90 and categorical cross-entropy loss. In addition, image augmentation procedures were used, as follows, in order to increase the dataset size and to strengthen model generalizability: (1) rotation (clockwise by 0–180°, selected randomly), (2) horizontal flip, and (3) vertical flip. Brightness and scale adjustment were also performed for training inputs. Local contrast normalization of the retinal fundus photos took place after the central disc of the photo had been extracted to a template image of 512 × 512 pixels. The Contrast Limited Adaptive Histogram Equalization (CLAHE) method was then applied with a kernel size of 51 pixels. These networks were trained with two output nodes corresponding to the two target classes for SE (SE ≤ −6 D or >−6 D) and AL (AL > 26.5 mm or ≥ 26.5 mm). Training was performed until convergence on accuracy was demonstrated with the internal validation data. An early stop procedure was applied to avoid overfitting: the training was stopped if the loss on the validation set no longer decreased for 5 epochs.

Saliency heatmaps were generated on the template images by employing the Integrated Gradients technique⁶⁸. The Integrated Gradients technique has the advantage of fulfilling theoretical sensitivity and implementation invariance axioms by design, and being able to generate pixel-level saliency estimates, unlike other popular visualization methods. The heatmaps were subsequently post-processed by applying thresholding techniques to the Integrated Gradients pixel-level outputs, thereby emphasizing the pixels with the highest saliency. This is followed by the application of an image morphology operation to collate these high-saliency pixels into larger representative regions.

Development of the clinical data-only models involved extraction of the relevant clinical feature scores for each eye. Subsequently, grid search with five-fold cross-validation was used to determine the optimal random forest hyperparameter values for each model. The hyperparameters were optimized for the number of estimators, the maximum number of features, the maximum depth and the split quality criterion. Once the optimal hyperparameter values were determined, the actual random forest model was trained on the entire training data.

Preliminary experiments had been attempted with using feature vectors extracted just before the output node layer, of 10 and 100 nodes, respectively. The final random forest model utilizing just the output node values generally outperformed models utilizing a larger number of features extracted from previous intermediate layers. As such, the output node layer values were used to develop the actual experimental models as described below.

From the trained image-only models, image-based scores were generated for each eye. All images for an eye were first processed by the model, to produce an image-level score. These image-level scores were calculated from the individual output node value, by multiplying the value of each node with the index of the node (which corresponds to the severity of the condition), and then summing these values into a single representative score for that condition. In predicting myopia severity, the corresponding image model would predict four separate probabilities at its four output nodes, including the development of no myopia (node index 0), low myopia (node index 1), moderate myopia (node index 2) and high myopia (node index 3), respectively. These four probabilities were constrained to add up to the value of 1, with the use of the softmax function. To convert these four probabilities into a single value for the final predicted severity of myopia, each individual probability were multiplied by the node index and then added together. For example, if an image was predicted to have no myopia with a probability of 1, its image-level score would be 0 (1 × 0). If an image was predicted to have moderate myopia with a probability of 0.5 and high myopia with a probability of 0.5, the resulting image-level score would be 2.5 (0.5 × 2 + 0.5 × 3). Next, the eye-level score was computed as the average of all image-level scores from the images of that particular eye. The resulting eye-level scores can then be considered as a clinical feature that is automatically obtained from image data. These scores were subsequently integrated with the clinical data-only models, in the same manner as other clinical features, during the development of mixed models.

Statistical analysis

For each of the AI algorithms for detection of high myopia against ground truths, we calculated the area under the receiver operating characteristic (ROC) curve (AUC), accuracy, sensitivity and specificity for each classification threshold applied to the validation datasets. A classification threshold to achieve pre-determined sensitivity and specificity of at least 75% was set. The algorithms were tested on the independent testing datasets on different AI models—1) fundus image-only, 2) clinical data-only and 3) mixed model, with the previously-found output thresholds applied as operating points. For computing confidence interval estimation, bootstrapping was used only to estimate 95% confidence interval (CI) for the performance metrics of our classification results (i.e., AUC, sensitivity, specificity and accuracy). We applied n-out-of-n bootstrap with replacement at eye level from our dataset. For each bootstrap sample, we calculated and reserved the performance metrics for that bootstrap sample. The bootstrap sampling was repeated for 1000 times. We then estimated the 95% CI by using the 2.5 and 97.5 percentiles of the empirical distribution of corresponding metrics. Confusion matrices were used to assess the differences in classification performance for each model with the row and column representing predicted and true results, respectively. Comparison of baseline characteristics amongst the three schools was performed using analysis of variance and Chi-squared test for continuous and categorical variables, respectively. P values < 0.05 were considered statistically significant. All statistical analyses were performed using Python version 3.6.8 and SciPy version 1.5.4.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available from the corresponding author, [MA], upon reasonable request.

Code availability

The algorithms of this study are available for download at: https://www.dropbox.com/s/n6nsmia4gwjghka/SCORM_code.zip?dl=0.

References

Pararajasegaram, R. VISION 2020-the right to sight: from strategies to action. Am. J. Ophthalmol. 128, 359–360 (1999).
CAS Google Scholar
Holden, B. A. et al. Myopia: a growing global problem with sight-threatening complications. Community Eye Health 28, 35 (2015).
Google Scholar
GBD 2019 Blindness and Vision Impairment Collaborators, on the behalf of the Vision Loss Expert Group of the Global Burden of Disease Trends in prevalence of blindness and distance and near vision impairment over 30 years: an analysis for the Global Burden of Disease Study. Lancet Glob. Health 9, e130–e143 (2021).
Article Google Scholar
Modjtahedi, B. S., Ferris, F. L. 3rd, Hunter, D. G. & Fong, D. S. Public health burden and potential interventions for myopia. Ophthalmology 125, 628–630 (2018).
Article Google Scholar
Holden, B. A. et al. Global prevalence of myopia and high myopia and temporal trends from 2000 through 2050. Ophthalmology 123, 1036–1042 (2016).
Article Google Scholar
Wong, T. Y., Ferreira, A., Hughes, R., Carter, G. & Mitchell, P. Epidemiology and disease burden of pathologic myopia and myopic choroidal neovascularization: an evidence-based systematic review. Am. J. Ophthalmol. 157, 9–25 e12 (2014).
Article Google Scholar
Ikuno, Y. Overview of the complications of high myopia. Retina 37, 2347–2351 (2017).
Article Google Scholar
Morgan, I. G., Ohno-Matsui, K. & Saw, S. M. Myopia. Lancet 379, 1739–1748 (2012).
Article Google Scholar
Naidoo, K. S. et al. Potential lost productivity resulting from the global burden of myopia: systematic review, meta-analysis, and modeling. Ophthalmology 126, 338–346 (2019).
Article Google Scholar
Chua, S. Y. et al. Age of onset of myopia predicts risk of high myopia in later childhood in myopic Singapore children. Ophthalmic Physiol. Opt. J. Br. Coll. Ophthalmic Opticians 36, 388–394 (2016).
Article Google Scholar
Hu, Y. et al. Association of age at myopia onset with risk of high myopia in adulthood in a 12-year follow-up of a Chinese cohort. JAMA Ophthalmol. 138, 1129–1134 (2020).
Article Google Scholar
Jensen, H. Myopia in teenagers. Eight-year follow- study myopia progression risk factors. Acta Ophthalmol. Scand. 73, 389–393 (1995).
Article CAS Google Scholar
Gwiazda, J. et al. Factors associated with high myopia after 7 years of follow-up in the Correction of Myopia Evaluation Trial (COMET) cohort. Ophthalmic Epidemiol. 14, 230–237 (2007).
Article Google Scholar
Parssinen, O. & Kauppinen, M. Risk factors for high myopia: a 22-year follow-up study from childhood to adulthood. Acta Ophthalmol. 97, 510–518 (2019).
Article Google Scholar
Lanca, C. et al. Rapid myopic progression in childhood is associated with teenage high myopia. Investig. Ophthalmol. Vis. Sci. 62, 17 (2021).
Article Google Scholar
Liu, L. et al. Relationship between myopia progression and school entrance age: a 2.5-year longitudinal study. J. Ophthalmol. 2021, 7430576 (2021).
Article Google Scholar
Lanca, C. et al. New polygenic risk score to predict high myopia in Singapore Chinese children. Transl. Vis. Sci. Technol. 10, 26 (2021).
Article Google Scholar
Ghorbani Mojarrad, N. et al. Association between polygenic risk score and risk of myopia. JAMA Ophthalmol. 138, 7–13 (2020).
Article Google Scholar
Chen, Y. et al. Contribution of genome-wide significant single nucleotide polymorphisms in myopia prediction: findings from a 10-year cohort of Chinese twin children. Ophthalmology 126, 1607–1614 (2019).
Article Google Scholar
Lin, H. et al. Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study. PLoS Med. 15, e1002674 (2018).
Article Google Scholar
Resnikoff, S. et al. Estimated number of ophthalmologists worldwide (International Council of Ophthalmology update): will we meet the needs. Br. J. Ophthalmol. 104, 588–592 (2020).
Article Google Scholar
WHO. The impact of myopia and high myopia, <https://www.who.int/blindness/causes/MyopiaReportforWeb.pdf> (2015).
Chen, M. et al. The increasing prevalence of myopia and high myopia among high school students in Fenghua city, eastern China: a 15-year population-based survey. BMC Ophthalmol. 18, 159 (2018).
Article Google Scholar
Guo, Y. et al. High myopia in Greater Beijing School Children in 2016. PloS One 12, e0187396 (2017).
Article Google Scholar
Jung, S. K., Lee, J. H., Kakizaki, H. & Jee, D. Prevalence of myopia and its association with body stature and educational level in 19-year-old male conscripts in seoul, South Korea. Investig. Ophthalmol. Vis. Sci. 53, 5579–5583 (2012).
Article Google Scholar
Yotsukura, E. et al. Current prevalence of myopia and association of myopia with environmental factors among schoolchildren in Japan. JAMA Ophthalmol. 137, 1233–1239 (2019).
Article Google Scholar
Ang, M. et al. Review: Myopia control strategies recommendations from the 2018 WHO/IAPB/BHVI meeting on myopia. Br. J. Ophthalmol. 104, 1482–1487 (2020).
Google Scholar
Liu, Y. M. & Xie, P. The safety of orthokeratology—a systematic review. Eye Contact Lens 42, 35–42 (2016).
Article Google Scholar
Gong, Q. et al. Efficacy and adverse effects of atropine in childhood myopia: a meta-analysis. JAMA Ophthalmol. 135, 624–630 (2017).
Article Google Scholar
Sankaridurg, P. et al. IMI impact of myopia. Investig. Ophthalmol. Vis. Sci. 62, 2 (2021).
Article Google Scholar
Gifford, K. L. et al. IMI—clinical management guidelines report. Investig. Ophthalmol. Vis. Sci. 60, M184–M203 (2019).
Article Google Scholar
Flitcroft, D. I. et al. IMI—defining and classifying myopia: a proposed set of standards for clinical and epidemiologic studies. Investig. Ophthalmol. Vis. Sci. 60, M20–M30 (2019).
Article Google Scholar
Chen, Y., Wang, W., Han, X., Yan, W. & He, M. What twin studies have taught us about myopia. Asia-Pac. J. Ophthalmol. 5, 411–414 (2016).
Article Google Scholar
Lim, D. H. et al. The high prevalence of myopia in Korean children with influence of parental refractive errors: the 2008-2012 Korean National Health and Nutrition Examination Survey. PloS One 13, e0207690 (2018).
Article Google Scholar
Low, W. et al. Family history, near work, outdoor activity, and myopia in Singapore Chinese preschool children. Br. J. Ophthalmol. 94, 1012–1016 (2010).
Article Google Scholar
Yam, J. C. et al. High prevalence of myopia in children and their parents in Hong Kong Chinese Population: the Hong Kong Children Eye Study. Acta Ophthalmol. https://doi.org/10.1111/aos.14350 (2020).
Article Google Scholar
He, M. et al. Effect of time spent outdoors at school on the development of myopia among children in China: a randomized clinical trial. JAMA 314, 1142–1148 (2015).
Article CAS Google Scholar
Huang, H. M., Chang, D. S. & Wu, P. C. The association between near work activities and myopia in children-a systematic review and meta-analysis. PloS One 10, e0140419 (2015).
Article Google Scholar
Jin, J. X. et al. Effect of outdoor activity on myopia onset and progression in school-aged children in northeast China: the Sujiatun Eye Care Study. BMC Ophthalmol. 15, 73 (2015).
Article Google Scholar
Mutti, D. O., Mitchell, G. L., Moeschberger, M. L., Jones, L. A. & Zadnik, K. Parental myopia, near work, school achievement, and children’s refractive error. Investig. Ophthalmol. Vis. Sci. 43, 3633–3640 (2002).
Google Scholar
Wu, P. C. et al. Myopia prevention and outdoor light intensity in a school-based cluster randomized trial. Ophthalmology 125, 1239–1250 (2018).
Article Google Scholar
Huang, J. et al. Efficacy comparison of 16 interventions for myopia control in children: a network meta-analysis. Ophthalmology 123, 697–708 (2016).
Article Google Scholar
Weiss, R. S. & Park, S. Recent updates on myopia control: preventing progression 1 diopter at a time. Curr. Opin. Ophthalmol. 30, 215–219 (2019).
Article Google Scholar
Chia, A. et al. Atropine for the treatment of childhood myopia: safety and efficacy of 0.5%, 0.1%, and 0.01% doses (Atropine for the Treatment of Myopia 2). Ophthalmology 119, 347–354 (2012).
Article Google Scholar
Li, F. F. & Yam, J. C. Low-concentration atropine eye drops for myopia progression. Asia-Pac. J. Ophthalmol. 8, 360–365 (2019).
Article Google Scholar
Sacchi, M. et al. Efficacy of atropine 0.01% for the treatment of childhood myopia in European patients. Acta Ophthalmol. https://doi.org/10.1111/aos.14166 (2019).
Joachimsen, L. et al. A pilot study on the efficacy and safety of 0.01% atropine in german schoolchildren with progressive myopia. Ophthalmol. Ther. 8, 427–433 (2019).
Article Google Scholar
Pineles, S. L. et al. Atropine for the prevention of myopia progression in children: a report by the american academy of ophthalmology. Ophthalmology 124, 1857–1866 (2017).
Article Google Scholar
COMET Group, Myopia stabilization and associated factors among participants in the Correction of Myopia Evaluation Trial (COMET). Investig. Ophthalmol. Vis. Sci. 54:7871–7884 (2013).
Zhu, D. et al. Pre- and postcycloplegic refractions in children and adolescents. PloS one 11, e0167628 (2016).
Article Google Scholar
Azizoglu, S. et al. Evidence for the need for vision screening of school children in Turkey. BMC Ophthalmol. 17, 230 (2017).
Article Google Scholar
Seet, B. et al. Myopia in Singapore: taking a public health approach. Br. J. Ophthalmol. 85, 521–526 (2001).
Article CAS Google Scholar
Preedy, V. R. & Hammond, B. An investigation into the effects of aminoguanidine treatment on the plasma and blood of free-fed and dietary-restricted rats. J. Pharm. Pharmacol. 43, 337–341 (1991).
Article CAS Google Scholar
The impact of myopia and high myopia. Report of the Joint World Health Organization - Brien Holden Vision Institute Global Scientific Meeting on Myopia, <https://www.who.int/blindness/causes/MyopiaReportforWeb.pdf> (2015).
World Health Organization, World report on vision. (2019).
Pascolini, D. & Mariotti, S. P. Global estimates of visual impairment: 2010. Br. J. Ophthalmol. 96, 614–618 (2012).
Article Google Scholar
Kovin Naidoo, P. G.-P., Priya Morjaria et al. Global Mapping of Optometry Workforce. PREPRINT (Version 1) available at Research Square [https://doi.org/10.21203/rs.3.rs-1286073/v1] (2022).
Dabasia, P. L., Edgar, D. F., Garway-Heath, D. F. & Lawrenson, J. G. A survey of current and anticipated use of standard and specialist equipment by UK optometrists. Ophthalmic Physiol. Opt. 34, 592–613 (2014).
Article Google Scholar
Keane, P. A. & Topol, E. J. With an eye to AI and autonomous diagnosis. NPJ Digit. Med. 1, 40 (2018).
Article Google Scholar
Bellemo, V. et al. Artificial intelligence screening for diabetic retinopathy: the real-world emerging application. Curr. Diabetes Rep. 19, 72 (2019).
Article Google Scholar
Nguyen, H. V. et al. Cost-effectiveness of a national telemedicine diabetic retinopathy screening program in Singapore. Ophthalmology 123, 2571–2580 (2016).
Article Google Scholar
Khou, V. et al. Evaluation of the initial implementation of a nationwide diabetic retinopathy screening programme in primary care: a multimethod study. BMJ Open 11, e044805 (2021).
Article Google Scholar
Varadarajan, A. V. et al. Deep learning for predicting refractive error from retinal fundus images. Investig. Ophthalmol. Vis. Sci. 59, 2861–2868 (2018).
Article Google Scholar
Bawankar, P. et al. Sensitivity and specificity of automated analysis of single-field non-mydriatic fundus photographs by Bosch DR Algorithm-Comparison with mydriatic fundus photography (ETDRS) for screening in undiagnosed diabetic retinopathy. PloS One 12, e0189854 (2017).
Article Google Scholar
Saw, S. M. et al. Incidence and progression of myopia in Singaporean school children. Investig Ophthalmol. Vis. Sci. 46, 51–57 (2005).
Article Google Scholar
Saw, S. M. et al. Nearwork in early-onset myopia. Investig. Ophthalmol. Vis. Sci. 43, 332–339 (2002).
Google Scholar
Saw, S. M. et al. A cohort study of incident myopia in Singaporean children. Investig. Ophthalmol. Vis. Sci. 47, 1839–1844 (2006).
Article Google Scholar
Sundararajan, M., Taly, A. & Yan, Q. In: Proceedings of the International Conference on Machine Learning. pp 3319–3328 (PMLR).

Download references

Acknowledgements

This work is supported by National Medical Research Council Individual Research Grant (NMRC/0975/2005), National Medical Research Council Center Grant (NMRC/CG/C010A/2017_SERI) and Nurturing Clinician Researcher Scheme Program Grant Award (05/FY2021/P2/11-A92).

Author information

These authors contributed equally: Li Lian Foo, Gilbert Yong San Lim.
These authors jointly supervised this work: Daniel S. W. Ting, Seang-Mei Saw, Marcus Ang.

Authors and Affiliations

Singapore National Eye Centre, Singapore Eye Research Institute, Singapore, Singapore
Li Lian Foo, Gilbert Yong San Lim, Chee Wai Wong, Quan V. Hoang, Leopold Schmetterer, Audrey Chia, Tien Yin Wong, Daniel S. W. Ting, Seang-Mei Saw & Marcus Ang
Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
Li Lian Foo, Chee Wai Wong, Quan V. Hoang, Leopold Schmetterer, Audrey Chia, Tien Yin Wong, Daniel S. W. Ting, Seang-Mei Saw & Marcus Ang
Escola Superior de Tecnologia da Saúde de Lisboa (ESTeSL), Instituto Politécnico de Lisboa, Lisboa, Portugal
Carla Lanca
Comprehensive Health Research Center (CHRC), Escola Nacional de Saúde Pública, Universidade Nova de Lisboa, Lisboa, Portugal
Carla Lanca
Asia Pacific Eye Centre, Gleneagles Hospital, Singapore, Singapore
Chee Wai Wong
Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Quan V. Hoang
Dept. of Ophthalmology, Columbia University, Columbia, SC, USA
Quan V. Hoang
Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong, China
Xiu Juan Zhang & Jason C. Yam
Hong Kong Eye Hospital, Hong Kong, China
Jason C. Yam
Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong, China
Jason C. Yam
Hong Kong Hub of Paediatric Excellence, The Chinese University of Hong Kong, Hong Kong, China
Jason C. Yam
Department of Ophthalmology, Hong Kong Children’s Hospital, Hong Kong, China
Jason C. Yam

Authors

Li Lian Foo
View author publications
You can also search for this author in PubMed Google Scholar
Gilbert Yong San Lim
View author publications
You can also search for this author in PubMed Google Scholar
Carla Lanca
View author publications
You can also search for this author in PubMed Google Scholar
Chee Wai Wong
View author publications
You can also search for this author in PubMed Google Scholar
Quan V. Hoang
View author publications
You can also search for this author in PubMed Google Scholar
Xiu Juan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jason C. Yam
View author publications
You can also search for this author in PubMed Google Scholar
Leopold Schmetterer
View author publications
You can also search for this author in PubMed Google Scholar
Audrey Chia
View author publications
You can also search for this author in PubMed Google Scholar
Tien Yin Wong
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. W. Ting
View author publications
You can also search for this author in PubMed Google Scholar
Seang-Mei Saw
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Ang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.L.F., G.Y.S.L., D.S.W.T., C.L., C.W.W., Q.V.H., X.J.Z., J.C.Y., L.S., A.C., T.Y.W., S.M.S., and M.A. conceptualized the manuscript, researched its contents, wrote the manuscript and edited all revisions. L.L.F., G.Y.S.L., D.S.W.T. and M.A. were involved in the development of the DL model. All authors had full access to all the data in the study and accept responsibility to submit for publication.

Corresponding authors

Correspondence to Seang-Mei Saw or Marcus Ang.

Ethics declarations

Competing interests

D.T. is an Associate Editor of the journal and played no role in the peer review of this manuscript. D.S.W.T. and T.Y.W. hold a patent for a method of modifying a retina fundus image for a deep learning model issued to EyRiS (WO2019022663). The solution, Singapore Eye Lesion Analyser (SELENA), is associated with the start-up company EyRis and the patent issued to EyRis (WO2019022663). The remaining authors do not have competing interests to declare.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Foo, L.L., Lim, G.Y.S., Lanca, C. et al. Deep learning system to predict the 5-year risk of high myopia using fundus imaging in children. npj Digit. Med. 6, 10 (2023). https://doi.org/10.1038/s41746-023-00752-8

Download citation

Received: 20 September 2022
Accepted: 10 January 2023
Published: 26 January 2023
DOI: https://doi.org/10.1038/s41746-023-00752-8

This article is cited by

Development and validation of predictive models for myopia onset and progression using extensive 15-year refractive data in children and adolescents
- Jing Zhao
- Yanze Yu
- Xingtao Zhou
Journal of Translational Medicine (2024)
Insights into artificial intelligence in myopia management: from a data perspective
- Juzhao Zhang
- Haidong Zou
Graefe's Archive for Clinical and Experimental Ophthalmology (2024)