Training data size and predication errors in the use of machine-learning assisted intraocular lens power calculation

Tabuchi, Hitoshi; Yamauchi, Tomofusa; Shojo, Tomohiro; Takase, Kosuke; Tanabe, Mao

doi:10.1038/s41598-023-38616-6

Download PDF

Article
Open access
Published: 13 July 2023

Training data size and predication errors in the use of machine-learning assisted intraocular lens power calculation

Hitoshi Tabuchi^1,2,
Tomofusa Yamauchi²,
Tomohiro Shojo²,
Kosuke Takase² &
…
Mao Tanabe²

Scientific Reports volume 13, Article number: 11348 (2023) Cite this article

736 Accesses
Metrics details

Subjects

Abstract

This retrospective study examined the effect of the size of training data on the accuracy of machine learning-assisted SRK/T power calculation. Clinical records of 4800 eyes of 4800 Japanese patients with intraocular lenses (IOLs) were reviewed. A support vector regressor (SVR) was used for refining the SRK/T formula, and dataset sizes for training and evaluation were reduced from full to 1/64. The prediction errors from the postoperative refractions were calculated, and the proportion within ± 0.25 D, ± 0.50 D, and ± 1.00 D of errors were compared with those using full data. The influence of the difference in A-constant was also evaluated. Prediction errors within ± 0.50 D in the use of full data were obtained with the dataset of ≥ 150 eyes (P = 0.016), whereas the datasets of ≥ 300 eyes were required for the error within ± 0.25 D (P < 0.030). The prediction errors did not alter with the A-constant values among IOLs with open-loop haptics, except for IOLs with plated haptics. In conclusion, the accuracy of SVR-assisted SRK/T could be achieved with the training dataset of ≥ 150 eyes for the Japanese population, and the calculation was versatile for any open-looped IOLs.

Accuracy of a new intraocular lens power calculation method based on artificial intelligence

Article 28 April 2020

An ensemble-based approach for estimating personalized intraocular lens power

Article Open access 25 November 2021

Development of a new intraocular lens power calculation method based on lens position estimated with optical coherence tomography

Article Open access 16 April 2020

Introduction

In the use of premium intraocular lenses (IOLs) for astigmatism and presbyopia corrections, accurate IOL power calculation for postoperative emmetropia is necessary for IOL functions. Although postoperative refractive errors within ± 1.0 D could be obtained in 93% of eyes using third- to fourth-generation calculation formulas such as the SRK/T and Haigis formula¹, accuracy of > 90% within the absolute errors of 0.5 D is desired for patients undergoing premium IOL implantation. Thus, advanced calculation methods, such as the Barrett Universal II (BUII)², Hill-radial basis function (Hill-RBF)³, and Kane formula⁴, have been used, and several publications have reported their superiority^5,6,7. New-generation formulas enable higher accuracy by adding more biometric measurements such as lens thickness and corneal diameter, utilizing a complex model of ocular geometry, and utilizing machine learning with a large dataset.

As most of the advanced calculations are based on the biometry of Caucasian eyes, performances could be inherently altered by patients’ ethnicity, race, and region. The alternations for patient groups of a site have been adjusted with the constants of third- to fourth-generation formulas, such as the A-constant. However, such optimization is not available for advanced calculations⁸. Recently, we demonstrated that the use of machine learning with the SRK/T formula effectively improved the power calculation accuracy for a patient group⁹. Predicted refractions derived from the SRK/T formula were adjusted with support vector regression (SVR) machine learning. The SVR nonlinearly provided a regression equation in which the total errors of the training data outside a certain margin from the equation were minimized¹⁰ and were suitable for IOL power calculation¹¹. With training data of 1211 eyes, the prediction errors were less than that with BUII for patients in the Kyushu Island of Japan⁹. Adaptation was achieved using a small size of training data by incorporating SRK/T; however, how much training data are required for a specific accuracy is not certain. Thus, this retrospective study aimed to assess the effect of training data size on the accuracy of IOL power calculation and evaluate the influence of the difference in A-constants.

Methods

Participates

This retrospective study was approved by the Institutional Ethics Committee of Tsukazaki Hospital (Approval No. 181011) and adhered to the tenets of the Declaration of Helsinki. For all participants, the use of clinical records related to cataract surgery was approved as stated in the informed consent obtained before surgery. Clinical records of consecutive patients who underwent cataract surgery with IOL implantation between September 2017 and April 2021 were reviewed. The inclusion criteria were as follows: no history of refractive surgery, postoperative corrected distance visual acuity (CDVA) of 16/20 in Snellen or better, and optimized constants of implanted IOLs. For bilateral implantation, an eye with regular and mild astigmatism was selected for the analysis.

Preoperative axial length (AXL), corneal radius (CR), anterior chamber depth (ACD), lens thickness (LT), and white-to-white distance (WTW) were measured using a swept-source biometer IOLMaster 700 (Carl Zeiss, Oberkochen, Germany). IOL power was determined using the SRK/T formula, and all IOLs were implanted in the capsule without complications. Three months after surgery, the manifest refraction spherical equivalent (MRSE) was measured during the examination for CDVA.

Machine learning-assisted power calculation

SVR was used to enhance the accuracy of the SRK/T formula⁹. Initially, predicted postoperative refractions were obtained using the SRK/T with biometry measurements of AXL and CR and an optimized A-constant. The predicted postoperative refractions were refined for the patient group with additional inputs of AXL, CR, ACD, LT, and WTW. The SVR with an RBF kernel was trained by using the “scikit-learn” library (https://scikit-learn.org/stable/modules/svm.html#svm-regression) in Python 3. Hyperparameters such as the C-constant and shape parameter γ of the kernel function were tuned using a grid search for avoiding overfitting¹¹.

To evaluate the effect of training data sizes on calculation accuracy, a dataset of the participants was randomly divided into five groups. Initially, four groups were used for SVR training to refine the accuracy of the predicted postoperative refractions, and the remaining group was used to evaluate the trained calculator. As shown in Fig. 1, the groups used for training were rearranged four times to obtain evaluation results for all data. Then, size of the dataset was reduced by half and divided into five groups, and training and evaluation were conducted similarly. The dataset had been divided by two until the size was 1/64 of the original size. When the original data size was 4800 eyes, training and evaluation were conducted with datasets of 4800, 2400, 1200, 600, 300, 150, and 75 eyes.

Analysis

To assess the accuracy for each training data size from the SVR, prediction errors of the predicted postoperative refractions from MRSE were obtained, and its means and standard deviations (SDs) were calculated. The median of the absolute prediction error (MedAE) was also obtained. Changes in the mean prediction errors with the dataset sizes were examined using the analysis of variance (ANOVA), followed by Holm’s multiple comparisons in the presence of a significant change. The proportion of eyes within prediction errors ± 0.25 D, ± 0.50 D, and ± 1.00 D was calculated, and differences from the use of full data were examined using the Chi-squared test.

The influence of eyes with long AXL (> 26.0 mm) was also compared with those of eyes with normal AXL (between 22.0 and 26.0 mm). Owing to the limited sample size (178 eyes), eyes with short AXL (< 22.0 mm) were not analyzed. The prediction errors were compared using a t-test, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors were compared using the Chi-squared test.

To investigate whether the calculator would accommodate various IOL models, the influence of A-constants on prediction errors was also evaluated. Prediction errors were compared between four groups according to the ranges of A-constants, such as ≤ 119.0, 119.0–119.2, 119.2–119.4, and 119.4–119.6, using ANOVA following the Tukey multiple comparison. P < 0.05 was considered a statistically significant difference.

Ethics approval and consent to participate

This retrospective study was approved by the Institutional Ethics Committee of Tsukazaki Hospital (Approval No. 181011) and adhered to the tenets of the Declaration of Helsinki. For all participants, the use of clinical records related to cataract surgery was approved as stated in the informed consent obtained before surgery.

Consent to publish

Name and other personally identifiable information were removed from all text/figures/tables/images.

Results

Clinical records of 4800 eyes from 4800 eligible patients were available. The mean age of the patients was 71.5 (SD 8.4) years, and there were 2195 men and 2605 women. The preoperative mean AXL, CR, and ACD were 24.0 (SD 1.5; range 20.5–30.5) mm, 7.63 (SD 0.26; range 6.72–8.54) mm, and 3.11 (SD 0.40; range 1.75–4.62) mm, respectively. The LT and WTW were 4.53 (SD 0.46) mm and 11.7 (SD 0.4) mm, respectively. The implanted IOLs and A-constants used are listed in Table 1. The power of the implanted IOLs ranged from 5.0 to 30.0 D, and the mean power was 19.4 (SD 4.0) D for targeting refractions between − 7.42 D and 1.13 D (mean − 0.20 D). The mean postoperative MRSE was − 0.18 (SD 0.90) D, and the CDVA was − 0.11 (SD 0.08) logMAR.

Table 1 Demographic data of subjects.

Full size table

Table 2 shows the mean prediction errors, MedAE, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors in the use of SVR-assisted calculation for seven dataset sizes. The mean prediction errors did not change with the data size (P = 1.00, ANOVA), whereas the SD values increase compared with the overall data of 4800 when the data size were ≤ 300 (P < 0.027, F-test). Figure 2 shows the change in proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors with the dataset size. Compared with the results using overall data, the proportions within ± 0.50 D error for the dataset of 75 eyes were significantly low (P = 0.016, Chi-squared test). For errors within ± 0.25 D, the use of datasets of 75 and 150 eyes resulted in a lower proportion (P = 0.014 and 0.030, respectively). In comparison with the results of SRK/T only (N = 4800 eyes), the proportion within ± 0.50 D error was higher when the dataset size was ≥ 150, whereas it was lower for the size of 75 (P < 0.001).

Table 2 Refractive errors of SVR-assisted calculations with data set sizes from 4800 to 75 eyes.

Full size table

The influence of long eyes (AXL > 26.0 mm) was evaluated in comparison with normal eyes (AXL of 22.0–26.0 mm). Table 3 lists refractive errors for long and normal eyes. In the mean prediction errors, no differences were found for all dataset sizes (P > 0.19, t-test). Within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors, the proportions in long eyes were significantly less for a dataset size of 75 eyes (P < 0.003).

Table 3 Refractive errors of SVR-assisted calculations for long and normal eyes.

Full size table

Changes in the prediction error with the A-constant used were examined. In this study, 603, 1109, 1442, and 1646 eyes had A-constants of implanted IOLs of ≤ 119.0, 119.0–119.2, 119.2–119.4, and 119.4–119.6, respectively. As seen in Table 1, the A-constants of ≤ 119.0 consisted of only a single type of IOLs (LS-313 MF15 and LS-313 MF15T) of hydrophilic acrylic material with plated haptics, whereas other IOLs had open-loop haptics with various materials. Table 4 shows the mean prediction errors, MedAE, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors. The mean prediction error for A-constants of ≤ 119.0 significantly shifted to hyperopia compared with A-constants of 119.2–119.4 (P = 0.0041, Tukey multiple comparisons), whereas no change was observed among IOLs of A-constants of 119.0–119.6.

Table 4 Refractive errors in the ranges of A-constants with the SVR-assisted calculations (Data set size of 4800 eyes).

Full size table

Discussion

The use of SVR with the SRK/T formula improved the accuracy of IOL power calculations, and the accuracy did not degrade when the dataset size for SVR training was ≥ 150 within ± 0.50 D errors. The calculator was versatile for any IOLs with an open loop. In the analysis by Aristodemou et al. using data from 8180 eyes and conventional statistical techniques, data from 243 eyes would be required to optimize each A-constant, and the accuracy increases with the sample size¹². In the current results, the accuracy remained until the dataset size of 300. This superior performance with a small sample size would result from the use of nonlinear SVR. In addition, the refining of SRK/T outputs accommodated the IOL with A-constants of 119.0–119.6. Previous assessments of machine-learning power calculations used multiple types of IOLs for trainings^11,13; however, the difference in IOLs were not examined. Our results indicated that the calculator accommodated most of the one-piece hydrophobic acrylic IOLs with open haptics.

While the mean prediction errors and MedAE did not change with the dataset size, the variance increases when the size was ≤ 150. As a result, the accuracy within ± 0.25 D errors was lower when the dataset size was ≤ 150. For attaining high accuracy, data of ≥ 300 eyes would be preferred. Thanks to the accommodation of multiple IOL types for training, such dataset size would be acceptable for optimization for a patient group at each site or surgeon.

In the comparison between long and normal eyes, significant differences were found in the use of 75-eye dataset. Similarly, the use of small datasets results in lower performance in the proportion within ± 0.5 D and ± 0.25 D errors. One of the factors would be limited coverage of datasets; thus, accommodating eyes with long or short AXL and minor IOL design would be difficult. In the current analysis, ≥ 150 eyes were the least recommended for Japanese patients in the territory of the site. To provide favorable postoperative outcomes, collecting data from patients within each territory would be better.

In the comparison of A-constants, only a particular IOL type showed lower outcomes. This IOL was extended depth-of-focus, made of hydrophilic acrylic material, and equipped with plated haptics. Compared with other IOL types of one-piece and open-loop haptics, the mean prediction errors were significantly and slightly shifted to hyperopic. As the shifting of the IOLs posteriorly resulted in hyperopic errors¹⁴, bending of plated haptics due to capsule contraction would induce this prediction error. Further investigation is required. Except for a particular IOL model, the current machine learning-assisted power calculation improved the accuracy for the A-constants in the range of 119.0–119.6, whereas the training dataset insisted on data with multiple IOL models. This finding would be attributed to the use of SRK/T outputs and optimized A-constants; thus, optimized IOL power calculation for our patient group with a limited training dataset would be beneficial. Further investigation is necessary to verify this speculation.

This study has some limitations. First, owing to the retrospective design, the topographic data of the cornea was not measured. Refractive powers of the cornea were obtained from the powers of the anterior (keratometric) and posterior surfaces and corneal thickness. Thus, the influence of the posterior cornea could not be evaluated. Further evaluation with the use of a rotational Scheimpflug camera or optical coherence tomography¹⁵ is necessary for more accurate power calculation. In addition, the influence of the asphericity of the cornea¹⁶ should be examined. Moreover, multiple IOLs are available for training and evaluation. As per the guideline presented⁸, an IOL power calculation was evaluated for a single IOL model. In the previous assessment of the same calculation with 1611 eyes with SN60WF alone, the mean prediction error was 0.01 (SD, 0.38) D, and the proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors were 54.4%, 83.5%, and 98.5%, respectively⁹, which were slightly better than the current results. As expected, a higher accuracy would be obtained by selecting the IOL type routinely used. In other cases, the range of the dataset was determined by the biometry of limited patients within the territory. Hence, there would be patients who would be out of the range of the dataset used for the training. Ideally, a dataset includes a heterogeneous cohort of patients as much as possible; however, this is not practical. However, indicating the minimum requirement for a clinical situation would be important. Finally, implementing the proposed calculation in clinical practice is not easy, since the calculator works in Python 3. To examine the effectiveness of the proposed calculator in other sites, an environment in which a user-friendly calculator can be used through the web is warranted.

Conclusions

This study using data from 4800 eyes revealed that the accuracy of SVR-assisted SRK/T power calculation could be achieved with the training dataset of ≥ 150 within ± 0.50 D errors for the Japanese population. The calculation was versatile for any open-looped IOL models.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Behndig, A. et al. Aiming for emmetropia after cataract surgery: Swedish National Cataract Register study. J. Cataract Refract. Surg. 38, 1181–1186 (2012).
Article PubMed Google Scholar
Barrett, G. D. An improved universal theoretical formula for intraocular lens power prediction. J. Cataract Refract. Surg. 19, 713–720 (1993).
Article CAS PubMed Google Scholar
Hill, W. E. Hill-RBF (Radial Basis Function) calculator version 3.0. https://rbfcalculator.com (2016).
Connell, B. J. & Kane, J. X. Comparison of the Kane formula with existing formulas for intraocular lens power selection. BMJ Open Ophthalmol. 4, e000251 (2019).
Article PubMed PubMed Central Google Scholar
Roberts, T. V., Hodge, C., Sutton, G. & Lawless, M. Contributors to the Vision Eye Institute IOL outcomes registry. Comparison of Hill-radial basis function, Barrett Universal and current third generation formulas for the calculation of intraocular lens power during cataract surgery. Clin. Exp. Ophthalmol. 46, 240–246 (2018).
Article PubMed Google Scholar
Melles, R. B., Holladay, J. T. & Chang, W. J. Accuracy of intraocular lens calculation formulas. Ophthalmology 125, 169–178 (2018).
Article PubMed Google Scholar
Darcy, K., Gunn, D., Tavassoli, S., Sparrow, J. & Kane, J. X. Assessment of the accuracy of new and updated intraocular lens power calculation formulas in 10930 eyes from the UK National Health Service. J. Cataract Refract. Surg. 46, 2–7 (2020).
PubMed Google Scholar
Hoffer, K. J. & Savini, G. Update on intraocular lens power calculation study protocols: The better way to design and report clinical trials. Ophthalmology 128, e115–e120 (2021).
Article PubMed Google Scholar
Mori, Y. et al. Machine learning adaptation of intraocular lens power calculation for a patient group. Eye Vis. (Lond.) 8, 42. https://doi.org/10.1186/s40662-021-00265-z (2021).
Article PubMed Google Scholar
Drucker, H., Burges, C., Kaufman, L., Smola, A. & Vapnik, V. Support vector regression machines. Adv. Neural Inf. Process. Syst. 9, 155–161 (1996).
Google Scholar
Yamauchi, T., Tabuchi, H., Takase, K. & Masumoto, H. Use of a machine learning method in predicting refraction after cataract surgery. J. Clin. Med. 10, 1103. https://doi.org/10.3390/jcm10051103 (2021).
Article PubMed PubMed Central Google Scholar
Aristodemou, P., Knox Cartwright, N. E., Sparrow, J. M. & Johnston, R. L. Intraocular lens formula constant optimization and partial coherence interferometry biometry: Refractive outcomes in 8108 eyes after cataract surgery. J. Cataract Refract. Surg. 37, 50–62 (2011).
Article PubMed Google Scholar
Carmona González, D. & Palomino Bautista, C. Accuracy of a new intraocular lens power calculation method based on artificial intelligence. Eye (London) 35, 517–522 (2021).
Article Google Scholar
Miyata, K., Kataoka, Y., Matsunaga, J., Honbo, M. & Minami, K. Prospective comparison of one-piece and three-piece tecnis aspheric intraocular lenses: 1-year stability and its effect on visual function. Curr. Eye Res. 40, 930–935 (2015).
Article PubMed Google Scholar
Swartz, T., Marten, L. & Wang, M. Measuring the cornea: The latest developments in corneal topography. Curr. Opin. Ophthalmol. 18, 325–333 (2007).
Article PubMed Google Scholar
Savini, G., Hoffer, K. J., Barboni, P., Schiano Lomoriello, D. & Ducoli, P. Corneal asphericity and IOL power calculation in eyes with aspherical IOLs. J. Refract. Surg. 33, 476–481 (2017).
Article PubMed Google Scholar

Download references

Acknowledgements

Dr. Keiichiro Minami, Ph.D. (KK. Evidence Slyme) helps in medical writing and preparation of this article.

Author information

Authors and Affiliations

Department of Technology and Design Thinking for Medicine, Hiroshima University, 1-2-3 Kasumi, Minami-ku, Hiroshima, 734-8553, Japan
Hitoshi Tabuchi
Department of Ophthalmology, Tsukazaki Hospital, 68-1 Waku, Aboshi-ku, Himeji, Hyogo, 671-1227, Japan
Hitoshi Tabuchi, Tomofusa Yamauchi, Tomohiro Shojo, Kosuke Takase & Mao Tanabe

Authors

Hitoshi Tabuchi
View author publications
You can also search for this author in PubMed Google Scholar
Tomofusa Yamauchi
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Shojo
View author publications
You can also search for this author in PubMed Google Scholar
Kosuke Takase
View author publications
You can also search for this author in PubMed Google Scholar
Mao Tanabe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.T. and T.Y. wrote the main manuscript text. H.T and T.Y. designed the study. H.T. created the database. T.Y. and M.T. programmed the model. T.S. and K.T. collected data.

Corresponding author

Correspondence to Tomofusa Yamauchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tabuchi, H., Yamauchi, T., Shojo, T. et al. Training data size and predication errors in the use of machine-learning assisted intraocular lens power calculation. Sci Rep 13, 11348 (2023). https://doi.org/10.1038/s41598-023-38616-6

Download citation

Received: 26 December 2022
Accepted: 11 July 2023
Published: 13 July 2023
DOI: https://doi.org/10.1038/s41598-023-38616-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.