An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials

Fleck, B W; Williams, C; Juszczak, E; Cocker, K; Stenson, B J; Darlow, B A; Dai, S; Gole, G A; Quinn, G E; Wallace, D K; Ells, A; Carden, S; Butler, L; Clark, D; Elder, J; Wilson, C; Biswas, S; Shafiq, A; King, A; Brocklehurst, P; Fielder, A R

doi:10.1038/eye.2017.150

Download PDF

Clinical Study
Open access
Published: 28 July 2017

An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials

B W Fleck¹,
C Williams²,
E Juszczak³,
K Cocker⁴,
B J Stenson⁵,
B A Darlow⁶,
S Dai⁷,
G A Gole⁸,
G E Quinn⁹,
D K Wallace¹⁰,
A Ells¹¹,
S Carden¹²,
L Butler¹³,
D Clark¹⁴,
J Elder^15,16,
C Wilson¹⁷,
S Biswas¹⁸,
A Shafiq¹⁹,
A King³,
P Brocklehurst³,
A R Fielder²⁰ &
BOOST II Retinal Image Digital Analysis (RIDA) Group

Eye volume 32, pages 74–80 (2018)Cite this article

2114 Accesses
42 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Purpose

To investigate whether the observed international differences in retinopathy of prematurity (ROP) treatment rates within the Benefits of Oxygen Saturation Targeting (BOOST) II trials might have been caused by international variation in ROP disease grading.

Methods

Groups of BOOST II trial ophthalmologists in UK, Australia, and New Zealand (ANZ), and an international reference group (INT) used a web based system to grade a selection of RetCam images of ROP acquired during the BOOST II UK trial. Rates of decisions to treat, plus disease grading, ROP stage grading, ROP zone grading, inter-observer variation within groups and intra-observer variation within groups were measured.

Results

Forty-two eye examinations were graded. UK ophthalmologists diagnosed treat-requiring ROP more frequently than ANZ ophthalmologists, 13.9 (3.49) compared to 9.4 (4.46) eye examinations, P=0.038. UK ophthalmologists diagnosed plus disease more frequently than ANZ ophthalmologists, 14.1 (6.23) compared to 8.5 (3.24) eye examinations, P=0.021. ANZ ophthalmologists diagnosed stage 2 ROP more frequently than UK ophthalmologists, 20.2 (5.8) compared to 12.7 (7.1) eye examinations, P=0.026. There were no other significant differences in the grading of ROP stage or zone. Inter-observer variation was higher within the UK group than within the ANZ group. Intra-observer variation was low in both groups.

Conclusions

We have found evidence of international variation in the diagnosis of treatment-requiring ROP. Improved standardisation of the diagnosis of treatment-requiring ROP is required. Measures might include improved training in the grading of ROP, using an international approach, and further development of ROP image analysis software.

Telemedicine retinopathy of prematurity severity score (TeleROP-SS) versus modified activity score (mROP-ActS) retrospective comparison in SUNDROP cohort

Article Open access 14 September 2023

Christine L. Xu, Joel Adu-Brimpong, … Darius M. Moshfeghi

Retinopathy of prematurity in Rwanda: a prospective multi-centre study following introduction of screening and treatment services

Article 30 July 2019

Francis Mutangana, Charles Muhizi, … for the Rwanda Retinopathy of Prematurity Study Group

DIGIROP efficacy for detecting treatment-requiring retinopathy of prematurity in a Portuguese cohort

Article 12 March 2021

Ana C. Almeida, Luís Miguel Borrego, … Susana Teixeira

Introduction

Retinopathy of prematurity (ROP) is graded using the International Classification of Retinopathy of Prematurity (ICROP).¹ While standard images are provided in ICROP, examiners must use subjective judgement when describing ROP in an infant. Variation in the rates of severe ROP between clinical centres have been attributed in part to observer bias.² A number of studies have demonstrated inter-observer variation when grading ROP using retinal images.^{3, 4, 5, 6, 7, 8, 9, 10}

Five international, multicentre randomised controlled trials of oxygen saturation targeting in very premature infants have recently been reported. The trial protocols were prospectively aligned to facilitate meta-analysis, the NeOProM collaboration.¹¹ The trials were performed in UK,^{12, 13} Australia,^{12, 13} New Zealand,^{12, 14} Canada,¹⁵ and USA.^{16, 17}

The Benefits of Oxygen Saturation Targeting (BOOST) II trials performed in UK, Australia, and New Zealand reported outcomes at the time of hospital discharge in 2013.¹² While the participants (premature infants) were broadly similar across countries, a large difference in ROP treatment rates was noted.¹² 153/798 (19.2%) of enroled infants were treated in the UK, compared to 75/975 (7.7%) in Australia and 23/306 (7.5%) in New Zealand.¹² All ophthalmologists in the BOOST II trials were instructed to base their decision to treat on the ETROP¹⁸ definition of Type 1 ROP, however subjective interpretation of ROP disease signs may have varied between countries.

Within the BOOST II UK trial, ophthalmologists in 12 of the 34 trial centres used RetCam imaging (Natus Medical, Pleasanton, CA, USA) for ROP screening.¹⁹ Imaging was not performed in the other UK centres. These images gave us the opportunity to compare ROP grading decisions made by ophthalmologists in the UK, Australia, and New Zealand who participated in the BOOST II trials. An international reference group was used as the gold standard. We aimed to determine whether international variation in the interpretation of images and subsequent treatment decisions was present, evidenced in our opportunistic cohort.

Materials and methods

Ophthalmologists participating in the BOOST II trials

Within the BOOST II trials, local ophthalmology services for routine ROP screening and treatment were used. In the UK, BOOST II UK trial ophthalmologists were asked to attend a training session on ROP classification, and were provided with printed training materials. In Australia and New Zealand, all BOOST II trial ophthalmologists were asked to self-certify prior to the trials, using a training and assessment website http://www.boostnz.info/ROP/.

Readers

Nine readers from Australia, two from New Zealand, and seven from UK who participated in the BOOST II trials were used (Supplementary Information). The groups from Australia and New Zealand were combined (ANZ) because the number of readers from New Zealand was low, and because ophthalmologists in Australia and New Zealand have a close working relationship for training and clinical practice. An international reference group of six experienced ophthalmologists with an interest in ROP who had not participated in the trials was used as the ‘gold standard’ (INT) (Supplementary Information). The international reference readers were from UK (2), USA (2), Canada (1), and Australia (1). The median (range) number of year’s experience of the readers in performing clinical ROP screening examinations was 25 (14–26) for the UK group, 15 (3.5–40) for ANZ, and 21 (10–38) for the international reference group.

Reading experiments

Each reader logged on to the study website and was given detailed instructions on how to classify the study images. Readers were referred to ICROP,¹ but standard comparison images were not given. To protect patient anonymity, no clinical data were provided. For each eye examination, drop down menus were used to grade ROP, and a decision to ‘treat’ or ‘not treat’. The order of eye examinations was randomised each time a reader logged on. On completion of grading, data were downloaded to an Excel spreadsheet for analysis.

Eye examination images

Images were selected by the lead study ophthalmologist (BWF) for high image quality and readability. An eye ‘examination’ was a set of one to five images obtained when examining one eye of one infant on one occasion. All selected eye examinations were performed prior to treatment. Forty-two eye examinations obtained from six centres were used (Supplementary Information). In some infants more than one eye examination was used, to ensure a range of ROP disease severity was available for review. When more than one examination was used from the same infant, each examination was performed on a different date. In 31 infants one eye examination was used, in 3 infants two examinations were used, and in one infant five examinations were used. Six of the 42 eye examinations, illustrating a range of ROP severity, were duplicated to allow measurement of intra-observer variation. Each reader assessed 48 eye examinations. Seventeen of the 42 (40.5%) image sets were obtained at the time when a decision to treat was made, or immediately prior to treatment. Thirteen of the 42 (31.0%) image sets were from infants who did not require treatment at the time of imaging, but who were subsequently treated. Twelve of the 42 (28.6%) image sets were from infants who were not treated for ROP at any time.

Infants

RetCam images from 35 infants were used, linked to clinical data from the BOOST II UK trial. Thirty-four infants were white, one was British Pakistani. Seventeen (48.6%) infants were female and 18 were male. The mean gestational age was 25⁺² weeks, range 22⁺⁶–27⁺⁶ weeks. The mean (SD) birth weight was 785 (170) g, range 366–1115 g. Twenty-three of the 35 (65.7%) infants were treated for ROP at some time in their clinical course.

Statistical analysis

Descriptive statistics were used to summarise data according to type and distribution using counts/percentages for categorical data, means (standard deviations [SD]) for normally distributed continuous variables, and medians (ranges) for other continuous variables. Data obtained from duplicate eye examinations were only used for the calculation of intra-observer variation and were excluded from all other analyses. Inter-observer variation (Fleiss kappa) and intra-observer (Cohen kappa) values were calculated using the online tool www.statstodo.com. Conventionally, a kappa of 0.2 or less is considered poor agreement, 0.21–0.4 fair, 0.41–0.6 moderate, 0.61–0.8 strong and more than 0.8 near complete agreement.²⁰ These terms were used when reporting our results.

Results

Treatment decisions

Of the 42 eye examinations reviewed the mean (SD) number of examinations per reader judged to require treatment was 13.9 (3.49) for UK readers, 9.4 (4.46) for ANZ readers, and 12.8 (5.49) for the international readers. The difference between UK and ANZ readers was significant (t-test P=0.038, mean difference=4.49, 95% CI=0.27–8.72).

Plus disease

Of the 42 eye examinations reviewed the mean (SD) number of examinations per reader judged as ‘plus’ disease was 14.1 (6.23) for UK readers, 8.5 (3.24) for ANZ readers, and 13.2 (6.31) for the international readers (Table 1). The difference between UK and ANZ readers was significant (t-test P=0.021, mean difference=5.69, 95% CI=0.98–10.40).

Table 1 The mean (SD) number of eye examinations per reader classified as plus disease by reader group (N=42)

Full size table

Stage of retinopathy of prematurity

For each reader, the number of examinations read for each ROP stage was calculated. The mean (SD) for each reader group is given in Table 2. The mean number of eye examinations per reader classified as stage 2 was higher in the ANZ group than in the UK group (t-test, P=0.026, mean difference=7.47, 95% CI=1.00–13.94). For stage 3 there were no significant differences between the groups.

Table 2 The mean (SD) number of eye examinations per reader classified as each stage of ROP by reader group (N=42)

Full size table

Zone

For each reader, the number of examinations read for each ROP zone was calculated. The mean (SD) for each reader group is given in Table 3. The proportion of eye examinations read as each zone was not significantly different between any pair of groups.

Table 3 The mean (SD) number of eye examinations per reader assessed for each ROP zone by reader group (N=42)

Full size table

Inter-observer variation

Inter-observer variation Fleiss kappa measures for each classification variable are given in Table 4. Inter-observer agreement for the whole group of readers was ‘fair’ or ‘moderate’ for all measures. Agreement was highest within the ANZ group for all measures, with ‘moderate’ agreement for treatment decisions and for plus disease categories. Agreement was ‘fair’ for treatment decisions within the UK group. Agreement was poor for most measures within the INT group.

Table 4 Inter-observer variation kappa statistics

Full size table

Intra-observer variation

We measured intra-observer variation by including six duplicate examinations within the 48 eye examinations shown to each reader. The results are shown in Table 5. All kappa values were within the ‘strong’ or ‘near perfect’ agreement categories.

Table 5 Intra-observer variation weighted Cohen kappa statistics

Full size table

Discussion

We have compared the ROP grading decisions of BOOST II trial ophthalmologists in UK with those in Australia and New Zealand. UK ophthalmologists demonstrated a lower threshold to treat than Australian and New Zealand ophthalmologists. UK ophthalmologists graded more images as plus disease, and more images as treatment-requiring. There were no significant differences in grading stage 3 disease or ROP zone. The UK ophthalmologists had more inter-observer variation than the Australian and New Zealand ophthalmologists. Intra-observer consistency appeared to be good among all ophthalmologists. The international reference ophthalmologists graded in a similar way to the UK ophthalmologists.

There were a number of limitations in our study. While the data were obtained within the context of a clinical trial, RetCam images and ROP clinical data were obtained from routine clinical screening examinations. RetCam imaging was used in a limited number of centres during the BOOST II UK trial, and in some centres was only used immediately prior to treatment. The quality of images obtained was variable. The completeness of accompanying clinical data from the treating ophthalmologists was variable. The set of RetCam images used for the study was selected, not random. The groups of readers from each country were biased towards experienced, research-active ophthalmologists. The international reference group was limited in number, and may not have been truly representative of broad-based international expertise. The sample size of both RetCam images and of readers was small and therefore of insufficient power to detect all but the largest differences.

The context of this study was a group of five oxygen trials in premature infants—the NeOProM collaboration.¹¹ Significant differences in ROP treatment rates between countries were evident. Within the BOOST II trials performed in UK, Australia, and New Zealand, 153/798 (19.2%) of enroled infants were treated in the UK, compared to 75/975 (7.7%) in Australia and 23/306 (7.5%) in New Zealand.¹² Thus, in the UK 153 infants were treated, and 645 were not treated. In Australia and New Zealand combined (ANZ), 98 were treated and 1183 were not treated. The difference in treatment rates was significant (Chi squared test P<0.0001, odds ratio=2.51, 95% CI=1.98–3.18). In the Canadian COT trial, 130/1003 (13%) of trial survivors at 36 weeks postmenstrual age had undergone ROP treatment or had Stage 4 or 5 ROP.¹⁵ In the American SUPPORT trial, 120/913 (13.1%) of trial survivors at 36 weeks postmenstrual age had undergone ROP treatment or had been diagnosed as having Type 1 ETROP.¹⁸ If the Canadian and USA trials are combined,^{15, 16} 250 of 1916 (13.0%) were treated. The difference in treatment rates between the North American trials and the ANZ trial was significant (Chi squared test P<0.0001, odds ratio=1.71, 95% CI=1.37–2.13), and the difference in treatment rates between the UK trial and the North American trials was also significant (Chi squared test P<0.0001, odds ratio=1.47, 95% CI=1.22–1.77). These differences are unlikely to be due to chance.

The baseline clinical characteristics of infants enroled in the BOOST II UK, BOOST II Australia and the BOOST New Zealand trials were very similar.¹² In addition, the measured oxygen treatments given to the infants in the trials were very similar, as were morbidity measures (other than treatment for ROP), and mortality.¹² The cohorts enroled in the Canadian and USA trials^{15, 16} were also similar to those in the BOOST II trials. It is therefore unlikely that the difference in treatment rates between the individual studies was due to differences in the patient populations.

Different treatment rates could potentially result in different visual outcomes. The 2 year outcome data from the UK and Australian trials¹³ and from the New Zealand trial¹⁴ gave visual outcome data. In the UK 23 of 718 infants (3.2%), in Australia 5 of 911 infants (0.55%), and in New Zealand 1 of 340 infants (0.29%) had severe visual impairment. Additional information was available for the subgroup of UK infants treated with the revised oxygen algorithm.¹³ Eighteen of 551 (3.3%) had severe visual impairment. Four of these had retinal detachment, 12 had cerebral visual impairment and in two the cause was not recorded. Thus 4 of 551 (0.73%) had severe visual impairment due to retinal detachment. The lower treatment rate in ANZ did not result in a higher rate of severe visual impairment.

Differences in ROP treatment rates have been documented between centres,^{2, 21} between countries,^{22, 23} and over time.^{9, 24, 25, 26} Some variation may be due to differences in the clinical characteristics of the populations under study, and to neonatal care practices. This is likely to be the case when comparing countries with differing health service characteristics and over periods of time.^{9, 22, 24, 25} The clinical characteristics of the infants in the BOOST II trials were very similar.^{12, 13} In this study, we have explored the possible contribution to the observed different rates of ROP treatment of international variation in disease grading.^{2, 9} Our results suggest such variation was present.

While inter-observer agreement for plus disease grading was ‘moderate’ within the ANZ group, it was ‘poor’ for the UK group. Previous studies have also found limited agreement between experts in the diagnosis of treatment-requiring ROP,^{10, 27} and of plus disease.¹⁰ Gschliesser found moderate inter-observer agreement (kappa 0.41) for the necessity for treatment, and ‘fair’ agreement (kappa 0.32) for plus disease.¹⁰ Chiang found ‘fair’ and ‘moderate’ weighted kappa agreement for the diagnosis of plus disease when each of a group of experts was compared to all the other experts in the group.⁵

While standardisation of ROP diagnostic grading may be approached by improved training of screening ophthalmologists,^{28, 29} an international approach is needed. Tools such as online training and assessment websites may be used. In Australia and New Zealand all BOOST II study ophthalmologists were asked to self-certify prior to the trials, using http://www.boostnz.info/ROP/.

The key component in ROP treatment decisions is the detection of plus disease, as defined by ICROP.¹ Our study, and a number of other studies,⁵ show the limitations of clinical judgement based on reference photographs. As has occurred in diabetic retinopathy screening, a move towards the use of retinal images rather than clinical examinations is a prerequisite for the standardisation of diagnostic decisions.^{30, 31} Computerised image analysis techniques, trained by clinical experts, are needed to improve the objectivity of treatment decisions.^{30, 31, 32, 33, 34, 35, 36, 37, 38, 39}

The planning of international ROP treatment trials requires improved training and standardisation of observers. In the Cryotherapy for ROP study, a second examiner was required to examine each infant within 3 days of the primary examiner, to confirm the presence of treatment-requiring ‘threshold’ disease.⁴⁰ In 12% of cases, the two examiners disagreed on the presence of plus disease.^{27, 40} Ideally, retinal images should be used in trials, with central reading centres.⁴¹ Both clinical trials and clinical practice will benefit from the use of image analysis software that quantifies plus disease.

We found international variation in the diagnosis of treatment-requiring ROP. While excessively low rates of ROP treatments risk blindness, excessively high rates of ROP treatments should also be avoided. Treatment is invasive, and carries risks of ocular and systemic morbidity. Improved standardisation of treatment decisions is an important goal. Approaches might include the use of internationally standardised online training tools, and the development of image analysis software to quantify ROP plus disease.

References

International Committee for the Classification of Retinopathy of Prematurity. The international classification of retinopathy of prematurity revisited. Arch Ophthalmol 2005; 123 (7): 991–999.
Article Google Scholar
Darlow BA, Elder MJ, Horwood LJ, Donoghue DA, Henderson-Smart DJ Australian and New Zealand Neonatal Network. Does observer bias contribute to variations in the rate of retinopathy of prematurity between centres? Clin Exp Ophthalmol 2008; 36 (1): 43–46.
Article Google Scholar
Chiang MF, Keenan JD, Starren J, Du YE, Schiff WM, Barile GR et al. Accuracy and reliability of remote retinopathy of prematurity diagnosis. Arch Ophthalmol 2006; 124 (3): 322–327.
Article Google Scholar
Chiang MF, Starren J, Du YE, Keenan JD, Schiff WM, Barile GR et al. Remote image based retinopathy of prematurity diagnosis: a receiver operating characteristic analysis of accuracy. Br J Ophthalmol 2006; 90 (10): 1292–1296.
Article CAS Google Scholar
Chiang MF, Jiang L, Gelman R, Du YE, Flynn JT . Interexpert agreement of plus disease diagnosis in retinopathy of prematurity. Arch Ophthalmol 2007; 125 (7): 875–880.
Article Google Scholar
Scott KE, Kim DY, Wang L, Kane SA, Coki O, Starren J et al. Telemedical diagnosis of retinopathy of prematurity intraphysician agreement between ophthalmoscopic examination and image-based interpretation. Ophthalmology 2008; 115 (7): 1222–1228 e3.
Article Google Scholar
Wallace DK, Quinn GE, Freedman SF, Chiang MF . Agreement among pediatric ophthalmologists in diagnosing plus and pre-plus disease in retinopathy of prematurity. J AAPOS 2008; 12 (4): 352–356.
Article Google Scholar
Williams SL, Wang L, Kane SA, Lee TC, Weissgold DJ, Berrocal AM et al. Telemedical diagnosis of retinopathy of prematurity: accuracy of expert versus non-expert graders. Br J Ophthalmol 2010; 94 (3): 351–356.
Article Google Scholar
Slidsborg C, Forman JL, Fielder AR, Crafoord S, Baggesen K, Bangsgaard R et al. Experts do not agree when to treat retinopathy of prematurity based on plus disease. Br J Ophthalmol 2012; 96 (4): 549–553.
Article Google Scholar
Gschliesser A, Stifter E, Neumayer T, Moser E, Papp A, Pircher N et al. Inter-expert and intra-expert agreement on the diagnosis and treatment of retinopathy of prematurity. Am J Ophthalmol 2015; 160 (3): 553–560 e3.
Article Google Scholar
Askie LM, Brocklehurst P, Darlow BA, Finer N, Schmidt B, Tarnow-Mordi W et al. NeOProM: Neonatal Oxygenation Prospective Meta-analysis Collaboration study protocol. BMC Pediatr 2011; 11: 6.
Article Google Scholar
Group BIUKC, Group BIAC, Group BINZC Stenson BJ, Tarnow-Mordi WO, Darlow BA, Simes J, Juszczak E, Askie L et alGroup BIUKC, Group BIAC, Group BINZC. Oxygen saturation and outcomes in preterm infants. N Engl J Med 2013; 368 (22): 2094–2104.
Article Google Scholar
Australia B-I, United Kingdom Collaborative G, Tarnow-Mordi W, Stenson B, Kirby A, Juszczak E et al. Outcomes of two trials of oxygen-saturation targets in preterm infants. N Engl J Med 2016; 374 (8): 749–760.
Article Google Scholar
Darlow BA, Marschner SL, Donoghoe M, Battin MR, Broadbent RS, Elder MJ et al. Randomized controlled trial of oxygen saturation targets in very preterm infants: two year outcomes. J Pediatr 2014; 165 (1): 30–35 e2.
Article Google Scholar
Schmidt B, Whyte RK, Asztalos EV, Moddemann D, Poets C, Rabi Y et al. Effects of targeting higher vs lower arterial oxygen saturations on death or disability in extremely preterm infants: a randomized clinical trial. JAMA 2013; 309 (20): 2111–2120.
Article CAS Google Scholar
Vaucher YE, Peralta-Carcelen M, Finer NN, Carlo WA, Gantz MG, Walsh MC et al. Neurodevelopmental outcomes in the early CPAP and pulse oximetry trial. N Engl J Med 2012; 367 (26): 2495–2504.
Article CAS Google Scholar
Network SSGotEKSNNR Carlo WA, Finer NN, Walsh MC, Rich W, Gantz MG, Laptook AR et alNetwork SSGotEKSNNR. Target ranges of oxygen saturation in extremely preterm infants. N Engl J Med 2010; 362 (21): 1959–1969.
Article Google Scholar
Early Treatment For Retinopathy Of Prematurity Cooperative G. Revised indications for the treatment of retinopathy of prematurity: results of the early treatment for retinopathy of prematurity randomized trial. Arch Ophthalmol 2003; 121 (12): 1684–1694.
Article Google Scholar
Moreton RB, Fleck BW, Fielder AR, Williams CA, Butler L, Wilson C et al. The effect of oxygen saturation targeting on retinal blood vessel growth using retinal image data from the BOOST-II UK Trial. Eye 2016; 30 (4): 577–581.
Article CAS Google Scholar
Landis JR, Koch GG . The measurement of observer agreement for categorical data. Biometrics 1977; 33: 159–174.
Article CAS Google Scholar
Thomas K, Shah PS, Canning R, Harrison A, Lee SK, Dow KE . Retinopathy of prematurity: risk factors and variability in Canadian neonatal intensive care units. J Neonatal Perinatal Med 2015; 8 (3): 207–214.
Article CAS Google Scholar
Gilbert C, Fielder A, Gordillo L, Quinn G, Semiglia R, Visintin P et al. Characteristics of infants with severe retinopathy of prematurity in countries with low, moderate, and high levels of development: implications for screening programs. Pediatrics 2005; 115 (5): e518–e525.
Article Google Scholar
Blencowe H, Lawn JE, Vazquez T, Fielder A, Gilbert C . Preterm-associated visual impairment and estimates of retinopathy of prematurity at regional and global levels for 2010. Pediatr Res 2013; 74 (Suppl 1): 35–49.
Article Google Scholar
Tan Z, Chong C, Darlow B, Dai S . Visual impairment due to retinopathy of prematurity (ROP) in New Zealand: a 22-year review. Br J Ophthalmol 2015; 99 (6): 801–806.
Article Google Scholar
Stoll BJ, Hansen NI, Bell EF, Walsh MC, Carlo WA, Shankaran S et al. Trends in care practices, morbidity, and mortality of extremely preterm neonates, 1993–2012. JAMA 2015; 314 (10): 1039–1051.
Article CAS Google Scholar
Dhaliwal C, Fleck B, Wright E, Graham C, McIntosh N . Incidence of retinopathy of prematurity in Lothian, Scotland, from 1990 to 2004. Arch Dis Child Fetal Neonatal ed 2008; 93 (6): F422–F426.
Article CAS Google Scholar
Phelps DL . It’s plus disease, isn’t it? Arch Ophthalmol 2007; 125 (7): 963–964.
Article Google Scholar
Paul Chan RV, Williams SL, Yonekawa Y, Weissgold DJ, Lee TC, Chiang MF . Accuracy of retinopathy of prematurity diagnosis by retinal fellows. Retina 2010; 30 (6): 958–965.
Article CAS Google Scholar
Myung JS, Paul Chan RV, Espiritu MJ, Williams SL, Granet DB, Lee TC et al. Accuracy of retinopathy of prematurity image-based diagnosis by pediatric ophthalmology fellows: implications for training. J AAPOS 2011; 15 (6): 573–578.
Article Google Scholar
Walton OBt, Garoon RB, Weng CY, Gross J, Young AK, Camero KA et al. Evaluation of automated teleretinal screening program for diabetic retinopathy. JAMA Ophthalmol 2016; 134 (2): 204–209.
Article Google Scholar
Abramoff MD, Folk JC, Han DP, Walker JD, Williams DF, Russell SR et al. Automated analysis of retinal images for detection of referable diabetic retinopathy. JAMA Ophthalmol 2013; 131 (3): 351–357.
Article Google Scholar
Koreen S, Gelman R, Martinez-Perez ME, Jiang L, Berrocal AM, Hess DJ et al. Evaluation of a computer-based system for plus disease diagnosis in retinopathy of prematurity. Ophthalmology 2007; 114 (12): e59–e67.
Article Google Scholar
Hewing NJ, Kaufman DR, Chan RV, Chiang MF . Plus disease in retinopathy of prematurity: qualitative analysis of diagnostic process by experts. JAMA Ophthalmol 2013; 131 (8): 1026–1032.
Article Google Scholar
Ryan MC, Ostmo S, Jonas K, Berrocal A, Drenser K, Horowitz J et al. Development and evaluation of reference standards for image-based telemedicine diagnosis and clinical research studies in ophthalmology. AMIA Annu Symp Proc 2014; 2014: 1902–1910.
PubMed PubMed Central Google Scholar
Ataer-Cansizoglu E, Kalpathy-Cramer J, You S, Keck K, Erdogmus D, Chiang MF . Analysis of underlying causes of inter-expert disagreement in retinopathy of prematurity diagnosis. Application of machine learning principles. Methods Inf Med 2015; 54 (1): 93–102.
Article CAS Google Scholar
Kemper AR, Prosser LA, Wade KC, Repka MX, Ying GS, Baumritter A et al. A comparison of strategies for retinopathy of prematurity detection. Pediatrics 2016; 137 (1): 1–10.
Article Google Scholar
Campbell JP, Ataer-Cansizoglu E, Bolon-Canedo V, Bozkurt A, Erdogmus D, Kalpathy-Cramer J et al. Expert diagnosis of plus disease in retinopathy of prematurity from computer-based image analysis. JAMA Ophthalmol 2016; 134 (6): 651–657.
Article Google Scholar
Abbey AM, Besirli CG, Musch DC, Andrews CA, Capone Jr A, Drenser KA et al. Evaluation of screening for retinopathy of prematurity by ROPtool or a lay reader. Ophthalmology 2016; 123 (2): 385–390.
Article Google Scholar
Wilson CM, Cocker KD, Moseley MJ, Paterson C, Clay ST, Schulenburg WE et al. Computerized analysis of retinal vessel width and tortuosity in premature infants. Investig Ophthalmol Vis Sci 2008; 49 (8): 3577–3585.
Article Google Scholar
Reynolds JD, Dobson V, Quinn GE, Fielder AR, Palmer EA, Saunders RA et al. Evidence-based screening criteria for retinopathy of prematurity: natural history data from the CRYO-ROP and LIGHT-ROP studies. Arch Ophthalmol 2002; 120 (11): 1470–1476.
Article Google Scholar
Daniel E, Quinn GE, Hildebrand PL, Ells A, Hubbard GB 3rd, Capone A Jr et al. Validated system for centralized grading of retinopathy of prematurity: telemedicine approaches to evaluating acute-phase retinopathy of prematurity (e-ROP) study. JAMA Ophthalmol 2015; 133 (6): 675–682.
Article Google Scholar

Download references

Acknowledgements

The BOOST II trials were funded by the United Kingdom Medical Research Council, National Health and Medical Research Council of Australia, Health Research Council of New Zealand. The current study was funded by the Royal College of Surgeons Edinburgh, Royal Blind, and Edinburgh and Lothian Health Foundation.

Author information

David G Cottrell, Rasha Altaie, Rohan W Essex, Geoffrey C Lam, Michael Forrest, Shaheen Shah, James Smith, Jeremy Smith, Deepa Taranath and Michael O'Keefe: Members of the BOOST II Retinal Image Digital Analysis (RIDA) Group are listed above References.

Authors and Affiliations

Department of Child Life and Health, University of Edinburgh, Edinburgh, UK
B W Fleck
Department of Paediatric Ophthalmology, School of Social and Community Medicine, University of Bristol, Bristol, UK
C Williams
Clinical Trials Unit, National Perinatal Epidemiology Unit, University of Oxford, Oxford, UK
E Juszczak, A King & P Brocklehurst
Department of Ophthalmology, Princess Alexandra Eye Pavilion, Edinburgh, UK
K Cocker
Neonatal Unit, Royal Infirmary of Edinburgh, Edinburgh, UK
B J Stenson
Department of Paediatrics, Cure Kids Professor of Paediatric Research, University of Otago, Christchurch, New Zealand
B A Darlow
Department of Paediatric Ophthalmology, Starship Children’s Hospital, University of Auckland, Auckland, New Zealand
S Dai
Department of Ophthalmology, University of Queensland, Brisbane, Australia
G A Gole
Department of Pediatric Ophthalmology, Children’s Hospital of Philadelphia, Philadelphia, USA
G E Quinn
Department of Pediatric Ophthalmology, Duke University, Durham, USA
D K Wallace
Department of Ophthalmology, University of Calgary, Alberta, Canada
A Ells
Department of Ophthalmology, Royal Children’s Hospital, Victoria, Australia
S Carden
Department of Paediatric Ophthalmology, Birmingham and Midlands Eye Centre, Birmingham, UK
L Butler
Department of Ophthalmology, Aintree University Hospital, Liverpool, UK
D Clark
Department of Ophthalmology, Royal Children’s Hospital, Melbourne, Australia
J Elder
Department of Paediatrics, University of Melbourne, Melbourne, Australia
J Elder
Department of Ophthalmology, Chelsea and Westminster Hospital, London, UK
C Wilson
Department of Paediatric Ophthalmology, Manchester Royal Eye Hospital, Central Manchester Foundation Trust and Manchester Academic Health Sciences Centre, Manchester, UK
S Biswas
Department of Ophthalmology, Newcastle Eye Centre, Newcastle, UK
A Shafiq
Department of Ophthalmology, Division of Optometry and Visual Sciences, City University, London, UK
A R Fielder

Authors

B W Fleck
View author publications
You can also search for this author in PubMed Google Scholar
C Williams
View author publications
You can also search for this author in PubMed Google Scholar
E Juszczak
View author publications
You can also search for this author in PubMed Google Scholar
K Cocker
View author publications
You can also search for this author in PubMed Google Scholar
B J Stenson
View author publications
You can also search for this author in PubMed Google Scholar
B A Darlow
View author publications
You can also search for this author in PubMed Google Scholar
S Dai
View author publications
You can also search for this author in PubMed Google Scholar
G A Gole
View author publications
You can also search for this author in PubMed Google Scholar
G E Quinn
View author publications
You can also search for this author in PubMed Google Scholar
D K Wallace
View author publications
You can also search for this author in PubMed Google Scholar
A Ells
View author publications
You can also search for this author in PubMed Google Scholar
S Carden
View author publications
You can also search for this author in PubMed Google Scholar
L Butler
View author publications
You can also search for this author in PubMed Google Scholar
D Clark
View author publications
You can also search for this author in PubMed Google Scholar
J Elder
View author publications
You can also search for this author in PubMed Google Scholar
C Wilson
View author publications
You can also search for this author in PubMed Google Scholar
S Biswas
View author publications
You can also search for this author in PubMed Google Scholar
A Shafiq
View author publications
You can also search for this author in PubMed Google Scholar
A King
View author publications
You can also search for this author in PubMed Google Scholar
P Brocklehurst
View author publications
You can also search for this author in PubMed Google Scholar
A R Fielder
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

BOOST II Retinal Image Digital Analysis (RIDA) Group

David G Cottrell
, Rasha Altaie
, Rohan W Essex
, Geoffrey C Lam
, Michael Forrest
, Shaheen Shah
, James Smith
, Jeremy Smith
, Deepa Taranath
& Michael O'Keefe

Corresponding author

Correspondence to B W Fleck.

Ethics declarations

Competing interests

Dr Anna Ells was a member of the Scientific Advisory Board of Clarity Medical Systems, the manufacturer of RetCam technology, until 2011 and Professor Alistair R Fielder has received remuneration for presentations on behalf of Clarity Medical Systems.

Additional information

David G Cottrell MBBS FRCOphth. Department of Ophthalmology, Royal Victoria Infirmary, Newcastle, UK Rasha Altaie MBChB FRANZCO. Department of Ophthalmology, University of Auckland, New Zealand Rohan W Essex MBBS FRANZCO. Department of Ophthalmology, Australian National University, Canberra, Australia Geoffrey C Lam MBBS FRANZCO. Department of Ophthalmology, Princess Margaret Hospital for Children, Subiaco, WA, Australia Michael Forrest MBBS FRANZCO. Department of Ophthalmology, University of Queensland, Brisbane, Australia Shaheen Shah MD FRANZCO. Department of Ophthalmology, Lady Cilento Children’s Hospital, Brisbane, Australia James Smith MBBS FRANZCO. Department of Ophthalmology, Children’s Hospital at Westmead, NSW, Australia Jeremy Smith MBBS FRANZCO. Department of Ophthalmology, Children’s Hospital at Westmead, NSW, Australia Deepa Taranath MBBS FRANZCO. Department of Ophthalmology, Women’s and Children’s Hospital, Adelaide, Australia Supplied images: Michael O’Keefe MD, FRCOphth. Department of Ophthalmology, Temple Street Children’s University Hospital, Dublin, Ireland

Supplementary Information accompanies this paper on Eye website

Supplementary information

Supplementary Information (DOCX 12 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Fleck, B., Williams, C., Juszczak, E. et al. An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials. Eye 32, 74–80 (2018). https://doi.org/10.1038/eye.2017.150

Download citation

Received: 24 February 2017
Accepted: 21 June 2017
Published: 28 July 2017
Issue Date: January 2018
DOI: https://doi.org/10.1038/eye.2017.150

This article is cited by

Estimate of incidence of ROP requiring treatment in extreme preterms and impact on service—7 year review in tertiary unit
- Shokufeh Tavassoli
- Richard Wach
- Cathy Williams
Eye (2019)
Comment on: ‘An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials’
- F. R. Burgess
- R. Davies
- C. L. L. Schmoll
Eye (2018)

Subjects

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Telemedicine retinopathy of prematurity severity score (TeleROP-SS) versus modified activity score (mROP-ActS) retrospective comparison in SUNDROP cohort

Retinopathy of prematurity in Rwanda: a prospective multi-centre study following introduction of screening and treatment services

DIGIROP efficacy for detecting treatment-requiring retinopathy of prematurity in a Portuguese cohort

Introduction

Materials and methods

Ophthalmologists participating in the BOOST II trials

Readers

Reading experiments

Eye examination images

Infants

Statistical analysis

Results

Treatment decisions

Plus disease

Stage of retinopathy of prematurity

Zone

Inter-observer variation

Intra-observer variation

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

BOOST II Retinal Image Digital Analysis (RIDA) Group

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information (DOCX 12 kb)

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Estimate of incidence of ROP requiring treatment in extreme preterms and impact on service—7 year review in tertiary unit

Comment on: ‘An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials’

Search

Quick links