Introduction

Allogeneic hematopoietic stem cell transplantation (HSCT) is currently the treatment of choice for a variety of malignant and non-malignant hematological disorders. Graft-versus-host disease (GVHD) represents one of the major complications of HSCT, involving different tissues including the eye. Dry eye (DE) disease is the most frequent manifestation of ocular GVHD, and is reported to occur, to a various degree, in 40–76% of patients [1, 2]. Clinical features consist of conjunctival fibrosis, punctate and/or filamentous keratopathy, chronic blepharitis, atrophy and irregularity of the eyelid margin, which may eventually induce keratinization of the tarsal conjunctiva and symblepharon [3]. Symptoms include irritation, burning, foreign body sensation, pain, photophobia and blurred vision, with deeply impaired quality of life and daily activities [4].

Recently, the International TFOS Dry Eye Workshop (DEWS) II proposed a new comprehensive classification of overall DE disease, incorporating ocular discomfort symptoms, tear instability and osmolarity, and ocular surface staining [5].

Although there are no subjective symptoms or clinical signs pathognomonic of ocular GVHD, the National Institutes of Health (NIH) introduced, and recently updated, specific criteria for its diagnosis, identifying the new onset of dry, gritty, or painful eyes, cicatricial conjunctivitis, keratoconjunctivitis sicca and confluent areas of punctate keratopathy as distinctive manifestations of chronic ocular GVHD [6, 7]. Later in 2013, the International Consensus Criteria on ocular GVHD (ICCGVHD) proposed a more detailed and specialized diagnosing and scoring schema, with the aim of replacing the NIH criteria [8].

However, the evidence of an ocular surface impairment in patients with hematological disorders already before HSCT is recently growing. In fact, the impairment of clinical tests along with morphological changes of meibomian glands with reduced gland areas detected by infrared meibography have been demonstrated in a large percentage of patients already prior to HSCT [9,10,11,12]. This evidence opens a new scenario, with the need for distinguishing between “conventional pre-existing dry eye” and “dry eye due to active ocular chronic GVHD.” As a consequence, the ICCGVHD, the German-Austrian-Swiss Consensus Conference and the 2014 updated NIH Consensus Conference warranted a prospective study evaluating a comprehensive baseline pre-HSCT ophthalmological evaluation [7, 8, 13]. This new protocol could be relevant not only to reach early diagnosis and prompt treatment, but also to identify the real proportion of patients diagnosed as ocular GVHD after HSCT.

Therefore, the aim of the present study was to compare the proportion and the agreement rate in the diagnosis of chronic ocular GVHD among three different criteria, applied with and without pre-HSCT ophthalmological examination.

Materials and methods

Study

Patients who had undergone HSCT for various hematological disorders over the period March 2007–September 2015 at the Hematological Department of the S.Orsola-Malpighi University Hospital were screened for inclusion in this study. Ophthalmological and hematological data prospectively collected over the period March 2007 to March 2016 were reviewed. The study followed the tenets of the Declaration of Helsinki, and was approved by the local ethics committee of S.Orsola-Malpighi University Hospital. Each subject signed an informed consent before any study procedure. Inclusion criteria were: age above 18 years and the completeness of medical charts. Exclusion criteria were: history of previous ocular diseases, and the use of eye drops for the treatment of glaucoma. Conditioning regimen and HSCT procedure were performed as previously described [11]. Two scheduled ophthalmological visits had been performed, respectively, 7–9 days before HSCT and conditioning regimen (baseline, V0), and in a time window ranging from 3 to 6 months after HSCT (V1).

Protocols

Traditionally, the diagnosis of ocular GVHD has been conducted evaluating patients only after HSCT, thus reaching the diagnosis in the presence of post-HSCT ocular surface impairment, regardless the baseline pre-HSCT ocular conditions. For methodological purposes, in the present study, the protocol based only on post-HSCT ophthalmological examination was called “Protocol A”. Recently, our and other groups proposed a new protocol including both pre- and post-HSCT ophthalmological assessment to proper diagnose and treat pre-existing DE before HSCT, as well as ocular GVHD after. In the present study, this protocol was called “Protocol B”.

Furthermore, we compared the proportion of diagnosis of ocular GVHD and the agreement rate among three different classification criteria (NIH, ICCGVHD and TFOS DEWS II) by applying both Protocol A and Protocol B. This latter protocol, taking into account the baseline pre-HSCT ophthalmological evaluation, allows the identification of incident cases: those patients not suffering from DE before HSCT and developing it only after HSCT. The clarification of this issue, strongly suggested by NIH and ICCGVHD criteria, is crucial for a proper diagnosis of “new onset” of DE after HSCT, and thus for properly reaching the diagnosis of ocular GVHD. In this study, we identified as GVHD patients with new onset DE not only incident cases, but also those patients suffering from DE already before HSCT and worsening after (see below).

GVHD diagnosis

Chronic ocular GVHD diagnosis was reached according to three different classification criteria. Two of them, namely NIH [7] and ICCGVHD criteria [8], are already in use in this field. The third one is represented by our proposal to extend TFOS DEWS II criteria [5], currently used for conventional DE disease, to estimate also the diagnosis of ocular GVHD in hematological patients undergoing HSCT.

1. The NIH 2014 criteria identified as distinctive signs new ocular sicca with Schirmer’s test values ≤ 5 mm/5 min (preferably with confirmation of normal values at an established baseline), or a new onset of keratoconjunctivitis sicca by slit lamp examination with mean Schirmer’s test values between 6 and 10 mm/5 min (preferably with confirmation of normal values at an established baseline). These ocular signs were defined as distinctive, and an additional distinctive feature of at least another organ is considered necessary to reach the diagnosis for chronic ocular GVHD. The severity was assessed by a 4-point grade scale based on symptoms, requirement of lubricant eye drops and activities of daily living impairment (Table 1) [7]. The increase of at least 1 point of the severity scale after HSCT was considered as a worsening.

Table 1 Grading scale according to National Institute of Health (NIH) criteria [7]

2. The ICCGVHD identified the following parameters for reaching the diagnosis of chronic ocular GVHD: ocular surface disease index (OSDI); Schirmer’s test without anesthesia; corneal fluorescein staining; conjunctival injection. Severity scores from 0 to 3 were assigned to OSDI, corneal staining and Schirmer’s values. Conjunctival injection was scored from 0 to 2. Based on the presence or absence of systemic GVHD, definite ocular GVHD was reached, respectively, with at least 6 or 8 points (Table 2) [8]. The increase of at least 2 points of the score system after HSCT was considered as a worsening.

Table 2 Grading scale according to the International Consensus Criteria on chronic ocular graft-versus-host disease (ICCGVHD) [8]

3. The TFOS DEWS II criteria required for the DE diagnosis, the presence of ocular discomfort symptoms, with an OSDI score ≥ 13, and in addition, at least one eye had to exceed thresholds on one of the three subset signs, chosen among tear break-up time (TBUT) < 10, tear osmolarity ≥ 308, ocular surface staining ( > 5 corneal spots, > 9 conjunctival spots, or lid wiper epitheliopathy of ≥ 2 mm in length and/or ≥ 25% sagittal width) (Table 3) [5]. The aggravation of at least one parameter among those studied after HSCT was considered as a worsening. According to our proposal, the diagnosis of ocular GVHD after HSCT can be reached in case of new positivity or worsening of TFOS DEWS II criteria, regardless of the involvement of other organs.

Table 3 Diagnostic criteria according to the TFOS DEWS II Diagnostic Methodology report [5]

Statistical analysis

Descriptive statistics, including the median and range for continuous variables and the percentage and frequencies of categorical variables, were calculated for assessing the demographic and clinical characteristics of ocular GVHD patients. All data were analyzed with SPSS version 19.0 (SPSS Inc., Chicago, IL, USA).

The proportion of patients diagnosed with ocular GVHD and the relative 95% confidence interval (95% CI) were calculated. Agreement of binary diagnoses between pairs of the three grading scales (NIH, ICCGVHD and TFOS DEWS II criteria) and of the two protocols (Protocols A and B) was evaluated with the Kappa coefficient, and its 95% CIs were estimated.

Results

One hundred nine Caucasian patients fulfilled the study criteria, and were included in the analysis. The characteristics of patients and HSCTs are summarized in Table 4. Sixty out of 109 patients (55% of the total) were found to be affected by DE already before HSCT using TFOS DEWS II criteria.

Table 4 Demographic, clinical and HSCT-related characteristics of subjects included in the study

The proportion of patients diagnosed with ocular GVHD according to the three criteria was analyzed as shown in Table 5. In particular, the NIH, the ICCGVHD and the DEWS II criteria diagnosed ocular GVHD in 14.7%, 17.4% and 59.6% of the HSCT patients, respectively, by applying Protocol A. Conversely, ocular GVHD was diagnosed in 11.9%, 15.6% and 33.0% of the HSCT patients, respectively, by applying Protocol B.

Table 5 Proportion of patients diagnosed with ocular graft-versus-host disease (GVHD) according to different criteria and protocols used

The inter-rate agreement coefficient Kappa (K) for the proportion of patients diagnosed with ocular GVHD by NIH and ICCGVHD criteria was K = 0.626 (95% CI: 0.425–0.827) when applying Protocol A, and K = 0.615 (95% CI: 0.398–0.831) when applying Protocol B. The K coefficient by NIH and TFOS DEWS II criteria was K = 0.144 (95% CI: 0.040–0.248) when applying Protocol A, and K = 0.233 (95% CI: 0.059–0.407) when applying Protocol B. The K coefficient by ICCGVHD and TFOS DEWS II criteria was K = 0.250 (95% CI: 0.140–0.360) when applying Protocol A, and K = 0.499 (95% CI: 0.325–0.673) when applying Protocol B.

The K coefficient for the proportion of patients diagnosed with ocular GVHD applying Protocol A and B was K = 0.881 (95% CI: 0.750–1.012) if NIH criteria were used, K = 0.933 (95% CI: 0.843–1.023) if ICCGVHD criteria were used, and K = 0.501 (95% CI: 0.366–0.636) if DEWS II criteria were used.

Discussion

GVHD is a potentially severe complication that may develop after allogeneic HSCT, representing the most common cause of non-relapse mortality in hematological patients. The ocular surface is one of the target tissues of chronic GVHD, and about half of patients who undergo HSCT later develop DE, or experience its worsening. However, there are no specific symptoms, clinical signs or biomarkers available for a definite diagnosis of ocular GVHD. In fact, it mimics other diseases such as Sjögren or non-Sjögren-associated DE disease and blepharitis, making it difficult to definitively diagnose ocular GVHD, particularly in milder cases. Besides slit lamp examination and clinical tests, conjunctival biopsy is the only option for substantiating the diagnosis of ocular chronic GVHD [14]. However, biopsy is potentially harmful for these patients due to its invasiveness, and cannot be repeated indefinitely if disease monitoring is required.

Different diagnostic criteria for the assessment of the diagnosis and the severity of ocular GVHD have been introduced in the recent years [6,7,8, 13]. Among these, NIH and ICCGVHD criteria are the most commonly used in the field. In particular, the NIH eye score represents a part of a global assessment of chronic systemic GVHD for clinical trials, is easy to use and can be performed by the hematologists themselves. The severity grading has scientific limitations, being based on subjective and behavior-related criteria, and specific ophthalmologic examinations such as slit lamp examination and ocular tests other than Schirmer one, are not included in the recommendation. As a consequence, this classification appears far from satisfactory, and more recently the ICCGVHD proposed more detailed and specialized diagnostic criteria for chronic ocular GVHD, which need to be performed by an ophthalmologists. Both these two diagnostic criteria include Schirmer test, which represents a sole defining criterion in NIH criteria, or one of the four considered parameters in ICCGVHD criteria. However, Schirmer test is not specific, does not reflect the whole spectrum of the disease, and its reliability and sensitivity in diagnosing and monitoring DE have been shown to be very poor, particularly in milder cases [15, 16]. Thus, it has been removed from the recently updated TFOS DEWS II criteria.

In addition, NIH and ICCGVHD criteria do not include TBUT, which is a diagnostic method for determining tear instability, or tear osmolarity, two well-recognized core mechanisms for DE, included in both definition and diagnostic criteria of TFOS DEWS II [17]. These criteria allow a more complete assessment of the tear film, which may provide deeper insight into the condition of the patient.

Recently, the evidence of ocular surface impairment in hematological patients already prior to HSCT is growing, with the need for a new protocol based on pre-HSCT ophthalmological examination, able to distinguish between “conventional pre-existing dry eye” and “dry eye due to active ocular chronic GVHD”.

To overcome these inconsistencies, we applied TFOS DEWS II criteria, traditionally used in the overall DE disease and including also tear instability and osmolarity, to reach the diagnosis of ocular GVHD in our hematological population, and compared their diagnostic performance with NIH and ICCGVHD criteria, already in use in this field. As ocular GVHD seems to be an “organ-specific entity”, we considered the positivity of TFOS DEWS II criteria sufficient to reach the diagnosis of ocular GVHD, irrespective of systemic GVHD status, as already proposed also by the current Chronic Ocular GVHD Workshop and the ICCGVHD criteria [8].

Two different protocols with (Protocol B) and without (Protocol A) the baseline pre-HSCT ophthalmological examination were applied to all the above mentioned criteria. The proportion of patients diagnosed as affected by ocular GVHD varied among the three criteria by applying both Protocols A and B. In particular, when Protocol A was used, over half of patients presented ocular GVHD according to TFOS DEWS II criteria, whereas the other two criteria reached the diagnosis in much less cases (about one-sixth). Instead, when Protocol B was applied, the proportion of ocular GVHD patients diagnosed by NIH and ICCGVHD criteria was similar (slightly lower), whereas that one obtained by TFOS DEWS II criteria was much lower (about half). The explanation for this difference is that TFOS DEWS II criteria diagnosed patients with hyperevaporative DE with low TBUT and pathological Meibomian gland dysfunction (MGD) scores but normal Schirmer test, whereas the other two criteria did not include this kind of patients because they did not investigate tear stability but only (NIH), or mainly (ICCGVHD), tear production. Therefore, the diagnostic performance of NIH and ICCGVHD criteria was not influenced by the type of protocol used, as they did not include among their parameters tear instability, which was demonstrated to be the main parameter impaired already before HSCT.

As a consequence, a high rate of disagreement among TFOS DEWS II and both ICCGVHD and NIH criteria was found by applying both protocols. However, this disagreement was higher between TFOS DEWS II and NIH, whereas was slightly lower between TFOS DEWS II and ICCGVHD because both these two criteria do not require the involvement of another organ to reach the diagnosis of ocular GVHD.

Our results appear consistent with a recent study, which reported substantial agreement between diagnosis reached by NIH and ICCGVHD criteria [18]. Another study compared the diagnostic accuracy of NIH eye score, the Japanese DE score and the DEWS 2007 score, and showed statistically good agreement among these criteria in the diagnostic rate and the severity of ocular GVHD [19]. However, it is difficult to compare our results with this work, because it was performed at a later stage of GVHD, and included a third scale (Japanese DE score), which was not analyzed in our study.

A recent study validated for the first time the ICCGVHD criteria, comparing them with best clinical practice (BCP), defined as ocular GVHD evaluation by highly trained single expert in ophthalmology on the basis of comprehensive clinical observation of the tear film and meibomian glands, among others [20]. There was slight overall agreement between BCP assessment and the ICCGVHD criteria for the diagnosis of chronic ocular GVHD. However, stratification by severity showed that BCP was more sensitive in identifying patients with milder forms of ocular GVHD compared with the ICCGVHD criteria. The underestimation of the milder forms of GVHD by using ICCGVHD criteria is concordant with our results, and seems to be related to the exclusion of TBUT from the diagnostic criteria. On the contrary, the higher was the severity of ocular GVHD, the stronger was the association between BCP and ICCGVHD, reaching a full agreement among patients with severe ocular GVHD, in which also tear production is impaired and Schirmer test values are pathological.

Currently, tear instability is not included in the criteria in use in hematological patients, despite the increasing evidence on the impairment of the lids and meibomian glands even before and in the early stage following HSCT [12, 21,22,23]. In addition, the recent recommendations from the German-Austrian-Swiss Consensus Conference [13] and the GVHD Consortium for clinical trials [24] specifically targeting ocular GVHD agreed that other objective ophthalmological criteria with more detailed measures might be helpful to document ocular chronic GVHD activity. In fact almost the totality of the studies showed the poor performance of Schirmer test in the diagnosis of DE, and this test was recently removed in the NIH guidelines from the markers of severity and from the response criteria. In addition, as the Schirmer test score does not reflect changes in ocular GVHD activity, it was not recommended for the measurement of the changes in ocular GVHD studies by the Chronic GVHD Consortium.

In the present study, we did not measure tear osmolarity in all patients, and therefore we did not use this parameter to reach the diagnosis of DE disease according to TFOS DEWS II criteria. This issue represents the major weakness of the study.

In conclusion, the diagnosis of ocular GVHD varied significantly in our cohort of hematological patients not only according to the diagnostic criteria used, but also to the different visit protocols applied. In particular, the use of the Protocol B, including the baseline pre-transplant ophthalmological examination, may allow to identify DE already prior to HSCT in the majority of patients, and thus to diagnose ocular GVHD after HSCT only in the presence of incident cases, i.e., patients not suffering from DE before HSCT and developing or worsening it after.

Summary

What was known before

  • Ocular GVHD represents a major complication occurring after HSCT. Currently, different diagnostic criteria are in use in this setting for reaching ocular GVHD diagnosis. In addition, despite the recent evidence of ocular surface impairment already prior to HSCT, current criteria do not include baseline pre-HSCT ophthalmological examination.

What this study adds

  • The diagnosis of ocular GVHD varied significantly in our cohort of hematological patients, not only according to the diagnostic criteria used, but also to the different visit protocols applied. Baseline ophthalmological examination may allow to identify DE already prior to HSCT, and thus to diagnose ocular GVHD after HSCT only in the presence of incident cases.