Introduction

Thyroid eye disease (TED) is the most common extrathyroidal manifestation of autoimmune hyperthyroidism, occurring in 30–50% of cases e1,2. TED is not limited to patients affected by Graves’ disease, but may also be observed in Hashimoto's thyroiditis, and rarely also without thyroid autoimmunity (euthyroid orbitopathy)3,4,5,6.

In TED, autoimmune-induced stimulation of orbital fibroblasts leads to increased production of hydrophilic mucopolysaccharides i.e., hyaluronic acid, secretion of pro-inflammatory cytokines, and enhanced adipogenesis hence causing edema and expansion of orbital connective tissue7,8.

The aforementioned changes increased volume and pressure within the bony orbital cavity are responsible for most mechanical problems in TED9. Inflammation, edema and swelling of the extraocular muscles impair ocular motility with subsequent diplopia.

A serious complication of TED is the occurrence of dysthyroid optic neuropathy (DON) in approximately 5% of cases. DON is characterized by reduced visual function/acuity, color perception and visual field defects which may progress to permanent loss of vision if untreated10,11. The most widely accepted cause of DON is the compression of the optic nerve by expansion of the enlarged extraocular muscles and—to a lesser extent—orbital fat, causing ischemia and inhibition of axonal flow7.

Earlier studies analyzed orbital bone dimension and size of extraocular muscles using computed tomography (CT). However, uncertainty remained regarding the association of bowing of the medial wall in DON, as some studies reported such an association12,13, while others did not14,15.

As volumetric analyses seem superior15,16 over plain measurement of muscle diameter to predict DON, we decided to combine volumetric measurements with angular measurements of the orbit as previously described al.13,14, in order to improve risk stratification for patients with TED to develop DON.

Material and methods

Study population

Orbital CT scans of 92 patients with clinically diagnosed TED (184 orbits) acquired 2014–2017 were retrospectively analyzed (Fig. 1). All patients were clinically diagnosed with TED according to the defined clinical criteria of the European Group on Graves’ Orbitopathy (EUGOGO)2.

Figure 1
figure 1

Flowchart of patient inclusion and exclusion in this study.

Of these 92 patients with clinically diagnosed TED, 49 patients (98 orbits, 26 male 72 female, mean age 49 ± 13.8 years) were allocated to the TED only group (without DON). DON was diagnosed in 43/92 patients (86 orbits). Of these, 76 orbits (59 female, 17 male, mean age 60 ± 12.5 years) were allocated to the TED + DON group. Optic nerve dysfunction was considered present if at least two of the following tests were impaired: visual acuity, pupillary light response, color vision/saturation and perimetry; patients with other causes for the impairment of visual function were not included in this population10. Patients with other causes for the impairment of visual function were not included in this population10. Visual acuity in logMAR was 0.51 ± 0.51 (0.00–3.00) in patients with DON, with two patients being able to only count fingers or to see hand movement. 30 eyes of 76 patients with DON (39.5%) had a relative afferent pupillary defect. Mean defect was 10.86 ± 7.22 dB using automated static perimetry.

In 10 patients DON was unilateral. This unaffected (ua) eye was measured as well and formed its own group (TED + DON (ua)) (10/86 orbits, 3 male, 7 female, mean age 61 ± 12.6 years).

None of the assessed patients had any prior ophthalmic or sinusal surgery.

We furthermore analyzed the orbits of 20 subjects (40 orbits, 28 female, 12 male, mean age 57 ± 8.9 years), who served as control. These CT scans were performed to e.g. rule out intracranial hemorrhage or ischemia. None of these subjects had skull or face injuries, a history of thyroid or orbital disease or any other condition that could interfere with proper measuring.

The study protocol was approved and consent was waived by the local ethics committee (Landesärztekammer Rhineland-Palatine, #2021-15977). The study was conducted according to state laws. All research was performed in accordance with relevant local guidelines and/or regulations and in accordance with the declaration of Helsinki.

Radiologic analyses

All orbital CT scans were part of a routine clinical protocol performed on a Toshiba Aquilion 32 slice CT (Toshiba Medical Systems Group Co, Zoetermeer, The Netherlands). The acquisition parameters used were as follows: 120 kV, 200 mA, 210 mm display FOV, pitch of 21 and a slice thickness of 0.5 mm.

To ensure consistency throughout all measurements, the measurements were taken on reconstructed 2 mm slices to be either orthogonal to the optic nerve (muscle volume measurements) (Fig. 2) or parallel to the infraorbitomeatal line (bony orbit measurements) (Fig. 3).

Figure 2
figure 2

Coronal reconstructed CT images (orthogonal to the optic nerve) (A) patient with TED + DON and massive enlargement of the EOM (measured muscles exemplary outlined in green); (B) patient with TED only and substantial increase in muscle volume; (C) healthy control group orbit with normal muscle dimensions.

Figure 3
figure 3

Angle and protrusion measurements: (A) apex point, (B) orbital rim angle, (C) lateral orbital wall angle, (D) medial orbital wall angle, (E) length of the lateral orbital wall, (F) anterior medial wall angle, (G) AMW = anterior medial wall angle (see chapter “Angular measurements” for further details of parameter assesment).

Angular measurements

Orbital angles and length of the lateral orbital wall, as well as medial wall measurements were performed as described in recent studies13,14.

The apex point was defined as the anterolateral border of the groove in the sphenoid body formed by the intracavernous portion of the internal carotid artery (red cross in Fig. 3A). From this point, the orbital rim angle was measured on the height of the medial palpebral ligament (angle between green lines in Fig. 3B). The medial and lateral orbital wall angles were assessed on the image where the medial and lateral rectus muscle had the widest diameter as angular change between the orbital rim angle and the widest point of the orbital wall (Fig. 3C,D). The length of the lateral orbital wall was measured just below the anterior clinoid process (Fig. 3E).

To assess the anterior medial wall angle, a perpendicular line was drawn from the beginning of the medial orbit wall on the nasal bone (Fig. 3F, point 1) perpendicular to the interzygomatic line. This was considered line A (Fig. 3F, A). Line B (Fig. 3F, B) was drawn, connecting point 1 with the point of maximum excursion of the medial wall (Fig. 3F, point 2). The resulting angle formed between line A and line B was considered the anterior medial wall angle (Fig. 3F, AMW). This angle was considered positive if the point 2 was lateral to line A or negative, if medial to it. Exophthalmos/Protrusion was measured as the distance between the interzygomatic line and posterior sclera just lateral of the optic nerve head (blue lines, Fig. 3F).

Volumetric analyses of the extraocular muscles

Volume of the extraocular muscles was measured using the closed polygon tool of OsiriX version 9.0 (Aycan Digitalsysteme GmbH, Würzburg, Germany) by outlining the profile of the muscles on coronal images. Due to difficulties to accurately discern the levator palpebrae from the superior rectus muscle, these muscles were evaluated as the superior muscle group.

Statistical tests

Multiple statistics were conducted with IBM SPSS Statistics for Windows, version 25 (IBM Corp., Armonk, N.Y., USA) using linear mixed models to compare muscle volumes and angles between patient groups and a generalized linear mixed model to predict DON from these measurements. All available variables that were distinctly different (p < 0.05) on univariate analysis were then encompassed in a multivariate model. No adjustment for multiple testing was done.

Results

Disease duration was significantly shorter and clinical activity score was significantly higher in patients with TED + DON compared to patients with TED only (disease duration: 2.1 ± 3.1 vs. 4.6 ± 5.2 years, p = 0.006; CAS: 1.9 ± 1.1 vs. 3.9 ± 1.2, p < 0.001). Likewise, ocular motility was restricted significantly stronger in patients with TED-DON compared to TED only (Table 1). Compared to patients with TED only, the medial, inferior, lateral, and the superior muscle group volume were significantly higher in the TED + DON group using the univariate mixed model analysis (all p < 0.001). Muscle volume of the TED group was significantly larger (p < 0.005) compared to the control group in all measured muscles, except for the lateral rectus muscle (p = 0.055, Table 1, Fig. 4).

Table 1 Muscle and angle measurements of the different groups and ophthalmological patients’ characteristics.
Figure 4
figure 4

Boxplots showing the differences between muscle volumes of each group. MRV (medial rectus muscle volume) IRV (inferior rectus muscle volume) SMGV (superior muscle group volume) LRV (lateral rectus muscle volume). *Indicates p < .05, **indicates p < .001.

The muscle volume of each muscle of the unaffected orbits of DON patients (TED + DON (ua) group) was markedly decreased compared to those affected by DON (TED + DON group; e.g. medial rectus volume (MRV) M = 1.56 cm3 vs 2.16 cm3). However, compared with the TED group or the control group, the TED + DON (ua) group showed a distinctly increased muscle volume for all muscles (e.g. MRV: M = 1.56 cm3 vs. M = 1.06 cm3; p = 0.015; vs. M = 0.73 cm3; p < 0.001) apart from that of the lateral rectus muscle (M = 1.03 cm3 vs M = 0.81 cm3; p = 0.603 vs M = 0.63 cm3; p = 0.062).

With regard to the bony boundaries of the orbit, differences in the medial orbital wall angle were noticeable. Patients with TED + DON showed a more concave medial wall than patients of the TED group (M =  + 0.98° vs. − 2.3°; p < 0.001) or the TED + DON (ua) group (M = − 1.95°, p < 0.001). The medial wall angle also differed significantly between the control group and patients of the TED group (M = − 4.22° vs. − 2.3°; p = 0.03).

The lateral wall angle, orbital rim angle, as well as the length of the lateral orbit wall did not differ markedly between the TED + DON, TED or TED + DON (ua) groups. Neither was there a difference when comparing the control group with the TED + DON group and the TED group.

Both types of angle measurement of the medial orbital wall showed the correlation between increased muscle volume and bowing of the medial wall. The medial rectus muscle had a slightly higher correlation (r = 0.564) compared to the other muscles or to the total muscle volume (Table 2).

Table 2 Pearson correlation between the curvature of the medial orbital wall as well as the extent of proptosis and muscle volume.

In the generalized linear mixed model, which contained all muscle measurements with distinct differences (i.e. p < 0.05) between the TED + DON and TED groups, the medial rectus muscle was the most influential variable regarding diagnostic accuracy (specificity: 86.7%, sensitivity: 73.7%). Adding all other muscles resulted in a specificity of 87.8% and a sensitivity of 77.6%. Further addition of the medial wall angle did not confer any diagnostic benefit.

A few subjects (n = 5, Fig. 5, marked red) stood out in that they had a higher MRV than that of the TED + DON group without, however, being affected by DON. There are also patients with DON (n = 6, Fig. 5, marked yellow) having a similar MRV and a smaller medial wall angle compared to patients with TED only.

Figure 5
figure 5

Scatterplot showing medial rectus volume (MRV) as well as the medial angle change (MAC). Upper horizontal line shows the average MRV of the TED + DON group (2.16 cm3). Lower horizontal line is set at the lowest MRV at which DON occurred (0.92 cm3). Right vertical line is showing the average MAC of DON patients, the left vertical line shows the average MAC of the TED group. Patients with higher than average MRV but not affected by DON are marked red. Yellow squares are marking DON-patients with a similar MRV and smaller MAC than the average of the TED group.

Discussion

Radiological characteristics in patients with DON have been a major subject of research. Changes such as enlargement of the superior ophthalmic vein, anterior displacement of the lacrimal gland or the presence of an intracranial fat prolapse have been investigated17,18,19. The most thoroughly investigated radiological parameter aiding the diagnosis of DON is the increase of extraocular muscle size, ultimately leading to apical crowding and DON.

In the underlying study we have been able to confirm an increase of muscle volume especially that of the medial rectus muscle, to be the strongest predictor for the development of DON in patients with TED. Although dimensions of the bony orbit significantly differed between the examined groups, there was no difference predisposing to the development of DON in patients with TED prior to disease-related bony orbital changes.

In other studies apical crowding was graded on a categorical scale or by forming the ratio of the diameters of the horizontal or vertical extraocular muscles and the diameter of the bony orbit20,21. Other authors calculated the ratio between soft- and fat tissue around the apex area to form a volumetric apical crowding index16.

To our knowledge, the underlying study is the only one including both, measurements of individual muscle volumes as well as of the bony orbit. Furthermore, our collective of DON patients seems to be the largest hitherto analyzed. Finally, previously published research ignored the unaffected orbits of patients with DON, which in our study were analyzed as a separate subgroup, as it cannot be excluded that relevant anatomical differences prior to disease manifestation in orbits of patients with DON and those without exist.

With the medial orbital wall being much thinner than the lateral orbital wall and the air-filled ethmoidal sinuses providing less resistance, the medial wall is more prone to bony remodeling. This thesis is supported by the fact that the change of the medial orbital angle correlates with an increase in muscle volume, especially that of the medial rectus muscle as shown in the TED + DON-group. Our findings regarding muscle volume are consistent with previously published research13,15, showing the volume or thickness of the medial rectus muscle to be the most significant predictor for the development of DON, which is most likely due to its close anatomic relation to the optic nerve in the optic canal15. At some point during the process of muscle enlargement, compensatory mechanisms of the orbit (mainly the expansion into the ethmoidal sinuses) seemed to exceed their limits leading to a critical increase of pressure within the orbit and ultimately optic nerve damage.

Subjects in the TED + DON group showed a distinctly increased medial orbital angle compared to the TED group. The angles of both groups were also significantly increased compared to the control group. This is in accordance with other studies12,13. Adding medial orbital angle change to our multivariate analysis, however, did not improve the diagnostic accuracy. This suggests that the increased bowing of the medial wall serves as a surrogate parameter for the increase in muscle volume. Interestingly, this finding is contrary to previous studies14,15, in which there was no significant difference in medial orbital wall curvature between these groups. Reasons for this might be the use of subjective evaluation methods to assess the bowing of the medial orbital wall15, the higher number of subjects in our DON group, or the extent of muscle enlargement inherited by them.

Previous findings describing an increase of the lateral wall angle in patients suffering from DON could not be replicated in our study. While subjects allocated to the TED + DON group had a lateral wall angle larger than subjects of the TED group, the difference was not significant (p = 0.6). Reasons for this could be the larger sample size of our study, the different patient ethnicity, group composition or even more increased bowing of the medial wall resulting in a reduction of pressure on the lateral wall. The fact that lateral orbital wall remodeling occurs later during disease progression could also explain this finding.

Unaffected (ua) orbits of DON patients did not differ markedly from orbits of the TED group regarding the dimension of the bony orbit. Despite the small sample size and the assumption, that both orbits developed more or less equally, this data might suggest that there are no characteristic bony differences or that these are too minuscule to measure, which would affect the development of DON patients prior to thickening of the muscles.

Besides bony orbital changes, protrusion of the eyeball is another effect of increased intra-orbital volume. Proptosis was remarkably higher in DON patients and correlated well with muscle volume (Table 1). The scatterplot (Fig. 5) of medial rectus volume (MRV) and medial angle change (MAC) of DON and TED + DON patients demonstrates, that a sharp delineation between these two groups is not possible and that some cases (red and yellow points) develop atypically. Thus, these two groups therefore cannot be easily distinguished from the non-affected, respectively affected population on the basis of the measurements obtained and illustrate the difficulty of diagnosing DON using anatomical radiological data. Thus, alternative approaches like radiomics or diffusion tensor imaging might offer interesting approaches and are currently being investigated. However, currently, the clinical ophthalmological examination is indispensable and increases sensitivity and specificity22.

Our data confirms the heterogeneity of the disease with several existing subtypes23. While most of the subjects exhibiting DON are showing a distinct increase in muscle volume, a subset shows no or barely any increase. On the other hand, our data shows a trend towards discrimination of both groups (TED and TED + DON) and in our opinion it may be advisable, that patients with known TED and a medial rectus volume higher than 0.9cm3 should be monitored more closely disregarding the dimensions of the bony orbit, as this was the lowest value in our population associated with the development of DON (Fig. 5, lower horizontal line). The heterogeneity of TED has also been emphasized by Uddin et al., who stressed TED to be heterogeneous in its underlying pathogenesis, clinical manifestations, and response to medical and surgical treatment modalities24. In their manuscript they criticized that several previous categorizations of the clinical appearance of TED were dichotomous and underrepresented the heterogeneity of the disease. Consequently, Uddin et al. suggested a new classification system dividing TED into six different classes based on the phenotypic features (clinical and radiologic) and their response to different treatments.

Heterogeneity has not only been reported for clinical and radiological findings in EO, but also different imaging modalities including MRI and PET (alone or in combination with MRI or CT) have been used to study various morphological and functional parameters. In a recently published work, Weber et al. reported PET-MRI to be suited for the assessment of EO inflammation and to be a good discriminator for severe vs. mild-to-moderate EO25. A variety of other parameters have been analyzed in EO in numerous other studies. These were mainly morphological parameters like the volume or width of fat or muscles26 or the optic nerve27, herniation of the lacrimal gland28, contrast agent uptake and T2-hyperintensity (as surrogate parameters for inflammation)29,30, quantitative MRI and T1-mapping of eye muscles31,32, just to name a few. Therefore imaging has a significant role in the investigation of TED. Siakallis et al., however, correctly pointed out that a consensus on the use of the different imaging modalities in the course of disease has not yet been reached33.

As it is uncommon to routinely perform CT or MRI of the orbit prior to the occurrence of TED, a definitive conclusion about risk factors regarding the dimensions of muscles and the bony orbit aggravating the progression from mere TED to TED + DON cannot be made based on this retrospective study. Thus, a follow up study closely assessing changes in muscle volume in our set of TED patients might be of interest to identify novel parameters (such as an increase in medial rectus volume in a certain amount of time) which are better suited to stratify the risk to progress from TED to TED + DON.

It should be noted that the gender ratio in our study differs from other publications. In our population the proportion of males in the DON group was noticeably lower than in comparable studies13,16,20. It might be noteworthy that inclusion of patients with mild and inactive disease in the TED group (compared to the moderate to severe and sight-threatening cases) can be a confounding factor. As our center primarily sees patients who have a more serious course of disease and for whom a CT scan is justified, the measured values might not necessarily be comparable to those of the entire TED population. DON, in the majority of cases, is a clinical diagnosis being based on abnormalities in assessment of visual acuity, color vision/color saturation, perimetry, fundoscopy and/or by testing of the pupillary light reaction/relative afferent pupillary defect in patients with TED. Radiological imaging while not mandatory, should aid in making a diagnosis in borderline cases or help to identify patients with an increased risk for the development of a DON.

To conclude, the underlying study shows the complexity of DON and the inherent problem of identifying patients with TED to develop DON. From our experience volumetric measurements of the medial rectus muscle seem to provide the best sensitivity and specificity for the development of DON and patients with a volume of the medial rectus muscle of more than 0.9cm3 should be monitored closely.