Introduction

Disuse osteoporosis is one of the most common complications of spinal cord injury (SCI) which causes significant morbidity in patients and increased toll for health-care systems [1, 2]. The process of bone loss is rapid and severe and starts as early as in the first year after SCI [3,4,5,6]. Different mechanisms contribute to this process such as physical unloading and loss of innervation [2,3,4]. Bone loss after SCI causes increased risk of fractures due to little or no trauma. It has been estimated that as many as 50% of the patients with SCI experience fractures at some point in their lifetime and the duration of injury is the most important predictor for the occurence of fractures [5,6,7,8]. Fractures may cause various complications such as non-union fractures, delayed healing, and pressure ulcers resulting from bracing, and it has been shown that such events plague more than 50% of fracture-related hospitalizations [9,10,11,12,13]. Dual energy X-ray absorptiometry (DXA) is the gold-standard method widely used to assess areal bone mineral density (aBMD); however, some believe that quantitative computed tomography (qCT) may provide better insight into the boney structure specifically in patients with SCI [2, 3]. New developments in the computed tomography (CT) technology, such as automatic exposure control and thin sectioning, subject the spinal column to a more homogenous X-ray beam. As a result, Hounsfield unit (HU) measurements are closely dependent on the composition of the targeted tissue which theoretically eliminates the necessity of using calibrating phantoms in assessing BMD. [9] It is noteworthy that many patients with SCI undergo CT studies for various reasons throughout the years after injury. The reasons include investigational purposes to determine sources of chronic pain or to evaluate skeletal abnormalities like mal-alignments. In this context, it seems beneficial to incorporate HU-based BMD assessment in routine radiologic reports as it is a measure readily available. The goal of the present study is to determine whether the results obtained using DXA are comparable with HU-based results of the whole spine or regionally in the ability to detect low BMD.

Methods

A database containing imaging studies of war veterans with chronic SCI was retrospectively reviewed. The data were later compared to the medical records of patients such as their ambulatory status, their neurological-deficit status based on American Spinal Injury Association (ASIA) Impairment Scale, and the duration of injury.

Participants who had both DXA and spinal CT investigations at the site of their injury within the same screening visit were included. Those who had instrumental fixations, previous spinal interventions such as vertebroplasty or fusion, were excluded. Using these criteria, 44 male war veterans were included in this study. The measurement of HU was done according to the method described by Schreiber et al. [9].

All DXA scans were obtained using a DMS Stratos device (France, 2015). Measurements were carried out from the first to the fourth lumbar vertebrae and were reported in g/cm2, and the respective average Z- and T-scores were calculated. qCT was performed using 16 channel helical CT scanner SOMATOM Emotion 6 (Siemens, Erlangen, Germany, 1994) in all patients. CT parameters included: slice thickness of 1.25 mm, with 1.0 s interval, tube voltage 130 kV, effective tube current 150 mAs (obtained using CARE dose 4D), and boney reconstruction in three planes and no phantoms were used. Picture Archiving and Communications System (PACS) used was INFINITT software (version 3.0.11.3, South Korea). To measure the average HU in axial planes, an elliptical area was drawn in the vertebral bodies in frames immediately inferior to superior end plate, mid vertebral, and immediately superior to inferior end plate below the site of SCI. This elliptical region did not contain cortical boney tissue (as shown in Fig. 1). HU measurements were done by two different observers (HH and NM) according to the method described by Schreiber et al., and the observers were blinded to DXA result.

Fig. 1
figure 1

Procedure of measuring HU in sublesional vertebrae. Each circle was drawn in a way to include largest possible area in each axial a, coronal b, and sagittal c plane

Categorizing HU into different groups of BMD status was done according to the suggested values by Schreiber et al. for measurements done in lumbar vertebrae and Marinova et al. [14] for measurements done in the thoracic region. For the cervical region, limited amount of evidence existed; however, we referenced work by Weishauptet al. [1] for 25 male participants and an average of 250 was chosen which was in compliance with other work done by Yoganandan et al. [15] in female participants. DXA reported T-scores were categorized into groups of T-scores higher than −2.5 standard deviations (SD) and −2.5 SD or below. The suggestions of the International Society for Clinical Densitometry, the American College of Radiology, and the American College of Physicians for diagnosing osteoporosis in men who are at least 50-years old and have additional risk factors, and other studies [1,2,3,4,5,6,7, 9,10,11,12,13,14,15,16], were taken into consideration for choosing this cutoff.

Statistical analysis

Normal distribution of quantitative variables was assessed using Shapiro–Wilk test and visual inspection of histograms. Inter-observer and intra-observer reliability were assessed using interclass correlation coefficient reported as a score between 0 representing no agreement and 1 representing perfect agreement. Scores higher than 0.8 are considered as an excellent agreement. Further analysis to compare subgroups in aspects of age and time since injury was done using one-way ANOVA, and using Fisher’s exact test for ambulation status and partial or complete motor injury. Measures of diagnostic accuracy were calculated using receiver operating characteristic (ROC) analysis. Comparison between the two methods was carried out using Fisher’s exact test.

Results

Reliability

Measuring HU according to the method proposed by Schreiber et al. was reliable with inter-observer reliability of 0.988 and intra-observer reliability of 0.992, both showing excellent levels of agreement.

Sample demographics

The average age for 44 participants was 50 ± 7 and there was no significant age difference between the subgroups of different BMD status (p-value = 0.069). Distribution of DXA Z-scores vs. age was random, which is depicted in figure S1. Out of 44 participants, 8 participants (18%) had cervical spine injury, 24 (55%) thoracic spine injury, and 12 (27%) lumbar spine injury. The mean duration of SCI was 28 ± 4 years, which did not differ significantly across the subgroups of different BMD status (p-value = 0.349).

Out of 44 participants, 4 (9.0%) walked independently, 8 (18%) walked using unilateral or bilateral canes, 28 (64%) moved independently by utilizing wheelchairs, and 4 (9.0%) were dependent for movement completely on others. Ambulatory status did not differ significantly across the subgroups with different BMD (p-value = 0.698).

Overall, 15 participants (34%) had motor complete SCI whereas 29 (66%) had motor incomplete SCI. Status of motor complete or incomplete injury did not differ significantly across the different BMD subgroups (p-value = 0.452) (Tables 1 and 2).

Table 1 Sample demographics and characteristics
Table 2 Comparison of DXA-based results vs. qCT-based results

BMD comparability

We categorized qCT and DXA results based on the previous studies and the suggestions which were specified in methods section. Performing Fisher’s exact test to assess whether the two methods differed in general assessment of BMD revealed no significant difference with a p-value of 0.237 and R = 0.188. Further measures of diagnostic comparability are presented in Table 3.

Table 3 Decision matrix comparing result from HU-based qCT with DXA results (reference test) at each region of the spine

ROC analysis was performed for each region of the spine separately as each region differs from the other in the range of normal HU values.

Due to small numbers of participants in each subset, coordinates of ROC curves are given in the supplements (Table S1) since these coordinates provide more information in comparison with the curve itself when the sample size is small. The curves are presented in Fig. 2. Areas under curves (AUC) for cervical, thoracic, and lumbar spine were 0.27, 0.53, and 0.83 with p-values of 0.297, 0.876, and 0.062, respectively. As it can be seen in Table S1, a cutoff point of 375 HU in the cervical spine would provide a sensitivity of 0.67 and a specificity of 0.40. In the thoracic spine, the respective cutoff of 230 HU would provide a sensitivity of 0.5 and a specificity of 0.23, while in the lumbar spine a cutoff of 150 HU would give a sensitivity of 0.80 and a specificity of 0.86. p-Values for all AUC measures were non-significant (p > 0.05).

Fig. 2
figure 2

ROC curves demonstrating HU comparability with DXA-based results at each region of the spinal column. For information on the coordinates of the curve please refer to Table S1

Discussion

Osteoporosis is a serious complication of SCI which imposes increased costs on health-care system and endangers patients with various consequences [1,2,3,4,5,6]. Patients with chronic SCI may be investigated with CT scans throughout the years for complications associated with the condition, such as vertebral fractures, facet joints degenerations, and preoperative studies for mal-alignment correction or other surgeries. It has recently been suggested that due to thin sectioning and automatic exposure adjustments implemented in new CT scanning modalities, HU measurements provide a method readily available to assess aBMD with comparable accuracy to DXA [9, 14]. In this study, we aimed to investigate how well the proposed HU normal ranges performed in comparison with DXA in patients with chronic SCI. Results of this study demonstrate that considering qCT to be superior or inferior in assessing BMD in comparison with DXA is rather a simplistic view on this matter. As we can see from Table 3 and Fig. 2, the comparability of the two methods increases from the cervical spine to lumbar spine. This may be due to BMD heterogeneity in SCI setting. It has been previously suggested that while supralesional BMD is fairly preserved in patients with SCI, severe sublesional osteoporosis can occur [17]. This pattern of bone loss is the reason we chose to measure HU in sublesional vertebral bodies. However, while DXA and qCT may not show desirable comparability, we believe in the future HU measurement may prove to be a valuable tool to assess regional BMD, which in turn is an important prognostic predictor of device implantation success [18,19,20,21,22]. Coordinates of the ROC curve provide a better insight into this matter by showing sensitivity and specificity of a single HU measurement as is mentioned in the results section and can be further explored in Table S1.

Measuring HU in the planned trajectory of screw fixations may provide important insight for surgeons about strength of bone–screw interface before operation. This correlation has been tested in dentistry to see if bone strength assessed by using qCT with automatic exposure control correlates with insertion torque and stability of metal implants both in vivo and in vitro [18,19,20,21,22]. For this purpose, further correlations between HU measurements and screw fixation outcomes may prove beneficial. Another advantage of this method is that qCT measures mineral density of trabecular bone which is more metabolically dynamic and responds to treatments faster [23,24,25,26]. Present study however has following limitations: first limitation was lack of normative data for the cervical spine. In this study, we decided to choose an average of 250 HU for measurements done in cervical region which was referenced on work by Weishaupt et al. and similar study by Yoganandan et al. [1, 15]. However, because our sample was averagely older and all-male, this value might not be a proper representative of average HU in the cervical spine. For the thoracic and lumbar spine previously normative data have been suggested; however, establishing conclusive worldwide normative values represented following STARD guidelines is yet to be done. Another important limitation of this study was the small sample size. This is due to physicians’ preference of requesting spinal MRI, wherever applicable, mainly because MRI does not subject patients to X-ray beam and also because it visualizes the cord itself in which multiple abnormalities can occur in chronic SCI such as cord tethering, cyst formation, and cord atrophy [27, 28].

In conclusion, this study suggests using qCT, done throughout the spine, can become a useful tool in assessing BMD. Preferably, large prospective cohort studies which use fracture rates as the main outcome will assist in determining normal HU values for the spinal column and provide a better insight into the comparability of DXA and qCT. However, qCT can be done on any vertebral body, and with the growing pool of normal values, it can be used for pre-operative assessment about screw implantations stability and to serve as a regional measure of spinal strength.