Introduction

Since Galibert and Deramond first used bone cement to treat aggressive cervical hemangiomas in 19871, this procedure has been verified by numerous studies to be an effective minimally invasive surgery for the treatment of osteoporotic vertebral compression fractures (OVCFs)2,3,4,5. As the population continues to age, the incidence of vertebral compression fractures caused by osteoporosis has also increased6,7,8. For the foreseeable future, vertebral augmentation will continue to play an important role as a minimally invasive treatment of OVCFs.

Vertebral augmentation can quickly alleviate the pain caused by OVCFs only when the bone cement rapidly halts crack propagation of the fracture and restores the height and stiffness of the fractured vertebra to meet weight-bearing demand. In addition, cytotoxicity and heat generation of bone cement also play a role in pain relief of patients received vertebral augmentation9,10,11. The amount of bone cement used can influence risk of various complications after vertebral augmentation12,13,14,15,16,17,18,19,20,21. Injection of excessive bone cement tends to induce new fractures of adjacent vertebral bodies and may increase the leakage rate12,13,14,15, while too little bone cement can result in insufficient filling, which can fail to relieve the pain rapidly15,16 and easily induce re-collapse of the treated vertebrae17,18,19. However, the amount of bone cement per se is unlikely to be the most reliable predictor of whether the cement will achieve a satisfactory distribution, given differences in vertebral body size as well as in location and degree of compression14,15,16.

Many studies have shown the distribution of bone cement to be a more suitable index12,13,18,19,22,23,24. Calculating the volume fraction of bone cement based on computed tomography may be more accurate, while computer simulation combined with hydrodynamics can also model distribution of bone cement25,26,27. However, these methods are quite complex and cannot be conducted in real time to guide intraoperative procedures14,15,16. Methods based on X-ray images have been reported to evaluate distribution of bone cement, but they differ widely and lack consistency or prognostic relevance12,15,22,23,24. Consequently, there is no recognized, practical way to evaluate the distribution of bone cement in vertebral augmentation and therefore predict risk of subsequent re-collapse or new fracture.

To address this gap, we assessed different methods for analyzing the distribution of bone cement on X-ray images after vertebral augmentation, and then predicting new symptomatic fracture or re-collapse of augmented vertebrae. Our goal was to determine the best evaluation method and the corresponding optimal distribution of bone cement.

Materials and methods

Patient characteristics

This study was approved by the Ethics Committee of West China Hospital of Sichuan University. All methods and procedures were carried out in accordance with relevant guidelines and regulations, and informed consent was obtained from all subjects. This retrospective study included data from patients who received vertebral augmentation in our two institutions from April 2014 to March 2019. The specific inclusion criteria were: (1) the patient was aged ≥ 70 years, or the bone mineral density (BMD) measured by dual-energy X-ray absorptiometry was T ≤ -2.5; (2) the patient had a known history of hypochondriac pain and/or back pain, with or without limited mobility; (3) acute, single-segment OVCF was confirmed by magnetic resonance imaging (MRI); and (4) the patient was followed up for at least 6 months after surgery, with the most recent X-ray examination at least 6 months postoperatively. Patients were excluded if they underwent vertebral augmentation because of pathological fractures caused by spinal neoplasms, or if they suffered new fractures caused by high-energy trauma during the follow-up period.

Surgical procedure

All patients were given local anesthesia, and surgery was performed under a C-arm X-ray machine while the patient was lying in the prone position. A 3.5-mm puncture needle was used to puncture the pedicle of the vertebra. During the needle insertion, the angles of introversion, extroversion, and cephalocaudal inclination were adjusted based on C-arm fluoroscopy, until the puncture needle reached the junction of the middle and posterior thirds of the vertebral body. Then, if the patient chose percutaneous kyphoplasty (PKP), the balloon was inserted into the vertebra with height reduction, followed by removal of the balloon and rapid injection of polymethylmethacrylate. If the patient chose percutaneous vertebroplasty (PVP), direct polymethylmethacrylate injection was performed. The same kind of bone cement (Osteopal V, Germany) was used in all patients regardless of procedure method.

Post-operative treatment and follow-up

After surgery, patients rested in the supine position for 3 h and gradually resumed off-bed activities with waist support. Routine examinations of anteroposterior and lateral X-ray films were performed to assess the distribution and leakage of bone cement. Computed tomography and MRI were performed on patients whose pain was exacerbated or not significantly alleviated, and on patients who had emerging nerve root pain; based on the findings, further treatment was undertaken as necessary. Patients were routinely administered calcium and active vitamin D after surgery, and 31 patients elected additional treatment with zoledronic acid.

After discharge, patients were followed-up by telephone every 3 months to enquire about their pain and daily activities. If a patient mentioned back or hypochondriac pain that had lasted longer than 3 days or was not relieved significantly after taking nonsteroidal anti-inflammatory drugs (NSAIDs), they were instructed to return to hospital for an X-ray examination. If new fractures were strongly suspected, confirmatory MRI was performed.

Description of variables

Data were collected on each patient’s sex, age, body mass index (BMI), and puncture method (unilateral or bilateral), treatment method (PVP or PKP), cause of fracture, and other imaging data, including X-ray films before surgery and during follow-up. A spine surgeon (11 years of experience in spine surgery) and a radiologist (7 years of experience in musculoskeletal system images) jointly evaluated the images to determine the number of augmented vertebral bodies, whether there was a cleft in the fractured vertebra, location of the fractured vertebra, whether the degree of compression of the fracture exceeded 50%, re-collapse of the augmented vertebra, new fracture, and bone cement leakage. Five different evaluation methods were used to assess the bone cement distribution. When the evaluation results differed, the two physicians discussed and made a final decision.

Re-collapse of augmented vertebrae was diagnosed when the angle between the upper and lower endplates of the augmented vertebra (based on images from the last follow-up) had changed by more than 10° from the angle immediately after surgery, or when the height of the augmented vertebra was compressed by more than 15% from the height immediately after surgery19. New fracture was diagnosed based on hyperintensity on T2-weighted MRI with fat suppression. A typical series of images is shown in Fig. 1.

Figure 1
figure 1

X-ray images of a 79-year-old woman. (A,B) Images taken at admission, showing vertebral compression fracture at L1. (C,D) Anteroposterior and lateral images taken after percutaneous kyphoplasty, showing restoration of the fracture. Bone cement distribution was scored as 7 using method 1, 8 using method 2, 6 using method 3, and 9 using methods 4 and 5. (E,F) Images taken at 6 months after surgery, showing re-collapse of the operated vertebra.

Based on previous reports15,23, we developed and compared three new methods to evaluate anteroposterior and lateral X-ray images. Our goal was to identify a method showing good inter-rater consistency that could reliably predict new fracture and re-collapse. The methods differed in how vertebral quadrants were defined and in how affected quadrants were evaluated. We named the five methods based on the number of scores that each generated: method 1 was termed the 8-score method; method 2, 10-score; method 3, new 8-score; method 4, 9-score; and method 5, 12-score. Further details of the five methods are described in Table 1.

Table 1 Methods to evaluate cement distribution from X-ray images in this study.

Statistical analyses

Measurements were expressed as mean ± standard deviation. Inter-group differences in continuous variables were assessed for significance using one-way ANOVA in the case of normally distributed data, or the Kruskal–Wallis rank-sum test in the case of skewed data. Differences in categorical variables were assessed using the χ2 test. The kappa statistic was used to assess agreement between the two clinicians in assessing bone cement distribution, while receiver operating characteristic (ROC) curves were calculated to assess ability to predict new fracture or re-collapse of augmented vertebrae. The bone cement evaluation method that had the largest area under the ROC curve (AUC) was used to plot the cumulative curves of new fracture and re-collapse at each score, and linear regression was performed in each case. Statistical analyses were performed using SPSS 23.0 (IBM, Armonk, New York, USA) at a significance level of α = 0.05.

Results

Patient characteristics and rates of new fracture and re-collapse

A total of 263 patients with single-segment OVCFs were enrolled in this study, including 43 males and 220 females, with an average age of 73.43 ± 8.34 years. Among them, 62 patients had a history of vertebral augmentation. A total of 137 new fractures were observed in 119 (45.25%) of the 263 patients, of which 54 (45.38%) were adjacent or included adjacent vertebrae. There were significant differences between patients with or without new fractures in terms of distribution of bone cement, thoracolumbar location, age, number of augmented vertebrae, and whether clefts were present or the cause of the fracture was known (Table 2). During follow-up, 49 of 263 (18.63%) patients suffered re-collapse of augmented vertebrae. There were significant differences between patients with or without vertebral re-collapse in terms of puncture method, thoracolumbar location, number of augmented vertebrae, and bone cement distribution (Table 2). One patient died during the follow-up period.

Table 2 Baseline characteristics of patients, stratified by whether they experienced new fractures or re-collapse of the augmented vertebra.

Inter-rater consistency and ROC curves for different evaluation methods

Two physicians used five methods to assess bone cement distribution based on X-ray images. The kappa values were as follows: 8-score, 0.716; 10-score, 0.695; new 8-score, 0.673; 9-score, 0.714; and 12-score, 0.652 (Table 3). The corresponding AUCs for predicting new fractures were 0.568, 0.579, 0.576, 0.582, and 0.591 (Fig. 2A). Except for the 8-score method (P = 0.059), all other methods showed statistical significance (P < 0.05, Table 3). Respective AUCs for predicting incidence of augmented vertebra collapse were 0.818, 0.874, 0.893, 0.807, and 0.933 (Fig. 2B), which were all statistically significant (P < 0.05, Table 3). The 12-score method showed higher AUCs for predicting new fractures and re-collapse of augmented vertebrae (Table 3).

Table 3 Inter-rater consistency and ability of each bone cement evaluation method to predict new fracture or re-collapse of augmented vertebra.
Figure 2
figure 2

Receiver operating characteristic curves to assess the ability of the various methods of bone cement evaluation to predict (A) new fracture or (B) re-collapse of the operated vertebra.

Predictive value of 12-score method based on linear regression

Scores for the 12-score method concentrated in the range of 6–12, with a mean of 10.33 ± 1.62. The cumulative proportions of new fractures and re-collapse corresponding to each score are shown in Table 4, and the line chart based on these data is shown in Fig. 3. The score corresponding to the intersection point of the two lines was 10. Linear regression showed that the distribution of bone cement was a significant predictor of re-collapse of the operated vertebra (F = 237.753, P < 0.001; Table 5), with a regression coefficient of − 0.144 [95% confidence interval (CI) − 0.168 to − 0.120], and a significant predictor of new fracture (F = 13.091, P = 0.015; Table 5), with a regression coefficient of 0.033 (95% CI 0.010–0.057).

Table 4 Cumulative numbers of new fractures and re-collapses of augmented vertebrae for each score observed using the 12-score method.
Figure 3
figure 3

Cumulative proportions of new fracture and re-collapse of treated vertebra associated with each score observed in our patient sample based on the 12-score method.

Table 5 Linear regression analysis of the 12-score method’s predictions of new fracture or re-collapse of the treated vertebra.

Discussion

The distribution of bone cement is known to affect the outcome and complications of vertebral augmentation12,13,14,15,16,17,18,19,20,21,22, but there is no recognized, convenient, effective method to assess such distribution or predict prognosis for patients undergoing vertebral augmentation. It would be particularly helpful if the method could assess distribution and predict outcome in real time to guide surgery. In the present study, we compared the ability of five methods to assess bone cement distribution from X-ray images and predict risk of new fracture or re-collapse of treated vertebrae. The methods here were taken directly from the literature (8-score method23 and 9-score method15) or newly developed (new 8-score method, 9-score method and 12-score method). Among the five methods, the newly designed 12-score method showed high inter-rater consistency and strongest ability to predict new fracture and re-collapse of augmented vertebrae. Our results suggest that the distribution of bone cement correlates negatively with re-collapse of augmented vertebrae and positively but weakly with new fracture. In this scoring system, a score of 10 appears to indicate the best balance between preventing new fracture and re-collapse of augmented vertebrae.

Bone cement injected during vertebral augmentation is commonly measured in terms of amount and morphological distribution12,13,14,15,16,17,18,19,22. However, the amount of bone cement stored in the fractured vertebrae alone is not an ideal index because of variations in vertebral body size and OVCF severity14,15,16. Although computed tomography is theoretically the most accurate method to calculate the ratio of bone cement volume to vertebral volume14,15,16, it is complex and inconvenient for intraoperative use, and it does not take into account morphological features13,19,24. In contrast, methods based on X-ray images can show whether the bone cement touches the upper and lower endplates22, whether it crosses the midline and is continuous12,24, and the amount of bone cement deposited12,13,15,16; all these factors are related to clinical outcomes. So far, however, methods based on X-ray images have not been assessed in terms of inter-rater consistency, sensitivity, or specificity.

All five evaluation methods for bone cement distribution based on X-ray imaging showed good consistency in this study. Although the inter-rater consistency of the 12-score method in this study was not as good as that of other methods, as indicated by kappa values 0.652, it had the largest AUC in predicting re-collapse of treated vertebrae and new fractures. This method takes into account the morphological characteristics of bone cement in all directions as well as the degree of filling, which may help explain its high prediction ability. Linear regression based on the 12-point method showed that bone cement distribution had a nearly negative correlation with re-collapse of augmented vertebra, but a weak positive correlation with new fracture. The two curves intersected at a score of 10, indicating that patients with this score are at lowest risk of re-collapse and new fracture. This score may be a clinically useful target to achieve, as a complement or alternative to the target of 19.78–25% bone cement volume fraction suggested by some studies14,15,16. The score could be determined intraoperatively using C-arm radiography.

Our study further shows that the distribution of bone cement affects risk of re-collapse of augmented vertebra and new fracture. Linear regression also showed that the probability of re-collapse of augmented vertebrae decreased by 14.4% for each quadrant increase in cement distribution. This provides strong evidence to support the repeated needle insertion technique20 in order to prevent re-collapse; the rationale of this technique is that repeated puncture into the unfilled area can distribute bone cement more uniformly and adequately. For new fractures, none of the evaluation methods in the present study showed a high predictive effect. The AUC of the 12-score method was the largest, but it was only 0.591, and linear regression showed a regression coefficient of only 0.033. These results suggest that bone cement distribution influences risk of new fracture much less than risk of re-collapse.

According to the literature, new fracture and re-collapse are related to a number of factors, including low BMD and BMI, presence of a cleft sign, reduction degree, vertebral augmentation, old fracture, thoracolumbar location, advanced age, and less bone cement17,18,19,21,28,29,30,31,32,33. However, which of these factors play stronger roles is unclear. Our study found patients with or without new fracture to differ in terms of age, number of augmented vertebrae, fracture cause, cleft sign and thoracolumbar location, while patients with or without re-collapse differed in number of augmented vertebrae, puncture method and thoracolumbar location. In this way, our analysis on which it is based may help clarify differential risk factors—besides bone cement distribution—for the two outcomes of re-collapse and new fracture.

There are several limitations of this study to consider. First, we applied methods based on calculation of scores rather than precise quantitative metrics in order to ensure ease of implementation. Second, since most of our patients experienced different degrees of pain during follow-up and therefore underwent X-ray examination, the proportions of patients with new fracture or re-collapse in our study may be relatively high. Our findings should be confirmed and extended in prospective studies. Third, we cannot rule out that quadrants in different positions of the vertebral body may differently influence risk of re-collapse and new fracture, although such differential influence should be minimal when bone cement has covered most of the quadrants of the vertebral body. Future studies should examine the differential influence of individual quadrants, depending on their location.

Conclusion

Evaluating bone cement distribution based on X-ray images can be convenient and practical, and it can reliably predict new fracture and re-collapse of augmented vertebrae. The 12-score evaluation method showed good inter-rater consistency and strong ability to predict new fractures and re-collapse. A score of 10 with this method appears to be associated with the lowest risk of re-collapse and new fracture.