Introduction

Femoral neck fractures are among the most common orthopedic injuries in the elderly. In 1990, the estimated number of femoral neck fractures was 1.66 million worldwide, per year. Further, the incidence is increasing, with the number of femoral neck fractures projected to reach up to 6.26 million by the year 20501. Compared with other fractures, femoral neck fractures exhibit specific consequences in blood supply, displacement and shear forces, often resulting in fracture nonunion2 and femoral head necrosis3. Even though the treatment history of femoral neck fracture is over 400 years old and has made considerable progress, numerous problems remain to be resolved because of above characteristics of the injury. Thus far, treatments for displaced intracapsular femoral neck fracture have included closed or open reduction and internal fixation (IF), hemiarthroplasty (HA), and total hip replacement (THR). Notably, the indications for particular treatment modalities are very heterogeneous among orthopaedic surgeons, although establishing algorithms and hospital care pathways have recently been focused on4. Great attention should be paid to treatment these fractures.

Numerous published randomized controlled trials (RCTs) concerning this topic have compared the efficacy of treatment among IF, THR, and HA5,6,7,8,9,10,11,12,13,14. In addition, many meta-analyses and systematic reviews have assessed comparisons of IF versus HA15, IF versus THR16, cemented HA versus uncemented HA17, unipolar HA versus bipolar HA18, and THR versus HA19. When considering uncemented and cemented HA, Parker et al. reported that unipolar uncemented HA offers less reoperation, similar mortality and function compared with IF20, while Hedbeck et al. demonstrated that unipolar cemented HA has less reoperation, similar complications and mortality, better health-related quality of life, compared with IF11. Deangelis et al. reported that the use of cemented or uncemented femoral components was associated with similar functional outcomes during 1 year12. When considering unipolar or bipolar, Hedbeck et al. described that unipolar and bipolar HA appeared to produce equivalent clinical outcomes after 1 year21. However, in contrast, bipolar HA resulted in better health-related quality of life beyond the first 2 years following surgery, compared to unipolar HA, in a study performed by Inngul et al.10. When considering uncemented or cemented THR, no direct evidence exists to evaluate relative performances. RCTs comparing uncemented and cemented THR have focused on whether either of these treatments provide better outcomes than IF in hip function14,22.

Therefore, all the 7 procedures of a direct or indirect comparison under displaced intracapsular femoral neck fractures lacked. Thus, to comprehensively evaluate the efficacy and complications of the 7 surgical procedures, we performed a Bayesian network meta-analysis via a global search of published RCTs on this topic, providing evidence for clinical decision-making.

Methods

Inclusion Criteria

(1) Trials: RCTs. (2) Participants: Elderly patients (≥65 years) suffering from displaced intracapsular femoral neck fractures. (3) Interventions: Interventions studied in the meta-analysis were IF, unipolar uncemented HA, bipolar uncemented HA, unipolar cemented HA, bipolar cemented HA, uncemented THR, and cemented THR. (4) Outcomes: reoperation, mortality, dislocation, infection. Specifically, the definition of reoperation was taken to be secondary surgery caused by any reason. Infection included both superficial and deep wound infection, but not respiratory or urinary tract or other systemic infection.

Exclusion criteria were as follows: patients being unfit for arthroplasty or IF, previous hip pathology (osteoarthritis, malignant disease, infectious disease), and the age was <65 year old.

Literature Search

We searched PubMed (1966–2015.8), EMBASE (1974–2015.8), and the Cochrane library (Issue 8 of 12, August 2015) using a searching strategy that combined MeSH/Emtree terms and free text words: “Femoral Neck Fractures,” Femoral Neck Fractures, “Randomized Controlled Trial,” randomized controlled trial*, single-blind method, single blind*, double-blind method, double blind*, triple blind*, random allocation, random allocation*, randomly allocation*. Retrieval dates were from the time of database creation to 31 August 2015. The language should be English. The search strategy for Medline (Pubmed) was in Supplementary Table S1.

Data Extraction and Quality Evaluation

Two investigators (Peng-Fei Wang and Hai Huang) were responsible for independently reading all titles, abstracts, and full texts using the following steps: (1) examining titles and abstracts to remove obviously irrelevant studies, (2) retrieving the full texts of potentially relevant trials, (3) examining the full texts for compliance with eligibility criteria, and (4) making final decisions on study inclusion and proceeding to data collection. From the studies included, the investigators extracted baseline information on subjects (e.g., treatment strategy, approach, and outcomes) and detailed methods used in the study design (e.g., publication year, study settings, designs, method of randomization, allocation concealment, and blinding). Disagreements were resolved via discussion with a third investigator (Hu Wang). When continuous variables were described as median, these values were translated into mean or aggregated depending on researcher recommendation23.

Each study was independently assessed for its methodological quality by the previous investigator. The criteria for methodological quality were based on those described in the Cochrane Reviewers’ Handbook 5.1.024, including selection bias, performance bias, detection bias, attrition bias, reporting bias domains, random sequence generation, allocation concealment, blinding, incomplete outcome data, selective reporting, and other biases.

Statistical Methods

The analysis was primarily based on Multiple Treatments Meta-analysis, as described by Salanti et al.25. We used the Bayesian method based on the Chaimani model for binary variables (utilizing random effects models). Statistical analysis of these variables was based on binomial likelihoods, with vague priors for the trial baselines, basic parameters (normal distribution with mean 0 and standard deviation 0.0001), and random effects standard deviation (uniformly distributed i in the interval 0 to 2). We used the Bayesian method based on the random effects of the Dias model for continuous variables. Statistical analysis was based on deviance contribution, with vague priors for the trial baselines, basic parameters (normal distribution with mean 0 and standard deviation 0.0001), and random effects standard deviation (uniformly distributed i in the interval 0 to 5). Comparative effectiveness of the treatments was expressed as the odds ratio (OR) or weighted mean difference (MD), with 95% credibility intervals (CrIs). The CrI is the Bayesian analog to confidence intervals used in traditional frequentist statistical approaches. We considered a result “significant” if the CrI did not include OR = 1. We also ranked meaningfully different procedures in terms of their likelihood of leading to the best results for each outcome. The meta-analysis was performed using Winbugs version 1.4.3 (Imperial College and MRC, UK). Further, we checked the consistency in closed loops using inconsistency factors and calculated the contribution of direct estimates effect of each pair in the entire network with STATA version 12.0 (STATA Corporation, College Station, TX, USA), under the traditional frequentist statistical approaches.

Results

Process for Selecting Trials

In total, 3497 potentially relevant studies were identified and screened for retrieval (Fig. 1). From this initial study group, 698 studies were excluded because of duplications, and 2681 studies were excluded after reading their titles and abstracts. Among the remaining 118 studies, there were 3 conference abstracts, 55 reviews, and 12 RCTs comparing the treatment with others. Subsequently, 48 RCTs were assessed for eligibility. After reading, 5 RCTs were excluded due to more than 1 type of intervention being implemented in the study group. Finally, 40 studies (85 arms)5,6,7,8,9,10,11,12,13,14,20,21,22,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55 were included in this meta-analysis.

Figure 1
figure 1

Flowchart of studies included in the meta-analysis.

Characteristics of Included Trials and Quality Evaluation

The main characteristics of the included trials are listed in Table 1. The number of patients participating in studied RCTs varied from 20 to 455. A total of 6141 patients with femoral neck fractures were included in the meta-analysis; of these, the treatment breakdown included 1432, 1194, 821, 1511, 252, 229, and 702 in the IF, unipolar cemented HA, unipolar uncemented HA, bipolar cemented HA, bipolar uncemented HA, uncemented THR, and cemented THR groups, respectively. Five studies32,36,43,50,51 contained 3 arms; however, the remainder were 2-arm studies. In the IF group, cancellous lag screws or sliding hip screw (with or without antirotational screws) were used. In the other 6 groups, most patients underwent a modified Hardinge or a posterior approach. The follow-up period varied from 6 to 204 months. Eighteen studies did not report the number lost during follow-up6,11,22,27,32,33,34,35,36,37,41,42,43,44,50,51,52,53.

Table 1 Main characteristics of the trials included in the meta-analysis.

Most of the 40 studies reported random sequence generation from computerized randomization or random numbers (Table 2), there were 24 low risk, 1 high risk, and 15 unclear in random sequence generation. Allocation concealment was detailed in 23 studies, using the sealed-envelope technique, there were 23 low risk, and 17 unclear in allocation concealment. In blinding, there were 15 low risk, 16 high risk, and 9 unclear. In incomplete outcome data and selective reporting, all of studies were unclear.

Table 2 Quality evaluation of included trials.

Evidence Network

The various procedures and number of studies and patients per direct comparison included in our network-analysis are shown in Fig. 2. The thickness of lines represents the number of studies; and the blue spots, the number of patients.

Figure 2
figure 2

Direct comparisons in the network model.

Primary End Point

Reoperation

The contribution of the direct estimates effect of each pair in the entire network is shown in Fig. 3. Mixed estimates included direct and indirect estimates; the indirect estimate was constructed via intermediary. For example, in the mixed estimates of reoperation, the direct comparison result of bipolar cemented HA versus bipolar uncemented HA provided 79.4% weights for the mixed estimates, and this pair contributed a proportion of 39.5% to the result of bipolar cemented HA versus IF, a proportion of 32.8% to the result of bipolar cemented HA versus unipolar cemented HA, a proportion of 27.3% to the result of bipolar uncemented HA versus uncemented THR, and so on. In the mixed estimates reoperation of bipolar cemented HA versus cemented THR, the proportion from direct comparison was 61.1%. This pair contributed 28.3% weights to the result of cemented THR versus unipolar cemented HA and 23.6% to the result of cemented THR versus uncemented THR. In the network, other pairs were similar to the above comparisons.

Figure 3
figure 3

Contribution of the direct estimates effect of each pair in the entire network for reoperation (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Inconsistency in closed loops was assessed using inconsistency factors (Fig. 4). Any 3 treatments forming a direct triangular connection were assessed for inconsistency. If the lower limit of the 95% confidence interval in any 1 of the closed loops did not reach 0, this insinuates that a statistical difference may exist in the inconsistency hypothetical test. No loop indicated the possibility of inconsistency (Fig. 4).

Figure 4
figure 4

Inconsistency in the closed loops of reoperation. (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Table 3 displays the effect estimates of reoperation (lower left, italic). Overall, the results showed that the reoperation incidence was higher in the IF group than in the other groups (compared to unipolar cemented HA [OR = 13.39; 95% CrI 6.21 to 26.06], unipolar uncemented HA [OR = 3.80; 95% CrI 1.72 to 7.43], bipolar cemented HA [OR = 8.10; 95% CrI 4.20 to 14.16], bipolar uncemented HA [OR = 6.87; 95% CrI 1.16 to 23.30], uncemented THR [OR = 9.93; 95% CrI 3.13 to 25.23], and cemented THR [OR = 11.45; 95% CrI 5.48 to 21.88]). The reoperation incidence was lower in the unipolar cemented HA group than in the unipolar uncemented HA group (OR = 0.31; 95% CrI 0.12 to 0.66). In the remainder of comparisons, the results did not indicate any significant differences. Based on the above outcomes, we drew the rank of these 7 procedures under the reoperation incidence using surface under the cumulative ranking curve (SUCRA). The rank of potential reoperation from low incidence to high incidence was as follows: 1 for unipolar cemented HA; 2, cemented THR; 3, uncemented THR; 4, bipolar cemented HA; 5, bipolar uncemented HA; 6, unipolar uncemented HA; and 7, IF.

Table 3 Main findings of reoperation and mortality.

Mortality

The contribution each pair in the entire network is shown in Fig. 5. The inconsistency in closed loops was assessed using inconsistency factors (Fig. 6). Notably, the bipolar cemented HA-cemented THR-IF loop indicated the possibility of inconsistency, as the 95% confidence interval was 0.11 to 1.39, P = 0.021. Table 3 also displays the effect estimates of mortality (top right, bold). Mortality during the follow-up did not differ significantly. Overall, the results showed that the mortality incidence in these 7 groups was similar. Thus, no drawing or ranking of order under mortality was necessary.

Figure 5
figure 5

Contribution of the direct estimates effect of each pair in the entire network for mortality (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Figure 6
figure 6

The inconsistency in closed loops of mortality (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Infection

The contribution of each pair in the entire network is shown in Fig. 7. The inconsistency in closed loops was assessed using inconsistency factors (Fig. 8). The bipolar cemented HA-IF-unipolar cemented HA loop indicated potential inconsistency (P = 0.019). Table 4 displays the effect estimates for infection and dislocation. Regarding infection, a few significant differences were observed. Overall, the results showed that the infection incidence in IF group was lower than that in the unipolar uncemented HA group (OR = 0.49; 95% CrI 0.20 to 0.99). In the bipolar uncemented HA group, the infection incidence was lower than that in the IF (OR = 39.48; 95% CrI 1.00 to 203.4), unipolar cemented HA (OR = 70.60; 95% CrI 1.88 to 391.30), unipolar uncemented HA (OR = 95.06; 95% CrI 2.16 to 443.90), bipolar cemented HA (OR = 70.57; 95% CrI 1.94 to 362.20), uncemented THR (OR = 0.07; 95% CrI 0.00 to 0.44), and cemented THR groups (OR = 0.18; 95% CrI 0.00 to 0.86). In the remainder of comparisons, the results did not indicate significant difference. Based on the above outcomes, we drew the rank of these 7 procedures by “SUCRA”. The rank of potential infection, from low to high incidence, was as follows: 1 for bipolar uncemented HA; 2, IF; 3, cemented THR; 4, bipolar cemented HA; 5, unipolar cemented HA; 6, unipolar uncemented HA; and 7, uncemented THR.

Figure 7
figure 7

Contribution of the direct estimates effect of each pair in the entire network for infection (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Figure 8
figure 8

Inconsistency in closed loops of infection (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Table 4 Main findings of infection and dislocation.

Dislocation

The contribution of each pair in the entire network is shown in Fig. 9. The inconsistency in closed loops was assessed using inconsistency factors (Fig. 10). Table 4 also displays the effect estimates of dislocation. For dislocation during follow-up, a few significant differences were observed. Overall, the results showed that the dislocation rate in the IF group was lower than that in the other groups (compared to unipolar cemented HA [OR = 0.23; 95% CrI 0.04 to 0.68] and unipolar uncemented HA [OR = 0.20; 95% CrI 0.05 to 0.53]). The dislocation incidence was higher in the uncemented THR group than in the IF (OR = 0.003; 95% CrI 0.00 to 0.03), unipolar cemented HA (OR = 0.02; 95% CrI 0.00 to 0.15), unipolar uncemented HA (OR = 0.02; 95% CrI 0.00 to 0.18), bipolar cemented HA (OR = 0.01; 95% CrI 0.00 to 0.08), bipolar uncemented HA (OR = 0.01; 95% CrI 0.00 to 0.07), and cemented THR groups (OR = 6.44E + 32; 95% CrI 2.52 to 1.07E + 30). The dislocation incidence was higher in the cemented THR group than in the IF (OR = 0.08; 95% CrI 0.02 to 0.19), bipolar cemented HA (OR = 0.21; 95% CrI 0.06 to 0.51), and bipolar uncemented HA groups (OR = 0.24; 95% CrI 0.03 to 0.83). In the remainder of the comparisons, the results did not indicate significant difference. Based on the above outcomes, we drew the rank of these 7 procedures regarding infection incidence by “SUCRA”. The rank of potential dislocation, from low incidence to high incidence, was as follows: 1 for IF; 2, bipolar uncemented HA; 3, bipolar cemented HA; 4, unipolar cemented HA; 5, unipolar uncemented HA; 6, cemented THR; and 7, uncemented THR.

Figure 9
figure 9

Contribution of the direct estimates effect of each pair in the entire network for dislocation (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Figure 10
figure 10

Inconsistency in closed loops of dislocation (IF, internal fixation; unipolar-cemented-HA, U-C-HA; unipolar-uncemented-HA, U-U-HA; bipolar-cemented-HA, B-C-HA; bipolar-uncemented-HA, B-U-HA; uncemented-THR, U-THR; cemented-THR, C-THR).

Discussion

Traditionally, the selection of the appropriate treatment for patients should depend on the age, location of fracture, orientation, comminution, type, stability, and requirements of postoperative functional recovery and so on. In this study, we used Bayesian network meta-analysis to compare efficacy and complications between 7 procedures.

Reoperation is an important end point of efficacy. The results demonstrate that patients in the IF group had the highest incidence of arthroplasty. Simultaneously, the unipolar cemented HA group exhibited a lower incidence than did the unipolar uncemented HA group and may provide the lowest reoperation incidence in all of managements, the next order is cemented-THR. In addition, procedures with bone cement tend to have lower reoperation incidence than those without cement: unipolar cemented HA < unipolar uncemented HA, bipolar cemented HA < bipolar uncemented HA, and cemented THR < uncemented THR. This suggests a role for cement in fixing stems and reducing the requirement for reoperation, which may be protective for patients. In additional, Inngul et al. reported that bipolar HA could display a later onset of acetabular erosion compared to unipolar HA10,56. Thus, the reoperation incidence in the bipolar HA group is expected to be at least the equivalent to unipolar HA. Two meta-analyses comparing unipolar and biopolar HA support this conjecture57,58. But in our analysis, unipolar cemented HA exhibited lower reoperation incidence than other managements, and notably lower than did bipolar HA. We think that cementing has such a large influence on unipolar HA that unipolar cemented HA exhibits a lower reoperation incidence than do other managements. Typically, the main reason for reoperation in IF is loss of fixation, whilst in arthroplasty, it is the revision typically required following acetabular erosion, dislocation, periprosthetic fracture, pain, and loosening10,21,59.

The rank of dislocation incidence in our meta-analysis shows another tendency: IF < bipolar HA < unipolar HA < THR. The IF group exhibited the lowest dislocation rate because of a tendency to fail in fixation rather than dislocation. THR had a higher dislocation incidence because the existence of an acetabular cup may destroy bone mass. Uncemented THR contributed the highest dislocation incidence, when compared to other groups, in this meta-analysis. Further, low dislocation incidence in bipolar HA corresponds with the theory that bipolar HA may reduce the amount of acetabular erosion, compared to unipolar HA60.

Mortality constitutes another important clinical factor, and we found that there was no difference between interventions, which indicates that the type of operation does not obviously influence mortality. Complications illustrate that the infection incidence in bipolar uncemented HA is the lowest of all managements, a significant difference compared to other groups. Unfortunately, no immediate explanation is apparent for these findings.

Our meta-analysis has several potential limitations that should be considered. First, Bayesian network meta-analysis is an emerging and encouraging research method, which is not very perfect and convincing, but we could not completely refuse the evidence from it because it provided direct and indirect estimates, and the possible order of managements. In our meta-analysis, the 95% CrI varied greatly, and we suggest that this may be due to differences in RCTs. Indicators were analyzed and discussed as the results of ranking, which provides a tendency, and the contribution from direct comparison is only a little part of all. Therefore, systematic bias is possible in this study. Second, confounding factors involving surgical approach and variation from IF may contribute to differences in the findings. A direct anterior (Smith-Peterson), anterolateral (Watson-Jones), lateral (Hardinge), posterior (Moore), or posterolateral approach could be used to perform hip arthroplasty38,61,62,63,64. Although meta-analyses have reported similar outcomes and complication incidences between operative approaches so far65,66, a higher dislocation incidence may exist when using the posterior approach67, contributing heterogeneity to the overall outcome. In addition, IF is divided into closed and open reduction depending on whether cancellous lag screws or dynamic hip screws are used, which is another source of heterogeneity. Finally, according to the methodological quality items in Cochrane Reviewers’ Handbook 5.1.024, different bias might be introduced in the study, especially selective reporting bias, because all of studies were unclear. We should be cautious to the conclusion.

In conclusion, our network meta-analysis rank the orders of 7 procedures in reoperation, mortality, dislocation and infection, which indicates that IF may provide the highest reoperation incidence, unipolar cemented HA may provide the lowest reoperation incidence; uncemented THR contributes the highest dislocation incidence; and bipolar uncemented HA provides the lowest infection incidence. No differences in mortality were observed among the treatments. This conclusion is indirect; higher-quality direct comparisons are required.