Introduction

Total hip arthroplasty (THA) is a well-established and successful procedure to treat patients with osteoarthritis or injuries of the hip or other forms of joint degeneration. With 233,537 cases, THA was one of the ten most commonly performed surgeries in Germany in 20211. With life expectancy on the rise, the likelihood of a reoperation and even multiple reoperations after THA increases. In around 42% of THAs, the estimated survival time of the implant is lower than 25 years, and revision surgery is required2. Although hip implants vary in design, conventional hip implants consist of a stem that is fixed to the femur, an acetabular component fixed to the pelvis, a femoral head which is connected with the stem, and an insert for the acetabular component. The interaction between the femoral head and the insert is referred to as the bearing surface: this is where the joint actually moves and as such where friction and wear take place. In addition to other factors such as patient expectation, BMI, age, sex, comorbidities, length of hospital stay, and the type of surgical approach, the choice of the prosthesis design plays a crucial role for patient satisfaction3,4,5,6. Low wear rates and high survival rates are important factors for selecting prosthetic design and material.

The most common bearing surfaces utilized in THA include metal-on-polyethylene (MoP), metal-on-metal (MoM), ceramic-on-ceramic (CoC), and ceramic-on-polyethylene (CoP)7. The most commonly used metals are cobalt-chromium alloys (CoCr), stainless steel or oxidized zirconium (OxZr). Ceramic materials, such as aluminium oxide (Al2O3), zircon oxide (ZrO2) and alumina toughed zirconia (AMC/ZTA), are used for head and liner. Polyethylene liners can be made of conventional ultrahigh-molecular-weight polyethylene (UHMWPE), or by further crosslinking of UHMWPE to crosslinked polyethylene (XLPE), moderately cross-linked polyethylene (MXLPE), highly cross-linked polyethylene (HXLPE) and Vitamin E-infused HXLPE (HXLPE-VEPE).

Wear between the bearing surfaces is a complex phenomenon involving material characteristics, lubrication and friction8. Different wear phenomena, such as abrasion, adhesion, and tribocorrosion, can occur between the bearing surfaces and lead to material loss and debris production8. The production of wear particles can lead to pseudotumor formation and aseptic loosening, with bone loss as a consequence of biological reaction from abrasive particles9, 10. Apart from wear between the bearing surfaces, wear in hip implants can occur at modular junctions such as the taper and neck11,12,13 or between the acetabular component and liner14 from micromotion.

Wear in the bearing surface of the patient’s hip implant can be estimated on radiolographic images in different ways. One way is to determine the one-dimensional linear femoral head penetration. This can be accomplished measuring the centre difference of femoral head and liner, manually or -mostly used- by computer-assisted techniques15. Another method is to consider volumetric wear as the material loss in all three dimensions. During their lifetime, the wear rate of implants varies. Higher wear rates are observed in the running-in phase during the first one million (walking-)cycles, usually during the first 12 months from THA. Later, the wear coefficient decreases in the steady state phase8.

Relevant factors for implant survival are the surgeon, the patient, his/her activity, and the choice of implant. A number of meta-analyses have investigated wear and revision rates of different bearing surface materials, yet mostly focussing on one material or the comparison of two different material combinations16,17,18,19,20,21,22.

To support the choice of the material for the bearing surface of a hip replacement, we performed a Bayesian network meta-analysis where we looked into different material combinations of head and liner with respect to revision interval, total wear penetration and wear penetration per year. The following material combinations were studied: Al2O3–Al2O3, AMC/ZTA–AMC/ZTA, CoCr–AMC/ZTA, CoCr–CoCr, CPE/UHMPE–ZrO2, CPE/UHMPE–CoCr, CPE/UHMPE–Al2O3, CPE/UHMPE–OxZr, HXLPE–CoCr, HXLPE–Al2O3, HXLPE–ZrO2, HXLPE–AMC/ZTA, HXLPE–Stainless-Steel, HXLPE–VEPE–CoCr, HXLPE-VEPE–AMC/ZTA, MXLPE–CoCr, MXLPE–AMC/ZTA, XLPE–CoCr, XLPE–Al2O3, XLPE–OxZr, XLPE-VEPE–CoCr.

Methods

Eligibility criteria

All clinical investigations which compared two or more material combinations for head and inlay in THA were accessed. Only studies published in peer-reviewed journals were considered. According to the authors´ language capabilities, articles in English and German were eligible. Only studies with level I to IV of evidence, according to Oxford Centre of Evidence-Based Medicine1, were considered. Reviews, opinions, letters, and editorials were not considered. Animal studies, in vitro, biomechanics, computational, and cadaveric studies were not eligible. Missing quantitative data under the outcomes of interests warranted the exclusion of the study.

Search strategy

This study was conducted according to the PRISMA extension statement for reporting of systematic reviews incorporating network meta-analyses of health care interventions: checklist and explanations23. The PICOT algorithm was preliminary pointed out:

  • P (Problem): End stage hip OA;

  • I (Intervention): THA;

  • C (Comparison): Different material combinations of head and inlay;

  • O (Outcomes): Rate of revision surgery, total wear penetration, wear penetration per year

  • T (Timing): Minimum 12 months follow-up.

In September 2023, the following databases were accessed: PubMed, Scopus, Embase, Google Scholar, Cochrane. A time constraint was set from January 2000 to September 2023. The following matrix of keywords were used in each database to accomplish the search using the Boolean operator AND/OR: THA AND (hip OR arthroplasty OR replacement OR prosthesis) AND (metal OR ceramic OR alumina OR zirconia OR polyethylene OR steel) AND (wear OR revision). No additional filters were used in the databases search.

Selection and data collection

Two authors (F.M. and R.M.) independently performed the database search. All the resulting titles were screened by hand and, if suitable, the abstract was accessed. The full-text of the abstracts which matched the topic of interest were accessed. If the full-text was not accessible or available, the article was not considered for inclusion. A cross reference of the bibliography of the full-text articles was also performed for inclusion. Disagreements were debated and mutually solved by the authors. In case of further disagreements, a third senior author (J.E.) took the final decision.

Data items

Two authors (R. M. and F. M.) independently performed data extraction. The following data at baseline were extracted: author, year of publication and journal, length of the follow-up, number of patients with related mean age and BMI (Kg/m2). The following data were collected at last follow-up: inlay wear penetration (mm), inlay wear penetration per year (mm/year), rate of revision.

Assessment of the risk of bias and quality of the recommendations

Two reviewers (U.K.H. and F.M.) evaluated the risk of bias of the extracted studies independently. The included studies were evaluated using the risk of bias of the software Review Manager 5.3 (The Nordic Cochrane Collaboration, Copenhagen)24. The following endpoints were evaluated: selection, detection, performance, attrition, reporting, and other bias.

Synthesis methods

The statistical analyses were performed by one author (F.M.) following the recommendations of the Cochrane Handbook for Systematic Reviews of Interventions25. For descriptive statistics, mean and standard deviation were used. For baseline comparability, the IBM SPSS software was used. Comparability was assessed through the Analysis of Variance (ANOVA), with P > 0.1 considered satisfactory. The network analyses were made through the STATA/MP software (Stata Corporation, College Station, Texas, USA). Only studies which stated clearly the nature of the material of the component (head and/ or liner) were included in the analyses. An overview of the material combinations of head and liner included in the present Bayesian network meta-analysis is shown in Table 1.

Table 1 Material combinations of head and liner included in the present Bayesian network meta-analysis.

The analyses were performed through the Stata routine for Bayesian hierarchical random-effects model analysis. Continuous variables were analysed through the inverse variance method, with the standardized mean difference (SMD) effect measure. Binary data were analysed through the Mantel–Haenszel method, with the Log Odd Ratio (LOR) effect measure. Edge, interval, and funnel plots were performed and analysed. The overall transitivity, consistency, and heterogeneity, as well as the size of the treatment effect of interest within-study variance, were evaluated. The overall inconsistency was evaluated through the equation for global linearity via the Wald test. In PWald values > 0.05, the null hypothesis could not be rejected, and the consistency assumption could be accepted at the overall level of each treatment. Confidence and percentile intervals (CI a d PrI, respectively) were each set at 95%.

Ethical approval

This study complies with ethical standards.

Results

Study selection

The initial databases search resulted in 22,423 articles. Of these, 5567 duplicates were excluded. After screening titles and abstracts 16,443 articles were excluded because they did not match the following eligibility criteria: not comparing two or more bearing material combinations, not mentioning rate of revision surgery or wear related values, no matching study design, not focusing on THA. Of the remaining 413 articles, another 274 were excluded because they did not report quantitative data for wear penetration, or rate of revision surgery, or the follow up time was shorter than 12 months. Finally, 139 studies were included in this review. The results of the literature search are shown in Fig. 1.

Figure 1
figure 1

PRISMA flow chart of the literature search.

Risk of bias assessment

The Cochrane risk of bias tool was performed to investigate the risk of bias of the included studies. Given the number of retrospective studies included in the present investigation, the risk of selection bias was moderate. Few authors performed assessor blinding, leading to a moderate risk of detection bias. The risk of attrition and reporting biases was moderate, as was the risk of other bias. Concluding, the risk of bias graph evidenced a moderate quality of the methodological assessment (Fig. 2).

Figure 2
figure 2

Cochrane risk of bias tool.

Study characteristics

Data from 663,038 THAs were collected. 55% of patients were women. The mean patient age was 59.0 ± 8.1 years, the mean BMI was 27.6 ± 2.6 kg/m2. The mean length of follow-up was 87.9 ± 46.3 months. At baseline, no statistically significant difference was found in mean age, BMI, and mean length of follow-up (P > 0.5). The generalities and demographic and further basic data of the included studies are shown in Table 2.

Table 2 Generalities and patient baseline data of the included studies.

Synthesis of results

The combination of Al2O3 head and Al2O3 liner demonstrated the lowest wear penetration at last follow-up (Fig. 3) and the lowest rate of wear penetration per year (Fig. 4).

Figure 3
figure 3

From left to the right: edge, funnel and interval plots of the comparison: overall wear penetration.

Figure 4
figure 4

From left to the right: edge, funnel and interval plots of the comparison: wear penetration per year.

The combination of HXLPE head and ZrO2 demonstrated the lower rate of revision at last follow-up (Fig. 5). The equation of global linearity found no statistically significant inconsistency in all comparisons.

Figure 5
figure 5

From left to the right: edge, funnel and interval plots of the comparison: rate of revision at last follow-up.

Discussion

The choice of the best hip implant design and material of the bearing surface is crucial for patient satisfaction and longevity of the prosthesis. Different factors must be taken into account when choosing the best material combination for each patient. An important factor for the choice of the bearing surface biomaterial is wear, which remains a major problem in the long run leading to potentially aseptic loosening, pseudotumor formation, and pain. This network meta-analysis was conducted to compare the currently used material combinations for hip implant bearing surface regarding wear penetration, yearly penetration rate and revision surgeries.

In this Bayesian network meta-analysis, the combination of Al2O3 head and Al2O3 liner demonstrated the lowest wear penetration at last follow-up as well as the lowest rate of wear penetration per year. On the other hand, the combination of a HXLPE liner and ZrO2 head demonstrated the lowest rate of revision at last follow-up. Mean age, mean BMI, and mean length of the follow-up had no significant influence on wear behaviour and revision rate.

In general, bearing surfaces in hip implants can be distinguished in hard on soft bearings (with a polymeric material used for the liner and the hard femoral head) and in hard on hard (MoM or CoC) bearings. Given the hardness difference of the articulating partners, hard on hard bearings show lesser and smaller wear particles than hard on soft bearings7, 26, 27.

Wear modes in a tribological system depend on its structure, kinematic interactions, and the combination of wear phenomena. Wear modes are dynamic, and can change over time. Wear modes can be distinguished in normal wear (mode 1), wear occurring due to bearing surfaces articulating against non-bearing surfaces (mode 2), three-body wear (mode 3), and two non-bearing surfaces wearing against each other (mode 4). In the presence of hard wear particles, particularly, polyethylene wear increases. Harder materials result in a reduced contribution of third-body wear to overall wear28, 29.

CoC bearings have been used in THA for a long time given their biocompatibility, high wear resistance and chemical durability7. Additionally, CoC bearing combinations have the advantage to produce smaller and inert debris compared to other bearing types, leading to harmless wear to the human body. For this reason, they are generally considered a good choice for young patients30. The first generations of alumina ceramics had a high risk of fracture, which was later reduced by improving their manufacturing process31. Despite further improvements, ceramics as bearing surfaces still have weaknesses such as bearing noise and reduced toughness, which led to the development of advanced material combinations, such as AMC/ZTA, for use as bearing materials. Biomechanical studies have shown that AMZ/ZTA ceramics exhibit lower wear rates under extreme conditions compared to Al2O332, 33. Nonetheless, our network meta-analysis found that Al2O3 ceramics had the lowest wear penetration rate per year and the least amount of wear at last follow up. This could possibly be explained by the fact that the latest material such as AMZ/ZTA is newer on the market and the average study duration is thus potentially shorter. We only included studies with a minimum duration of 12 months in our analysis; nevertheless, shorter study durations may overestimate debris and wear given the influence of running-in effects17.

Despite its good wear resistance, the Al2O3–Al2O3 combination did not exhibit the lowest revision rate in this meta-analysis. One major disadvantage of an Al2O3 combination are the disturbing noises which are associated with vibrations of the femoral implant system34, 35. Compared to MoP or MoM bearings, fracture of ceramic heads and liner still remains a major disadvantage for CoC bearings34. A study based on the Norwegian Arthroplasty Register found a 3.6 times higher occurrence of ceramic fracture in COC bearings compared to COP bearings. Furthermore, there was an elevated risk of fractures observed in Alumina ceramics compared to AMC heads36. Revision for ceramic fracture is of particular concern, as it can lead to catastrophic failures and severe complications because of third body wear caused by ceramic fragments37, 38. Additionally, the use of CoC bearings is expensive and requires an exquisite surgical insertion technique to avoid chipping off from contact surfaces39.

In this study, the combination of HXLPE liner and ZrO2 head demonstrated the lowest rate of revision at last follow-up. National registries are an important tool to compare revision rates of different material combinations. In the Australian Orthopaedic Association National Joint Replacement Registry (AOA) in 2022 Ceramised Metal head on XLPE liner exibit the lowest 10-year revision rate followed by ceramic head on XLPE liner, which, however, has the lowest 20-year revision rate with 6.8%. 20 year data for ceramised metal head on XLPE liner are not available yet40. The National Joint Registry (NJR) of England and Wales in 2022 reports ceramic on polyethylene to have the lowest 15 year revision rates for all fixation types41. The German Arthroplasty Registry (EPRD) registered the lowest 6-year-revision rate for CoC bearings for elective THA. Nevertheless, ceramic on HXLPE bearings were, with 49.2%, the most frequently used bearing type in Germany in 202142. Regarding NJR data in England and Wales, MOP is still the most commonly used bearing with decreasing tendency, while the use of CoP bearings increases41. Crosslinked polyethylene is listed as the most commonly used polyethylene type, with 97.2% in 2021 in Australia40. In general, low revision rates for CoP and CoHXLPE are mentioned across all the registries.

The German registry classifies polyethylene into different degrees of crosslinking such as UHMWPE, MXLPE, and HXLPE, whereas the NJR only considers polyethylene as a single category. Similarly, the materials of the heads are divided only into broad categories of metal and ceramic or partly ceramised metals by the NJR. As a result, it is not possible to conduct a detailed analysis of the material properties in registry studies. Additionally, in registries, implant combinations are selected for patients based on individual characteristics, making comparisons between implant combinations highly susceptible to bias. Systematic reviews and meta-analyses have been conducted to overcome these limitations. A few exceptions aside20, most review studies only offer analyses of two or three material combinations19, 43.

We performed a comprehensive Bayesian network meta-analysis investigating more than 600,000 THA with 23 different material combinations. As mentioned, in registry studies, CoP bearings exhibit low revision rates. Biomechanical studies found improved wear behaviour for HXLPE compared to PE, which should also entail a longer lifetime15, 44. Zirconia as material for hip implants head has promising properties. In 2001, however, the largest manufacturer of zirconia femoral heads recalled their products for problems with thermal processing associated with some batches producing higher fracture rates, leading to a loss of confidence in zirconia as a reliable orthopaedic biomaterial45, 46. ZrO2 hip implant heads are also mentioned to be prone to aging47. Nevertheless, ZrO2 is widely used in dental applications48. A registry study in 2012 stated that ZrO2 heads are inferior to metal heads regarding revision rate at 12 years49. Of note, most studies evaluating ZrO2 on HXLPE bearing surfaces included in this network meta-analysis were performed in Japan50,51,52,53. Demographic characteristics could thus influence the results of this study. Nevertheless, the positive results for ZrO2 heads observed in the present network meta-analyses may prompt surgeons to rethink their attitude towards this material. However, only few studies investigated the survival rate of zirconia in the last few years.

The present study has several limitations that should be considered when interpreting the results. First, the influence of the head diameter, the fixation technique of stem and cup as well as the orientation of the cup and liner were not analysed. A high inclination angle can cause an increase in liner wear54. The head diameter of the prosthesis is an important factor that can affect the performance of the prosthesis, especially regarding the risk of dislocation55, 56. A larger head diameter can lead to increased volumetric wear in polyethylene cups, while linear wear remains consistent57,58,59. From our analyses, we cannot tell whether certain materials were preferably used in specific sizes. Future studies should consider the influence of head diameter in their analyses. Second, other types of head designs such as dual mobility bearings or hip resurfacing were not explicitly described. Although we subdivided polyethylene into different categories based on the descriptions used in the studies (CPE/UHMWPE, XLPE, HXLPE, MXLPE, HXLPE-VEPE), there could be differences arising from different manufacturing techniques such as annealing and remelting of the polyethylene or amount of crosslinking60. Currently, different treatments, including irradiation and melting, irradiation and annealing, sequential irradiation with annealing, irradiation followed by mechanical deformation, and irradiation and stabilization with vitamin E are available61. Irradiating UHMWPE results in cross-linking between the molecular chains, which improves the mechanical and tribological properties of this cross-linked PE62. The offset of that is that crosslinking affects the mechanical properties of UHMWPE, usually resulting in a decrease in toughness, stiffness, and hardness of the polymer63. Despite that effect, cross-linked UHMWPE is presently the standard of care.

In addition to randomized controlled studies, prospective and retrospective studies were included in this meta-analysis to provide additional data, leading to a moderate risk of bias. Prospective and retrospective studies have a higher risk of bias than randomized controlled trials because they may not use random allocation to balance potential confounding variables between treatment groups. In addition, the quality of the included studies varied, with some studies having a high risk of bias or unclear methodological quality. Nevertheless, a patient and case specific implant choice has to consider patient factors such as age, activity level, and weight, surgical technique, and cost in addition to wear rate and revision rate. Additionally, further design criteria are mandatory to be taken into account, including the fixation technique of the cup and stem within the bone.

Conclusion

The combination of an Al2O3 head and an Al2O3 liner showed the lowest wear penetration at last follow-up, as well as the lowest rate of wear penetration per year. On the other hand, the combination of ZrO2 head ad HXLPE liner exhibited the lowest rate of revision at last follow-up.