Introduction

Male circumcision (MC) is the most commonly performed surgical procedure in the world1 and has demonstrated its effectiveness on reducing various sexually transmitted infections, such as human immunodeficiency virus, human papillomavirus, and herpes simplex virus 22,3,4,5. Moreover, it is associated with the prevention of urinary tract infections, penile cancer, prostate cancer, and cervical cancer of female partners6,7,8,9,10.

Conventional circumcision (CC) is recommended by World Health Organization (WHO) as the standard procedure to remove the redundant prepuce11. However, CC has some limitations: it is time consuming, painful, requires stitches, and its operative incision visibility lacks controllability12,13. In addition, adequate training (an average of 100 circumcisions) is required for practitioners to perform CC safely and effectively14. To facilitate the surgical procedures, promote the recovery process, and satisfy the penile appearance, several special disposable devices with mature surgical methods are being routinely used in many medical centers11.

Pediatric and neonatal MCs are often performed using one of the several different clamping or crimping devices, such as Plastibell, Gomco, Mogen, or AccuCirc15,16,17. For adult MC, Shang Ring and PrePex, two in situ devices (ISD), have been proven to have better clinical outcomes than CC. They are associated with a shorter operative time (OT), a lower intraoperative pain score, a higher satisfaction with penile appearance, less intraoperative blood loss (IB), a lower rate of adverse event, and a lower incidence of wound bleeding (WB)13,18,19. Circular disposable devices (CDD), such as the Circular Stapler20 and Unicirc12, are based on a principle different from that of ISD and may be superior to CC21. However, credible evidence is insufficient to confirm whether ISD or CDD can optimize MC, and comprehensive comparisons of their advantages and disadvantages are lacking. Therefore, we conducted this systematic review and network meta-analysis to assess the safety and efficacy of ISDs and CDDs. Furthermore, we evaluated the characteristics of these devices for optimizing MC.

Results

Selected trials

Pubmed and Ovid databases were searched and 491 and 838 references were evaluated, respectively. After 775 duplicates were removed, the titles and abstracts of 554 records were read, and 539 records that did not meet the inclusion criteria were excluded. Fifteen trials were retrieved for a detailed evaluation and five were excluded: two non-randomized controlled trials, one trial that compared two application methods of one device, and two trials that used immature15 and harmful devices22. Ten randomized controlled trials (RCTs) were identified (4649 men)12,18,20,21,23,24,25,26,27,28 and included in our meta-analysis (Fig. 1); moderate methodological quality is shown in Fig. 2. Seven trials12,18,21,23,26,27,28 had a low risk of bias of a random sequence generation; only three trials18,26,28 had a low risk of allocation concealment; and none of the trials had a low risk of performance bias or detection bias. However, there was a low risk of attrition bias, reporting bias, and other biases in all trials except one26 that had an incomplete outcome data.

Figure 1
figure 1

Flow diagram of systematic review.

Figure 2
figure 2

Risk of bias summary of included studies. (+: Low risk of bias; ?: Unclear risk of bias; -: High risk of bias)

Of all the RCTs, six were conducted in China20,21,24,25,27,28, and the remaining four were conducted in South Africa12, Uganda23, Rwanda18, and Kenya and Zambia26. Nine RCTs were two-arm trials: circular stapler versus CC (three studies), Shang Ring versus CC (four studies), Unicirc versus CC (one study), and PrePex versus CC (one study). One RCT was a three-arm trial: circular stapler versus Shang Ring versus CC. More RCT characteristics are shown in Table 1. The general network situation of eligible comparisons in this meta-analysis is shown in Fig. 3. Five studies compared CDD versus CC, six compared ISD versus CC, and one compared CDD versus ISD directly. However, the numbers of comparisons were variable and less than ten in each of the analyzed outcomes (Tables 2 and 3). Thus, we considered the publication bias in each comparison.

Table 1 Characters of Included Studies.
Figure 3
figure 3

Network plot of the circumcision. Both nodes and edges are weighted according to the number of studies involved in each direct comparison. Green line: CDD vs CC; Red line: ISD vs CC; Yellow line: CDD vs ISD. (CDD: circular disposable devices, ISD: in situ devices, CC: conventional circumcision)

Table 2 Outcomes of Pairwise Meta-analysis
Table 3 The Outcomes of Network Meta-analysis.

Credibility of network meta-analysis

We had adequate iterations in each model to make sure every Markov chain was similar and that all potential scale reduction factors (PSRF) were close to 1 (the details are not shown). This ensured that every model in this network meta-analysis converged. Consistency and inconsistency analyses were applied in each outcome and no great difference between random effects standard deviation (RESD) and inconsistency standard deviation (ICSD) was found (Table 4). As comparison involved both direct and indirect evidence of CDD versus ISD, node-splitting models were estimated on the outcomes of this comparison (Table 4). We found that most of evidence was exchangeable, except the WB.

Table 4 The Credibility of Network Meta-analysis.

Comparisons between CDD and CC

Five studies involving 2026 men were included in pairwise meta-analyses (Table 2). The statistically significant outcomes were IB, OT, mean pain score on the operation day (PO), mean pain score on postoperation days (PP), and wound healing time (WHT). CDD showed less IB [standard mean difference (SMD): −3.12 (−4.32, −1.92)], less OT [SMD: −4.33 (−6.43, −2.23)], less WHT [SMD: −0.88 (−1.18, −0.58)], less PO [SMD: −1.51 (−2.55, −0.46)], and less PP [SMD: −1.38 (−2.28, −0.48)] compared with CC.

In network meta-analyses (Table 3), CDD showed less IB [SMD: −8.24 (−11.6, −4.86)], less OT [SMD: −17.3 (−22.6, −11.4)] but a higher overall expenditure (Cost) [SMD: 245 (84.1, 404)] than CC. The stable outcomes with statistical significance in both pairwise and network meta-analyses were for IB and OT.

Comparisons between ISD and CC

Five studies involving 2937 men were included in pairwise meta-analyses (Table 2). The statistically significant outcomes were IB, OT, satisfaction rate (SR), WB, and incidence of wound edema (WE). ISD showed less IB [SMD: −3.25 (−3.65, −2.85)], less OT [SMD: −5.72 (−7.11, −4.33)], a higher SR [risk ratios (RR): 1.17 (1.02, 1.35)], less WB [RR: 0.16 (0.03, 0.76)], and WE [RR: 0.69(0.53, 0.88)] compared with CC.

In network meta-analyses (Table 3), ISD showed less IB [SMD: −9.26 (−13.5, −4.93)], less OT [SMD: −18.2 (−23.7, −12.6)], and less WB [odds ratio (OR): 0.07 (0, 0.78)] than CC. The stable outcomes with statistical significance in both pairwise and network meta-analyses were for IB, OT, and WB.

Comparisons between CDD and ISD

Only one study involving 628 men was included in pairwise meta-analyses (Table 2). The statistically significant outcomes were observed for IB, OT, PO, PP, SR, incidence of wound adverse event (WAE), WB, WE, WHT, and incidence of wound infection (WI). CDD showed more IB [SMD: 0.33 (0.17, 0.48)], more OT [SMD: 0.48 (0.32, 0.63)], less PO [SMD: −2.23 (−2.43, −2.03)], less PP [SMD: −2.39 (−2.59, −2.18)], a higher SR [RR: 1.57 (1.39, 1.78)], less WAE [RR: 0.30 (0.20, 0.45)], more WB [RR: 21.0 (1.24, 357)], less WE [RR: 0.10 (0.05, 0.24)], less WHT [SMD: −0.74 (−0.90, −0.58)], and less WI [RR: 0.04 (0.002, 0.62)] compared with ISD.

In network meta-analyses (Table 3), CDD had a higher Cost [SMD: 235 (4.91, 473)] and more WB [SMD: 28.6 (1.17, 1320)] but less WHT [SMD: −6.58 (−12.6, −0.58)] than ISD. The stable outcomes with statistical significance in both pairwise and network meta-analyses were for WB and WHT.

The surface under the cumulative ranking curve and treatment ranks (Table 5)

Table 5 the SUCRA and ranks of treatments.

The outcomes of CDD whose surface under the cumulative ranking curve (SUCRA) was ≥ 80% were PO, PP, WE, WHT, and WI and whose SUCRA was ≤ 20% were Cost and WB. CDD had a 72%, 73%, 90%, 91%, and 70% possibility for the least PO, PP, WE, WHT, and WI, respectively; CDD had a 97% and 74% possibility to have the highest Cost and WB, respectively.

The outcomes of ISD whose SUCRA were ≥ 80% were IB, OT, SR, and WB and whose SUCRA were ≤ 20% was WHT. ISD had a 70%, 62%, and 77% possibility to have the least IB, OT, and WB, respectively, and a 97% possibility to have the highest SR; ISD had a 95% possibility to have the longest WHT.

No CC outcome had SUCRA ≥ 80%. The outcomes whose SUCRA were ≤ 20% were IB, OT, PO, PP, WI, and SR. CC had a 100%, 85%, 100%, 79%, and 71% possibility to have the largest IB, OT, PO, PP, and WI and a 73% possibility to have the lowest SR.

Although direct and indirect evidence were unexchangeable when comparing WB of CDD and ISD, a sensitivity analysis was applied to explore the reliability of the ranks’ order, and we found that the conclusion was stable (the details are not shown).

Discussion

The WHO and Joint United Nations Programme on HIV/AIDS (UNAIDS) recommend a voluntary medical MC to be considered as a part of a comprehensive HIV prevention package in countries with generalized epidemics. MC has been proven to have the greatest public health impact and provide the largest cost efficiency if the services are rapidly scaled up. One efficacious way includes using disposable devices in MC; this may lower the surgical skill required and accelerate the pace of delivery of voluntary medical MC while maintaining the safety of the procedure11,19.

To our knowledge, this is the first network meta-analysis concerning disposable devices in MC. Although our analysis was based on ten studies, it included 4649 individuals who were randomly assigned to three different kinds of MC methods. These trials were conducted in China and some countries in Africa. The methodological quality of RCTs was moderate due to inadequacies in allocation concealment and blinding because of ethical issues and properties of the surgical studies.

In our meta-analysis, devices were classified into one of two categories: ISD (PrePex and Shang Ring) and CDD (circular stapler and Unicirc), according to their operation principles. An ISD consists of an inner and an outer ring. The inner ring is a frame that the outer ring can lock onto to clamp the foreskin26,29. Excess foreskin is removed immediately after the rings are locked firmly23 or at the time when the rings are removed29. Rings are removed according to surgeon’s assessment of whether ischemic foreskin has necrosed and the wound has healed. Without a ring in situ, CDD has a circular glans pedestal in which the excess foreskin can be incised smoothly and a fastening part to fix the reserved foreskin to prevent shifting. The wound is stapled simultaneously21 when incising the foreskin or glued with some type of biogel12. The staples will theoretically fall off when the wound has healed.

Some trials involving ISD in Africa were reviewed by the Technical Advisory Group on Innovations in Male Circumcision of the WHO. The outcomes based on a total of 1983 Shang Ring and 2417 PrePex procedures found that ISD was easier to perform, had higher surgical success rates, lower total procedure times, eliminated the need for suturing, possibly had fewer complications, caused less bleeding, gave better cosmetic results, may potentially reduce the time taken for recovery after surgery, and eliminated the need for routine injectable anesthesia (only PrePex) compared with other methods. However, wound healing took about 1–2 weeks longer on average than CC and the pain varied by the method type19. We had previously conducted a meta-analysis of RCTs that compared the safety and efficacy of Shang Ring with CC and drew similar conclusions13.

The superiority of disposable devices was also confirmed in our meta-analysis. According to the stable outcomes, ISD was found to have less intraoperative blood loss, a lower operative time, and a lower incidence of wound bleeding than CC. In addition, ISD showed a lower incidence of wound bleeding but needed more wound healing time than CDD. CDD was found to have lesser intraoperative blood loss and a lower operative time than CC. After assessing the results of SUCRA and treatment ranks, we found that CDD tended to be the treatment with the best wound healing condition (least incidence of wound edema/infection and wound healing time) and the least pain experience (lowest pain score). However, it may be the most expensive device with the highest incidence of wound bleeding after surgery; among all the techniques, the ISD circumcision tended to have the lowest operative time and bleeding volume intraoperatively and the lowest incidence of bleeding postoperatively. Furthermore, ISD had the highest satisfaction rate despite requiring the longest wound healing time relative to the other techniques. CC showed no advantages other than a minor trend to be the cheapest MC method (78.5% SUCRA, 57% possibility).

Lv et al.21 conducted a survey on 508 men who had recently undergone circumcision. They found that safety and pain were their main concerns before MC, while pain and penile appearance were their main postoperative concerns. Pain levels varied based on different levels of individual tolerance, methods of anesthesia and analgesia, wound condition (e.g., infection or edema would result in more pain) and so on. In our analysis, both CDD and ISD demonstrated shorter operative times than CC, which was a result of a fast excision of excess foreskin (or no excision at all) and avoiding suturing. It could also be the reason for their lower intraoperative pain score. CDD was also likely to have a better wound condition and that could be the reason for its lower postoperative pain and fast wound healing. Men who chose ISD circumcision were required to wear the ring for 1–2 weeks longer, and pain during an erection was reported as being somewhat higher than at comparable times following a CC (Shang Ring, in particular). However, ISD had a 97% possibility to be the device that best satisfied the men in our analysis. ISD left a neat circumferential wound with no suture marks six weeks postoperatively19. However, staples and stitches are used to anastomose foreskin in circular stapler circumcision and CC, respectively, and the unaesthetic appearance may be derived from the imprints and pinholes.

Without a doubt, the most basic and important requirement of safety of CDD and ISD have been proven in many studies and in this meta-analysis. Nevertheless, devices still have their own specific adverse events, such as a dislocation of the frenulum and wound disruption associated with all devices, inconvenience associated with ISD, unpleasant odor associated with PrePex, and potential intraoperative suturing requirement associated with CDD because of bleeding or incomplete wound closure11,12,19,25,29. We could not explore these issues in our analysis due to lack of adequate numbers of included RCTs.

There are some limitations in our study: We classified devices into two categories but differences still existed within the same category. For example, Shang Ring requires a sterile field and injection of local anesthetic at placement, whereas PrePex requires neither a sterile field nor an injection of local anesthetic at placement. We failed to contact missing data and had a limited number of studies (ten RCTs), and only one comparison between CDD and ISD implies the possibility of a publication bias. Some of the outcomes measured in this meta-analysis were only based on two studies. Studies often report their outcomes in different ways, such as follow-up periods, pain score, and definition of complications. Therefore, all of the results from our meta-analysis should be considered with caution.

In conclusion, the clinical performance of disposable devices used in adult MC exceeded that of CC. CDD circumcision tends to have the best wound healing condition and the least pain experience. ISD circumcision tends to have the lowest operative time, least intraoperative blood loss, least incidence of wound bleeding, and highest satisfaction rate. Each device has its own advantages and these should be discussed with men prior to their circumcision.

Material and Methods

Search strategy

A systematic bibliographic search of PubMed, Embase, and the Cochrane Library databases (the Cochrane Central Register of Controlled Trials and the Cochrane database of Systematic Reviews) of Ovid was done from inception to 12 January 2016 for RCTs that reported using disposable devices to complete adult MCs. The following were the keywords used for searching: “circumcision,” “randomized controlled trial” and their other form of expressions. Original papers were scanned in the reference section to look for missing trials.

Study eligibility criteria

There was no language restriction, and published RCTs that compared efficacy and safety of one kind of disposable device with CC or another device in adult MC were included regardless of whether concealment of allocation or blinding method was carried out or not. The studies including devices that had been confirmed as immature or harmful and thereby not currently used were excluded. Duplicate publications of two or more studies investigating the same sample were excluded.

Interventions and comparisons

Men were divided into different groups according to the principle of the operation, irrespective of the brand names of devices used. Shang Ring and PrePex were classified as ISD; circular stapler and Unicirc were classified as CDD; and all non-device MCs were classified as CC (e.g., sleeve and dorsal slit). The comparisons were between at least two of ISD, CDD, and CC.

Outcome measures

The outcomes measured in this review included the following: IB-intraoperative blood loss (ml), OT-operative time (min), PO-mean pain score on the operation day, PP-mean pain score of postoperative days (after 24 h), Com-overall incidence of complication, WAE-incidence of wound adverse event, WB-incidence of wound bleeding, WD-incidence of wound dehiscence, WE-incidence of wound edema, WHT-wound healing time (day), WI-incidence of wound infection, Cost-overall expenditure, SR-satisfaction rate.

Literature screening and data extraction

Two reviewers (Yu Fan and Dehong Cao) read the titles and abstracts of all potential trials and independently selected eligible RCTs according to the predetermined inclusion and exclusion criteria. If any discrepancy existed, it was resolved by discussion between the two reviewers or by the assistance of a third reviewer (Qiang Wei). Data were extracted and typed in by the first two reviewers through elaborative reading and utilization of a specific formula. The third reviewer (Qiang Wei) checked the information and resolved any differences by discussions with other team members (other authors). If the study provided medians and interquartile ranges instead of means and standard deviations (SDs), we imputed the means and SDs, as described by Hozo et al.30 In cases of multiple pain scores at different time points, we calculated the mean ± SD pain score to obtain PO and PP. In the presence of missing data, efforts were made to contact the authors for information. Extracted data mainly included general trial information, selected outcomes, and methodological characteristics. The methodological quality of each selected trial was assessed by the same two reviewers according to the Cochrane Collaboration Risk of Bias Tool in Review Manager 5.3. A funnel plot was used to test the possibility of publication bias only when the number of each comparison was no less than ten. Otherwise, the existence of potential publication bias was considered.

Data processing and statistical analysis

A network plot of devices was drawn in Stata 13.0 to visually display the number of studies involved in each direct comparison. A pairwise meta-analysis was then processed by synthesizing the studies that compared the same interventions with a random-effects model, which incorporated the assumption that different studies assessed different yet related treatment effects31. Dichotomous and continuous data were expressed as RR and SMD, respectively, both with 95% confidence intervals (CI).

A network meta-analysis was analyzed in AIDDS v. 1.16.6. Its network meta-analysis models were implemented in the Bayesian framework and estimated by Markov chain Monte Carlo methods. All data processing procedures in AIDDS followed the methods described by Zhao et al.32 Using the Brooks–Gelman–Rubin diagnostic, we considered that the model had converged when all Markov chains were similar and PSRF was close to 1. Otherwise, additional iterations were applied until the model had converged. In a network meta-analysis, we chose OR and the 95% credibility interval (CrI) to express dichotomous data. This was because logOR was more suitable than logRR as the former had better mathematical properties and often reflected the underlying mechanisms more effectively. Consistency and inconsistency analyses were applied to check whether the trials in the network were consistent. An ICSD greater than RESD indicated an inconsistency problem. Next, we investigated the source for inconsistency from the studies, excluded them, and ran the model until there was no significant inconsistency. A node-splitting model was estimated for each comparison involving both direct and indirect evidence. Whether the evidence was exchangeable was according to the corresponding inconsistency parameter. It should be zero when evidence was completely exchangeable (p > 0.05). When there was no relevant inconsistency and evidence was exchangeable, a consistency model was used to draw conclusions on the relative effects (OR, SMD, with CrI) of the included treatments and their rank effects with possibilities. Outcomes were considered to be stable if they were statistically significant in both pairwise and network meta-analysis.

The aforementioned treatment ranks were processed in Stata 13.0 to calculate SUCRA. The larger the SUCRA value, the better recommendation of its best rank, and vice versa. We defined a SUCRA ≥ 80% as a significant recommendation of the best rank and a SUCRA ≤ 20% as a significant consideration of the worst rank.

Additional Information

How to cite this article: Fan, Y. et al. The characteristics of circular disposable devices and in situ devices for optimizing male circumcision: a network meta-analysis. Sci. Rep. 6, 25514; doi: 10.1038/srep25514 (2016).