Introduction

Pancreatic surgery has made tremendous progress over the last several decades. With the introduction of minimally invasive techniques, adoption of laparoscopy and robotic platforms for performance of complex pancreatic surgery has evolved as well. Pancreaticoduodenectomy (PD) remains one of the most technically challenging surgeries in the current era1,2,3. Prior to the introduction of the robotic platform, attempts to improve surgical outcome with the use of laparoscopy did not gain widespread adoption4,5. Introduction of robotic surgery has overcome some of the limitations from laparoscopic approach6. Difficulties related to laparoscopic surgery such as the fulcrum effect, fine instrument manipulation, and reversal of instrument tip from surgical hand-motion created challenges for many surgeons4,5.

Following the first case of robotic PD reported in 2003 by Giulianotti et al., multiple studies have reported feasibility and safety of robotic PD in the management of pancreatic head malignancies5,7,8,9,10,11. Robotic platform allows performance of complex surgical cases by overcoming the limitations associated with laparoscopic surgery. The advantages of robotic surgery includes 7 degrees of freedom, 3D visualization, and fine tremor reduction3. Previous reviews and meta-analyses have found perioperative outcomes of robotic PD to be at worst equivalent to those of open PD12,13. The studies to date have focused on clinical short-term outcomes in patients undergoing robotic PD. However, long-term outcomes of robotic surgery for pancreatic cancer is still scarce in the literature due to the limited longitudinal data available.

Based on longitudinal studies, we know that margin negative (R0) resection for pancreatic cancer translates into improved survival14,15,16. In addition, increased number of lymph nodes retrieved during surgery frequently allows accurate staging and is synonymous with the adequacy of surgical resection17. There are currently several studies that also reports improved resection margin with the use of robotic platforms12. Based on the available literature, we attempted to evaluate the role of robotic PD in comparison to open approach with a focus on short-term clinical outcomes as well as histopathological outcomes such as margin status and nodal harvesting14.

Materials and methods

This systematic review was performed according to the Cochrane Handbook for Systematic Reviews of Interventions18 and follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and Meta-analysis Of Observational Studies in Epidemiology (MOOSE) guidelines19,20. The protocol of this systematic review was developed prospectively and registered in the International prospective register of systematic reviews PROSPERO: CRD42018112039. Given the summary design nature of this study, Institutional Review Board approval and written consents were not required. The literature search, screening of the records, study selection, extraction and analysis of the data, followed by critical appraisal, were performed by two independent researchers (MG and XDD). The research question was formulated within the PICOTS framework as following:

(P) Population: Adults older than 18 years old undergoing pancreaticoduodenectomy.

(I) Intervention: robotic pancreaticoduodenectomy.

(C) Comparator intervention: open pancreaticoduodenectomy.

(O) Outcomes: operating time, estimated blood lost, postoperative complication rate, postoperative pancreatic fistula (POPF) rate, delayed gastric emptying rate, incisional surgical site infection rate, reoperation rate, length of hospital stay, margin involvement rate, and number of lymph nodes harvested.

(T) Time: Short-term.

(S) Setting: Inpatient.

Eligibility criteria, definitions and endpoints

All experimental or observational clinical studies comparing robotic to open PD for benign and/or malignant disease were eligible for inclusion. Non-comparative descriptive studies, studies comparing any of the interventions of interest to a non-relevant intervention such as laparoscopic pancreaticoduodenectomy or robotic tumor resection followed by mini-laparotomy for reconstruction, and review articles were excluded.

Postoperative complications were classified according to Clavien-Dindo classification21. Surgical site infections (SSI) were defined according to the Center for Disease Control National Nosocomial Infections Surveillance System22.

The primary endpoints of this systematic review were overall postoperative complication and resection margin involvement rates. Secondary endpoints included operating time, estimated blood loss, postoperative complication rate, postoperative pancreatic fistula rate, rate of delayed gastric emptying, surgical site infection rate, reoperation rate, length of hospital stay, and number of lymph nodes harvested.

Search strategy and study selection

The Pubmed, EMBASE, and Cochrane Library were systematically searched using the following MeSH terms: ‘pancreatoduodenectomy’, ‘pancreaticoduodenectomy’, ‘whipple’, and ‘robotic’ combined with the Boolean operator ‘AND’ and all synonyms combined with the Boolean operator ‘OR’. In addition, clinicaltrials.gov was searched for any ongoing studies. The details of Pubmed search strategy are presented in Supplement 1. Relevant articles were identified, and the results of the search were screened through the title, abstract and/or full text article. The sensitivity of the search strategy was tested by screening the references of included articles for additional publications.

Data extraction and quality assessment

The data from the included articles were collected to predefined Microsoft Excel tables and studies were assessed for validity by three researchers independently (MG, XDD, and DMF). Extracted data items included publication-specific variables (authors and affiliations, journal and year of publication), study-specific variables (study design, study span, sample size, definitions of interventions and endpoints, conclusions, potential biases), and patient-specific variables (baseline characteristics, intra- and postoperative outcomes, pathologic outcomes). Quality assessment of each individual study was performed according to Cochrane Handbook for Systematic Reviews of Interventions on the following items: selection, performance, detection, attrition, selective reporting, and other bias risks18. In addition, Risk Of Bias In Non-randomized Studies (ROBINS-I) tool was utilized to evaluate the quality of observational studies on the following biases: confounding, selection, classification of interventions, deviations of intended comparability, and outcomes23.

Statistical analysis

Inverse variance method with mean difference (MD) and standard error as the measure of an effect estimate was used for continuous variables, whereas Mantel–Haenszel method with odds ratios and 95% confidence intervals (OR (95%CI)) was employed for dichotomous variables. In cases when continuous variables were reported in median and interquartile range in the included studies, mean and standard deviation were estimated using Hozo’s formula24. Statistical heterogeneity among effect estimates was assessed using Cochran Chi2 and I2, and between-study variance was assessed using Tau2 statistic when the I2 was 50% or greater25. Random-effects model was utilized for meta-analysis. The results of the meta-analysis were illustrated on forest plots. Ad-hoc meta-regression analysis with Omnibus test was performed to evaluate the impact of potential confounding factors on outcomes. To assess clinical significance of the statistical findings for dichotomous endpoints, relative risk reduction (RRR), absolute risk reduction (ARR) and number needed to treat/harm (NNT) with 95%CI were calculated. Clinical significance of the MD was assessed for numeric endpoints. The variability of the effect of intervention over different settings was assessed using 95% prediction intervals26. Visual assessment of funnel plots and Egger’s test were utilized to assess for publication bias. A leave-one-out meta-analysis was utilized for sensitivity analysis. A p-value < 0.05 was considered statistically significant. Certainty of evidence was evaluated using GRADE approach. Statistical analysis was performed using RevMan (version 5.3; Nordic Cochrane Center, Cochrane Collaboration, Copenhagen, Denmark) and CMA Software (Version 3; Biostat, NJ, USA).

Results

Literature search and study selection

Details of the search strategy and study selection are presented in the PRISMA flowchart (Fig. 1). Four databases were searched and revealed 237 records. Additionally, two articles were found at clinicaltrials.gov and through the references of eligible studies. Twenty-nine studies (including published abstracts of conference proceedings) were included in the qualitative synthesis after excluding duplicates, non-relevant articles, and articles not reporting the outcome of interest.

Figure 1
figure 1

PRISMA flow diagram.

Quality assessment

The risk of bias summary and graph of the included studies are presented in Fig. 2A,B. The risk of selection, performance and detection bias was high in all included studies given their observational nature. Attrition, reporting, and other bias risks were moderate or low in included studies. The results of quality assessment using the ROBINS-I tool are presented in Supplement 2. Overall risk of bias was assessed as serious in most studies.

Figure 2
figure 2

Quality assessment: (A) Risk of bias summary. (B) Risk of bias graph.

Description of included studies

Figure 3 highlights the time span of included studies published from the same institutions, which may increase the risk of duplicate data synthesis. Due to an overlap of the studies by Napoli et al.27 and Boogi et al.28 from the University of Pisa, only the study by Boogi et al.28 was included as it covers a longer time span. An abstract published by Walsh et al.29 from Cleveland Clinic was excluded as there was an overlap with the study by Chalikonda et al.30 There were five studies from the University of Pittsburgh that overlap to a certain extent. After excluding three (McMillan et al.31, Varley et al.32 and Wilson et al.33), studies by Boone et al.34 and Cai et al.35 with a maximal time span covered and minimal overlap were included.

Figure 3
figure 3

Arrow plot of study spans of included studies with the risk of duplicate data synthesis.

Twenty-four studies were included in the final quantitative data synthesis, totaling 12,579 patients (2,175 robotic PD and 10,404 open PD)8,28,30,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54. Seven studies30,34,36,38,41,50,52 were prospective cohort studies and 17 retrospective cohort studies8,28,35,37,39,40,42,43,44,45,46,47,48,49,51,53,54. In four of these studies34,40,43,50, data were extracted from the abstracts of conference proceedings published in indexed journals. Five studies8,36,38,41,52 had the Oxford CEBM level of evidence of 2b and 19 studies-2c28,30,35,37,39,40,42,43,44,45,46,47,48,49,50,51,53,54,55. Three studies8,42,47 were multicenter studies with (5, 8 and 14 included centers), 20 studies28,30,35,36,37,38,39,40,41,43,44,45,46,48,49,50,51,52,53,54 were single center studies and one study34 did not provide this information. Various primary endpoints were reported in seven studies35,38,45,47,50,52,54 and are described in Table 1, whereas 17 studies8,28,30,34,36,37,39,40,41,42,43,44,46,48,49,51,53 did not report/specify this variable.

Table 1 Characteristics of included studies.

Description of study populations and interventions

Adult patients from multiple countries (China, Korea, Italy, USA, Russia, Netherlands, Belgium, France, Turkey, Germany, UK and Spain) were involved in the 24 included studies. 17 studies8,28,30,35,36,37,38,39,41,43,44,46,47,48,51,52,54 included patients with benign and malignant diseases, four studies42,45,49,53 only malignant diseases and three studies34,40,50 did not describe the indications. The primarily involved organs were the pancreas, biliary tract and duodenum in 16 studies8,28,36,37,38,39,41,42,46,47,48,49,51,52,53,54; one study43 only involved only pancreatic and biliary tract diseases and one study45 only pancreatic cancer. Six studies30,34,35,40,44,50 did not provide this information (Table 1).

Patients’ baseline characteristics are summarized in Table 2. The definitions of the interventions are summarized in Table 3. Robotic PD was a totally robotic procedure in 17 studies8,28,34,35,36,38,39,41,42,44,45,47,49,51,52,53,54. A hybrid procedure was performed in four studies30,37,46,48 and three studies40,43,50 did not specify the type of the procedure. Six studies reported DaVinci console type (both Si/Xi in three studies38,44,52; Si49,50 in two and S in one41).

Table 2 Comparison of patients’ baseline characteristics in robotic vs. open pancreaticoduodenectomy.
Table 3 Definition of interventions in included studies.

Intervention categories that were described for both open and robotic surgery included type of procedure (Whipple procedure, pylorus preserving PD, or multivisceral resection) and type of anastomosis (pancreaticojejunostomy, pancreaticogastrostomy and/or duct-to-mucosa). Vein resection was reported in 10 studies8,28,30,35,36,38,41,42,46,47 and was up to 25% in the robotic and up to 38% in the open group. Further categories described in Table 3 were routine or selective placement of pancreaticojejunal anastomotic stent, antecolic or retrocolic location of the jejunal loop, and routine or selective use of abdominal drainage.

Meta-analysis

All 24 studies, regardless of the evidence level and risk of bias, were included in this meta-analysis. Primary outcomes were overall postoperative morbidity and margin involvement rate. Secondary clinical outcomes were operating time, estimated blood loss, postoperative pancreatic fistula (POPF), delayed gastric emptying (DGE), surgical site infection (SSI), reoperation rate, and length of hospital stay. An additional secondary outcome was the number of lymph nodes harvested.

Primary endpoints

Overall postoperative morbidity

Overall postoperative morbidity as a clinical primary outcome was reported in 18 studies (1052 robotic PD vs. 8206 open PD). The statistical among-study heterogeneity was low (I2 = 0%). The overall postoperative morbidity rate was 42.6% (448/1,052) in robotic PD vs. 54.4% (4,464/8,206) in open PD. This difference was not statistically significant [OR (95%CI) = 0.86 (0.74, 1.01); p = 0.06] (Fig. 4A). The RRR was 22% and the NNT was 9 (7, 12) (Table 4). 95% prediction interval was 0.72, 1.02 with moderate GRADE certainty of evidence (Table 4).

Figure 4
figure 4

Meta-analysis of primary endpoints: (A) Overall postoperative morbidity. (B) Resection margin involvement rate.

Table 4 Clinical relevance and certainty of evidence provided by statistical difference in clinical and pathologic endpoints (dichotomous and numeric) between robotic vs. open pancreaticoduodenectomy.
Resection margin involvement rate

Resection margin involvement rate was reported in 17 studies (784 robotic PD vs. 1886 open PD). The statistical among-study heterogeneity was moderate (I2 = 56%; Tau2 = 0.40). The margin involvement rate was 15.6% (122/784) in robotic PD vs. 19.9% (376/1886) in open PD. This difference was statistically and clinically significant [OR (95%CI) = 0.64 (0.41, 1.00); p = 0.05; NNT = 23 (13, 79)] (Fig. 4B) (Table 4). 95% prediction interval was 0.15, 2.68 with moderate GRADE certainty of evidence (Table 4).

Secondary endpoints

Operating time

Operation time was reported in 23 studies (2,086 robotic PD vs. 10,131 open PD) and was significantly longer in robotic PD [MD (95%CI) = 75.17 (48.05, 102.28); p < 0.00001] with high among-study statistical heterogeneity (I2 = 99%; Tau2 = 3956.67) (Fig. 5A). Although the clinical importance of the MD was assessed to be moderate, 95% prediction interval was − 58.77, 209.11 and GRADE certainty of evidence was low (Table 4).

Figure 5
figure 5figure 5figure 5figure 5

Meta-analysis of secondary endpoints: (A) Operating time. (B) Estimated blood loss. (C) Postoperative pancreatic fistula rate. (D) Delayed gastric emptying rate. (E) Incisional SSI rate. (F) Reoperation rate. (G) Length of hospital stay. (H) Number of lymph nodes harvested.

Estimated blood loss

Estimated blood loss was reported in 18 studies (1,549 robotic PD vs. 2,935 open PD) and was significantly lower in robotic PD [MD (95%CI) = − 191.35 (− 238.12, − 144.59); p < 0.00001] with high among-study statistical heterogeneity (I2 = 96%; Tau2 = 7,522.30) (Fig. 5B). Clinical importance of the MD was assessed to be low and 95% prediction interval was − 382.04, − 0.66 with moderate GRADE certainty of evidence (Table 4).

Postoperative pancreatic fistula rate

POPF rate was reported in 20 studies and did not differ significantly between the two approaches (18.9% (265/1,909) in robotic PD vs. 16.0% (1,589/9,921) in open PD) [OR (95%CI) = 0.89 (0.65, 1.22); p = 0.48; NNT = 47 (26, 267)] with high among-study heterogeneity (I2 = 64%; Tau2 = 0.26) (Fig. 5C) (Table 4). 95% prediction interval was 0.29, 2.74 with low GRADE certainty of evidence (Table 4).

Delayed gastric emptying rate

Sixteen studies reported the rate of delayed gastric emptying, which did not differ significantly between the two approaches (14.7% (177/1,202) in robotic PD vs. 15.2% (1,317/8,663) in open PD) [OR (95%CI) = 0.98 (0.74, 1.30); p = 0.87], with low among-study heterogeneity (I2 = 24%; Tau2 = 0.07) (Fig. 5D). RRR was 3%, NNT was 210, 95% prediction interval was 0.51, 1.87 with very low GRADE certainty of evidence (Table 4).

Incisional surgical site infection rate

Incisional SSI rate was reported in 13 studies and favored robotic PD (7.9% (80/1,010) in robotic PD vs. 10.1% (820/8,100) in open PD) [OR (95%CI) = 0.45 (0.27, 74); p = 0.002; NNT = 46 (25, 243)] with moderate among-study heterogeneity (I2 = 52%; Tau2 = 0.30) (Fig. 5E) (Table 4). 95% prediction interval was 0.12, 1.70 with low GRADE certainty of evidence (Table 4).

Reoperation rate

Sixteen studies reported the rate of reoperations (1,521 robotic PD vs. 8,732 open PD), which did not differ significantly between the two approaches (6.6% (100/1,521) in robotic PD vs. 6.6% (577/8,732) in open PD) [OR (95%CI) = 0.80 (0.62, 102); p = 0.72], with low among-study heterogeneity (I2 = 0%; Tau2 = 0.00) (Fig. 5F). RRR was 1%, NNT was 3,007, 95% prediction interval was 0.61, 1.04 with very low GRADE certainty of evidence (Table 4).

Length of hospital stay

Length of hospital stay was reported in 20 studies (1,893 robotic PD vs. 9,903 open PD) and did not differ significantly between the two approaches [MD (95%CI) = − 1.00 (− 1.88, − 0.12); (p = 0.06)] with high among-study heterogeneity (I2 = 97%; Tau2 = 2.29) (Fig. 5G). Although the clinical importance of the MD was assessed to be moderate, 95% prediction interval was − 4.32, 2.32 and GRADE certainty of evidence was very low (Table 4).

Number of lymph nodes harvested

Number of lymph nodes harvested was reported in 15 studies (908 robotic PD vs. 1,953 open PD). Statistical among-study heterogeneity was high (I2 = 83%; Tau2 = 9.24). The difference was statistically significant [MD (95%CI) = 2.88 (1.12, 4.65); p = 0.001] (Fig. 5H). Although the clinical importance of the MD was be moderate, 95% prediction interval was − 3.97, 9.73 and GRADE certainty of evidence was low (Table 4).

Meta-regression analysis

Ad-hoc meta-regression analysis was performed to assess the impact of potential covariates on the statistical findings. Covariates utilized for meta-regression analysis included the central tendency values for age and BMI, proportion of males, proportion patients with ASA > 2, and study design. A statistically significant correlation was found between overall postoperative mortality and average age in robotic PD (Omnibus p = 0.040) (Fig. 6A). However, only a statistical trend in correlation was found between overall postoperative mortality and open PD (Omnibus p = 0.075) (Fig. 6B). No statistically significant impact of the above-mentioned covariates on margin involvement rate and secondary endpoints was found.

Figure 6
figure 6

Ad-hoc meta-regression analysis: (A) Regression plot of overall postoperative morbidity and average age in robotic PD (Omnibus p = 0.040). (B) Regression plot of overall postoperative morbidity and average age in open PD (Omnibus p = 0.075).

Publication bias and sensitivity analysis

Publication bias was evaluated by visual assessment of symmetry on the funnel plot (Fig. 7) and using Egger’s test (Overall postoperative morbidity: t = 0.534, p = 0.522; Margin involvement rate: t = 0.478, p = 0.641). No significant risk of publication bias was found. A sensitivity analysis of the included observational studies was performed using leave-one-out forest plots. Consecutive exclusion of studies did not significantly impact the findings (Fig. 8). The results of the evaluation of the certainty of evidence are summarized in Table 4.

Figure 7
figure 7

Funnel plot of reporting bias: (A) Overall postoperative morbidity. (B) Resection margin involvement rate.

Figure 8
figure 8

Sensitivity analysis: (A) Leave-one-out meta-analysis of overall postoperative morbidity. (B) Leave-one-out meta-analysis of margin involvement rate.

Discussion

Currently, general surgery is the fastest growing specialty for the DaVinci robotic platform in the US. Specifically, robotic PD is experiencing widespread growth since its introduction7. Similar to laparoscopic distal pancreatectomy has seen its use expanded three-fold between 1998 and 2009, robotic PD is also subject to significant growth nationwide56.

Pancreaticoduodenectomy remains a technically demanding operation with significant risks of morbidity57. Historically, minimally invasive surgery has often been compared to open approach in non-inferiority studies. With improved visualization through magnification of target anatomy and ergonomics allowing more precise excision along critical resection margins, robotic approach may allow superior dissection and skeletonization of critical borders. Margins near the uncinate process along the superior mesenteric artery requiring dissection down to the adventitia is facilitated with the robotic instruments. Although resection margins are important for overall survival and locoregional recurrence, the ability to achieve R0 resection can be as low as 60% in some open cases14,58. One of the important findings of this meta-analysis is the improvement in resection margin in robotic PD. Patients with non-involved resection margins have improved overall survival as well as decreased locoregional recurrence risk in comparison to R1 resection14. The benefits of R0 resection is especially pronounced in patients with N0 disease14.

Previously, the benefits of robotic surgery for pancreatic cancer in terms of margin status have been reported12,45. With the high rate of locoregional failure, assessment of the circumferential margin of the Whipple specimen was re-defined in 200658. Verbeke et al. advocated a standardized protocol for margin assessment since circumferential margin positivity can be underestimated by as much as 60%59. Unfortunately, the method of margin quantification in the majority of these studies were not clearly defined based on the papers reviewed58. Furthermore, only two studies stated their adherence to the standardized Leeds Pathology Protocol (LEEPP) for margin assessment. Only two papers45,49 specified that the LEEPP protocol were followed. Nonetheless, Peng et al. performed a meta-analysis previously and showed improved margin status favoring robotic surgery over open surgery12,60. Within their findings, only 8 studies were included which discussed oncologic outcomes12. Kauffman et al. performed a propensity score matched analysis of robotic versus open PD and found equivalent rates of R1 resection45. The authors did comment, as speculated by many robotic surgeons, that following the peri-adventitial dissection plane close to the right side of the SMA, following early ligation of the inferior pancreaticoduodenal artery makes the retroperitoneal dissection easier. The retroperitoneal dissection plan is particularly efficient using the minimally invasive robotic approach45. In this meta-analysis, we confirmed the significant difference in resection margin involvement rates favoring robotic approach.

Similar to margin status, an increase in the number of lymph nodes harvested is frequently associated with improved staging and optimal resection margins17. In this study, we also identified that robotic PD has an increased number of lymph node harvested as compared to open. Previously, studies have shown both that total number of lymph nodes evaluated and a higher positive lymph node ratio to be superior in terms of oncological outcome. This meta-analysis is one of the first to present superior nodal sampling with the use of robotic surgery.

As expected, clinical outcomes favoring robotic surgery included significantly lower estimated blood loss, decreased incisional SSI rate, and lower length of hospital stay at the cost of longer operating time. These findings confirmed the results of previous meta-analyses. No significant difference in POPF, DGE, and reoperation rates was found.

One of the strengths of this meta-analysis is the number of studies and thereby number of patients included. Other strengths were prospective development and registration of the protocol, and rigorous literature search. This meta-analysis has several limitations. Given the observational nature, all included studies were subject to high risk of selection, performance, and detection biases. Moreover, all studies reported only short-term outcomes. The differences in surgical approaches and perioperative management across the globe may have contributed further to the heterogeneity and variance across the included studies. The lack of other histopathological details including and not limited to lymphovascular and perineural invasion adds additional heterogeneity. Another limitation was a lack of standardization in the definitions of interventions, a fact that may have contributed to the risk of performance bias.

Conclusion

This meta-analysis found that robotic PD was associated with improved resection margins and number of lymph node harvested as compared to open PD. Moreover, robotic PD allowed surgery with less blood loss and was associated with decreased wound infection rates and shorter length of hospital stay, at the expense of increased operating time and surgical cost. The current application of robotic PD needs further experimental and observational prospective studies given the possible benefits over open PD.