Introduction

Colorectal cancer (CRC) is one of the commonest cancers, associated with >1.2 million new cases and 600,000 deaths per year globally1. In the United States, CRC is the most frequent of gastrointestinal (including liver and pancreatic) cancers and leading cause of death in that group2. The liver is the primary site of CRC metastasis, with approximately 14.5% of all patients having developed liver metastases by 5 years following resection of primary tumor with curative intent3.

Liver metastases are also often present at first diagnosis of CRC, however they are no longer a contra-indication for surgery. Recent advances in surgical techniques and experience, together with chemotherapeutic regimens have improved median survival for selected patients with colorectal cancer liver metastases (CCLM) to between 35 and 58% at 5-year following liver resection4,5,6,7, which remains the only prospect of long-term survival4,8,9,10. The established operative approach to CRC metastases is open liver resection (OLR). However, since the introduction of laparoscopic liver resection(LLR) by Gagnerand colleagues in 199211, minimally invasive techniques have found increasing use in the operative management of benign and malignant liver lesions, including CCLM12. Several large randomized trials13,14,15 have demonstrated oncological equivalence of laparoscopic versus open resection for the primary tumor in CRC, with reduced postoperative length of hospital stay (PLOS). However, doubts remain about LLR for CCLM due to its technical complexity, the risk of uncontrollable bleeding and gas embolism16. Questions also remain in relation to adequacy of surgical margins, port-site metastases, and peritoneal spread17,18.

Advances in laparoscopic equipment, techniques, and increasing surgeon experience have meant that outcomes after LLR for benign and malignant primary liver lesions are now equivalent or better than those for OLR19,20,21. As a result, the number of LLRs performed worldwide has increased exponentially in recent years. With regard to CCLM, several retrospective studies22,23,24,25,26 have been published demonstrating similar oncological outcomes between LLR and OLR, but LLR is consistently associated with less blood loss, reduced postoperative complications, shortened PLOS, fewer cases of disease recurrence and lower mortality. These studies were all conducted in specialist centers, however, and questions remain regarding selection bias27. Nevertheless an international panel of expert surgeons recently stated that LLR was appropriate in the treatment of CCLM28,29.

To date, no randomized controlled trials have compared oncological outcomes of LLR and OLR. There have been several meta-analyses30,31,32,33,34 consistently showing that LLR improved intra- and post-operative outcomes, but which were inconclusive on oncological outcomes. To address this issue, we conducted the most comprehensive systematic review and meta-analysis using the GRADE system to assess the quality of individual studies and produce the most rigorous analysis of LLR versus OLR in CCLM thus far.

Results

Description of included trials in the meta-analysis

The search strategy initially generated 421 clinical trials; no randomized clinical trials were identified. Figure 1 details the process of selecting comparative studies using the PRISMA statement for meta-analyses. Initial screening of abstracts led to the exclusion of 383 articles. Besides, six were further excluded by a close scrutiny of remaining articles: 135 only included OLR, 3 duplicates24,36,37, 138 did not report outcomes of interest and 139 used radiofrequency ablation. Finally, 32 non-randomized comparative studies were included in our meta-analysis20,26,27,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68.

Figure 1
figure 1

Study selection flow chart according to PRISMA statement.

Study and patient characteristics

The characteristics of included studies are shown in Table 1. A total of 4697 patients were included: 1809 patients in the LLR and 2888 patients in the OLR group, respectively. The quality assessments of the included studies are given in Supplementary Table S1. The level of GRADE was made according to the GRADE working group recommendations69, are summarized in Supplementary Table S2. The sample size of the included studies varied from 14 to 1152 patients. The rate of open conversion ranged from 0% to 15.8%.

Table 1 Characteristics of the included studies.

Meta-analysis results

Results of individual analyses are shown in Fig. 2 and Supplementary Fig. S1.

Figure 2
figure 2

(A) Forest plot of negative surgical margin between two groups. (B) Forest plot of recurrence between two groups. (C). Forest plot of 3-year OS between two groups. (D) Forest plot of 5-year OS between two groups. (E) Forest plot of 3-year DFS between two groups. (F) Forest plot of 5-year DFS between two groups.

Primary outcomes: LLR had higher rates of clear resection margins (R0) compared to OLR in the 24 studies reporting this outcome (n = 4152; OR: 1.64 [1.32 to 2.05], p < 0.00001; I2 = 30%; GRADE: moderate). However, we did not find any statistically significant differences in tumor recurrence (n = 1850; OR: 0.86 [0.70 to 1.06], p = 0.16; I2 = 50%;moderate), 3-year OS (HR: 0.90 [0.78 to 1.04], p = 0.15; moderate) and 5-year OS (HR: 0.94 [0.82 to 1.07], p = 0.32; moderate), or 3-year DFS (HR: 0.89 [0.78 to 1.00], p = 0.05;moderate) and 5-year DFS(HR: 0.97 [0.85 to 1.12], p = 0.69; moderate). There was no heterogeneity among studies reporting OS and DFS (all I2 = 0).

Secondary outcomes: 24 studies reported mean operation time, demonstrating longer operative times on average in the LLR group (n = 2441; WMD: 14.44 min [1.01 to 27.88]; p < 0.00001; I2 = 71%; very low); 26 studies estimated overall less blood was lost intra-operatively in LLR compared to OLR (n = 2700; WMD: −147.46 mL [−195.78 to −99.15]; p < 0.00001; I2 = 91%; very low); furthermore, the rate of blood transfusion was lower in the LLR group in 15 reporting studies (n = 1807; OR: 0.41 [0.30 to 0.58], p < 0.00001; I2 = 0%; moderate); PLOS, reported in 26 studies, was shorter in LLR patients(n = 3735; WMD: −2.36d [−3.06 to −1.66]; p < 0.00001;I2 = 78%; very low); similarly, patients undergoing LLR experienced less overall morbidity (30 trials; n = 4197; OR: 0.64 [0.55 to 0.75]; p < 0.00001; moderate), but no statistically significant difference in mortality (28 trials; n = 4277; OR: 0.98 [0.58 to 1.70], p = 0.98; moderate).There was no heterogeneity among studies reporting overall morbidity and mortality (both I2 = 0).

Sensitivity and subgroup analysis

Results from sensitivity and subgroup analyses are summarized in Table 2.

Table 2 The result of subgroup and sensitivity analysis.

All subgroup and sensitivity analyses did not reveal significant changes of 5-year OS or 3- and 5-year DFS, apart from that tumor recurrence was reduced in studies with sample size >50 or performed in Eastern countries and 3-OS was improved in studies using LLR assisted with other modalities. Analyses for studies with propensity score matching, cases >50 and LLR assisted with other modalities did not change primary meta-analysis results or statistical heterogeneity for negative surgical margin. However, this outcome measure was not significant anymore when in subgroup analyses of studies with higher quality or studies with simultaneous colorectal and liver resection or studies in Eastern countries.

Meta-regression analysis

None of the included covariates had any significant impact on heterogeneity (Supplementary Table S3).

Publication bias

The funnel plots were based on the 3-, 5-year OS and overall morbidity, which is shown in Fig. 3. As no study lies outside the limits of the 95%CI, there was no evidence of publication bias.

Figure 3
figure 3

Funnel plots for publication bias. The funnel plot revealed no publication bias. (A) 3-OS; (B) 5-OS; (C) overall morbidity.

Discussion

This study provides the most robust evidence to date that LLR is a viable alternative to OLR in the treatment of CCLM in select patients. Using the GRADE system of qualifying evidence, we can report with relative confidence that post-operative mortality was equivalent in both groups, but overall morbidity was almost halved by LLR. Importantly, there were also no statistically significant differences in recurrence rates, 3- and 5- year OS or DFS. If at all, there was a trend towards a benefit from LLR in our analysis. This is entirely supported by our finding that there were a greater proportion of clear resection margins (R0) in the LLR group. While it is theoretically possible that operative factors such as magnification or extra caution when using LLR techniques indeed led to an improved resection rate, it is more likely that this represents selection bias. Therefore, it is probable that larger, more aggressive tumors were either not even attempted or converted to OLRs and, thus, artificially enhancing the pool of OLRs with high-risk malignancies.

Indeed, there is currently no consensus in the literature as to which patients with CCLM are suitable for LLR, although LLR has been performed for all liver segments in this context. Small tumors in the left lateral segments of the liver70,71 and patients with limited tumor burden (2 metastases or fewer)33 have been reported as advantageous in LLR. Similarly, patients with tumors involving the inferior vena cava, left or right portal veins, roots of any of the hepatic veins, or patients with multifocal or bilobar tumors are not good candidates for a minimally invasive liver resection. As all studies included in our analysis were non-randomised comparison studies, and the LLR groups had fewer numbers of tumor resected, this is further indicative of selection bias.

Our analysis agrees with previous meta-analyses that LLR is associated with longer surgery (by around 15 minutes), less blood loss (about 150 mL) and shorter PLOS (around 2 days). Interestingly, GRADE analysis rates the quality of this conclusion as ‘very low’, based on very high heterogeneity between studies, higher than expected variances in the sample populations and unmeasured confounding factors. One of the likely sources of significant heterogeneity between studies is the constantly and rapidly evolving nature of LLR. Modifications of equipment and technique, such as the use of intraoperative ultrasonography, ultrasonic dissection, microwave coagulators, endoscopic linear staplers, laparoscopic CUSA and vascular staplers, have simplified LLR and addressed concerns about major intra-operative hemorrhage72,73,74,75. Due to these methological difficulties as well as the modest improvements in the outcome parameters it would be unreasonable to offer one type of surgery over the other on this basis alone.

Based on our GRADE analysis, one secondary outcome stands out as being both statistically significant, consistent amongst studies and potentially clinically significant. People in the LLR were less than half as likely as those in the OLR group to receive a blood transfusion in the perioperative period. This could be a combination of a lower intra-operative blood loss as well as reduced risk of abdominal wall/muscular bleeding in the post-operative period. This is potentially of clinical significance due to the ongoing debate whether blood transfusions (through immunogenicity or otherwise) contribute to worse long-term survival in colorectal cancer76,77,78,79.

Our subgroup and sensitivity analysis supports our conclusions. Propensity score matching allowed us to take known confounders into account80, considered to approach the accuracy of a randomized controlled trial, however reliant on knowing the confounders81. Nevertheless, the importance of all disease- and patient- relevant confounders on the choice of surgery in CCLM can only be determined in a well-designed randomized controlled trial, insisting on homogeneity of tumor characteristics, operative technique and use of adjuvant therapy.

Based on current evidence, LLR is at least as safe as OLR in the treatment of CCLM in specialist centers and has the potential to significantly reduce morbidity in this population.

Methods

Literature search and study selection

PubMed (Medline), EMBASE and Science Citation Index Expanded and Cochrane Central Register of Controlled Trials (CENTRAL) in the Cochrane Library were searched systematically for all articles published as full papers in the English language from January 1992 to October 2016 comparing LLR and OLR for CCLM. The following medical search headings (MeSH) and keywords were used: “laparoscopy” or “laparoscopic” or “robotic” or “robot-assisted” or “minimally invasive surgery” and “hepatectomy” or “liver resection” or “hepatic resection” and “colorectal cancer” or “colorectal neoplasm” or “colorectal liver metastases”. Reference lists of selected articles were also examined to find relevant studies which were not identified during the initial database searches. The database searching was supplemented with manual searching for reference lists of obtained articles, unpublished studies, and conference abstracts. We contacted the authors for full-text or original data of their investigations where required. Final inclusion of articles was determined by consensus from two authors (S.M.X. and J.J.X.); when this failed, a third author adjudicated (X.T.L.).The whole process of this study followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement82.

Inclusion and exclusion criteria

Three authors (S.M.X., J.J.X. and X.T.L.) identified and screened the aforementioned databases for potentially eligible studies.

Inclusion criteria

(1) adult patients with CCLM; (2) clear documentation of the operative techniques as OLR or LLR with either hand-assisted, robot-assisted, or hybrid; (3) studies with at least one of the outcomes of interest mentioned; (4) where multiple studies came from the same institute and/or authors, either the one of higher quality or the most recent publication was included in the analysis.

Exclusion criteria

(1) abstracts, letters, editorials, expert opinions, case reports, reviews and studies lacking control groups; (2) studies with no clearly reported outcomes of interest; (3) studies including patients with other types of malignant liver tumors, (4) non-comparative studies, using only OLR or LLR; (5) animal studies (6) study with radiofrequency energy was used.

Outcomes of interest

Primary outcomes: negative surgical resection margin, recurrence, 3-and 5-year overall survival (OS), 3- and 5-year disease-free survival (DFS). In some cases, 5-year OS or DFS included these patients with a follow-up between 3–5 years. Secondary outcomes: operative time, intraoperative blood loss and need for blood transfusion; overall morbidity, mortality and PLOS.

Data extraction and quality assessment

Data were extracted by three independent observers (S.M.X., J.J.X. and X.T.L.) using standardized forms. The recorded data included patient and study characteristics and surgical details. For non-randomized controlled studies, a modification of the Newcastle-Ottawa Scale (NOS)83,84 was used for selection, comparability and outcome assessment. Studies valued more than four (of six) stars were recognized as being moderate to high quality.

Statistical analysis

Meta-analysis was performed using Review Manager Version 5.0 software (The Cochrane Collaboration, Oxford, UK). For continuous and categorical variables, treatment effects were expressed as weighted mean differences (WMD) and odds ratios (OR) with corresponding 95% confidence intervals (CI), respectively. For survival analysis, we extracted data from survival curve referring to method reported in previous study, and hazard ratios (HR) were used for quantitative analysis85. An HR of < 1 represented a survival benefit favoring the LLR group and p values < 0.05 indicated statistical significance. Medians were converted to means using a previously described methodology86. Heterogeneity was assessed by I2 with p < 0.1 taken as significant87. An I2 value of < 25% was defined to denote low heterogeneity, a value between 25 and 50% was defined as moderate heterogeneity and a value of >50% was considered to be of high heterogeneity. The fixed-or random-effects model was used as appropriate88. Subgroup and sensitivity analyses were undertaken by only including studies with high quality, propensity score matching, case matching, sample size >50, conducted in Eastern countries, simultaneous colorectal and liver resection, and LLR assisted with other modalities. Meta-regression analyses assessed impact of publication year, sex, age, study design and tumor size on summary estimates using Stata SE Version 13 Software (StataCorp LP, Texas, USA); P < 0.05 was considered significant. Funnel plots were constructed to evaluate potential publication bias based on the 3-, 5-year OS and overall morbidity89.