Introduction

Colorectal cancer is the third most common type of malignancy and ~43,030 rectal cancer cases are newly diagnosed annually in the United States1. The widespread use of total mesorectal excision combined with multimodality treatments has improved oncologic outcomes, but the rates of distant metastasis after long-term follow-up remains high, at 20–35%2,3.

Following a landmark German trial4, recent National Comprehensive Cancer Network (NCCN) and European guidelines have recommended preoperative chemoradiotherapy (CRT) plus surgical resection as the standard treatment for locally advanced rectal cancer5,6. After completion of transabdominal surgery, adjuvant chemotherapy (ACT) has generally been recommended5. However, the clinical efficacy of the postoperative strategy has been questioned in the era of neoadjuvant CRT. The rationale for ACT has been extrapolated from the results of colon cancer cases, as well as an earlier meta-analysis that reported improved prognosis with ACT mainly in the context of upfront surgery followed by postoperative treatment7. According to the 5-year results of the European Organization for Research and Treatment of Cancer (EORTC) 22921 trial, the use of ACT improved disease-free survival (DFS) and overall survival (OS) in cases with intermediate down-staging due to CRT8. However, a long-term analysis of 10-year data failed to show similar survival differences3. Given the lack of randomized evidence, the indications for ACT in rectal cancer clinics are controversial9,10,11,12,13.

A pathological complete response (pCR), coded ypT0N0, is importantly associated with favorable prognosis in rectal cancer. Since a pCR is seen in only a small percentage (10–20%) of rectal cancer patients14,15, no prospective study designed to assess the survival benefits of ACT in these individuals has been conducted. Here, we performed a systematic review and meta-analysis to evaluate the impact of ACT on the survival of rectal cancer patients who achieved ypT0N0 status. The OS outcomes of two groups, with and without the postoperative use of chemotherapy, were compared. Given the paucity of randomized trials appertaining to pCR status, the present pooled analysis provides clinical insights into the role of ACT in patients who achieved remarkable tumor eradication following CRT.

Methods and Materials

Literature search strategy

A systematic search of electronic databases was conducted to identify studies that analyzed OS in locally advanced rectal cancer patients treated with ACT. The search process followed the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. The outcomes of the present meta-analysis were reported according to the Meta-analysis of Observational Studies in Epidemiology (MOOSE) criteria (Supplementary Materials Appendix 1)16. Published articles that compared ACT and non-ACT groups of rectal cancer patients treated with neoadjuvant CRT plus surgery from January 1990 to September 2018 were identified by searching the EMBASE, PubMed, Ovid, Web of Science, and Cochrane Library databases. Hand searches were also performed to identify other potentially eligible studies, but no additional studies were added. The following keywords, synonyms, and combinations thereof were used as search terms: [rectum] OR [rectal]; [cancer] OR [carcinoma]; [postoperative] OR [adjuvant]; [chemotherapy]; [preoperative] OR [neoadjuvant]; [chemoradiotherapy] OR [chemoradiation] (Supplementary Materials Appendix 2). No restriction on study design was considered.

Study selection

Two independent authors (YJL and MK) performed the searches and assessed study eligibility. Studies comparing OS between postoperative ACT (intervention group) and observation alone (comparator group) in pCR patients were selected. Exclusion criteria applied during the selection process were as follows: (1) conference abstracts; (2) unstructured papers, such as editorials, comments, and letters; (3) case reports and review articles; (4) studies not reporting the survival outcome of pCR patients; (5) lack of information regarding a comparator group; and (6) insufficient OS data to extract hazard ratios (HRs) and 95% confidence intervals (CIs). The name of the institution or database included in the final set of eligible studies was reviewed. When multiple studies were based on the same data, the one with a longest-duration study period and the largest number of patients was selected. The study selection process was verified independently by a third investigator (YK).

Data extraction

The data were extracted independently by two authors (YJL and MK). When discrepancies occurred, the authors discussed to reach a consensus. Some authors of potentially eligible studies were contacted via e-mail to request required data, and one study replied17.

Risk of bias assessment

A risk of bias assessment was conducted independently by two authors (YJL and MK). Since all of the included studies were non-randomized observational studies, the Risk of Bias Assessment tool for Nonrandomized Studies (RoBANS) was used to assess the following six domains: the selection of participants; confounding variables; intervention measurement; blinding of the outcome assessment; incomplete outcome data; and selective outcome reporting18. Regarding potential discrepancies between the two authors, a consensus was obtained after further review and discussion. In addition, a third investigator (YK) verified the results.

Statistical analysis

The primary outcome of interest was OS. The HRs and 95% CIs from each study were either extracted directly from original papers or calculated using Kaplan-Meier OS curves based on the method of Tierney et al.19. HRs were calculated using a random-effects model with the inverse variance method. Cochrane Q tests and the I2 index were used to evaluate heterogeneity. Funnel plots with Egger’s regression tests were used to examine publication bias. An additional stratified analysis comparing results from single institution and multicenter studies was performed. RevMan software (ver. 5.3; Cochrane Collaboration, Copenhagen, Denmark) was used for all pooled analyses.

Results

Study characteristics

A flowchart of the study selection process is shown in Fig. 1. Among the 3,021 studies identified initially, the titles and abstracts of 1,838 studies were screened. Irrelevant articles, in terms of their structure or content, were excluded, and 95 manuscripts were reviewed. Further screening then identified 16 studies with sufficient OS data on a potentially eligible pCR population. Of these, three studies that had duplicate data (i.e., from the same institution) but shorter follow-ups, were excluded. Since four different National Cancer Database (NCDB)-based studies overlapped in terms of study period, we included all studies and separately conducted pooled analyses on each of them. A total of 13 studies were eligible17,20,21,22,23,24,25,26,27,28,29,30,31.

Figure 1
figure 1

Flowchart of the study selection process.

Table 1 lists the characteristics of included studies. We investigated five single-institution17,20,21,22,23 and four multicenter studies24,25,26,27, and four NCDB analyses (n = 3,182, 3,330, 3,575, and 4,739 for pooled analysis sets I, II, III, and IV defined as datasets including Dossa et al., Polanco et al., Xu et al., and Shahab et al., respectively)28,29,30,31. The time interval between CRT and surgery was typically 1–3 months, and the total radiation dose was 45–50 Gy. Although the details of the chemotherapy regimens used were not included in the NCDB-based studies, most other studies reported fluoropyrimidine-based regimens. Most patients underwent total mesorectal excision. The summarized results regarding risk of bias revealed that the overall quality of the included studies was relatively high (Table 2).

Table 1 Characteristics of 13 studies for the present pooled analysis.
Table 2 A summary of risk of bias assessment using the Risk of Bias Assessment Tool for Non-randomized Studies (RoBANS).

Comparison of OS with and without ACT

Individual forest plots of OS data were generated for the four different NCDB-based studies (pooled analysis sets I, II, III, and IV, Fig. 2A–D). Although a trend toward better OS with ACT was observed, statistical significance was not consistent in the different sets of analyses (HR 0.72, 95% CI 0.49–1.05, P = 0.09; HR 0.71, 95% CI 0.51–0.97, P = 0.03; HR 0.72, 95% CI 0.48–1.09, P = 0.12; and HR 0.76, 95% CI 0.55–1.05, P = 0.10 for pooled analysis sets I, II, III, and IV, respectively). No significant heterogeneity was observed (I2 = 19%, P = 0.27; I2 = 3%, P = 0.41; I2 = 21%, P = 0.25; and I2 = 0%, P = 0.57 for pooled analysis sets I, II, III, and IV, respectively).

Figure 2
figure 2

Overall survival comparing adjuvant chemotherapy and observation alone in patients with a pathologic complete response. Forest plots of pooled analysis sets (A) I, (B) II, (C) III, and (D) IV, including four different NCDB-based studies of Dossa et al., Polanco et al., Xu et al., and Shahab et al., respectively. HR: hazard ratio; CI: confidence interval; ACT: adjuvant chemotherapy. *Four NCDB-based studies.

Stratified analysis

To compare the results from single institution and multicenter studies, a stratified analysis was performed. Using the pooled analysis set III, which included the study of Xu et al., ACT did not result in significant survival benefits in either multicenter (HR 0.68, 95% CI 0.42–1.11, P = 0.12) or single-institution (HR 0.90, 95% CI 0.38–2.14, P = 0.81) studies (Fig. 3). Other subgroup results based on pooled analysis sets I, II, and IV also showed comparable results (data not shown).

Figure 3
figure 3

Representative results of stratified analysis comparing results from single institution and multicenter studies using the pooled analysis set III, including the NCDB-based study of Xu et al. HR: hazard ratio; CI: confidence interval; ACT: adjuvant chemotherapy.

Publication bias

There was no publication bias in the overall pooled results (P = 0.167, P = 0.210, P = 0.225, and P = 0.365 for pooled analysis sets I, II, III, and IV, respectively) (Supplementary Fig. 1). No significant asymmetry was observed between the subgroup results of the multicenter and single-institution studies (data not shown).

Discussion

The present meta-analysis reviewed multicenter analyses and single institutional series to evaluate whether rectal cancer patients who achieved a pCR after preoperative CRT could benefit from ACT. Despite a trend toward better survival with ACT, statistical significance was not consistent in the different sets of overall pooled analyses. The stratified analysis comparing the results from single institution and multicenter studies failed to show a significant prognostic benefit. This study provides an updated perspective on the optimal postoperative strategy in cases with a marked treatment response after neoadjuvant CRT.

Based on ambiguous guidelines, patients with more advanced disease and a poorer response to CRT are potential candidates for ACT. A higher ypT and/or N stage are indicative of unfavorable tumor biology, which highlights the need for aggressive postoperative treatment to eradicate any residual tumor burden. Fietkau et al. failed to demonstrate any benefit of ACT in ypN0 patients (P = 0.993 for 3-year DFS), suggesting the need for more intensive systemic management of ypN2 patients with a poorer prognosis12. Another pooled analysis of the EORTC 22921 trial and an Italian trial also showed no favorable effect of ACT in patients showing a pathological response (HR of death and recurrence or death [95% CI]: 0.96 [0.75–1.23] and 0.95 [0.73–1.23], respectively)13. In contrast, ACT has often been recommended in down-staged patients after CRT. Based on the responsiveness to preoperative cytotoxic treatment, eradicating micrometastatic disease and further beneficial effects arising from ACT can be expected. A meta-analysis of relevant trials showed that the 5-year OS was significantly higher following ACT in patients with ypT0-2N0M0 (i.e., down-staged) (odds ratio 0.57, 95% CI 0.38–0.85)10. In a pooled analysis including individual patient data from the I-CNR-RT, PROCTOR-SCRIPT, EORTC 22921, and CHRONICLE trials, unlike the ypII subgroup (HR [95% CI] of OS, 0.87 [0.65–1.18]), poor responders in the ypIII stage did not show outcomes that favored the use of ACT (HR [95% CI] of OS, 1.09 [0.86–1.38])32.

A pCR, characterized as maximal down-staging that can be achieved by CRT, yields a high 5-year survival rate of ~85–90%33,34,35. The lack of prospective studies to evaluate the effect of ACT is because a pCR is only achieved in ~10–20% of all rectal cancer patients14,15. Additionally, treatment adherence to ACT in this subpopulation is mostly poor given the possibility of treatment-related toxicity, financial burden, and a patient or clinician’s preference for less aggressive treatment36. In this clinical setting, a meta-analytic approach with a larger number of patients can help detect a small or absent treatment effect.

The results from this meta-analysis suggest there is no clear evidence to support survival benefits of ACT for pCR patients. The trend toward a favorable prognosis with ACT was mainly based on NCDB studies, and other exploratory analyses excluding the population-based data failed to obtain any reproducible results. Although a recent meta-analysis from a Chinese group suggested the potential survival benefit of ACT in pCR patients37, only one NCDB study of Polanco et al.29. was included without considering the other three NCDB studies28,30,31. Specifically, the propensity score matching method used by Polanco et al. may have produced additional selection bias due to limited patient information, such as underlying morbidity or immediate mortality, within a few months after completion of treatment. Chang also noted that the survival benefit of a certain treatment can be overestimated in the statistical setting of NCDB data38. Therefore, we believe that the trend toward better survival in the ACT group, with the underlying predominance of one NCDB-based study, should be interpreted with caution.

The study of Maas et al. analyzing individual-level data of pCR patients from 13 institutions could not prove any benefits of ACT in terms of survival or tumor recurrence (HR 0.94, 95% CI 0.53–1.69 for DFS)27. Multicenter studies from Taiwan, Italy/Spain, and Australia24,25,26 failed to identify a survival difference, as did a recent multi-institutional study from Korea (n = 118 for pCR, from personal communication)39. The I-CNR-RT trial could not analyze outcomes of pCR patients due to the low incidence of mortality events2. To date, there is little randomized evidence to demonstrate the survival benefit of ACT in this patient population.

To assess the clinical value of ACT, potential adverse effects also need to be considered. Among the 13 studies, only two reported ACT-related complication events20,26. In the study of Tay et al., 13% of study population stopped ACT due to significant toxicities, such as diarrhea, nausea, and vomiting26. Time to closure of ileostomy was longer in patients with ACT, suggesting inferior quality of life and related medical morbidities20. Although life-threatening complication is not common (~1–2%), grade 3–4 acute toxicities were reported in more than one third of patients after the use of multi-agent ACT regimens40,41. Regarding the inconsistent survival outcomes in the present pooled analyses including four different NCDB-based studies, it is questionable whether the therapeutic benefit of ACT is sufficiently expected to risk treatment-related toxicities and quality-of-life problems in patients with a marked response to CRT.

This study had several limitations. First, all of the included studies were observational and inevitably suffer from confounders, such as selection bias and heterogeneity in sample characteristics and treatments42. Of the included studies, the method of Tierney et al.19 to extract HR and 95% CI data was used in 5 studies20,22,23,25,26. The differences between the data indirectly extracted from survival curves and the original data can induce additional bias. Due to insufficient data, the effect of ACT on DFS or treatment-related toxicities was not evaluated in the present meta-analysis. Most of the studies included herein did not account for the presence of underlying comorbid illnesses or postoperative complications. Use of 5-fluorouracil, 5-fluorouracil/leucovorin (FL), and capecitabine chemotherapy was reported in most studies, but it remains unclear whether the current results are also applicable to other regimens, such as multi-agent folinic acid/fluorouracil/oxaliplatin (FOLFOX) and folinic acid/fluorouracil/irinotecan (FOLFIRI). Nevertheless, this large-scale meta-analysis provides clinically useful insights into the prognostic role of ACT in this small-sized population who achieved a pCR.

This meta-analysis could not warrant the survival benefits of ACT in patients who achieved a pCR, suggesting that routine use of ACT should not be recommended in this subset of patients with rectal cancer. Further pooled analysis of individual patient data from existing randomized trials is needed to establish guidelines for ACT in conjunction with contemporary neoadjuvant treatments.