Background

Large-scale population-based studies in developed countries, such as EUROCARE,1 indicate that ‘major advances in cancer management seem to have resulted in improved survival’ in many cancer types. These data are informative for policy-makers seeking information on net survival improvements but generally lag behind contemporary management strategies (for example, EUROCARE reports only to 20071); focus mainly on common cancers; and generally fail to capture detailed treatment and stage information, necessary to interpret whether survival improvements reflect introductions of new treatments or stage migrations.

In contrast, patients and oncologists generally seek to understand prognosis, namely the chance of surviving from a specific cancer, in the context of contemporary treatment options.2 For trialists, there is an additional need to forecast expected number of events based on current standard of care. But a new problem is emerging in trials—namely that outcomes from contemporary standard arm management exceed expectations (compared with historical literature). Thus, trials reach target recruitment but findings appear to lack power.3 This issue is exemplified in recent non-oncology (ARRIVE4) and oncology (COLOFOL5 and ROLAAR6) trials.

Here, we address the above problem in the setting of an uncommon cancer, namely squamous cell carcinoma cancer of the anus (SCCA), treated with chemo-radiotherapy (CRT). The latter is standard of care in many countries as reflected by guidelines, for example, from NCCN,7 ESMO-ESSO-ESTRO,8 and ACPGBI.9 Approximately three-quarters of patients with SCCA receive CRT as initial treatment.10 Through systematic review,11 we recently reported on 45 studies of patients with SCCA who received either radiotherapy alone (RT) or CRT and noted that 5-year overall survival increased from a mean estimate of 64% in 1980 to 75% in 2010 (p = 0.046). It is conceivable that this temporal improvement might be driven by improvements in loco-regional control, but might also be due to unmeasured factors, such as general improvement in healthcare, centralisation, improved imaging and radiotherapy delivery, and more effective management of toxicity. It might also reflect early tumour stage at presentation or younger mean age at diagnosis.

In this study, we confirmed the observation of significant temporal improvement in survivals and aimed to use these striking temporal changes to derive models to estimate contemporary outcomes.

Methods

Patients

We performed a treatment-cohort analysis, using a prospectively maintained clinical database of patients with SCCA treated at the Christie NHS Foundation Trust, Manchester, United Kingdom, seen between 1 January 1990 and 31 December 2014, and followed to 30 April 2018. The Christie anal cancer multi-disciplinary team (MDT) meeting was centralised for the Greater Manchester and North Cheshire geographical areas (approximate 1.8 million) in 2007. From 2004, pre-treatment HIV testing was performed selectively (for example, untested male homosexual men).

Patients were included if they had histologically confirmed squamous cell carcinoma arising from the anal canal or margin treated with CRT with curative intent. For sensitivity analyses, patients treated curatively with RT alone were added. Standard clinical, pathological and treatment-related variables were collected, as previously published.12 We recognised a change in pre-treatment staging assessment through the study period and categorised this as follows: 1990 to 2003 assessment was physical examination and CT imaging; 2004 to 2010 assessment added MR imaging;13 and from 2011 to 2014, assessment additionally added Fluoro-Deoxy-Glucose Positron Emission Tomography/Computed Tomography. TNM staging was in accordance with the American Joint Committee on Cancer (AJCC) staging 7th Edition.14

Treatment

From 1990 to 2001, a split ACT I15 radiotherapy regimen was prescribed and described elsewhere.12 After 2001, the treatment protocol followed that used in the ACT II trial16—namely, radiotherapy of 50.4 Gy was delivered over 5.5 weeks with a two phase technique, without a mandatory break. Phase 1 included 30.6 Gy in 17 daily fractions with non-conformal rectangular parallel-opposed fields. Phase 2 required conformal planning and delivered 19.8 Gy in 11 daily fractions over 15 days to the primary tumour with a 3 cm margin and any involved lymph nodes. From 2005, we reported median duration of radiotherapy treatment.

Chemotherapy regimens were administered concurrently with radiotherapy as either: mitomycin-C (MMC) 12 mg/m2 on day 1, and continuous infusion of 5-fluorouracil (5-FU) 1000 mg/m2 on days 1–4 and days 29–32 or cisplatin (60 mg/m2 on days 1 and 29) with 5-FU (as above), the latter regimen as part of the ACT II trial16 (2001–2008). The selection to RT or CRT was randomized as part of the ACT I trial15 until 1994. Thereafter, selection for RT was the exception, and based on contra-indications to the use of CRT, typically co-morbidities or increasing age.

Follow-up and outcomes

Since 2004, post-treatment follow-up was typically clinical assessment at 6 weeks after completion of CRT and again at clinical visits paralleling the 3- and 6-month MR scans.13 From 6 to 60 months, patients were assessed clinically on a six-monthly basis and imaging follow-up based on risk of local relapse—in patients deemed at high-risk for local relapse (T size > 5 cm; AJCC 7th Edition N2 and N3 disease; incomplete RT or CRT), MR scans were generally performed at 12, 18, 24 and 36 months; in the remainder (low-risk), MR scans were performed at 36 months. Prior to 2004, surveillance was by clinical examination.

For this analysis, the primary outcomes were 3-year loco-regional failure (LRF); 5-year overall survival (OS); and 5-year cancer-specific survival (CSS). These are CORMAC17 core outcome measures. Time-to-events were from the date of start of first treatment. LRF was defined as the presence of either residual or recurrent disease within the inguinal/pelvic anatomic sites. OS was defined as the period of time until death from any cause; CSS was defined as the period of time until death from anal cancer.

Statistical analysis

All statistical analyses were performed using Stata software, Version 14 (Stata Corp, College Station, TX, USA). The main analysis was based on patients receiving curative CRT; sensitivity analyses included all patients treated with curative intent—namely CRT and RT, over the time period. In order to test for a period effect, we divided the cohort into five groups of five-year intervals, spanning the 25-year study period. Differences in baseline characteristics across the five periods were explored using the Cuzick’s non-parametric test and the Cochran-Armitage test for trends (2 × n tables) as appropriate. For cT stage, we used ordinal regression to account for the multinomial stage proportions and examine whether overall stage distribution and stage-specific proportions changed significantly.

To derive predicted contemporary (2020) estimates, we used a two-stage approach. First, we assessed for key confounders in this cohort and evaluated the associations between patient and tumour factors with the three outcomes of 3-year LRF; 5-year OS; and 5-year CSS. We derived Kaplan–Meier (K–M) estimates and then performed univariable and multivariable analyses using Cox models, adjusted for year of treatment. Proportionality assumptions were tested using Schoenfeld residuals.

Second, we sought to relate changes in key outcomes with study periods. For this analysis, we estimated the three outcomes using K–M methods, in two-year bands (except the first 3 years, due to small sample size), and related these over time using regression models, weighted for period sample size. Initial exploration revealed that linear models might predict implausible outcomes (for example, greater than 100% survivals). Therefore, non-linear splines were used. A range of cut-off points from years 2000 to 2010 were tested as pivots for each scenario. The optimal cut-point was determined based on three criteria: (i) visual inspection of plots; (ii) lowest AIC (Akaike Information Criteria) value per model; and (iii) clinically plausible coefficients. For example, if LRF rates were declining (negative regression coefficient), we rejected models where the regression coefficient ‘right’ of the cut-point was positive. Once the optimum regression spline model was determined, we used it to predict options to extrapolate estimates with 95% confidence intervals (95% CIs) for 2020. We additionally tested for the presence of competing risk of death bias by visually comparing the predictions for 5-year OS vs. 5-year CSS over time.

Finally, once we established the optimal regression models, we superimposed the equivalent estimates for the three primary outcomes from the six reported trials15,16,18,19,20,21 of CRT in patients with SCCA, and visually inspected for model fit.

Results

Baseline characteristics

Between 1990 and 2014, there were 1040 referrals to the anal cancer MDT of which 930 (89%) were primary SCCA. Numbers of patients with SCCA doubled from 134 in 1990–94 to 266 in 2010–14. Of the 930 patients, there were 701 patients treated with curative intent, with either RT (N: 141) or CRT (N: 560) (Fig. 1). The proportions treated by curative intent remained steady (at approximately 80%) across the five time intervals (lower panel in Fig. 1). Median radiotherapy duration was 37 (IQR: 37–38) days. The proportion of patients with incomplete radiotherapy (<32 days) was 2.7%; the proportion with clinically-relevant delayed delivery of radiotherapy (≥42 days) was 8.0%. There were no differences across time periods from 2005.

Fig. 1
figure 1

Flow diagram of The Christie squamous cell carcinoma of the anus (SCCA) series, 1990–2014.

Table 1 details the baseline characteristics by time periods for the 560 patients undergoing CRT. Women accounted for two-thirds of the cohort. Median age was 60 years and was stable across the study periods. The proportions of cT1 to cT4 stages remained remarkably stable across the study periods (all Ps > 0.05). By contrast, nodal positivity increased from 17% in the first study period to 41% in the last study period (P < 0.001). The baseline characteristics for the 701 patients undergoing either RT or CRT with curative intent are detailed in Table S1. The proportions and trends with time are very similar to those for CRT alone.

Table 1 Baseline characteristics in 560 patients with SCCA treated by chemo-radiotherapy at The Christie 1990–2014, by study period.

Locoregional failure

With a median follow-up of 61 (IQR: 36–79) months, there were 119 LRFs among the 560 patients who underwent CRT. The 3-year LRF rate was 33% in the first study period (1990–1994), declining to 16% in the last study period (2010–2014) (Ptrends < 0.001) (Table 2). LFR rates were higher among men compared with women, even after excluding HIV positivity patients (who were all men except one) (p = 0.003), and higher with increasing cT stage (cT3 v cT2, p = 0.004; cT4 v cT2, p < 0.001). Anatomic site, tumour differentiation and nodal positivity were not associated with LRF.

Table 2 Cox models for loco-regional failure (LRF) in 560 patients with SCCA treated with chemoradiotherapy, The Christie 1990–2014.

Survival

There were 230 deaths among the 560 patients who underwent CRT. The 5-year OS rate was 60% in the first study period (1990–1994), increasing to 76% in the last study period (2010–2014) (Ptrends = 0.001) (Table 3). OS rates declined with increasing cT stage (cT4 v cT2, p < 0.001), node positivity (p = 0.012), and poorer performance status (WHO PS2 v PS0, p = 0.049 in analyses limited from 2005 to 2014). Gender, anatomic site, tumour differentiation and HIV positivity were not associated with OS.

Table 3 Cox models for overall and cancer-specific survivals in 560 patients with SCCA treated with chemoradiotherapy, The Christie 1990–2014.

There were 143 deaths from anal cancer. The 5-year CCS rate was 62% in the first study period (1990–1994), increasing to 80% in the last study period (2010–2014) (Ptrends = 0.001) (Table 3). CSS rates declined with increasing cT stage (cT4 v cT2, p < 0.001), node positivity (p = 0.024), and poorer performance status (WHO PS2 v PS0, P = 0.038; analysis limited to post-2005). Gender, anatomic site, tumour differentiation and HIV positivity were not associated with CSS.

Modelled estimates

The spline regression models predicted estimates for 3-year LRF, 5-year OS and 5-year CSS similar to those in the observed data but added a non-linear dimension (Fig. 2a–c). For 2020, the predicted 3-year LRF was 14.7% (95% CIs: 0–31.3); 5-year OS was 74.7% (95% CIs: 54.6–94.9); and 5-year CSS was 85.7% (95% CIs: 75.3–96.0). We compared modelled changes with time for 5-year OS and 5-year CSS and found no evidence of a competing risk bias for death (Fig. 2d).

Fig. 2: Regression models, with predictions to 2020, of 3-year LRF, 5-year OS and 5-year CSS, and comparative evaluation of OS vs. CSS trends.
figure 2

a For 3-year LRF, rates declined from a mean 37.3% in 1991 to 15.6% at the 2007 pivot, and then the decline slowed down to a mean 15.2% in 2014. b For 5-year OS, rates increased from a mean 55.9% in 1991 to 72.7% at the 2007 pivot, and then the improvement slowed down to a mean 73.7% in 2014; and c for 5-year CSS, rates increased from a mean 61.6% in 1991 to 73.0% at the 2000 pivot, and then the improvement slowed down to a mean 81.3% in 2014. d Similar trends for OS and CSS, not supporting a competitive risk bias.

Predicted models and literature trials

We superimposed the equivalent estimates for the three primary outcomes from six published trials of CRT in patients with SCCA. The plots (Fig. 3 and Table S2) illustrate that the current and predicted rates for 3-year LRF and 5-year OS and CSS are considerably improved compared with most of the estimates from historic trials.

Fig. 3: Regression models, with predictions to 2020, of 3-year LRF (loco-regional failure), 5-year OS (overall survival) and 5-year CSS (cancer-specific survival) using splines, as in Fig. 1.
figure 3

The equivalent estimates (either reported or derived indirectly) from each of the published six trials of chemo-radiotherapy in patients with SCCA in superimposed.

Sensitivity analysis

We repeated the univariable and multivariable models to include all patients treated with curative intent—namely CRT and RT, over the time period, and found similar results (Table S3 and S4).

Discussion

Summary of main findings

Over 25 years, we observed the following. First, there were increased numbers of referrals with time and changing treatment selection to predominantly CRT. Second, in the absence of clear evidence of earlier clinical presentation or changing demographics, we illustrated striking improvements in LRF, and OS and CSS with time. Third, we derived models to estimate contemporary oncological outcomes.

Context of other literature

The increase in number of referrals received by our institute over the 25 years is in keeping with the epidemiological literature, which demonstrates an overall increase in the incidence of anal cancer in many Western populations.22

A small number of institute-level studies have described the presentation and outcomes of anal cancer over time. Myerson et al. (Washington University),23 Kim et al. (Hwasun Hospital, Korea),24 and Tomaszewski et al. (Peter MacCallum Cancer Centre, Australia)25 present institutional series over 25–30 years. However, the treatments within the cohorts from Myerson23 and Kim24 were heterogeneous, and although Tomaszewski et al.25 concentrated on patients treated with CRT, this study did not account for the potential effect of time on presentation and outcome. Whitford et al. (Oregon Health Sciences University)26 explored for a time period effect on outcome but the study was too small (76 cases over 30 years) to take account of varying presentations with time. Recently, Guren et al.27 reported that 5-year net (or relative) survival in 1548 patients from the Cancer Registry of Norway increased from 63 to 73% (1987–2016). However, while the registry reported that 82% were treated with curative intent, detailed treatment details were lacking. Furthermore, relative survival represents a modelled survival estimate taking account underlying period changes, and does not equal observed patient survival estimates, which is required by patients and trialists.

In the 1990s, two randomised trials15,19 demonstrated the use of CRT improved local control compared with RT alone. Further trials,16,20,21 reported between 2008 and 2013, established the combination of radiotherapy with 5-fluorouracil and mitomycin-C as the optimal therapy. While the use of CRT is associated with improved loco-regional disease control (compared with RT alone), it is unclear whether this translates into improvements in overall survival (argument expanded in Supplemental Material p13). To the best of our knowledge, our study is the first to illustrate parallel temporal improvement for LRF and survivals.

In our analysis, there were striking increases in the proportions of pre-treatment nodal positivity from 17% in 1990–94 to 41% in 2010–14. We believe that most of this is driven by the introduction into clinical practice of modern imaging modalities, a type of the Will-Rogers phenomenon. We have written extensively about this and described the added phenomenon of ‘reduced prognostic discrimination’.11 For example, this might explain why nodal positivity was not a predictor of loco-regional relapse. We caution against the interpretation that the increased proportion of nodal positivity reflects a ‘true’ shift to more advanced stage disease, as the proportions of T stages remained constant over the study period.

Limitations and strengths

Our study has limitations. First, there may be selection bias. Over the study period, improvements might reflect stricter criteria for curative intent. This seems unlikely as the proportions treated by curative intent were broadly 80% throughout. Similarly, improvements might reflect proportionately increased use of CRT (rather than RT). This is true—though our sensitivity analyses demonstrate that the same patterns of oncological outcomes were seen for the combined RT and CRT cohort. Second, there may be unmeasured confounding. For example, we did not routinely capture performance status data before 2005. It is likely that our patients’ general health status improved with time, though it seems less plausible that this alone accounts for the observed 16% absolute improvement in overall survival. Third, there was a lack of treatment-related toxicity data. It is conceivable that grade 3 and 4 toxicities lessened, and their management improved. Again, it seems unlikely that this alone accounted for the magnitude of observed improvements in overall survival. Fourth, we did not capture technical refinements in salvage surgery over time, which might account for some increases in long-term disease-free states. However, as primary locoregional failure rates have reduced substantially, salvage surgery is now less often required. Furthermore, among patients with local relapses, the proportion that proceed to salvage surgery has decreased from more than 70% in historic series12,28 in the 1990s to only 23% in the ACT II trial from the mid-2000s.29 This is an area of ongoing research in this cohort.

There are several study strengths. First, we used a prospectively maintained database, where for example, key prognostic factors such as pre-treatment stage were consistently recorded. Second, this is the largest temporal clinical institute-level dataset of its type. Other datasets (106 patients;23 50 patients;24 284 patients;25 76 patients26)—were smaller. Third, we concentrated our analysis on a homogenous treatment—namely CRT. The details of this treatment varied with time, but the backbone was 50–55 Gy radiotherapy and a 5-FU-mitomycin based concurrent chemotherapy. Fourth, there was appropriate length follow-up. Fifth, our definition of 3-year LRF is equivalent to that currently used in the UK PLATO trial,30 and many of the patient population in the primary analysis of this study are equivalent to those eligible for modern trials, like PLATO.

Clinical implications

The improved oncological outcomes are likely to have multifactorial drivers. The use of advanced imaging may facilitate more accurate treatment with CRT. Advances in RT technologies over time, better awareness of toxicity and improved supportive care and the abandonment of the inter-phase RT break (after ACT I) are likely contributors. Centralisation of anal cancer management is likely to have contributed to improvements through use of defined patient protocols. The culmination of these changes is the probable driver of improved oncological outcomes, although near-impossible to quantify. Human Papilloma Virus (HPV) is the aetiological agent in most SCCA tumours, but is also a marker of radio-sensitivity.31 It is conceivable that the proportion of HPV-driven tumours have increased with time, in turn, increasing the overall radio-sensitivity of these cancers.

The current and predicted rates for 3-year LRF and 5-year survivals are more optimistic than those in the historic trials. It is important that ongoing and future trials are appropriately powered to reflect event rates for current standard of care (the control arm). We illustrate this as follows. Consider a hypothetical trial based on clinical practice 25 years ago. We assume that the LRF rate was 30% and the new intervention aimed to improve LRF by (relative) 25% i.e. to 24%. Assuming an alpha = 0.05 and power = 0.80, a 1:1 head-to-head trial would require 675 in each arm (total: 1350) with 365 events. Now consider a similar trial today. We assume that the LRF rate is 20% and the new intervention aimed to improve LRF by (relative) 25% i.e. to 15%. Assuming an alpha = 0.05 and power = 0.80, a 1:1 head-to-head trial would require 715 in each arm (total: 1430) with 251 events.

Unanswered questions and future research

First, while the Will Rogers phenomenon11 may partly explained the increase in proportions of SCCA patients with node positivity, there might be other factors. Nodal positivity is clinically important as this is used as treatment stratification in clinical practice and in trials. The relevance of this is still not clear.

Second, we are now in an era of accurate radiotherapy delivery with VMAT and IMRT, which has become standard RT for anal cancer.32 RTOG 052933 was a phase 2 evaluation of dose-painted intensity modulated radiation therapy in combination with 5-FU and MMC, which not only showed a reduction of acute morbidity but also improved LRF. If there is a causal relationship between LRF and OS, these new treatment modalities might further improve LRF, reduce treatment-related toxicity, and ultimately, further reduce the death burden from this cancer.