Temporal improvements in loco-regional failure and survival in patients with anal cancer treated with chemo-radiotherapy: treatment cohort study (1990–2014)

We evaluated oncological changes in patients with squamous cell carcinoma of the anus (SCCA) treated by chemoradiotherapy (CRT) from a large UK institute, to derive estimates of contemporary outcomes. We performed a treatment-cohort analysis in 560 patients with non-metastatic SCCA treated with CRT over 25 years. The primary outcomes were 3-year loco-regional failure (LRF), 5-year overall survival (OS), and 5-year cancer-specific survival (CSS). We developed prediction models; and overlaid estimates on published results from historic trials. Age distributions, proportions by gender and cT stage remained stable over time. The median follow-up was 61 (IQR: 36–79) months. Comparing the first period (1990–1994) with the last period (2010–2014), 3-year LRF declined from 33 to 16% (Ptrends < 0.001); 5-year OS increased from 60% to 76% (Ptrends = 0.001); and 5-year CCS increased from 62% in to 80% (Ptrends = 0.001). For 2020, the models predicted a 3-year LRF of 14.7% (95% CIs: 0–31.3); 5-year OS of 74.7% (95% CIs: 54.6–94.9); and 5-year CSS of 85.7% (95% CIs: 75.3–96.0). Reported oncological outcomes from historic trials generally underestimated contemporary outcomes. Current and predicted rates for 3-year LRF and 5-year survivals are considerably improved compared with those in historic trials.


BACKGROUND
Large-scale population-based studies in developed countries, such as EUROCARE, 1 indicate that 'major advances in cancer management seem to have resulted in improved survival' in many cancer types. These data are informative for policy-makers seeking information on net survival improvements but generally lag behind contemporary management strategies (for example, EUROCARE reports only to 2007 1 ); focus mainly on common cancers; and generally fail to capture detailed treatment and stage information, necessary to interpret whether survival improvements reflect introductions of new treatments or stage migrations.
In contrast, patients and oncologists generally seek to understand prognosis, namely the chance of surviving from a specific cancer, in the context of contemporary treatment options. 2 For trialists, there is an additional need to forecast expected number of events based on current standard of care. But a new problem is emerging in trials -namely that outcomes from contemporary standard arm management exceed expectations (compared with historical literature). Thus, trials reach target recruitment but findings appear to lack power. 3 This issue is exemplified in recent non-oncology (ARRIVE 4 ) and oncology (COLOFOL 5 and ROLAAR 6 ) trials.
Here, we address the above problem in the setting of an uncommon cancer, namely squamous cell carcinoma cancer of the anus (SCCA), treated with chemo-radiotherapy (CRT). The latter is standard of care in many countries as reflected by guidelines, for example, from NCCN, 7 ESMO-ESSO-ESTRO, 8 and ACPGBI. 9 Approximately three-quarters of patients with SCCA receive CRT as initial treatment. 10 Through systematic review, 11 we recently reported on 45 studies of patients with SCCA who received either radiotherapy alone (RT) or CRT and noted that 5-year overall survival increased from a mean estimate of 64% in 1980 to 75% in 2010 (p = 0.046). It is conceivable that this temporal improvement might be driven by improvements in loco-regional control, but might also be due to unmeasured factors, such as general improvement in healthcare, centralisation, improved imaging and radiotherapy delivery, and more effective management of toxicity. It might also reflect early tumour stage at presentation or younger mean age at diagnosis.
In this study, we confirmed the observation of significant temporal improvement in survivals and aimed to use these striking temporal changes to derive models to estimate contemporary outcomes. www.nature.com/bjc

Patients
We performed a treatment-cohort analysis, using a prospectively maintained clinical database of patients with SCCA treated at the Christie NHS Foundation Trust, Manchester, United Kingdom, seen between 1 January 1990 and 31 December 2014, and followed to 30 April 2018. The Christie anal cancer multi-disciplinary team (MDT) meeting was centralised for the Greater Manchester and North Cheshire geographical areas (approximate 1.8 million) in 2007. From 2004, pre-treatment HIV testing was performed selectively (for example, untested male homosexual men).
Patients were included if they had histologically confirmed squamous cell carcinoma arising from the anal canal or margin treated with CRT with curative intent. For sensitivity analyses, patients treated curatively with RT alone were added. Standard clinical, pathological and treatment-related variables were collected, as previously published. 12 We recognised a change in pre-treatment staging assessment through the study period and categorised this as follows: 1990 to 2003 assessment was physical examination and CT imaging; 2004 to 2010 assessment added MR imaging; 13 and from 2011 to 2014, assessment additionally added Fluoro-Deoxy-Glucose Positron Emission Tomography/Computed Tomography. TNM staging was in accordance with the American Joint Committee on Cancer (AJCC) staging 7th Edition. 14 Treatment From 1990 to 2001, a split ACT I 15 radiotherapy regimen was prescribed and described elsewhere. 12 After 2001, the treatment protocol followed that used in the ACT II trial 16 -namely, radiotherapy of 50.4 Gy was delivered over 5.5 weeks with a two phase technique, without a mandatory break. Phase 1 included 30.6 Gy in 17 daily fractions with non-conformal rectangular parallel-opposed fields. Phase 2 required conformal planning and delivered 19.8 Gy in 11 daily fractions over 15 days to the primary tumour with a 3 cm margin and any involved lymph nodes. From 2005, we reported median duration of radiotherapy treatment.
Chemotherapy regimens were administered concurrently with radiotherapy as either: mitomycin-C (MMC) 12 mg/m 2 on day 1, and continuous infusion of 5-fluorouracil (5-FU) 1000 mg/m 2 on days 1-4 and days 29-32 or cisplatin (60 mg/m 2 on days 1 and 29) with 5-FU (as above), the latter regimen as part of the ACT II trial 16 (2001-2008). The selection to RT or CRT was randomized as part of the ACT I trial 15 until 1994. Thereafter, selection for RT was the exception, and based on contra-indications to the use of CRT, typically co-morbidities or increasing age.

Follow-up and outcomes
Since 2004, post-treatment follow-up was typically clinical assessment at 6 weeks after completion of CRT and again at clinical visits paralleling the 3-and 6-month MR scans. 13 From 6 to 60 months, patients were assessed clinically on a six-monthly basis and imaging follow-up based on risk of local relapse-in patients deemed at high-risk for local relapse (T size > 5 cm; AJCC 7th Edition N2 and N3 disease; incomplete RT or CRT), MR scans were generally performed at 12, 18, 24 and 36 months; in the remainder (low-risk), MR scans were performed at 36 months. Prior to 2004, surveillance was by clinical examination.
For this analysis, the primary outcomes were 3-year locoregional failure (LRF); 5-year overall survival (OS); and 5-year cancer-specific survival (CSS). These are CORMAC 17 core outcome measures. Time-to-events were from the date of start of first treatment. LRF was defined as the presence of either residual or recurrent disease within the inguinal/pelvic anatomic sites. OS was defined as the period of time until death from any cause; CSS was defined as the period of time until death from anal cancer.
Statistical analysis All statistical analyses were performed using Stata software, Version 14 (Stata Corp, College Station, TX, USA). The main analysis was based on patients receiving curative CRT; sensitivity analyses included all patients treated with curative intent-namely CRT and RT, over the time period. In order to test for a period effect, we divided the cohort into five groups of five-year intervals, spanning the 25-year study period. Differences in baseline characteristics across the five periods were explored using the Cuzick's non-parametric test and the Cochran-Armitage test for trends (2 × n tables) as appropriate. For cT stage, we used ordinal regression to account for the multinomial stage proportions and examine whether overall stage distribution and stage-specific proportions changed significantly.
To derive predicted contemporary (2020) estimates, we used a two-stage approach. First, we assessed for key confounders in this cohort and evaluated the associations between patient and tumour factors with the three outcomes of 3-year LRF; 5-year OS; and 5-year CSS. We derived Kaplan-Meier (K-M) estimates and then performed univariable and multivariable analyses using Cox models, adjusted for year of treatment. Proportionality assumptions were tested using Schoenfeld residuals.
Second, we sought to relate changes in key outcomes with study periods. For this analysis, we estimated the three outcomes using K-M methods, in two-year bands (except the first 3 years, due to small sample size), and related these over time using regression models, weighted for period sample size. Initial exploration revealed that linear models might predict implausible outcomes (for example, greater than 100% survivals). Therefore, non-linear splines were used. A range of cut-off points from years 2000 to 2010 were tested as pivots for each scenario. The optimal cut-point was determined based on three criteria: (i) visual inspection of plots; (ii) lowest AIC (Akaike Information Criteria) value per model; and (iii) clinically plausible coefficients. For example, if LRF rates were declining (negative regression coefficient), we rejected models where the regression coefficient 'right' of the cut-point was positive. Once the optimum regression spline model was determined, we used it to predict options to extrapolate estimates with 95% confidence intervals (95% CIs) for 2020. We additionally tested for the presence of competing risk of death bias by visually comparing the predictions for 5-year OS vs. 5-year CSS over time.
Finally, once we established the optimal regression models, we superimposed the equivalent estimates for the three primary outcomes from the six reported trials 15,16,[18][19][20][21] of CRT in patients with SCCA, and visually inspected for model fit.  (Fig. 1). The proportions treated by curative intent remained steady (at approximately 80%) across the five time intervals (lower panel in Fig. 1). Median radiotherapy duration was 37 (IQR: 37-38) days. The proportion of patients with incomplete radiotherapy (<32 days) was 2.7%; the proportion with clinically-relevant delayed delivery of radiotherapy (≥42 days) was 8.0%. There were no differences across time periods from 2005. Table 1 details the baseline characteristics by time periods for the 560 patients undergoing CRT. Women accounted for two-thirds of the cohort. Median age was 60 years and was stable across the study periods. The proportions of cT1 to cT4 stages remained remarkably stable across the study periods (all Ps > 0.05). By contrast, nodal positivity increased from 17% in the first study period to 41% in the last study period (P < 0.001). The baseline characteristics for the 701 patients undergoing either RT or CRT with curative intent are detailed in Table S1. The proportions and trends with time are very similar to those for CRT alone.
Predicted models and literature trials We superimposed the equivalent estimates for the three primary outcomes from six published trials of CRT in patients with SCCA. The plots (Fig. 3 and Table S2) illustrate that the current and predicted rates for 3-year LRF and 5-year OS and CSS are considerably improved compared with most of the estimates from historic trials.

Sensitivity analysis
We repeated the univariable and multivariable models to include all patients treated with curative intent-namely CRT and RT, over the time period, and found similar results (Table S3 and S4).

Summary of main findings
Over 25 years, we observed the following. First, there were increased numbers of referrals with time and changing treatment selection to predominantly CRT. Second, in the absence of clear evidence of earlier clinical presentation or changing demographics, we illustrated striking improvements in LRF, and OS and CSS with time. Third, we derived models to estimate contemporary oncological outcomes.

Context of other literature
The increase in number of referrals received by our institute over the 25 years is in keeping with the epidemiological literature, which demonstrates an overall increase in the incidence of anal cancer in many Western populations. 22 A small number of institute-level studies have described the presentation and outcomes of anal cancer over time. Recently, Guren et al. 27 reported that 5-year net (or relative) survival in 1548 patients from the Cancer Registry of Norway increased from 63 to 73% (1987-2016). However, while the registry reported that 82% were treated with curative intent, detailed treatment details were lacking. Furthermore, relative survival represents a modelled survival estimate taking account underlying period changes, and does not equal observed patient survival estimates, which is required by patients and trialists.
In the 1990s, two randomised trials 15     trials, 16,20,21 reported between 2008 and 2013, established the combination of radiotherapy with 5-fluorouracil and mitomycin-C as the optimal therapy. While the use of CRT is associated with improved loco-regional disease control (compared with RT alone), it is unclear whether this translates into improvements in overall survival (argument expanded in Supplemental Material p13). To the best of our knowledge, our study is the first to illustrate parallel temporal improvement for LRF and survivals. In our analysis, there were striking increases in the proportions of pre-treatment nodal positivity from 17% in 1990-94 to 41% in 2010-14. We believe that most of this is driven by the introduction into clinical practice of modern imaging modalities, a type of the Will-Rogers phenomenon. We have written extensively about this and described the added phenomenon of 'reduced prognostic discrimination'. 11 For example, this might explain why nodal positivity was not a predictor of loco-regional relapse. We caution against the interpretation that the increased proportion of nodal positivity reflects a 'true' shift to more advanced stage disease, as the proportions of T stages remained constant over the study period.
Limitations and strengths Our study has limitations. First, there may be selection bias. Over the study period, improvements might reflect stricter criteria for curative intent. This seems unlikely as the proportions treated by curative intent were broadly 80% throughout. Similarly, improvements might reflect proportionately increased use of CRT (rather than RT). This is true-though our sensitivity analyses demonstrate that the same patterns of oncological outcomes were seen for the combined RT and CRT cohort. Second, there may be unmeasured confounding. For example, we did not routinely capture performance status data before 2005. It is likely that our patients' general health status improved with time, though it seems less plausible that this alone accounts for the observed 16% absolute improvement in overall survival. Third, there was a lack of treatment-related toxicity data. It is conceivable that grade 3 and 4 toxicities lessened, and their management improved. Again, it seems unlikely that this alone accounted for the magnitude of observed improvements in overall survival. Fourth, we did not capture technical refinements in salvage surgery over time, which might account for some increases in long-term disease-free states. However, as primary locoregional failure rates have reduced substantially, salvage surgery is now less often required. Furthermore, among patients with local relapses, the proportion that proceed to salvage surgery has decreased from more than 70% in historic series 12,28 in the 1990s to only 23% in the ACT II trial from the mid-2000s. 29 This is an area of ongoing research in this cohort.
There are several study strengths. First, we used a prospectively maintained database, where for example, key prognostic factors such as pre-treatment stage were consistently recorded. Second, this is the largest temporal clinical institute-level dataset of its type. Other datasets (106 patients; 23 50 patients; 24 284 patients; 25 76 patients 26 )-were smaller. Third, we concentrated our analysis  30 and many of the patient population in the primary analysis of this study are equivalent to those eligible for modern trials, like PLATO.

Clinical implications
The improved oncological outcomes are likely to have multifactorial drivers. The use of advanced imaging may facilitate more accurate treatment with CRT. Advances in RT technologies over time, better awareness of toxicity and improved supportive care and the abandonment of the inter-phase RT break (after ACT I) are likely contributors. Centralisation of anal cancer management is likely to have contributed to improvements through use of defined patient protocols. The culmination of these changes is the probable driver of improved oncological outcomes, although near-impossible to quantify. Human Papilloma Virus (HPV) is the aetiological agent in most SCCA tumours, but is also a marker of radio-sensitivity. 31 It is conceivable that the proportion of HPVdriven tumours have increased with time, in turn, increasing the overall radio-sensitivity of these cancers. The current and predicted rates for 3-year LRF and 5-year survivals are more optimistic than those in the historic trials. It is important that ongoing and future trials are appropriately powered to reflect event rates for current standard of care (the control arm). We illustrate this as follows. Consider a hypothetical trial based on clinical practice 25 years ago. We assume that the LRF rate was 30% and the new intervention aimed to improve LRF by (relative) 25% i.e. to 24%. Assuming an alpha = 0.05 and power = 0.80, a 1:1 head-to-head trial would require 675 in each arm (total: 1350) with 365 events. Now consider a similar trial today. We assume that the LRF rate is 20% and the new intervention aimed to improve LRF by (relative) 25% i.e. to 15%. Assuming an alpha = 0.05 and power = 0.80, a 1:1 head-to-head trial would require 715 in each arm (total: 1430) with 251 events.
Unanswered questions and future research First, while the Will Rogers phenomenon 11 may partly explained the increase in proportions of SCCA patients with node positivity, there might be other factors. Nodal positivity is clinically important as this is used as treatment stratification in clinical practice and in trials. The relevance of this is still not clear. Second, we are now in an era of accurate radiotherapy delivery with VMAT and IMRT, which has become standard RT for anal cancer. 32 RTOG 0529 33 was a phase 2 evaluation of dose-painted intensity modulated radiation therapy in combination with 5-FU and MMC, which not only showed a reduction of acute morbidity but also improved LRF. If there is a causal relationship between LRF and OS, these new treatment modalities might further improve LRF, reduce treatment-related toxicity, and ultimately, further reduce the death burden from this cancer.   Fig. 3 Regression models, with predictions to 2020, of 3-year LRF (loco-regional failure), 5-year OS (overall survival) and 5-year CSS (cancer-specific survival) using splines, as in Fig. 1. The equivalent estimates (either reported or derived indirectly) from each of the published six trials of chemo-radiotherapy in patients with SCCA in superimposed.
Temporal improvements in loco-regional failure and survival in patients. . . H Sekhar et al.