Introduction

Despite advances in surgical and oncological management, the prognosis of patients with oesophageal adenocarcinoma (OAC) remains poor1. The incidence of OAC is rising in the Western world2, but the majority of patients still present with advanced disease and palliation is the only treatment available to them.

Neo-adjuvant chemotherapy is currently standard of care in the UK for the minority deemed potentially curable with major surgery3. However, only 15% of OAC patients exhibit a meaningful pathological response4, precipitating unnecessary surgical delays in the majority, in which time disease can progress, and patients can become physiologically deconditioned before their operation. The majority of these patients then relapse and eventually succumb to their disease5. Improving the selection of patients who are likely to respond to neo-adjuvant chemotherapy is critical. There is a pressing need to identify biomarkers capable of predicting response to neo-adjuvant chemotherapy to facilitate optimal patient selection prior to surgical resection.

New discoveries have identified genomic sub-types of OAC, one of which demonstrates deficiency in DNA damage repair in 20%6. Recently, the accuracy of a novel DNA-damage immune response (DDIR) assay to predict pathological responders to DNA-damaging platinum-based chemotherapy was evaluated7. Patients with a positive DDIR assay had a significantly higher pathological response rate (p = 0.033) and were associated with improved recurrence-free survival (RFS, p = 0.042) and overall survival (OS, p = 0.015) in multi-variable analysis. These early results are promising, but the prediction of patients likely to respond to neo-adjuvant chemotherapy could, and must, be improved further.

Positron emission tomography (PET) combined with computed tomography (PET-CT) is now routinely used in the OAC staging pathway8. PET-CT improves the sensitivity for the detection of distant metastases compared to contrast-enhanced CT from 52 to 71%9 and changes management in 20–40% of patients10. Metabolic response on PET-CT has also been shown to predict pathological response11, therefore imaging biomarkers, in combination with the DDIR assay may further characterise the tumour microenvironment (TME) and predict which patients will respond.

In this current study, we hypothesised that the addition of PET imaging parameters to clinical parameters and the DDIR assay would further improve the prediction of pathological response to neo-adjuvant chemotherapy. We aimed to assess the diagnostic accuracy of PET response in this cohort and calculate the prognostic significance of PET metrics for RFS and OS.

Materials and methods

Patient cohort

This retrospective, single-centre study included patients with oesophageal adenocarcinoma who were treated with neo-adjuvant platinum-based chemotherapy and resection between 2003 and 2014. This current study builds on the investigation of a DDIR assay in this patient cohort7. The study was performed according to the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) guidance12. Ethical approval was obtained for biological sample collection and analysis in conjunction with detailed clinical annotation from the Northern Ireland Biobank (NIB12-0032) and the Office for Research Ethics Committees Northern Ireland (ORECNI: 13/NI/0149). All research was performed in accordance with relevant guidelines, regulations and the Declaration of Helsinki, and informed consent was obtained from all participants.

Clinical variables

All patients had biopsy-proven adenocarcinoma of the oesophagus or gastro-oesophageal junction following upper gastrointestinal (GI) endoscopy. Clinical variables including age at diagnosis, gender, radiological staging, pathological staging, neo-adjuvant chemotherapy regimen and outcome were recorded.

PET-CT protocol

Two PET-CT scanners and protocols were used during the study period. All patients had pre- and post-treatment PET-CT imaging using the same scanner. Before December 2011, PET-CT was performed using a GE Discovery LS 2-dimensional (2D) scanner, without a time-of-flight (TOF) algorithm. Patients received a dose of 375 MBq of 18F-FDG. After December 2011, a GE Discovery 690 3D scanner with TOF algorithm was used. Patients received a 18F-FDG dose of 3.5 MBq/Kg. SUVmax was recorded by the reporting PET-CT radiologist and documented in the corresponding report.

Radiological staging

Radiological staging followed international guidelines and consisted of initial contrast-enhanced CT, following by PET-CT + /− EUS for more detailed staging8. All patients were staged according to the contemporaneous International Union for Cancer Control (UICC) Tumour Node Metastasis (TNM) classification (7th edition). Three PET metrics were retrospectively recorded from the PET-CT examination reports. Baseline SUVmax represented the highest SUV in the primary tumour prior to treatment initiation. Post neo-adjuvant treatment SUVmax was recorded after chemotherapy completion and prior to surgery. The second PET-CT was performed to re-stage patients prior to oesophagectomy to detect interval metastases13. The change in SUVmax between examinations was calculated by subtracting the post-treatment SUVmax from the baseline SUVmax. A PET response was defined as a reduction in SUVmax of 35% between baseline and post-treatment PET-CT11 (Fig. 1).

Figure 1
figure 1

Selected images from a patient with a distal oesophageal adenocarcinoma. Maximum intensity projection (MIP) (a) and fused axial PET-CT images (b) demonstrated a large FDG-avid tumour. After neo-adjuvant chemotherapy was completed, a repeat PET-CT (c, d) showed there had been an excellent metabolic response. This patient had a positive DDIR assay and final pathological examination indicated a good response (tumour regression grade 2).

Treatment pathway

Patients had 12 weeks of neo-adjuvant platinum-based chemotherapy equalling three cycles of either epirubicin, cisplatin and capecitabine (ECX), or epirubicin, cisplatin and 5-fluorouracil (ECF). The time between the last cycle of chemotherapy finishing and the second PET-CT was 21 days. Surgical resection was then performed 4–6 weeks after completing neo-adjuvant chemotherapy.

Pathological staging

Pathological resection specimens were reported following the minimum recommended dataset14. The TNM classification was assigned, and the degree of adenocarcinoma differentiation, local vascular invasion (LVI) and circumferential resection margin (CRM) involvement were categorised. Pathological response was assigned using the Mandard classification15, with responders classified as tumour regression grade (TRG) 1–2, and non-responders as TRG 3–54. The details of the DDIR assay are found in Turkington et al.7.

Outcome data

Data on three clinically important outcomes were collected; pathological response, RFS and OS. RFS was defined as the time from surgical resection to recurrence or death, or date of last follow-up. OS was defined as the time from diagnosis to death from any cause, or date of last follow-up. Dates of each outcome were captured, and the time interval between diagnosis and outcome recorded in days.

Statistical analysis

Statistical analysis was performed using R software (version 3.6.1)16. Cases with missing data were excluded. Correction for multiple comparisons was performed using the Bonferroni method17. Differences between continuous and categorical variables were tested with Mann–Whitney U tests and chi-square tests, respectively. Diagnostic performance of PET metrics was calculated with sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV). Receiver operator characteristic (ROC) curves and Youden method were used to calculate optimum thresholds of continuous variables for diagnostic performance. Univariable analysis tested each variable for association with RFS and OS. A series of multi-variable regression models were constructed to analyse the additional value of the DDIR assay and PET metrics to predict pathological response, RFS and OS.

Results

Clinicopathological and radiological characteristics of the cohort

In total, 163 patients were considered for inclusion. After selection criteria were applied, 113 patients were analysed after cases with missing PET (n = 40) and TRG data (n = 10) were excluded. Table 1 details the characteristics of the cohort. A CONSORT study flow diagram is shown in Fig. 2.

Table 1 Baseline characteristics of patient cohort.
Figure 2
figure 2

A CONSORT study flow diagram detailing the inclusion of patients in the study. The total patients excluded for missing PET-CT data were 40.

Post-treatment PET-CT was performed after completion of chemotherapy approximately 15 weeks after baseline PET/CT. Surgery was performed up to 6 weeks from post-treatment PET-CT. Summary statistics of SUVmax distribution between DDIR negative and positive tumours are detailed in Supplementary Table S1. No significant differences in SUVmax (p = 0.47), post-NACT SUVmax (p = 0.52), or change in SUVmax (p = 0.21) were found between the two PET-CT scanners. Histograms and waterfall plots of SUVmax distributions are available in Supplementary Figs. S1S3.

Association of clinicopathological and radiological variables with DDIR status, pathological response and recurrence

There were several significant associations between clinical variables and clinical outcomes. Notably, pT-stage was associated with pathological response (p < 0.001), and pN-stage and CRM were associated with recurrence (p < 0.001, respectively) (Supplementary Table S2).

Boxplots showing the differences in PET metrics between DDIR status and pathological response status are shown in Fig. 3. There were significant associations between post-NACT SUVmax (p = 0.03) and change in SUVmax (p = 0.002) with pathological response after adjusting for multiple comparisons (Table 2).

Figure 3
figure 3

Boxplots of differences between baseline, post neo-adjuvant chemotherapy and change in SUVmax for DDIR status and pathological response (unadjusted p-values shown within boxplots).

Table 2 Association of PET metrics with DDIR status, pathological response and recurrence.

Sensitivity and specificity of PET response to predict pathological response

In the entire cohort, the sensitivity and specificity of the 35% SUVmax reduction threshold to predict pathological response were 1.00 and 0.44, respectively. The optimal SUVmax reduction threshold for the best sensitivity and specificity values to predict pathological response were calculated with the Youden method. Cases of progression (defined as increase in SUVmax between PET-CT examinations) were excluded for this purpose. Using a threshold of 46.5%, the optimum sensitivity and specificity was 0.69 and 0.66, respectively. This threshold produced a c-statistic of 0.78. (Supplementary Fig. S4) Table 3 shows the diagnostic accuracy results for the DDIR status, DDIR status combined with a PET threshold of 35%, and DDIR status when combined with the optimised 46.5% threshold. Combining PET metrics with DDIR status improved the specificity for predicting pathological response up to 0.90 without loss of sensitivity.

Table 3 Sensitivity, specificity, positive predictive value and negative predictive value for pathological response of DDIR Status, DDIR combined with a PET Threshold of 35% and with the optimised SUVmax reduction threshold.

Optimum SUVmax reduction thresholds for clinical outcomes (pathological response and recurrence) and individual ROC curves for each PET metric are included in Supplementary Table S3 and Supplementary Fig. S5.

Combined DDIR status and PET metrics predict pathological response better than either metric alone

Having demonstrated the association of PET metrics with pathological response, we sought to integrate this information with DDIR status to assess the performance of a combined assay. Two significant associations were demonstrated on uni-variable analysis after Bonferroni correction. Change in SUVmax (p = 0.029) and pT-stage (< 0.001) were both significantly associated with pathological response. DDIR status (unadjusted p = 0.428) was not associated with pathological response.

In terms of PET response, logistic regression was not reliable (unadjusted p = 0.991) because the sensitivity of PET response at the 35% threshold was 100%, therefore no false negative results were recorded. Therefore, this variable was omitted from further analysis predicting pathological response. Full details of the univariable analysis are included in Supplementary Table S4.

On multi-variable analysis, post-NACT SUVmax (p = 0.020), change in SUVmax (p = 0.003), and PET response at optimised 46.5% threshold (p = 0.021) were independently and significantly associated with pathological response. (Table 4) In contrast to Turkington et al.7, DDIR status was not significantly associated with pathological response. This result may be explained by the analysis of a sub-group from the original cohort which included relatively few pathological responses. Furthermore, the AIC values of the logistic regression models indicate that the models combining DDIR and PET metrics performed better than DDIR status alone.

Table 4 Results for each multi-variable model to predict pathological response.

Survival analysis

The median RFS of all patients was 36.7 months (95% CI 21.1–63.2). The median OS of the cohort was 43.9 months (95% CI 33.2–not reached).

RFS rates at 1, 2, and 5 years were 73.4% (95% CI 65.6–82.0%), 55.4% (95% CI 46.9–65.4%), and 39.6% (95% CI 30.8–51.1%), respectively. OS rates at 1, 2, and 5 years were 78.7% (95% CI 71.5–86.6%), 67.9% (59.7–77.1%), and 46.7% (37.1–58.7%), respectively.

After adjusting for multiple comparisons, seven variables were associated with RFS in uni-variable analysis; pT-stage, pN-stage, LVI, CRM, total positive nodes after resection, positive nodal ratio, and TRG. Similarly, seven variables were associated with OS; pT-stage, pN-stage, LVI, CRM, total positive nodes after resection, positive nodal ratio, and recurrence.

Full details of the uni-variable analysis results for RFS and OS are included in the Supplementary Tables S5 and S6. There were significant differences in RFS (p = 0.030) and OS (p = 0.028) in PET responders versus non-responders at the 35% reduction threshold. (Fig. 4) Median RFS of PET responders was 54.2 months (95% CI 28.1–not reached) and PET non-responders was 19.0 months (95% CI 13.2–41.2). Median OS of PET responders was not reached, and median OS of PET non-responders was 27.3 months (95% CI 17.3–not reached).

Figure 4
figure 4

Cumulative (a) recurrence-free and (b) overall survival depending on PET response status using a 35% SUVmax reduction threshold level.

Three groups were constructed to compare the survival differences between combined DDIR status and PET response; no PET response and DDIR negative, PET response and DDIR negative, and DDIR positive. The latter group comprised both PET responders and non-responders because only nine patients had no PET response and were DDIR positive, so these nine patients were combined with the other DDIR positive patients for this specific analysis. There were significant differences in RFS (p = 0.01) and OS (p = 0.03) (Fig. 5) between groups.

Figure 5
figure 5

Cumulative overall survival differences between three groups; no PET response and DDIR negative, PET response and DDIR negative, and DDIR positive. The latter group comprised both PET responders and non-responders because only nine patients had no PET response and were DDIR positive.

The prognostic significance for RFS and OS was also tested using the optimised 46.5% PET response threshold to predict pathological response. There was a significant difference (p = 0.047) in RFS between PET responders (63.2 months; 95% CI 29.2–not reached) and PET non-responders (23.8 months; 95% CI 15.2–54.2) using the 46.5% threshold optimised for pathological response, although the survival difference between the two groups was larger using the 35% SUVmax reduction threshold. (Fig. 6) The optimised 46.5% threshold did not meet statistical significance for OS (p = 0.099; Supplementary Fig. S6). Additional cumulative survival plots for DDIR status and pathological response with RFS and OS are included in Supplementary Figs. S7–S10.

Figure 6
figure 6

Recurrence-free survival difference between pathological responders and non-responders using the optimised threshold of 46.5% reduction in SUVmax.

In similar methodology to the prediction of pathological response, a series of Cox regression models were constructed for RFS and OS. The variables cN-stage, differentiation of adenocarcinoma and DDIR status were included in all models because these three variables are available at the time of diagnosis, unlike pathological variables7. DDIR status was significant for RFS but not OS in the first model containing just three variables (cN-stage, differentiation of adenocarcinoma and DDIR status). (Table 5) The best performing model for both RFS and OS was model six, which contained cN-stage, differentiation of adenocarcinoma, DDIR status, and optimised 46.5% PET response threshold, which was independently significant for both RFS and OS (p = 0.012 and p = 0.047, respectively).

Table 5 Summary of results for each RFS and OS multi-variable Cox regression model.

Discussion

This study has demonstrated that PET metrics are predictive of pathological response and have additional value when combined with a novel pre-treatment DDIR assay. A model using both PET metrics and a DDIR assay improves the prediction of which patients are likely to have a pathological response. Post-treatment SUVmax and change in SUVmax were both independently predictive of pathological response and, when combined with a DDIR assay, demonstrated a greater predictive performance than DDIR alone. These findings add evidence to incorporating PET metrics into pre-treatment clinical decision tools and to re-stage OAC patients with PET-CT prior to surgery.

Pathological response is an important endpoint in oesophageal adenocarcinoma. Only 15% of patients have a meaningful benefit from neo-adjuvant chemotherapy, defined as Mandard TRG 1–215. In this cohort, only 13 of 113 patients (11.5%) were classified as TRG 1 or 2. It is imperative that patient selection for radical curative treatment improves because despite aggressive treatment, the 2-year overall survival after neo-adjuvant chemotherapy and oesophagectomy ranges from 40 to 70%7,18 with a 20% recurrence rate5.

PET metrics were also associated with RFS and OS in patients with oesophageal adenocarcinoma treated with neo-adjuvant chemotherapy. Using a 35% reduction threshold, PET response was prognostically significant for both RFS and OS in univariable analysis. A series of multi-variable models were constructed using clinicopathological factors (cN-stage and degree of differentiation), PET metrics and DDIR status. Baseline SUVmax, post-treatment SUVmax and change in SUVmax were all included separately in multivariable models to assess the added value of PET metrics to predict pathological response. In addition, the optimised 46.5% SUVmax reduction threshold used to predict pathological response was independently significant for both RFS and OS in this cohort. However, these results must be validated in a new independent cohort prior to clinical adoption.

This work expanded on a study by Turkington et al.7 investigating a novel DDIR assay to predict pathological response prior to treatment initiation. The DDIR assay was originally developed in breast cancer and constitutes a 44-gene assay19 indicating constitutive activation of the cyclic GMP-AMP synthase (cGAS)/stimulator of interferon genes (STING) pathway in response to DNA damaging chemotherapy20. The DDIR assay includes immune checkpoint targets, such as programmed death ligand 1 (PD-L1)21, which are up-regulated by immune activation caused by infiltration of the tumour by T-lymphocytes. Up-regulation creates an inflammatory microenvironment associated with sensitivity to chemotherapy.

Whole genome sequencing in pre-treatment oesophageal adenocarcinoma biopsy samples has identified three distinct subtypes6. One sub-type, representing 20% of cases, had impaired DNA-damage repair status. These patients are likely to respond to DNA-damaging chemotherapy. Therefore, identification of this sub-group of patients prior to treatment initiation would be beneficial, because it may allow refinement of treatment approaches. The pre-treatment DDIR assay was strongly predictive of pathological response and can be performed using formalin-fixed material7. However, the diagnostic performance of the assay for pathological response could be improved further. We hypothesised that the addition of PET metrics to the DDIR assay could achieve this necessary improvement.

PET response is often defined as a 35% interval reduction in SUVmax. This threshold is based on work assessing early PET response in oesophageal adenocarcinoma to predict pathological response11,22,23. After 14 days, an early repeat PET-CT evaluated the change in SUVmax from baseline PET-CT and attempted to predict pathological response. In original work, Weber et al.22 demonstrated that the mean SUVmax of PET responding OAC (− 54% ± 17%) was significantly different from that of non-responding OAC (− 15% ± 21%). The optimal threshold value to differentiate these groups after one cycle was 35% SUVmax reduction, which resulted in a sensitivity of 93% and a specificity of 95%. This threshold was validated in subsequent work by this group, although the compelling diagnostic performance was not replicated. Ott et al.23 found the sensitivity and specificity of using these criteria were 80% and 78%, respectively. Lordick et al.11 found the sensitivity and specificity to be 100% and 72%, respectively. These diagnostic accuracy values were calculated against the reference standard pathological response criteria by Becker et al.24, defined as Becker classification 1 (< 10% tumour cells remaining).

Subsequently, authors have applied the 35% threshold to predict pathological response after completion of neo-adjuvant chemotherapy, when PET-CT is repeated approximately 3 months after staging. Results of studies show variable benefit25. Currently, no evidence exists to support the use of PET metrics to accurately predict pathological response. In our study, the 35% threshold did not show a significant association with pathological response, which suggests that alternative thresholds used to define metabolic PET response after completion of neo-adjuvant chemotherapy should be sought. However, the study identified an optimised threshold of 46.5% in this cohort of OAC patients. This threshold was significantly associated with pathological response, RFS and OS. These results are promising however they must be validated in external cohorts before being applied in clinical practice.

Findlay et al.26 investigated PET response between staging and post neo-adjuvant chemotherapy. The authors demonstrated that a larger reduction of SUVmax after treatment completion may be more predictive of pathological response. Whilst only a single-centre study, a SUVmax reduction of 77.8% performed better than the PERCIST threshold of 30%27 and the 35% threshold. Findlay et al. suggested using a more pragmatic threshold of 75% would result in a sensitivity of 73.6% and a specificity of 84.0%. Again, these results must be validated in external centres, but in combination with our results, suggest that a threshold above the commonly cited 35% threshold used after completion of neo-adjuvant treatment should be considered.

There are some limitations of the present study; this is a retrospective, single-centre study conducted over a 10-year period. These data are consistent and homogenous but require validation in a multi-centre study. Relatively few pathological responders were included in each DDIR status group. Early models of PET-CT scanners have inferior spatial resolution and signal-to-noise ratio compared to more modern systems, which may affect quantification of SUVmax, particularly for small tumours. The findings of this study must be validated in larger external cohorts using pre-defined PET response thresholds. The added value of PET metrics to the DDIR assay must also be tested in alternative neo-adjuvant therapy regimens including peri-operative FLOT (docetaxel, oxaliplatin, leucovorin, and 5-fluorouracil)28 and neo-adjuvant chemoradiotherapy29. Combining these biomarkers, whilst predictive of pathological response, does not inform the use of neo-adjuvant treatment.

In conclusion, this study has demonstrated the additional value of PET metrics, in combination with a novel transcriptomic DDIR assay, to predict pathological response in OAC patients treated with neo-adjuvant chemotherapy. Furthermore, an optimised SUVmax reduction threshold was calculated and was independently significant for RFS and OS. Combining imaging and molecular biomarkers could lead to improved stratification and precision therapeutic approaches in oesophageal adenocarcinoma.