Introduction

Neoadjuvant chemotherapy as part of a multimodal approach has been widely used for decades in patients with inflammatory, locally advanced, or inoperable breast cancer to achieve or facilitate operability and increase both local and systemic control1. In operable breast cancer, a recently published meta-analysis by the EBCTCG demonstrated that neoadjuvant chemotherapy was as effective as adjuvant chemotherapy in reducing the risk of distant recurrence and death from breast cancer. The rates of breast-conserving surgery were increased with the neoadjuvant approach and clinical response was highest in patients with HER2-positive and triple-negative breast cancer (TNBC)2.

Several individual trials and two patient-level meta-analyses showed that a pathological complete response (pCR) after neoadjuvant therapy is associated with improved event-free and overall survival3,4,5. The magnitude of this effect varied by molecular subtype and was most likely seen in patients with HER2-positive or triple-negative disease but also some luminal B-like tumors. In contrast, the long-term outcome of patients without a pCR is generally poor. Two recent randomized trials demonstrated that in these patients adding additional treatment after surgery - in case of TNBC capecitabine and in case of HER2-positive breast cancer TDM-1 - improved disease-free survival (DFS) and/or overall survival6,7. Although pCR rates have significantly increased with the implementation of targeted treatments and patients with pCR have a better long-term outcome than those without pCR, still 15–20% of patients with a pCR will relapse within the first 5 years. A better understanding of which patients will relapse despite a pCR may guide further treatment following surgery especially in patients with more aggressive tumors (like triple-negative or HER2-positive disease). Thus, we aimed to identify factors predicting relapse in patients with a pCR.

The presented study is a pooled analysis of the five neoadjuvant GBG/AGO-B trials GeparTrio, GeparQuattro, GeparQuinto, GeparSixto, and GeparSepto8,9,10,11,12,13,14,15,16,17,18,19,20,21 investigating factors predicting a relapse despite a pCR in the entire cohort and in subgroups according to tumor types.

Results

Patient characteristics

Between 2002 and 2013, a total of 7933 patients were recruited within the GeparTrio, GeparQuattro, GeparQuinto, GeparSixto, and GeparSepto trial, of whom 2066 (26%) had a pCR. The median age in patients with a pCR was 48 (range 21–75) years, 55% (n = 1125) were cN0 and 19% (n = 385) had a more advanced cT3/4 tumor at diagnosis. Most patients had a non-lobular histological tumor type (97%, n = 1997) while 40% (n = 805) and 39% (n = 780) of the patients had TNBC or HER2-positive disease, respectively. 1,41 (62%) patients had a high-grade tumor (G3) (Table 1).

Table 1 Patient, tumor and treatment characteristics.

After a median follow-up of 58.0 months (IQR 47.0–73.6), 269 events for DFS, 184 for DDFS, and 118 for OS were observed. Survival curves showing patients at risk by one year increment are provided in Supplementary Figure 1a–c. Residual in situ disease was associated with a significantly worse DFS, but not DDFS or OS. Median follow-up for the most recent studies GeparSixto and GeparSepto was shorter (48.4 and 49.7 months, respectively), thus an estimation of 4-year survival rates was reported in the following.

Association of potential risk factors and outcome

1892 patients with a pCR and non-missing risk factors were included in multivariate Cox regression analyses of potential risk factors. In the entire cohort, DFS, DDFS, and OS were significantly worse for patients with a positive cN status at baseline (cN+ vs cN0 hazard ratio 1.94, 95%CI 1.48–2.54, p < 0.001 for DFS; hazard ratio 2.29, 95%CI 1.64–3.19, p < 0.001 for DDFS; hazard ratio 1.98, 95%CI 1.31–2.98, p = 0.001 for OS) (Fig. 1a–c). A worse DDFS and OS was seen in patients with lobular tumor type (lobular vs other tumor type hazard ratio 1.95, 95% CI 1.02–3.70, p = 0.043 for DDFS; hazard ratio 2.47, 95%CI 1.19–5.10, p = 0.015 for OS) (Fig 1b, c). The results for the various biological subtypes were inconsistent for DFS, DDFS, and OS (Fig. 1a–c). Patients with TNBC patients and patients with HER2 + /HR- tumors had a significantly shorter DFS compared to the rest (TNBC vs HER2-/HR + hazard ratio 1.49, 95%CI 1.03–2.17, p = 0.036 and HER2 + /HR- vs HER2-/HR + hazard ratio 1.61, 95%CI 1.06–2.45, p = 0.027). OS was significantly worse in TNBC patients compared to HR + /HER2- patients (TNBC vs HER2-/HR + hazard ratio 1.84, 95%CI 1.06–3.21, p = 0.031).

Fig. 1: Multivariate Cox regression models for disease-free survival (a), distant disease-free survival (b) and overall survival (c) for the total population. Error bars represent the 95%CI.
figure 1

HR hazard ratio, CI confidence interval, HER2 human epidermal growth factor receptor 2, TNBC triple-negative breast cancer.

Potential risk factors and outcome in TNBC

Multivariate Cox regression analyses in biological tumor subtypes revealed that TNBC patients with a pCR were at higher risk for a DFS, DDFS, and OS event in case of lobular histology (DFS hazard ratio 3.55, 95%CI 1.53–8.23, p = 0.003, DDFS hazard ratio 3.55, 95%CI 1.30–9.73, p = 0.014 and OS hazard ratio 3.67, 95%CI 1.19–11.4, p = 0.024). Clinically positive lymph nodes at baseline were also associated with a higher risk for a shorter DFS, DDFS, and OS (DFS hazard ratio 2.45, 95%CI 1.59–3.79, p < 0.001; DDFS hazard ratio 3.83, 95%CI 2.22–6.61, p < 0.001 and OS hazard ratio 2.95, 95%CI 1.58–5.50, p < 0.001) (Fig. 2a–c). Interestingly, patients with >6 scheduled cycles of neoadjuvant treatment had a longer DDFS (hazard ratio 0.21, 95%CI 0.04–0.96, p = 0.044) (Fig. 2b) compared to patients with a shorter therapy.

Fig. 2: Multivariate Cox regression models for disease-free survival (a), distant disease-free survival (b) and overall survival (c) in TNBC cohort. Error bars represent the 95%CI.
figure 2

HR hazard ratio, CI confidence interval, HER2 human epidermal growth factor receptor 2, TNBC triple-negative breast cancer.

Potential risk factors and outcome in patients with HER2 + disease

Patients with HER2-positive cT3/4 tumors were at a significantly higher risk for a DFS, DDFS, and OS event compared to HER2-positive cT1 tumors (DFS hazard ratio 2.07, 95%CI 1.06–4.03, p = 0.033; DDFS hazard ratio 3.28, 95%CI 1.38–7.83, p = 0.007; OS hazard ratio 3.42, 95%CI 1.08–10.8, p = 0.036; multivariate Cox regression analyses). This effect was mainly seen in the HER2 + /HR- subtype (DFS hazard ratio 3.93, 95%CI 1.30–11.8, p = 0.015; DDFS: hazard ratio 7.75, 95%CI 1.70–35.2, p = 0.008; OS: hazard ratio 9.29, 95%CI 1.14–75.7, p = 0.037; multivariate Cox regression analyses) (Fig. 3a–c). The risk for a DDFS event was lower in patients with HER2 + /HR- tumors receiving >6 cycles of planned treatment (hazard ratio 0.06, 95%CI 0.01–0.51; p = 0.010; multivariate Cox regression analyses). This was not the case for DFS or OS.

Fig. 3: Multivariate Cox regression models for disease-free survival (a), distant disease-free survival (b) and overall survival (c) in HER2 + /HR- cohort. Error bars represent the 95%CI.
figure 3

HR hazard ratio, CI confidence interval, HER2 human epidermal growth factor receptor 2, TNBC triple-negative breast cancer.

In the subgroup of HER2 + /HR + breast cancer, none of the potential risk factors significantly predisposed patients to a DFS or OS event (Fig. 4a–c). Only a positive nodal status at baseline significantly increased the risk for a DDFS event in all pts (hazard ratio 2.28, 95%CI 1.02–5.12, p = 0.046; multivariate Cox regression analyses).

Fig. 4: Multivariate Cox regression models for disease-free survival (a), distant disease-free survival (b) and overall survival (c) in HER2 + /HR + cohort. Error bars represent the 95%CI.
figure 4

HR hazard ratio, CI confidence interval, HER2 human epidermal growth factor receptor 2, TNBC triple-negative breast cancer.

In the subgroup of HER2-/HR + breast cancer receiving neoadjuvant chemotherapy, pCR patients with cN+ at baseline had a higher risk for a DFS event than cN0 patients (hazard ratio 2.24, 95%CI 1.18–4.25, p = 0.013; multivariate Cox regression analyses) but none of the other potential risk factors significantly increased the risk for a DDFS or OS event.

4-year DFS, DDFS, and OS rates overall and in subgroups

The 4-year DFS, DDFS, and OS rates are presented in Table 2. At 4 years DFS rates were 91.9% for HER2-/HR + , 90.2% for TNBC, 89.0% for HER2 + /HR + and 87.9% for HER2 + /HR- patients. Lobular compared to other histological subtypes was associated with worse DFS, DDFS, and OS rates. Lower DFS, DDFS, and OS rates at 4 years were also detected for cT3/4 tumors compared to cT1 or cT2 tumors and for cN+ compared to cN0 status at baseline. 4-year survival rates of breast cancer subtypes according to HER2 and HR status are presented in detail in Table 3. Of clinical interest, the risk of having a DFS event within 4 years was > 10% in HER2 + patients with cT3/4 tumors or nodal involvement at primary diagnosis.

Table 2 Four-year rates for DFS, DDFS, and OS according to subgroups.
Table 3 Four-year rates for DFS, DDFS, and OS in subgroups by biological subtype.

Discussion

This analysis identified potential clinical risk factors in women with a pCR after neoadjuvant chemotherapy combined with anti-HER2 therapy in cases of HER2-positive disease. Overall, nodal involvement at diagnosis,cT3/4 tumors as well as lobular histology were identified being the most adverse factors in patients with a pCR, underlining that tumor burden at the time of diagnosis is important. In patients with TNBC, an initial positive nodal status and lobular histology were predicting a significantly higher risk of relapse or death. In patients with HER2-positive disease tumors initially classified as cT3/4 tumors indicated a higher risk of relapse and death, again very consistently for DFS, DDFS, and OS. This effect was strongest in the HER2-positive hormone receptor-negative subgroup. Four-year DFS and DDFS rates for patients included in this pooled analysis were lowest in those patients with lobular histology, cN+, and cT3/4 tumors.

Several individual trials and meta-analyses showed that a pCR following neoadjuvant chemotherapy improves long-term outcomes in terms of DFS, DDFS, and OS and this was in particular seen in patients with more aggressive tumor types like triple-negative or HER2-positive tumors3,4,5,22. However, up to 20% of patients with pCR will eventually relapse.

In individual studies long-term outcome is mainly driven by pCR vs no pCR, but also tumor size and nodal involvement assessed before neoadjuvant therapy resulted in an inferior outcome23,24. This is also reflected by the fact that these factors are included in the clinical pathological stage (CPS) score with poorer outcome in case of a more advanced tumor extent in HR positive BC25,26,27. However, none of these studies analyzed data of patients with and without pCR separately due to limitations in the number of patients and events. Several neoadjuvant trials have shown that patients with small tumors or no nodal involvement at diagnosis are more likely to have a pCR23,28,29. It is of interest that in patients diagnosed with a more advanced clinical stage before neoadjuvant treatment a pCR was less likely and in case a pCR was present the relapse rates were still higher compared to those patients with a pCR and initially small tumors or negative nodes. An explanation might be that these patients may have more systemic disseminated tumor cells at diagnosis, which are resistant to treatment. In fact, trials investigating the role of circulating tumor cells (CTCs) in neoadjuvantly treated patients demonstrated that the presence of CTCs before and not after neoadjuvant therapy negatively influenced the outcome in terms of OS, DDFS, and locoregional relapse-free survival30,31. The number of CTCs before the start of neoadjuvant therapy was also important with the worst outcome seen in patients with the highest CTC counts. In the meta-analysis of Bidard et al., the detection of CTCs at diagnosis was significantly associated with greater tumor size30. A retrospective analysis on tissues from the primary and the metastases from patients with a recurrent disease despite pCR and relapse matched to controls of pts with a pCR but no relapse showed the potential of transcriptomic analyses in this understudied cohort32.

Earlier analyses demonstrated that patients with lobular subtypes have lower pCR rates after neoadjuvant chemotherapy compared with invasive ductal carcinomas33. However, if they had no pCR their outcome was better compared to the non-lobular types. The low responses of lobular histology can be explained at least in part by their particular biologic profile with low proliferation rates, positive hormone receptors, and low grade34. Thus, patients with pure lobular histology (G2 and HR + /HER2-) are usually not candidates for the use of neoadjuvant chemotherapy and in fact only 3–4% of the patients included in the five GBG neoadjuvant trials were diagnosed with an invasive lobular subtype. From the GeparSepto study onwards patients with tumors of pure lobular histology were excluded from trial participation. However, there are also subtypes of lobular histology being high grade, hormone receptor-negative, and HER2-positive and in those lobular tumors with more aggressive biological features pCR rates were significantly higher with up to 20%33. In our investigation relapse rates were significantly higher in the non-classical lobular cohort (TNBC or HER2-positive) which may confirm that this histologic subtype is different from the pure lobular cancers. These high-risk lobular types may be enriched with HER2 mutations35,36.

A pooled analysis has several limitations. One of them is that the treatment regimens were quite different since the included trials were conducted between 2002 and 2013. However, the chemotherapy backbone in every trial was an anthracycline-taxane-based regimen, which can still be considered the standard of care, except for the paclitaxel and liposomal doxorubicin treatment applied in the GeparSixto trial. However, as this study focuses on patients with a pCR after neoadjuvant treatment, it was considered justifiable to also include patients from this trial as an anthracycline was administered. Taking the chemotherapy backbone in consideration, the findings may not be extrapolated to patients achieving a pCR with different treatment regimens. The limitation could only be overcome by conducting this analysis in patients that were treated with the same chemotherapy regimen. As relapses in breast cancer can frequently be seen beyond 5 years, especially in those of HR + subtype, our median follow-up time of nearly 5 years may be still too short. However, our trial population was enriched with biologically aggressive tumors, where typically most of the relapses occur during the first 5 years37,38, therefore our follow-up time most likely covers the majority of relapses. To overcome this limitation a longer follow-up would be necessary and this would implement that the analysis should be repeated in the future to confirm the conclusions. The strength of our analysis is that we could include almost 2,000 patients with a pCR in this pooled analysis. Our analysis was based on five prospectively randomized phase 3 neoadjuvant trials of a study collaboration (GBG/AGO-B), ensuring a very homogenous assessment of all study procedures like pre-treatment tumor assessments, central review of histopathological reports, and definition of pCR but also of additional study procedures including further post-surgical treatments which are usually not part of neoadjuvant trials. Furthermore, lymph node status before treatment was consistently assessed throughout all trials not only clinically but also with ultrasound.

Taken together this is a pooled analysis identifying factors predicting a relapse after pathological complete response following neoadjuvant chemotherapy. In our analysis lobular histology, larger tumor size and initially involved lymph nodes indicated a higher risk for DFS, DDFS, and OS events after a pCR following neoadjuvant treatment. The importance of these risk factors varied by intrinsic subtype, with nodal status and lobular histology as predictors in the triple-negative cohort, and the tumor size in the HER2-positive cohort. Thus, factors predicting a higher risk of relapse after a pCR were tumor extent before therapy and histological type. These data might become of clinical relevance, especially after validation, as treatments are available now for patients at high risk after neoadjuvant therapy, namely those with no pCR, i.e. T-DM1 for HER2 + , capecitabine, and pembrolizumab for TNBC and most recently, PARP inhibitors for patients with BRCA1/2 mutations and CDK 4/6 inhibitors for HER2-/HR + patients. So far, most trials recruiting in the postneoadjuvant setting focus on patients that did not experience a pCR. Taking the results of the here presented analysis into consideration there is also a group of patients with pCR that have a higher risk of relapse and should therefore be considered for an additional treatment despite pCR in postneoadjuvant studies to improve long-term outcome of this otherwise neglected patient population.

Patients and methods

Patients

Data from the five neoadjuvant trials GeparTrio (NCT00544765) (ethics committee of the University of Frankfurt), GeparQuattro (NCT00288002) (ethics committee of the University of Frankfurt), GeparQuinto (NCT00567554) (ethics committee of the University of Frankfurt), GeparSixto (NCT01426880) (ethics committee Nordrhein, Duesseldorf) and GeparSepto (NCT01583426) (ethics committee Berlin) conducted between 2002 and 2013 were pooled and only patients with a pCR (defined as no microscopic evidence of residual invasive tumor cells in any resected specimens of the breast and axillary nodes with in-situ residuals being allowed (ypT0/ypTis, ypN0)) were considered. Individual results and study designs of these studies have previously been reported8,9,10,11,12,13,14,15,16,17,18,19,20,21. All trials were approved by the respective ethics committees and patients had given written informed consent for study participation and data collection. All trials had comparable main eligibility criteria and used an anthracycline-taxane-based chemotherapy backbone (Supplementary table 1). The GeparSixto study enrolled only patients with triple-negative and HER2-positive breast cancer. Patients with HER2-positive disease received anti-HER2 treatment as part of their neoadjuvant therapy within the GeparQuattro, GeparQuinto, GeparSixto, and GeparSepto study. In the GeparTrio trial patients with HER2-positive tumors did not receive any anti-HER2 therapy as part of their neoadjuvant or adjuvant treatment since this was the clinical practice when the study was conducted between 2002 and 2005. Patients with HER2-positive disease from GeparTrio were therefore excluded from the analysis. After surgery, anti-HER2 therapy, endocrine therapy in patients with hormone receptor-positive breast cancer, and radiotherapy were given according to current national guidelines. The tumor subtype was centrally tested.

Objectives and endpoints

The primary objective of this pooled analysis was to characterize patients at higher risk for relapse despite a pCR after neoadjuvant chemotherapy for early breast cancer. Therefore, the influence of predefined potential risk factors as biological subtype (HER2-negative/Hormone receptor (HR)-positive, TNBC, HER2-positive/HR-positive, HER2-positive/HR negative), histological tumor type (lobular subtype, other), tumor grade (G1/G2, G3), tumor stage at baseline (cT1, cT2, cT3/4), nodal stage at baseline (cN0, cN + ), age (≤40, 41–59, ≥ 60 years), BMI ( < 25, 25–29, ≥ 30), menopausal status (pre-, postmenopausal), scheduled number of chemotherapy cycles (≤6, >6) and clinical response after 2–4 cycles (stable disease, partial response, complete response, progressive disease) on DFS was analysed. Secondary objectives were to assess the influence of the same risk factors on distant disease-free survival (DDFS) and overall survival (OS). Ki-67 (≤20% vs higher) was considered as a covariate in preliminary analyses but was excluded due to too many missing values in GeparTrio, GeparQuattro, and GeparQuinto where Ki-67 was not centrally assessed.

DFS was defined as the time in months from randomization to first relapse (local or distant), secondary malignancy, or death from any cause, whichever occurred first39. DDFS was defined as the time in months from randomization to any distant recurrence of disease, any secondary malignancy, or death due to any cause, whichever occurred first; OS as the time in months from randomization to death due to any cause. Patients without an event were censored at the date of the last contact.

Statistical analysis

Multivariate (including all potential risk factors and study) Cox proportional hazards models were used to report hazard ratios with 95% confidence intervals (CIs), adjusted for study to account for possible heterogeneity between the trials. Patients with missing values in risk factors were excluded from multivariate Cox regression models. Patients with missing values for the variable defining the subgroup were excluded from the analyses in this subgroup.

For every potential risk factor, 4-year DFS, DDFS, and OS rates and the corresponding 95% CIs were estimated using the Kaplan–Meier method.

All endpoints were additionally analyzed in the subgroups according to biological subtype.

All reported p-values with p < 0.05 were considered statistically significant. No adjustment for multiple testing was performed. SAS versions 9.2 and 9.4 under SAS Enterprise Guide 4.3 and 8.5 were used to perform the analyses.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.