Main

In the United States, breast cancer is the most common newly diagnosed cancer and the second most common cause of cancer-related death in women (Siegel et al, 2015). Human epidermal growth factor receptor 2 (HER2) is overexpressed in 20% of patients with primary invasive breast cancer. HER2 amplification or overexpression is associated with an aggressive tumour phenotype, and patients with HER2-positive breast cancer had shorter survival durations than those with normal HER2 expression until the monoclonal antibody for HER2, trastuzumab (TmAb) (Herceptin, Genentech Inc., South San Francisco, CA, USA), was developed (Slamon et al, 1987, 1989; Press et al, 1997).

Neoadjuvant systemic therapy (NST) in combination with anti-HER2-targeted therapy has been well established as a treatment strategy for HER2-positive breast cancer. Patients with HER2-positive breast cancer obtain additional clinical benefit from NST when it is combined with TmAb; higher pathologic complete response (pCR) rates and better survival outcomes are achieved in these patients compared with those treated with cytotoxic agents alone (Gianni et al, 2010). Moreover, anti-HER2 doublet therapy combined with cytotoxic agents has recently become a standard NST regimen for HER2-positive breast cancer. Neoadjuvant chemotherapy combined with TmAb plus lapatinib demonstrated improved pCR rates compared with chemotherapy combined with single-agent TmAb (Baselga et al, 2012). More recently, prospective randomised controlled trials (NeoSphere and TRYPHAENA trials) have reported that anti-HER2 doublet therapy of TmAb and pertuzumab yielded higher pCR rates and better survival outcomes than single-agent anti-HER2-targeted therapy (Gianni et al, 2012; Schneeweiss et al, 2013). Currently, we recommend the anti-HER2 doublet therapy of TmAb and pertuzumab as a treatment in neoadjuvant setting in HER2+ breast cancer.

Sensitivity to systemic therapy is known to vary depending on hormone receptor (HR) status among patients with HER2-positive breast cancer. Previous studies reported that HR-positive tumours were less sensitive to combination systemic therapies. The pCR rate in patients with HR-positive and HER2-positive disease was 20%, compared with 36% in those with HR-negative disease (Gianni et al, 2012). Another study showed that HR-negative status significantly increased the duration of event-free survival in patients with HER2-positive disease (hazard ratio 0.46, 95% confidence interval (CI) 0.27–0.80), but HR-positive status did not affect event-free survival (hazard ratio 0.87, 95% CI 0.43–1.74) (Gianni et al, 2010). pCR rates after NST with regimens that include TmAb tend to be high in patients with HR-negative and HER2-positive breast cancer. However, the optimal way of predicting which patients in this group are most likely to achieve pCR after NST with regimens that include anti-HER2-targeted therapies is still unknown. In addition, oestrogen receptor (ER) expression is currently considered as a categorical variable in clinical settings; it is unclear whether ER expression level as a continuous variable is associated with pCR.

Recently, our group showed that a high HER2/CEP17 ratio is a predictor of high pCR rates among patients with HER2-positive locally advanced breast cancer who received NST that included TmAb (Kogawa et al, 2016). Given that HR status tends to predict pCR rates, considering ER expression (as a continuous variable) together with HER2/CEP17 ratio may provide a better prediction of pCR rates than either variable alone. We hypothesised that high HER2/CEP17 ratios and low ER expression levels are associated with high pCR rates in patients with HER2-positive breast cancer treated with NST. We constructed a nomogram to predict pCR rates in patients with primary HER2-positive breast cancer treated with NST.

Materials and methods

Study design

This retrospective single-centre study was approved by the Institutional Review Board at The University of Texas MD Anderson Cancer Center (protocol number: PA 12-1173) and the University of Texas Health Science Center at Houston School of Public Health (protocol number: HSC-SPH-15-0038). Patients were identified from the Breast Cancer Management System database (protocol number: 2004-0541) at MD Anderson. We retrospectively reviewed patients with stages I–III HER2-positive invasive breast cancer who had definitive surgery in 1999–2015 and received NST. HER2-positive disease was defined as a HER2/CEP17 ratio of 2.0 according to fluorescence in situ hybridisation (FISH) or an immunohistochemistry (IHC) staining score of 3 or higher (Wolff et al, 2013). Patients for whom continuous ER and PR levels or HER2/CEP17 ratios were not available were excluded from our analysis.

We collected the following information from patient charts and the database: age, patient-reported race, menopausal status, body mass index (BMI), ER and progesterone receptor (PR) status (as a continuous variable from 0 to 100%), HER2/CEP17 ratio, histologic subtype and nuclear grade (determined from a biopsy specimen), clinical stage inflammatory breast cancer (IBC) or non-IBC, NST regimens, and treatment response (pCR or non-pCR). A waiver of informed consent was authorised because the study was a retrospective chart review involving no additional diagnostic or therapeutic intervention.

Pathologic evaluation

Diagnostic biopsy and resected specimens were evaluated by a dedicated breast pathologist at MD Anderson. ER and PR expression were evaluated in the diagnostic biopsy samples using IHC staining; the percentage of expression, ranging from 0 to 100%, was calculated at the time of diagnosis. pCR after NST was defined as no evidence of residual invasive cancer in the breast and no residual cancer in the axilla at the time of definitive surgery.

Statistical analysis

The primary endpoint of interest was pCR. Variables of interest included age, race, BMI prior to NST, menopausal status, histologic findings, clinical stage, IBC or non-IBC, ER expression, PR expression, HER2/CEP17 ratio, and NST regimen (containing TmAb, TmAb plus pertuzumab (PmAb), or cytotoxic agents only). Data were summarised using descriptive statistics. Associations between categorical variables were examined using the χ2 test or Fisher exact test when appropriate. The Wilcoxon rank-sum test or Kruskal–Wallis test was used to examine differences in continuous variables between or among patient groups. Multivariate logistic regression models were applied to assess the effect of variables of interest on pCR status. Backward stepwise selection was applied to determine which variables were included in the final multivariable model. A nomogram was built to estimate the probability of pCR, given the risk factors in the final multivariable model. A bootstrap validation method based on 200 bootstrap samples was employed to estimate the bias-corrected or overfitting-corrected predictive discriminative ability of the model, which was presented as the concordance index (C-index; Efron and Gong, 1983). The accuracy of the logistic regression model was assessed graphically with a calibration curve by plotting the predicted probabilities of pCR on the x axis with the observed probabilities of pCR on the y axis. Two-sided P<0.05 was considered statistically significant in all statistical analyses. All computations were carried out in SAS 9.3 (SAS Institute Inc., Cary, NC, USA) and R 3.1.2.

Results

Patient characteristics

A total of 793 patients were included in the analysis, and 345 of them (43.5%) achieved pCR. Demographic and clinicopathologic characteristics are summarised in Table 1. Four hundred ten patients (51.7%) were postmenopausal, and postmenopausal women had significantly higher pCR rates than premenopausal women (P=0.03). The mean ER expression level was 47.1% in the non-pCR group and 28.2% in the pCR group (P<0.001), and the mean PR expression level was 26.2% in the non-pCR group and 14.6% in the pCR group (P<0.001). Patients with non-IBC had a higher pCR rate (44.5%) than those with IBC (34.2%; P=0.09), although not statistically significant. Patients who received NST regimens containing TmAb or TmAb plus PmAb had significantly higher pCR rates than those receiving NST regimens containing cytotoxic agents alone (TmAb: 277 out of 596 patients, 46.5%; TmAb plus PmAb: 42 out of 81 patients, 51.9%; cytotoxic agents alone: 26 out of 116 patients, 22.4%; P<0.001). There was no significant interaction between NST regimen and HER2/CEP17 ratio by pCR rate.

Table 1 Baseline patient characteristics

Logistic regression model of pCR and nomogram to predict pCR

In multivariate logistic regression analysis, ER expression level, HER2/CEP17 ratio, IBC or Non-IBC, and NST regimen remained in the final model were associated with pCR rates. PR was included in the model as a variable of interest. Low ER expression was associated with higher pCR rates (odds ratio (OR) 0.99, 95% CI 0.98–0.99, P<0.001). High HER2/CEP17 ratio and non-IBC were also associated with higher pCR rates (HER2/CEP17 ratio: OR 1.05, 95% CI 1.004–1.10, P=0.03; non-IBC: OR 2.23, 95% CI 1.30–3.82, P=0.003; Table 2). NST regimens containing TmAb and TmAb plus PmAb were associated with significantly higher pCR rates than NST regimens containing cytotoxic agents alone (TmAb: OR 3.41, 95% CI 2.08–5.61, P<0.001; TmAb plus PmAb: OR 4.03, 95% CI 2.05–7.91, P<0.001; Table 2). The final regression model of pCR remained similar in the analysis without IBC patients (N=314, Supplementary Table 1). Since majority of patients with HER2+tumour are treated with TmAb with or without PmAb, we created a two different multivariate logistic model to see the effect of cytotoxic agents alone in the model. The first model removed cytotoxic agent alone from the model in Table 2. Even after removing cytotoxic agents, ER, PR, HER2 FISH ratio, and IBC remained same in the model and NST regimen was not significant (Supplementary Table 2). In the second model, we conducted a new model selection after removing cytotoxic agent alone, ER, PR, HER2 FISH ratio, and IBC again remained same and NST regimen was not included in the final model. (Supplementary Table 3).

Table 2 Multivariate logistic regression model of pathologic complete response (345 out of 793 patients; 43.5%)

A nomogram constructed on the basis of the final multivariable model is shown in Figure 1. To test the discriminative ability of the model, we calculated the C-index using bootstrap methods with 200 replications. The C-index was 0.69 (95% CI 0.66–0.73). The calibration curve used to assess the goodness-of-fit of the model (Figure 2) showed good agreement between the predicted and observed probabilities of pCR.

Figure 1
figure 1

Nomogram to predict the probability of pathologic complete response (pCR). ER=oestrogen receptor; IBC=inflammatory breast cancer; PR=progesterone receptor.

Figure 2
figure 2

Calibration curve of observed and predicted probabilities. The x axis is the predicted probabilities measured by the final logistic regression model and the y axis is the actual probabilities. pCR=pathologic complete response.

Discussion

Our results demonstrated that high HER2/CEP17 ratios and low ER expression levels were associated with high pCR rates in patients with HER2-positive breast cancer treated with NST. On the basis of these results, we created a nomogram to predict the probability of pCR in patients with stages I–III HER2-positive primary breast cancer treated with NST using five covariates: ER expression level, PR expression level, HER2/CEP17 ratio, IBC or non-IBC, and NST regimen.

Previous studies have shown that the response to systemic therapy in breast cancer varies depending on the disease characteristics. Our group previously showed that a high HER2/CEP17 ratio was a significant predictor of pCR in patients with locally advanced disease who had received NST (Kogawa et al, 2016). In patients with HER2-positive breast cancer who had received NST-containing cytotoxic agents and TmAb, negative ER/PR status was an independent predictor of pCR in multivariate analysis (adjusted OR 3.42, 95% CI 2.42–4.86, P<0.001; Takada et al, 2014). Another study demonstrated that in patients not treated with NST-containing TmAb, pCR was highest for those with HR-negative and HER2-positive disease (45%) and lowest for those with HR-positive and HER2-negative disease (9%) (Esserman et al, 2012). A retrospective study showed that high HER2 expression was associated with high pCR rates after treatment with the combination of lapatinib and TmAb, irrespective of HR status (Scaltriti et al, 2015). Some prospective studies have shown that patients with HR-negative disease were sensitive to chemotherapies, allowing them to achieve pCR more than those with HR-positive disease, which is also consistent with our results (Esserman et al, 2012; Alba et al, 2014).

The current study is, to the best of our knowledge, the first to generate a nomogram using various patient and disease characteristics to predict pCR after NST in HER2-positive breast cancer. Our study cohort also included those treated with PmAb-containing regimens; PmAb was recently (September 2013) approved by the US Food and Drug Administration for neoadjuvant treatment of HER2-positive breast cancer and has been the standard of care since then. In addition, in our model, ER, PR, and HER2/CEP17 ratios were treated as continuous variables. This nomogram can provide the estimated probability of achieving pCR after the planned NST using clinicopathologic features.

Several limitations to our findings should be noted. First, this is a retrospective study and thus we could not account for unknown factors related to pCR. Second, we could not assess this nomogram in terms of survival outcomes because of the short duration of follow-up in patients in the cohort who received NST with PmAb. This is mainly because PmAb was just approved in 2013 and we did not have enough events to analyse survival outcomes based on the predicted probability obtained from the nomogram. Third, the quality of supportive care could have varied because of the long duration of the inclusion period. In particular, those who received NST-containing cytotoxic agents alone might have been treated longer ago than those who received anti-HER2-targeted therapy. In addition, relatively low-quality supportive care may have led to delay or discontinuation of the systemic therapy, leading to low pCR rates in the group receiving NST-containing cytotoxic agents alone. However, all patients were treated at MD Anderson and any differences in quality of the care are therefore likely to be small.

Another issue is the reproducibility of ER/PR expression level. Because we treated ER and PR as continuous variables, the percentage might change depending on the pathologist who reviewed the slides. We included clinical stage in the analysis instead of T stage and N stage separately. In ER-positive breast cancer in particular, node status can be an important indicator of survival outcomes (Sparano et al, 2015). However, our primary endpoint in the current study was pCR. In addition, all patients in our cohort had HER2-positive breast cancer and hormone status was one of the variables used in the analysis. In the current study, tumour size and node status were not assessed independently. However, the effect of it was likely small and tumour size and nodal status were included in staging that was included in analysis. Also neoadjuvant systemic treatment strategy is the same regardless of tumour size or nodal stage.

Lastly, although we used a bootstrap method to validate the model internally, we did not have an independent cohort outside of MD Anderson to validate the nomogram externally. Without external validation, whether the reported nomogram can be generalised in other patient cohorts such as different patient demographic characteristics and treatment outside of MD Anderson is not clear. Also the causality of the variables are still not proved. We will continue to seek such data sets for future validation study.

The C-index of our model was 0.69. This means that our model can distinguish a patient with pCR from a patient without pCR 69% of the time. C-index values range from 0.5 to 1; a value of 0.5 means the discrimination is just random chance and a value of 1 indicates a perfect discriminative ability. Some important variables were not available to be considered in the model. For example, we could not evaluate the heterogeneity of the HER2-positive tumours. One study demonstrated that heterogenous tumours with a HER2/CEP17 ratio of more than 2.2 in <80% of the tumour cells were associated with poor response to TmAb (Mcghan et al, 2014). We could not consider intratumor heterogeneity because we performed a retrospective chart review. This may have contributed to the low discriminative ability. Other factors may also lead to treatment resistance; several studies have investigated both intrinsic resistance and acquired resistance to treatment (Gottesman, 2002; Sui et al, 2007; Colleoni et al, 2009; Takada et al, 2014). For example, one study demonstrated that the ERα gene could increase drug resistance in breast cancer cells by activating MDR1 transcription (Shi et al, 2014).

By using the model developed in the current study, we can estimate the probability of pCR before the start of NST. However, to answer the critical clinical question of who definitely needs PmAb in addition to TmAb, we need to identify biological and other clinical factors to improve the predictability of pCR in future studies.

In conclusion, patients with high HER2/CEP17 ratios and low ER expression levels are most likely benefit from NST. Our nomogram predicted pCR rate after NST among patients with HER-2 positive primary breast cancer using clinicopathologic factors. To improve the predictability of this nomogram, we need to identify other clinical and molecular factors. Further studies, especially an external validation, are warranted.