Main

Adjuvant systemic therapy has made a significant contribution to the improved outcomes for patients diagnosed with oestrogen receptor (ER)-positive early breast cancer over the past three decades. Five years of adjuvant tamoxifen therapy has been estimated to improve breast cancer mortality by about a third (Early Breast Cancer Trialists Collaborative G, 2005) with superior benefit seen with the use of aromatase inhibitors in post-menopausal women (Breast International Group 1-98 Collaborative G et al, 2005; Howell et al, 2005; Cuzick et al, 2010). More recently, extending hormonal therapy beyond 5 to 10 years has shown further benefit using either tamoxifen (Davies et al, 2013; Gray et al, 2013) or 5 years of an aromatase inhibitor after an initial 5 years of tamoxifen (Goss et al, 2007).

Polychemotherapy provides a further reduction in breast cancer mortality when given in addition to endocrine therapy, improving breast cancer mortality on average by a third (RR 0·78, SE 0·07, 2p=0·001 in women 55–69 years) (Early Breast Cancer Trialists' Collaborative G et al, 2012). However its use is associated with potential toxicities both in the short term such as alopecia, nausea, vomiting and neutropenic sepsis and in the longer term rarer effects such as infertility, cardiac impairment and haematological malignancies.

Not all women with ER-positive disease will have substantial gains from chemotherapy; this includes those with such a good prognosis, taking into consideration their endocrine therapy that they could not benefit meaningfully from additional treatment. It would be beneficial for such patients to be spared these toxicities. Traditionally tumour size, grade and nodal burden have dominated decision-making, particularly through the use of decision aids such as the Nottingham prognostic index and Adjuvant! Online (AoL). However a number of prognostic indices have been developed that improve prognostic accuracy.

The IHC4+C score is a prognostic tool that estimates the residual risk of distant recurrence at 10 years in post-menopausal women with ER-positive breast cancer who have received 5 years of endocrine therapy. It incorporates both immunohistochemical parameters of ER, progesterone receptor (PgR), HER2 and Ki67 as well as the clinicopathological parameters of tumour size, grade, nodal status and type of endocrine therapy administered for 5 years (tamoxifen vs aromatase inhibitor). It was developed from a retrospective analysis of 1125 patients with ER-positive disease from the TransATAC cohort who did not received chemotherapy, validated in an independent cohort of 786 patients and has been shown to perform similarly to the Genomic Health recurrence score (RS) in predicting distant recurrence (Cuzick et al, 2011). In this journal, Stephen et al, 2014 showed that in an endocrine treated population from the TEAM trial, the IHC4 provided stronger prognostic information when compared with the Mammostrat score.

When used in adjuvant decision-making, AoL, RS and IHC4+C, categorise patients into low-, intermediate- and high-risk categories. Patients in the low-risk range are usually recommended endocrine therapy alone, and those in the high-risk range are offered chemotherapy in addition to endocrine therapy. There is no consensus as to the optimal treatment for those in the intermediate-risk range and while clinical trials are currently underway to address this uncertainty (MINDACT and TAILORX), many patients are currently offered chemotherapy. In a retrospective analysis of 101 patients who had both AoL and IHC4+C risk estimates calculated, the IHC4+C score reclassified more than half of the patients stratified as being of intermediate risk using AoL to low risk (Barton et al, 2012).

The need to study the clinical utility of the IHC4+C score has recently been identified by the National Institute for Health and Care Excellence (NICE) in its 2013 guidance paper on gene expression profiling and expanded immunohistochemistry tests for guiding adjuvant chemotherapy decisions in early breast cancer management (NICE diagnostic guidance 10, 2013). The aim of this prospective study was to determine the effect of the IHC4+C score on adjuvant systemic decision-making in the setting of the multidisciplinary team meeting (MDT) at our institution, the forum for making adjuvant treatment recommendations. The MDT has been shown to provide consistent and evidence-based decision-making and is now part of the standard of care in the management of breast cancer (Saini et al, 2012).

Patients and methods

Patient population

Post-operative ER-positive breast cancer patients were prospectively selected for IHC4+C testing during the breast unit MDT at The Royal Marsden Hospital between January 2013 and April 2014 (Figure 1). Patients were eligible if they were 50 years or older at the time of diagnosis with an ER-positive, HER2-negative (HER2 0 or 1+ on immunohistochemistry or DDISH-negative if HER2 2+) breast tumour with pathological lymph node stage N0/1. All patients were treated with primary surgery. Nodal involvement with micrometastases and/or ITCs was classified as node-negative for the purpose of scoring. Patients with T4 disease, incomplete axillary staging, before neoadjuvant therapy and patients in whom the adjuvant decision would be in no way changed by the availability of the score (e.g., significant co-morbidities, patient’s known intention against chemotherapy), were excluded.

Figure 1
figure 1

Patient decision pathway. *Time between MDT2 and MDT3 1 week.

Study design

Adjuvant treatment decisions were recorded at three time points during this study (Figure 1). The initial recommendation by the MDT clinicians for adjuvant systemic therapy was recorded (Decision 1). The IHC4+C score was requested by clinicians to be derived only for patients in whom it was felt the IHC4+C could affect decision-making. The IHC4 staining was requested and the IHC4+C prediction of the risk of distant recurrence at 10 years was presented to the MDT 1 week later. Their subsequent recommendation with information provided from the score was recorded (Decision 2). The patient’s final decision after discussion in the clinic as to their accepted choice of adjuvant therapy was also recorded (Decision 3).

This decision was reported in one of three ways: (i) for endocrine therapy alone (no chemotherapy recommended); (ii) for chemotherapy (in addition to endocrine therapy); or (iii) for a discussion of chemotherapy (i.e., a less definitive indication for chemotherapy but that it should be discussed with the patient; that discussion then could lead to the patient accepting chemotherapy). The study was approved as a service evaluation by the institution’s clinical audit committee.

Study endpoints

The primary endpoint was the proportion of patients who had a change in their adjuvant systemic treatment recommendation by the MDT due to the availability of the score. Secondary goals were to correlate the IHC4+C score with the AoL score in predicting risk of recurrence with endocrine therapy and to calculate the proportion of patients who have a change in their actual choice of adjuvant treatment (Decision 3) from the MDT recommended treatment (Decision 2).

The IHC4+C score calculation

Clinicopathological data was collected prospectively from the MDT including age and menopausal status and type of endocrine therapy intended. Immunohistochemistry for ER, PgR, HER2 and Ki67 was performed as previously described (Dowsett et al, 2008). ER was quantified by using the H-score, which is defined as the percentage of cell staining weakly plus 2 × the percentage of cell staining moderately, plus 3 × the percentage of cell staining strongly, and was considered positive if >1. The variable ER10 was derived by dividing the H-score by 30 to obtain a variable with a range of 0–10. PgR was scored as the percentage of cells staining positive, and was divided by 10 to obtain a variable between 0 and 10 (PgR10). HER2 status was assessed as either positive (3+ or 2+ and CISH or FISH positive) or negative (0/1+ or 2+ and CISH or FISH negative). Ki-67 scores were recorded as the percentage of positively staining cancer cells using the MIB1 antibody, in five representative high-power fields (Zabaglo et al, 2010).

The IHC4+C score was calculated using the addition of two algorithms:

(i) The IHC4 score

using an adjustment for Ki67 as previously described because of the differences in scoring from manual methods (Barton et al, 2012).

(ii) The clinical score (C)

with N, T, Gr and Age representing categories of nodal status, tumour size, grade and patient age, respectively, and Ana representing treatment with anastrozole as opposed to tamoxifen (Cuzick et al, 2010; Barton et al, 2012).

AoL distant recurrence calculation

An adjustment to AoL was required in order to compare the estimated residual risk of distant recurrence at 10 years between AoL and IHC4+C. As previously described (Barton et al, 2012) the AoL residual risk of breast cancer recurrence was calculated using the AoL’s estimate of breast cancer mortality at 10 years and subtracting the estimated benefit of 5 years of endocrine therapy, leaving the 10-year breast cancer-specific mortality. This was then multiplied by 1.25, to account for only the distant recurrence events (estimated to be 25% greater than the breast cancer mortality events in the ATAC study at 10 years (Cuzick et al, 2010)).

Statistical analysis

The risk cutoffs used of 10 and 20% residual risk of distant recurrence at 10 years were chosen to correspond with the 18 and 30 RS risk of distant recurrence at 10 years that has previously been is used to classify patients as being at intermediate risk (Howell et al, 2005; Dowsett et al, 2010; Barton et al, 2012). The planned sample size of 100 patients at this single institution was previously sufficient to demonstrate the reclassification by IHC4+C of patients into a lower risk group (Barton et al, 2012).

The relationship between AoL and IHC4+C was assessed on a continuous basis graphically and by Spearman’s Rank correlation. The agreement of the AoL and IHC4+C on a categorical basis was tested using κ statistic where a κ statistic<0.4 is considered poor agreement, 0.4–0.75 is considered moderate agreement and >0.75 is considered a strong agreement.

Results

Patient characteristics

Between January 2013 and April 2014, 124 patients met eligibility criteria for the study with tissue suitable for IHC4+C assessment. Clinical and tumour characteristics are summarised in Table 1. The median patient age was 59 years, the median tumour size was 20mm and most were grade 2 (65%). Twenty-six percent of patients had macroscopic lymph node involvement and most (85%) patients had an aromatase inhibitor as their recommended endocrine therapy leaving 19 patients (15%) recommended for tamoxifen, predominantly because of perimenopausal status.

Table 1 Clinicopathological characteristics

Comparison between MDT decisions with and without the availability of IHC4+C score (Decisions 1 and 2)

The IHC4+C score led to more endocrine therapy alone recommendations being made by the MDT. Figure 2 shows the concordant and discordant decisions with IHC4+C scoring. There was concordance in the MDT’s recommendation between Decisions 1 and 2 in 90 cases (72%). In the 34 discordant cases, all but one led to endocrine therapy alone being recommended rather than chemotherapy plus endocrine therapy after the IHC4+C score was given to the MDT. In Table 2: from Decision 1, 50 (40%) patients had an endocrine therapy alone recommendation and this increased to 82 (66%) patients at Decision 2. There were no ‘discuss chemotherapy’ recommendations at Decision 2.

Figure 2
figure 2

Concordant and disconcordant MDT decisions between Decisions 1 and 2.

Table 2 Change in MDT decision-making without (Decision 1) and with (Decision 2) the IHC4+C score

Figure 3 shows individual changes in treatment recommendations for all patients at Decisions 1,2 and 3. At Decision 1, chemotherapy or at least its discussion was recommended in 74 (59%) patients. This fell to 42 (34%) at Decision 2, potentially sparing 32 (26%) patients a chemotherapy recommendation. In the 32 patients with node-positive disease (not including micrometastases), a subgroup traditionally thought to be of increased risk of relapse, 11 (34%) patients had their initial chemotherapy recommendation changed to endocrine therapy alone.

Figure 3
figure 3

Adjuvant systemic therapy treatment decisions. Fifteen patients who were recommended chemotherapy in Decision 2, chose not to have it at Decision 3. Five patients who were recommended endocrine therapy alone in Decision 2 chose to have chemotherapy at Decision 3. Two patients who were recommended endocrine therapy at Decision 2 chose to have no adjuvant therapy in Decision 3.

Comparison between MDT recommendation to patient and patient’s decision (Decisions 2 and 3)

Patients adhered to the recommendation by the clinician 82% of the time (102 out of 124) of cases (Figure 3). Fifteen patients who were recommended chemotherapy in Decision 2, chose not to have it at Decision 3. Five patients who were recommended endocrine therapy alone in Decision 2 chose to have chemotherapy at Decision 3. Two patients who were recommended endocrine therapy at Decision 2 chose to have no adjuvant therapy in Decision 3.

Comparison between IHC4+C score and AOL

Figure 4 shows the relationship between AoL and IHC4+C estimates of risk of distant recurrence at 10 years for the 124 patients. The Spearman’s Rank correlation was 0.73. Agreement between the categories created for the IHC4+C score and the AoL score was 54% (CI 45.7–63.8%; κ =0.32). AoL generally estimated patients at higher risk than IHC4+C. AoL estimated 31% of patients as high risk, while IHC4+C rated only 10% patients as high risk. From the 38 patients classified as high risk by AoL, 58% were reclassified as intermediate risk by IHC4 and another 8% reclassified in the low-risk range. From the 46 patients in the intermediate AoL range, 63% were reclassified into the low-risk range using IHC4+C. IHC4+C rated 56% of patients as low risk, as opposed to AoL which rated only 30% as low risk. Only two patients were reclassified from low-to-intermediate risk by the use of IHC4+C score, and none were reclassified to high risk.

Figure 4
figure 4

Spearman’s rank correlation between AoL and estimates of risk of distant recurrence at 10 years. Agreement: 54.8% (CI 45.7–63.8%; κ =0.32).

Discussion

To our knowledge, this is the first prospective study of the clinical utility of the IHC4+C score. The population was clinically relevant: all patients in whom the score was performed were felt by the MDT to have either a minimal or low benefit from chemotherapy, such that additional information provided by the score may help to reinforce their initial recommendation of endocrine therapy alone, or change their initial recommendation from chemotherapy to endocrine therapy alone. Over the 15-months period of recruitment for this study, 850 women over 50 years of age with ER-positive breast cancer were presented at the post-operative MDT for decision-making, leading to 124 (15%) patients having had an IHC4+C test requested by clinicians. For patients in whom it was felt the availability of the score could not have any impact on decision-making, it was not requested (e.g., co-morbidities would preclude chemotherapy anyway). Long-term follow-up in these patients is planned.

The IHC4+C score provided useful information to the MDT in two ways.

  1. 1

    Reducing chemotherapy recommendations in a borderline population of patients. From those patients in whom the initial recommendation was either for a discussion or clear recommendation of chemotherapy, 26% of these patients were subsequently recommended endocrine therapy alone, as guided by the residual recurrence estimates provided by the score, and hence sparing a significant proportion of women the potential short and long-term toxicities of chemotherapy, as well as sparing health-care providers the costs in providing such treatment. It is estimated that in the United Kingdom the cost of six cycles of second generation chemotherapy with fluorouracil, epirubicin and cyclophosphamide is £3500 (Campbell et al, 2011). In our series, 32 patients were potentially spared chemotherapy, amounting to a direct chemotherapy cost saving of £112 000 over a period of 15 months of patient recruitment, not withstanding the cost of hospital admissions for the treatment of febrile neutropenia as well as sparing of undesirable side effects of chemotherapy for patients.

  2. 2

    Providing reassurance in concordant cases. Patients in this study were requested for an IHC4+C on the basis that the clinician felt that more information provided by the score could assist in their decision-making. Most patients (73%) had concordant decisions recommended before and after scoring. We estimated that the MDT considered their initial recommendation for treatment to be ‘borderline’ for 47 out of these 91 concordant cases. Hence concordant scoring helped to confirm their decision and provide reassurance to the clinician.

The IHC4+C score is a prognostic tool and does not offer predictive information. In our study, the use of this prognostic tool enabled a large proportion of patients to be treated with endocrine therapy alone. As shown in Figure 3, the initial MDT recommendation confidently spared chemotherapy in 50 (40%) patients (Decision 1). With the availability of the score 90 (72%) patients ultimately decided on endocrine therapy as their treatment of choice (Decision 3).

One criticism of this study may be that the benefit of the IHC4+C score in reducing chemotherapy recommendations may be overestimated by the group in whom a discussion of chemotherapy was recommended. In 45 out of the 124 (36%) patients in whom an IHC4+C score was requested by the MDT, the original recommendation was for a discussion regarding chemotherapy. With the availability of the score, 21 of these were still recommended chemotherapy, leaving 24 patients to be recommended endocrine therapy only. Without the availability of the IHC4+C score to the MDT, there can be only an estimation of how many of these patients who were recommended for a discussion would have actually had chemotherapy. Using our best clinical estimate and based on prior experience, in the absence of the IHC4+C score, chemotherapy would most likely have been given in 36 (80%) patients in whom a discussion of chemotherapy was the initial recommendation. This estimate is based on the fact that in the ‘discuss chemotherapy’ cohort, 13 cases were >3cm; an additional 12 were grade 3 and an additional 11 patients who were not grade 3 or over 3 cm had nodal involvement. It was felt that it would be inappropriate to ask patients, in what is often a stressful time at the point of deciding on chemotherapy or not, what their decision would be without and with the availability of the IHC4+C score.

The IHC4+C score is currently only validated in the post-menopausal patient. However it may be seen that the information provided by the score would be equally if not more usefully applied to premenopausal women, particularly given the potential effects chemotherapy may have on fertility in this cohort. This would require testing the calculator in a large set of premenopausal women, treated with tamoxifen but not chemotherapy

In this prospective single institution study, we have shown that the IHC4+C score can be performed in real time, making it available to the MDT for decision-making. The IHC4+C score was made available to the MDT within 1 week and in time for medical oncology appointments where most adjuvant systemic therapy is discussed. Scoring requires a dedicated breast pathologist, and scoring of immunohistochemical markers provided routinely by most service histopathology laboratories in the United Kingdom. Clinical information using the patient age and intended endocrine therapy (aromatase inhibitor vs tamoxifen) are included in the calculator. It is important to note that this information should be gathered on pretreatment samples (i.e., no neoadjuvant therapy or other presurgical treatment administered).

Commercially available molecular signatures (Oncotype DX (Genomic Health, Redwood City, CA, USA); Endopredict (Sividon Diagnostics, Cologne, Germany); Prosigna (Nanostring, Seattle, WA, USA)), remain inaccessible to the majority of NHS patients at this time. Oncotype DX may be available through an access scheme for some eligible NHS patients with early stage hormone-receptor positive, HER2-negative breast cancer who are considered to be at intermediate risk (NICE diagnostic guidance 10, 2013). The IHC4+C score has the potential to fulfil these unmet needs by providing similar prognostic information to assist in adjuvant decision-making without the need to send tissue externally, and without the significant financial cost, to the patient, insurer or trust for doing so. The cost of performing the score is estimated to be less than £200 per case at our institution.

All scoring was performed within our single institution by our research team, which has extensive experience in conducting the tests and assessing the results in order to produce standardised input for the IHC4 algorithm and this may be seen as a limitation of the study. If IHC4+C scoring is to be routinely used across the UK, there will be a need to validate local procedures and to quality assure performance on a continuing basis. The standardisation of Ki67 assessment in particular has been shown to be problematic (Dowsett et al, 2011). However there is evidence, at least from the UK, that Ki67 immunohistochemistry is technically reproducible when laboratories validate pre-analytical and technical aspects appropriately and participate in an external quality assessment scheme (Parry et al, 2014), and efforts are ongoing by an international study group to improve its analytical reproducibility (Polley et al, 2013). We have recently established that the IHC4 score is tolerant of multiple different IHC platforms (Dodson et al, 2015). Nonetheless it is important that labs should establish that their own methodology yields acceptable concordance with reference methodology (Cuzick et al, 2011) before implementation. Ultimately we envisage that the IHC4+C calculator may be universally available in an online format like AoL where the individual components can be entered by the treating team.