Introduction

The mortality rates in breast cancer patients have decreased dramatically in the past two decades, due mainly to the administration and improvement of adjuvant systemic therapy.1,2 Clinical decision-making regarding administration of adjuvant systemic therapy in breast cancer patients is guided by conventional prognostic factors such as age, tumor size, tumor grade, status of axillary lymph nodes, and hormone receptor status. Clinicopathological risk indexes, such as Adjuvant!Online and the Nottingham Prognostic Index, use these factors to estimate the risk of recurrence and the benefit of adjuvant systemic treatment.3,4 Nevertheless, breast cancer tumors with comparable clinicopathological characteristics may have considerable different outcomes, reflecting the heterogeneity of the disease.5

Several gene expression profiles were developed to predict the risk of dissemination in breast cancer patients, and these gene expression profiles have drawn attention as an accurate alternative or adjunct for predicting outcome in individual breast cancer patients. The European Society of Medical Oncology suggests the use of gene expression profiles to gain additional prognostic and/or predictive information to complement pathology assessment, particularly in patients with estrogen receptor (ER)-positive (+) breast cancer.6 The current Dutch national (NABON) guideline for breast cancer (2012) recommends the use of a validated gene expression profile in patients with an invasive ductal carcinoma with ER-positive disease and a questionable indication for adjuvant chemotherapy (CT) based on conventional prognostic factors.7

To date, research of gene expression profiles mainly focuses on the value of the test for the individual patient. The 70-gene signature (70-GS; MammaPrint) is the most commonly used gene expression profile in the Netherlands.8 The aim of the present study is to evaluate the impact of the 70-GS on the administration of adjuvant CT at a nationwide level in a subgroup of patients for whom the additional value of CT is debated and national guidelines suggest the use of a gene expression profile.

Materials and Methods

Data on patient, tumor, and treatment characteristics were obtained from the Netherlands Cancer Registry (NCR). The NCR is a nationwide database managed by the Netherlands Comprehensive Cancer Organization (IKNL) that prospectively registers clinicopathological and treatment characteristics of all cancer patients treated in the Netherlands since 1989. The NCR started registering the use of gene expression profiles in 2011. Between February 2007 and July 2011, the 70-GS became available in the Netherlands and was offered to patients enrolled in the Microarray in Node-Negative Disease May Avoid Chemotherapy (MINDACT) trial.9 In this trial patients were randomized between administration of adjuvant CT based on the gene signature or conventional prognostic clinicopathological factors.

Since accrual of the MINDACT trial ended, the 70-GS is increasingly used as an adjunct to conventional clinicopathological factors for clinical decision-making. It is the most widely used gene-expression profile in the Netherlands. Although OncotypeDX recently has become commercially available in the Netherlands, this gene-expression profile was not frequently used during the study period, and its use was therefore not taken into account.

Study population

Female patients with primary breast cancer who were older than 17 years of age, surgically treated between November 2011 and October 2013, and had no prior history of malignancy, neoadjuvant treatment, or distant metastasis upon diagnosis were identified in the NCR. According to the Dutch national guidelines, adjuvant CT should be administered to all lymph node–positive patients (≥N1a) and to patients without lymph node involvement but with unfavorable clinicopathological tumor features (grade III tumors >1 cm, any tumor >2 cm, or HER2 + tumors) as well as to patients younger than 35 years of age. The current Dutch guideline advises against the administration of CT in patients who do not fulfill the aforementioned criteria, in patients more than 70 years of age, and in patients with grade I tumors smaller than 2 cm.

According to the Dutch guidelines, “Validated gene expression profiles may be used in individual cases with a hormone receptor–sensitive invasive ductal carcinoma, if there is doubt about the indication for adjuvant chemotherapy on the basis of traditional prognostic factors.”7 In line with this guideline we identified three groups of patients, all younger than age 70 years with ER-positive and HER2-negative invasive ductal carcinoma, where controversy exists regarding the administration of adjuvant CT: N0, grade I, >2 cm (group A); N0, grade II, >1 cm (group B); and N1mi, grade I/II (group C).

Statistical analysis

Frequencies of baseline characteristics were compared between patients who received a 70-GS versus patients who did not receive the test within the indicated area for 70-GS use, i.e., in whom controversy existed regarding the benefit of adjuvant CT, using a χ2 test for differences in categorical data. For normally distributed continuous variables (age and size), means were calculated and a t-test was performed. Concordance of the test result with the administration of CT in the overall study population and the aforementioned subgroups was assessed by dividing the sum of patients with a low-risk test result in whom adjuvant CT was omitted and patients with a high-risk test result who received adjuvant CT by all patients with a known test result.

For the whole group and the three categories of patients, the proportions receiving CT were assessed in relation to 70-GS use, and adherence to the low-risk or high-risk test result was evaluated. To provide the most valid results using observational data taking patient clustering in hospitals into account and to address potential “confounding by indication,” linear mixed-effect regression models and instrumental variable (IV) analyses were used.

Linear mixed-effects regression analysis. The association between 70-GS use and the administration of adjuvant CT was assessed in a linear mixed model adjusting for possible observed confounders, taking into account patient clustering within hospitals. For this, we included both a random intercept per hospital (thus taking baseline differences in CT use among hospitals into account) and a random 70-GS slope per hospital (thus taking potential differences in the effect of the 70-GS on CT administration between hospitals into account). We adjusted for age and tumor size (continuously), and for grade, axillary status, and incidence year (categorically). Linearity of the relation between the continuous variables and CT use was inspected using a LOWESS smoother and was concluded to be linear for both age and size (the latter after truncation of 0.5% of the data above 5 cm). Because the dependent variable (CT) was coded as 0 (no) and 1 (yes), the results of the linear mixed-effects analyses are on the risk difference scale, i.e., showing absolute differences in proportion of adjuvant CT use. The proportions of absolute differences were multiplied by 100 to present the results of these linear mixed-effects models as percentages in absolute risk differences. Besides investigating the association between 70-GS use on the administration of adjuvant CT in the overall study population, we assessed the effect of 70-GS within the aforementioned subgroups of patients (A, B, and C) and age categories (<50 years, 50–59 years, and 60–69 years). To test for differences in the association between 70-GS and CT use between these subgroups, we calculated P values for interaction by including interaction terms to the models.

IV analyses. In an attempt to optimally control confounding by indication and further assess the validity of the linear mixed-effect models, we performed IV analyses (two-stage least square (2SLS) using ordinary linear regression). Confounding by indication is a well-known phenomenon in studies with observational data, and it may be only partly resolved by multivariable regression analyses, such as mentioned above, because these methods cannot adjust for unmeasured confounders. An IV may serve as a substitute for randomization (pseudo-randomization) in non-randomized studies under the assumption that the IV (i) is strongly associated with the exposure (in our case 70-GS use), (ii) is unrelated to confounders, and (iii) has no direct association with outcome (the administration of adjuvant CT).10 We considered percentage of 70-GS use within the indicated area per hospital per year (IV 1), 70-GS use in the previous patient within the indicated area treated in the same hospital (IV 2), and a combination of both in the first stage of 2SLS analyses as IVs. All are measures for hospital 70-GS preference. First, the association of these IVs with 70-GS use, the administration of adjuvant CT, or possible confounders was assessed by univariable logistic regression analysis. We considered using incidence year as an IV, however, incidence year was strongly related with CT use, also in patients treated in hospitals were the 70-GS was never used, and therefore failed the third assumption.

We present the results of the IV analyses with and without adjustment for the same potential confounders as in the aforementioned linear mixed-effects models (these were then included in both stages of the 2SLS approach). The results of the 2SLS analysis can be interpreted as the absolute change in percent CT use due to 70-GS use.

All P values and 95% confidence intervals (CIs) for the linear mixed-effects and the 2SLS analyses were based on 5,000-fold bootstrap resampling. A P value < 0.05 was considered statistically significant. All analyses were performed in R (version 3.1.3 for Mac OS) using the lme4 package (version 1.1–7) for the linear mixed-effects models.

Results

Between November 2011 and October 2013, 2,043 women with primary breast cancer without prior malignancy or having received neoadjuvant treatment, within the indicated area for 70-GS use were surgically treated in the Netherlands. Of these patients, 298 (15%) actually received 70-GS. Patients who received 70-GS were younger, had more limited axillary involvement, and suffered from smaller tumors compared with patients within the indicated area who did not receive 70-GS ( Table 1 ). Furthermore, an increase in 70-GS use over time was observed.

Table 1 Patient and tumor characteristics according to 70-gene signature use in 2,043 patients within the indicated area for 70-GS use (all younger than 70 years of age with ER+/HER2-, invasive ductal carcinoma)

Adherence to test result

The majority of patients were assigned to a low-risk test result (64%), and high adherence rates to the 70-GS test result in the administration of adjuvant CT were observed in the overall study population (86%). In the predefined guideline delineated subgroups A, B, and C, the majority of the patients were assigned to the low-risk group by the 70-GS (83, 63, and 63%, respectively), and the administration of CT was in line with the 70-GS result in 89, 86 and 83% of the patients in subgroups A, B, and C, respectively ( Table 2 ). The 70-GS did not affect the administration of adjuvant endocrine therapy within these subgroups.

Table 2 Administration of adjuvant CT and ET in patients within the indicated area for 70-GS use who received the 70-GS, i.e., in whom controversy exists regarding the administration of adjuvant CT based on conventional prognostic factors

Linear mixed-effects regression analysis

In the group of patients where the 70-GS was considered indicated according to the national guideline (n = 2,043), use of the 70-GS was associated with a 9.5% (95% CI: −15.7 to −3.3; P = 0.003) absolute reduction in the administration of CT after adjustment for measured confounders in linear mixed-effect model analyses ( Table 3 ). A younger age, intermediate invasive tumor grade, and micro metastatic involvement of the axillary lymph nodes showed a strong positive association with the administration of adjuvant CT (Supplementary Table S1 online).

Table 3 The effect of 70-GS use on the administration of adjuvant CT as observed with linear mixed-effect modeling, in the overall study population (n = 2,043), guideline- and age-delineated subgroups of patients within the indicated area for 70-GS use, with and without adjustment for potential confounders

Subgroup analyses revealed a significant reduction in the administration of adjuvant CT after 70-GS use in subgroups A and C (−24.0%; 95% CI: −46.5 to −1.6; P = 0.035 and −16.0%; 95% CI: −28.0 to −4.0; P = 0.009, respectively) but no significant reduction in subgroup B (−5.8%; 95% CI: −13.0 to 1.3; P = 0.11). Although the association between 70-GS use and the administration of adjuvant CT is lower in subgroup B than in subgroup A, this difference in association was not significant (P for interaction 70-GS*group B: 0.122; Figure 1 ).

Figure 1
figure 1

Absolute risk of receiving adjuvant CT according to the (fully adjusted) linear mixed model in guideline-delineated subgroups of patients within the indicated area for 70-GS use. All patients younger than 70 years of age with ER+/HER2- invasive ductal carcinoma (subgroup A: N0, BRI, >2 cm; subgroup B: N0, BRII, >1 cm; subgroup C: N1mi, BRI/II). CT, chemotherapy; 70-GS, 70-gene signature. *Calculated using the fully adjusted linear mixed model; age, tumor size, invasive tumor grade, axillary involvement, and incidence year were included as covariables in the model. Projected values are the percent chemotherapy use for those without and with the 70-GS as derived from the linear mixed models’ regression equations filled in at the mean of all other covariables for the entire (sub-)group (i.e., the projected difference between 70-GS use and non-use is adjusted for confounding). **P values for interaction between 70-GS use and the subgroups.

The strongest relation between 70-GS use and the administration of adjuvant CT was seen in younger patients. In patients younger than 50 years of age, a 26.9% reduction in the administration of CT was observed in patients who received 70-GS (95% CI: −38.3 to −15.6; P < 0.001). In patients between 50 and 59 years of age, 70-GS use led to a 9.2% reduction in the administration of adjuvant CT; however, this was not significant (95% CI: −18.7 to 0.3; P = 0.058; Table 3 ). In the older age category a reverse relation is observed: 70-GS use resulted in a 3.5% (95% CI: −5.9 to 12.9; P = 0.48) increase in the administration of adjuvant CT (P for interaction 70-GS* 60–69 year of age: <0.001; see Supplementary Figure S1 online).

IV analysis

To assess the validity of the two IVs as an instrument in this study, we confirmed that both percentage of 70-GS use within the indicated area per hospital per year (IV1) and use of the 70-GS in the previous patient within the indicated area (IV2) had a strong positive association with 70-GS use. The 70-GS was used in 5.6% of the patients within the indicated area in hospitals that less frequently used 70-GS (0–25% 70-GS use within the indicated area per year), which was 47.9% for patients treated in hospitals who used 70-GS in >25% of the patients within the indicated area (odds ratio: 14). If the 70-GS was used in the previous patient within the indicated area, then 33.6% of the subsequently treated patients also received a 70-GS compared with 12.2% if the prior patient did not receive a 70-GS (odds ratio: 18). Combining IV1 and IV2 in the first stage of the 2SLS regression method showed the strongest association with 70-GS use (5.2 vs. 45.3%; odds ratio: 22; comparing the best scoring 25% of patients for this IV vs. the rest). Both IV1 and IV2 were generally less associated with potential confounders (age at diagnosis and tumor characteristics) than actual 70-GS use. However, incidence year was not equally distributed between levels of the three IVs (Supplementary Table S2 online).

The results of the IV analyses are shown in Table 4 , with and without adjustment for potential confounders. The analyses of both IV1 and IV2 revealed a significant reduction in the administration of adjuvant CT in patients who received a 70-GS (−27.6%; 95% CI: −54.5 to −0.6%; P = 0.045 and −10.2%; 95% CI: −19.6 to −0.8; P = 0.033, for IV 1 and 2, respectively). The results of the IV analysis with the combined IVs were similar to the results of the adjusted linear mixed-effects regression model (−9.9%; 95% CI: −19.3 to −0.4; P = 0.040).

Table 4 The effect of 70-GS use on the administration of adjuvant CT in 2,043 patients within the indicated area for 70-GS use as derived from different instrumental variable (IV) analyses (two-stage least-square regression analysis)

Discussion

In this nationwide study, patients in whom the 70-GS was used received 10% less adjuvant CT compared with patients who did not receive the 70-GS in a cohort of Dutch breast cancer patients for whom controversy exists regarding the benefit of CT based on clinicopathological characteristics. Furthermore, in this selection of patients, compliance with the test result was high.

We observed a significant and clinically relevant decrease in the administration of adjuvant CT after use of the 70-GS in patients with an uncertain indication for adjuvant CT based on conventional prognostic factors after correction of all measured confounders. Currently, there is only limited evidence concerning the impact of the 70-GS on the administration of CT. Various studies have reported the impact of gene expression profiles within cohorts of patients who received the test.11,12 A theoretical change in adjuvant systemic treatment decisions was reported in ~30% of patients after use of the 70-GS, resulting in a more tailored adjuvant systemic treatment plan for the individual patient.13 Current evidence is based on a few impact studies with relatively small numbers of patients, predominantly describing a hypothesized decrease in the administration of adjuvant CT after 70-GS use in early breast cancer patients.12,14 Similar results were reported regarding the impact of the 21-gene Recurrence Score (OncotypeDx) on the administration of adjuvant CT.15,16,17 In addition, high compliance rates with the test results in patients with early-stage ER+, node-negative breast cancer have been reported.18 Yet, to our knowledge, there are no nationwide studies assessing the impact on the administration of adjuvant CT for either OncotypeDx or the 70-GS.

Recently, Cusumano et al. conducted a European inter-institutional impact study of the 70-GS (n = 194). Cases were presented to clinician panels in four different countries asking for an adjuvant treatment advice, first without and then with knowledge of the 70-gene test result. In the subset of ER-positive and HER2Neu-negative patients (n = 100), a 21% (73 vs. 52%) expected absolute decrease in the administration of adjuvant CT was reported when the 70-GS result was taken into account by a Dutch multidisciplinary team.19 Our results are in line with this report, although we saw a smaller decrease in the administration of adjuvant CT when the 70-GS was used. This difference may be explained by differences in the design of the present study in which patients were categorized by a combination of hormone status, HER2-status, tumor size, differentiation grade, and lymph node involvement. In addition, the smaller proportion of patients receiving adjuvant CT in the control group in our study (44 vs. 73%) illustrates the different national attitudes toward the administration of adjuvant CT in the absence of a gene signature test. In fact, the relative reduction of administered adjuvant CT was comparable between our population-based study and Cusumano’s questionnaire study.

Approximately 45% of patients in whom controversy exists regarding the administration of adjuvant CT received adjuvant CT without use of the 70-GS. This reflects the current controversy regarding the administration of adjuvant CT in these patients. High compliance rates with the 70-GS test result were seen demonstrating the propensity to adhere to a reproducible advice in this group of patients. The 70-GS assigned the majority of these patients to the low risk category (83% in group A, 63% in group B, and 63% in group C). In particular, in group B, consisting of patients with grade II breast cancer, the proportion of low-risk test results was higher than reported in previous studies assigning ~50% to the high risk category and 50% to the low risk category.20 Especially in subgroup A (N0, BRI, >2 cm) and subgroup C (N1mi, BRI/II), 70-GS use was associated with less administered CT, whereas in subgroup B (N0, BRII, >1 cm) only a non-significant trend was observed. Nevertheless, the P values for interaction between the subgroups indicate that the effect of 70-GS use on the administration of CT does not significantly differ between the various subgroups. Presumably, also considering the prevalence of genomic high-risk patients, the current sample size of subgroup B is too small to reach significance.

This is the first nationwide study to report the independent association between the use of the 70-GS and the administration of adjuvant CT in patients for whom controversy exists according to the national guidelines, i.e., patients with ER-positive, HER2-negative invasive ductal tumors of low or intermediate malignancy grade in the absence of overt lymph node metastases. This was supported by linear mixed-effect modeling, which has as its main advantage that no independence is assumed among observations, allowing correlated observations within a unit or cluster. Because hospital preference or even clinician preference plays an important role in 70-GS use and the administration of adjuvant CT, patient clustering might have influenced the results. Therefore, this statistical approach is ideally suited for assessment of the association between 70-GS use and the administration of adjuvant CT and makes the observed association more valid. In this study, other factors also influenced the decision to administer CT in the subset of patients where CT was considered controversial. In particular, the effect of age was remarkable. The high proportion of young women who received adjuvant CT in the controversy group reflects an understandably more aggressive attitude in younger women. Nonetheless, although the overwhelming majority of younger women received adjuvant CT when 70-GS was not used (83%; Supplementary Figure S2 online), 53% of these young women received adjuvant CT when 70-GS was used, resulting in the largest relative and absolute reduction of administered CT. By contrast, in the older age category (60–69 years of age), 70-GS use was associated with an increased administration of adjuvant CT (Supplementary Figure S2). Although this effect in the older age category is not statistically significant in the linear mixed-effect analyses, the P value for interaction (P < 0.001) implies that the effect of 70-GS use on the administration of CT in older patients is different from those in patients younger than 50 years of age. These findings might indicate that the physician’s intention for 70-GS use is associated with age, but the retrospective observational design of the current study precludes detailed analyses of these findings.

Confounding by indication is a well-known phenomenon in studies using observational data. In an attempt to control this form of bias, we performed additional IV analyses.10 An IV is an external factor that influences outcome (in our case the administration of CT) exclusively through its effect on exposure (70-GS use), unrelated to potential confounders. We decided to use facility-prescribing patterns as IVs, an approach adopted previously by others.20 In our case, a high-quality IV should be strongly related to 70-GS use, which was the case when combining both IVs. Furthermore, a robust IV should not be associated with other confounders of the administration of CT. Because confounders can be measured or unmeasured, it is impossible to assess the direct relation between an IV and all possible confounders. Therefore, most authors using IVs report on the relation between IVs and measured confounders (for example, patient characteristics) under the assumption that if the IV is not related to a measured confounder, it likely is also not related to unmeasured confounders.21 In our case, all IVs (IV1: 70-GS use in the previous patient within the indicated area; IV2: proportion of 70-GS use per hospital within the indicated area) and a combination of both resulted in a better balance for measured confounders compared with actual 70-GS use (Supplementary Table S2 online), but a disbalance remained for the variable incidence year. Therefore, we also adjusted the IV effects for the measured potential confounders, and the combined IV analysis resulted in a strikingly similar absolute risk reduction for the administration of CT after 70-GS as observed in the standard linear mixed-effect regression analyses.

The prognostic value of the 70-GS has been validated in multiple retrospective and one prospective patient series.8,13,22,23,24,25 Evidence supporting the predictive value of the 70-GS on CT benefit is limited; therefore, its clinical utility mainly lies in its prognostic capacity.26,27,28 Currently, the MINDACT trial is being conducted and the first results are expected by the end of this calendar year, which will further assess the predictive capacity of the 70-GS in a randomized prospective setting.9 In the present study, i.e., in patients who were treated within the indicated area for gene expression profiles characterized as having a fairly good prognosis, it was the prognostic value of the 70-GS that was used to discern a group of patients with such good outcome that no substantial benefit was to be expected of adjuvant CT. The observed effect in the present study is not attributable to a hitherto unproven predictive value of the 70-GS.

In conclusion, in Dutch early-stage breast cancer patients with a dubious indication for adjuvant CT based on clinicopathological factors, 70-GS use is associated with a 10% decrease in the administration of adjuvant CT.

Disclosure

The authors declare no conflict of interest.