Main

In non-metastatic breast cancer, knowledge of HER2 status at diagnosis is recommended to determine the adjuvant therapy strategy. It is therefore necessary to have standardized and validated procedures for this evaluation before making any therapeutic decision.

For the HER2 status determination, core-needle biopsies may be less reliable than surgical specimens due to the smaller volume of the tissue sample, the possible sampling error on a tumor with a heterogeneous distribution of the antigens within the tumor, and potential crush/edge artifacts in the core-needle biopsies. However, counter arguments suggest that core-needle biopsies may be better fixed than lumpectomies and should be used to assess the patient's biomarker status. Furthermore, HER2 status is usually homogenous (<3% of heterogeneous tumors)1 and there is a concrete clinical need for testing core-needle biopsies insomuch as neoadjuvant therapy strategies (and then HER2 status) may be determined for both operable and non-operable breast cancers.

Regarding the available guidelines, ASCO/CAP (American Society of Clinical Oncology/College of American Pathologists) recommends that cores entirely involved by retraction artifacts or crush artifacts should not be used as a sample to perform/interpret HER2 immunohistochemistry2, 3 and UK recommendations which precise that observers should be aware of the range of common artifacts, including edge artifacts, which can be problematic in small biopsy samples.4 French guidelines authorize testing on core biopsies5 and Canadian recommendations do not specifically discuss core biopsy processing/testing.6

Recently, bright-field in-situ hybridization techniques such as chromogenic in-situ hybridization (CISH) and silver-enhanced in situ hybridization (SISH), which combine features of immunohistochemical analysis and in-situ hybridization (ISH), have been introduced for the determination of HER2 status. These new techniques allow results to be visualized by standard bright-field microscopy, and signals do not decay over time.

In most studies, a high correlation (>90%) was shown between the first available procedures designed to evaluate HER2 status (immunohistochemistry, fluorescence in-situ hybridization (FISH), considered as the gold standard, and CISH, first described in 2000)7 but concordance between these methods might depend, at least in part, on centers (methodology, instrumentation and experience of the laboratories performing the testing), particularly for immunohistochemistry and FISH.8, 9, 10, 11

SISH was implemented more recently and three studies reported a 97, 96 and 87% concordance between SISH and FISH results on excision samples12, 13, 14 taking into account that discordant cases could be partly explained by intratumor heterogeneity of amplification. One study showed an 89% concordance of SISH results between surgical specimens and core-needle biopsies but the number of studied cases was limited (n=56).15

Since FISH on surgical specimens is considered as the gold standard for her2 gene status evaluation in breast cancer in the ASCO/CAP guidelines,2, 3 concordance with other available HER2 tests (immunohistochemistry, SISH and CISH), on core-needle biopsies, is essential because the results could be used for patient's management, and will be the only material available in the neoadjuvant setting.

Materials and methods

Study Design

This French multicenter, cross-sectional, histopathological study was based on the diagnostic core-needle biopsies and surgical specimens, in women suffering from a previously untreated early-stage breast cancer.

Twenty histopathological laboratories (10 cancer centers, 5 university hospitals and 5 private laboratories) involved in breast cancer diagnosis participated in the study. They used immunohistochemistry and were trained to perform CISH assay. There were also four reference centers (2 cancer centers, 1 university hospital and 1 private laboratory) experienced in FISH and CISH and in the analysis of discordant cases. Among these 24 centers, 18 performed SISH assays. Participating centers had to recruit patients suffering, at first diagnosis, from non-metastatic, invasive breast carcinoma that had not been treated before surgical removal, and whose core-needle biopsies and surgical specimens were available at the center. Samples had to contain sufficient invasive tumor material to perform the different study assays, and histological tissues were fixed, depending of the structure's procedures, by neutral buffered formalin, alcohol-formalin-acetic acid, Hollande's fixative or alcohol-formalin.

It was decided to make an a priori selection of the patients, to reach significance in the statistical analysis taking into account the estimation that only 15% of patients with early breast cancer have an HER2-positive status. Patient screening was based on immunohistochemistry results obtained on surgical specimens and each center had to recruit 5–6 patients scored 3+, 2 patients scored 2+ and 3 patients scored 0/1+, by immunohistochemistry. The retrospective deadline for the oldest selected cases was fixed at 1 January 2003. If there were insufficient retrospective cases, prospective cases could be included. Collected data were patient and tumor characteristics, HER2 evaluation method (fixative procedures, type of autostainers, antigens retrieval methods, types of antibodies, with the interpretation guideline used for immunohistochemistry; CISH, SISH or FISH procedures for hybridization techniques) according to the type of specimen (core-needle biopsy or surgical specimens), and results.

In accordance with French legislation regarding non-interventional studies, the study protocol was approved by the ‘Comité Consultatif sur le Traitement de l’Information en Matière de Recherche dans le Domaine de la Santé’ (Consultative Committee on Information Processing for Research in the Field of Health) and validated by the ‘Commission Nationale de l’Informatique et des Libertés’ (Independent administrative authority protecting privacy and personal data), which guarantees subject confidentiality.

Study Objectives

The primary study objective was to assess the concordance of HER2 status determined by immunohistochemistry, CISH and SISH, performed on core-needle biopsies, with FISH results obtained on surgical specimens. The scoring was performed according to different guidelines used: test performed in routine practice by the participant centers (Herceptest (pathologists’ scoring); cutoff of 10% for 3+ category16), ASCO/CAP criteria (cutoff of 30% for 3+ category2), and GEFPICS (Groupe d’Etude des Facteurs Pronostiques par Immunohistochimie dans le Cancer du Sein) criteria (cutoff of 60% of positive cells intermediate intensity for 2+ category8). The secondary objectives of the study were to assess the concordance of HER2 status assessed on core-needle biopsies with that determined on surgical specimens (by immunohistochemistry, SISH, CISH and FISH), and to describe the clinical and histopathological characteristics of patients and tumors.

Statistical Methods

In all, 220 enrolled patients were needed to obtain 196 evaluable cases with the hypothesis of 15% discordant cases between CISH results on core-needle biopsies and FISH results on surgical specimens, using a two-sided 95% confidence interval with 5% precision.

The percentages of discordant cases between immunohistochemistry, CISH and SISH on preoperative specimens, and FISH on surgical specimens were described with their 95% confidence intervals as well as sensitivity, specificity, false positive and negative rates, and positive and negative predictive values. Concordance results for immunohistochemistry were compared between different interpretation guidelines, using coefficient of correlation κ, as well as the correlation results with CISH and SISH. The same statistical methods were used for concordance between results of each method according to the type of tested sample (core-needle biopsies vs surgical specimens). Clinical and pathological characteristics of patients were described in the overall studied population.

All tests were two-sided with an α risk at 5%. Statistical analyses were performed using SAS Version 8.2 (SAS Institute, Cary, NC, USA).

Results

Analysis Population

Breast tumor specimens were tested between 14 April 2003 and 27 August 2007. The analysis population is described in Figure 1. In all, 261 patients were enrolled by 24 centers including 4 referent laboratories and 20 local centers; all the 20 local centers (83%) recruited the 11 expected patients. Among the 260 analyzed patients (1 exclusion for previous neoadjuvant treatment), 193 had also an SISH evaluation of their tumor, performed by 18 centers equipped with a Benchmark immunostainer (Ventana, Tucson, USA) according to the manufacturer guidelines.

Figure 1
figure 1

Study populations and data sets analyzed according to performed tests. SS, surgical specimen; CNB, core-needle biopsy; IHC, immunohistochemistry; FISH, fluorescence in-situ hybridization; CISH, chromogenic in-situ hybridization; SISH, silver in-situ hybridization. (1) As judged by referent center, surgical specimen was not sufficient for FISH in 12 cases. (2) As judged by referent center, preoperative biopsy specimen was not sufficient for FISH in 9 cases.

Patient and Tumor Characteristics

Women were aged 58±13 years at inclusion and the median time between the first diagnosis (based on the results of immunohistochemistry performed on preoperative biopsy specimen) and inclusion was 32 days.

On the basis of pretreatment and post-surgical histopathological analyses, main tumor characteristics were as follows: invasive ductal carcinoma (90% of cases), median tumor size (18 mm (range: 3–80)), (p)TNM stage (T1 (60%) or T2 (35%), pN0 (61%)), SBR (Scarff Bloom and Richardson) grade (I 13%, II 43%, III 45%), hormonal status (ER+ (positive estrogen receptor) or PR+ (positive progesterone receptor) 73%, ER+ and PR+ 49%).

Technical Evaluation Method of HER2 Status

Fixation of Specimens

Whatever the test performed (immunohistochemistry, CISH, FISH or SISH), analyzed core-needle biopsies specimens were mostly fixed with a formalin-based fixative (neutral buffered formalin in 47–54% or Hollande's fixative in 4 and 6%, depending on the test), and less frequently with alcohol-based fixative (alcohol-formalin-acetic acid in 34–44% or alcohol-formalin in 4–6%, depending on the test).

In contrast, surgical specimens were mostly fixed with alcohol-based fixative (alcohol-formalin-acetic acid in 52–54% or alcohol-formalin in 4–6%, depending of the test) and less frequently with formalin-based fixative (neutral buffered formalin in 35–40% or Hollande's fixative in 4–6%, depending on the test).

Evaluation of HER2 Status with Immunohistochemistry and/or FISH

Whatever the type of sample (core-needle biopsies or surgical specimens), the applied method was fully automated in 96% of cases and partially (not the deparaffinization, nor the antigen retrieval) in 69%. Slides were pretreated by using polyclonal A0485 anti-HER2 primary antibody (Dako, Glostrup, Denmark) in 75% of samples; 4B5 (Ventana) and monoclonal CB11 (Novocastra, Newcastel, UK) antibodies were used less frequently (<20 and <10%, respectively).

Positive (3+) HER2 status was observed in 50–51% of cases and negative (0/1+) results were observed in 29–43% of cases, depending on the immunohistochemistry interpretation guidelines used and the type of specimen (Figure 2). The rate of equivocal cases (2+) ranged between 7 and 20% depending on the guideline used but it was lower for results observed on core-needle biopsies and surgical specimens when evaluated according to GEFPICS guidelines (7 and 8%, respectively). Most often, FISH could be used to determine any immunohistochemistry equivocal cases (2+), with <3% remaining doubtful (Figure 3).

Figure 2
figure 2

HER2 status according to immunohistochemistry results on preoperative biopsies and surgical specimens (n=260). ASCO, American Society of Clinical Oncology; GEFPICS, Groupe d’Etude des Facteurs Pronostiques par Immunohistochimie dans le Cancer du Sein.

Figure 3
figure 3

HER2 status of equivocal immunohistochemistry results (2+) according to FISH on surgical specimens. ASCO, American Society of Clinical Oncology; GEFPICS, Groupe d’Etude des Facteurs Pronostiques par Immunohistochimie dans le Cancer du Sein.

Technical Failures

Technical failure occurred in 6 and 5% of FISH tests, 5 and 3% of CISH tests, and 8 and 5% of SISH tests performed respectively on core-needle biopsies and surgical specimens.

Concordance of HER2 Status Determined by Immunohistochemistry, CISH and SISH (on Core-Needle Biopsies) with FISH Results (on Surgical Specimens)

Whatever the immunohistochemistry interpretation guideline applied, excellent concordance (κ: 0.92–0.97) was shown between immunohistochemistry on core-needle biopsies and FISH on surgical specimens (discordance rates: 2–4%) (Table 1). Specificity (97–98%) and sensitivity values (95–99%) of immunohistochemistry/FISH correlations did not differ significantly according to the immunohistochemistry interpretation guidelines used. Sensitivity was particularly high when results were determined by pathologists’ scoring and using ASCO guidelines (sensitivity: 99 and 98%, respectively). Although lower, the sensitivity of immunohistochemistry results interpreted using GEFPICS guidelines (sensitivity: 95%) were mainly related to higher false negative results (7%) but did not differ significantly from immunohistochemistry results for sensitivity using the other guidelines.

Table 1 Concordance of HER2 status determined by IHC, CISH and SISH (on core-needle biopsy specimens) with FISH results (on surgical specimens)

Regarding ISH methods on core-needle biopsies, CISH and SISH showed a strong correlation with FISH on surgical specimens (κ: 0.96 and 0.94, respectively), even if a lower false negative rate was observed for CISH than for SISH results (2 vs 5%). The characteristics of CISH/FISH and SISH/FISH correlations did not differ significantly, both methods being highly sensitive and specific in determining HER2 status on core-needle biopsies (sensitivity: 99 and 96%—specificity: both 98%).

Concordance of HER2 Status Determined by Each Test (Immunohistochemistry, FISH, CISH and SISH) on Core-Needle Biopsies and Surgical Specimens

Whatever the test used, excellent concordance was shown between core-needle biopsies and surgical specimens (κ 0.97 and discordance rates between 1 and 2%) (Table 2).

Table 2 Concordance of HER2 status determined by each test (IHC, FISH, CISH and SISH) on core-needle biopsy specimens and surgical specimens

For immunohistochemistry, although the discordance rate between core-needle biopsies and surgical specimens was higher for results interpreted using GEFPICS guidelines (2%), it did not differ significantly from discordance rates interpreted using pathologists’ scoring and ASCO guidelines (both: 1%). Sensitivity results were similar between immunohistochemistry interpretation guidelines (98% for GEFPICS and 99% for pathologists’ scoring and ASCO) and there was no significant difference in specificity results (specificity: 100% for pathologists’ scoring and ASCO, 98% for GEFPICS).

For FISH, CISH and SISH, discordance rates were <1% and did not differ significantly. Correlations between core-needle biopsies and surgical specimens also showed excellent sensitivity values (99% for all three methods, always related to false negative rates: 1, 2 and 2%, respectively) and 100% specificity with no significant difference observed between these correlations.

Discussion

Several studies compared HER2 status by immunohistochemistry and FISH on breast core-needle and excisional biopsies17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 and showed generally a very good concordance between these samplings (average 90%), suggesting that core-needle biopsies can be used with confidence for HER2 status determination. However, most of these studies assessed only a small number of samples with a limited statistical power to detect discordances, leading to considerable differences between their results (range of concordance rate: 64–100%). More recent studies28, 29, 30 assessing higher samples (n=500, 332 and 225 paired samples of core-needle biopsies and subsequent surgical specimens, respectively) showed excellent concordance rates (90, 99 and 89%, respectively). This is confirmed in our hands. Thus, when comparing the results of each test on core-needle biopsies to these obtained on surgical specimens, no significant difference was observed between discordance rates, sensitivity, specificity, false positive and false negative rates for immunohistochemistry, irrespective of the guidelines used, or for ISH methods (FISH, CISH and SISH). Whatever the test performed, concordance between core-needle biopsies and surgical specimens on the same tissue sample remained excellent, κ coefficients of correlation ranging from 0.97 (for immunohistochemistry interpreted using GEFPICS guidelines) to 0.99 (for immunohistochemistry interpreted using pathologists’ scoring and ASCO guidelines or FISH, and SISH). These results, in accordance with previous findings,19, 20, 24, 31, 32 suggest that intratumoral heterogeneity of HER2 is not a significant confounding factor when analyzing small sized samples, if ISH is used. In spite of excellent concordance between core-needle biopsies and surgical specimens, some pathologists retest for HER2 status on surgical specimens in case of rare heterogeneous cases.32

Furthermore, we have shown that the non-fluorescent immunohistochemistry techniques on core-needle biopsies can generally be used with confidence, an excellent concordance rate between immunohistochemistry on core-needle biopsies and FISH on surgical specimens (κ: 0.92–0.97) being observed whatever the immunohistochemistry interpretation guideline used. Likewise, CISH and SISH on core-needle biopsies showed a strong correlation with FISH on surgical specimens (κ: 0.96 and 0.94, respectively). These results are consistent with previous data available on surgical specimens.12, 13, 14, 33

In conclusion and based on our results, immunohistochemistry status assessed on cores biopsy is highly concordant with FISH in cores and surgical specimens. Furthermore, CISH and/or SISH even represents a safe alternative method to determine HER2 status on these cores biopsies to confirm any ambiguous immunohistochemistry results (2+) or to perform HER2 status screening, if ISH is used as first-line screening method. CISH or SISH may also be used for calibration or quality controls of immunohistochemistry, either on cores biopsies or on surgical specimens.