INTRODUCTION

Breast cancer results from molecular alterations that are genetically and/or environmentally induced resulting in uncontrolled cell proliferation. One of the significant molecular alternations noted is amplification of the HER-2/neu gene that occurs in 20–30% of cases. This marker has been extensively studied and it has been shown that HER2 is a significant prognostic marker in both node positive and node negative breast cancer patients. Equally important is the predictive role of this marker in identifying patients that will or will not respond to hormonal or some chemotherapeutic regimens. The significance of HER-2/neu as a prognostic or predictive marker in breast cancer has been addressed in two literature reviews (1, 2). The role of HER-2/neu has been extended with the release of Herceptin, a humanized antibody against HER-2/neu that is being used to treat breast cancer patients (3). The drug was released in the US in 1998, in Canada in the fall of 1999 and will be released in Europe in 2000.

In spite of the release of the drug there is no agreement as to the best method of assessing amplification and/or over-expression of HER-2/neu gene. There is also great variation in the published data vis-a-vis the scoring systems and the positive cut off points used (1, 2). HER-2/neu can be measured in different ways: 1) Oncoprotein over-expression: This could be performed by immunohistochemistry (IH) on either frozen or paraffin embedded tissue sections using monoclonal or polyclonal antibodies to the 185 kd protein or the phosphorylated product; by ELISA (using cytosol or serum) or by Western blot. 2) Gene amplification: This could be performed by Southern hybridization; by polymerase chain reaction (PCR) (competitive, differential or real time); or by fluorescence in situ hybridization (FISH). 3) mRNA by Northern blot.

The Herceptin efficacy phase 2 and 3 trials, utilized a cocktail of two antibodies (CB11 and D5) to assess HER-2/neu status by immunohistochemistry (CTA) (3). They considered the case positive when weak to moderate (2+) or strong staining (3+) was seen in 10% of cells in a membranous pattern. Dako produced a polyclonal antibody and compared the immunohistochemical results with their antibody to the CTA antibodies in a different subset of patients. There was 79% overall concordance between results obtained with the CTA and Dako antibodies using 580 breast cancer specimens that differed from the ones used in the phase 2 and 3 trials of Herceptin. Forty two percent of cases in the 2+ category were discordant with the CTA results. In addition, 168 cases were examined for gene amplification and protein over-expression and the results compared with the Dako HercepTest. This showed a positive agreement of 62% and a negative agreement of 100% indicating that there was a significant number of false positive cases (Dako Press Release). Penault-Llorca (4) et al. compared the results of HER-2/neu testing in 58 tumors using different methods, different fixatives and different antibodies: Tab250, 9G6, NCL CB11 and OA-11–854. Their results showed than on paraffin sections NCLCB11 and Tab250 offered a good balance between sensitivity and specificity although CB11 was better on frozen tissue and Tab250 was most consistent in tissue fixed in phosphate buffered formalin. They also found that alcoholic formalin appeared to be the best fixative for HER-2/neu immunodectection.

We undertook a study to establish a reproducible, sensitive and specific method for testing for HER-2/neu that could be used in diagnostic pathology laboratories. We compared IH detection of HER-2/neu protein with differential PCR to assess gene amplification. FISH was used as the gold standard to resolve discrepant cases. We selected PCR instead of FISH for comparison with IH since it is less subjective in defining the copy number and could be automated to study large numbers of cases. In addition, assessment of appropriate cut off points were established.

MATERIALS AND METHODS

Two hundred and fifty consecutive cases of in situ and invasive breast cancer were prospectively examined during 1998 in the Department of Pathology at Women’s College Campus of Sunnybrook and Women’s College Health Sciences Center in Toronto. These included 220 invasive and 30 in situ breast cancers. Thirteen of the invasive carcinomas were locally recurrent. Of the primary invasive cancers, 196 were infiltrating duct, 5 infiltrating lobular and 6 mucinous carcinomas. The nodal status of the 207 cases of primary invasive cancer was known. The tissue was fixed in 10% buffered formalin and usually processed within 24 hours.

Immunohistochemical assay

Immunohistochemical staining was performed on formalin fixed, paraffin embedded tissue sections of each case using the monoclonal antibody CB11 to the HER-2/neu oncoprotein (Nova Castra Laboratories, UK). After deparaffinization, endogenous peroxidase was blocked using 3% hydrogen peroxide. The primary antibody was applied at a 1:50 dilution for 2 hours followed by the Biotin-Streptavidin peroxidase complex. 3.3′ diaminobenzidine was used as the chromagen. The slides were counterstained with Harris’s hematoxylin.

The percentage of tumor cells showing either a complete or partial membranous pattern was scored in each case. The intensity of membrane staining was assessed as strong, moderate or weak. Strong membranous staining was easily seen at low magnification (Fig 1a). Weak staining was pale and not clearly visible at 400 × magnification (Fig 1b). Moderate staining was less intensive than strong staining but easily visible at 400 × magnification (Fig. 1c). Partial membrane staining was defined as incomplete positivity around the membrane of a cell (Fig. 1b).

FIGURE 1
figure 1

A, Invasive ductal carcinoma showing diffuse strong complete membranous staining for HER-2/neu (mag × 100). B, Invasive ductal carcinoma showing weak incomplete membranous staining for HER-2/neu (arrows) (mag × 400). C, Invasive ductal carcinoma showing variable intensity of membrane staining. Some cells show moderate staining and others showing weak staining for HER-2/neu (mag × 400).

PCR

We used a competitive PCR method to assess the average number of HER-2/neu gene copies, using asparagine synthase (AS) as a reference gene. A parallel 10 micron section of the same block was assessed for gene amplification using differential PCR. One hundred nanograms of target DNA was used in each determination. The HER-2/neu primers (neu1 5[prime]-CTC ACA ACC AAG TGA GGC AG-3[prime] and neu2 5[prime]-CAG GGG TGG TAT TGT TCA GC-3[prime]) were used at a concentration of 0.8 um. The control gene (AS) primers (AS1 5[prime]-ACA TTG AAG CAC TCC GCG AC-3[prime] and AS2 5[prime]-CCA CAT TGT CAT AGA GGG CG-3[prime]) were used at a concentration of 0.4 um. Differential PCR was performed on Perkin Elmer Cetus Thermal Cycler-480. Touchdown PCR technique was used to improve PCR specificity by decrementing the annealing temperature during initial cycles, starting at 60°C, and arriving at a “touchdown” temperature of 50°C. 20 μl of the PCR reaction solution was separated by electrophoresis on 4% agarose gel, stained with ethidium bromide, and imaged on a Vector Bio-system imaging system. The fluorescent density of resulting amplicon bands was assessed using the ImagequaNT Software Version 4.2. HER-2/neu gene amplification copy number was determined by comparing the ratio of HER2:AS amplicon density to the established calibration curve generated from similar ratios plotted against known HER-2/neu gene copy numbers in using breast cancer cell-line DNA.

A standard curve of HER-2/neu gene copy number against HER-2/neu:AS PCR amplicon ratios was generated using cell-lines, SKBR-3, T47-D and normal spleen DNA giving 10, 3 and 1 HER-2/neu copies respectively. Serial dilutions of DNA from SKBR-3 cell-line was performed to obtain additional data points.

For PCR the ratio of HER2:AS amplicon density of each case was recorded and interpreted as follows: ≥2 is amplified, less than 2 not amplified.

FISH

FISH analysis was performed on formalin fixed paraffin sections on 28 of cases which included amplified, not amplified and all the discrepant cases using the Vysis probe (Intermedico) according to the method described by Pauletti et al. (5) and according to the manufacturer’s specifications. The results were scored as follows: 2–4 copies = not amplified, ≥5 copies = amplified. In addition, the number of copies of chromosome 17 was evaluated to assess ploidy and the ratio of HER-2/neu gene copies to the number of chromosome 17 noted. A ratio ≥2 was considered amplified.

Thirteen randomly selected positive and negative concordant cases from this series were studied by FISH. Twenty previous cases of breast cancer that were concordant by PCR, IH and FISH were used as controls.

Statistical Analysis

Statistical analysis was performed using Fisher’s exact test. Regression analysis was carried out for comparison of the HER-2/neu protein overexpression by IH and amplification of the gene by differential PCR.

RESULTS

IH showed either complete membranous staining (Fig. 1a) partial membrane staining (Fig. 1b) and/or cytoplasmic staining. The intensity of membrane staining was either weak, moderate or strong (Fig. 1a, b, c). Linear correlation of results between PCR and IH showed that the best concordance was obtained using 10% complete membrane staining and moderate or strong intensity of staining as the cut off point for positivity for IH and ≥2 density score for PCR (Fig. 2). Although our graph shows that the cut point is 15% complete membrane staining we opted for 10% for practical reasons. Moderate to strong complete membrane staining was noted in 44/250 cases. Twelve cases with weak staining and/or partial membrane staining were considered negative. When partial membrane staining was present it was noted in less than 10% of tumor cells. Cytoplasmic staining alone did not correlate with gene amplification. Figure 3 illustrates cases positive and negative for HER-2/neu amplification.

FIGURE 2
figure 2

Relationship between IH and PCR for HER-2/neu detection in invasive breast cancer.

FIGURE 3
figure 3

PCR results of HER-2/neu amplification in positive and negative cases and controls. Agarose gel image showing HER-2/neu and AS amplified bands, 125 and 162 bp respectively. PCR reactions were done as described in Material and Method section. Samples were analyzed after 26(1), 28(2) and 30(3) cycles of amplification. Negative samples show both the HER-2/neu and AS bands while positive samples show predominance of the HER-2/neu band. Cases 1 and 2 are negative, whereas 3 and 4 are positive for HER-2/neu amplification.

The overall concordance between IH and PCR for all 250 cases (invasive and in situ breast carcinoma) was 94% (Table 1). Of the 235 cases that were concordant, 41 (16%) of cases were positive and 194 (78%) of cases were negative by both methods. The HER-2/neu status of 13/235 concordant cases (9 positive and 4 negative) were confirmed using FISH (Fig. 4).

TABLE 1 Results of HER2/Neu Testing by IH, PCR, and FISH
FIGURE 4
figure 4

Invasive ductal carcinoma showing clusters of HER-2/neu signals in tumor cell nuclei using FISH.

The overall concordance between IH and PCR of the 220 invasive breast cancer was 95%. Of the 209 concordant cases, 32 (14.5%) were IH and PCR positive whereas 177 (80.5%) were negative by both methods. Of the 11 discordant cases 8 (4%) were IH negative and PCR positive, and FISH was positive in 2 out of these 8 cases. All 3 cases that were IH positive/PCR negative were FISH positive. These results indicate that using FISH as the gold standard, 3/11 cases were false negative and 6 were false positive by PCR, whereas 2/11 cases were false negative by IH and none of the cases were false positive by IH.

The sensitivity of CB11 as compared with PCR in the invasive cancer was 80% and the specificity 98%. The positive predictive value was 91% and the negative predictive value was 96%.

Amplification and/or over-expression of HER-2/neu was associated with the invasive ductal carcinoma phenotype. None of the 5 invasive lobular carcinomas showed HER-2/neu amplification or over-expression of the protein whereas 1/6 mucinous carcinomas showed amplification and over-expression of HER-2/neu.

Twenty six of the 30 (87%) in situ cancers were concordant. Nine cases (30%) were concordant IH positive/PCR positive and 17 cases (57%) were concordant IH negative/PCR negative. In the 4 discordant cases FISH results indicated that IH was false negative in 2/3 and false positive in 1/1 case. PCR was false positive in 1/3 cases and there ware no false negatives (Table 1).

Of the 220 invasive cancers, 78 were node positive and 17 (22%) of these were HER-2/neu positive; 113 were node negative and 14 (12.5%) were HER-2/neu positive; in 29 cases axillary dissection was not performed and 5 (17%) were HER-2/neu positive (Table 2). Only 2/13 recurrent cases expressed HER-2/neu protein, one case was node negative and the other was node positive.

TABLE 2 HER2/neu Expression and Nodal Status in Invasive Breast Cancer

DISCUSSION

The interest in Herceptin, a humanized monoclonal antibody, stems from the results of randomized phase 3 trials showing that patients whose tumor cells have increased levels of HER-2/neu protein expression do significantly better when Herceptin is added to chemotherapy (3). In addition to its prognostic significance in breast cancer patients, HER-2/neu protein over-expression may gain greater values if current ongoing studies confirm its role in predicting response to hormonal and radiotherapy as well as the type of combination and/or dose of chemotherapy (6, 7, 8, 9). However, the utility of this biological marker can be enhanced and better evaluated if we can reach consensus on the appropriate methodology and the most clinically significant cut off points (10).

The principal methods for assessing HER-2/neu status are immunohistochemistry using antibodies to detect over-expression of the protein and molecular techniques specifically PCR or fluorescence in situ hybridization (FISH) to assess amplification of the gene. The marked variability in the specificity and sensitivity of different HER-2/neu antibodies (11) and the variability in interpreting the results (12, 13) even when the same antibody is used requires comprehensive standardization studies. In addition, use of frozen versus fixed tissue and the different fixatives used can influence the specificity and sensitivity of the antibodies used (4). According to Paulette et al. the gold standard of choice for HER-2/neu testing is IH on frozen material (5). However, since frozen material is not always available, they demonstrated that FISH is a reliable and reproducible method for testing for HER-2/neu (5).

The standardization of HER-2/neu testing and appropriate cut off points have to be achieved by both technical and clinical validation of the results. This point was well addressed by G. Allred and Swanson in a recent editorial (14). The CTA is the only clinically validated test in predicting response to Herceptin. More recently, the tumors tested by the CTA were assessed by FISH and there was 82% concordance between the two methods (15).

In our study, moderate or strong complete membranous staining was the significant pattern of HER-2/neu over-expression. Incomplete membrane staining occurred in 12 cases and was not associated with gene amplification. None of the cases showing cytoplasmic staining by IH were amplified by PCR. The significance of cytoplasmic staining seen with CB11 is controversial although most studies agree that it is a nonspecific finding. However, a few studies have demonstrated concordance between strong cytoplasmic staining and poor prognosis in a subset of node positive breast cancer patients (16). Thus for the present time only the membrane pattern of staining should be evaluated.

In this study, the most appropriate cut off points for percentage membrane staining by IH that resulted in the best concordance between IH and PCR, was 10% P = .0001. The percentage of cells showing moderate or strong positivity is a more appropriate method of expressing the HER-2/neu status rather than using the scoring system of 1+ 2+ 3+ since the latter can be confusing because of different definitions by different groups. In addition, the intensity of IH can be manipulated by modifying the immunohistochemical technique. The assessment of the percentage of positive cells, although subjective, is reproducible as pathologists are trained to assess other biological markers e.g., estrogen and progesterone receptors in a similar fashion.

Concerns related to the high rate of “false positive” results by the HercepTest were expressed by Roche et al. (17). Although the HercepTest provides the opportunity to strictly adhere to a unified methodology by the users, it is relatively expensive and there are concerns regarding the clinical significance of the 2+ positive group. This was further substantiated by Tubbs et al. (18) who showed that 15% of cases that were 2+ positive by HercepTest did not show gene amplification by either FISH or by mRNA detection.

Mass et all (15) showed that the cases scored as 2+ in the CTA were found to have gene amplification using FISH in only 24% of cases as compared with 89% in the 3+ cases confirming the high false positive rate in the 2+ category. Moreover, the cases that had the 3+ score showed significant response to Herceptin and Taxol or Adriamycin and cyclophosphamide (AC) when compared with 2+ cases (3). These results indicate that the Herceptest is specific but too sensitive and that the 2+ score (10% weak to moderate complete membrane staining) include a subset of tumors that do not have HER-2/neu gene amplification.

From the study of Jacobs et al. (19) it is reasonable to conclude that experienced labs using the same antibody with different methodologies and scoring methods can obtain concordant and reasonable results (97% concordance). Attempts to standardize scoring of IH staining in general and specifically for HER-2/neu were adopted by Allred et al. (20). The score of 1–5 incorporates the intensity of staining and the percentage of positive cells. Using that scoring methodology Molina et al. showed that there was good concordance between IH and Western blotting data and that both methods showed prognostic significance (21).

The variability of specificity and sensitivity of the various antibodies has been addressed by Pauletti et al. in 1984 (5) and by Penault-Laurca (4) and recently O’Malley (22). The latter study showed the high specificity of CB11, TAB250 and HercepTest (97%, 95%, 82%) and the sensitivity of these three antibodies (73%, 81%, 92%). The positive predictive value of HercepTest, however, was low at 63% compared with TAB250 at 84% and CB11 at 91%.

In our study there was excellent overall concordance between IH and differential PCR for assessing HER-2/neu status in 235/250 cases of in situ and invasive breast cancer, (94%) [IH+/PCR+ (16%) and IH-/PCR- (78%)]. In the invasive cancers (220 cases) the overall concordance between IH and PCR was 95%. The sensitivity was 91% and the specificity was 96%. Similar results correlating PCR and IH were obtained by P. Decremoux (23) and O’Malley (22).

FISH was useful in resolving some of the discrepancies between IH and differential PCR in the 11 discordant cases of invasive carcinoma. Three of the 11 discrepant cases were IH +ve/PCR –ve and FISH +ve indicating that the PCR was a false negative result. All three cases showed few nests of tumor cells surrounded by abundant stroma. The excessive amount of normal tissue compared with the tumor could account for the false negative PCR results. PCR and FISH were positive in two cases, whereas IH was negative indicating a false negative IH result. In these two cases the HER-2/neu copy number by FISH was low (i.e., five to six copies) suggesting that in cases with low level amplification IH, on formalin fixed material may be negative or weak. The six remaining discrepant cases were IH-ve/FISH –ve but PCR +ve indicating a false positive PCR result. This is difficult to explain but may be due to technical aspects of PCR. It is possible that real time PCR could overcome this problem (24).

Pauletti et al. (5) using the Vysis probe found FISH to be highly sensitive (96.5%) and specific (100%) as compared with IH on frozen tissue. Similar results were obtained by Mitchel et al. (25). Press et al. (26) found FISH to be 98% sensitive and 100% specific compared with PCR and that FISH results were an independent predictor of poor outcome. Similar data showing concordance between IH and FISH were obtained by P. Jimminez et al. (27) and Jacobs et al. (28). These studies suggest that very few cases (3–5%) show protein over-expression without gene amplification indicating that FISH or PCR could be false negative in about 5% of cases.

The percentage of cases with HER-2/neu amplification or over-expression varies considerably in the literature from as low as 10% to 54% (1). Our overall frequency of HER-2/neu over-expression/amplification in invasive cancer was 16%. The frequency detected in the node positive patients was 22% and in the node negative patients 12.5%. The latter group was predominant in our series. The higher percentage positivity in the node positive patients supports the more aggressive behavior of the HER-2/neu positive tumors.

In summary, we have demonstrated that IH using CB11 is a reproducible, sensitive and specific method for assessing HER-2/neu status in breast cancer. The current study indicates that moderate or strong complete membrane staining in ≥10% of tumor cells in formalin fixed paraffin embedded tissue sections provides the appropriate cut off point when compared with PCR and FISH. IH is easily applicable in most laboratories and is relatively inexpensive. Routine use of FISH for HER-2/neu testing that recently has been advocated by Slamon at ASCO2000 (Slamon D, personal communication) is not practical for a widely required test for breast cancer management since FISH is expensive and only available in large specialized labs. Positive and negative IH controls should always be performed and all cases with indeterminate results should be reassessed by either FISH or PCR in the specialized laboratory providing this service. Clinical validation with outcome data would strengthen our proposed practice of using ≥10% complete, moderate strong membrane staining as a positive cut off for IHC testing for HER-2/neu oncoprotein over-expression.