Main

The collection of tissue material is an important component of many clinical trials studying the efficiency of different cancer therapies. It is hoped that molecular analyses of tissues of patients that were enrolled in studies will eventually allow a distinction of these patients that responded favorably to therapy. Tissue materials retrieved from clinical trials are often limited to a small number of unstained sections that are submitted by local pathologists and subsequently stored at a central place. A collection and central storage of tissue blocks has proved to be impractical because many institutions dislike to lose control over the tissue, which might be needed for additional diagnostic procedures to be performed years after the initial diagnosis and enrolling the patient in a study.

The current praxis of collecting unstained sections for later use is overshadowed by strong evidence suggesting that long-time storage of unstained sections will significantly deteriorate the results of molecular analyses.1, 2, 3, 4 The problem is relevant for many molecular techniques including fluorescence in situ hybridization (FISH),5 RNA in situ hybridization,6 and immunohistochemistry (IHC).7 IHC is the most frequently used technology in translational research studies. Studies have shown that the staining intensity of immunohistochemical reactions deteriorates for most antibodies with time,1, 2, 7, 8, 9, 10, 11, 12 that a significant decrease of the staining occurs as early as 2 weeks after sectioning,1 and that sophisticated methods of slide storage only incompletely avoid deterioration of IHC stainings.1, 13, 14 These data question the utility of current slide collection schemes in clinical trials as a deterioration of IHC stainings can result in false-negative results.

No study has so far addressed the question of the influence of slide aging on the outcome of translational research studies using IHC. To investigate the impact of age-related IHC deterioration on statistical analyses, it is necessary to analyze a sufficiently large number of cases on tissue sections of different age. Obviously, performing a large IHC study twice with slides of different age would be impractical if traditional methods were used. Our recently developed tissue microarray (TMA) technology is highly suited for such methodological comparisons.15, 16 In this method, minute samples (diameter 0.6 mm) from up to 1000 different tissues are placed on one microscope glass slide. Many minor parameters with a potential influence on the outcome of immunohistochemical analyses such as slide thickness, slide age, incubation time, reagent concentrations are fully standardized if consecutive sections from a TMA block are used in one experiment. In this study, we utilized fresh and old sections from a breast cancer TMA containing samples from 522 patients with clinical follow-up data for a systematic analysis of the influence of slide aging on the outcome of translational research projects using IHC.

Materials and methods

Patients

Formalin-fixed, paraffin-embedded tumor samples from 522 breast carcinomas had previously been included into a TMA. All slides from all tumors were reviewed by one pathologist (GS) to define the histologic grade according to Elston and Ellis17 (BRE) and the histologic tumor type. The pathologic stage and nodal status were obtained from the primary pathology reports. The series included 333 ductal, 82 lobular, nine medullary, 19 mucinous, 29 cribriform, 10 tubular, 14 papillary, eight apocrine, eight clear cell, two metaplastic, five atypical medullar, one histiocytic, one adenoid-cystic carcinoma, and one carcinosarcoma. There were 31.2% grade 1, 38.5% grade 2, and 30.3% grade 3 cancers. The pT stage was pT1 in 28.2%, pT2 in 53.9%, pT3 in 8.1%, and pT4 in 9.8%. Axillary lymph nodes had been examined in 458 patients (51.5% pN0, 43.9% pN1, 4.6% pN2). Raw survival information was available from the Cancer Registry of Basel. Advanced stage, high grade and positive nodal status were strongly associated with poor survival of our patients (P<0.0001 each). The mean follow-up time was 90 months (range 3–152).

IHC

Two sets of slides taken from one TMA block were utilized for this study. The first set (old slides) was cut 6 months before the immunohistochemical staining and stored at 4°C. The second set was cut 1 week before IHC staining (fresh slides) and stored at room temperature. The used antibodies, their dilutions, and pretreatment conditions as well as the definitions used for definition of positivity are summarized in Table 1. For all antibodies, protocols were used that had previously been established either for routine use on ‘large sections’ or for scientific studies on TMAs. No further protocol optimization was attempted. All tumors were analyzed for ER, PR, and HER2. The analysis was restricted to the first two quadrants (306 tumors) of our TMA for cyclin D1 and E cadherin staining because of mechanical damage on the old sections. All TMA sections were scored by one pathologist (GS) who was unaware of the section age information in 1 day in a consecutive session to minimize intraobserver variability.

Table 1 Antibodies

Statistics

Contingency table analysis and χ2 tests were used to study the relationship between IHC results, grade, stage, and nodal status. Survival curves were plotted according to Kaplan–Meier. A log-rank test was applied to examine the relationship between IHC data and tumor-specific survival. Surviving patients were censored 3 months before the date of inquiry to the Cancer Registry.

Results

Frequency of Positivity

The average frequency of positive cells and the average staining intensity was lower on the old slides than in the fresh sections for all examined antibodies. Even though the staining intensity difference did not result in a different classification in all cases, significant differences were visible in almost all tumors with positive staining. Examples of tumors with a weaker staining on old slides than on fresh sections are shown in Figure 1. These cases illustrate the magnitude of the staining differences in individual cases that are beyond the possible level of analysis subjectivity. For all five antibodies, this resulted in a higher fraction of positive cases in the fresh section group as compared to the old section group (Figure 2) according to our preselected definitions. On freshly cut sections, the frequency of positivity (including weak staining) increased from 46 to 65% for ER (P<0.0001), from 18.5 to 33% for PR (P<0.0001), from 9.6 to 16.3% for HER2 (P=0.0047), from 37.7 to 45.1% for cyclin D1 (P=0.10), and from 32.9 to 58.9% for E-cadherin (P<0.0001).

Figure 1
figure 1

IHC staining results on old and fresh TMA sections. These images show examples of IHC stainings obtained on consecutive sections from individual tumors. (a) HER2 staining on freshly cut section. (b) HER2 staining on old section (same case as (a)). (c) HER2 staining on freshly cut section. (d) HER2 staining on old section (same case as (c)). (e) ER staining on freshly cut section. (f) ER staining on old section (same case as (e)). (g) ER staining on freshly cut section. (h) ER staining on old section (same case as (g)). (i) E-cadherin on freshly cut section. (j) E-cadherin on old section (same case as (i)). (k) E-cadherin on freshly cut section. (l) E-cadherin on old section (same case as (k)).

Figure 2
figure 2

Influence of slide aging on the fraction of positive cases. For each antibody, the frequency of positive cases is shown as separate bars for old (O) and fresh (F) sections.

Association with Tumor Phenotype

We then analyzed previously established associations between gene expression and tumor phenotype. In particular, we were interested whether the known associations between PR and cyclin D1 expression with low tumor grade and positive ER status as well as the association between HER-2 positivity and high tumor grade and negative ER status could be confirmed on old and fresh TMA sections. The results of this analysis are shown in Table 2. Despite marked differences in the staining intensity of individual samples, all expected associations could be found on old and fresh TMA sections, although the level of significance was sometimes better for new section analyses than for old section results. The use of different cutoff levels for definition of positivity such as 1% positive nuclei for ER positivity or 2+/3+ for HER2 positivity did not change the overall message of our results (data not shown).

Table 2 Associations with phenotype and HER2/ER/PR/cyclin D1 status

Association with Survival

In a further analysis, we investigated the association between our IHC data for ER, PR, HER2 and raw patient survival. These results are shown in Figure 3. Significant association with prognosis was detected for all parameters in the fresh section analysis. The association with poor survival was always weaker if old sections were analyzed. For HER2, statistical significance was still achieved while significance was not reached for ER (P=0.09) and PR (P=0.11). No association with patient survival was found for E-cadherin and cyclin D1 expression, neither on fresh nor on old sections.

Figure 3
figure 3

Influence of slide aging on survival analyses. The survival curves show the outcome of survival analyses performed with data obtained on fresh and old sections. (a) ER staining on freshly cut sections. (b) ER staining on old sections. (c) PR staining on freshly cut sections. (d) PR staining on old sections. (e) HER2 staining on freshly cut sections. (f) HER2 staining on old sections.

Discussion

Previous studies have demonstrated that the intensity of immunohistochemical stainings decreases for many antibodies after long-time storage of tissue sections.1, 2, 3, 4, 7, 10, 11, 12, 13, 14 These studies have examined a variety of different antibodies but were always limited to a small number of tissues. Therefore, none of the earlier studies has provided data on the influence of slide aging on the results of research studies looking at associations between IHC results and other parameters. The use of TMAs enabled the rapid analysis of a sufficiently large set of ‘old’ and ‘freshly cut’ tumors and allowed us to investigate to what extent the age of tissue sections would affect associations between IHC results, tumor phenotype, and clinical outcome.

Our data show the expected clear differences between freshly cut and old sections in the staining intensity for all five antibodies. However, despite a clearly decreased immunoreactivity, most of the expected associations between molecular findings and clinical or pathological parameters could still be found in the old section analysis. Importantly, this indicates that slides stored for a long period of time are still valuable for research purposes. The obvious ability to gather meaningful biologic information despite suboptimal staining quality also reflects a general phenomenon in IHC studies. If the same protein is studied with different antibodies, protocols, or scoring criteria, investigators often find highly discrepant data with respect to the absolute number of positive cases but a much better concordance of their results with respect to clinicopathological associations. The use of slides with an old—but standardized—age is comparable to other protocol variations that may either increase or decrease the sensitivity of an IHC setting.

Major research efforts are warranted to seek solutions that allow a reliable IHC analysis after long-term tissue storage. The slide aging problem is likely to become more important in the future, because of the increasing importance of retrospective diagnostic molecular tissue analysis and also because of the increasing use of TMAs in research. Possible strategies to overcome the slide aging problem may include protocol adjustments, perhaps involving a harsher antigen-retrieval procedure. No attempts to improve existing protocols were made in this study, mainly because of a shortage of old TMA sections. It is also possible that not all antibodies for one protein will show a similar degree of age dependence. Bertheau et al2 found no obvious impact of the age of the section on the staining results of seven of their 11 analyzed antibodies. Improved methods for section storage would be another possible solution to the aging problem. However, previous studies had suggested that IHC deterioration cannot satisfactorily be prevented by section storage under paraffin coating or at low temperatures.18

Jacobs et al1 showed a significant decrease of staining intensity already after 2 weeks of storage, suggesting that deterioration of IHC starts early after sectioning. All tissue samples on our ‘old slides’ were of identical age (6 months) in this study. This is in contrast to the situation within clinical trials where tumor tissue sections are continuously collected over a period of several months or years. It will be important to determine to what extent IHC quality decreases between 2 weeks and 6 months of storage and also after more than 6 months storage time. In the optimal case, a plateau will be reached for IHC deterioration after a certain amount of time. In such a scenario, the slide aging effect would be identical for all sections older than the time period in which immunoreactivity keeps decreasing. Appropriate protocol adjustments might then enable IHC analysis of old sections with sufficient quality.

TMAs constitute an optimal tool for such protocol developments. TMAs are highly suited for comparing different procedures, because of a maximal standardization of many parameters that are otherwise difficult to control. An entire TMA study does often involve the analysis of only one slide. In that case, section thickness, slide age, the type and concentration of all reagents, the temperature of incubation or during slide storage and all incubation times are absolutely identical. Even if multiple TMA sections are included in a TMA study, most of these parameters are still identical since the entire study will typically be conducted on 1 day using identical reagents. It can be speculated that other minor parameters such as section thickness or variations in incubation times that are often allowed in IHC protocols (for example: incubation ‘over night’) may have—at least for some antibodies—a significant influence on the outcome of IHC analyses. Overall, the influence of the protocols on IHC data is paramount. For example, the use of three different antibodies resulted in a more than fivefold difference in the positivity rate of the EGFR gene in a recent study.19 As compared to the impact of the experimental procedures on TMA results, the influence of the staining interpretation is only minimal. The small size of the tissue area to be analyzed results in a more reproducible staining interpretation of arrayed tissue samples as compared to large sections.20 If IHC results are evaluated on large sections, the interpretation bias involves the selection of the ‘decisive’ tissue area, a step that is excluded in the TMA setting. For example, in a study comparing p53 analysis on TMAs and on large sections of >500 breast cancers, the area selection bias had prevented us from finding prognostic associations on large sections while these were clearly identified on four different TMAs.19, 20

In summary, the data of this study confirm a major impact of the age of tissue sections on the outcome of IHC analyses. Importantly, such a deterioration of IHC staining intensity may be acceptable for many research projects since significant association between molecular changes and tumor phenotype can often be detected in case of suboptimal staining conditions. It is hoped that optimized staining procedures may compensate for the slide aging problem, at least in some instances. However, a decreased slide quality after long-time storage will not be acceptable if IHC results are used for clinical decision-making for individual patients. Every effort should be taken to use freshly cut sections for potentially clinically relevant IHC analyses.