Main

Since the publication of the human genome and the development of high-throughput array-based gene expression profiling platforms, knowledge about breast cancer and its genetic background has increased enormously. On the basis of the gene expression, invasive breast cancers can be classified into three major subtypes: luminal, basal-like and Her2/neu-overexpressing.1 Many investigators have used surrogate immunohistochemical markers to classify these tumors: estrogen receptor, progesterone receptor, and HER2 negative breast cancers (=triple-negative breast cancer profile) are classified as normal breast-like if basal cytokeratins and epidermal growth factor receptor are lacking, and basal-cell-like cancers when basal cytokeratins (cytokeratin 5 and 6 and/or cytokeratin 14) are expressed.2 Most breast cancer series contain 8–15% triple-negative tumors and 10–15% basal-like tumors. Triple-negative and basal-like breast cancers are often (but not always) associated with high proliferation, high grade, young age, BRCA1 and aggressive clinical behavior (30–50% mortality in lymph node negative cancers).3 Furthermore, certain gene profiles correlate with estrogen receptor expression, therapy response and resistance and survival outcome. Some of these gene profiles are currently being validated in large prospective multi-national studies.

It is interesting that the gene signatures correlating with outcome contain high numbers of genes correlated to proliferation. A recent study even found that a signature based upon cell-cycle-related genes only, was a more accurate predictor of breast cancer clinical outcome than another FDA approved signature containing other genes as well.4 Others5 found that the simplest model defining the risk score as the expression of a single proliferation gene, yielded similar or even better performance than models fitted from genome-wide data, and also outperformed classical factors, such as histological grade. This is in agreement with previous retrospective and prospective studies performed on large numbers of lymph node negative breast cancer patients showing that proliferation measured by thymidine labeling index, mitotic activity index, or phosphohistone H3 are stronger prognosticators than classical predictors.6 Moreover, in two independent studies, adjuvant chemotherapy was significantly beneficial for patients with rapidly proliferating tumors, but not for patients with slowly proliferating tumors.7, 8 A comparison between phosphohistone H3/mitotic activity index and gene signatures, however, has not yet been performed.

Recently an extra level of gene regulation was discovered: small non-coding RNA molecules called microRNAs. They have an important role in gene silencing by binding directly and specifically to mRNA molecules and enabling their degradation. The microRNAs are 19–25 nucleotides in length and compose the largest family of non-coding RNA's involved in gene silencing. Their functions are exerted through translational inhibition of the targeted mRNA by binding to the 3′-untranslated region (imperfect match) and degradation of target mRNA (perfect match).9 The microRNAs are downregulated in a number of different tumors,8, 10 and in some cases the re-introduction of these microRNAs has been shown to impair the viability of cancer cells.11

In the current study, we investigated in microRNA arrays the correlation between microRNA expression and several important biological and prognostic breast cancer features, such as proliferation, estrogen receptor/progesterone receptor/Her2/cytokeratin 5 and 6 expression to better understand the fundamentals of the regulation of these features.

Materials and methods

Patients

The study was approved by the Regional Ethics Committee, the Norwegian Social Science Data Service, and the Norwegian Data Inspectorate. According to national guidelines from the Norwegian Breast Cancer Group, during the period 1993–1997, fresh frozen tumor tissue from each breast tumor patient was stored at the Department of Pathology of the Stavanger University Hospital for hormone receptor determination. In total, material from 235 individual patients was stored, of which 135 were lymph node negative. The following patients had to be excluded because of either bilateral disease (n=3), breast tumors in earlier years (n=8), lack of follow-up (n=3), lack of adequate material (n=10) or too poor quality RNA (n=8). This left 103 patients with adequate material and follow-up. Although these cases were selected on the basis of the presence of fresh frozen tissue, the distribution over the different classes was similar to the overall distribution in population-based studies in the province of Rogaland-South, Norway.6 All patients had been treated with either modified radical mastectomy, or breast-conserving therapy with postoperative radiation. Additional adjuvant treatment, including peri-operative chemotherapy, postoperative radiation (also in selected patients treated with modified radical mastectomy) and endocrine adjuvant treatment (ie tamoxifen) was offered according to the national guidelines of the Norwegian Breast Cancer Group at the time of diagnosis.

Histopathology

The post-surgical size of the tumor was measured in fresh specimens; the tumors were sliced (0.5 cm), fixed in buffered 4% formaldehyde, and embedded in paraffin. Paraffin sections (4 μm) were stained with haematoxylin, eosin, and saffran. Histological type was assessed according to World Health Organization criteria.12 Grade was assessed according to the Nottingham modification13 on the basis of the careful examination by two pathologists with considerable experience in breast pathology, using the criteria mitotic activity index 0–5=1, mitotic activity index 6–10=2, and mitotic activity index >10=3; nuclear atypia mild=1, moderate=2, and marked=3; and tubular formation majority (ie, >75%)=1, moderate (10–75%)=2, and little or none (<10%)=3. Grade was the sum of mitotic activity index, nuclear atypia, and tubular formation values. Thus, a sum of 3–5 was Grade I, 6–7 was Grade II, and 8–9 was Grade III. Mitotic activity index was assessed as previously described.14 Briefly, all unambiguous mitoses were counted with a conventional transmission light microscope with a × 40 objective (450 μm at specimen level) in 10 consecutive neighboring fields of vision in the invasive, most cell-dense area in the periphery of the tumor (representing a total area of 1.59 mm2 at the specimen level).

Immunohistochemistry

Antibody dilution and immunohistochemistry protocols were optimized before the study onset. To ensure uniform handling of samples, all sections were processed simultaneously. Paraffin sections adjacent to the haematoxylin, eosin and saffran sections used for assessment of the mitotic activity index and histology were mounted onto Superfrost Plus slides (Menzel, Braunschweig, Germany) and dried overnight at 37°C followed by 1 h at 60°C. Sections were deparaffinized in xylene and rehydrated in decreasing concentrations of alcohol. Antigen was retrieved with a highly stabilized retrieval system (ImmunoPrep, Instrumec, Oslo, Norway) using 10 mM TRIS/1 mM EDTA (pH 9.0) as the retrieval buffer. Sections were heated for 3 min at 110°C followed by 10 min at 95°C and cooled to 20°C. Rabbit polyclonal anti-phosphohistone H3 (ser 10) (Upstate 06-570; Lake Placid, NY, USA) was used at a dilution of 1:1500. Cytokeratin 5 and 6 (Clone D5/16 B4, Dako, Glostrup, Denmark) was used at a dilution of 1/100. Estrogen receptor (clone SP1, Neomarkers/LabVision, Fremont, CA, USA) was used at a dilution 1/400. Progesterone receptor (Clone SP2, Neomarkers/LabVision) was used at a dilution of 1/1000. Anti-phosphohistone H3 was incubated for 60 min at 22°C. All the other antibodies were incubated for 30 min at 22°C Dako antibody diluent (S0809) was used. Endogenous peroxidase activity was blocked with a peroxidase-blocking reagent (S2001; Dako) for 10 min. The immune complex was visualized with the Dako REAL EnVision Detection System, Peroxidase/DAB, Rabbit/Mouse (K5007; Dako). Sections were incubated with EnVision/HRP, Rabbit/Mouse for 30 min and diaminobenzidine chromogen for 10 min. The sections were counterstained with haematoxylin, dehydrated, and mounted. All steps were performed using a Dako autostainer and TBS (S1968; Dako) with 0.05% Tween 20 as wash buffer. For HER2 assessments, Dako Herceptest was used according to the manufacturer's procedures.

Quantification of Immunohistochemical Stainings

The phosphohistone H3 index was assessed using the same counting protocol as for the mitotic activity index.15 Two independent pathologists counted the number of phosphohistone H3-positive objects (nuclei and mitoses) in 10 adjacent fields of vision, with a × 40 objective, as described above for mitoses. Nuclei with fine granular phosphohistone H3 staining were not counted, as these cells are not in the G2 phase.16 Phosphohistone H3-rich areas are usually localized in the periphery (ie, growing zone) of the cancers. If the counts of two observers differed by more than three figures, the count was repeated with a multi-head microscope and a consensus score was obtained. In addition to performing subjective counts, phosphohistone H3 expression was evaluated using the fully automated Visiopharm Integrator System (VIS) analysis system (Visiopharm, Hørsholm, Denmark), following the same image processing principles described before.6 Reproducibility of the phosphohistone H3 measurements between subjective counts by two observers, and between subjective and digital image analysis results was high (R=0.94–0.98). Not surprisingly, the reproducibility of the phosphohistone H3 counts by the automated digital image analysis on different days by different observers was close to perfect (R=0.99). For this reason, in the statistical analysis the image analysis counts were used.

The percentage of cytokeratin 5 and 6 positive tumor cells in each cancer was scored using a continuous scale of 0–100%. Estrogen receptor and progesterone receptor were scored as positive when nuclear staining was present in >10% of the tumor cells. HER2 was scored according to the Dako Hercep Test scoring protocol. All 2+ and 3+ cases were regarded as positive. All sections were independently scored by two pathologists.

RNA isolation/Labeling/Hybridization

At least two 10 μm cryosections were cut; to assess the number of tumor cells in the tissue all sections were evaluated by an experienced breast pathologist. Where possible, only material from the tumor area was isolated by means of macrodissection. All tissues contained at least 50% tumor cells. The RNA isolation method was chosen after careful comparison of different RNA isolation procedures. The MirVANA total RNA isolation kit (Ambion/Applied Biosystems, Austin, TX, USA) appeared to be the best-suited isolation method for use with Exiqon LNA microRNA profiling (unpublished results). Total RNA was isolated according to the protocol provided by the manufacturers. For quality control, all (n=109) samples were analyzed by both Agilent Bioanalyzer 2100 (RNA and microRNA chips) and RNA measurement on the Nanodrop instrument. The samples were labeled using the miRCURY Hy3/Hy5 power labeling kit and hybridized on the miRCURY LNA Array (v.11.0, Exiqon A/S, Vedbaek, Denmark). These arrays contain melting temperature (Tm)-normalized capture probes for 2090 microRNA's (all human and viral microRNA's (n=838) annotated in miRBase 11, as well as 427 proprietary microRNAs not yet included in miRBase 11.0), including the corresponding pre-microRNAs. The samples were hybridized on a hybridization station, where analysis of the scanned slides showed that the labeling was successful as all capture probes for the control spike-in oligonucleotides produced signals in the expected range. The quantified signals (background correction) were normalized using the global LOcally WEighted Scatterplot Smoothing (Lowess) regression algorithm, which produced the best within-slide normalization to minimize the intensity-dependent differences between the dyes. All hybridizations were made against a common reference pool consisting of all patient material combined.

microRNA Analysis

Data analysis was performed with only those microRNAs present in 10 or more samples (n=604). Using the free downloadable software package Dchip (version 31 March 2009; Dana-Farber Cancer Institute, Boston, MA, USA),17 absolute correlations (including genes with opposite gene profiles) were calculated using analysis of variance with a P-value of ≤0.01. Gene lists created in this way were used to classify samples by linear discriminant analysis (LDA) and cluster analysis. Classification accuracy was tested by performing cross validation analysis (by leaving one sample out and reselecting genes). Gene lists and classification results were chosen according to the highest classification accuracy after cross validation.

Results

An overview of the clinicopathological features for all patients included in this study is given in Table 1. Unsupervised hierarchical cluster analysis divides the patients into four main groups (Figure 1), of which the basal-like/triple-negative group is the most prominent (11% of all cases), the luminal A cancers containing the Her2 negative and estrogen receptor/progesterone receptor-positive tumors is the largest group (57%), and the group of luminal B (32%) is more heterogeneous and contains the Her2 positive/estrogen receptor-negative patients as well. In the luminal B cancers, two smaller subgroups can be distinguished where only age is significantly different (67.6 vs 57.1), and although insignificant, there is also a trend towards higher proliferation in the older patient group (Table 2). analysis of variance P-values were lowest for cytokeratin 5 and 6 (P=0.00000001) and estrogen receptor (P=0.0000001). The microRNAs that correlated with the different clinicopathological features are listed in Table 3. It is interesting that mir-106b, mir-29, and their pre-microRNAs (mir-106b* and mir-29*) are present in several of the signatures. Hierarchical cluster analysis shows that the estrogen receptor-negative patients tend to overlap with the cytokeratin 5 and 6 positive patients, these patients are often under 55 years of age (median age=49.5) and all have high proliferation (mitotic activity index≥10 and phosphohistone H3≥13). The following microRNAs are positively correlated with proliferation (independent of which method (mitotic activity index or phosphohistone H3) or threshold used to measure proliferation) miR-25, miR-106b, miR-130b, miR1274a, and miRPlus-1030. On the other hand, miR-29c is negatively correlated with proliferation and also strongly correlates with estrogen receptor. Estrogen receptor-negative tumors have very low concentrations of microRNAs that are located at 1q, such as miR-190b.18 Another interesting observation is the presence of several microRNAs (miR532-5p, miR-500, miR362-5p, and miR502-3p) located at Xp11.23 in cancers with a triple-negative signature and the upregulation of several miR-17 cluster members in estrogen receptor-negative tumors.

Table 1 Overview for all clinicopathologic features
Figure 1
figure 1

Unsupervised hierarchical clustering for 103 samples and 603 valid microRNAs. The heat map diagram shows the result of the two-way hierarchical clustering of microRNAs and samples. Each row represents a microRNA and each column represents a sample. The microRNA clustering tree is shown on the left, and the sample clustering tree appears at the top. The color scale shown at the bottom illustrates the relative expression level of a microRNA across all samples: red represents an expression level above mean, and blue represents expression lower than the mean. Gray color means that the specific microRNA on a given slide has a signal below background. Numbers for clinicopathological features indicate the following: Tsize2 (Tumor size: 0≤2 cm, 1>2 cm), Nottgrade (Nottingham grade: 1=grade 1, 2=grade 2, 3=grade 3), histologic type (1=tubular, 2=colloid, 3=medullary, 4=lobular, 5=ductal, 6=mix ductal/lobular), MAI10 (mitotic activity index with threshold 10: 0<10, 1≥10), MAI3_9 (mitotic activity index with thresholds 3 and 9: 0<3, 1=≥3–≤9, 2>9), MAI2 (mitotic activity index with threshold 2: (0<3, 1≥3), H3_13 (phosphohistone H3 with threshold 13: 0<13, 1≥13), Her2pos_neg (Her2: 0=0 or 1+, 1=2+ or 3+), PR (progesterone receptor: 0≤10% positive tumor cells, 1>10% positive tumor cells), ER (estrogen receptor) (estrogen receptor: 0≤10% positive tumor cells, 1>10% positive tumor cells), Triple-negative (0=positive for either ER/PR/Her2, 1=negative for ER and PR and Her2), cytokeratin 5 and 6 (cytokeratin 5 and 6: 0=no staining, 1=any percentage of positive tumor cells).

Table 2 Classification results for each clinicopathological feature
Table 3 List of all microRNA's correlating with the different clinicopathological features (only those giving the highest classification value are shown).

As proliferation is age dependent,19 we repeated all the analyses for patients aged <71 years (n=70 patients). Again, highest overall classification values were found for estrogen receptor, triple-negative and cytokeratin 5 and 6, with 91, 85, and 84% accuracy, respectively, and the analysis of variance P-values were highest for estrogen receptor (0.000001) and cytokeratin 5 and 6 (0.00001) (Figure 2). Also in unsupervised hierarchical clustering, the basal-like/triple-negative group is the most easily recognizable. Even though the group of patients under 71 years of age is smaller than the patient group as a whole, the signatures correlating to estrogen receptor/triple-negative/cytokeratin 5 and 6-positive and proliferation are still very stable and highly significant. MiR-106b is prominently present in all of these signatures and correlates strongest with high proliferation.

Figure 2
figure 2

Supervised hierarchical clustering for cytokeratin 5 and 6. The heat map diagram shows the result of the two-way hierarchical clustering of microRNAs and samples. Each row represents a miRNA and each column represents a sample. The microRNA clustering tree is shown on the left, and the sample clustering tree appears at the top. The color scale shown at the bottom illustrates the relative expression level of a microRNA across all samples: red represents an expression level above mean, blue represents expression lower than the mean. Gray color means that the specific microRNA on a given slide has a signal below background. Numbers for clinicopathological features indicate the following: Tsize2 (Tumor size: 0≤2 cm, 1>2 cm), Nottgrade (Nottingham grade: 1=grade 1, 2=grade 2, 3=grade 3), histologic type (1=tubular, 2=colloid, 3=medullary, 4=lobular, 5=ductal, 6=mix ductal/lobular), MAI10 (mitotic activity index with threshold 10: 0<10, 1≥10), MAI3_9 (mitotic activity index with thresholds 3 and 9: 0<3, 1=≥3–≤9, 2>9), MAI2 (mitotic activity index with threshold 2: (0<3, 1≥3), H3_13 (phosphohistone H3 with threshold 13: 0<13, 1≥13), Her2pos_neg (Her2: 0=0 or 1+, 1=2+ or 3+), PR (progesterone receptor: 0≤10% positive tumor cells, 1>10% positive tumor cells), ER (estrogen receptor: 0≤10% positive tumor cells, 1>10% positive tumor cells), Triple-negative (0=positive for either ER/PR/Her2, 1=negative for ER and PR and Her2), cytokeratin 5 and 6 (cytokeratin 5 and 6: 0=no staining, 1=any percentage of positive tumor cells).

Discussion

Most microRNA profiling studies have been performed in mixed groups with both lymph node negative and positive breast cancer patients, which are inevitably very heterogeneous. The results of such studies, therefore, must be interpreted with great care. The current study is one of the largest microRNA array studies published regarding lymph node negative breast cancer and also compares microRNA with several important biological features, such as estrogen receptor/progesterone recpetor/Her2/cytokeratin 5 and 6, and proliferation to create more insight into their fundamental regulation. We found that estrogen receptor negativity and cytokeratin 5 and 6 positivity are the most prominent biological features with unique microRNA profiles. Furthermore, several microRNAs were found that correlate with high proliferation, which is important as proliferation is the strongest single prognosticator in this group of lymph node negative patients.14 Being able to modulate the level of microRNAs involved in proliferation, such as mir-106B, might, therefore, offer new possibilities for treating patients.

Although most of the microRNA studies have been performed using mixed groups of both lymph node negative and -positive breast cancer patients with earlier versions of arrays (thereby, including fewer microRNAs), several microRNAs have been described before in relation to estrogen receptor/progesterone receptor/triple-negative/cytokeratin 5 and 6 and proliferation. miR-342 was found to be higher in luminal B subtype breast cancer and lowest in the triple-negative/basal-like subtype,20 and furthermore, miR-342 level as measured by quantitative reverse transcriptase-PCR was found to positively correlate with Her2 positivity. In our study, miR-342 was especially low in estrogen receptor-negative tumors and cytokeratin 5 and 6 positive tumors. Although miR-342 has also been found to be downregulated in tamoxifen- resistant breast cancer cells compared with tamoxifen-sensitive cells,21 a clear target with a link to the triple-negative/basal-subtype and tamoxifen resistance has not been described as yet.

In vitro experiments have shown that miR-18a can directly bind to the 3′-untranslated region of estrogen receptor-α, both overexpression of the precursor pre-miR-18a, and mature miR-18a could result in a considerable reduction of estrogen receptor protein.22 Adding estradiol to estrogen receptor-α-positive cell lines resulted in immediate upregulation of pri-mir17-92 expression, most likely by recruiting c-MYC to the mir-17-92 promoter. Furthermore, in women with a hepatocellular carcinoma, miR-18a expression correlated with reduced estrogen receptor-α and caused increased proliferation,23 thereby supporting our findings that miR18a is highly expressed in high-proliferating estrogen receptor-α negative tumors. MiR-106b, also found to be upregulated in high-proliferating estrogen receptor-α negative tumors, has been described to be able to negatively regulate AIB1 protein translation by a direct interaction with the 3′-untranslated region of AIB1 mRNA, and is also able to downregulate p21.24 Transfection with mir-106b could even prevent p53-induced blockage of the cell cycle at the G1-S checkpoint, after doxorubicin-induced DNA-damage, probably by silencing p21.25 Other members of the mir-106b family, eg miR-93 and miR-106a, are also positively correlated with proliferation; transfection of mir-106a resulted in an increase by 14–15% in the number of S-phase cells.25 Both mir-93 and mir-130b have been described to target the 3′ untranslated region of the mRNA for a tumor suppressor protein, tumor protein 53-induced nuclear protein 1.26 Overexpression of tumor protein 53-induced nuclear protein 1 induces cell-cycle arrest and apoptosis in several cell lines, even in the absence of p53. In this case, tumor protein 53-induced nuclear protein 1 is functionally associated with p73 and allows regulation of cell-cycle progression and apoptosis, independent of p53.27

In agreement with Lowery et al,20 mir135b was also inversely correlated with estrogen receptor in our study: one possible target for mir-135b is the estrogen-related receptor α according to PicTar. This protein is an orphan member of the superfamily of hormone nuclear receptors, and has been shown to have a key role in the regulation of estrogen-responsive genes by efficiently binding estrogen-responsive elements leading either to modulation of the response to estrogens or functional substitution for estrogen receptor as a constitutive activator of estrogen responsive elements-dependent transcription. In estrogen receptor-negative cell lines, estrogen-related receptor-α has been shown to act as an constitutive, estrogen-independent activator of transcription of estrogen responsive elements-dependent transcription.28 One of the target genes for such activation is Osteopontin, a secreted protein that is over-expressed in a number of human cancers, and has been associated with increased metastatic burden and poor prognosis in breast cancer patients.29 Other microRNAs that inversely correlate with estrogen receptor-α expression are mir-505 and mir-181a-2*, and although little is known about these miRs, mir-181a has been described to target p27 in myeloid leukemia cells.30

MiR-29c is downregulated in estrogen receptor-α negative, cytokeratin 5 and 6 positive and high proliferating tumors, and has recently been described to upregulate p53 and directly suppress p85α (the regulatory subunit of PI3 kinase) and CDC42 (a Rho family guanosine triphosphatase), both of which negatively regulate p53.31

The current study shows that estrogen receptor negativity and cytokeratin 5 and 6 expression are important biological processes in lymph node negative breast cancer as the correlations with specific microRNAs are strongest in the estrogen receptor negative and cytokeratin 5 and 6 positive tumors. Although proliferation is a strong prognosticator in lymph node negative breast cancer, it is clearly the end product of different biological processes, thereby making it more difficult to separate the low from the high proliferative group. As mRNA profiling studies have shown, larger numbers of tumors need to be analyzed to confirm these results and each individual microRNA needs to be investigated more thoroughly in both formaldehyde-fixed paraffin embedded tissues and in cell cultures, for localization and biological function.