Main

The cfms proto-oncogene encodes the only known receptor for the macrophage colony-stimulating factor (CSF1). CSF1R is a transmembrane tyrosine kinase receptor, and its ligand, CSF1, has soluble, membrane-bound and cell matrix-associated isoforms.1, 2, 3 The CSF1R/CSF1 receptor/ligand pair has essential physiologic functions in monocyte and macrophage differentiation,4, 5 embryonic implantation and placental development, and lactogenic differentiation of the human breast.6, 7, 8 Abnormally high CSF1R expression has clinically been associated with aggressive breast, ovarian, endometrial, and prostate cancer.9, 10, 11, 12, 13, 14, 15, 16

Most invasive breast carcinoma cells express readily detectable levels of activated CSF1R.13 Studies17 have shown that invasive breast cancer cells coexpress the ligand CSF 1, but adjacent in situ carcinoma cells do not. Moreover, in early-stage breast cancer patients treated with local therapy only, high levels of CSF1R expression also have been associated with a higher likelihood of ipsilateral recurrence.18

To further evaluate the role of CSF1R in invasion and metastasis, we studied a mouse cell line model19 that utilized the HC11 cell line, a spontaneously immortalized mammary epithelial cell line cultured from lactating BALB/c mice, which expressed high levels of CSF1. When HC11 was stably transfected with the mouse wild-type CSF1R oncogene, coexpression of the endogenous ligand produced an autocrine system in all transfectants. Compared with the parental HC11 cells, transfectants expressing the wild-type CSF1R (HC11-CSF1R) invaded 100-fold more efficiently through A barrier reconstituted basement membrane (Matrigel). Moreover, the transfectants were capable of forming colonies in soft agar. When HC11-CSF1R cells were injected intravenously in mice, they produced 10 times more lung metastases than the parental HC11 cell line.

At least six sites of tyrosine autophosphorylation have been observed after the activation of CSF1R, of which two (tyr-721 and tyr-807) have been shown to be important for coupling the CSF1R to intracellular signal transduction pathways.20, 21 The tyr-807 phosphorylation site plays a critical role in CSF1-dependent differentiation in monocytes, and tyr-807 phosphorylation is required for binding of elements of the STAT1 and Src pathways,22 and the activation of the ras-GAP pathway.23 Phosphorylation of tyr-721 is required for the binding of the p100 subunit of PI3-kinase and for CSF1-dependent mitogenesis.24 CSF1R was mutated tyr->phe individually at these two autophosphorylation sites.20 When the HC11 cells were transfected with CSF1R mutated at the 721 phosphorylation site, we obtained a cell line (HC11-CSF1R-721) with high capability for in vitro protease production and local invasion, but no capacity for anchorage-independent growth and in vivo metastasis after the intravenous injection.19 Conversely, transfection of HC11 cells with CSF1R mutated at the 807 phosphorylation site created a phenotype (HC11-CSF1R-807) incapable of in vitro invasion, but capable of anchorage-independent growth and capable of producing in vivo metastasis when injected intravenously into mice.19 Confirmation of presence of high levels of CSF-1R in these cell lines has been performed by Northern and Western blots.19

cDNA microarray technology enables us to examine patterns of expression of thousands of genes simultaneously.25, 26, 27 We hypothesize that transfection of HC11 cells with CSF1R and with the two modified receptors discussed above induces changes in the expression of numerous genes associated with invasion and tumorigenesis. Our goals were: (1) to identify genes that were upregulated or downregulated with transfection of HC11 with wild-type CSF1R and (2) to identify gene expression changes associated with mutation of CSF1R at the 721 and 807 autophosphorylation sites.

We have pursued our goals using this unique cell line model that enables us to compare gene expression profiles in a spontaneously immortalized parental cell line with those observed after transfection with a single gene, the wild-type or mutated CSF1-R. Other cell lines such as BT20 and SKBR3, which are human cell lines, also express CSF1 and CSF1R. However, these cell lines lack counterparts that are isogenic except the CSF-1R expression, and comparisons across different cell lines would reveal gene expression differences that are unrelated to CSF-1R. Therefore, our mouse model serves as a basis for further studies in human cell lines and in human tissues. Furthermore, our model enables us to study changes in gene expression caused by point mutation of critical autophosphorylation sites in CSF1R, necessary for the activation of specific downstream signal transduction pathways.

Materials and methods

Cell Lines and Cell Culture

HC11 cells originated from mammary glands of BALB/c mice in mid-pregnancy as described by Hynes et al.28 HC11 had been stably transfected with the pcDNA3 expression vector encoding cDNAs for either the wild-type CSF1R or the Tyr->Phe721 mutation or the Tyr->Phe807 mutation as described previously.19 We selected the clones that invaded most efficiently through the barrier reconstituted basement membrane (Matrigel) in our previous experiments. All four cell lines (HC11, HC11-CSF1R, HC11-CSF1R-721, and HC11-CSF1R-807) were incubated at 37°C, with 5% CO2 in DME/F12 medium (Sigma Chemical Co, St Louis, MO, USA) containing 10% fetal bovine serum (Life Technologies Inc.), 5 μg/ml bovine insulin (Sigma Chemical Co.), and 10 ng/ml murine epidermal growth factor (Sigma). HC11 cells that had been transfected with either the wild type or the mutant types of CSF1R were maintained in media containing 300 μg/ml gentamicin sulfate (Gemini Bio-Products, Woodland, CA, USA).

RNA Extraction

Total RNA was extracted from the four cell lines using the guanidine-cesium chloride method.19 The samples were electrophoresed in a 1% agarose gel containing 2.2 M formaldehyde to verify the integrity of the RNA.

Probe Preparation and Labeling

Probe preparation and labeling were performed as described previously.29 Briefly, 25 μg of total RNA was used for each cell line for each array. cDNA probes were synthesized by reverse transcription using Superscript RT (Gibco, Tulsa, OK, USA) with oligo-dT as a primer, incorporating allyl amine-deoxyuridine triphosphate (aa-dUTP; Sigma) into synthesized cDNA. Coupling of cyanine-3 (cy3) or cyanine-5 (cy5) dyes to aa-dU-modified cDNAs was carried out using NHS-ester cy-3 or cy-5 dye (Pharmacia, New York, NY, USA) by incubation at 25°C for 90 min in subdued light.

cDNA Microarray Slide Hybridization

Microarray slides of (4.6 K cDNA) were purchased from the Keck Microarray Facility at Yale University. A full list of genes included in this microarray is available upon request. Hybridization of fluorescently labeled probes to the glass slides was performed with hybridization buffer (50% deionized formamide, 12.5% SSPE, 0.625% SDS, 1.5 × Denhardt's with blockers (0.5 μg/μl mouse Cot 1 DNA, 0.1 μg/μl poly-A (15A), 0.2 μg/μl yeast tRNA) at 42°C for 18–24 h. After hybridization, the slides were washed first with 1 × SSC/0.1% SDS at 25°C for 15 min, then with 0.2 × SSC, 0.1% SDS, and finally with 0.2 × SSC.

Microarray Slide Scanning

The slides were scanned with a GSI Lumonics or Axon Model laser scanner. The analysis of the fluorescent hybridization signal of the microarray slide was performed with Genepix software (Axon Instruments, Inc., Union City, CA, USA).

Microarray Experimental Design

HC11-CSF1R was chosen as the reference sample, to which the other three cell lines were compared. HC11, HC11-CSF1R-721, and HC11-CSF1R-807 were each hybridized four times on arrays with HC11-CSF1R. On half of the arrays HC11-CSF1R was coupled with cy3 and the index sample with cy5, on the other half the dyes were reversed. Each array contains duplicate copies of cDNAs representing the same gene. Two different RNA isolations were used to perform four hybridizations per sample pair, generating eight ratios per gene per sample pair.

Microarray Data Preprocessing

As a preliminary step in searching for differentially expressed genes between HC11 (A) and HC11-CSF1R (B), we excluded genes for which we did not have good hybridization in at least one of the eight measurements. Similarly, in comparing HC11-CSF1R-721 (C) or HC11-CSF1R-807 (D) cell lines to HC11-CSF1R (B), we analyzed genes that have eight complete measurements.

Genes that had complete sets of 24 ratios with expression levels greater than 100 pixels per spot from these sets of experiments were selected for a principal component data visualization technique and for unsupervised clustering. In each of the 24 experiments, gene expression levels of the index sample (A, C and D) and of the reference sample (B) were normalized by the median expression level of all genes in that sample. These normalized expression levels were then used to calculate gene expression ratios between the index and reference sample. For projection of the data onto a pair of the best partitioning gene principal components (described below), we selected a subset of genes that satisfied the above filtering procedure (over 2500 genes), and had at least two-fold expression changes in one or more of the 24 ratios.

Differentially Expressed Genes

To identify the differentially expressed genes, we applied the analysis of variance (ANOVA)30 package provided by the Jackson Laboratory (http://www.jax.org/research/churchill/software/anova/index.html).

We used the dye-swap design to assess the differential expression between HC11 and HC11-CSF1R. Similarly, we compared the differential expression of HC11-CSF1R-721 and HC11-CSF1R-807 with HC11-CSF1R. In the terminology of Kerr et al,30 the four cell lines in our experiments are referred to as ‘varieties’. The ANOVA model partitions the variation in gene expression over genes and varieties into a number of effects, attempting to isolate the effects of interest while appropriately adjusting for other systematic effects. For example, the model includes an average effect for each gene and an average effect for each variety. The model also includes gene–variety interactions, which are the parameters of primary interest. These gene–variety interactions model the different effects of the varieties for each individual gene. Other effects, such as systematic differences between the dyes, were accounted for in other parameters in the ANOVA. We used 20 000 iterations of the bootstrap to calculate confidence intervals for the gene–variety interactions. This use of the bootstrap provides results that are more robust than t-tests and t confidence intervals, when error distributions are skewed or long-tailed or otherwise non-normal.30 Our criterion for selecting genes that were upregulated (or downregulated) relative to HC11-CSF1R was that both end points of their 99% confidence intervals be positive (or negative, respectively). We thus generated three lists of genes that are differentially expressed with respect to HC11-CSF1R. The eight ratios per sample pair for each gene were then combined to form a single estimate of the true ratio.

Microarray Data Analysis

In order to verify that the 24 sets of ratios are separable into three distinctive groups despite the sample similarity, we visualized their distribution in a reduced two-dimensional space. For dimension reduction, we used the method of principal components to project the data onto a lower dimensional space spanned by the principal component direction vectors, which are the linear combinations of the expression ratios of the various genes that capture most of the variance of the complete data.

This projection is based on singular value decomposition, coupled with a preprocessing normalization step, as described in recent work on simultaneous clustering of genes and samples.31 We selected a particular normalization procedure involving a repeated row and column normalization of the matrix representing the data (whose rows correspond to genes and columns correspond to the 24 ratios mentioned above). This two-way rescaling was designed to factor out overall gene and sample effects. In this case, we projected the data onto the second and third principal components because the normalization procedure causes the first principal component to be uninformative. An alternative normalization procedure that leads to similar results and is suitable for extracting a gene–sample interaction matrix is obtained by taking the log transformation of the data matrix followed by the subtraction of the row and column means and the addition of the overall mean.

Northern Blot Analysis for Confirmation of Gene Expression

Northern blot analysis was performed as described previously.19 Briefly, total RNA (10 μg) was electrophoresed on a 1% agarose/0.4 M formaldehyde gel, and was blotted onto a nylon membrane. Ethidium bromide staining of RNA was used as a loading control. Filters were prehybridized for 1 h in a PerfectHyb Plus hybridization buffer (Sigma-Aldrich Co, St Louis, MO, USA) at 68°C with salmon sperm DNA (Sigma). Hybridization was performed with the same hybridization buffer and 32P-labeled DNA probes (ResGen, Carlsbad, CA, USA) at 68°C for 16 h. Signals on the washed filter were visualized by Kodak XAR-5 film autoradiography.

Cell Line Tissue Microarray Construction and Immunohistostaining

SKBR-3, BT-20, BT-549, BT-474, MDA-MB-468, and MDA-MB-435S cell lines were purchased from the American Type Culture Collection (Manassas, VA, USA), and grown per ATCC recommendations (http://www.atcc.org/). Tissue microarrays containing the cell blocks were made as described elsewhere.32 In all, 10 cores measuring 0.6 mm per cell line were obtained and spaced 0.8 mm apart on the tissue microarrays. Immunohistostaining of the tissue arrays was performed for the products of a subset of genes for which there were commercially available antibodies. For mouse and rabbit primary antibodies, we used a standard protocol established in the Department of Pathology.33 The protocol was modified for goat primary antibodies by using a biotinylated secondary anti-goat IgG (Vector Laboratories, Inc., Burlingame, CA, USA) followed by washing with phosphate-buffered saline and incubation for 30 min with ABC complex (Vector Laboratories, Inc.). The rest of the staining procedure was as described.33 Slides were stained for CSF-1R (mouse anti-human IgG at 1:2000, Santa Cruz Biotechnology, Santa Cruz, CA, USA), secretory leukoprotease inhibitor (goat anti-human IgG at 1:500, R&D Systems, Inc., Minneapolis, MN, USA), MAP kinase phosphatase-1 (rabbit anti-human IgG at 1:100, Santa Cruz Biotechnology, Santa Cruz, CA, USA), tumor-associated calcium signal transducer-2 Trop 2 (mouse anti-human IgG at 1:20, BD Biosciences, San Diego, CA, USA), ceruloplasmin (goat anti-human IgG at 1:10 000, Bethyl Laboratories, Inc., Montgomery, TX, USA), chaperonin 10 (rabbit anti-human IgG at 1:10 000, Calbiochem, San Diego, CA, USA), and tissue inhibitor of metalloproteinase 2 (rabbit anti-human IgG, 1:100, Santa Cruz Biotechnologies, Santa Cruz, CA, USA). A cell line tissue microarray was also stained for Her2/neu (rabbit anti-human IgG at 1:300, DAKO Corporation, Carpinteria, CA, USA) to serve as positive and negative controls, utilizing the published data on Her2/neu expression in these cell lines.

Results

We studied gene expression profiles in our four cell lines: the parental cell line HC11 (A), HC11 transfected with CSF1R (B), HC11 transfected with CSF1R at the 721 phosphorylation site (C), and HC11 transfected with CSF1R mutated at the 807 phosphorylation site (D), using a 4.6 K mouse cDNA microarray. The reference sample, to which the other three cell lines were compared, was HC11-CSF1R. The index samples, HC11, HC11-CSF1R-721, and HC11-CSF1R-807, were each hybridized four times on arrays with HC11-CSF1R. Each array contains duplicate spots of cDNAs representing the same gene, thus the four arrays per sample pair produced eight data points per ratio per gene, resulting in a total of 24 ratios per gene from the three reference samples.

After preprocessing the data as described above, we projected the gene expression profiles of the eight data sets for the three index samples relative to the reference sample onto a pair of leading gene principal components. As demonstrated in the scatter plot shown in Figure 1, the different samples are clearly separable. Separation of these samples was also seen using our spectral bidirectional clustering method.31

Figure 1
figure 1

Projection of the samples onto a subspace spanned by two principal components. Clustering of samples is evident, even though the identities of the samples were not used in performing the projection. The data matrix was preprocessed by an iterative procedure of rescaling rows and columns, leading to a normalized matrix having all row sums equal and all column sums equal. Using the normalized matrix, projections of the samples onto the second and third principal components for expression ratios of HC11(A) to HC11-CSF1R (B) are shown by red dots, ratios of HC11-CSF1R-721(C) to HC11-CSF1R by blue dots, and ratios of HC11-CSF1R-807(D) to HC11-CSF1R by green dots.

Table 1 includes genes that consistently revealed differential expression of 1.8-fold or more between one or more of the index cell lines and HC11-CSF1R, with a confidence interval of 99%. The values shown are based on the gene–variety interactions estimated by the ANOVA. The numbers represent the log ratio (base 2) of the index sample to HC11-CSF1R. The genes have been sorted by Gene Ontology Molecular Function groups (http://www.geneontology.org).

Table 1 Differentially expressed genes

Table 2 includes some of the genes that consistently revealed differential expression between the parental cell line HC11 and HC11-CSF1R, with a confidence interval of 99%. Likewise, Table 3 lists genes that were differentially expressed when CSF1R was mutated at the 721 phosphorylation site compared with HC11-CSF1R. Differential expression of these genes correlates with the loss of ability for anchorage-independent growth and formation of metastases when injected intravenously into mice. Table 4 includes genes that consistently revealed differential expression with a confidence interval or 99% when comparing HC11-CSF1R-807 with HC11-CSF1R. Differential expression of these genes correlates with the loss of ability for in vitro invasion.

Table 2 Genes that are differentially expressed with stable transfection of HC11 with CSF1R
Table 3 Genes that are differentially expressed with mutation of CSF1R at the 721 phosphorylation site
Table 4 Genes that are differentially expressed with mutation of CSF1R at the 807 phosphorylation site

Genes that had at least a two-fold difference in the expression based on the microarray analysis, and for which cDNA probes were readily available, were chosen for confirmatory Northern blot analysis. These included chaperonin 10, procollagen type IV alpha, MAP kinase phosphatase, prenylated snare protein, secretory leukoprotease inhibitor, Trop 2, WDNM1, and ceruloplasmin as shown in Figure 2. RNA levels in the Northern blots were consistent with the microarray data.

Figure 2
figure 2

Northern blot analysis of gene expression of selected genes. Total RNA (10 μg) was fractionated on 1% agarose–formaldehyde gels, blotted onto nylon membrane, and hybridized with the probes indicated. The bottom panel represents the 18S and 28S RNA in the ethidium bromide-stained agarose gel.

Immunohistostaining was performed for a subset of genes using a tissue microarray containing nonisogenic human breast cancer cell lines. The staining patterns are summarized in Table 5. Examples of negative, weak and strong staining are shown in Figure 3.

Table 5 Immunhistochemical staining of a subset of differentially expressed genes using a human cell line tissue microarray
Figure 3
figure 3

Examples of immunohistostaining of the human cell line tissue micorarray. Immunhistostaining of SKBR-3 was weakly positive for CSF1R (a) and negative for MAP kinase phosphatase-1 (b). Immunohistostaining of BT-20 was strongly positive for CSF1R (c) and weakly positive for MAP kinase phosphatase-1 (d).

Discussion

In this communication, we present data on changes in gene expression observed after mouse HC11 mammary epithelial cells were transfected with either a wild-type CSF1R gene or one of two tyr to phe point mutated versions, which interrupted specific downstream signal transduction pathways. Microarray analysis was employed to quantitate the differences in gene expression among the four cell lines. The gene expression changes presented above were consistent across numerous repetitions, using cells harvested at different times. Moreover, the results were consistent with the Northern blot analysis performed on a subset of genes. As detailed below, some of the changes in gene expression were as expected, based on information about these genes in the literature (eg MAP kinase phosphatase, tissue inhibitor of metalloproteinase 2), and other genes are ‘novel’ in terms of their involvement in tumor invasion and metastasis.

As a first step in the analysis of the changes in gene expression profiles in our cell lines, we performed a clustering analysis. We were successful in clustering the samples in an unsupervised fashion, which succeeded despite sample similarity (all cell lines having originated from the same parental cell line and having been transfected with a single gene or one of two point mutated variants).

The microarrays used in this study contained probes for 4600 genes representing a broad spectrum of cellular functions, as well as numerous expressed sequence tags. Other investigators have used varying sizes of microarrays to analyze gene expression profiles in breast cancer specimens, some with hundreds of genes and some with thousands.34, 35, 36, 37, 38 Comparisons between our data set and other published data sets are rather difficult to make, because the different experimental models studied have been analyzed on arrays that include different gene sets and different species. Moreover, most publications discuss only the subset of genes that were differentially expressed, and do not focus on genes that might have been included on the array, but were not differentially expressed. Some of the differentially expressed genes in our gene set have not previously been associated with aggressive malignant behavior. Many of the other breast cancer cell line microarray analyses published in the literature make comparisons between completely different cell lines,35, 39, 40, 41, 42 which is problematic both methodologically and fundamentally, since many of the differences between cell lines may represent differences that are unrelated to invasiveness or tumorigenesis. We compare a spontaneously immortalized cell line with the same cell line after stable transfection with a single oncogene (creating an autocrine system since the parent cell produces the ligand), which alters its invasive and tumorigenic properties. Although HC11 is a mouse cell line, it has been used in numerous model systems to study invasion, tumorigenicity, and anticancer drug testing related to genes known to be involved in human tumor progression such as p53 mutation and ErbB2 transfection.43, 44

We sorted the genes by Gene Ontology functional classes (Table 1). Many genes encode for catalytic proteins. Other classes include smaller numbers of genes, and many genes and ESTs do not have assigned Gene Ontology classes.

For some genes, our study confirms data found in the literature. For example, WDNM 1 (also known as extracellular proteinase inhibitor) is a gene involved in breast involution. It has been shown45 to be downregulated in a metastatic breast phenotype, while being upregulated in a nonmetastatic phenotype. Others have described WDNM1 as a metastasis suppressor gene.46 The expression of WNDM1 has recently been shown to be related to p53 dysregulation in breast cell lines.47 Our results reveal the downregulation of WDNM1 in HC11 transfected with CSF1R, as well as in HC11 transfected with CSF1R mutated at phosphorylation site 807. However, it was not downregulated in the cell line transfected with the 721 mutation. This suggests that phosphorylation of the 807 site is necessary for the downregulation of WDNM1, and that the downregulation of WDNM1 might be related to tumorigenicity.

The MAP kinase phosphatase-1 expression level was significantly lower in HC11-CSF1R than in HC11. It was not expressed in SKBR-3 cells, and weakly expressed in BT-20 cells. CSF1R activates the RAS, RAF, and MEK pathways, activating MAP kinase.22 MAP kinase phosphatase-1 is a downregulator of MAP kinase, and its levels are suppressed when MAP kinase is activated. HC11-CSF1R-807 also had a higher expression level of this gene than HC11-CSF1R, suggesting that phosphorylation of tyr-807 is necessary for the activation of MAP kinase. The downregulation of MAP kinase phosphatase-1 in melanocytes by a receptor tyrosine kinase results in suppressed differentiation and a transformed phenotype.48 Our results suggest that MAP kinase phosphatase-1 downregulation might be involved in CSF-1R transformation of the HC11 cell line, with the results suggesting that tyr-807 phosphorylation is necessary for this process.

Ceruloplasmin was downregulated in the two cell lines transfected with the mutated CSF1R relative to the wild-type HC11-CSF1R. It is a copper transport protein and an acute phase reactant that has been shown to be upregulated in human breast cancer cell lines,49 and in human ovarian cancer specimens.50 Moreover, ceruloplasmin levels are increased in the serum of patients with breast cancer.51 It has been proposed that ceruloplasmin transports iron into malignant cells.50 Ceruloplasmin was expressed in most of the human cell lines that we stained. The fact that it is downregulated with mutations of both phosphorylation sites suggests that its downregulation might be mediated through either the PI3 kinase pathway or the MAP kinase pathway or other pathways related to both sites.

Trop 2 (also known as tumor-associated calcium signal transducer) is a calcium signal transducer expressed at high levels by numerous human cancers.52 Trop 2 is upregulated in HC11-CSF1R relative to the other cell lines, with a higher level in HC11-CSF1R-721 than in the phenotypes that lack capacity for in vitro invasion (Figure 2). The expression pattern of Trop 2 in most of the human cell lines studied mimicked that of CSF1R (Table 5). The specific role for Trop 2 in breast cancer invasion and metastasis is not known; however, it is a cell surface receptor with a cytoplasmic domain that appears to play a regulatory role in the growth of carcinoma cells.53 Our data suggest a role for Trop 2 in tumor invasion.

Tissue inhibitor of metalloproteinase 2 (TIMP 2) was higher in HC11 than in HC11-CSF1R, and was the same in the two cell lines with the mutated CSF1R as in HC11-CSF1R. Previous studies by Sapi et al19 have shown that TIMP 2 inhibits in vitro invasion of HC11-CSF1R. Hence, low levels of TIMP 2 are necessary for HC11-CSF1R to maintain its invasive capabilities. TIMP 2 has also been shown to inhibit the development of mammary tumors in mice,54 consistent with our findings.

Secretory leukoprotease inhibitor is a gene with no known role in invasiveness of breast cancer, and warrants further evaluation. The mRNA level of this gene is higher in HC11-CSF1R than in the other cell lines. The protein product of this gene was expressed in most of the invasive human cell lines studied by immunohistostaining. Secretory leukoprotease inhibitor has inhibitory activity against chymotrypsin, trypsin, neutrophil elastase, and cathepsin G,55 and has been shown in monocytes to suppress the induction of matrix metalloproteinases.56 High expression levels of secretory leukoprotease inhibitor have been found in human ovarian carcinoma specimens.50

Procollagen type IV alpha is a precursor to collagen type IV alpha, a component of the basement membrane in mammary glands. Its level was higher in HC11 than in HC11-CSF1R, and the mutation of CSF1R did not change its expression. Previously, we have shown that urokinase expression was upregulated by CSF1R expression in HC11 cell lines.19 Urokinase has a major role in the collagenase pathway, therefore our recent findings confirm the involvement of CSF1R in collagen regulation.

Chaperonin 10 (also known as heat shock 10 kDa protein 1) is upregulated in HC11-CSF1R relative to HC11, and is somewhat lower in the cell lines with the mutated CSF1R than in HC11-CSF1R. All of the invasive human breast cell lines studied by immunohistostaining were positive for chaperonin 10. It is involved in mitochondrial protein folding.57 Its extracellular homologue, early pregnancy factor, is associated with cell growth and division.58 CSF1R is also upregulated in normal placental trophoblast epithelium.19 However, the specific relationship between chaperonin 10 and CSF1R is unknown, and no role for chaperonin 10 in metastasis and invasion was found in the literature.

Based on these data, we have initiated studies to examine the specific roles of some of these genes in invasion and metastasis. We will use siRNA technology to inhibit genes that were upregulated in the transfectants relative to the parental cell line to study whether inhibition of these genes abolishes invasive capability of these cells. Likewise, we will study whether inhibition by siRNA of genes that were upregulated in HC11-CSF1R and HC11-CSF1R-807 relative to the other cell lines abolishes the ability for anchorage-independent growth and in vivo tumorigenicity. These data serve as the initial step towards understanding the role of CSF-1R, its phosphorylation sites and its downstream genes in breast cancer, and the roles of these genes need to be confirmed in human tumors.

In summary, cDNA microarray analysis of our cell line model reveals a set of genes involved in in vitro invasion and in in vivo tumorigenesis in mice. The reduction of our data into a two-dimensional space shows distinctive clusters of the different samples, despite their similarity. This suggests biological significance of our differential gene expression. Our data confirm some findings in the literature with regard to genes known to be involved in the invasive process. We have also identified novel genes, which have not previously been associated with malignant invasion and metastasis. cDNA microarray analysis of a well-controlled model system involving a highly relevant oncogene is a useful screening tool for identifying novel differentially expressed genes. These genes can provide a basis for further studies of metastatic progression and local invasiveness in human tumors, and can be evaluated as therapeutic targets.