The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis

Ben-David, Uri; Ha, Gavin; Khadka, Prasidda; Jin, Xin; Wong, Bang; Franke, Lude; Golub, Todd R.

doi:10.1038/ncomms12160

Download PDF

Article
Open access
Published: 04 July 2016

The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis

Uri Ben-David¹,
Gavin Ha^1,2,
Prasidda Khadka¹,
Xin Jin¹,
Bang Wong¹,
Lude Franke ORCID: orcid.org/0000-0002-5159-8802³ &
…
Todd R. Golub^1,2,4,5

Nature Communications volume 7, Article number: 12160 (2016) Cite this article

7564 Accesses
33 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Aneuploidy and copy-number alterations (CNAs) are a hallmark of human cancer. Although genetically engineered mouse models (GEMMs) are commonly used to model human cancer, their chromosomal landscapes remain underexplored. Here we use gene expression profiles to infer CNAs in 3,108 samples from 45 mouse models, providing the first comprehensive catalogue of chromosomal aberrations in cancer GEMMs. Mining this resource, we find that most chromosomal aberrations accumulate late during breast tumorigenesis, and observe marked differences in CNA prevalence between mouse mammary tumours initiated with distinct drivers. Some aberrations are recurrent and unique to specific GEMMs, suggesting distinct driver-dependent routes to tumorigenesis. Synteny-based comparison of mouse and human tumours narrows critical regions in CNAs, thereby identifying candidate driver genes. We experimentally validate that loss of Stratifin (SFN) promotes HER2-induced tumorigenesis in human cells. These results demonstrate the power of GEMM CNA analysis to inform the pathogenesis of human cancer.

Integrated analyses of murine breast cancer models reveal critical parallels with human disease

Article Open access 22 July 2019

Jonathan P. Rennhack, Briana To, … Eran R. Andrechek

Single allele loss-of-function mutations select and sculpt conditional cooperative networks in breast cancer

Article Open access 02 September 2021

Nathan F. Schachter, Jessica R. Adams, … Sean E. Egan

Analysis pipelines for cancer genome sequencing in mice

Article 06 January 2020

Sebastian Lange, Thomas Engleitner, … Roland Rad

Introduction

The understanding of cancer biology has benefitted tremendously from large-scale analyses of genomic data. Resources of comprehensive molecular characterizations of human tumours, best illustrated by The Cancer Genome Atlas (TCGA), have become indispensable for contemporary cancer research¹. However, the utility of such data is limited by the extensive genetic diversity of the human population and by the complexity of late-stage tumours that harbour true driver events buried in a majority of passenger alterations. The study of aneuploidy and large copy-number alterations (CNAs), affecting on average ∼25% of the tumour genome^2,3, is particularly challenging. As these CNAs often encompass hundreds of genes, it is difficult to distinguish driver from passenger genes within such aberrations. It is equally challenging to associate tumour-initiating events (for example, point mutations) with unique CNAs that cooperate with them on tumorigenesis.

In principle, genetically engineered mouse models (GEMMs) provide a strategy to overcome the limitations of human genomic data. GEMMs have been successfully used to dissect cellular and molecular aspects of tumorigenesis, to identify and validate candidate cancer genes, and to test new therapeutic approaches^4,5. Mouse copy-number data at large scale could therefore facilitate the study of multiple aspects of tumour biology. However, the landscape of chromosomal aberrations in GEMMs has been underexplored, even in breast cancer, for which GEMMs have been generated and studied extensively^4,5,6,7,8,9. We therefore set out to generate a comprehensive catalogue of chromosomal aberrations in breast cancer GEMMs, and to mine this resource to address multiple aspects of tumour development. We find that CNA prevalence, as well as the recurrence of specific events, are largely determined by the initiating perturbations. Building on this finding, we compare context-specific recurrent events between mouse models and human patients, and identify candidate co-driver genes. We experimentally validate the relevance of one such gene, Stratifin (SFN), to human HER2-induced tumorigenesis.

Results

Gene expression profiles reveal CNAs in breast cancer GEMMs

As copy-number data from breast cancer GEMMs are scarce, whereas genome-wide gene expression profiles from these GEMMs are abundant^10,11,12, we first asked whether we could infer CNAs from their coordinated gene expression biases^{13,14,15,16,17}. To examine this possibility, we modified the e-karyotyping method¹⁵ to analyse the mammary tumour gene expression data. For affymetrix microarray platforms, we also applied the functional genomic messenger RNA (mRNA) profiling (FGMP) method¹⁷. CNAs were estimated by analysing the differences in gene expression between normal mammary tissues and tumour samples (Methods). Analysis of 567 normal tissue samples led to 100% being accurately identified as diploid, suggesting a very low false detection rate of chromosomal aberrations (FDR<0.008, 95% CI). Furthermore, a comparison of expression-inferred CNAs to those estimated by comparative genomic hybridization (CGH) arrays from matched tumours, showed high concordance between these platforms: 26 out of 27 (96.3%) large (>5 Mb) CNAs identified by the RNA expression data were confirmed by DNA data (Supplementary Fig. 1). Therefore, gene expression-based analyses can capture the landscape of aneuploidy and large CNAs in tumours from breast cancer GEMMs, at approximately cytoband resolution (Fig. 1a).

**Figure 1: Analysing aneuploidy and large CNAs in breast cancer GEMMs using gene expression profiles.**

Having validated the methodology, we were able to map the landscape of aneuploidy and large chromosomal aberrations in breast cancer GEMMs. For this aim, we collected and analysed gene expression profiles of 2,697 samples from 36 unique breast cancer mouse models: 567 normal tissue samples, 100 premalignant mammary tissues/lesions, 1,910 primary mammary tumours, 17 breast cancer metastases, and 103 breast cancer cell lines and cell line-derived tumours (Fig. 1b). These data were collected from 87 studies, across multiple experimental platforms, genetic backgrounds and transgene delivery methods, representing all major breast cancer GEMMs generated to date (Supplementary Data 1–6). The availability of this large-scale CNA resource allowed us to address a number of fundamental questions in cancer biology, as described below.

CNAs arise late in breast cancer tumorigenesis

We first explored the time course of CNA acquisition during breast cancer tumorigenesis in GEMMs. To address the fundamental, yet unanswered question of when CNAs arise, we analysed multiple studies for which data were available from distinct stages of mammary tumour development, including normal mammary tissues, premalignant lesions, ductal carcinomas in situ and invasive carcinomas. In SV40Tag-induced tumours, chromosomal aberrations were rarely detected in hyperplasias or ductal carcinomas in situ, but commonly found at the invasive carcinoma stage (Fig. 2a,b; Supplementary Fig. 2a). These findings suggested that chromosomal aberrations accumulated late during tumorigenesis. This observation was confirmed in five other GEMMs from which premalignant samples were available (Supplementary Fig. 2b; Supplementary Data 4). Therefore, aneuploidy and large CNAs are preferentially acquired, or become clonally dominant, during the progression of non-invasive lesions to invasive carcinomas, in line with recent findings from lung¹⁸ and skin^19,20 cancer mouse models. In accordance, we observed a few instances, in which a genomic region did not meet our strict cutoff for CNA detection at early stages of tumorigenesis, but careful examination suggested that an aberration already existed at these time points in a subpopulation of cells, but was clonally selected only at the final stages of the tumour development (for example, loss of chromosome 2 in Supplementary Fig. 2).

**Figure 2: Chromosomal aberrations are a late event in breast cancer tumorigenesis and further aberrations are acquired during the derivation of cell lines.**

We also examined aneuploidy and CNAs in metastases from Polyoma Middle T (PymT) and allografted tumours to determine whether additional chromosomal aberrations were required for the development of metastases (Supplementary Data 5). We did not detect an increased burden of chromosomal aberrations in these samples (Fig. 2c; Supplementary Fig. 2c), suggesting that further acquisition of such aberrations is not required for the metastatic phenotype.

Cancer cell lines harbour more CNAs than primary tumours

As cancer cell lines are commonly used in the breast cancer research, it is important to assess the degree to which their genomic landscape faithfully represents that of primary tumours. Analysis of 103 samples from cell lines and cell line-derived tumours revealed that mouse breast cancer cell lines, as well as tumours generated following their transplantation, harbour many more CNAs compared with primary tumours (98% of the cell lines are aneuploid, compared with 29% of the tumours; Fig. 2d). Freshly derived cell lines are more than nine times more likely than their parental tumours to harbour chromosomal aberrations (Supplementary Fig. 2d), suggesting that cell line derivation is associated with the acquisition or selection of CNAs. Of note, distinct chromosomal aberrations are often detected in samples of the same established cell line (Supplementary Data 6), suggesting that additional chromosomal aberrations commonly arise during culture propagation. Similar to our findings in GEMMs, we found significantly more CNAs in human breast cancer cell lines, compared with human primary breast tumours of the respective subtype (Fig. 2e; Supplementary Fig. 3), in line with a previous finding with a much smaller data set²¹. Taken together, our analysis of breast cancer GEMMs reveals that the major wave of chromosomal aberrations occurs during the progression of a premalignant tissue to an invasive carcinoma; and that the prevalence of chromosomal aberrations in cell lines is much higher than in tumours (Supplementary Fig. 4). We therefore focused further analyses on the primary tumour samples.

CNA prevalence is determined by the initial perturbation

A key question in cancer biology is whether particular initiating oncogenic events determine the eventual CNA landscape of the tumour. This question is particularly well suited to mouse models, where genetic background can be controlled, tumours can be generated by manipulating a single gene and the initiating event is known a priori. We therefore measured CNA prevalence in the 11 most common breast cancer GEMMs, and used it as an index of their degree of genomic instability, commonly referred to as DGI¹⁷. We found that DGI markedly differed between breast cancer GEMMs (P<10⁻¹⁶, χ²-test of independence), with the prevalence of large CNAs ranging from ∼4% to ∼80% of the tumours (>17-fold difference, for the PyMT and p53^−/− models, respectively; Fig. 3a). The three most unstable models (p53^−/−, SV40Tag and Brca1^−/−) often harboured multiple chromosomal aberrations per tumour. In contrast, the most stable models (PyMT, Wnt/βcat, Pik3ca_mut and Her2/Neu) primarily gave rise to diploid tumours, and almost never developed tumours with more than one CNA (Supplementary Fig. 5). The other four models (c-Myc, Pten^−/−, Etv6–Ntrk3 and Met) showed intermediate prevalence of chromosomal aberrations. As expected, not only was p53^−/− the least stable model, but p53 status (^+/+ or ^+/−) was also a predictor of genomic instability across various models (Supplementary Fig. 6a). We assessed DGI in two additional ways: by computing ‘autocorrelation values’ to determine the instability from correlated expression of neighbouring genes¹⁷, and by counting the number of CNA-encompassed genes (Methods). These analyses corroborated the significant DGI differences between the various mouse models (Fig. 3b; Supplementary Fig. 6b,c; Supplementary Table 1).

**Figure 3: Driver-specific degree of genomic instability in breast cancer GEMMs.**

The availability of large-scale CNA data allowed us to revisit questions that had been previously addressed at smaller scale. For example, a recent study of 82 mice from three models of lung cancer reported a significantly higher level of aneuploidy and CNAs in GEMM tumours compared with chemically induced tumours, leading to the conclusion that genetically engineered and carcinogen-induced models develop tumours through different routes¹⁸. We re-addressed this question in 1,910 mouse mammary tumours from which the CNA data were inferred. In contrast to the reported result, we found that mammary tumours induced by the strong carcinogen 7,12-dimethylbenzanthracene fell well within the DGI spectrum of the genetic models. In fact, some genetic models exhibited even fewer chromosomal aberrations than the carcinogen-induced model (Supplementary Fig. 6d), arguing against a dichotomous distinction between chemical and genetic tumorigenesis routes.

Our large-scale data set also allowed us to dissect several variables that could affect driver-specific DGI. For example, as each GEMM has its typical tumour latency²², the DGI of each model might merely reflect the time it takes from transgene activation to tumour development. However, while DGI and average tumour latency correlated well (R²=0.6) across models (Fig. 3c), tumour latency had no effect on DGI within models (Fig. 3d). Similarly, DGI was not associated with mouse genetic background (Supplementary Fig. 6e,f), or with the method used for genetic perturbation (that is, the promoter used for transgene activation or excision; Fig. 3d). Therefore, DGI is intrinsic to the introduced perturbation, and is consistent within each model across genetic backgrounds, tumour latencies and activating promoters—an observation nearly impossible to make in human tumours given the diversity of genetic backgrounds and the diversity of inciting oncogenes. In contrast, we found DGI to be associated with tumour histological subtypes, in cases where the same transgene could give rise to histologically distinct tumours: a statistically significant DGI difference exists between subtypes of tumours induced by Myc^10,23 (P=0.017, χ²-test of independence; Fig. 3e), and between subtypes of tumours induced by mutated Pik3ca²⁴ (P=0.02, Fisher’s exact test; Fig. 3f). We conclude that distinct tumour subtypes, generated by activating the same transgene, differ in their tendency to acquire CNAs.

Specific drivers are associated with unique recurrent CNAs

We next asked whether the large-scale CNA data would enable us to identify statistically significant recurrent CNAs. Associating specific CNAs with specific tumour-initiating events could have far-reaching implications for understanding oncogenesis, with potential impact on targeted therapies. We therefore asked whether breast cancer GEMMs differ in their patterns of recurrent events. Indeed, we found distinct landscapes of chromosomal aberrations across models (Fig. 4a; Supplementary Fig. 7). To determine the recurrent CNAs in each model, we combined absolute and relative criteria: aberrations were determined as recurrent if present in at least 10% of the tumours, or if statistically significant in a binomial test (Bonferroni corrected P<0.05; Methods). Thirty five recurrent events were identified in the 11 common GEMMs, and 34 of them were confirmed to be significant by GISTIC2.0 analysis (Supplementary Fig. 8; Supplementary Table 2). To distinguish between cross-model and model-specific recurrent aberrations, we applied a χ²-test of independence, thus identifying 15 unique, GEMM-specific CNAs (Fig. 4a; Supplementary Table 2). These analyses revealed that each GEMM is associated with a characteristic chromosomal landscape, suggesting that these CNAs are not simply passenger events, but rather play a functional role in promoting tumorigenesis.

**Figure 4: The landscapes of aneuploidy and large CNAs in breast cancer GEMMs reveal driver-specific recurrent events.**

To examine whether recurrent driver-specific CNAs are also tissue dependent, we took advantage of common oncogenes that can induce cancer in multiple tissues. Specifically, we analysed 319 Myc-induced lymphoma and prostate tumours (Supplementary Data 7), as well as 92 SV40Tag-induced prostate tumours (Supplementary Data 8). With both transgenes, some recurrent CNAs were observed only in a particular cancer type, whereas others recurred across multiple cancers (Fig. 4a–c), suggesting that a subset of driver-specific CNAs cooperate with the initial driver independently of the targeted tissue. Interestingly, the lymphoma and prostate data also recapitulated our findings in breast cancer that aneuploidy occurs late during cancer progression, and that DGI is inherent to the driver gene (Supplementary Fig. 9).

Cross-species analysis identifies candidate co-driver genes

As genes important for tumorigenesis are likely to reside within recurrent CNAs, we next asked whether integrated analysis of CNAs and gene expression could uncover such driver genes. To identify oncogenes or tumour suppressor genes that promote tumorigenesis across models, we compared recurrent events shared by multiple GEMMs. Amplification of 11qE1–E2 is a recurrent event in three of the GEMMs: PyMT, Met and Brca1^−/− (Fig. 4a). We therefore searched for genes that reside within this region and that are significantly overexpressed in each of these models¹⁰. The anti-apoptotic gene Survivin (Birc5) was the only overlapping gene between the three GEMMs (Supplementary Fig. 10a), suggesting its potential involvement in breast cancer tumorigenesis. Interestingly, we found that three additional GEMMs (SV40Tag, Myc and Her2/Neu) in which 11qE1–E2 amplification was not recurrent, also significantly overexpressed Birc5 (ref. 10), suggesting that its expression may be dysregulated through multiple mechanisms.

In line with a driving role for BIRC5 in human mammary tumorigenesis, this gene is commonly amplified in human invasive ductal breast carcinomas²⁵, in human invasive lobular breast carcinomas²⁶ and in human breast cancer xenografts²⁷ (Supplementary Fig. 10b). Moreover, we found high expression of BIRC5 to be associated with worse clinical outcome in human breast cancer patients (Supplementary Fig. 10c), consistent with previous analyses of much smaller cohorts^28,29. Lastly, knockdown of BIRC5-induced apoptosis and/or reduced colony formation capacity of breast cancer cell lines of the basal³⁰, HER2³¹ and luminal³² subtypes, further supporting its subtype-independent oncogenic role in breast cancer.

To identify genes that promote tumorigenesis in a particular genomic context, we performed an integrated cross-species analysis. Model-specific CNAs may be driven by genes that cooperate, or interfere, with the initial driver event, and may be important for human tumorigenesis in the same genetic context. We therefore sought to take advantage of the incomplete synteny between the mouse and human genomes to narrow critical regions of interest. We compared the recurrent aberrations identified in GEMMs to those that characterize human breast cancers with activation of the same pathway³³ (Methods). This comparison identified several syntenic recurrent events, enabling a focus on substantially smaller regions within large CNAs in both species (Fig. 4d; Supplementary Table 3). For example, we identified monosomy 4 as a recurrent event in the Her2/Neu GEMM. Mouse chromosome 4 is syntenic to four human chromosomes; of these, only chromosome 1p is commonly deleted in human tumours with a HER2 amplification gene expression signature³³. This approach led to a considerable narrowing of the critical region of deletion (60 and 45% reduction for mouse and human chromosomes, respectively; Fig. 4d). Focusing on this syntenic region, we next compiled a list of orthologous genes that reside within it and are downregulated in Her2/HER2-induced tumours (Supplementary Data 9; Methods). These candidate genes, together with HER2 itself, were then subjected to unbiased gene network analyses using the GeNets platform (Methods), which identified SFN as a strong candidate gene to cooperate with HER2 during tumorigenesis (Supplementary Fig. 11a).

Loss of SFN promotes human HER2-induced tumorigenesis

SFN (Stratifin, also known as 14-3-3σ) has been described as a putative tumour suppressor involved in cell cycle progression and epithelial polarity³⁴. However, in human breast cancer of the basal subtype, its expression has also been reported to promote invasiveness³⁵, suggesting that its role in tumorigenesis (either oncogene or tumour suppressor) may be contingent on cellular context. To address this, we set out to determine whether deletion of SFN promotes or inhibits tumorigenesis in the human HER2-enriched breast cancer subtype. We found an inverse association between SFN mRNA expression levels and the protein levels of HER2, as well as that of multiple other proteins in the HER2 pathway, both in human breast tumours and in human breast cancer cell lines (Fig. 5a; Supplementary Fig. 11b). Furthermore, low SFN expression levels were associated with the decreased overall survival of breast cancer patients, specifically within the HER2-enriched human subtype (Fig. 5b; Supplementary Fig. 11c), most consistent with a loss-of-function, tumour suppressive role of SFN.

**Figure 5: Downregulation of SFN promotes *HER2*-induced human breast cancer tumorigenesis.**

To functionally validate SFN’s role in human HER2-induced tumorigenesis, we turned to a model of HER2-overexpressing human mammary epithelial cells. Overexpression of HER2 was not sufficient to transform MCF10A cells, unless combined with overexpression of 14-3-3ζ, another member of the 14-3-3 protein family³⁶. Whereas control and HER2-overexpressing MCF10A cells expressed 14-3-3σ, its expression was lost upon overexpression of 14-3-3ζ, so that the transformed cells did not express it at all (Fig. 5c). Importantly, restoring SFN expression in the transformed cells significantly reduced their anchorage-independent colony formation capacity and their in vitro migration and invasion capabilities (Fig. 5d–g). Furthermore, we found that knockdown or knockout of SFN decreased the in vitro tumorigenicity of the basal subtype cell line MDA-MB-231, but had an opposite effect on two cell lines of the HER2-enriched subtype (MDA-MB-453 and EFM-192A) (Supplementary Fig. 12). Taken together, these results suggest that SFN acts as a tumour suppressor gene in the context of HER2-mediated transformation, in line with previous data from the mouse³⁷ and in contrast to its role in non-HER2-driven human mammary tumours³⁵. More broadly, these results delineate a comparative oncogenomics strategy to identify genes that co-drive tumorigenesis in specific genomic contexts (Supplementary Fig. 13). Applying the same strategy to the recurrent CNAs in additional mouse models yielded a list of candidate genes that may underlie each of these aberrations (Supplementary Data 9).

Discussion

GEMMs make a powerful tool for in vivo modelling of human breast cancer. However, as GEMMs do not always recapitulate the progression of the human disease, comprehensive genomic characterizations of these models should inform their proper use in cancer research, and guide the selection of the most suitable GEMMs for addressing a particular biological question. Unlocking the copy-number information hidden in thousands of gene expression profiles allowed us to perform the first comprehensive study of aneuploidy and large CNAs in GEMMs. By systematically mining this novel resource (available as band-level aberration matrices in Supplementary Data 10), we uncovered a complex landscape of chromosomal aberrations in breast cancer GEMMs, indicative of driver-specific genomic routes to tumour development. We used this data set to address several long-standing questions in cancer research, and demonstrated its relevance to the human disease.

Several of our findings are of particular interest: First, we show that CNA prevalence varies extensively across mouse models, depending on the inciting oncogene. Westcott et al.¹⁸ recently concluded, based on the analysis of 82 tumours from three mouse lung cancer models, that there were systematic differences in CNA prevalence between genetically induced and chemically induced models of cancer. Our analysis of 1,910 mammary tumours, the largest ever reported, clearly shows that the variation across GEMMs is similar to that seen between some GEMMs and chemical models. It therefore emphasizes the importance of performing such analyses at the appropriate scale. Second, we demonstrate the feasibility of associating specific inciting oncogenic events with specific aneuploidies, even in mouse models that are otherwise genomically stable. These relationships can thus serve as a basis for discovering the multi-step pathogenesis of cancer. Third, we illustrate how cross-species CNA analyses can tease out driver genes within a large region of amplification or deletion in human tumours. Therefore, our findings demonstrate a novel approach to harness GEMM data to the understanding of human cancer pathogenesis.

Our findings reveal the context-dependent role of SFN (14-3-3σ) in human breast cancer. As 14-3-3 proteins interact with hundreds of binding partners and regulate multiple cellular processes, the molecular underpinnings of this unique behaviour remain to be elucidated. Previous studies showed that upregulation of transforming growth factor beta (TGFβ) is required for HER2-induced transformation of MCF10A cells³⁸. Indeed, overexpression of 14-3-3ζ promotes HER2-induced tumorigenesis by activating the TGFβ pathway³⁶. Here we report that overexpression of 14-3-3ζ also abolishes 14-3-3σ expression. Interestingly, 14-3-3ζ and 14-3-3σ play an opposite role in TGFβ-induced growth inhibition³⁹, and 14-3-3σ was recently found to be a direct target of TGFβ⁴⁰. This raises the intriguing possibility that loss of SFN promotes HER2-induced tumorigenesis through its modulation of the TGFβ pathway.

The same approach that identified SFN as a tumour suppressor in HER2-induced tumorigenesis was also applied to the systematic exploration of recurrent CNAs in other models, yielding a list of candidate genes that may underlie these driver-specific events. Interesting examples are the translation initiation factors and the ribosomal proteins that are co-amplified with Myc in both mouse and human Myc-induced tumours (Supplementary Data 9), which may collectively underlie the recurrence of 8q amplifications in human MYC-induced tumorigenesis; and PDZ binding kinase (Pbk), a gene previously shown to interact with p53 and modulate the expression of its transcriptional targets⁴¹, which is intriguingly deleted in both mouse and human p53-mutant tumours (Supplementary Data 9). The potential context-specific roles of these candidate genes await experimental validation. Importantly, as our approach for prioritizing candidate genes focuses on genes that interact with the driver event or belong to the same pathway (Supplementary Fig. 13), additional candidate genes may be identified by applying complementary approaches that focus on genes from alternative pathways.

In summary, our findings demonstrate the power of large-scale analyses of mouse models to inform the pathogenesis of mouse and human cancer. Further exploration of this resource, as well as its expansion to additional cancer types, should yield further insights into tumour biology.

Methods

Data assembly and processing

Gene expression profiles were obtained from the GEO (Gene Expression Omnibus) (http://www.ncbi.nlm.nih.gov/geo) and EMBL-EBI (European Molecular Biology Laboratory - European Bioinformatics Institute) (http://www.ebi.ac.uk) databases. Accession numbers are provided in Supplementary Data 1. Normalized matrix files were downloaded, and samples were curated manually according to the information available for each of them to identify the tissue type (normal, premalignant, primary tumour, metastasis, cell line or cell line-derived tumour), the tumour-initiating event, the promoter used for transgene activation or perturbation, and the mouse background strain. Arrays were analysed for quality control and the outliers were removed. The final database consisted of 567 normal tissue samples, 100 premalignant lesions, 1,910 invasive carcinomas, 103 cell lines and cell line-derived tumours, and 17 metastases from breast cancer GEMMs, as well as 319 samples and 92 samples from lymphoma and prostate GEMMs, respectively. GEMMs were defined according to the introduced/perturbed gene. The analysis was performed in batches, and normal tissue samples included in each study served as internal controls, whenever available. Data was processed using the R statistical software (http://www.r-project.org/)⁴²: probe sets were organized by their chromosomal location, and the expression values were log2 transformed, if needed. Probe sets without annotated chromosomal location were removed. For genes with multiple probe sets, all the probe sets of the gene were averaged (as well as the chromosomal location) to obtain one intensity value per gene. Next, a threshold expression value was set, and genes with lower expression values were collectively raised to that level: flooring values were 6.5–7 for the Affymetrix and Illumina platforms, and −0.5 for the Agilent platforms. Probe sets not expressed in >20% of the samples within a batch were removed. The 10% of the probe sets with the most variable expression levels were also excluded, to reduce expression noise. Normalized CGH array data were also downloaded from the GEO website, and probe sets were organized by their chromosomal location.

Inference of copy-number alterations

To infer CNAs from coordinated gene expression biases, the protocol developed by Ben-David et al.¹⁵ was applied. In each batch of analysis, the median expression of each gene across all normal tissue samples, or across the entire batch (if normal tissue samples were not available), was subtracted from the expression value of that gene in each sample to obtain a comparative value. The data were processed using a CGH analysis software program, CGH-Explorer (http://heim.ifi.uio.no/bioinf/Projects/CGHExplorer/)⁴³. Gene expression regional biases were detected using the program's piecewise constant fit (PCF) algorithm, with the following set of parameters: least allowed deviation=0.15–0.4; least allowed aberration size=50–80; winsorize at quantile=0.001; penalty=12–18; and threshold=0.01. Moving average plots were generated with the moving average fit tool, with a window size of 200 genes. The DGI was subsequently determined for each sample based on the PCF results: either by counting the number of discrete aberrations within each sample (CNA prevalence-based DGI), or by counting the number of altered genes within each sample and dividing it by the total number of genes (gene-based DGI).

Functional genomic mRNA profiling

For Affymetrix microarray platforms, mouse genome 430A, 430A 2.0 and 430 2.0 (which correspond to 53 of the 83 studies analysed), the FGMP method, proposed by Fehrmann et al.¹⁷ was also used. This procedure first estimates a set of transcriptional components that explain the majority of gene expression variation using a set of non-cancer samples. Upon correcting the gene expression data of cancer samples for these transcriptional components, the residual gene expression data strongly correlates with the copy number. We applied this approach to the mouse data, and corrected the mouse gene expression data for the first 25 principal components (PCs) that had been identified in a heterogeneous set of 17,081 mouse samples. The corrected data was then subjected to the same processing steps and CGH-PCF analysis described above to detect CNAs. The DGI was subsequently determined for each sample, by first sorting the 19,115 probe sets present in each of the three analysed Affymetrix platforms according to their genomic position, and then calculating (using a lag of 10 probe sets) the autocorrelation per sample, as described in Fehrmann et al.¹⁷

Frequency plots and heat maps

CNAs were visualized using the Integrative Genomics Viewer (https://www.broadinstitute.org/igv/). The lists of segmented CNAs of all studies within each GEMM (received as outputs from the CGH-Explorer analyses) were united, and chromosomal locations were modified to match the mouse mm8 assembly. These lists were then uploaded to Integrative Genomics Viewer to generate frequency plots and heat maps.

Recurrence analysis

To detect recurrent CNAs, the lists of segmented CNAs (CGH-Explorer output) for all studies within each GEMM were united and matched to the mouse chromosomal cytobands obtained from Ensembl 67 Archive for Mus musculus mm9 (May 2012). Each cytoband was assigned with the copy number of the segment(s) that correspond(s) to it (−1, deletion; 0, neutral; and 1, gain). The frequency of gains and losses of each chromosomal cytoband was computed within each GEMM. Aberrations were determined as recurrent if their prevalence was >10% in the N tumour samples, or if statistically significant (Bonferroni adjusted P<0.05) in binomial test for observing an alteration frequency K_c that is higher than expected. For the binomial test, the expected probability p_c of an event was computed as the background event frequency across all other cytobands within the GEMM, excluding the cytoband in the test: . The test was performed separately for gains and for losses. To further improve our confidence in detecting recurrent CNAs, we applied GISTIC2.0 (version 2.0.22) using the Mus musculus (mm9) refSeq gene annotations (ftp://ftp.broadinstitute.org/pub/GISTIC2.0/refgenes/mm9_v0.2_refgene.tgz). As input segments already contained CNA calls (−1, deletion; 0, neutral; and 1, gain), the deletion and amplification thresholds were set at ±0.5, respectively. Other GISTIC parameters were the following: genegistic=1, maxseg=2,000, js=2, cap=1.5, broad=1, brlen=0.7, conf=0.99, armpeel=1, rx=0 and gcm=extreme. The q value of each cytoband was determined by the significant focal analysis (q<0.05), and if there were no significant focal overlaps, by the significant broad analysis (q<0.05). To determine model-specific recurrent CNAs, the Pearson’s χ²-test of independence was applied, and aberrations were determined as model-specific if statistically significant (Bonferroni adjusted P<0.05) in this test.

Detection of arm-level CNAs in human tumours and cell lines

The prevalence of aneuploidy and large CNAs was compared between 1,097 human breast cancer tumours from the TCGA project²⁵ and 57 human breast cancer cell lines from the Cancer Cell Line Encyclopaedia (CCLE) cohort⁴⁴, using GISTIC2.0 analysis of arm-level events⁴⁵. Normalized, segmented Affymetrix single-nucleotide polymorphism 6.0 copy-number data for the cell lines were obtained from the CCLE (http://www.broadinstitute.org/ccle/data/browseData,2012-04-05hg18dataset). Normalized, segmented single-nucleotide polymorphism 6.0 copy-number data for the TCGA breast adenocarcinoma samples were obtained from the TCGA/GDAC Firehose stddata__2014_10_17 data set (http://gdac.broadinstitute.org/runs/stddata__2014_10_17/data, doi:10.7908/C1K64H78). The data were median-centered and converted from log2 ratio to relative copy number by GISTIC2.0 (with cap=1.5). The median relative copy number across each chromosome arm was computed for every sample and compared with a threshold of ± 0.1 copies. Using the standard GISTIC2.0 noise threshold, arm median values exceeding 0.1 were assigned 1 in the output table, arm median values below −0.1 were assigned −1 and arm median values within the range were assigned 0.

Synteny–orthology and gene networks analyses

Recurrent driver-specific aberrations identified in each GEMM were compared with recurrent aberrations identified in human breast tumours with high expression score of the same pathway³³. Synteny between the mouse and human genome was determined and drawn using the synteny location-based display of the Ensemble Genome Browser (release 80) (http://www.ensembl.org). Genes that were significantly differentially regulated in the GEMM, compared with the normal tissue samples or compared with all other GEMMs¹⁰, were then filtered to include only the ones that have human orthologues that reside within the respective human syntenic region. Orthology between the mouse and human genes was examined using the HUGO (the Human Genome Organisation) Gene Nomenclature Committee comparison of orthology predictions search (http://www.genenames.org/cgi-bin/hcop). This list was then subjected to a gene network analysis, using GeNets: The Broad Institute Web Platform for Genome Networks (https://www.broadinstitute.org/genets). For each analysis, the original initiating gene of the model (for example, HER2, MYC, TP53 and so on) was added to the list of orthologous genes dysregulated within the recurrent CNAs of that particular model, and these gene lists were then subjected to a protein–protein interaction analysis using the InWeb3 network, and to a pathway analysis using the ConsensusPathDB network.

Survival analysis

Survival data were obtained from the Kaplan–Meier Plotter breast cancer survival analysis database⁴⁶, 2014 version (http://kmplot.com/analysis/index.php?p=service&cancer=breast). The mean expression values of the two ERBB2 probe sets (210930_s_at, 216836_s_at), the three SFN probe sets (33322_i_at, 33322_r_at, 209260_at) and the three BIRC5 probe sets (202094_at, 202095_s_at, 210334_x_at) were used. The P value was calculated using a log-rank test.

Tumour latency analysis

Average tumour formation latencies were derived from The Jackson Laboratory website (http://jaxmice.jax.org/cancer/featured.html), and were confirmed by a recent review paper comparing latencies between different GEMMs²². The latencies characteristic of the Pten^−/−, the Brca1^−/− and the Etv6–Ntrk3 GEMMs were derived from Liu et al.⁴⁷, Li et al.⁴⁸ and Diaz-Cruz et al.⁴⁹, respectively.

Association analysis of gene expression and protein levels

To assess the degree of association between mRNA expression levels of relevant genes, for example, SFN, and RPPA protein levels in cell lines and tumours we used an information-theoretic measure of association: the information coefficient. This quantity is a rescaling of the differential mutual information to make it lie in the interval [−1, 1] in a way similar to a correlation coefficient. The differential mutual information is a sensitive metric to detect linear and non-linear relationships between variables. The information coefficient, the matching score shown on the side of Fig. 5a; Supplementary Fig. 9c, is computed using standard kernel estimation procedures and its statistical significance (that is, nominal P values and false discovery rates) is assessed using an empirical permutation test. Similar association analysis has been applied in other problems, such as correlating drug sensitivities, mRNA levels, pathway profiles and genomic alterations^50,51.

Cell culture and genetic manipulations

MCF10A stable cell lines overexpressing an empty vector (control), ERBB2 alone, 14-3-3ζ alone or both (transformed MCF10A cells) were a kind gift from Dihua Yu and colleagues³⁶. Cell lines were tested for mycoplasma contamination, and their morphology and performance in a functional (colony formation) assay were confirmed. MCF10A cells were cultured in MEGM Mammary Epithelial Cell Growth Medium (Lonza CC-3151), supplemented with the MEGM Bulletkit (Lonza CC-3151) and with 5 μg ml⁻¹ human transferrin (Lonza CC-4205). MDA-MB-231, MDA-MB-453 and EFM-192A breast cancer cell lines were obtained from the Broad Institute CCLE repository⁴⁴, and cultured in RPMI medium 1640 GlutaMAX (Thermo Fisher Scientific 61870-036). Lentiviral vector and its packaging vectors were transfected into 293T cells using FuGENE HD transfection reagent (Promega E2311). 293T cells were split into 6-cm² plates, and were transfected the following day with 1 μg of vector, together with 100 ng of pCMV-VSV-G and 900 ng of psPAX2 packaging plasmids. For overexpression of SFN, the introduced vector was the CCSB-Broad Lentiviral Expression clone of Human SFN ORF (ccsbBroad304_06302; ccsbBroad304_99991 luciferase clone was used as control). For short hairpin RNA (shRNA)-mediated knockdown of SFN, the introduced vector was the GeneCopoeia HSH007802-LVRH1H shRNA-1; CSHCTR001-LVRH1H was used as a scrambled control shRNA. For CRISPR/Cas9-mediated knockout of SFN, the introduced vectors were the lentiCas9_blast and the lentiGuide_Puro into which a guide RNA (gRNA) against SFN was cloned; a gRNA against green fluorescent protein was used as control. The morning following transfection, the medium was replaced with fresh culture medium. Forty-eight and 72 h later, the lentivirus containing media was collected from transfection, filtered through a 0.45-μm filtre and the target cells were infected with the fresh lentivirus containing media (supplemented with 8 μg ml⁻¹ polybrene). The next day, the medium was replaced with fresh culture medium containing selection antibiotics. MCF10A stable clones were selected with 5–10 μg ml⁻¹ blasticidin (Life Technologies A11139-03); MDA-MB-231, MDA-MB-453 and EFM-192A stable clones were selected with 100–200 μg ml⁻¹ hygromycin (for shRNAs; Life Technologies 10687-010), 1 μg ml⁻¹ puromycin (for Cas9; Life Technologies A1113803) or 5–10 μg ml⁻¹ blastocidin (for gRNAs; Life Technologies A11139-03). The sequences of the shRNAs are the following: shRNA-scrambled: GCTTCGCGCCGTAGTCTTA and shRNA-SFN: GCGAAACCTGCTCTCAGTA. The sequences of the gRNAs are the following: gRNA-green fluorescent protein: GGGCGAGGAGCTGTTCACCG and gRNA-SFN: CGAGATCGCCAACAGCCCCG.

Immunoblotting

Total cell lysates were collected with a mix of 4 × protein loading buffer (Li-Cor 928-40004) and 10 × NuPAGE sample reducing agent (Life Technologies NP0009). The lysate was boiled for 5 min at 96 °C and frozen at −20 °C. Protein concentration was normalized between samples by cell counting. Cell lysates were subjected to electrophoresis using SDS–polyacrylamide gel electrophoresis and transferred to a nitrocellulose membrane with the iBlot2 dry blotting system (Life Technologies IB23001). Membrane was then blocked with Odyssey blocking buffer (Li-Cor 927-40100) for 1 h at room temperature, followed by an overnight primary antibody incubation at 4 °C in Odyssey blocking buffer with 0.1% Tween-20. For detection of SFN/14-3-3σ, we used the anti-human 14-3-3σ (E-11) mouse monoclonal antibody (Santa Cruz Biotechnologies, sc-166473, 1:200). For detection of β-actin, we used the anti-human β-actin rabbit polyclonal antibody (Santa Cruz Biotechnologies, sc-130656, 1:200). Following primary antibody staining, membranes were washed three times with Tris-Buffered Saline with Tween 20 (TBST) and incubated with the appropriate IRDye secondary antibody (Li-Cor) for 1 h at room temperature in Odyssey blocking buffer with 0.1% Tween-20 and 0.02% SDS. Membrane was then washed three times with TBST and twice with phosphate-buffered saline, and the signal was detected with a Li-Cor Odyssey CLx imaging machine and quantitated with the Image Studio software. Three biological replicates of the experiments were performed. Uncropped scans are presented in Supplementary Fig. 14.

Soft-agar colony formation assay

Cells were suspended in 0.35% agar with their culture media, plated into six-well plates pre-coated with 0.5% agar at a density of 25 k cells per well and incubated at 37 °C. Once a week, 200 μl of media was added to each well. At 2 weeks, cells were stained for 1 h with 0.005% crystal violet (Sigma-Aldrich V5265) in phosphate-buffered saline with 4% formaldehyde, washed three times and images of the entire wells were taken using a Leica automated microscope with an ACE light source (Schott A20500). Images were analysed and colonies (>10 pixel units in size) were automatically counted using the Cell Profiler imaging software. Three biological replicates of the experiments were performed.

Cell migration and invasion assays

CytoSelect 96-well cell migration assay (Cell Biolabs CBA-106) and CytoSelect 96-well cell invasion assay (Cell Biolabs CBA-112-COL) were performed according to the manufacturer’s protocol. In short, cells were suspended in low-serum (0.5% fetal bovine serum) DMEM medium, and added to the top chambers of the 96-well cell migration plates or the collagen-coated 96-well cell invasion plates at a density of 50 k cells per well. Complete media was added to the bottom chambers as attractant. Twenty-four hours after incubation, migrating/invading cells were detached from the underside of the membrane using cell detachment solution, lysed with lysis buffer and stained with CyQuant GR dye solution. Fluorescence intensity was determined with Envision plate reader at 485/535 nm. Five biological replicates of the experiments were performed.

Statistical analyses

The significance of the differences in the prevalence of CNAs between different stages of tumorigenesis, between primary tumours and cell lines, between the various GEMMs, between activating promoters, between genetic backgrounds, between histological subtypes, between tumours with different p53 status and between mouse strains was determined using the Pearson’s χ²-test of independence, or using the Fisher’s exact test (whenever the number of samples for one of the conditions was <10). The significance of the difference in the average number of arm-level CNAs between human primary tumours and cell lines, and the significance of the difference in the performance in the migration, invasion and colony formation assays were determined using the two-tailed Student’s t-test. The significance of the difference between the autocorrelation distributions of GEMMs was determined by a Mann–Whitney U test. The significance of the difference between the DGI of the various GEMMs, as determined by the fraction of altered genes or by the number of discrete CNAs was determined by a Kruskal–Wallis rank-sum test, followed by a post hoc Dunn’s test of multiple comparisons. Bar plots represent the mean ± s.d. Box plots were generated using the R statistical software, so that the boxes show the median, 25th and 75th percentiles, lower whiskers show data within 25th percentile −1.5 times the interquartile range (IQR), upper whiskers show data within 75th percentile +1.5 times the IQR and circles show outliers.

Data availability

Data referenced in this study and their associated accession codes are available in Supplementary Data 1. The authors declare that any other data supporting the findings of this study are available within the article, its Supplementary Information files or available from the author upon request.

Additional information

How to cite this article: Ben-David, U. et al. The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis. Nat. Commun. 7:12160 doi: 10.1038/ncomms12160 (2016).

References

Garraway, L. A. & Lander, E. S. Lessons from the cancer genome. Cell 153, 17–37 (2013).
Article CAS Google Scholar
Zack, T. I. et al. Pan-cancer patterns of somatic copy number alteration. Nat. Genet. 45, 1134–1140 (2013).
Article CAS Google Scholar
Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).
Article CAS ADS Google Scholar
van Miltenburg, M. H. & Jonkers, J. Using genetically engineered mouse models to validate candidate cancer genes and test new therapeutic approaches. Curr. Opin. Genet. Dev. 22, 21–27 (2012).
Article CAS Google Scholar
Menezes, M. E. et al. Genetically engineered mice as experimental tools to dissect the critical events in breast cancer. Adv. Cancer Res. 121, 331–382 (2014).
Article CAS Google Scholar
Weaver, Z. A. et al. A recurring pattern of chromosomal aberrations in mammary gland tumours of MMTV-cmyc transgenic mice. Genes Chromosomes Cancer 25, 251–260 (1999).
Article CAS Google Scholar
Weaver, Z. et al. Mammary tumours in mice conditionally mutant for Brca1 exhibit gross genomic instability and centrosome amplification yet display a recurring distribution of genomic imbalances that is similar to human breast cancer. Oncogene 21, 5097–5107 (2002).
Article CAS Google Scholar
Fabris, V. T. From chromosomal abnormalities to the identification of target genes in mouse models of breast cancer. Cancer Genet. 207, 233–246 (2014).
Article CAS Google Scholar
Silva, G. O. et al. Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer. Breast Cancer Res. Treat. 152, 347–356 (2015).
Article CAS Google Scholar
Hollern, D. P. & Andrechek, E. R. A genomic analysis of mouse models of breast cancer reveals molecular features of mouse models and relationships to human breast cancer. Breast Cancer Res. 16, R59 (2014).
Article Google Scholar
Herschkowitz, J. I. et al. Comparative oncogenomics identifies breast tumours enriched in functional tumour-initiating cells. Proc. Natl Acad. Sci. USA 109, 2778–2783 (2012).
Article CAS ADS Google Scholar
Herschkowitz, J. I. et al. Identification of conserved gene expression features between murine mammary carcinoma models and human breast tumours. Genome Biol. 8, R76 (2007).
Article Google Scholar
Carter, S. L., Eklund, A. C., Kohane, I. S., Harris, L. N. & Szallasi, Z. A signature of chromosomal instability inferred from gene expression profiles predicts clinical outcome in multiple human cancers. Nat. Genet. 38, 1043–1048 (2006).
Article CAS Google Scholar
Patel, A. P. et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344, 1396–1401 (2014).
Article CAS ADS Google Scholar
Ben-David, U., Mayshar, Y. & Benvenisty, N. Virtual karyotyping of pluripotent stem cells on the basis of their global gene expression profiles. Nat. Protoc. 8, 989–997 (2013).
Article Google Scholar
Mayshar, Y. et al. Identification and classification of chromosomal aberrations in human induced pluripotent stem cells. Cell Stem Cell 7, 521–531 (2010).
Article CAS Google Scholar
Fehrmann, R. S. et al. Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat. Genet. 47, 115–125 (2015).
Article CAS Google Scholar
Westcott, P. M. et al. The mutational landscapes of genetic and chemical models of Kras-driven lung cancer. Nature 517, 489–492 (2015).
Article CAS ADS Google Scholar
Nassar, D., Latil, M., Boeckx, B., Lambrechts, D. & Blanpain, C. Genomic landscape of carcinogen-induced and genetically induced mouse skin squamous cell carcinoma. Nat. Med. 21, 946–954 (2015).
Article CAS Google Scholar
McCreery, M. Q. et al. Evolution of metastasis revealed by mutational landscapes of chemically induced skin cancers. Nat. Med. 21, 1514–1520 (2015).
Article CAS Google Scholar
Neve, R. M. et al. A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. Cancer Cell. 10, 515–527 (2006).
Article CAS Google Scholar
Fantozzi, A. & Christofori, G. Mouse models of breast cancer metastasis. Breast Cancer Res. 8, 212 (2006).
Article Google Scholar
Andrechek, E. R. et al. Genetic heterogeneity of Myc-induced mammary tumours reflecting diverse phenotypes including metastatic potential. Proc. Natl Acad. Sci. USA 106, 16387–16392 (2009).
Article ADS Google Scholar
Van Keymeulen, A. et al. Reactivation of multipotency by oncogenic PIK3CA induces breast tumour heterogeneity. Nature 525, 119–123 (2015).
Article CAS ADS Google Scholar
Cancer Genome Atlas, N. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Ciriello, G. et al. Comprehensive molecular portraits of invasive lobular breast cancer. Cell 163, 506–519 (2015).
Article CAS Google Scholar
Eirew, P. et al. Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution. Nature 518, 422–426 (2015).
Article CAS ADS Google Scholar
Marsicano, S. R. et al. Survinin expression in patients with breast cancer during chemotherapy. Tumour Biol. 36, 3441–3445 (2015).
Article CAS Google Scholar
Tanaka, K. et al. Expression of survivin and its relationship to loss of apoptosis in breast carcinomas. Clin. Cancer Res. 6, 127–134 (2000).
CAS PubMed Google Scholar
Wang, C., Zheng, X., Shen, C. & Shi, Y. MicroRNA-203 suppresses cell proliferation and migration by targeting BIRC5 and LASP1 in human triple-negative breast cancer cells. J. Exp. Clin. Cancer Res. 31, 58 (2012).
Article Google Scholar
Li, Q. X. et al. Survivin stable knockdown by siRNA inhibits tumour cell growth and angiogenesis in breast and cervical cancers. Cancer Biol. Ther. 5, 860–866 (2006).
Article CAS Google Scholar
Xia, W. et al. Regulation of survivin by ErbB2 signaling: therapeutic implications for ErbB2-overexpressing breast cancers. Cancer Res. 66, 1640–1647 (2006).
Article CAS Google Scholar
Gatza, M. L., Silva, G. O., Parker, J. S., Fan, C. & Perou, C. M. An integrated genomics approach identifies drivers of proliferation in luminal-subtype human breast cancer. Nat. Genet. 46, 1051–1059 (2014).
Article CAS Google Scholar
Lodygin, D. & Hermeking, H. The role of epigenetic inactivation of 14-3-3sigma in human cancer. Cell Res. 15, 237–246 (2005).
Article CAS Google Scholar
Boudreau, A. et al. 14-3-3sigma stabilizes a complex of soluble actin and intermediate filament to enable breast tumour invasion. Proc. Natl Acad. Sci. USA 110, E3937–E3944 (2013).
Article CAS Google Scholar
Lu, J. et al. 14-3-3zeta Cooperates with ErbB2 to promote ductal carcinoma in situ progression to invasive breast cancer by inducing epithelial-mesenchymal transition. Cancer Cell 16, 195–207 (2009).
Article CAS Google Scholar
Ling, C., Su, V. M., Zuo, D. & Muller, W. J. Loss of the 14-3-3sigma tumour suppressor is a critical event in ErbB2-mediated tumour progression. Cancer Discov. 2, 68–81 (2012).
Article CAS Google Scholar
Seton-Rogers, S. E. et al. Cooperation of the ErbB2 receptor and transforming growth factor beta in induction of migration and invasion in mammary epithelial cells. Proc. Natl Acad. Sci. USA 101, 1257–1262 (2004).
Article CAS ADS Google Scholar
Hong, H. Y. et al. 14-3-3 sigma and 14-3-3 zeta plays an opposite role in cell growth inhibition mediated by transforming growth factor-beta 1. Mol. Cells 29, 305–309 (2010).
Article CAS Google Scholar
Hong, H. Y., Jeon, W. K., Kim, S. J. & Kim, B. C. 14-3-3 sigma is a new target up-regulated by transforming growth factor-beta1 through a Smad3-dependent mechanism. Biochem. Biophys. Res. Commun. 432, 193–197 (2013).
Article CAS Google Scholar
Hu, F. et al. PBK/TOPK interacts with the DBD domain of tumour suppressor p53 and modulates expression of transcriptional targets including p21. Oncogene 29, 5464–5474 (2010).
Article CAS Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing (2014).
Lingjaerde, O. C., Baumbusch, L. O., Liestol, K., Glad, I. K. & Borresen-Dale, A. L. CGH-Explorer: a program for analysis of array-CGH data. Bioinformatics 21, 821–822 (2005).
Article CAS Google Scholar
Barretina, J. et al. The Cancer Cell Line Encyclopaedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).
Article CAS ADS Google Scholar
Mermel, C. H. et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 12, R41 (2011).
Article Google Scholar
Gyorffy, B. et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res. Treat. 123, 725–731 (2010).
Article Google Scholar
Liu, J. C. et al. Combined deletion of Pten and p53 in mammary epithelium accelerates triple-negative breast cancer with dependency on eEF2K. EMBO Mol. Med. 6, 1542–1560 (2014).
Article CAS Google Scholar
Li, Z. et al. ETV6-NTRK3 fusion oncogene initiates breast cancer from committed mammary progenitors via activation of AP1 complex. Cancer Cell 12, 542–558 (2007).
Article CAS Google Scholar
Diaz-Cruz, E. S., Cabrera, M. C., Nakles, R., Rutstein, B. H. & Furth, P. A. BRCA1 deficient mouse models to study pathogenesis and therapy of triple negative breast cancer. Breast Dis. 32, 85–97 (2010).
Article Google Scholar
Konieczkowski, D. J. et al. A melanoma cell state distinction influences sensitivity to MAPK pathway inhibitors. Cancer Discov. 4, 816–827 (2014).
Article CAS Google Scholar
Stewart, M. L. et al. KRAS genomic status predicts the sensitivity of ovarian cancer cells to decitabine. Cancer Res. 75, 2897–2906 (2015).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Pablo Tamayo, Steven Schumacher, Craig Bielski and John Mercer for assistance with the data assembly; Dihua Yu for the MCF10A cells; and Gad Getz, Rameen Beroukhim and Zuzana Tothova for the helpful discussions. U.B.-D. is supported by a Human Frontiers Science Program postdoctoral fellowship.

Author information

Authors and Affiliations

Cancer Program, Broad Institute of Harvard and MIT, Cambridge, 02142, Massachusetts, USA
Uri Ben-David, Gavin Ha, Prasidda Khadka, Xin Jin, Bang Wong & Todd R. Golub
Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, 02115, Massachusetts, USA
Gavin Ha & Todd R. Golub
Department of Genetics, University of Groningen, Groningen, 9711, The Netherlands
Lude Franke
Harvard Medical School, Harvard University, Boston, 02115, Massachusetts, USA
Todd R. Golub
Howard Hughes Medical Institute, Chevy Chase, 208115, Maryland, USA
Todd R. Golub

Authors

Uri Ben-David
View author publications
You can also search for this author in PubMed Google Scholar
Gavin Ha
View author publications
You can also search for this author in PubMed Google Scholar
Prasidda Khadka
View author publications
You can also search for this author in PubMed Google Scholar
Xin Jin
View author publications
You can also search for this author in PubMed Google Scholar
Bang Wong
View author publications
You can also search for this author in PubMed Google Scholar
Lude Franke
View author publications
You can also search for this author in PubMed Google Scholar
Todd R. Golub
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

U.B.-D. conceived the project, performed the experiments, collected the data and carried out the analyses; G.H. assisted with the computational analyses; P.K. assisted with the SFN overexpression experiments; X.J. assisted with the data analysis of human cancer cell lines; B.W. assisted with the figure design and preparation. L.F. assisted with the FGMP and autocorrelation computations; T.R.G. directed the project; and U.B.-D. and T.R.G. wrote the manuscript.

Corresponding author

Correspondence to Todd R. Golub.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-14 and Supplementary Tables 1-3. (PDF 2016 kb)

Supplementary Data 1

Breast cancer datasets analyzed in the current study. (XLS 22 kb)

Supplementary Data 2

List of breast cancer tumor samples. (XLS 129 kb)

Supplementary Data 3

List of normal mammary tissue samples. (XLS 38 kb)

Supplementary Data 4

List of premalignant mammary samples. (XLS 17 kb)

Supplementary Data 5

List of breast cancer metastase. (XLS 13 kb)

Supplementary Data 6

List of breast cancer cell lines. (XLS 18 kb)

Supplementary Data 7

Myc-induced tumors in lymphoma and prostate cancer GEMMs. (XLS 34 kb)

Supplementary Data 8

8: SV40Tag-induced tumors in prostate cancer GEMMs. (XLS 17 kb)

Supplementary Data 9

Candidate genes underlying recurrent driver-specific CNAs. (XLS 71 kb)

Supplementary Data 10

Band-level aberration matrix files. (XLS 2553 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Ben-David, U., Ha, G., Khadka, P. et al. The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis. Nat Commun 7, 12160 (2016). https://doi.org/10.1038/ncomms12160

Download citation

Received: 30 March 2016
Accepted: 06 June 2016
Published: 04 July 2016
DOI: https://doi.org/10.1038/ncomms12160

This article is cited by

Machine-learning analysis reveals an important role for negative selection in shaping cancer aneuploidy landscapes
- Juman Jubran
- Rachel Slutsky
- Esti Yeger-Lotem
Genome Biology (2024)
Dissecting metastasis using preclinical models and methods
- Jess D. Hebert
- Joel W. Neal
- Monte M. Winslow
Nature Reviews Cancer (2023)
Insights from transgenic mouse models of PyMT-induced breast cancer: recapitulating human breast cancer progression in vivo
- Sherif Attalla
- Tarek Taifour
- William Muller
Oncogene (2021)
Aneuploidy as a promoter and suppressor of malignant growth
- Anand Vasudevan
- Klaske M. Schukken
- Jason M. Sheltzer
Nature Reviews Cancer (2021)
Cross-species oncogenic signatures of breast cancer in canine mammary tumors
- Tae-Min Kim
- In Seok Yang
- Sangwoo Kim
Nature Communications (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.