An independent poor-prognosis subtype of breast cancer defined by a distinct tumor immune microenvironment

Tekpli, Xavier; Lien, Tonje; Røssevold, Andreas Hagen; Nebdal, Daniel; Borgen, Elin; Ohnstad, Hege Oma; Kyte, Jon Amund; Vallon-Christersson, Johan; Fongaard, Marie; Due, Eldri Undlien; Svartdal, Lisa Gregusson; Sveli, My Anh Tu; Garred, Øystein; Frigessi, Arnoldo; Sahlberg, Kristine Kleivi; Sørlie, Therese; Russnes, Hege G.; Naume, Bjørn; Kristensen, Vessela N.

doi:10.1038/s41467-019-13329-5

Download PDF

Article
Open access
Published: 03 December 2019

An independent poor-prognosis subtype of breast cancer defined by a distinct tumor immune microenvironment

Xavier Tekpli ORCID: orcid.org/0000-0003-4042-8290¹,
Tonje Lien¹,
Andreas Hagen Røssevold ORCID: orcid.org/0000-0002-7212-0111^2,3,
Daniel Nebdal¹,
Elin Borgen⁴,
Hege Oma Ohnstad ORCID: orcid.org/0000-0001-6932-0297³,
Jon Amund Kyte^2,3,
Johan Vallon-Christersson ORCID: orcid.org/0000-0002-2195-0385⁵,
Marie Fongaard¹,
Eldri Undlien Due¹,
Lisa Gregusson Svartdal⁴,
My Anh Tu Sveli⁴,
Øystein Garred⁴,
OSBREAC,
Arnoldo Frigessi⁶,
Kristine Kleivi Sahlberg^1,7,
Therese Sørlie^1,8,
Hege G. Russnes ORCID: orcid.org/0000-0001-8724-1891^1,4,
Bjørn Naume^3,9 &
…
Vessela N. Kristensen^1,8,10

Nature Communications volume 10, Article number: 5499 (2019) Cite this article

10k Accesses
104 Citations
5 Altmetric
Metrics details

Subjects

Abstract

How mixtures of immune cells associate with cancer cell phenotype and affect pathogenesis is still unclear. In 15 breast cancer gene expression datasets, we invariably identify three clusters of patients with gradual levels of immune infiltration. The intermediate immune infiltration cluster (Cluster B) is associated with a worse prognosis independently of known clinicopathological features. Furthermore, immune clusters are associated with response to neoadjuvant chemotherapy. In silico dissection of the immune contexture of the clusters identified Cluster A as immune cold, Cluster C as immune hot while Cluster B has a pro-tumorigenic immune infiltration. Through phenotypical analysis, we find epithelial mesenchymal transition and proliferation associated with the immune clusters and mutually exclusive in breast cancers. Here, we describe immune clusters which improve the prognostic accuracy of immune contexture in breast cancer. Our discovery of a novel independent prognostic factor in breast cancer highlights a correlation between tumor phenotype and immune contexture.

Machine learning-based cluster analysis of immune cell subtypes and breast cancer survival

Article Open access 03 November 2023

A novel immune prognostic index for stratification of high-risk patients with early breast cancer

Article Open access 08 January 2021

A single-cell and spatially resolved atlas of human breast cancers

Article 06 September 2021

Introduction

The tumor microenvironment influences cancer initiation and progression^1,2. In breast cancer, clinicopathological characteristics such as age, grade, stage, and molecular subtypes associate with prognosis and drive treatment decisions. High-throughput gene expression analyses led to a molecular classification of breast cancers^3,4. The five clinically relevant molecular subtypes: Luminal A, Luminal B, Her2-enriched, Basal-like, and Normal-like, have different incidences, survival, prognosis, and tumor biology. Such patient stratification has clinical and economical utility in breast cancer management⁵.

In addition to cancer cell biology, an inflammatory microenvironment influences initiation and progression⁶. The immune microenvironment surrounding cancer cells can recognize and inhibit tumor growth⁷ or promote progression⁸. It is crucial to characterize the quality and quantity of immune response at the tumor site, as it may help to pinpoint patients who could benefit from immunotherapies and will improve our understanding of the tumor–host biology.

In breast cancer, high immune infiltration has been associated with better clinical outcome^9,10. In particular, high CD8+ T cell infltration associate with better overall survival (OS) in estrogen receptor (ER)-negative patients^11,12. In addition, high immune infiltration has been associated with an increased response to neo-adjuvant and adjuvant chemotherapy¹³.

Recently, we and others have demonstrated that transcriptomic data can be leveraged to dissect the tumor microenvironment^{14,15,16,17,18,19}. Such methods have shown that elevated expression of leukocyte marker genes associates with a lower risk of breast cancer recurrence^14,17,20,21. Notably, Ali et al. and Bense et al. recently reported through comprehensive studies how specific immune cell types influence breast cancer outcome^14,22. In these studies, the authors assessed each predicted cell type individually and did not consider the immune microenvironment as a whole. More studies are needed to specify the role and the clinical relevance of the immune contexture in breast cancer.

In the present study, we discover clinically relevant immune clusters with gradual immune infiltration. In 15 breast cancer cohorts, spanning 6101 breast cancer samples, the group of patients with intermediate levels of tumor immune infiltration has a worse prognosis independently of known prognostic molecular and clinicopathological features. Through characterization of the immune composition of the clusters, we find a pro-tumorigenic immune infiltration associated with the poor prognosis group. Further phenotypical analyses show two mutually exclusive aggressive tumor phenotypes in breast cancers, one linked to epithelial–mesenchymal transition (EMT) and the other to proliferation. Both phenotypes are found in the poor prognosis cluster on an inactive/pro-tumorigenic immune microenvironment.

Results

Immune clusters in breast cancer

The expression of 760 genes in 95 formalin-fixed, paraffin-embedded (FFPE) tumor samples of the MicMa cohort was measured using the nCounter® PanCancer Immune Profiling array, an array designed to profile immune infiltration in solid tumors. Seventy-nine of these 95 samples have been previously profiled by Agilent whole-genome 4 × 44K oligo array²³. We first compared the expression obtained with the two platforms using Pearson and Spearman correlations and found a high degree of positive correlation between the genes’ expression values (Supplementary Fig. 1A).

In order to group patients according to their similarity in expression of the immune-related genes, we performed unsupervised hierarchical clustering of the correlation matrix (Fig. 1a: 95 MicMa-nCounter and Supplementary Fig. 1B; 104 MicMa-Agilent samples). Silhouette plot analysis from 3 to 10 clusters indicated that 3 clusters captured best the segmentation of both the nCounter and the Agilent datasets (Supplementary Fig. 1C, D). We therefore continued our analyses based on three clusters of patients. We compared the clustering obtained from FFPE: MicMa-nCounter, 95 samples (correlation matrix obtained from the expression of 760 genes on the Immune Profiling array) to the clustering performed on fresh frozen tissue MicMa-Agilent, 104 samples (correlation matrix obtained from the expression of the 509 genes on the Immune Profiling array found in all datasets used in this study). Seventy-nine samples were overlapping in these two datasets. With different platforms used to measure gene expression, as well as incomplete overlap in gene lists and samples used to perform unsupervised clustering, we still found the cluster assignment for the 79 overlapping samples significantly similar (Supplementary Table 1 with Fisher exact test <0.0001).

To confirm that the clusters were associated with the tumor immune microenvironment (Fig. 1b), we used the algorithm Nanodissect to score for total lymphocyte and myelocyte infiltration^17,24,25. Nanodissect scores were first validated in the MicMa cohort using the evaluation of immune infiltration of matched hematoxylin and eosin (H&E) sections analyzed by experienced pathologists (Fig. 1c and Supplementary Fig. 1E).

We found the three clusters significantly correlated with Nanodissect lymphocyte (Fig. 1b) and myelocyte (Supplementary Fig. 1F) scores. In addition, Chi-squared test showed significant association between clusters and immune infiltration assessed by experienced pathologists (p < 0.0001). We concluded that Clusters A–C reflect gradual immune infiltration and were therefore called immune clusters.

Clusters reflect gradual immune infiltration

We validated the association between the clusters and lymphoid/myeloid infiltration using the expression data from nine other cohorts (Supplementary Table 2). As stated above, 509 of the 760 genes on the nCounter® PanCancer Immune Profiling array were found in all datasets studied, the expression of these 509 genes was used in the unsupervised clustering (Fig. 1d and Supplementary Fig. 2A for the clustering of the METABRIC and The Cancer Genome Atlas (TCGA) cohorts, respectively). In each cohort, the three clusters obtained were significantly associated with lymphoid and myeloid Nanodissect scores (Lymphoid score: METABRIC, Fig. 1e; TCGA, Supplementary Fig. 2B).

Lymphoid and myeloid infiltrations gradually increased from Cluster A (blue; low infiltration; cold tumors) to Cluster B (light blue; intermediate infiltration) and Cluster C (pink; high infiltration; hot tumors).

For an additional layer of validation, we used the pathological assessment of immune infiltration in the METABRIC cohort²⁶, which was significantly associated with the Nanodissect scores (Fig. 1f and Supplementary Fig. 2C) and with the immune clusters: Chi-square test between immune clusters and pathological assessment of immune infiltration p value < 0.0001. We could now strongly conclude that unsupervised hierarchical clustering using genes of the PanCancer Immune Profiling array allows to group breast cancer tumors according to gradual levels of immune infiltration.

Immune clusters associate with prognosis

We examined the immune clusters in perspective of survival using Kaplan–Meier analysis and log-rank tests. For the two largest cohorts METABRIC (n = 1904) and TCGA (n = 981), we found Cluster B (with intermediate levels of immune infiltration) associated with worse prognosis (Supplementary Fig. 3A, B). Such a worse outcome for Cluster B cases was also observed when stratifying for ER-negative (Supplementary Fig. 3C, D) and ER-positive cases (Supplementary Fig. 3E, F) separately. To refine our observation, we plotted patient survival according to Cluster B (light blue) vs Clusters A and C (purple) and confirmed a clear and significant worse prognosis for patients in Cluster B (Fig. 2). We further validated this result in four additional cohorts with relevant survival data: TAI (n = 327), VDX (n = 344), STK (n = 159), and UPP (n = 251) (Supplementary Fig. 4). We concluded that immune clusters associate with prognosis both in ER-negative and ER-positive breast cancers.

Predicting immune clusters with binomial logistic regression

Motivated by the clinical relevance of the immune clusters, we aimed at developing a general method that could precisely and sensitively predict the classification of patients to the worse prognosis group without having to rely on unsupervised clustering. We developed a model through training on 10 cohorts (4546 samples) and testing on 5 others (1555 samples). We used binomial logistic regression penalized by the lasso method to obtain a set of genes (Supplementary Data 1) that sensitively and specifically predict whether a sample is part of Cluster B or not, as assessed by receiver operating characteristic curve and area under the curve (AUC) analysis (Fig. 3a). Our model predicted the immune clusters with an AUC = 85.8% (82.8%–88.7%). We found that 96.3% of the samples assigned to Clusters A and C by clustering were predicted to be A and C by the model, while 68.8% of the samples assigned to Cluster B through clustering were found in Cluster B using the lasso method (Fig. 3b). It appeared that the lasso method decreased the number of samples in Cluster B (Fig. 3b). As unsupervised clustering is less reliable in small cohorts and because learning the cluster assignment from several cohorts will help to precise the phenotype underlying the immune clusters, we hypothesized that the lasso-derived classification would be a better prognostic factor than the clustering method. Indeed, by comparing the survival log-rank test p values, we found that the lasso classification generally improved the significant associations between the immune clusters and survival (Supplementary Table 3). The lasso model was validated in five additional cohorts: Fig. 3c–e for STAM (n = 856), MAINZ (n = 200), and UPSA (n = 289) and Supplementary Fig. 5A, B for CAL (n = 118) and PNC (n = 92).

As the binomial logistic regression only predicted two clusters (Cluster B vs Clusters A and C), we performed another round of binomial logistic regression to distinguish between Cluster A and C with high accuracy (Supplementary Fig. 5C, D). In conclusion, binomial logistic regression penalized by the lasso method refined Cluster B and provided a single sample predictor that could be applicable to every next patient in the clinic. In the subsequent analyses, we use the categories given by the lasso methods as it has a more significant association with survival.

Immune clusters, an independent prognostic factor

We further investigated how the immune clusters were related to well-known clinicopathological features in breast cancer (size, age, grade, stage, lymph node involvement, and molecular subtypes (PAM50)). Cluster A (with low immune infiltration) was enriched in ER-positive and Luminal cases, while a higher proportion of ER-negative and Basal-like cases was found in Cluster C (with high immune infiltration) (Fig. 4a, b). ER-negative and ER-positive samples as well as the PAM50 subtypes were equally represented in the poor prognosis Cluster B (Fig. 4a, b).

We tested the prognostic impact of the immune clusters while accounting for other prognostic factors using multivariable Cox regression analysis. The variables available for each cohort (ER status, PAM50 subtypes, age, nodal status, size, and grade) were entered into each model. The odd ratios and p values associated with each variable in each model are shown in Supplementary Table 4. We found that immune clusters were an important factor to model survival as shown by the significant p values associated with immune clusters in each cohort Cox model. Indeed, if we removed the immune clusters from the modeling, the Akaike Information Criterion (AIC) index was increased (Supplementary Table 5), demonstrating the important value of immune clusters on top of all other variables for explaining breast cancer survival.

To further test the strength of the immune clusters as an important prognostic biomarker, we used a stepwise backward selection. From the initial Cox models containing all variables, we removed the weakest predictor variable only if this did not weaken the model (as monitored by the calculation of AIC index). This allowed us to find for each cohort the set of variables explaining survival best. For all cohorts, the immune clusters were kept in the best fitted minimal model, and in 9 out of 11 cohorts, the immune clusters were a significant prognostic variable (Table 1). To further emphasize and illustrate the clinical relevance of the immune clusters and their independence from the PAM50 molecular subtypes, we plotted for the METABRIC and TCGA cohorts the Kaplan–Meier survival curve for each PAM50 subtype (Supplementary Fig. 6).

Table 1 Summary statistics of the Cox regression analysis and stepwise backward selection.

Full size table

Validation in a new RNA-seq dataset with risk of recurrence (ROR) scores

We generated a new dataset: EMIT0, which is a subset of the OSLO2 cohort study. The OSLO2-EMIT0 was assessed by the Food and Drug Administration-approved Prosigna risk of recurence (ROR) scores. As recently demonstrated, ROR scores add significant prognostic information above standard clinicopathological features^3,27. We assessed whether the immune clusters could add prognostic value to ROR scores. We found Cluster B composed of samples with intermediate ROR scores compared to Clusters A and C (Fig. 4c). This suggested that the poor prognosis associated with Cluster B was not likely to be explained by the information contained in the ROR scores. This observation was also true when assessing the ER-negative (Supplementary Fig. 7A) and ER-positive (Supplementary Fig. 7B) cases separately. For all cohorts, we calculated the ROR scores following Parker et al.³’s method, which is related to PAM50 subtyping³, and confirmed that Cluster B was composed of intermediate ROR scores as exemplified in the METABRIC cohort (Fig. 4d and Supplementary Fig. 7C, D).

Multivariable regression analysis confirmed that the immune clusters bring additional prognostic value to the ROR scores (Supplementary Table 6) as demonstrated by the significant p values for the immune clusters when modeling survival with ROR scores and immune clusters. Through computation of net reclassification improvement (NRI) and integrated discrimination improvement (IDI) indexes²⁸, we emphasized the additional value of immune clusters to classify patients according to survival when taken together with ROR scores, as indicated by the positive NRI and IDI coefficients in all cohorts. Bootstrapping for confidence interval (CI) construction for NRI and IDI showed that, for several cohorts, the immune clusters significantly improved patient classification according to prognosis when added to the ROR scores (Supplementary Table 6). Using complementary statistical analyses, we demonstrate the clinical relevance of the immune clusters in breast cancer.

Immune clusters and response to neoadjuvant chemotherapy

We further assessed the association between the immune clusters and response to neoadjuvant chemotherapy, using gene expression data from studies in which patients were treated in neoadjuvant setting (chemotherapy before surgery). The endpoint of these studies was pathological complete response (pCR), which means complete eradication of cancer cells at the end of the chemotherapeutic regimen before surgery (see Supplementary Table 2 for datasets used in this section). We used gene expression data from 8 studies (1377 samples), and assigned to each sample its immune cluster belonging using the lasso method. As shown in Fig. 4e, we found the highest percentage of responders in Cluster C (59%), followed by Cluster A (30%) and the lowest percentage of responders in Cluster B (11%). Since Cluster B is also the smallest cluster in terms of patient numbers, we also calculated the percentage of responders within each cluster. Cluster C was composed in average of 42% of responders and 58% of patients with residual disease, whereas Cluster B had 18%/82% and Cluster A 13%/87% of responders/residual disease cases, respectively.

As the pCR rate differs as a function of ER status²⁹, we also calculated the percentage of responders in ER-positive and ER-negative cases independently and found the lowest rate of responders in Cluster B regardless of ER status (Supplementary Fig. 8A, B, respectively).

For each cohort with response to neoadjuvant chemotherapy, we assessed the distribution (chi-square p values, Supplementary Table 7) of the pCR and non-pCR cases across the immune clusters taking into account all cases, or ER-positive and ER-negative cases independently. When considering the whole cohort, we found the distribution of the responders significantly different across immune clusters, with less responders in Cluster B and most responders in Cluster C. When splitting by ER status, the same tendency was observed although not always significant.

These results demonstrate that patients in Cluster C have a higher probability to be responders, which corroborate previous studies reporting a higher pCR rate for cases with high immune infiltration and/or proliferative phenotype^29,30. Our results also highlight a low response rate in Cluster B, suggesting that such patients may be candidates for testing of new neoadjuvant therapeutic options.

In silico dissection of the immune clusters

To assess whether the gradual immune infiltration in the clusters could explain the association with prognosis, we tested which of the immune clusters or total immune infiltration scores was more predictive of survival in a Cox multivariable regression analysis (Supplementary Table 8). Nanodissect lymphocyte scores were poorly associated with survival, we therefore hypothesized that specific immune cell-type mixtures, rather than the total number of immune cells in the tumor microenvironment, may explain the poor prognosis in Cluster B.

We estimated the proportions of 22 distinct immune cell types using the CIBERSORT algorithm¹⁹. We calculated per cohort and cluster the median infiltration of each immune cell type and performed unsupervised clustering of such cell-type-specific median infiltration scores (Fig. 5a). We found that the CIBERSORT inferred immune infiltration recapitulated the immune clusters. Cluster C cases were enriched, among other cell types, for macrophages M1, memory activated T cells, and follicular T helper cells (Fig. 5a), as also illustrated by the distribution of the CIBERSORT scores in the METABRIC and the TCGA cohorts (Fig. 5b and Supplementary Fig. 9). Cluster A had, as expected, very low levels of immune cells. In the poor response and prognosis Cluster B, higher levels of macrophages M2, resting mast cells, and resting memory T cells were found (Fig. 5a), as also illustrated by density plots for the METABRIC and TCGA cohorts (Fig. 5b and Supplementary Fig. 9).

Using generalized linear models, we specified the immune cell types distinguishing between Cluster B vs A–C and identified resting and pro-tumorigenic immune cell types explaining Cluster B (Fig. 5c). We also tested which immune cell types explained the differences between Cluster A versus Cluster B (Supplementary Fig. 10A) and between Cluster B versus Cluster C (Supplementary Fig. 10B). When comparing Cluster A to Cluster B, all immune cell types could explain Cluster B, indeed, Cluster A has no or low immune infiltration. When comparing Cluster B to C, we found again the pro-tumorigenic cell types macrophages M2 and resting mast cells explaining Cluster B. These results suggest that pro-tumorigenic immune infiltration in Cluster B may favor tumor growth. In conclusion, Cluster A is composed of immune-cold tumors, Cluster C contains immune-hot tumors, and cases in Cluster B have a pro-tumorigenic immune infiltration.

Phenotypic analysis of the immune clusters

To further characterize the phenotype associated with the poor prognosis in Cluster B, we identified through differential gene expression analysis the genes significantly overexpressed in Cluster B. We found 909 genes upregulated in Cluster B when compared to Cluster A and Cluster C separately (Bonferroni-corrected p value < 0.0001; Supplementary Data 3). These genes were associated with stem cell biology and EMT, as shown by the gene set enrichment analysis (GSEA) using the H and C2 collection of the MsigDB³¹ (Fig. 6a).

To further characterize the relationship between the immune clusters and cancer cell phenotype, we used gene sets associated with EMT, stem cells, hypoxia, and proliferation. In total, 11 gene sets from the MsigDB and an additional EMT-related signature from Tan et al.³² were selected (Supplementary Data 3). We calculated per cluster and cohort an average gene set enrichment score using the GSVA method; this score reflects the activity of each pathway/gene set in an immune cluster³³. Unsupervised clustering of averaged-gene-set scores clearly separated the immune Clusters A and C, while Cluster B was divided into two subgroups (Fig. 6b). These results suggested an association between immune clusters and the stem cell/EMT-related gene signatures.

Two mutually exclusive phenotypes in breast cancer

Through unsupervised clustering of GSVA enrichment scores, we identified two mutually exclusive gene signatures in breast cancer, (i) one associated with proliferation and embryonic stem cell-like phenotype and (ii) and the other with EMT and mammary stem cell phenotype.

A proliferative phenotype was dominating Cluster C (Supplementary Fig. 11A), the same was observed when gene set scores were calculated for each METABRIC sample (Supplementary Fig. 11B). In Cluster B, the average gene set scores were either high for EMT or proliferation-related signatures (Supplementary Fig. 11C). At the sample level in the METABRIC, we observed a similar pattern with samples having the one or the other state activated (Supplementary Fig. 11D). Cluster A showed low scores for both the EMT and proliferative states (Supplementary Fig. 11E, F).

To formally identify which gene set scores explained Cluster B, we tested how each gene set contribute to Cluster B vs Clusters A and C using generalized linear models. EMT signatures contributed positively to Cluster B while proliferation and cell motility were associated with Clusters A and C (Fig. 6c). We also tested which gene set score explained Cluster B when compared separately to Cluster A (Supplementary Fig. 12A) or Cluster C (Supplementary Fig. 12A). We found in both cases EMT scores being a significant explanatory variable of Cluster B. However, EMT signature scores alone were not of strong prognostic value according to Cox regression analysis (Supplementary Table 9). Overall, these results suggest a mutually exclusivity between EMT and proliferation in breast cancers. They also suggest that only when accompanied by a certain immune contexture the EMT or the proliferative phenotype result in poor prognosis.

Correlation between tumor phenotype and immune infiltration

As immune clusters were associated with both (i) immune cell types and (ii) gene set signatures, we formally assessed the relation between immune infiltration (CIBERSORT) and cancer cell characteristics (gene set scores). Figure 6d shows that the proliferation and EMT scores correlate significantly with different type of immune cells. Notably, high EMT scores are associated with macrophages M2, resting mast cells, and resting memory T cells while high proliferation is correlated with a more active adaptive tumor microenvironment (macrophages M1, T helper cells, activated dendritic cells, and active memory T cells). These data suggest a continuum between the cancer cell phenotype and the composition of the tumor microenvironment.

Heterogeneity in gene set scores within Cluster B

Cluster B was dominated by samples with pro-tumorigenic immune infiltration and high EMT signal; however, ~35% of Cluster B samples also exhibited a proliferative phenotype. To explore this heterogeneity within Cluster B, we grouped samples according to the gene signature scores in an unsupervised manner into B1 dominated by the EMT phenotype and B2 by the proliferation (Fig. 6e).

In the METABRIC and TCGA, B2 cases with the proliferative phenotype had a worse outcome (Fig. 6f, g, also see Supplementary Fig. 13 in which survival probabilities of B1 and B2 are plotted with Cluster A and Cluster C). While we were able to identify a difference in survival between Cluster B1 and B2 in METABRIC and TCGA, for other smaller cohorts, it was difficult to conclude, as further splitting Cluster B resulted in small groups. To further assess whether the heterogeneity in gene set scores was accompanied by heterogeneity in immune contexture, we sought for differences in specific immune cell types between sub-clusters B1 and B2. Unsupervised clustering in Supplementary Fig. 14 showed that the two sub-clusters B1 and B2 both have a pro-tumorigenic/resting immune microenvironment.

Altogether, the two mutually exclusive states within Cluster B may be relevant in regard to prognosis; however; a unifying factor of Cluster B is the presence of a pro-tumorigenic/resting immune microenvironment.

Discussion

The tumor microenvironment plays an important role in breast cancer pathogenesis. We provide a new immune-related subtype in breast cancer with relevance for prognosis and response to neoadjuvant chemotherapy in both ER-positive and ER-negative cases. The herein described immune clusters are dependent on both the abundance and composition of the immune infiltrate and are independent of other prognostic factors, including PAM50.

Through unsupervised clustering using the expression of genes part of the nCounter® PanCancer Immune Profiling Panel, we identified in FFPE and fresh frozen breast tumors, three clusters of patients. These clusters were (i) associated with total levels of immune infiltration and with specific immune microenvironment, (ii) provided an independent prognostic information, and (iii) revealed two mutually exclusive breast cancer phenotypes.

As the immune clusters provided an independent prognostic value in breast cancer, we developed a simple method that refined and accurately predicted whether a sample falls in the poor prognosis cluster (Cluster B) or not. We tested our method successfully in 15 cohorts, spanning 6101 breast cancer samples. We demonstrate using different and complementary statistical approaches the strength of the immune cluster as a new prognostic biomarker.

Through phenotypical characterization of the immune clusters, we also identified two mutually exclusive states in breast cancers, one associated with EMT and the other with proliferation. A similar observation of two mutually exclusive states: proliferative and EMT, was recently reported in a pan-cancer genomic analysis of metastatic tumors³⁴. Our study therefore suggests that such a mutual exclusion could be extended to primary breast tumors and possibly to other primary cancer types. The EMT process has often been associated with metastasis³⁵; it has been also previously suggested that transcription factors such as TWIST1, which may drive the EMT process, need to be turned off for the cancer cell to proliferate³⁶. Such a mechanism may explain why these two processes could not coexist in cancer cells.

Samples with the EMT or proliferative phenotype were found in the poor prognosis cluster (Cluster B). About 65% of the samples in Cluster B had an EMT-like phenotype. We further found that this dominating phenotype could help explain Cluster B when compared to Clusters A and C using generalized linear models. As opposed to that, 35% of the samples in Cluster B had a proliferative phenotype like most of the Cluster C samples. Proliferation in Cluster C associated with infiltration of active anti-tumorigenic immune cells, while proliferative samples in Cluster B had infiltration of immune cells less likely to eradicate cancer cells (macrophages M2, resting mast cells). This indicates that in breast cancer a proliferative phenotype associated with a non-adapted, pro-tumorigenic, resting immune microenvironment relates to an adverse outcome as indicated by the Kaplan–Meier analysis (Fig. 6f, g).

Many studies have suggested that EMT drives an aggressive tumor phenotype in breast cancer^37,38. However, recent studies have questioned the role of EMT in tumorigenesis, progression, and metastasis³⁹. Importantly, we show here that specific immune infiltration is associated with the EMT process in breast cancer. As a recent study also suggests⁴⁰, we highlight that immune contexture is an important factor to consider when evaluating the role of EMT during cancer pathogenesis.

Using CIBERSORT¹⁹ to infer for specific immune cell infiltration, we found the EMT state highly correlated with infiltration of resting mast cells, macrophages M2, natural killer (NK) cells, and resting memory T cells. It has been previously suggested that EMT could be associated with a pro-tumorigenic microenvironment in lung cancer⁴¹. In esophageal squamous cell carcinoma, M2 macrophages promote migration, invasion, and enhance EMT⁴². On the other hand, mast cells have been associated with angiogenesis in breast cancer⁴³. Based on several gene expression datasets, our current results demonstrate that the EMT process is accompanied with infiltration of pro-tumorigenic/resting immune cell types. The presence of antitumorigenic immune cells, like NK cells, has also been found to be highly correlated with the EMT in melanoma⁴⁴.

In Cluster C, a proliferative phenotype was found to be correlated with infiltration of activated dendritic cells, T helper cells, macrophages M1, and CD4 memory T cells. These cell types reflect an antitumoral microenvironment. Cluster C is dominated by both a highly proliferative phenotype and high infiltration of antitumoral cell types. One may argue that chemotherapies may successfully eradicate such proliferative tumors with the support of an antitumoral microenvironment; explaining a better outcome of these patients and the higher rate of responders to neoadjuvant chemotherapy in Cluster C (Fig. 4e).

Low immune infiltration in Cluster A associated with neither the proliferative nor the EMT state, which may indicate a less aggressive tumor phenotype.

Previous studies have suggested that basal-like breast cancers display a high metastatic ability associated with mesenchymal features⁴⁵. Sarrio et al.⁴⁶ further showed that several markers of EMT were upregulated in basal-like breast cancers⁴⁶. Our study shows using recent algorithms that the EMT phenotype is enriched in Cluster B. In breast cancer, a recent gene transcriptional profiling has identified an EMT gene expression signature associated with claudin-low and metaplastic breast cancers⁴⁷. However, the claudin-low subtype in the METABRIC cohort did not correlate with Cluster B.

Our study suggests that targeting the primary pathways involved in EMT such as transforming growth factor-b⁴⁸, E-Cadherin⁴⁹, WNT/B-catenin pathway⁵⁰, Notch⁵¹, hypoxia, or tumor necrosis factor-alpha⁵² are interesting opportunities for therapeutic intervention for patients with the worse prognosis (Cluster B). More importantly, the macrophage re-education strategy, which proposes to remodel M2 type of macrophages into an anti-tumor, “M1-like” mode⁵³, could be beneficial for Cluster B patients.

In the era of modern immunotherapy, a few clinical trials using immune checkpoint inhibitors have been conducted in breast cancers and have been planned to be combined with immunogenic chemotherapy or radiation therapy. The results of the first clinical trials using monoclonal antibodies against immune checkpoint inhibitors have recently been communicated and show some degree of response especially in certain subpopulations^54,55. Our study suggests that considering both the immune cell types infiltrating the tumor and the main state of the tumor (EMT or proliferative) will precise treatment decisions and improve response to these new treatment strategies.

Methods

Gene expression analysis from FFPE

Operable early breast cancer patients were included in the Oslo1 micrometastasis observational study between 1995 and 1998⁵⁶. Informed consent has been obtained from all participants and the study was approved by the local ethical committee (S-97103). FFPE were collected for a subset of patient that also had fresh primary tumors collected for detailed molecular analyses, a cohort called MicMa. Only patients within the MicMa (n = 96) subset were included in the current analysis. FFPE tissue was first examined with H&E staining to determine the tumor area and dissection was performed to mainly include tumor tissue. RNA purification was performed using the Roche® High Pure FFPET RNA Isolation Kit; ≥1–5 10-μm FFPE slides were used for each tumor. A minimum of ∼100 ng of total RNA was used on the nCounter platform (Nanostring Technologies, Seattle, WA, US) and the PanCancer Immune Profiling Panel⁵⁷. Data were normalized using all housekeeping genes and log base 2 transformed.

RNA-seq analysis of the OSLO2-EMIT0 cohort

The OSL2 breast cancer cohort is a study collecting material from breast cancer patients with primary operable disease in several hospitals in south-eastern Norway. Inclusion of patients started in 2006 and is still ongoing. The study was approved by the Norwegian Regional Committee for Medical Research Ethics (approval number 1.2006.1607, amendment 1.2007.1125). Patients gave written consent for the use of material for research purposes. All experimental methods performed are in compliance with the Helsinki Declaration. Tumor tissue was cut into pieces and mixed before distribution to RNA extraction. RNA was isolated using the QIAgene kit Allprep DNA/RNA/miRNA universal on the QIAcube machine and method (Qiagen). Quality control was performed by Nanodrop ND-1000 (NanoDrop Technologies) and BioAnalyzer 2100 (Agilent) analysis. All RNA had RNA Integrity Number (RIN) ≥ 6. We used Illuminas TruSeq Stranded mRNA Library Prep Kit for the automated NeoPrep Library Prep System (Illumina). Starting amount was 120 ng total RNA and we used Illuminas NextSeq500 sequencers (2 × 75 bp). Raw sequencing read data were demultiplexed and filtered using Bowtie2 against ribosomal, phiX174, and UCSC RepeatMasker sequences. The sequence data were processed as described previously.⁵⁸ Log-transformed FPKM RNA-seq gene expression data at GEO are available at GSE135298. Raw data are available at EGAS00001003631.

Data collection and processing

Publicly available expression data from breast cancer cohorts were used in this study. Patients’ consents and ethical approval are available in the respective original articles the datasets were published with (Supplementary Table 2). Expression data were obtained from Gene Expression Omnibus, the European Genome-phenome Archive, ArrayExpress, or TCGA data portals. For survival analyses, we selected studies with >100 samples and relevant survival data from patients with invasive breast tumors sampled at the time of surgical resection without neoadjuvant treatment. Survival data were of four types: relapse-free survival, distant metastasis-free survival, OS, or breast cancer-specific survival.

For analysis of response to neoadjuvant chemotherapy, we selected cohorts of patients treated with a chemotherapeutic regimen and for which gene expression has been profiled from the primary tumor prior to treatment. pCR was assessed at the time of surgery at the end of treatment and refers to the total elimination of cancer cells at surgery.

Except for the METABRIC cohort for which the ER status has been extensively used and defined, we used gene expression data together with the R package optim to systematically infer for ER status using a two-component Gaussian finite mixture model using maximum likelihood estimation as previously described⁵⁹. Classification into the PAM50 intrinsic molecular subtypes was performed based on gene expression data using the genefu package in R³.

Gene set enrichment analysis

Gene set enrichment analysis was performed using the Molecular Signatures Database v4.0 (MSigDB³¹) H and C2 collections. Enrichment was assessed by hypergeometric testing.

Unsupervised clustering to obtain immune clusters

First a correlation matrix was calculated to assess the dependence between samples initially based on the expression of the 760 genes in the nCounter® PanCancer Immune Profiling Panel and later with the 509 genes that are present in all clustered datasets (training in Supplementary Table 2). Hierarchical clustering of patients’ correlation matrix was performed using the R package pheatmap v1.0.12 using correlation as clustering distance and ward.D as linkage. Clusters were identified using the cutree function. To determine the optimal number of clusters for each cohort, we used the silhouette analysis of KMeans using the cluster R package; for most of the cohorts assessed, three clusters was a better pick than more numerous clusters.

Nanodissect analysis, lymphoid and myeloid scores

The algorithm Nanodissect (http://nano.princeton.edu) was used as previously described to predict for lymphoid and myeloid infiltration^24,25. Breast collection data (May 2013), which contains 17,940 genes measured on 622 arrays, was inspected for genes specifically expressed in lymphoid or myeloid cell types and not expressed in mammary gland or mammary epithelium. The genes with >65% probability to be positive lymphocyte- or myelocyte-specific standard genes as opposed to mammary gland or epithelium were used in downstream analysis. Nanodissect scores for lymphocyte or myelocyte infiltration reflect the average expression of the respective genes (Supplementary Data 4) in a sample.

CIBERSORT analysis

The algorithm CIBERSORT was used on normalized expression data to infer the absolute proportions of 22 types of infiltrating immune cells. CIBERSORT is a deconvolution algorithm that uses a set of reference gene expression values (547 genes) to predict 22 immune cell type proportions from bulk tumor sample expression data by using support vector regression¹⁹. To assess the reliability of the deconvolution method, CIBERSORT derives a p value for each sample. CIBERSORT software package was obtained from the developers, and analysis was performed by using the default signature matrix at 1000 permutations.

Single-sample GSEA (GSVA)

Gene set analysis was carried out using the GSVA Bioconductor package v1.30.0³³. We curated gene sets for various epithelial mesenchymal transition, stem cell, proliferation, and cell cycle-related pathways (Supplementary Data 3). For each sample, a score for the enrichment of a set of genes using gene expression profile was obtained.

Binomial logistic regression to predict immune clusters

We used binomial logistic regression through the glmnet v2.0–16R package⁶⁰ to develop a method that allows to assign any given sample to the group with the worse prognosis or not without resorting to unsupervised clustering. This predictor method is highly efficient for smaller cohorts and allow to assign class to single samples. To perform the analysis, we mean centered datasets and set up a logistic regression using the binomial distribution to predict categorical response of the two possible outcomes: being in the bad prognosis group or not. This approach gave a signature of target genes that together captured the variation associated with the two categories (Supplementary Data 1).

Patients were divided into Cluster B or Clusters A and C groups according to the following index for patient i:

$${\rm{Index}}_i = \mathop {\sum }\limits_{g = 1}^n \;\beta _g.X_{gi}$$

(1)

where g is the target (gene), n is the number of targets, β_g is the lasso coefficient for target gene and X_gi is the gene expression value in sample i. If index for patient i is higher than the intercept = 1.206538657, sample is assigned to Cluster B.

Pathological assessment of immune infiltration

Vascular invasion, inflammatory cell infiltrate, and necrosis, including relation of tumor cells/tumor stroma, were evaluated on slides stained with H&E as previously described⁶¹. Using a simple microscope, subjective categorization of inflammatory cell infiltrate into the categories of “low,” “moderate,” “high,” and “severe” was performed based on the frequency of mononuclear inflammatory cell infiltration observed in the invasive tumor.

ROR score calculation

ROR scores for each sample were calculated as described in ref. ³, ROR-Score = 0.05 × Basal + 0.12 × Her2-enriched − 0.34 × Luminal A + 0.23 × Luminal B; where Basal, Her2-enriched, Luminal A, and Luminal B are the correlation of each sample to the centroid obtained using the genefu package in R.

Statistical, survival, multivariable Cox regression analysis

All analyses were performed in the R version 3.3.2. Unless otherwise stated, results were considered statistically significant, if p value < 0.05. Kaplan–Meier estimator and log-rank tests were performed using the functions Surv, survfit, and survdiff (R package survival v2.42–3). Multivariable Cox regression analyses were used to test the independent prognostic value of the immune clusters using the R package survival and the coxph function. Mann–Whitney U or Kruskal–Wallis tests were used to assess statistical significance within boxplots.

In the box-and-whisker plots, the line within each box represents the median. Upper and lower edges of each box represent 75th and 25th percentile, respectively. The whiskers represent the lowest datum still within [1.5 × (75th − 25th percentile)] of the lower quartile and the highest datum still within [1.5 × (75th − 25th percentile)] of the upper quartile.

To identify differentially expressed genes between clusters, we used a t test followed by Bonferroni correction of the p value. A strict corrected p value (p < 0.0001) was used to identify differentially expressed genes.

NRI and IDI were calculated using the survIDINRI v1.1–1 R package. To assess the 95% CI and p values for the IDI and NRI, a standard bootstrap method was used with resampling performed 500 times. NRI and IDI were assessed at the maximum follow-up time as presented in the Kaplan–Meier survival analysis to assess the improvement in performance of the survival model.

Forest plots were obtained using the forestplot v1.7.2 R package and represent for the univariate and multivariate analysis the hazard ratio and their 95% CI. Boxes represent hazard ratios and are inversely proportional to the width of the CI, horizontal lines are 95% CI.

Correlation plot using the corrplot v0.84 package visualizes Spearman correlations, only False Discovery Rate-corrected significant correlation are visualized and colored according to directionality of the rho values. Size of the dots are proportional to the rho value.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data used in this study are publicly available or can be downloaded through the European Genome-phenome Archive (EGA)—EMBL-EBI portal. We mainly used gene expression datasets from breast cancers in this study summarized in Supplementary Table 2. Data were downloaded with the study-specific normalization process. For initial clustering of the correlation matrix, no other normalization was performed. Further on, for binomial logistic regression and further downstream analysis (CIBERSORT, GSVA, differential expression) the datasets were mean centered. The source data underlying all Figures and Supplementary Figures are available in a source data file. The newly generated RNA-seq gene expression data for the breast cancer cohort OSLO2-EMIT0 is available at EGA with accession number EGAS00001003631. Log-transformed FPKM RNA-seq gene expression data at GEO are available at GSE135298. Newly generated, normalized log 2-transformed nCounter counts for the MicMa cohorts can be found in Supplementary Data 5.

Code availability

To reproduce all figures published in this study, we provide all codes and relevant data in a source data file. In addition, the code to subtype the immune clusters are available online at http://eurostar.nebdal.no:5000/ as well as the codes to subtype using R or python are available at https://github.com/dnebdal/clusterscore.

References

Denkert, C. et al. Tumour-infiltrating lymphocytes and prognosis in different subtypes of breast cancer: a pooled analysis of 3771 patients treated with neoadjuvant therapy. Lancet Oncol. 19, 40–50 (2018).
Article PubMed Google Scholar
Quail, D. F. & Joyce, J. A. Microenvironmental regulation of tumor progression and metastasis. Nat. Med. 19, 1423–1437 (2013).
Article CAS PubMed PubMed Central Google Scholar
Parker, J. S. et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J. Clin. Oncol. 27, 1160–1167 (2009).
Article PubMed PubMed Central Google Scholar
Prat, A. et al. Clinical implications of the intrinsic molecular subtypes of breast cancer. Breast 24(Suppl 2), S26–S35 (2015).
Article MathSciNet PubMed Google Scholar
Blok, E. J. et al. Systematic review of the clinical and economic value of gene expression profiles for invasive early breast cancer available in Europe. Cancer Treat. Rev. 62, 74–90 (2018).
Article CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Burnet, F. M. The concept of immunological surveillance. Prog. Exp. Tumor Res. 13, 1–27 (1970).
Article CAS PubMed Google Scholar
Ostrand-Rosenberg, S. Immune surveillance: a balance between protumor and antitumor immunity. Curr. Opin. Genet. Dev. 18, 11–18 (2008).
Article CAS PubMed PubMed Central Google Scholar
Manuel, M. et al. Lymphopenia combined with low TCR diversity (divpenia) predicts poor overall survival in metastatic breast cancer patients. Oncoimmunology 1, 432–440 (2012).
Article PubMed PubMed Central Google Scholar
Papatestas, A. E., Lesnick, G. J., Genkins, G. & Aufses, A. H. Jr. The prognostic significance of peripheral lymphocyte counts in patients with breast carcinoma. Cancer 37, 164–168 (1976).
Article CAS PubMed Google Scholar
Ali, H. R. et al. Association between CD8+ T-cell infiltration and breast cancer survival in 12,439 patients. Ann. Oncol. 25, 1536–1543 (2014).
Article CAS PubMed Google Scholar
Mahmoud, S. M. et al. Tumor-infiltrating CD8+ lymphocytes predict clinical outcome in breast cancer. J. Clin. Oncol. 29, 1949–1955 (2011).
Article PubMed Google Scholar
Pruneri, G., Vingiani, A. & Denkert, C. Tumor infiltrating lymphocytes in early breast cancer. Breast 37, 207–214 (2018).
Article PubMed Google Scholar
Ali, H. R., Chlon, L., Pharoah, P. D., Markowetz, F. & Caldas, C. Patterns of immune infiltration in breast cancer and their clinical implications: a gene-expression-based retrospective study. PLoS Med. 13, e1002194 (2016).
Article PubMed PubMed Central CAS Google Scholar
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220 (2017).
Article PubMed PubMed Central CAS Google Scholar
Clancy, T. et al. Bioinformatics approaches to profile the tumor microenvironment for immunotherapeutic discovery. Curr. Pharm. Des. 23, 4716–4725 (2017).
Article CAS PubMed Google Scholar
Dannenfelser, R. et al. Data-driven analysis of immune infiltrate in a large cohort of breast cancer and its association with disease progression, ER activity, and genomic complexity. Oncotarget 8, 57121–57133 (2017).
Article PubMed PubMed Central Google Scholar
Galon, J., Angell, H. K., Bedognetti, D. & Marincola, F. M. The continuum of cancer immunosurveillance: prognostic, predictive, and mechanistic signatures. Immunity 39, 11–26 (2013).
Article CAS PubMed Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Article CAS PubMed PubMed Central Google Scholar
Alexe, G. et al. High expression of lymphocyte-associated genes in node-negative HER2+ breast cancers correlates with lower recurrence rates. Cancer Res. 67, 10669–10676 (2007).
Article CAS PubMed Google Scholar
Teschendorff, A. E., Miremadi, A., Pinder, S. E., Ellis, I. O. & Caldas, C. An immune response gene expression module identifies a good prognosis subtype in estrogen receptor negative breast cancer. Genome Biol. 8, R157 (2007).
Article PubMed PubMed Central CAS Google Scholar
Bense, R. D. et al. Relevance of tumor-infiltrating immune cell composition and functionality for disease outcome in breast cancer. J. Natl Cancer Inst. https://doi.org/10.1093/jnci/djw192 (2016).
Article PubMed Central CAS Google Scholar
Enerly, E. et al. miRNA-mRNA integrated analysis reveals roles for miRNAs in primary breast tumors. PLoS ONE 6, e16915 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Fleischer, T. et al. DNA methylation at enhancers identifies distinct breast cancer lineages. Nat. Commun. 8, 1379 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Ju, W. et al. Defining cell-type specificity at the transcriptional level in human disease. Genome Res. 23, 1862–1873 (2013).
Article CAS PubMed PubMed Central Google Scholar
Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486, 346–352 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dowsett, M. et al. Comparison of PAM50 risk of recurrence score with oncotype DX and IHC4 for predicting risk of distant recurrence after endocrine therapy. J. Clin. Oncol. 31, 2783–2790 (2013).
Article PubMed Google Scholar
Pencina, M. J., D’Agostino, R. B. Sr., D’Agostino, R. B. Jr. & Vasan, R. S. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat. Med. 27, 157–172 (2008). discussion 207–112.
Article MathSciNet PubMed Google Scholar
Silwal-Pandit, L. et al. The longitudinal transcriptional response to neoadjuvant chemotherapy with and without bevacizumab in breast cancer. Clin. Cancer Res. 23, 4662–4670 (2017).
Article CAS PubMed Google Scholar
Tabchy, A. et al. Evaluation of a 30-gene paclitaxel, fluorouracil, doxorubicin, and cyclophosphamide chemotherapy response predictor in a multicenter randomized trial in breast cancer. Clin. Cancer Res. 16, 5351–5361 (2010).
Article CAS PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Tan, T. Z. et al. Epithelial-mesenchymal transition spectrum quantification and its efficacy in deciphering survival and drug responses of cancer patients. EMBO Mol. Med. 6, 1279–1293 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14, 7 (2013).
Article PubMed PubMed Central Google Scholar
Robinson, D. R. et al. Integrative clinical genomics of metastatic cancer. Nature 548, 297–303 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Tsai, J. H. & Yang, J. Epithelial-mesenchymal plasticity in carcinoma metastasis. Genes Dev. 27, 2192–2206 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tsai, J. H., Donaher, J. L., Murphy, D. A., Chau, S. & Yang, J. Spatiotemporal regulation of epithelial-mesenchymal transition is essential for squamous cell carcinoma metastasis. Cancer Cell 22, 725–736 (2012).
Article CAS PubMed PubMed Central Google Scholar
Labelle, M., Begum, S. & Hynes, R. O. Direct signaling between platelets and cancer cells induces an epithelial-mesenchymal-like transition and promotes metastasis. Cancer Cell 20, 576–590 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sjoberg, E. et al. A novel ACKR2-dependent role of fibroblast-derived CXCL14 in epithelial-to-mesenchymal transition and metastasis of breast cancer. Clin. Cancer Res. 25, 3702–3717 (2019).
Article PubMed Google Scholar
Sikandar, S. S. et al. Role of epithelial to mesenchymal transition associated genes in mammary gland regeneration and breast tumorigenesis. Nat. Commun. 8, 1669 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Weng, Y. S. et al. MCT-1/miR-34a/IL-6/IL-6R signaling axis promotes EMT progression, cancer stemness and M2 macrophage polarization in triple-negative breast cancer. Mol. Cancer 18, 42 (2019).
Article PubMed PubMed Central Google Scholar
Lou, Y. et al. Epithelial-mesenchymal transition is associated with a distinct tumor microenvironment including elevation of inflammatory signals and multiple immune checkpoints in lung adenocarcinoma. Clin. Cancer Res. 22, 3630–3642 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhou, J. et al. IL-1beta from M2 macrophages promotes migration and invasion of ESCC cells enhancing epithelial-mesenchymal transition and activating NF-kappaB signaling pathway. J. Cell. Biochem. 119, 7040–7052 (2018).
Article CAS PubMed Google Scholar
Cimpean, A. M. et al. Mast cells in breast cancer angiogenesis. Crit. Rev. Oncol. Hematol. 115, 23–26 (2017).
Article PubMed Google Scholar
Huergo Zapico, L. et al. NK cell editing mediates epithelial to mesenchymal transition via phenotypic and proteomic changes in melanoma cell lines. Cancer Res. 78, 3913–3925 (2018).
Article CAS PubMed Google Scholar
Prat, A. & Perou, C. M. Deconstructing the molecular portraits of breast cancer. Mol. Oncol. 5, 5–23 (2011).
Article CAS PubMed Google Scholar
Sarrio, D. et al. Epithelial-mesenchymal transition in breast cancer relates to the basal-like phenotype. Cancer Res. 68, 989–997 (2008).
Article CAS PubMed Google Scholar
Taube, J. H. et al. Core epithelial-to-mesenchymal transition interactome gene-expression signature is associated with claudin-low and metaplastic breast cancer subtypes. Proc. Natl Acad. Sci. USA 107, 15449–15454 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Taylor, M. A., Parvani, J. G. & Schiemann, W. P. The pathophysiology of epithelial-mesenchymal transition induced by transforming growth factor-beta in normal and malignant mammary epithelial cells. J. Mammary Gland Biol. Neoplasia 15, 169–190 (2010).
Article PubMed PubMed Central Google Scholar
Lombaerts, M. et al. E-cadherin transcriptional downregulation by promoter methylation but not mutation is related to epithelial-to-mesenchymal transition in breast cancer cell lines. Br. J. Cancer 94, 661–671 (2006).
Article CAS PubMed PubMed Central Google Scholar
Abdulla, T., Luna-Zurita, L., de la Pompa, J. L., Schleich, J. M. & Summers, R. Epithelial to mesenchymal transition-the roles of cell morphology, labile adhesion and junctional coupling. Comput. Methods Prog. Biomed. 111, 435–446 (2013).
Article Google Scholar
Li, Y. et al. Regulation of EMT by Notch signaling pathway in tumor progression. Curr. Cancer Drug Targets 13, 957–962 (2013).
Article CAS PubMed Google Scholar
Ho, M. Y. et al. TNF-alpha induces epithelial-mesenchymal transition of renal cell carcinoma cells via a GSK3beta-dependent mechanism. Mol. Cancer Res. 10, 1109–1119 (2012).
Article CAS PubMed Google Scholar
Mantovani, A., Marchesi, F., Malesci, A., Laghi, L. & Allavena, P. Tumour-associated macrophages as treatment targets in oncology. Nat. Rev. Clin. Oncol. 14, 399–416 (2017).
Article CAS PubMed PubMed Central Google Scholar
Adams, S. et al. Phase 2 study of pembrolizumab (pembro) monotherapy for previously treated metastatic triple-negative breast cancer (mTNBC): KEYNOTE-086 cohort A. J. Clin. Oncol. https://doi.org/10.1200/jco.2017.35.15_suppl.1008 (2017).
Article Google Scholar
Rugo, H. S. et al. Preliminary efficacy and safety of pembrolizumab (MK-3475) in patients with PD-L1 positive, estrogen receptor-positive (ER+)/HER2-negative advanced breast cancer enrolled in KEYNOTE-028. Cancer Res. https://doi.org/10.1158/1538-7445.SABCS15-S5-07 (2016).
Naume, B. et al. The prognostic value of isolated tumor cells in bone marrow in breast cancer patients: evaluation of morphological categories and the number of clinically significant cells. Clin. Cancer Res. 10, 3091–3097 (2004).
Article PubMed Google Scholar
Cesano, A. nCounter((R)) PanCancer Immune Profiling Panel (NanoString Technologies, Inc., Seattle, WA). J. Immunother. Cancer 3, 42 (2015).
Article PubMed PubMed Central Google Scholar
Saal, L. H. et al. The Sweden Cancerome Analysis Network—Breast (SCAN-B) Initiative: a large-scale multicenter infrastructure towards implementation of breast cancer genomic analyses in the clinical routine. Genome Med. 7, 20 (2015).
Article PubMed PubMed Central CAS Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. Invest. 121, 2750–2767 (2011).
Article CAS PubMed PubMed Central Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article PubMed PubMed Central Google Scholar
Dhakal, H. P. et al. Expression of cyclooxygenase-2 in invasive breast carcinomas and its prognostic impact. Histol. Histopathol. 27, 1315–1325 (2012).
PubMed Google Scholar

Download references

Acknowledgements

This study was supported by funding from the KG Jebsen Centre for Breast Cancer Research (SKGJ-MED-004) and the South Eastern Norway Health Authority (grant 2011042 to V.N.K.). Expression profiling was performed with funding from the Research Council of Norway (grant 193387/H10 to Anne-Lise Børresen-Dale and V.N.K.). X.T. is a postdoc fellow funded by the Norwegian Cancer Society (grant no. 419616111190). RNA-sequencing of frozen tumor samples was performed in the SCAN-B laboratory at Lund University, supported by grants from the Mats Paulsson Foundation and Mrs Berta Kamprad Foundation (2012/3657).

Author information

Authors and Affiliations

Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
Xavier Tekpli, Tonje Lien, Daniel Nebdal, Marie Fongaard, Eldri Undlien Due, Anne-Lise Børresen-Dale, Gry Aarum Geitvik, Anita Langerød, Kristine Kleivi Sahlberg, Therese Sørlie, Hege G. Russnes & Vessela N. Kristensen
Department of Cancer Immunology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
Andreas Hagen Røssevold & Jon Amund Kyte
Department of Oncology, Division of Cancer Medicine, Oslo University Hospital, Oslo, Norway
Andreas Hagen Røssevold, Hege Oma Ohnstad, Jon Amund Kyte, Olav Engebråten & Bjørn Naume
Department of Pathology, Division of Laboratory Medicine, Oslo University Hospital, Oslo, Norway
Elin Borgen, Lisa Gregusson Svartdal, My Anh Tu Sveli, Øystein Garred & Hege G. Russnes
Division of Oncology and Pathology, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Scheelegatan 2, Medicon Village, 22185, Lund, Sweden
Johan Vallon-Christersson
Department of Biostatistics, Oslo Centre for Biostatistics and Epidemiology, University of Oslo and Research Support Services, Oslo University Hospital, Oslo, Norway
Arnoldo Frigessi
Department of Research, Vestre Viken Hospital Trust, Drammen, Norway
Kristine Kleivi Sahlberg
Centre for Cancer Biomarkers CCBIO, Bergen, Norway
Therese Sørlie & Vessela N. Kristensen
Institute of Clinical Medicine, University of Oslo, Oslo, Norway
Jürgen Geisler, Olav Engebråten, Rolf Kåresen & Bjørn Naume
Department of Clinical Molecular Biology, Division of Medicine, Akershus University Hospital, Lørenskog, Norway
Vessela N. Kristensen
Section for Breast and Endocrine Surgery, Oslo University Hospital, Ullevål, Oslo, Norway
Ellen Schlichting
Department of Pathology, Akershus University Hospital, Lørenskog, Norway
Torill Sauer
Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
Torill Sauer
Department of Oncology, Akershus University Hospital, Lørenskog, Norway
Jürgen Geisler
Division of Medicine, Akershus University Hospital, Lørenskog, Norway
Jürgen Geisler
Cancer Registry of Norway, Oslo, Norway
Solveig Hofvind
Oslo and Akershus University College of Applied Sciences, Faculty of Health Science, Oslo, Norway
Solveig Hofvind
Department of Circulation and Medical Imaging, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
Tone F. Bathen
Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
Olav Engebråten & Gunhild Mari Mælandsmo
Department of Breast and Endocrine Surgery, Division of Surgery, Cancer and Transplantation, Oslo University Hospital, Oslo, Norway
Rolf Kåresen
Department of Pharmacy, Faculty of Health Sciences, University of Tromsø, Tromsø, Norway
Gunhild Mari Mælandsmo
Centre for Cancer Biomedicine, University of Oslo, Oslo, Norway
Ole Christian Lingjærde
Department of Computer Science, University of Oslo, Oslo, Norway
Ole Christian Lingjærde
Breast and Endocrine Surgery, Department of Breast and Endocrine Surgery, Vestre Viken Hospital Trust, Drammen, Norway
Helle Kristine Skjerven
Department of Pathology, Vestre Viken Hospital Trust, Drammen, Norway
Daehoon Park
Østfold Hospital, Østfold, Norway
Britt Fritzman

Authors

Xavier Tekpli
View author publications
You can also search for this author in PubMed Google Scholar
Tonje Lien
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Hagen Røssevold
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Nebdal
View author publications
You can also search for this author in PubMed Google Scholar
Elin Borgen
View author publications
You can also search for this author in PubMed Google Scholar
Hege Oma Ohnstad
View author publications
You can also search for this author in PubMed Google Scholar
Jon Amund Kyte
View author publications
You can also search for this author in PubMed Google Scholar
Johan Vallon-Christersson
View author publications
You can also search for this author in PubMed Google Scholar
Marie Fongaard
View author publications
You can also search for this author in PubMed Google Scholar
Eldri Undlien Due
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Gregusson Svartdal
View author publications
You can also search for this author in PubMed Google Scholar
My Anh Tu Sveli
View author publications
You can also search for this author in PubMed Google Scholar
Øystein Garred
View author publications
You can also search for this author in PubMed Google Scholar
Arnoldo Frigessi
View author publications
You can also search for this author in PubMed Google Scholar
Kristine Kleivi Sahlberg
View author publications
You can also search for this author in PubMed Google Scholar
Therese Sørlie
View author publications
You can also search for this author in PubMed Google Scholar
Hege G. Russnes
View author publications
You can also search for this author in PubMed Google Scholar
Bjørn Naume
View author publications
You can also search for this author in PubMed Google Scholar
Vessela N. Kristensen
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

OSBREAC

Anne-Lise Børresen-Dale
, Ellen Schlichting
, Torill Sauer
, Jürgen Geisler
, Solveig Hofvind
, Tone F. Bathen
, Olav Engebråten
, Gry Aarum Geitvik
, Anita Langerød
, Rolf Kåresen
, Gunhild Mari Mælandsmo
, Ole Christian Lingjærde
, Helle Kristine Skjerven
, Daehoon Park
& Britt Fritzman

Contributions

X.T.: designed the study, performed analysis, wrote the manuscript. T.L.: performed analysis. D.N.: developed the webservice. A.H.R., T.S. and H.G.R.: provided critical points of view. E.B.: scoring of FFPE for inflammation H.O.O. and K.K.S.: provided tissue samples. J.A.K.: provided critical points of view. J.V.-C.: bioinformatic analysis. M.F. and E.U.D.: prepared samples for RNA-seq. L.G.S. and M.A.T.S.: prepared samples for nCounter analysis. A.F.: counseling on statistical analysis. O.G.: pathological inspection of FFPE samples. B.N.: designed the study, provided tissue samples. V.N.K.: designed the study, wrote the manuscript, supervised all steps of the study.

Corresponding author

Correspondence to Vessela N. Kristensen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Members of the OSBREAC are listed at the end of the paper.

Supplementary information

Peer Review File

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tekpli, X., Lien, T., Røssevold, A.H. et al. An independent poor-prognosis subtype of breast cancer defined by a distinct tumor immune microenvironment. Nat Commun 10, 5499 (2019). https://doi.org/10.1038/s41467-019-13329-5

Download citation

Received: 11 December 2018
Accepted: 30 October 2019
Published: 03 December 2019
DOI: https://doi.org/10.1038/s41467-019-13329-5

This article is cited by

Exploration of the relationship between tumor-infiltrating lymphocyte score and histological grade in breast cancer
- Deyong Kang
- Chuan Wang
- Jianxin Chen
BMC Cancer (2024)
Development of a machine learning-based radiomics signature for estimating breast cancer TME phenotypes and predicting anti-PD-1/PD-L1 immunotherapy response
- Xiaorui Han
- Yuan Guo
- Changhong Liang
Breast Cancer Research (2024)
Tretinoin improves the anti-cancer response to cyclophosphamide, in a model-selective manner
- Caitlin M. Tilsed
- M. Lizeth Orozco Morales
- W. Joost Lesterhuis
BMC Cancer (2024)
Radiogenomic analysis of cellular tumor-stroma heterogeneity as a prognostic predictor in breast cancer
- Ming Fan
- Kailang Wang
- Lihua Li
Journal of Translational Medicine (2023)
Patient-derived scaffolds representing breast cancer microenvironments influence chemotherapy responses in adapted cancer cells consistent with clinical features
- Maria Carmen Leiva
- Anna Gustafsson
- Göran Landberg
Journal of Translational Medicine (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.