Classification of triple negative breast cancer by epithelial mesenchymal transition and the tumor immune microenvironment

Font-Clos, Francesc; Zapperi, Stefano; La Porta, Caterina A. M.

doi:10.1038/s41598-022-13428-2

Download PDF

Article
Open access
Published: 10 June 2022

Classification of triple negative breast cancer by epithelial mesenchymal transition and the tumor immune microenvironment

Francesc Font-Clos^1,2,
Stefano Zapperi^1,2,3 &
Caterina A. M. La Porta^1,4,5

Scientific Reports volume 12, Article number: 9651 (2022) Cite this article

3298 Accesses
12 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Triple-negative breast cancer (TNBC) accounts for about 15–20% of all breast cancers and differs from other invasive breast cancer types because it grows and spreads rapidly, it has limited treatment options and typically worse prognosis. Since TNBC does not express estrogen or progesterone receptors and little or no human epidermal growth factor receptor (HER2) proteins are present, hormone therapy and drugs targeting HER2 are not helpful, leaving chemotherapy only as the main systemic treatment option. In this context, it would be important to find molecular signatures able to stratify patients into high and low risk groups. This would allow oncologists to suggest the best therapeutic strategy in a personalized way, avoiding unnecessary toxicity and reducing the high costs of treatment. Here we compare two independent patient stratification strategies for TNBC based on gene expression data: The first is focusing on the epithelial mesenchymal transition (EMT) and the second on the tumor immune microenvironment. Our results show that the two stratification strategies are not directly related, suggesting that the aggressiveness of the tumor can be due to a multitude of unrelated factors. In particular, the EMT stratification is able to identify a high-risk population with high immune markers that is, however, not properly classified by the tumor immune microenvironment based strategy.

A breast cancer classification and immune landscape analysis based on cancer stem-cell-related risk panel

Article Open access 08 December 2023

Practical classification of triple-negative breast cancer: intratumoral heterogeneity, mechanisms of drug resistance, and novel therapies

Article Open access 16 October 2020

Role of CXCL10 in the progression of in situ to invasive carcinoma of the breast

Article Open access 09 September 2021

Introduction

Breast cancer accounts for 25% of all newly diagnosed cancer cases in women around the world. Despite clinical improvements introduced in the past decades, predicting the clinical outcome of individual patients is still an open challenge¹. This is a very important goal since current treatments are costly and have important side effects, which are detrimental for the patients quality of life. Hence, being able to predict which patients will be most likely to benefit from a given treatment would help establish personalized therapies and avoid overtreatment. The difficulty of this challenge stems from to the considerable heterogeneity of breast cancer, even within the standard molecular subtypes in which this tumor is usually classified^2,3. Breast cancer subtypes are based on the expression level of estrogen receptor (ER), progesteron receptor (PR), the human epidermal growth factor receptor 2 (HER2) and the proliferation marker Ki67. In particular, the four subtypes that are mostly used to classify breast cancer are Luminal A (ER and/or PR+, HER2−, Ki67 low), Luminal B (ER and/or PR+, HER2−, Ki67 high), HER2 positive (HER2+) and triple negative (ER−, PR−, HER2−)^2,3,4,5.

Stratification of breast cancer patients within each of the four subtypes has been attempted using a plethora of gene expression tests based on the expression level of gene panels either empirically selected^6,7 or resulting from machine learning classification of whole transcriptomic data^8,9. Older tests have mostly been applied to to the Luminal A breast cancer subtype and essentially measure the proliferation level. Machine learning based methods have been shown to suffer from overfitting^10,11. The problem arises when a machine learning algorithm tries to classify a high-dimensional object by using a small training set¹¹. When the dimension of the object (around 20000 genes in the case of the human transcriptome) is larger than the number of samples in the training set, the predictive power of the resulting classifier is poor.

Stratifying patients with triple negative breast cancer (TNBC) is an issue that has been addressed only in recent years^12,13,14. This breast cancer subtype is the most aggressive of the four, showing poor prognosis, particularly when metastasis are present, and is highly heterogeneous¹⁵. The classification scheme proposed by Lehmann et al.¹² and later refined by the same group¹⁶ is based on the clustering of gene expression data, leading to six subtypes which display differential response to treatment¹², but no statistically significant differences in relapse-free survival¹⁶.

In a recent work, we introduced and validated ARIADNE, a general algorithmic strategy to assess the risk of metastasis of patients with TNBC based on the identification of hybrid epithelial/mesenchymal phenotypes from gene expression data¹⁷. The method is based on a Boolean network model that is able to efficiently classify cell phenotypes by mapping gene expression data into a complex landscape whose topographic features represent important biological aspects of the cells¹⁸. The epithelial–mesenchymal transition (EMT) describes how polarized epithelial (E) cells transform into mesenchymal (M) cells by losing cell polarity and down-regulating adhesion molecules, such as E-cadherin. M cells tend to be more motile, suggesting that EMT could be associated with metastatic capabilities^19,20,21,22. Recent work shows that the EMT can also involve hybrid E/M states²³ where cells display a mix of markers, characteristic of E and M cells^24,25,26. These hybrid states combine invasive capabilities and intracellular adhesion^27,28 and are associated to extremely aggressive tumors^23,29,30,31. Several EMT scores have been proposed to determine the E or M character of a tumor sample based on gene expression data³². We showed that ARIADNE correlates with other EMT scores but it is more specific in identifying hybrid phenotypes, which is essential to stratify patients¹⁷.

Due to the possible involvement of the immune system in modulating the phenotype of tumor cells, a recent paper suggested that immunological metasignatures could stratify TNBC patients^33,34. In particular, the authors focus on the tumor immune microenvironment, considering gene expression profiles of matched tumor, epithelial and stromal compartments from TBNC patients³³. Using these data, the authors classify patients according to specific combinations of gene expression metasignatures that are able to stratify patients clinical outcomes³³. The paper also shows that each of these immunological subtypes expresses distinct patterns of immune related gene markers (i.e. immune suppression, IL-17 induction and production, cell death, neutrophils, type I Interpherons (IFN), cytotoxic activity and antigen presentation).

Given that previous papers show that the same TNBC patients can effectively be stratified by two independent strategies, one based on the EMT¹⁷ and the other based on the tumor immune microenvironment³³, we decided to investigate whether the two strategies are related. In other words, do patients considered at high/low risk according to the EMT based approach also show distinct immunological signatures? To address this question, we use ARIADNE to analyze gene expression data from patients included in the study of the TNBC tumor immune microenvironment³³ and then check if the groups selected by ARIADNE show any peculiar differences in the expression of immune-related genes.

Methods

Matching different datasets

Gene expression data analyzed in Gruosso et al.³³ are accessible in the GEO database under accession numbers GSE88715 (for gene expression from stromal and epithelial compartments) and GSE88847 (for bulk tumor gene expression). Survival data can be obtained from an earlier dataset (GSE58644) which contains gene expression data for the same patients together with others³⁵. We could not find an indication of how gene expression data from GSE88847 can be matched with the survival data in GSE58644. To solve this problem, we compute the correlation of pairs of transcriptomes from GSE88847 and GSE58644. We find that for each sample in GSE88847 there is a matching sample in GSE58644 that has a much larger correlation coefficient than the rest. We then verify the potential matching with clinical data (i.e. tumor size and age), and exclude four cases where the matching is not reliable because clinical data do not agree.

Data normalization

We normalize data from GSE88847 and GSE88715 by following the procedure adopted by Karn et al.³⁶ for GSE31519. To be precise:

1.
log2 transformation of MAS5 values
2.
median centering of arrays
3.
magnitude normalization of arrays.

where magnitude normalization must be understood as setting the sum of squares of all samples to one. This is because ARIADNE was trained on GSE31519¹⁷, and in this manuscript we do not re-train ARIADNE, but rather tackle the challenge of reusing the parameters obtained in Font-Clos et al.¹⁷ to compute the score of a new dataset. Therefore, it is crucial to use the same normalization as in the training data. When comparing different datasets one should also keep in mind that additional sources of variability could come from differences in sample preparation across different studies. Figure 1 shows the distribution of normalized expression for the genes used in the score computation, comparing the training data of ARIADNE (i.e. GSE31519) with the data newly analyzed in the present paper (i.e. GSE88847 and GSE88715). We then compute the ARIADNE score as explained in Font-Clos et al.¹⁷ for the samples in GSE88847 and GSE88715. After computing the raw ARIADNE score, which is an integer value, we define the high and low groups simply by sorting and splitting the dataset into two groups, high and low.

Calculation of pathway deregulation scores

Pathway Deregulation Scores (PDS) were first introduced by Drier et al.³⁷ as a way of quantifying the overall deregulation of a given pathway with respect to a reference sample by fitting a non-parametric, non-linear one-dimensional curve through the “middle” of the transcriptomic data, in the subspace generated by the genes of that pathway. In practice, this is usually done via the principal curve algorithm³⁸, although other procedures would be acceptable. We follow the steps of Drier et al.³⁷, except for the following modification that we introduced in a previous paper³⁹. We place the value of 0 the mean value of the reference sample, instead of at the extremal point of the curve. This modification alters the resulting PDS only by a linear shift, but makes the results more robust against the variability of the reference samples, as discussed in Font-Clos et.³⁹. We compute PDS for the immunological gene sets reported in Gruosso et al.³³ and for a subset of immunologically related “hallmark gene sets” obtained from msigdb⁴⁰. Boxplots show the distribution of PDS values, for each pathway, both for “ARIADNE low” samples (green) and for “ARIADNE high” samples (red).

Tumor immune microenvironment metasignatures

The list of genes corresponding to the metasignatures proposed by Gruosso et al.³³ are obtained from https://github.com/bhklab/EpiStromaImmune/. We focus our analysis on the “Immune” (CDSig1), “Fibrosis (CDSig3), “Cholesterol” (EDSig2) and “Interferon (IFN)” (EDSig5) metasignatures and use them to classify patients into groups following the algorithm described in Gruosso et al.³³ and reported in https://github.com/bhklab/EpiStromaImmune/. In particular, we first construct two groups—“Immune high/Fibrosis low” and “Immune low/Fibrosis high”—representing 60% and 40% of the samples respectively. We define samples that end up in both groups as “Intermediate”. This differs slightly from Gruosso et al.³³ where those samples are later re-assigned to one of the two classes. We then refine the classification for the samples in the “Immune high/Fibrosis low” group by constructing two additional groups based on the “Cholesterol” and “Interferon” metasignatures, each containing 50% of the samples³³. We compute the metasignatures for GSE88847 and GSE31519. When comparing the metasignature with ARIADNE, we consider two groups for GSE88847 (high and low) and three groups for GSE31519 (low, med and high, as in Font-Clos et al.¹⁷) owing to the larger sample size of the second dataset.

TNBC subtypes

We establish the subtype (TNBCtype) of the samples in GSE88847 according to Lehmann et al.¹² submitting the GSE88847 gene expression dataset to the TNBCtype server (https://cbc.app.vumc.org/tnbc/).

Computation of survival curves

We use the lifelines python package to compute survival curves in Fig. 2a using the Kaplan-Meyer approach.

Statistical analysis

In correlation plots, statistical significance is established through linear regression. Statistical differences in the distributions of ARIADNE scores for immune-related groups and among tissues are established using the Kolmogorov-Smirnov (KS) test.

Statement

All methods were carried out in accordance with relevant guidelines and regulations.

Results

We access gene expression data from TNBC patients taken from the tumor (GSE88847) and from adjacent tissues (stroma and epithelium) already analyzed in Gruosso et al.³³ and match them to survival data³⁵ as described in the Methods section. We then stratify the patients according to the score provided by the ARIADNE algorithm¹⁷ which maps gene expression data into the states of a Boolean network model simulating gene regulatory interactions responsible for the EMT¹⁸. The algorithm was already trained and cross-validated on a large cohort of TNBC patients (GSE31519⁴¹) and was able to identify low and high risk patients based on the presence of hybrid E/M characteristics¹⁷. As shown in Fig. 2a, ARIADNE successfully stratifies patients in two risk classes, a low risk class with high survival and a high risk class with lower survival. We then applied ARIADNE also to gene expression data measured in tissues adjacent to the bulk tumor (i.e. stroma and epithelium). As shown in Fig. 2b, the ARIADNE score, which measures the presence of hybrid E/M cells, is larger in the tumor bulk and smaller in the epithelium with intermediate scores found in the stroma. The differences are statistically significant as demonstrated by the KS test. This suggests an increasing presence of hybrid E/M phenotypes from the epithelium to the stroma and finally to the bulk tumor. We also establish the TNBC subtype of the tumor samples according to Lehmann et al.¹². As shown in Fig. 2c, samples are scattered across the six subtypes independently of their ARIADNE score.

Having confirmed that this cohort of patients can be effectively stratified by ARIADNE based on the EMT status of the tumor, we consider signatures related to the tumor immune microenvironment. To this end, we first consider the immunological gene sets considered in Gruosso et al.³³ and analyze if their expression correlates with the score produced by ARIADNE. As shown in Fig. 3a, we can not see any clear pattern in the gene expression values measured from bulk tumor samples when those are sorted according to their ARIADNE scores. To be more quantitative, we compute the cross-correlation between the ARIADNE score and the mean expression value within each gene set. The results displayed in Fig. 3b show that correlation coefficients are rather small and not statistically significant, even when the significance level is not particularly strict (i.e. \(\alpha =0.05\) without multiple testing correction). These negative results hold for all sample types: Bulk tumor, stroma and epithelium.

To obtain a more precise assessment of the possible relation between the stratification obtained by ARIADNE and the tumor immune microenvironment, we compute pathway deregulation scores (PDS)³⁷. The method quantifies the overall deregulation of a given pathway with respect to a reference sample, by fitting a non-parametric non-linear one-dimensional curve through the gene expression data relative to each pathway (see Methods for details). We apply the method using again the same gene sets (Fig. 4a) and then compute a cross-correlation between PDS and ARIADNE score. Again correlations are weak and not statistically significant (Fig. 4)b. We also repeat the same analysis for a set of immune related hallmark pathways⁴⁰. As shown in Fig. 5, we do not detect any significant correlation between PDS and ARIADNE score.

Finally, we consider the immunological metasignatures defined in Gruosso et al.³³ and compare their value with ARIADNE. In particular, we consider the “Immune” (CDSig2), “Fibrosis” (CDSig4), “Cholesterol” (EDSig2) and “Interferon” (EDsig5) metasignatures used in Gruosso et al.³³ to stratify patients. Cross-correlation analysis for the data in GSE88847 does not reveal significant correlations between the scores, except in one case (see Fig. 6a). To check if the lack of correlation is due to the relatively small size of the dataset, we also consider a larger dataset (i.e. GSE31519). As shown in Fig. 6b, the group of patients with high ARIADNE score displays a small but statistically significant enrichment in all the metasignatures. We then proceed as in Gruosso et al.³³ and define groups based on combinations of the metasignatures. In particular, we first divide patients in two classes: “Immune high”/“Fibrosis low” and “Immune low”/“Fibrosis high” (see Fig. 7a). As shown in Fig. 7b, there is a small but statistically significant difference in ARIADNE score between the two classes. Remarkably, the largest differences in ARIADNE score are observed in patients that fall in both groups and that we classify as “intermediate” (Fig. 7a). Our result is consistent with Fig. 6b showing that a number of patients with high ARIADNE score and also high immune and fibrosis markers. We also consider a sub-classification of the “Immune high”/“Fibrosis low” group into “Cholesterol low”/“Interferon high” and “Cholesterol high”/“Interferon low” groups (Fig. 7c), finding no significant association with ARIADNE score (Fig. 7d).

Conclusions

The possibility to stratify TBNC patients is a crucial aspect to build personalized treatments, which would be particularly relevant for this breast cancer subtype where no specific therapeutic strategy is available. Several patients stratification strategies based on gene expression data have been proposed in the literature. The most widely used classification of TNBC was proposed by Lehmann et al.¹² and it is based on automatic clustering of gene expression data and resulted in six subgroups, later refined into four¹⁶. The Lehmann classification showed promising results in identifying patients who respond to treatment¹², but limited success in identifying relapse-free surviving patients¹⁶.

Alternative patient stratification strategies for TNBC are built on specific biological processes known to affect clinical outcome, rather than performing an unsupervised analysis of gene expression data as in the case of Lehmann et al.¹². In this paper, we compared two of these strategies, one based on the EMT, which we introduced in a recent paper¹⁷, and the other based on the tumor immune microenvironment³³. Our analysis suggests that our EMT based stratification successfully identifies high risk patients in a way that is largely independent of the tumor immune microenvironment and the Lehmann subtyping. Our analysis, however, reveals a small fraction of patients with high ARIADNE score and large metasignature scores that is not properly classified according the the categories proposed in Gruosso et al.³³. This point is particularly interesting since it illustrates the potential of ARIADNE in identifying patients that fall into a grey area when classified with immune categories. Apart from this subpopulation, other patients classified as high or low risk by ARIADNE do not display a peculiar profile in terms of their tumor immune microenvironment.

Data availability

The datasets analyzed during the current study are available in the GEO repository under accession numbers GSE88847 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE88847, GSE88715 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE88715, and https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31519 GSE31519.

References

Waks, A. G. & Winer, E. P. Breast cancer treatment: a review. JAMA 321, 288–300 (2019).
Article CAS Google Scholar
Perou, C. M. et al. Molecular portraits of human breast tumours. Nature 406, 747–752 (2000).
Article ADS CAS Google Scholar
Koboldt, D. et al. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Article ADS CAS Google Scholar
Sims, A. H., Howell, A., Howell, S. J. & Clarke, R. B. Origins of breast cancer subtypes and therapeutic implications. Nat. Clin. Pract. Oncol. 4, 516–525 (2007).
Article CAS Google Scholar
Kennecke, H. et al. Metastatic behavior of breast cancer subtypes. J. Clin. Oncol. 28, 3271–3277 (2010).
Article Google Scholar
Paik, S. et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N. Engl. J. Med. 351, 2817–2826 (2004).
Article CAS Google Scholar
Paik, S. et al. Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer. J. Clin. Oncol. 24, 3726–3734 (2006).
Article CAS Google Scholar
Van’t Veer, L. J. et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature 415, 530–536 (2002).
Buyse, M. et al. Validation and clinical utility of a 70-gene prognostic signature for women with node-negative breast cancer. J. Natl. Cancer Inst. 98, 1183–1192 (2006).
Article CAS Google Scholar
Ein-Dor, L., Kela, I., Getz, G., Givol, D. & Domany, E. Outcome signature genes in breast cancer: is there a unique set?. Bioinformatics 21, 171–178 (2005).
Article CAS Google Scholar
Drier, Y. & Domany, E. Do two machine-learning based prognostic signatures for breast cancer capture the same biological processes?. PLoS ONE 6, e17795 (2011).
Article ADS CAS Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. Investig. 121, 2750–2767 (2011).
Article CAS Google Scholar
Burstein, M. D. et al. Comprehensive genomic analysis identifies novel subtypes and targets of triple-negative breast cancer. Clin. Cancer Res. 21, 1688–1698 (2015).
Article CAS Google Scholar
Yu, G. et al. Predicting relapse in patients with triple negative breast cancer (tnbc) using a deep-learning approach. Front. Physiol. 11 (2020). https://www.frontiersin.org/article/10.3389/fphys.2020.511071. https://doi.org/10.3389/fphys.2020.511071.
Shah, S. P. et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature 486, 395–399 (2012).
Article ADS CAS Google Scholar
Lehmann, B. D. et al. Refinement of triple-negative breast cancer molecular subtypes: implications for neoadjuvant chemotherapy selection. PLoS ONE 11, e0157368 (2016).
Article Google Scholar
Font-Clos, F., Zapperi, S. & La Porta, C. A. M. Classification of triple-negative breast cancers through a boolean network model of the epithelial–mesenchymal transition. Cell Syst. 12, 457-462.e4 (2021).
Article CAS Google Scholar
Font-Clos, F., Zapperi, S. & La Porta, C. A. M. Topography of epithelial–mesenchymal plasticity. Proc. Natl. Acad. Sci. USA 115, 5902–5907 (2018).
Article ADS CAS Google Scholar
Huber, M. A., Kraut, N. & Beug, H. Molecular requirements for epithelial–mesenchymal transition during tumor progression. Curr. Opin. Cell Biol. 17, 548–58. https://doi.org/10.1016/j.ceb.2005.08.001 (2005).
Article CAS PubMed Google Scholar
Rhim, A. D. et al. Emt and dissemination precede pancreatic tumor formation. Cell 148, 349–61. https://doi.org/10.1016/j.cell.2011.11.025 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sarrió, D. et al. Epithelial–mesenchymal transition in breast cancer relates to the basal-like phenotype. Cancer Res. 68, 989–97. https://doi.org/10.1158/0008-5472.CAN-07-2017 (2008).
Article CAS PubMed Google Scholar
Aleskandarany, M. A. et al. Epithelial–mesenchymal transition in early invasive breast cancer: an immunohistochemical and reverse phase protein array study. Breast Cancer Res. Treat. 145, 339–48. https://doi.org/10.1007/s10549-014-2927-5 (2014).
Article CAS PubMed Google Scholar
Grosse-Wilde, A. et al. Stemness of the hybrid epithelial/mesenchymal state in breast cancer and its association with poor survival. PLoS ONE 10, e0126522. https://doi.org/10.1371/journal.pone.0126522 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bitterman, P., Chun, B. & Kurman, R. J. The significance of epithelial differentiation in mixed mesodermal tumors of the uterus. A clinicopathologic and immunohistochemical study. Am. J. Surg. Pathol. 14, 317–28 (1990).
Article CAS Google Scholar
Haraguchi, S., Fukuda, Y., Sugisaki, Y. & Yamanaka, N. Pulmonary carcinosarcoma: immunohistochemical and ultrastructural studies. Pathol. Int. 49, 903–8 (1999).
Article CAS Google Scholar
Paniz Mondolfi, A. E. et al. Primary cutaneous carcinosarcoma: insights into its clonal origin and mutational pattern expression analysis through next-generation sequencing. Hum. Pathol. 44, 2853–60. https://doi.org/10.1016/j.humpath.2013.07.014 (2013).
Article CAS PubMed Google Scholar
Revenu, C. & Gilmour, D. Emt 2.0: shaping epithelia through collective migration. Curr. Opin. Genet. Dev. 19, 338–342. https://doi.org/10.1016/j.gde.2009.04.007 (2009).
Article CAS PubMed Google Scholar
Yu, M. et al. Circulating breast tumor cells exhibit dynamic changes in epithelial and mesenchymal composition. Science 339, 580–4. https://doi.org/10.1126/science.1228522 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Jolly, M. K. et al. Stability of the hybrid epithelial/mesenchymal phenotype. Oncotarget 7, 27067–27084. https://doi.org/10.18632/oncotarget.8166 (2016).
Article PubMed PubMed Central Google Scholar
George, J. T., Jolly, M. K., Xu, S., Somarelli, J. A. & Levine, H. Survival outcomes in cancer patients predicted by a partial emt gene expression scoring metric. Cancer Res. 77, 6415–6428. https://doi.org/10.1158/0008-5472.CAN-16-3521 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pastushenko, I. et al. Identification of the tumour transition states occurring during emt. Nature 556, 463–468. https://doi.org/10.1038/s41586-018-0040-3 (2018).
Article ADS CAS PubMed Google Scholar
Chakraborty, P., George, J. T., Tripathi, S., Levine, H. & Jolly, M. K. Comparative study of transcriptomics-based scoring metrics for the epithelial-hybrid-mesenchymal spectrum. Front. Bioeng. Biotechnol. 8, 220. https://doi.org/10.3389/fbioe.2020.00220 (2020).
Article PubMed PubMed Central Google Scholar
Gruosso, T. et al. Spatially distinct tumor immune microenvironments stratify triple-negative breast cancers. J. Clin. Investig. 129, 1785–1800 (2019).
Article Google Scholar
Bareche, Y. et al. Unraveling triple-negative breast cancer tumor microenvironment heterogeneity: towards an optimized treatment approach. JNCI J. Natl. Cancer Inst. 112, 708–719 (2020).
Article Google Scholar
Tofigh, A. et al. The prognostic ease and difficulty of invasive breast carcinoma. Cell Rep. 9, 129–142 (2014).
Article CAS Google Scholar
Karn, T. et al. Control of dataset bias in combined affymetrix cohorts of triple negative breast cancer. Genom Data 2, 354–356 (2014).
Article Google Scholar
Drier, Y., Sheffer, M. & Domany, E. Pathway-based personalized analysis of cancer. Proc. Natl. Acad. Sci. 110, 6388–6393 (2013).
Article ADS CAS Google Scholar
Hastie, T. & Stuetzle, W. Principal curves. J. Am. Stat. Assoc. 84, 502–516. https://doi.org/10.1080/01621459.1989.10478797 (1989).
Article MathSciNet MATH Google Scholar
Font-Clos, F., Zapperi, S. & La Porta, C. A. Integrative analysis of pathway deregulation in obesity. NPJ Syst. Biol. Appl. 3, 1–10 (2017).
Article Google Scholar
Liberzon, A. et al. The molecular signatures database hallmark gene set collection. Cell Syst. 1, 417–425 (2015).
Article CAS Google Scholar
Rody, A. et al. A clinically relevant gene signature in triple negative and basal-like breast cancer. Breast Cancer Res. 13, R97 (2011).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Complexity and Biosystems, University of Milan, via Celoria 16, 20133, Milano, Italy
Francesc Font-Clos, Stefano Zapperi & Caterina A. M. La Porta
Department of Physics, University of Milan, Via Celoria 16, 20133, Milano, Italy
Francesc Font-Clos & Stefano Zapperi
CNR - Consiglio Nazionale delle Ricerche, Istituto di Chimica della Materia Condensata e di Tecnologie per l’Energia, Via R. Cozzi 53, 20125, Milano, Italy
Stefano Zapperi
Department of Environmental Science and Policy, University of Milan, via Celoria 26, 20133, Milano, Italy
Caterina A. M. La Porta
CNR - Consiglio Nazionale delle Ricerche, Istituto di Biofisica, via Via De Marini 6, 16149, Genova, Italy
Caterina A. M. La Porta

Authors

Francesc Font-Clos
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Zapperi
View author publications
You can also search for this author in PubMed Google Scholar
Caterina A. M. La Porta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.F.C. and S.Z. analyzed data. C.A.M.L.P. and S.Z. designed the study and wrote the paper.

Corresponding author

Correspondence to Caterina A. M. La Porta.

Ethics declarations

Competing interests

The authors declare the following competing interests: Complexdata S.R.L has filed an Italian patent application related to the present work. Inventors: F. Font-Clos, S. Zapperi, C. A. M. La Porta. Patent status: granted. Date of application: 13/12/2019. Application number: 102019000023946. The patent concerns a method to screen breast cancer patients using transcriptomic data and Boolean networks. FFC, SZ, CAMPL hold 13.25%, 8.83% and 17.67% shares of Complexdata S.R.L., respectively.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Font-Clos, F., Zapperi, S. & La Porta, C.A.M. Classification of triple negative breast cancer by epithelial mesenchymal transition and the tumor immune microenvironment. Sci Rep 12, 9651 (2022). https://doi.org/10.1038/s41598-022-13428-2

Download citation

Received: 22 February 2022
Accepted: 09 May 2022
Published: 10 June 2022
DOI: https://doi.org/10.1038/s41598-022-13428-2

This article is cited by

Effects of exercise on immune cells with tumor-specific activity in breast cancer patients and survivors: a systematic review and meta-analysis
- Ana Myriam Lavín-Pérez
- Daniel Collado-Mateo
- Amir Hossein Ahmadi Hekmatikar
Supportive Care in Cancer (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.