A novel approach to triple-negative breast cancer molecular classification reveals a luminal immune-positive subgroup with good prognoses

Prado-Vázquez, Guillermo; Gámez-Pozo, Angelo; Trilla-Fuertes, Lucía; Arevalillo, Jorge M.; Zapater-Moros, Andrea; Ferrer-Gómez, María; Díaz-Almirón, Mariana; López-Vacas, Rocío; Navarro, Hilario; Maín, Paloma; Feliú, Jaime; Zamora, Pilar; Espinosa, Enrique; Fresno Vara, Juan Ángel

doi:10.1038/s41598-018-38364-y

Download PDF

Article
Open access
Published: 07 February 2019

A novel approach to triple-negative breast cancer molecular classification reveals a luminal immune-positive subgroup with good prognoses

Guillermo Prado-Vázquez^1,2,
Angelo Gámez-Pozo^1,2,
Lucía Trilla-Fuertes²,
Jorge M. Arevalillo⁴,
Andrea Zapater-Moros^1,2,
María Ferrer-Gómez¹,
Mariana Díaz-Almirón³,
Rocío López-Vacas¹,
Hilario Navarro⁴,
Paloma Maín ORCID: orcid.org/0000-0002-7418-5381⁵,
Jaime Feliú^6,7,
Pilar Zamora⁶,
Enrique Espinosa^6,7 &
…
Juan Ángel Fresno Vara ORCID: orcid.org/0000-0003-1527-1252^1,7

Scientific Reports volume 9, Article number: 1538 (2019) Cite this article

11k Accesses
46 Citations
40 Altmetric
Metrics details

Subjects

Breast cancer

Abstract

Triple-negative breast cancer is a heterogeneous disease characterized by a lack of hormonal receptors and HER2 overexpression. It is the only breast cancer subgroup that does not benefit from targeted therapies, and its prognosis is poor. Several studies have developed specific molecular classifications for triple-negative breast cancer. However, these molecular subtypes have had little impact in the clinical setting. Gene expression data and clinical information from 494 triple-negative breast tumors were obtained from public databases. First, a probabilistic graphical model approach to associate gene expression profiles was performed. Then, sparse k-means was used to establish a new molecular classification. Results were then verified in a second database including 153 triple-negative breast tumors treated with neoadjuvant chemotherapy. Clinical and gene expression data from 494 triple-negative breast tumors were analyzed. Tumors in the dataset were divided into four subgroups (luminal-androgen receptor expressing, basal, claudin-low and claudin-high), using the cancer stem cell hypothesis as reference. These four subgroups were defined and characterized through hierarchical clustering and probabilistic graphical models and compared with previously defined classifications. In addition, two subgroups related to immune activity were defined. This immune activity showed prognostic value in the whole cohort and in the luminal subgroup. The claudin-high subgroup showed poor response to neoadjuvant chemotherapy. Through a novel analytical approach we proved that there are at least two independent sources of biological information: cellular and immune. Thus, we developed two different and overlapping triple-negative breast cancer classifications and showed that the luminal immune-positive subgroup had better prognoses than the luminal immune-negative. Finally, this work paves the way for using the defined classifications as predictive features in the neoadjuvant scenario.

A breast cancer classification and immune landscape analysis based on cancer stem-cell-related risk panel

Article Open access 08 December 2023

Cancer genomics predicts disease relapse and therapeutic response to neoadjuvant chemotherapy of hormone sensitive breast cancers

Article Open access 18 May 2020

Identification of a five genes prognosis signature for triple-negative breast cancer using multi-omics methods and bioinformatics analysis

Article 26 April 2022

Introduction

Breast cancer (BC) causes 450,000 deaths every year worldwide¹. BC is clinically and genetically heterogeneous², and this heterogeneity has led to subdivisions in an attempt to treat patients more efficiently. The classical categorization considers the expression of hormonal receptors (estrogen receptors [ERs], and progesterone receptors [PRs]) and human epidermal growth factor receptor 2 (HER2) expression, because this determines the possibility of treatment with hormones and anti-HER2 therapies, respectively.

Triple-negative breast cancer (TNBC) is defined by a lack of ER and PR expression and a lack of HER2 overexpression. TNBC comprises a heterogeneous group of tumors. In 2000, Perou et al. proposed a classification of BC based on gene expression patterns. Most triple-negative tumors are included in the so-called basal-like molecular subgroup³, although both categories have up to 30% discordance⁴.

Several studies have developed specific molecular classifications for TNBC. For example, Rody et al. defined metagenes that distinguished molecular subsets within the group⁵. Lehmann et al. identified seven molecular subgroups: unstable; basal-like 1; basal-like 2; immunomodulatory; mesenchymal (MES)-like; mesenchymal stem-like (MSL); and luminal androgen receptor (LAR)⁶. The Immunomodulatory and MSL subtypes have recently been refined⁷. Burstein et al. applied non-negative matrix factorization and defined four subgroups: basal-like immune active; basal-like immune suppressed; mesenchymal; and luminal AR⁸. Other classifications have also been proposed by Sabatier⁹, Prat¹⁰, Jézéquel¹¹, and Milioli¹². Despite these extensive studies, the designation of TNBC molecular subtypes has had little impact in the clinical setting.

The so-called cancer stem cell hypothesis could provide a different way to categorize BC. It theorizes that cancer derives from a stem cell compartment that undergoes an abnormal and poorly regulated process of organogenesis analogous to many aspects of normal stem cells^13,14,15. Depending on the activation point of these cancer stem cells, tumors will have varying characteristics. Poorly differentiated breast tumors would arise from the most primitive stem cells¹⁴. This hypothesis contextualizes BC molecular groups¹ in a development framework. Moreover, molecular characterization of the claudin (CLDN)-low subtype reveals that these tumors are significantly enriched in epithelial-mesenchymal transition and stem cell-like features, while showing a low expression of luminal and proliferation-associated genes¹⁶.

In the present study, we applied probabilistic graphical models to a previously published TNBC cohort⁵. This technique allows exploring the molecular information from a functional perspective. Our aim was to tackle the molecular analysis of TNBC from a broad perspective, such as the cancer stem cell hypothesis, to provide a classification with clearer clinical implications.

Methods

TNBC gene expression and clinical data

Gene expression data from TNBC tumors and available clinical follow-up information were obtained from GSE31519. Gene expression values were magnitude normalized, and then log₂ was calculated. The Limma R package¹⁷ was applied to avoid the batch effect. Finally, the complete dataset was mean centered. The probe with the highest variance of each gene within all patients was selected. The results obtained with the first database were then applied to a second database of patients treated with neoadjuvant chemotherapy, GSE25066. GSE25066 data was magnitude normalized and log₂ was calculated just as with GSE31519.

Probabilistic graphical model analysis

A probabilistic graphical model compatible with a high-dimensionality approach to associate gene expression profiles, including the most variable 2000 genes, was performed as previously described¹⁸. Briefly, the resulting network, in which each node represents an individual gene, was split into several branches to identify functional structures within the network. Then, we used gene ontology analyses to investigate which function or functions were overrepresented in each branch, using the functional annotation chart tool provided by DAVID 6.8 beta¹⁹. We used “homo sapiens” as a background list and selected only GOTERM-DIRECT gene ontology categories and Biocarta and KEGG pathways. Functional nodes were composed of nodes presenting a gene ontology enriched category. To measure the functional activity of each functional node, the mean expression of all the genes included in one branch related to a concrete function was calculated. Differences in functional node activity were assessed by class comparison analyses. Finally, metanodes were defined as groups of related functional nodes using nonsupervised hierarchical clustering analyses.

Sparse k-means classification

Sparse k-means was used to establish the optimal number of tumor groups. This method uses the genes included in each node and metanode, as previously described²⁰. Briefly, classification consistency was tested using random forest. An analysis using the consensus clustering algorithm²¹ as applied to the data containing the variables that were selected by the sparse K-means method²² has provided an optimum classification into two subtypes in previous studies²⁰. In order to transfer the newly defined classification from the main dataset to other datasets, we constructed centroids for each defined subgroup, using genes included in various metanodes.

Assignation to groups defined by other molecular classifications

Tumors in the main dataset were assigned to a single group according to previously defined molecular classifications: PAM50 + CLDN low was assigned using the single sample predictor¹⁰. Burstein’s four subtypes were assigned using an 80-gene signature⁸. The TNBC4 type was performed in two steps: first, Lehmann’s seven subtypes were assigned using centroids constructed from 77 tumors included in the dataset that was previously assigned, and then Immunomodulatory and MSL groups were redefined as previously described⁷.

Statistical analyses and software suites

Survival curves were estimated using Kaplan–Meier analyses and compared with the log-rank test, using relapse free survival (RFS) as the end point. RFS was defined as the time between the day of surgery and the date of distant relapse or last date of follow-up. Correlations were assessed using Pearson’s r and linear regression. Differences in functional node activity between groups were assessed by the Kruskal–Wallis test, and multiple comparisons were assessed using the Dunn’s multiple comparisons test. Box-and-whisker plots are Tukey boxplots. All p-values were two-sided, and P < 0.05 was considered statistically significant. Expression data and network analyses were performed in MeV and Cytoscape software suites²³. The SPSS v16 software package, GraphPad Prism 5.0 and R v2.15.2 (with the Design software package 0.2.3) were used for all the statistical analyses.

Results

Gene expression and clinical data

Gene expression data and clinical information from 579 TNBC tumors were obtained from GSE31519. Some 85 samples were excluded because the patients had been treated with neoadjuvant chemotherapy or a different platform had been used. As a consequence, the data from 494 TNBC tumors from GSE31519 were used in subsequent analyses. Gene expression was normalized, the batch effect was corrected and the most variant probe was selected for each gene. The resulting dataset, including expression values from 13,146 genes will be referred to as the main dataset from now on.

Gene expression data from 508 breast cancer samples treated with neoadjuvant taxane-anthracycline chemotherapy were retrieved from GSE25066. A total 153 of these 508 samples were identified as TNBC.

Clinical features

All available clinical features of the main dataset and the neoadjuvant dataset are presented in Table 1. The main dataset’s population of tumors tended to be large (>T1 in 56% of the population), poorly differentiated (G3 in 57% of the samples), with no node invasion (N0 in 51% of the samples) and most of the patients were not treated with adjuvant chemotherapy (52%). The neoadjuvant dataset’s population of tumors tended to be T2 (44%) and T3 (32%), poorly differentiated (G3 in 81% of the samples), and N1 (46%) with 32% of the patients achieving a complete pathological response after neoadjuvant treatment.

Table 1 Clinical features of the main and neoadjuvant datasets.

Full size table

Molecular characterization of TNBC

A gene expression-based network, including the 2000 most variant genes in the development dataset, was constructed using a probabilistic graphical model (PGM) (Fig. 1). The functional structure of the network was explored using gene ontology analyses, and 26 functional nodes were defined (Fig. 1 and Sup. File 1). Functional node activity was calculated and relationships between nodes were assessed using a hierarchical clustering (HCL) analysis (Sup. File 2). Functional node 1 is composed of 34 genes, including the CLDN3, CLDN4 and CLDN7 genes. On the other hand, functional nodes 15 (chemokine activity), 16 (major histocompatibility complex class II receptor activity), 17 (immune response) and 18 (antigen binding) were related to various aspects of the immune response and clustered together as an “immune metanode” in the HCL analysis (Sup. File 2). Additionally, functional node 19 contained genes related to the peroxisome proliferator-activated receptor (PPAR) signaling pathway, and functional node 24 contained genes involved in the G1/S transition of mitotic cell cycle (Sup. File 1).

We then used the method described by Rody et al. to assess 15 metagenes (series of genes known to be related to one specific biological function or characteristic)⁵. Genes within a given metagene appeared close to each other in our network. Additionally, related metagenes, i.e., B-cell and IL-8 metagenes, also appeared close to each other (Fig. 2).

Functional nodes 5, 6, 7, 8 and 10 in our network had different gene ontologies related to an integral component of the plasma membrane, extracellular matrix, desmosomes, keratinization, and extracellular space, respectively. However, these five nodes appeared to correlate in the HCL analysis (Sup. File 2) and included genes from Rody’s basal-like metagene (Fig. 2). Thus, from now on, these five functional nodes were grouped as the basal metanode (Fig. 1). In the same way, functional nodes 0, 9, 11, 14, 22 and 23 were related to protein binding, extracellular exosomes, sequence-specific DNA binding, metabolic pathways and nucleosomes, respectively, again grouped together in the HCL analysis and including genes from Rody’s apocrine/luminal metagene, so they were defined as the luminal metanode (Fig. 1).

Cellular classification

The sparse k-means method was used to group samples into a limited number of clusters based on functional nodes and metanodes. Samples from the basal and luminal metanodes and the CLDN-enriched functional node were each divided into two groups. Mimicking the cancer stem cell hypothesis, we established the following workflow (Fig. 3): Samples with high luminal metanode activity were classified as the luminal androgen receptor group (LAR). Tumors showing low luminal metanode activity and high basal metanode activity from the basal subgroup were classified as basal. Finally, tumors with low activity in both the basal and luminal metanodes were screened for CLDN-enriched node expression. Samples showing low activity for the CLDN-enriched functional node were categorized as CLDN-low, whereas samples showing high activity for CLDN-enriched functional node were labeled as CLDN-high (Fig. 3).

From the 494 samples in the main dataset, the cellular classification defined 91 (18%) LAR, 53 (11%) CLDN-low, 310 (63%) basal and 40 (8%) CLDN-high samples. Only 7 (1.5%) samples showed high activity in both the luminal and basal metanodes (Table 2).

Table 2 Number of tumors classified in each metanode sparse k-means group and in the cellular classification.

Full size table

Clinical characteristics from the various entities of cellular classification are shown in Table 3. Basal subtype tumors were mostly small-sized, poorly differentiated and without lymph node infiltration. The CLDN-high subtype tumors were large, had poor differentiation and no lymph node infiltration. The CLDN-low as well as the LAR tumors were large, more differentiated and showed more infiltration than the basal and CLDN-high tumors. Cellular classification does not show a significant relationship to RFS (Sup. File 3), nor did basal and luminal metanode activities show prognostic value. CLDN-high tumors showed a trend toward a poorer prognosis than CLDN-low, but again, the differences were not significant.

Table 3 Number of tumors with clinical characteristics.

Full size table

Activity of functional nodes in cellular groups

The activity of the main functional nodes was assessed in each cellular group. CLDN-low tumors had lower activity than every other tumor subgroup in the functional nodes related to alpha-amylase activity and regulation of actin cytoskeleton, and higher activity than the other subgroups in the haptoglobin binding functional node. CLDN-high tumors had lower activity than basal tumors in the actin binding functional node, higher activity than tumors belonging to any other subgroup in chemokine activity functional node and lower activity than CLDN-low and LAR subtypes in the haptoglobin binding functional node. Basal tumors had higher activity than any other tumor in the functional nodes related to cell adhesion and regulation of the actin cytoskeleton. Finally, LAR tumors had lower activity in the nodes related to cell adhesion, G1/S transition of mitotic cell cycle and chemokine activity (Sup. File 4).

Immune metanode activity: Immune characteristics

On the other hand, taking the immune metanode into account, tumors were split according to their immune (IM) activity. High/low immune activity was defined with the sparse K-means method using genes included in the IM metanode. Some 259 (52%) samples were included in the IM-positive (IM+) group and 235 (48%) were included in the IM-negative (IM−) group (Table 4).

Table 4 Immune characteristic interaction with cellular classification. According to the chi-squared test, IM characteristics and cellular classification are dependent.

Full size table

IM+ tumors had a better prognosis than IM- tumors (hazard ratio [HR], 0.7286; 95% confidence interval [CI] 0.5329–0.9961; P < 0.05) (Fig. 4A). In addition, the immune metanode activity had a prognostic impact on the groups defined by the cellular classification. Patients with IM+/LAR subtype tumors had a better prognosis than those with IM−/LAR tumors (HR, 0.3474; 95% CI 0.1657–0.7284; P < 0.05). Also, patients with IM+/CLDN-high tumors had a better prognosis than those with IM−/CLDN−, although these differences did not reach statistical significance (HR, 0.3556; 95% CI 0.04115–0.9828; P = 0.057). IM activity had no impact on the prognosis of the basal and CLDN-low subtypes (Fig. 4B).

Comparison between Cellular classification and PAM50, TNBC4-type and Burstein’s classifications

Cellular classification and previous classifications were compared (Fig. 5). The basal subtype is highly enriched in basal-like immune suppressed (BLIS) and basal-like immune associated (BLIA) (Burstein 2015), basal (PAM50 + CLDN-low) and M (Lehmann 2016) subtypes, and it is poorly represented in the LAR subtypes from the Burstein and Lehmann classifications. The CLDN-high subtype is highly enriched in BLIA (Burstein 2015) and BL2 (Lehmann 2016). The CLDN-low subtype is highly enriched in MES (Burstein 2015), LumA (PAM50 + CLDN-low) and BL2 (Lehmann 2016). The LAR subtype is highly enriched in LAR (Burstein 2015), LumA (PAM50 + CLDN-low) and LAR (Lehmann 2016). The LAR subtype is not present in Basal (PAM50) and BL1 (Lehmann) assignations (Fig. 5 and Table 5).

Table 5 Shows comparisons between Cellular classification and PAM50, Lehmann’s and Burstein’s classifications.

Full size table

Immune characteristics and previous classifications

The Mesenchymal subtype from the TNBC4 type⁷ was highly enriched in IM- samples (148 samples of 187, 80% of all M subtype samples). Also, BL2 was enriched in IM+ samples (135 samples of 185, 72% of all BL2 subtype samples). The IM+ and IM- groups showed no prognostic value for the BL1, BL2 and M groups (Fig. 6). However, patients with IM+ tumors had better prognosis than those with IM− in the LAR group (HR, 0.2896; 95% CI 0.1125–0.7273; P < 0.05).

The IM+ and IM- subgroups were evenly distributed in the subtypes defined by PAM50 and CLDN-low, with the exception of the HER2 subtype, which was enriched in IM+ (Table 6).

Table 6 Shows immune characteristics in the PAM50+CLDN-low subgroups.

Full size table

LumA immune-positive tumors had a better prognosis than immune-negative tumors (HR, 2.638; 95% CI 1.098–6.341; P < 0.05). Basal Immune and normal-like immune-positive tumors also showed a trend toward a better prognosis than immunenegative, but the differences were not statistically significant. Finally CLDN-low, LumB and HER2 tumors showed no differences in prognosis related to their immune status (Fig. 7).

Finally, the Burstein subtype BLIA was highly enriched in the IM+ (106 samples of 140, 75%) and the BLIS was highly enriched in the IM- tumors (119 samples of 156, 76%).

Immune-positive and immune-negative tumors had different outcomes in each of the Burstein’s subgroups. BLIA, BLIS and LAR immune-positive tumors as well as MES immune-negative tumors had a better prognosis, although the differences were not statistically significant (Fig. 8).

Implications of the cellular classification and the immune characteristic in response to neoadjuvant treatment

Cellular classification was transferred using genes from the basal and luminal metanodes and the CLDN-enriched functional node. Of 153 triple-negative breast cancer tumors, 79 were assigned to the basal subgroup (51%), 8 were assigned to the CLDN-high subgroup (5%), 19 were assigned to the CLDN-low subgroup (12%) and 47 were assigned to the LAR subgroup (31%). The immune characteristic was transferred using genes from the immune metanode. Some 80 samples were immune-negative (52%) and 73 samples were assigned to the immune-positive subgroup (47%) (Table 7).

Table 7 Shows the cellular classification and the immune characteristic in the neoadjuvant dataset.

Full size table

The CLDN-high subgroup presented the poorest prognosis among the cellular classification subgroups. Immune-positive tumors had a better prognosis (Fig. 9).

Discussion

TNBC constitutes a heterogeneous disease with various molecular entities. The study of this heterogeneity has thus far not conferred significant advances in the treatment of patients. The application of probabilistic graphical models (PGMs) provides deep insight into high-throughput data¹⁸. In the present study, we used PGMs to unravel specific molecular information concerning various biological entities, such as the immune status or the developmental point when the breast stem cell turns carcinogenic.

Previous studies used differences in gene expression to define TNBC subtypes^3,6,7,8,10. Subtypes emerged from clustering methods such as HCL or non-negative matrix factorization, which group genes around specific functions. On the contrary, we hereby applied an unsupervised analysis, without knowledge of the functions of the genes selected in each step of the process. We ultimately identified the genes involved in 26 different molecular functions, which agreed with the metagenes described by Rody et al.⁵. This approach provides two different classifications (immune and cellular), each related to particular genes and functions.

Once the PGM functional structure was established, we defined four subgroups: CLDN-low, CLDN-high, basal-like and LAR, agreeing with the cancer stem cell hypothesis^2,13,14,15. These four groups identify the point of the differentiation process where the stem cell becomes carcinogenic: the less differentiated tumors will be CLDN-low, and the most differentiated tumors will be LAR.

Functional node activities confirm that there are differences among cellular subgroups, and some of these differences could have therapeutic utility. For example, the activity of node 19 (PPAR signaling pathway) showed meaningful differences between the CLDN-low subgroup and the other three, suggesting that PPAR-directed therapies might have a different effect on the CLDN-low subgroup. Finally, we observed that cellular subgroups had different clinical features.

On the other hand, the immune layer was described in this study as a compendium of functional nodes, each of which related to a specific immune function. However, when taking all these nodes together as a metanode we were able to establish an immune classification with prognostic value among all the series.

The immune and cellular classifications reflected unrelated biological identities. As shown in Fig. 4, the LAR and CLDN-high subgroups presented different prognoses when split by the immune layer. LAR immune-negative tumors were associated with a 30% 5-year survival rate compared with 70% in the LAR immune-positive group. The immune-based subtype might also influence the response to immunotherapy. Ongoing trials are evaluating anti-PD1 antibodies in breast cancer, particularly in triple-negative disease²⁴. It would be interesting to assess the efficacy of anti-PD1 therapy in subtypes defined by immune layer.

We also compared the cellular classification with other classifications previously described^7,8,10. LAR is overrepresented in every luminal subgroup regardless of the classification, which demonstrates that this is a homogeneous and reproducible group. Similarly, the basal cellular subgroup is overrepresented in basal subgroups across classifications. There is also a high correlation (83%) in the CLDN-low cellular groups, which confirms the existence of a CLDN-low subgroup independent of the expression of ER, PR and HER2, as previously suggested¹⁶.

Our results show that immune features appear across different subtypes. Interestingly, the luminal immune-positive group did much better than the luminal immune-negative group. Regardless of the classification^7,8,10, the immune layer added prognostic information to the luminal subtypes. The immune layer had been previously defined as a separate group in these classifications, but it appears to intersect with other biological features, providing additional prognostic value.

With regard to the cellular classification, our CLDN-low cellular subgroup had an 89% concordance with the basal-like 2 Lehmann’s subgroup, which puts BL2 in the stem cell hypothesis context, suggesting that basal-like 2 tumors might be caused by early differentiated carcinogenic stem cells. The CLDN-high subgroup does not appear in other classifications, which suggests that this is an intermediate group between CLDN-low tumors (stem cell not yet expressing CLDN genes) and basal tumors. It might be difficult to draw the line between groups in this continuous, cellular differentiation-based classification, although Burnstein’s basal-like immune-active corresponded to the CLDN-high immune-negative in our classification. Regardless of the classification, there was always a luminal subgroup, one or two basal subgroups and some mesenchymal or CLDN subgroup.

Our classification could also provide some predictive information. CLDN-high tumors had a poor response to neoadjuvant chemotherapy. Much effort has been devoted to the prediction of response to chemotherapy in TNBC. Cell-free DNA²⁵, tumor-infiltrating lymphocytes²⁶, microRNA signatures²⁷ and proteomics²⁸, among others, have recently been proposed as useful methods in this regard. Further research is needed before the cellular classification described in the present paper could be considered in the selection of therapy.

This study has some limitations. The 2010 American Society of Clinical Oncology guidelines established the 1% threshold for the expression of PR and ER²⁹; however, our tumor series was assessed before that date, so we cannot ensure that all the TNBC tumors fulfilled this criterion. Another limitation to our study is that the cellular classification is based on a continuum, which makes it difficult to set categories. Finally, these results should be validated in additional cohorts to evaluate the robustness of our cellular and immune classification. However, we believe that our findings serve as an important hypothesis in generating findings that can be explored in future studies.

Conclusion

In conclusion, the use of probabilistic graphical models in TNBC suggests that there are at least two independent biological layers, cellular and immune. We propose a new way to characterize TNBC taking these two dimensions into account, and leading to the result that the luminal immune-positive subgroup had a better prognosis than the luminal immune-negative.

Availability of Data and Material

The datasets analyzed during the current study, GSE31519 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc = GSE31519], and GSE25066 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc = GSE25066], are available in the GEO Datasets repository.

References

Network, C. G. A. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Article ADS Google Scholar
Stingl, J. & Caldas, C. Molecular heterogeneity of breast carcinomas and the cancer stem cell hypothesis. Nat Rev Cancer 7, 791–799 (2007).
Article CAS Google Scholar
Perou, C. M. et al. Molecular portraits of human breast tumours. Nature 406, 747–752 (2000).
Article ADS CAS Google Scholar
Yersal, O. & Barutca, S. Biological subtypes of breast cancer: Prognostic and therapeutic implications. World J Clin Oncol 5, 412–424 (2014).
Article Google Scholar
Rody, A. et al. A clinically relevant gene signature in triple negative and basal-like breast cancer. Breast Cancer Res 13, R97 (2011).
Article Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J Clin Invest 121, 2750–2767 (2011).
Article CAS Google Scholar
Lehmann, B. D. et al. Refinement of Triple-Negative Breast Cancer Molecular Subtypes: Implications for Neoadjuvant Chemotherapy Selection. PLoS One 11, e0157368 (2016).
Article Google Scholar
Burstein, M. D. et al. Comprehensive genomic analysis identifies novel subtypes and targets of triple-negative breast cancer. Clin Cancer Res 21, 1688–1698 (2015).
Article CAS Google Scholar
Sabatier, R. et al. Kinome expression profiling and prognosis of basal breast cancers. Mol Cancer 10, 86 (2011).
Article Google Scholar
Prat, A. & Perou, C. M. Deconstructing the molecular portraits of breast cancer. Mol Oncol 5, 5–23 (2011).
Article CAS Google Scholar
Jézéquel, P. et al. Gene-expression signature functional annotation of breast cancer tumours in function of age. BMC Med Genomics 8, 80 (2015).
Article Google Scholar
Milioli, H. H., Tishchenko, I., Riveros, C., Berretta, R. & Moscato, P. Basal-like breast cancer: molecular profiles, clinical features and survival outcomes. BMC Med Genomics 10, 19 (2017).
Article Google Scholar
Shipitsin, M. & Polyak, K. The cancer stem cell hypothesis: in search of definitions, markers, and relevance. Lab Invest 88, 459–463 (2008).
Article CAS Google Scholar
Sims, A. H., Howell, A., Howell, S. J. & Clarke, R. B. Origins of breast cancer subtypes and therapeutic implications. Nat Clin Pract Oncol 4, 516–525 (2007).
Article CAS Google Scholar
Allegra, A. et al. The cancer stem cell hypothesis: a guide to potential molecular targets. Cancer Invest 32, 470–495 (2014).
Article Google Scholar
Prat, A. et al. Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res 12, R68 (2010).
Article Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43, e47 (2015).
Article Google Scholar
Gámez-Pozo, A. et al. Combined Label-Free Quantitative Proteomics and microRNA Expression Analysis of Breast Cancer Unravel Molecular Differences with Clinical Implications. Cancer Res 75, 2243–2253 (2015).
Article Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4, 44–57 (2009).
Article CAS Google Scholar
de Velasco, G. et al. Urothelial cancer proteomics provides both prognostic and functional information. Sci Rep 7, 15819 (2017).
Article ADS Google Scholar
Monti, S., Tamayo, P., Mesirov, J. & Golub, T. Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data. Machine learning 52, 91–118 (2003).
Article Google Scholar
Witten, D. M. & Tibshirani, R. A framework for feature selection in clustering. J Am Stat Assoc 105, 713–726 (2010).
Article MathSciNet CAS Google Scholar
Saeed, A. I. et al. TM4: a free, open-source system for microarray data management and analysis. Biotechniques 34, 374–378 (2003).
Article CAS Google Scholar
Katz, H. & Alsharedi, M. Immunotherapy in triple-negative breast cancer. Med Oncol 35, 13 (2017).
Article Google Scholar
Sung, J. S. et al. Detection of somatic variants and. Oncotarget 8, 106901–106912 (2017).
PubMed PubMed Central Google Scholar
Wein, L. et al. Clinical Validity and Utility of Tumor-Infiltrating Lymphocytes in Routine Clinical Practice for Breast Cancer Patients: Current and Future Directions. Front Oncol 7, 156 (2017).
Article Google Scholar
García-Vazquez, R. et al. A microRNA signature associated with pathological complete response to novel neoadjuvant therapy regimen in triple-negative breast cancer. Tumour Biol 39, 1010428317702899 (2017).
Article Google Scholar
Gámez-Pozo, A. et al. Prediction of adjuvant chemotherapy response in triple negative breast cancer with discovery and targeted proteomics. PLoS One 12, e0178296 (2017).
Article Google Scholar
Hammond, M. E., Hayes, D. F., Wolff, A. C., Mangu, P. B. & Temin, S. American society of clinical oncology/college of american pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer. J Oncol Pract 6, 195–197 (2010).
Article Google Scholar

Download references

Acknowledgements

This study was funded by Instituto de Salud Carlos III, Spanish Economy and Competitiveness Ministry, Spain and co-funded by the FEDER program, “Una forma de hacer Europa” (PI15/01310). LT-F is supported by the Spanish Economy and Competitiveness Ministry (DI-15-07614). GP-V is supported by Conserjería de Educación, Juventud y Deporte of Comunidad de Madrid (IND2017/BMD7783).

Author information

Authors and Affiliations

Molecular Oncology & Pathology Lab, INGEMM, La Paz University Hospital Health Research Institute-IdiPAZ, Madrid, Spain
Guillermo Prado-Vázquez, Angelo Gámez-Pozo, Andrea Zapater-Moros, María Ferrer-Gómez, Rocío López-Vacas & Juan Ángel Fresno Vara
R&D department, Biomedica Molecular Medicine SL, Madrid, Spain
Guillermo Prado-Vázquez, Angelo Gámez-Pozo, Lucía Trilla-Fuertes & Andrea Zapater-Moros
Biostatistics Unit, La Paz University Hospital Health Research Institute-IdiPAZ, Madrid, Spain
Mariana Díaz-Almirón
Department of Statistics, Operational Research and Numerical Analysis, National University of Distance Education (UNED), Madrid, Spain
Jorge M. Arevalillo & Hilario Navarro
Department of Statistics and Operations Research, Faculty of Mathematics, Complutense University of Madrid, Madrid, Spain
Paloma Maín
Medical Oncology Service, La Paz University Hospital Health Research Institute-IdiPAZ, Madrid, Spain
Jaime Feliú, Pilar Zamora & Enrique Espinosa
Biomedical Research Networking Center on Oncology-CIBERONC, ISCIII, Madrid, Spain
Jaime Feliú, Enrique Espinosa & Juan Ángel Fresno Vara

Authors

Guillermo Prado-Vázquez
View author publications
You can also search for this author in PubMed Google Scholar
Angelo Gámez-Pozo
View author publications
You can also search for this author in PubMed Google Scholar
Lucía Trilla-Fuertes
View author publications
You can also search for this author in PubMed Google Scholar
Jorge M. Arevalillo
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Zapater-Moros
View author publications
You can also search for this author in PubMed Google Scholar
María Ferrer-Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Díaz-Almirón
View author publications
You can also search for this author in PubMed Google Scholar
Rocío López-Vacas
View author publications
You can also search for this author in PubMed Google Scholar
Hilario Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Paloma Maín
View author publications
You can also search for this author in PubMed Google Scholar
Jaime Feliú
View author publications
You can also search for this author in PubMed Google Scholar
Pilar Zamora
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Espinosa
View author publications
You can also search for this author in PubMed Google Scholar
Juan Ángel Fresno Vara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All the authors have directly participated in the preparation of this manuscript and have read and approved the final version submitted. J.M.A., H.N. and P.M. contributed the probabilistic graphical models. M.D.-A. and G.P.-V. contributed the statistical analyses. G.P.-V., L.T.-F., A.Z.-M., M.F.-G. and R.L.-V. performed the probabilistic graphical model interpretation and the gene ontology analyses. G.P.-V. drafted the manuscript. G.P.-V., A.G.-P., J.A.F.V., J.F., P.Z. and E.E. conceived of the study and participated in its design and interpretation. A.G.-P., J.A.F.V., E.E. and L.T.-F. supported the manuscript drafting. A.G.-P. and J.A.F.V. coordinated the study.

Corresponding author

Correspondence to Juan Ángel Fresno Vara.

Ethics declarations

Competing Interests

A.F.V., A.G.-P. and E.E. are shareholders of Biomedica Molecular Medicine S.L. L.T.-F. and G.P.-V. are employees of Biomedica Molecular Medicine S.L. The other authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Sup Files 2-4

Sup. Info 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Prado-Vázquez, G., Gámez-Pozo, A., Trilla-Fuertes, L. et al. A novel approach to triple-negative breast cancer molecular classification reveals a luminal immune-positive subgroup with good prognoses. Sci Rep 9, 1538 (2019). https://doi.org/10.1038/s41598-018-38364-y

Download citation

Received: 11 June 2018
Accepted: 19 December 2018
Published: 07 February 2019
DOI: https://doi.org/10.1038/s41598-018-38364-y

This article is cited by

Mesenchymal-like immune-altered is the fourth robust triple-negative breast cancer molecular subtype
- Pascal Jézéquel
- Hamza Lasla
- Mario Campone
Breast Cancer (2024)
Data projections by skewness maximization under scale mixtures of skew-normal vectors
- Jorge M. Arevalillo
- Hilario Navarro
Advances in Data Analysis and Classification (2020)
Biological molecular layer classification of muscle-invasive bladder cancer opens new treatment opportunities
- Lucía Trilla-Fuertes
- Angelo Gámez-Pozo
- Juan Ángel Fresno Vara
BMC Cancer (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

TNBC gene expression and clinical data

Probabilistic graphical model analysis

Sparse k-means classification

Assignation to groups defined by other molecular classifications

Statistical analyses and software suites

Results

Gene expression and clinical data

Clinical features

Molecular characterization of TNBC

Cellular classification

Activity of functional nodes in cellular groups

Immune metanode activity: Immune characteristics

Comparison between Cellular classification and PAM50, TNBC4-type and Burstein’s classifications

Immune characteristics and previous classifications

Implications of the cellular classification and the immune characteristic in response to neoadjuvant treatment

Discussion

Conclusion

Availability of Data and Material

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links