Functional proteomics outlines the complexity of breast cancer molecular subtypes

Gámez-Pozo, Angelo; Trilla-Fuertes, Lucía; Berges-Soria, Julia; Selevsek, Nathalie; López-Vacas, Rocío; Díaz-Almirón, Mariana; Nanni, Paolo; Arevalillo, Jorge M.; Navarro, Hilario; Grossmann, Jonas; Gayá Moreno, Francisco; Gómez Rioja, Rubén; Prado-Vázquez, Guillermo; Zapater-Moros, Andrea; Main, Paloma; Feliú, Jaime; Martínez del Prado, Purificación; Zamora, Pilar; Ciruelos, Eva; Espinosa, Enrique; Fresno Vara, Juan Ángel

doi:10.1038/s41598-017-10493-w

Download PDF

Article
Open access
Published: 30 August 2017

Functional proteomics outlines the complexity of breast cancer molecular subtypes

Angelo Gámez-Pozo ORCID: orcid.org/0000-0002-8931-0624¹,
Lucía Trilla-Fuertes²,
Julia Berges-Soria¹,
Nathalie Selevsek³,
Rocío López-Vacas¹,
Mariana Díaz-Almirón⁴,
Paolo Nanni ORCID: orcid.org/0000-0001-8429-3557³,
Jorge M. Arevalillo⁵,
Hilario Navarro⁵,
Jonas Grossmann³,
Francisco Gayá Moreno⁴,
Rubén Gómez Rioja⁶,
Guillermo Prado-Vázquez¹,
Andrea Zapater-Moros¹,
Paloma Main⁷,
Jaime Feliú⁸,
Purificación Martínez del Prado⁹,
Pilar Zamora⁸,
Eva Ciruelos¹⁰,
Enrique Espinosa⁸ &
…
Juan Ángel Fresno Vara¹

Scientific Reports volume 7, Article number: 10100 (2017) Cite this article

3066 Accesses
33 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Breast cancer is a heterogeneous disease comprising a variety of entities with various genetic backgrounds. Estrogen receptor-positive, human epidermal growth factor receptor 2-negative tumors typically have a favorable outcome; however, some patients eventually relapse, which suggests some heterogeneity within this category. In the present study, we used proteomics and miRNA profiling techniques to characterize a set of 102 either estrogen receptor-positive (ER+)/progesterone receptor-positive (PR+) or triple-negative formalin-fixed, paraffin-embedded breast tumors. Protein expression-based probabilistic graphical models and flux balance analyses revealed that some ER+/PR+ samples had a protein expression profile similar to that of triple-negative samples and had a clinical outcome similar to those with triple-negative disease. This probabilistic graphical model-based classification had prognostic value in patients with luminal A breast cancer. This prognostic information was independent of that provided by standard genomic tests for breast cancer, such as MammaPrint, OncoType Dx and the 8-gene Score.

Proteomic analysis of archival breast cancer clinical specimens identifies biological subtypes with distinct survival outcomes

Article Open access 16 February 2022

Survival analysis in breast cancer using proteomic data from four independent datasets

Article Open access 18 August 2021

Pan-cancer molecular subtypes revealed by mass-spectrometry-based proteomic characterization of more than 500 human cancers

Article Open access 12 December 2019

Introduction

Breast cancer is a major health issue in developed countries. Early diagnosis and the use of adjuvant therapies have contributed to improve survival; nevertheless, 87,000 women died of breast cancer in the European Union in 2011¹. Knowledge of the molecular biology of breast cancer has recently challenged the way in which oncologists make decisions about systemic treatment².

Breast cancer is a heterogeneous disease comprising a range of entities with various genetic backgrounds. Clinical decisions are currently based on classical factors, such as the extent of the disease and the expression of hormonal receptors and human epidermal growth factor receptor 2 (HER2). Genomic classifications have also been described, the better-known encompassing four major categories: luminal A, luminal B, basal-cell and HER2-enriched³. Most patients included in the categories of estrogen receptor-positive (ER+)/HER2-negative (HER2-) disease with luminal A breast cancer have a favorable prognosis; however, some eventually relapse, which suggests some heterogeneity within these categories. Patients in the categories of triple-negative disease — i.e., no expression of hormonal receptors, HER2- or basal-cell disease — have a poorer prognosis^{4, 5}.

In recent years, proteomic approaches have been incorporated into the study of clinical samples as a way to complement the information provided by classical factors and genomics. Mass spectrometry-based proteomics has emerged as preferred component of a strategy for discovering diagnostic and prognostic protein biomarkers as well as for establishing new therapeutic targets⁶. Although these investigations are encouraging^{7, 8}, the number of tumor biomarkers discovered with this approach is still limited⁹. MicroRNAs are key regulators in the genesis and progression of cancer. MicroRNA profiling, together with genomics and proteomics, could lead to unraveling regulatory networks of biological processes related to cancer¹⁰.

In this study, we used high-throughput proteomics and microRNA profiling to characterize two subtypes of breast cancer with various prognoses: ER+/progesterone receptor-positive (PR+) HER2- breast cancer and triple-negative breast cancer (TNBC). We applied probabilistic graphical models and flux balance analyses to explore molecular differences between these subtypes to unveil differences not detected by immunohistochemistry or genomics.

Results

Patient characteristics

A total of 106 patients with breast cancer from two different hospitals were included in the discovery cohort. All the patients had node-positive disease, all the tumors were negative for HER2 and all had received adjuvant chemotherapy and hormonal therapy for patients with ER+ disease (patients showing estrogen and/or progesterone receptor expression). Forty-six additional patients from a third hospital with ER+ disease and nodal involvement were eligible for the verification cohort: all had received anthracycline-based adjuvant chemotherapy followed by hormone therapy (Table 1 and Sup. Fig. S1).

Table 1 Patient’s characteristics.

Full size table

Protein extraction and shotgun-mass spectrometry analyses of formalin-fixed, paraffin-embedded breast cancer tumors

After mass spectrometry (MS) workflow, 25 TNBCs and 71 ER+ tumors from the discovery cohort were analyzed. Raw data normalization was performed as previously described¹⁰. Four samples were excluded due to poor protein extraction and six were excluded due to data quality. Of 3,239 protein groups identified using Andromeda, 1095 presented at least two unique peptides and detectable expression in at least 75% of the samples in either the ER+ or TNBC groups. No decoy protein passed through these additional filters. Label-free quantification data were obtained using MaxQuant as previously described¹⁰.

Protein expression analyses of breast cancer tumors

Protein expression values were analyzed using Significance Analysis of Microarrays (SAM). A total of 224 proteins were differentially expressed between the ER+ and TNBC samples with a false discovery rate (FDR) <5% (Sup. Table S1). Hierarchical clustering analysis split the samples into two main clusters: cluster I comprised 70.4% of ER+ tumors (labeled ER-true), and cluster II included both ER+ and triple-negative (TN) tumors. The ER+ tumors included in cluster II, representing 29.6% of all ER+ tumors, were labeled as TN-like tumors. The distant metastasis-free survival (DMFS) rate at 5 years was 88.2% for ER-true and 71.4% for TN-like patients (p = 0.21). The clinical evolution of TN-like breast cancer was similar to that of TNBC (DMFS rate at 5 years 65.4%, p = 0.7) (Fig. 1).

Characterization of ER-true and TN-like subtypes

A significance analysis of microarrays (SAM), excluding TNBC tumors, was performed to further characterize ER-true and TN-like subtypes. We found 44 proteins showing differential expression between both subgroups, with an FDR < 5% (Sup. Table S2 and Sup. Fig. 2). Four proteins presented deleted records in Uniprot and were excluded. Among the proteins with higher expression in ER-true tumors, we found 7 extracellular small leucine-rich canonical proteoglycans (SLRPs) (biglycan, decorin, asporin, lumican, prolargin, fibromodulin and osteoglycin), three proteins produced by mast cells (cathepsin G, mast cell carboxypeptidase A and chymase), COEA1, PRDBP, and both the PIP and ZA2G proteins. On the other hand, TN-like tumors showed greater expression of HS90B and STIP1 from the chaperone pathway, EF2 and THEM6 proteins. Gene ontology analyses showed that proteins defining the TN-like subtype were related to cell adhesion processes (Sup. Table S3). Regarding clinical factors, we found that TN-like tumors showed higher molecular grade (G1-2 vs. G3, p = 0.03). No differences between ER-true and TN-like tumors regarding age at diagnosis, tumor size, number of affected nodes, and ER, PR or Ki67 pathological assessment were found.

MicroRNA expression analysis of breast cancer tumors

MicroRNA expression profiling was available for 42 ER-true and 23 TN-like tumors from the discovery cohort. One microRNA was excluded from subsequent analyses due to absence of expression in most of the samples. Nine microRNAs showed significant higher expression in the ER-true compared with the TN-like tumors (p < 0.05; FDR < 5%) (Sup. Fig. S3).

Systems biology of ER+ breast cancer

Both label-free protein quantification and microRNA expression data were available for 16 TNBC and 63 ER+ tumors from the discovery cohort. A probabilistic graphical model was constructed with these values as previously described¹⁰. Differences in functional node activity between ER-true and TN-like tumors were found (Figs 2 and S4). These differences were corroborated in the external dataset (p < 0.05), except for the protein synthesis node. All metabolism and mitochondria nodes present higher activity in TN-like tumors. The “metabolism A” node includes proteins related to glutamine and glucose metabolism and LDHB. The “metabolism B” node includes GAPDH, PGK1, LDHA and pyruvate kinase proteins, among others; and also miR-449a, whose expression showed a negative correlation with the functional node activity (Sup. Fig. S6). The “mitochondria A” node includes proteins related to the mitochondrial oxidation/reduction process, whereas the “mitochondria B” node comprises tricarboxylic acid (TCA) cycle proteins. The “ECM & focal adhesion” node showed higher activity in ER-true tumors, and includes miR-139-5p, miR-149, miR-766, miR-342, miR-214* and miR-31. Both miR-214* and miR-31 expression showed positive correlation with functional node activity (Sup. Fig. S5). The “response and membrane” node includes proteins related to cellular response to external stimuli and cholesterol homeostasis, and shows higher activity in ER-true tumors. The “proteasome” node includes proteins from the proteasome core complex, and showed higher activity in TN-like tumors. This functional node includes miR-489 and miR-99a, although no correlation was found between their expression and functional node activity.

Flux balance analysis of breast tumors

We performed a flux balance analysis (FBA) using the E-Flux algorithm to evaluate the impact of the proteomics profile on tumor growth capability¹¹. Our Recon 2-based model includes 7440 reactions, from which we found gene-protein-reaction (GPR) rule values mediating 1085 reactions. All the tumors fulfilled the Warburg effect, redirecting pyruvate generated by glycolysis and glutaminolysis to lactic fermentation through lactate dehydrogenase. The predicted tumor growth rate was higher in both the TN-like and TNBC tumors compared with the ER-true tumors (Fig. 3).

Targeted proteomics of TN-like/ER-true subtypes

To corroborate the prognostic value of the TN-like/ER-true classification, 33 proteins differentially expressed between TN-like and ER-true subtypes were assessed, using a targeted proteomics approach via selected reaction monitoring (SRM) in a new cohort comprising 46 ER+ breast cancer tumors (Table 1)¹². One sample was excluded due to poor protein extraction and two due to data quality. Nineteen samples from the discovery cohort were also tested. SRM was able to detect differences between ER-true and TN-like samples from the discovery cohort (Sup. Fig. S6). An ER-true/TN-like classifier, including 14 proteins, was used to assign new samples to ER-true or TN-like (sup. info). DMFS rates at 5 years were 81.6% and 57.8% for the ER-true and TN-like groups, respectively (p < 0.17) (Fig. 4).

Assessing ER-true/TN-like subtypes using a meta-genomics external dataset

We used gene expression data from 1296 breast cancer tumors, obtained from public repositories, as an independent cohort to validate the prognostic value of the ER-true/TN-like stratification^{13, 14}. Among them, 935 tumors were ER+ and had follow-up information available. Tumors were labeled as ER-true or TN-like using 35 of 44 proteins from SAM analyses. Survival analyses using 421 tumors with ER+ and node positive characteristics showed that DMFS rates were 81.8% and 72.5% for the ER-true and TN-like groups, respectively (p < 0.005, HR = 0.5769, Sup. Fig. S7).

ER-true/TN-like subtypes and breast cancer molecular subtypes

We applied our TN-like classifier to the entire population and performed survival analyses independently for each breast cancer molecular subtype³. ER-true/TN-like subtyping provided additional prognostic information in luminal A tumors, but not in luminal B, basal or HER2-enriched tumor subtypes (Fig. 5).

ER-true/TN-like subtypes and molecular prognostic signatures

The clinical utility of the ER-true/TN-like subtypes was evaluated in combination with three prognostic gene signatures: the 70-gene signature¹⁵, the Recurrence Score¹⁶ and the 8-gene Score¹⁷. The prognostic value of the three tests in 935 patients with ER+ tumors was corroborated, followed by the application of the ER-true/TN-like class predictor (Fig. 6). The TN-like tumors were associated with a lower DMFS compared with ER-true tumors in each low-risk category, defined by prognostic signatures (Sup. Table S4). The high-risk categories were not further refined through ER-true/TN-like subtyping. The multivariate analyses, including each prognostic signature and the ER-true and TN-like subtypes, showed that the ER-true and TN-like subtypes were related to prognosis, independent of the prognostic gene signatures (Table 2). Multivariate analyses including the ER-true/TN-like subtypes and available clinical variables (grade and N) showed that both grade and ER-true/TN-like subtypes, along with lymph node status, provided significant and independent prognostic information.

Table 2 Univariate and multivariate analyses including clinical variables, prognostic signatures and the TN-like subtype.

Full size table

Discussion

In this study, a new subtype of breast cancer was identified using a proteomics approach. The clinical classification of breast cancer does not fully reflect cancer heterogeneity; thus, individuals receiving the same diagnosis can have markedly different outcomes. Genomics and proteomics approaches complement the information provided by routine determinations, and coupled with new data analysis techniques, they help to expand the information obtained. In this case, information provided by pure protein expression was organized into functional nodes involving specific biological processes and pathways. The new TN-like ER+ subtype defined has molecular features common with TNBC tumors and exhibits a similar clinical evolution. Patients with either TN-like or TNBC tumors have shorter DMFS than patients with ER-true breast cancer. Both SRM verification and meta-validation confirmed the findings obtained in the discovery series. These results might help to explain why the prognosis of patients with ER+ breast cancer is not uniformly favorable.

ER-true tumors present molecular features that could explain the favorable prognosis of this subtype, such as increased expression of proteins related to cell adhesion and greater activity of the “ECM & focal adhesion” node. Increased expression of decorin and lumican in breast cancer is associated with lower tumor size, decreased risk and rate of relapse, positive ER/PR status and better survival^{18, 19}. A stromal gene set including DCN and FBLN1 genes has demonstrated prognostic value independent of clinical information and a proliferation gene set²⁰. COEA1, asporin, osteoglycin and lumican showed increased expression in low-risk vs. high-risk tumors defined by MammaPrint²¹. With regard to miRNAs included in the “ECM & focal adhesion” node, miR-342 expression correlates with ER expression and tamoxifen sensitivity in breast tumors^22,23,24. Both miR-149 and miR-342 have been included in a prognostic signature for breast cancer²⁵. Our results suggest that miR-31 and miR-214* could be indirect regulators of cell attachment function in breast tumors. These results indicate that ER-true tumors harbor a limited metastatic potential compared with TN-like tumors.

There is more limited information on some other molecular features defining ER-true tumors. This subtype has high expression for proteins produced by mast cells, related to ER and PR positivity, low-grade and a good prognosis in breast cancer^26,27,28,29. High expression levels of PIP and AZGP1 genes have been related to a good prognosis and correlate with ER, PR and AR expression^{21, 30,31,32,33,34,35,36,37,38}. ZA2G is part of a panel of 13 proteins predicting recurrence in breast cancer, showing decreased expressions when recurrence occurred³⁹. PRDBP protein appears to dictate the balance between ERK and Akt signaling with consequences for cell metabolism (induction of Warburg metabolism), apoptosis and cell proliferation⁴⁰. Loss of the 11p15 region, where the PRKCDBP gene is located, is common in breast cancer metastases⁴¹.

TN-like tumors showed molecular features associated with an unfavorable prognosis. High HSP90AB1 expression is related to poor overall survival and with an increased distant metastasis relapse rate in breast cancer^{42, 43} which is consistent with our results, showing a higher expression of this protein in TN-like tumors. HS90B has been included in a panel of 13 proteins predicting recurrence in breast cancer³⁹. STIP1 interacts with HS90B in the folding of a number of proteins, including the androgen and estrogen receptors^{44, 45}. Additionally, greater expression of eEF2 was significantly associated with node positivity in breast cancer⁴⁶.

On the other hand, all metabolism and mitochondria nodes had higher activity in the TN-like subtype. The “mitochondria B” and “metabolism B” nodes include proteins related to the TCA cycle and glycolysis, respectively, suggesting that both TN-like and TNBC tumors are highly glycolytic. The TN-like tumors showed high activity in both the “metabolism A” and “mitochondria A” nodes compared with the ER-true and TNBC tumors, suggesting a unique metabolic profile for the TN-like subtype. FBA indicates that all breast cancer types fulfill the Warburg hypothesis and that glutamine-derived αKG refuels the TCA cycle (anaplerosis) and maintains constant levels of biosynthetic precursors, while the surplus turns to lactate⁴⁷. However, ER-true tumors had a predicted growth rate significantly lower than TN-like and TNBC tumors, both of which had comparable growth rates.

Molecular differences between TN-like and ER-true tumors resemble those previously described between ER+ and TNBC tumors¹⁰. SAM analysis identified 44 proteins differentially expressed between both subtypes, 24 of which were also differentially expressed between ER+ and TNBC samples. Moreover, miR-139-5p, miR-149, miR-449a and miR-342 were overexpressed in ER+ tumors with regard to TNBCs^{10, 24, 48, 49}. Interestingly, we found equivalent differences in the “ECM & focal adhesion,” “metabolism B,” “mitochondria B” and “protein synthesis” nodes when comparing ER-true versus TN-like tumors and ER+ versus TNBC tumors. Differences in the “protein synthesis” node could not be confirmed in the external dataset in both analyses, suggesting that some features observed at the protein level do not appear at the gene expression level¹⁰. On the other hand, no differences regarding the “proliferation” node activity between ER-true and TN-like samples were found, although they were present between the ER+ and TNBC tumors. We also found differences not described between ER+ and TNBC tumors: the “mRNA processing” and “protein transport” nodes showed higher activity in ER-true tumors, whereas the “response and membrane” node had higher activity in TN-like tumors.

The TN-like subtype added prognostic information in luminal A disease but not in the other molecular subtypes. Likewise, the TN-like subtype further subdivided low-risk categories defined by gene signatures, such as the 70-gene Score, Recurrence Score and 8-gene Score. These gene signatures are related to cell proliferation, whereas the TN-like subtype primarily depends on other drivers, such as cell attachment and metabolism, thus providing complementary information⁵⁰. New molecular information could improve the accuracy of gene signatures and help to determine the best treatment for patients with ER+ breast cancer. Additionally, the TN-like subtype prognostic information is independent of that provided by clinical variables such as lymph node status and grade. Adjuvant treatment of breast cancer is determined by two main factors: risk of relapse and the molecular characteristics of the tumor. Molecular tools developed in this setting — such as MammaPrint, OncoType or the 8-gene Score — have attempted to optimize the use of adjuvant chemotherapy, which is toxic and benefits a limited number of patients. Patients in the low-risk categories of these gene tests do not require chemotherapy, but our results indicate that these low-risk categories can be further subdivided. The presence of a TN-like subtype worsens the outcome; therefore, chemotherapy should be considered in these patients. The recommendation would be valid for luminal A tumors having features of the TN-like subtype, which could contribute to reducing the number of relapses in this population.

The TN-like subtype provided prognostic information in ER+ disease not only with the original proteomics approach, but also with other techniques, including the translation of proteins back to gene expression. This result supports the robustness of this new breast cancer subtype. In addition, some of the components defining the subtype could become potential therapeutic targets in the future. Hormonal receptors and HER2 are the only molecular features allowing targeted therapy in breast cancer. Gene subtyping into luminal A, luminal B, basal and HER-2 enriched groups has not revealed other features that can be used to develop new drugs. A proteomics approach unravels molecular processes not detected by genomics, with the advantage that proteins are the real effectors of genomic changes.

Our study has some limitations. The discovery series was limited to patients with node-positive disease, who have a poorer outcome than their node-negative counterparts. However, the meta-validation series is more heterogeneous regarding clinical stage, which suggests that the TN-like subtype is a clinical entity and not just a marker of advanced disease. Also, relevant clinical differences in the discovery and verification cohorts did not reach the statistical boundary due to the limited sample size and the fact that many relapses in this group appeared after 5 years of follow-up. This problem was overcome in the in-silico series, which is more representative of a population of patients with breast cancer. On the technical side, despite the informative value of proteomics, there is still room for improvement in the number of proteins detected. Moreover, SRM assays are complex to develop and analyze in comparison with other platforms such as quantitative polymerase chain reaction (qPCR), and its use in the clinical routine is still challenging. Finally, these results should be validated in additional cohorts to evaluate the TN-like subtype robustness.

High-throughput proteomics generate clinically useful protein-based molecular profiles, which can complement information provided by gene expression analysis. In this study, a proteomics approach allowed the identification of a new subtype of breast cancer using FFPE samples. The molecular characteristics of this new subgroup have been assessed using probabilistic graphical models. This subtype is included in the group of hormonal receptor-positive, HER2-negative tumors, but has molecular features and a poor clinical outcome similar to that of TNBC. This new TN-like subtype has the capability to add prognostic information to current clinical practice. Because proteins are the final effectors of genes, some proteins and biological processes defining TN-like tumors could become therapeutic targets. This possibility should be further explored in future studies.

Methods

Sample selection

A total of 106 patients with breast cancer were included in the discovery cohort. FFPE samples were retrieved from the I+12 Biobank (RD09/0076/00118) and from the IdiPAZ Biobank (RD09/0076/00073), both integrated in the Spanish Hospital Biobank Network (RetBioH; www.redbiobancos.es). Forty-six patients were included in the verification cohort, and FFPE samples were retrieved from the Basque Biobank/O+EHUN (RD09/0076/00140). Informed consent was obtained from all the patients. All the experiments were performed in accordance with relevant guidelines and regulations. The histopathological features of each sample were reviewed by an experienced pathologist to confirm diagnosis and tumor content. Eligible samples included at least 50% tumor cells. Approvals from the Ethics Committees of Hospital Doce de Octubre, La Paz University Hospital and Euskadi were obtained for the conduct of the study.

Total protein preparation and digestion

Proteins were extracted from FFPE samples as previously described⁵¹. Briefly, FFPE sections were deparaffinized in xylene and washed twice with absolute ethanol. Protein extracts from the FFPE samples were prepared in 2% sodium dodecyl sulfate (SDS) buffer using a protocol based on heat-induced antigen retrieval⁵². Protein concentration was determined using the MicroBCA Protein Assay Kit (Pierce-Thermo Scientific). Protein extracts (10 µg) were digested with trypsin (1:50) and SDS was removed from digested lysates using Detergent Removal Spin Columns (Pierce). Peptide samples were further desalted using ZipTips (Millipore), dried, and resolubilized in 15 µL of a 0.1% formic acid and 3% acetonitrile solution before MS analysis.

Liquid chromatography - mass spectrometry shotgun analysis

The samples were analyzed on an LTQ-Orbitrap Velos hybrid mass spectrometer (Thermo Fischer Scientific, Bremen, Germany) coupled to a NanoLC-Ultra system (Eksigent Technologies, Dublin, CA, USA) as previously described previously¹⁰. Briefly, after separation, peptides were eluted with a gradient of 5% to 30% acetonitrile in 95 minutes. The mass spectrometer was operated in data-dependent mode (DDA), acquiring a full-scan MS spectra (300–1700 m/z) at a resolution of 30,000 at 400 m/z after accumulation to a target value of 1,000,000, followed by collision-induced dissociation (CID) fragmentation on the 20 most intense signals per cycle. The samples were acquired using internal lock mass calibration on m/z 429.088735 and 445.120025. The acquired raw MS data were processed by MaxQuant (version 1.2.7.4)⁵³, followed by protein identification using the integrated Andromeda search engine⁵⁴. Briefly, spectra were searched against a forward UniProtKB/Swiss-Prot database for human, concatenated to a reversed decoyed FASTA database (NCBI taxonomy ID 9606, release date 2011-12-13). The maximum FDR was set to 0.01 for peptides and 0.05 for proteins. Label-free quantification was calculated on the basis of the normalized intensities (LFQ intensity). Quantifiable proteins were defined as those detected in at least 75% of samples in at least one type of sample (either ER+ or TNBC samples) showing two or more unique peptides. Only quantifiable proteins were considered for subsequent analyses. Protein expression data were log2 transformed, and missing values were replaced using data imputation for label-free data, as explained in Deeb et al.⁵⁵, using default values. Finally, protein expression values were z-score transformed. Batch effects were estimated and corrected using ComBat⁵⁶. All the mass spectrometry raw data files acquired in this study can be downloaded from Chorus (http://chorusproject.org) under the project name Breast Cancer Proteomics.

RNA extraction and MicroRNA expression

RNA isolation from the FFPE tumor specimens and microRNA expression profiling was performed as previously described¹⁰. Briefly, microRNA expression profiling was obtained using a custom TaqMan Array MicroRNA Card (Applied Biosystems) containing 95 FFPE-reliable assays, including four housekeeping miRNAs identified used NorMean⁵⁷. ΔCq values were normalized using two reference miRNAs (hsa-let-7d and hsa-let-7g).

Differential expression analysis of label-free proteomics and microRNA profiling

SAM⁵⁸ was performed to find differentially expressed proteins and miRNAs between sample groups with an FDR below 5%. Hierarchical clusters were constructed with the differentially expressed proteins or miRNAs between predefined samples groups identified by SAM, using Pearson’s correlation and the average-linkage method.

Functional network construction

A functional network to associate miRNAs and protein expression profiles was constructed as previously described¹⁰. Briefly, we chose probabilistic graphical models compatible with high-dimensionality. The result is an undirected graphical model with a local minimum Bayesian Information Criterion (BIC)⁵⁹. Methods are implemented in the open-source statistical programming language R⁶⁰; in particular, the functions minForest and stepw in the gRapHD package⁶¹. To identify functional nodes within the network, we split it into several branches or functional nodes. We then used gene ontology analyses to investigate which function or functions were overrepresented in each branch. To measure the functional activity of each functional node, we calculated the mean expression of all the proteins included in one branch related to a specific function. Differences in functional node activity were assessed by class comparison analyses.

Gene ontology analyses

Protein-to-gene ID conversion was performed using Uniprot (http://www.uniprot.org) and DAVID^{62, 63}. Gene ontology analyses were performed using the functional annotation chart tool provided by DAVID. We used “homo sapiens” as a background list and selected only GOTERM-FAT gene ontology categories and Biocarta, KEGG and Panther pathways.

Flux balance analyses

Flux balance analysis (FBA) is a widely used approach for studying biochemical networks by calculating the flow of metabolites through the network, including 7440 reactions from Recon 2⁶⁴. With this method, it is possible to predict the growth rate of an organism or the rate of production of a metabolite⁶⁵. The estimation of the GPR rule values was performed using a variation of the method described by Barker et al.⁶⁶. The mathematical operations used to calculate the numerical value were the sums for “OR” expressions and minimums for “AND” expressions. Finally, the GPR rule values were normalized, dividing by the maximum value in each tumor, and were included in the Recon 2 model using the E-Flux algorithm¹¹. Normalized GPR rule values have been used to establish both lower and upper reaction bounds if the reaction is reversible. If the reaction is irreversible, low bound is set to 0 in all cases. To calculate biomass production, the biomass objective function included in Recon 2 was optimized. FBA was performed using the COBRA Toolbox available for MATLAB⁶⁷.

Selected reaction monitoring analyses

The SRM design was based on both experimental data from our shotgun analysis and the PeptideAtlas⁶⁸. SRM-triggered MS2 was performed on a QTRAP 5500 instrument (ABSciex, Concord, Ontario), and SRM measurements were analyzed on a TSQ Vantage Triple Quadrupole Mass Spectrometer (ThermoFisher, San Jose, CA, USA), both equipped with a nanoelectrospray ion source. Chromatographic separations of peptides were performed on a NanoLC-2D HPLC system (Eksigent, Dublin, CA) coupled to a 15-cm fused silica emitter, 75-μm diameter, packed with a ReproSil-Pur C18-AQ 120 A and 1.9-μm resin (Dr. Maisch HPLC GmbH). Peptides were loaded on the column from a cooled (4 °C) Eksigent autosampler and separated with a linear gradient of acetonitrile/water, containing 0.1% formic acid, at a flow rate of 300 nl/min. A gradient from 5% to 35% acetonitrile in 40 minutes was used. For the SRM-triggered MS2 measurements, MS2 spectra were recorded upon detection of an SRM trace above a threshold of 1000 ion counts. An average of 100 transitions (scan time 10 ms/transition) per run was used and Q1 and Q3 were obtained at 0.7 unit mass resolution. MS2 spectra were recorded in enhanced product ion (EPI) mode for the highest MRM transitions, using dynamic fill time, Q1 resolution unit, scan speed 10,000 amu/s, m/z range 300–1000. Collision energies used for both acquisition modes were calculated according to the formulas: CE = 0.044 * m/z + 5.5 and CE = 0.051 * m/z + 4 (CE: collision energy; m/z: mass-to-charge ratio of the precursor ion) for doubly and triply charged precursor ions, respectively. In SRM, the mass spectrometer was operated in SRM scan mode, in which Q1 and Q3 were obtained at 0.7 unit mass resolution. Collision energies for each transition were calculated according to the following equations: CE = 0.034 * (m/z) + 3.314 and CE = 0.044 * (m/z) + 3.314 for doubly and triply charged precursor ions, respectively. Three SRM transitions were monitored for each endogenous (light) and internal standard (heavy) peptide. SRM data were processed using SRM skyline software⁶⁹. Peptides with the following criteria were used for the quantification: (i) correlation between ion ratios obtained for the heavy and the light form; (ii) correlation between the ion ratios obtained for both forms and the ion ratios obtained in the MS/MS spectra present in the SRM spectral library; and (iii) transition intensities of the heavy and the light form of >10. The three transitions for each heavy-light pair were used to quantify the peptide unless signals of coeluting interferences were detected. Punctual measurements for light peptides below the background measurement value were ignored. A light/heavy peptide ratio was calculated for all transitions. Protein expression values were calculated by the median expression from the three transitions for each heavy-light pair of their peptides.

Development of classifiers

We developed protein expression-based signatures to predict the class of future samples using the compound covariate predictor. The model incorporates proteins that were differentially expressed among classes at the 0.05 significance level as assessed by the random variance t-test⁷⁰, with protein-to-gene ID positive in the meta-validation dataset (see below). We estimated the prediction error of each model using leave-one-out cross-validation (LOOCV)⁷¹. For each LOOCV training set, the entire model building process was repeated, including the gene selection process. We also evaluated whether the cross-validated error rate estimate for a model is significantly less than the random prediction. The class labels were randomly permuted and the entire LOOCV process was repeated. The significance level is the proportion of the random permutations that gave a cross-validated error rate no greater than the cross-validated error rate obtained with the original data. The same workflow was performed using the SRM data. For more details, see the Simon R and Lam A. BRB-ArrayTools User Guide, version 3.2. BRB-ArrayTools v4.2.1, developed by R. Simon and A. Peng.

External dataset validation

A total of 1296 primary breast carcinoma data were collected from two independent datasets^{13, 14}. The Guedj dataset and associated clinical annotations were downloaded from the ArrayExpress Archive (http://www.ebi.ac.uk/arrayexpress/). The Miller dataset and associated clinical annotations were downloaded from the Cancer Research website. Batch effects were corrected using ComBat⁵⁶. Protein-to-gene ID was performed using Uniprot (http://www.uniprot.org) and DAVID^{62, 63}. All the probes in the dataset for each gene were retrieved. Probes with higher coefficients of variation were selected when multiple probes were found for a single gene, then expression values of each gene were z-score transformed. Samples with clinical characteristics similar to those in our discovery cohort were then assigned to various groups using the developed predictor. The 70-gene signature¹⁵, Recurrence Score¹⁶ and 8-gene Score predictors were calculated for all the samples in the dataset as described previously^{14, 17, 72}. Molecular subtype annotation was performed using the Single Sample Predictor described by Hu et al.^{72, 73}. To apply protein expression-based signatures to gene expression values, per-gene normalization was applied as previously described¹⁷.

Statistical analyses and software suites

Survival curves were estimated using a Kaplan-Meier analysis and compared with the log-rank test, using DMFS at 5 years as the end point. Univariate and multivariate Cox proportional hazard analyses were also employed to evaluate the defined prognosis predictors. Correlations were assessed using Pearson’s r and linear regression. The SPSS v16 software package, GraphPad Prism 5.0 and R v2.15.2 (with the Design software package 0.2.3) were used for all the statistical analyses. Correlation between node activity and microRNA expression was evaluated using linear regression analyses and Pearson’s correlation. Comparisons between different populations’ characteristics were assessed using Fisher’s exact test, the chi-squared test or the Mann-Whitney test as appropriate. All p-values were two-sided, and p < 0.05 was considered statistically significant. Expression data and network analyses were performed with the MeV and Cytoscape software suites^{74, 75}. Class comparison analyses were performed using BRB-ArrayTools v4.2.1.

References

Malvezzi, M. et al. European cancer mortality predictions for the year 2011. Ann Oncol 22, 947–956, doi:10.1093/annonc/mdq774 (2011).
Article CAS PubMed Google Scholar
Espinosa, E. et al. The present and future of gene profiling in breast cancer. Cancer Metastasis Rev 31, 41–46, doi:10.1007/s10555-011-9327-7 (2012).
Article CAS PubMed Google Scholar
Perou, C. M. et al. Molecular portraits of human breast tumours. Nature 406, 747–752, doi:10.1038/35021093 (2000).
Article ADS CAS PubMed Google Scholar
Parker, J. S. et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol 27, 1160–1167, doi:10.1200/JCO.2008.18.1370 (2009).
Article PubMed PubMed Central Google Scholar
Prat, A., Ellis, M. J. & Perou, C. M. Practical implications of gene-expression-based assays for breast oncologists. Nat Rev Clin Oncol 9, 48–57, doi:10.1038/nrclinonc.2011.178 (2012).
Article CAS Google Scholar
Hanash, S. Disease proteomics. Nature 422, 226–232, doi:10.1038/nature01514 (2003).
Article ADS CAS PubMed Google Scholar
Marko-Varga, G. et al. Personalized medicine and proteomics: lessons from non-small cell lung cancer. J Proteome Res 6, 2925–2935, doi:10.1021/pr070046s (2007).
Article CAS Google Scholar
Pastwa, E., Somiari, S. B., Czyz, M. & Somiari, R. I. Proteomics in human cancer research. Proteomics Clin Appl 1, 4–17 (2007).
Article CAS PubMed Google Scholar
Rifai, N., Gillette, M. A. & Carr, S. A. Protein biomarker discovery and validation: the long and uncertain path to clinical utility. Nat Biotechnol 24, 971–983, doi:10.1038/nbt1235 (2006).
Article CAS PubMed Google Scholar
Gamez-Pozo, A. et al. Combined label-free quantitative proteomics and microRNA expression analysis of breast cancer unravel molecular differences with clinical implications. Cancer Research, doi:10.1158/0008-5472.CAN-14-1937 (2015).
Colijn, C. et al. Interpreting expression data with metabolic flux models: predicting Mycobacterium tuberculosis mycolic acid production. PLoS Comput Biol 5, e1000489, doi:10.1371/journal.pcbi.1000489 (2009).
Article MathSciNet PubMed PubMed Central Google Scholar
Picotti, P., Bodenmiller, B., Mueller, L. N., Domon, B. & Aebersold, R. Full Dynamic Range Proteome Analysis of S. cerevisiae by Targeted Proteomics. Cell 138, 795–806, doi:10.1016/j.cell.2009.05.051 (2009).
Article CAS PubMed PubMed Central Google Scholar
Guedj, M. et al. A refined molecular taxonomy of breast cancer. Oncogene 31, 1196–1206, doi:10.1038/onc.2011.301 (2012).
Article CAS PubMed Google Scholar
Miller, L. D. et al. An iron regulatory gene signature predicts outcome in breast cancer. Cancer Res 71, 6728–6737, doi:10.1158/0008-5472.CAN-11-1870 (2011).
Article CAS PubMed PubMed Central Google Scholar
van de Vijver, M. J. et al. A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347, 1999–2009, doi:10.1056/NEJMoa021967 (2002).
Article PubMed Google Scholar
Paik, S. et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med 351, 2817–2826, doi:10.1056/NEJMoa041588 (2004).
Article CAS PubMed Google Scholar
Sanchez-Navarro, I. et al. An 8-gene qRT-PCR-based gene expression score that has prognostic value in early breast cancer. BMC Cancer 10, 336, doi:10.1186/1471-2407-10-336 (2010).
Article CAS PubMed PubMed Central Google Scholar
Troup, S. et al. Reduced expression of the small leucine-rich proteoglycans, lumican, and decorin is associated with poor outcome in node-negative invasive breast cancer. Clin Cancer Res 9, 207–214 (2003).
CAS PubMed Google Scholar
Cawthorn, T. R. et al. Proteomic analyses reveal high expression of decorin and endoplasmin (HSP90B1) are associated with breast cancer metastasis and decreased survival. PloS one 7, e30992, doi:10.1371/journal.pone.0030992 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mefford, D. & Mefford, J. Stromal genes add prognostic information to proliferation and histoclinical markers: a basis for the next generation of breast cancer gene signatures. PloS one 7, e37646, doi:10.1371/journal.pone.0037646 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Muraoka, S. et al. Strategy for SRM-based verification of biomarker candidates discovered by iTRAQ method in limited breast cancer tissue samples. Journal of proteome research 11, 4201–4210, doi:10.1021/pr300322q (2012).
Article CAS PubMed Google Scholar
Cittelly, D. M. et al. Downregulation of miR-342 is associated with tamoxifen resistant breast tumors. Mol Cancer 9, 317, doi:10.1186/1476-4598-9-317 (2010).
Article CAS PubMed PubMed Central Google Scholar
Miller, T. E. et al. MicroRNA-221/222 confers tamoxifen resistance in breast cancer by targeting p27Kip1. J Biol Chem 283, 29897–29903, doi:10.1074/jbc.M804612200 (2008).
Article CAS PubMed PubMed Central Google Scholar
He, Y. J. et al. miR-342 is associated with estrogen receptor-alpha expression and response to tamoxifen in breast cancer. Exp Ther Med 5, 813–818, doi:10.3892/etm.2013.915 (2013).
Article PubMed PubMed Central Google Scholar
Perez-Rivas, L. G. et al. A microRNA signature associated with early recurrence in breast cancer. PLoS One 9, e91884, doi:10.1371/journal.pone.0091884 (2014).
Article ADS PubMed PubMed Central Google Scholar
Dabiri, S. et al. The presence of stromal mast cells identifies a subset of invasive breast cancers with a favorable prognosis. Mod Pathol 17, 690–695, doi:10.1038/modpathol.3800094 (2004).
Article PubMed Google Scholar
Rajput, A. B. et al. Stromal mast cells in invasive breast cancer are a marker of favourable prognosis: a study of 4,444 cases. Breast Cancer Res Treat 107, 249–257, doi:10.1007/s10549-007-9546-3 (2008).
Article Google Scholar
Amini, R. M. et al. Mast cells and eosinophils in invasive breast carcinoma. BMC cancer 7, 165, doi:10.1186/1471-2407-7-165 (2007).
Article PubMed Central Google Scholar
della Rovere, F. et al. Mast cells in invasive ductal breast cancer: different behavior in high and minimum hormone-receptive cancers. Anticancer Res 27, 2465–2471 (2007).
PubMed Google Scholar
Baniwal, S. K., Chimge, N. O., Jordan, V. C., Tripathy, D. & Frenkel, B. Prolactin-induced protein (PIP) regulates proliferation of luminal A type breast cancer cells in an estrogen-independent manner. PloS one 8, e62361, doi:10.1371/journal.pone.0062361 (2014).
Article ADS PubMed Google Scholar
Darb-Esfahani, S. et al. Gross cystic disease fluid protein 15 (GCDFP-15) expression in breast cancer subtypes. BMC cancer 14, 546, doi:10.1186/1471-2407-14-546 (2014).
Article PubMed PubMed Central Google Scholar
Luo, M. H. et al. Expression of mammaglobin and gross cystic disease fluid protein-15 in breast carcinomas. Hum Pathol 44, 1241–1250, doi:10.1016/j.humpath.2012.10.009 (2013).
Article CAS PubMed Google Scholar
Parris, T. Z. et al. Clinical implications of gene dosage and gene expression patterns in diploid breast carcinoma. Clin Cancer Res 16, 3860–3874, doi:10.1158/1078-0432.CCR-10-0889 (2010).
Article CAS PubMed Google Scholar
Parris, T. Z. et al. Additive effect of the AZGP1, PIP, S100A8 and UBE2C molecular biomarkers improves outcome prediction in breast carcinoma. Int J Cancer 134, 1617–1629, doi:10.1002/ijc.28497 (2014).
Article CAS PubMed Google Scholar
Jablonska, K. et al. Prolactin-induced protein as a potential therapy response marker of adjuvant chemotherapy in breast cancer patients. American journal of cancer research 6, 878–893 (2016).
PubMed PubMed Central Google Scholar
Naderi, A. & Meyer, M. Prolactin-induced protein mediates cell invasion and regulates integrin signaling in estrogen receptor-negative breast cancer. Breast cancer research: BCR 14, R111, doi:10.1186/bcr3232 (2012).
Article CAS PubMed PubMed Central Google Scholar
Naderi, A. & Vanneste, M. Prolactin-induced protein is required for cell cycle progression in breast cancer. Neoplasia 16(329–342), e321–314, doi:10.1016/j.neo.2014.04.001 (2014).
Google Scholar
Lehmann-Che, J. et al. Molecular apocrine breast cancers are aggressive estrogen receptor negative tumors overexpressing either HER2 or GCDFP15. Breast cancer research: BCR 15, R37, doi:10.1186/bcr3421 (2013).
Article CAS PubMed PubMed Central Google Scholar
Johansson, H. J. et al. Proteomics profiling identify CAPS as a potential predictive marker of tamoxifen resistance in estrogen receptor positive breast cancer. Clin Proteomics 12, 8, doi:10.1186/s12014-015-9080-y (2015).
Article PubMed PubMed Central Google Scholar
Hernandez, V. J. et al. Cavin-3 dictates the balance between ERK and Akt signaling. Elife 2, e00905, doi:10.7554/eLife.00905 (2013).
Article PubMed PubMed Central Google Scholar
Wikman, H. et al. Clinical relevance of loss of 11p15 in primary and metastatic breast cancer: association with loss of PRKCDBP expression in brain metastases. PloS one 7, e47537, doi:10.1371/journal.pone.0047537 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Cheng, Q. et al. Amplification and high-level expression of heat shock protein 90 marks aggressive phenotypes of human epidermal growth factor receptor 2 negative breast cancer. Breast cancer research: BCR 14, R62, doi:10.1186/bcr3168 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pick, E. et al. High HSP90 expression is associated with decreased survival in breast cancer. Cancer research 67, 2932–2937, doi:10.1158/0008-5472.CAN-06-4511 (2007).
Article CAS PubMed Google Scholar
Echeverria, P. C., Bernthaler, A., Dupuis, P., Mayer, B. & Picard, D. An interaction network predicted from public data as a discovery tool: application to the Hsp90 molecular chaperone machine. PloS one 6, e26044, doi:10.1371/journal.pone.0026044 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Scheufler, C. et al. Structure of TPR domain-peptide complexes: critical elements in the assembly of the Hsp70-Hsp90 multichaperone machine. Cell 101, 199–210, doi:10.1016/S0092-8674(00)80830-2 (2000).
Article CAS PubMed Google Scholar
Meric-Bernstam, F. et al. Aberrations in translational regulation are associated with poor prognosis in hormone receptor-positive breast cancer. Breast cancer research: BCR 14, R138, doi:10.1186/bcr3343 (2012).
Article CAS PubMed PubMed Central Google Scholar
DeBerardinis, R. J. et al. Beyond aerobic glycolysis: transformed cells can engage in glutamine metabolism that exceeds the requirement for protein and nucleotide synthesis. Proc Natl Acad Sci USA 104, 19345–19350, doi:10.1073/pnas.0709747104 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Krishnan, K. et al. miR-139-5p is a regulator of metastatic pathways in breast cancer. Rna 19, 1767–1780, doi:10.1261/rna.042143.113 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lowery, A. J. et al. MicroRNA signatures predict oestrogen receptor, progesterone receptor and HER2/neu receptor status in breast cancer. Breast Cancer Res 11, R27, doi:10.1186/bcr2257 (2009).
Article PubMed PubMed Central Google Scholar
Gyorffy, B. et al. Multigene prognostic tests in breast cancer: past, present, future. Breast Cancer Res 17, 11, doi:10.1186/s13058-015-0514-2 (2015).
Article PubMed PubMed Central Google Scholar
Gamez-Pozo, A. et al. Shotgun proteomics of archival triple-negative breast cancer samples. Proteomics Clin Appl 7, 283–291, doi:10.1002/prca.201200048 (2013).
Article CAS PubMed Google Scholar
Gamez-Pozo, A. et al. Protein phosphorylation analysis in archival clinical cancer samples by shotgun and targeted proteomics approaches. Mol Biosyst 7, 2368–2374, doi:10.1039/c1mb05113j (2011).
Article CAS PubMed Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26, 1367–1372, doi:10.1038/nbt.1511 (2008).
Article CAS PubMed Google Scholar
Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant environment. J Proteome Res 10, 1794–1805, doi:10.1021/pr101065j (2011).
Article ADS CAS PubMed Google Scholar
Deeb, S. J., D’Souza, R. C. J., Cox, J., Schmidt-Supprian, M. & Mann, M. Super-SILAC Allows Classification of Diffuse Large B-cell Lymphoma Subtypes by Their Protein Expression Profiles. Molecular & Cellular Proteomics 11, 77–89, doi:10.1074/mcp.M111.015362 (2012).
Article CAS Google Scholar
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127, doi:10.1093/biostatistics/kxj037 (2007).
Article PubMed MATH Google Scholar
Sanchez-Navarro, I. et al. Comparison of gene expression profiling by reverse transcription quantitative PCR between fresh frozen and formalin-fixed, paraffin-embedded breast cancer tissues. Biotechniques 48, 389–397, doi:10.2144/000113388 (2010).
Article CAS PubMed Google Scholar
Tusher, V. G., Tibshirani, R. & Chu, G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 98, 5116–5121, doi:10.1073/pnas.091062498 (2001).
Article ADS CAS PubMed Central MATH Google Scholar
Schwarz, G. Estimating the dimension of a model. Annals of Statistics 6, 461–464 (1978).
Article ADS MathSciNet MATH Google Scholar
R Core Team. (R Foundation for Statistical Computing, Vienna, Austria., 2013).
Abreu, G. C. G., Edwards, D. & Labouriau, R. High-Dimensional Graphical Model Search with the gRapHD R Package. Journal of Statistical Software 37, 1–18 (2010).
Article Google Scholar
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4, 44–57, doi:10.1038/nprot.2008.211 (2009).
Article PubMed Google Scholar
Huang da, W., Sherman, B. T. & Lempicki, R. A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37, 1–13, doi:10.1093/nar/gkn923 (2009).
Article Google Scholar
Thiele, I. et al. A community-driven global reconstruction of human metabolism. Nature biotechnology 31, 419–425, doi:10.1038/nbt.2488 (2013).
Article CAS PubMed Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. O. What is flux balance analysis? Nature biotechnology 28, 245–248, doi:10.1038/nbt.1614 (2010).
Article CAS PubMed Central Google Scholar
Barker, B. E. et al. A robust and efficient method for estimating enzyme complex abundance and metabolic flux from expression data. Computational Biology and Chemistry 59(Part B), 98–112, doi:10.1016/j.compbiolchem.2015.08.002 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schellenberger, J. et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc 6, 1290–1307, doi:10.1038/nprot.2011.308 (2011).
Article CAS PubMed Central Google Scholar
Deutsch, E. W., Lam, H. & Aebersold, R. PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows. EMBO Rep 9, 429–434, doi:10.1038/embor.2008.56 (2008).
Article CAS PubMed PubMed Central Google Scholar
MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968, doi:10.1093/bioinformatics/btq054 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wright, G. W. & Simon, R. M. A random variance model for detection of differential gene expression in small microarray experiments. Bioinformatics 19, 2448–2455, doi:10.1093/bioinformatics/btg345 (2003).
Article CAS PubMed Google Scholar
Simon, R., Radmacher, M. D., Dobbin, K. & McShane, L. M. Pitfalls in the Use of DNA Microarray Data for Diagnostic and Prognostic Classification. Journal of the National Cancer Institute 95, 14–18, doi:10.1093/jnci/95.1.14 (2003).
Article CAS PubMed Google Scholar
Fan, C. et al. Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 355, 560–569, doi:10.1056/NEJMoa052933 (2006).
Article CAS PubMed Google Scholar
Hu, Z. et al. The molecular portraits of breast tumors are conserved across microarray platforms. BMC Genomics 7, 96, doi:10.1186/1471-2164-7-96 (2006).
Article PubMed PubMed Central Google Scholar
Saeed, A. I. et al. TM4: a free, open-source system for microarray data management and analysis. Biotechniques 34, 374–378 (2003).
CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome research 13, 2498–2504, doi:10.1101/gr.1239303 (2003).
Article CAS PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge funding from grants PI12/00444, PI12/01016 and PI15/01310 from the Instituto de Salud Carlos III, Spanish Economy and Competitiveness Ministry, Spain, and co-funded by the FEDER program, “Una forma de hacer Europa”. This study has also been supported by the PRIME-XS project, grant agreement number 262067, funded by the EU’s Seventh Framework Program for Research. AG-P and RL-V are supported by Instituto de Salud Carlos III, and the Spanish Economy and Competitiveness Ministry grants, CA12/00258 and CA12/00264, respectively. We want to particularly acknowledge the patients in this study for their participation and to the IdiPAZ, I+12 and O+EHUN Biobanks for the generous gifts of clinical samples used in this study. LT-F is supported by the Spanish Economy and Competitiveness Ministry (DI-15-07614). IdiPAZ, I+12 and O+EHUN Biobanks are supported by Instituto de Salud Carlos III, Spanish Economy and Competitiveness Ministry (RD09/0076/00073, RD09/0076/00118 and RD09/0076/00140, respectively) and FarmaIndustria, through the Cooperation Program in Clinical and Translational Research of the Community of Madrid and Basque Autonomous Community.

Author information

Authors and Affiliations

Molecular Oncology & Pathology Lab, Institute of Medical and Molecular Genetics-INGEMM, La Paz University Hospital-IdiPAZ, Madrid, Spain
Angelo Gámez-Pozo, Julia Berges-Soria, Rocío López-Vacas, Guillermo Prado-Vázquez, Andrea Zapater-Moros & Juan Ángel Fresno Vara
Biomedica Molecular Medicine SL, Madrid, Spain
Lucía Trilla-Fuertes
Functional Genomics Center Zürich, University of Zürich/ETH Zürich, Zürich, Switzerland
Nathalie Selevsek, Paolo Nanni & Jonas Grossmann
Department of Statistics, Biostatistics Unit, La Paz University Hospital - IdiPAZ, Madrid, Spain
Mariana Díaz-Almirón & Francisco Gayá Moreno
Operational Research and Numerical Analysis, National Distance Education University (UNED), Madrid, Spain
Jorge M. Arevalillo & Hilario Navarro
Medical Laboratory Service, La Paz University Hospital Health Research Institute-IdiPAZ, Madrid, Spain
Rubén Gómez Rioja
Department of Statistics and Operations Research, Faculty of Mathematics, Complutense University of Madrid, Madrid, Spain
Paloma Main
Medical Oncology Service, La Paz University Hospital-IdiPAZ, Madrid, Spain
Jaime Feliú, Pilar Zamora & Enrique Espinosa
Medical Oncology Service, Basurto Hospital, Bilbao, Spain
Purificación Martínez del Prado
Medical Oncology Service, Hospital 12 de Octubre (i+12) Health Research Institute, Madrid, Spain
Eva Ciruelos

Authors

Angelo Gámez-Pozo
View author publications
You can also search for this author in PubMed Google Scholar
Lucía Trilla-Fuertes
View author publications
You can also search for this author in PubMed Google Scholar
Julia Berges-Soria
View author publications
You can also search for this author in PubMed Google Scholar
Nathalie Selevsek
View author publications
You can also search for this author in PubMed Google Scholar
Rocío López-Vacas
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Díaz-Almirón
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Nanni
View author publications
You can also search for this author in PubMed Google Scholar
Jorge M. Arevalillo
View author publications
You can also search for this author in PubMed Google Scholar
Hilario Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Grossmann
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Gayá Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Rubén Gómez Rioja
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo Prado-Vázquez
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Zapater-Moros
View author publications
You can also search for this author in PubMed Google Scholar
Paloma Main
View author publications
You can also search for this author in PubMed Google Scholar
Jaime Feliú
View author publications
You can also search for this author in PubMed Google Scholar
Purificación Martínez del Prado
View author publications
You can also search for this author in PubMed Google Scholar
Pilar Zamora
View author publications
You can also search for this author in PubMed Google Scholar
Eva Ciruelos
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Espinosa
View author publications
You can also search for this author in PubMed Google Scholar
Juan Ángel Fresno Vara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All the authors have directly participated in the preparation of this manuscript and have approved the final version submitted. They declare no ethical conflicts of interest. J.B.-S., R.L.-V. and A.G.-P. contributed the RNA and protein extraction. P.N., N.S. and J.G. contributed the mass spectrometry data. J.M.A., H.N. and P.M. contributed the probabilistic graphical models. M.D.-A. and F.G.M. contributed the GPR rule method. P.M.d.P., P.Z., J.F., E.C. and E.E. contributed the clinical data and the analyses related. A.G.-P., G.P.-V., A.Z.-M., and J.B.-S. contributed to the design of the study and the statistical and gene ontology analyses. A.G.-P. drafted the manuscript. L.T.-F. and R.G.-R. contributed the FBA analyses. J.A.F.V., P.M.d.P., P.Z., J.F., E.C. and E.E. conceived of the study and participated in its design and interpretation. J.A.F.V. coordinated the study. All the authors have read and approved the final manuscript.

Corresponding author

Correspondence to Juan Ángel Fresno Vara.

Ethics declarations

Competing Interests

J.A.F.V., E.E. and A.G.-P. are shareholders in Biomedica Molecular Medicine S.L. L.T.-F. is an employee of Biomedica Molecular Medicine S.L. The other authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Supplementary table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gámez-Pozo, A., Trilla-Fuertes, L., Berges-Soria, J. et al. Functional proteomics outlines the complexity of breast cancer molecular subtypes. Sci Rep 7, 10100 (2017). https://doi.org/10.1038/s41598-017-10493-w

Download citation

Received: 04 April 2017
Accepted: 10 August 2017
Published: 30 August 2017
DOI: https://doi.org/10.1038/s41598-017-10493-w

This article is cited by

Functional proteomics of colon cancer Consensus Molecular Subtypes
- Jaime Feliu
- Angelo Gámez-Pozo
- Lucía Trilla-Fuertes
British Journal of Cancer (2024)
Genome-wide methylation analyses identifies Non-coding RNA genes dysregulated in breast tumours that metastasise to the brain
- Rajendra P. Pangeni
- Ivonne Olivaries
- Mark R. Morris
Scientific Reports (2022)
How is cancer complex?
- Anya Plutynski
European Journal for Philosophy of Science (2021)
Computational models applied to metabolomics data hints at the relevance of glutamine metabolism in breast cancer
- Lucía Trilla-Fuertes
- Angelo Gámez-Pozo
- Juan Ángel Fresno Vara
BMC Cancer (2020)
Biological molecular layer classification of muscle-invasive bladder cancer opens new treatment opportunities
- Lucía Trilla-Fuertes
- Angelo Gámez-Pozo
- Juan Ángel Fresno Vara
BMC Cancer (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.