Molecular classification of breast cancer using the mRNA expression profiles of immune-related genes

Mei, Juan; Zhao, Ji; Fu, Yi

doi:10.1038/s41598-020-61710-y

Download PDF

Article
Open access
Published: 16 March 2020

Molecular classification of breast cancer using the mRNA expression profiles of immune-related genes

Juan Mei¹,
Ji Zhao¹ &
Yi Fu¹

Scientific Reports volume 10, Article number: 4800 (2020) Cite this article

4902 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Breast cancer is the most lethal cancer in women and displaying a broad range of heterogeneity in terms of clinical, molecular behavior and response to therapy. Increasing evidence demonstrated that immune-related genes were an important source of prognostic information for several types of tumors. In this study, the k-mean clustering was applied to gene expression data from the immune-related genes, two molecular clusters were identified for 1980 breast cancer patients. The prognostic significance of the immune-related genes based classification was confirmed in the log-rank test. These clusters were also associated with immune checkpoints, immune-related features and tumor infiltrating levels. In addition, we used the shrunken centroid algorithm to predict the cluster of a given breast cancer sample, and good predictive results were obtained by this algorithm. These results indicated that the proposed classification method is a promising method, and we hope that this method may improve the treatment stratification of breast cancer in the future.

Machine learning-based cluster analysis of immune cell subtypes and breast cancer survival

Article Open access 03 November 2023

An independent poor-prognosis subtype of breast cancer defined by a distinct tumor immune microenvironment

Article Open access 03 December 2019

A breast cancer classification and immune landscape analysis based on cancer stem-cell-related risk panel

Article Open access 08 December 2023

Introduction

Breast cancer is one of the most aggressive cancers with an estimated 2100000 new cases and 627000 deaths worldwide in 2018¹. During the past years, multidisciplinary treatment regimen, such as surgery, chemotherapy, radiotherapy, hormonal therapy and targeted therapy had been made much progress for breast cancer^2,3. The five-year survival rate of breast cancer was approximately 85% and even worse for breast cancer patients with advanced stage. In recent years, the gene expression profiles in breast cancer patients had been investigated by many studies, and found that this cancer was composed of distinct molecular subtypes^2,4,5,6,7. These distinct molecular subtypes may underlie the high variability of clinical outcomes in breast cancer patients. Therefore, breast cancer should not be considered as a homogeneous entity, and molecular classification of breast cancer into clinically and biologically meaningful subtypes was needed.

At present, several researches had shown that the immune system was one of the determining factors during tumor initiation and progression^8,9,10,11. Several studies illustrated that the presence of tumor infiltrating lymphocytes was often associated with better prognosis several cancer types, including breast cancer^{12,13,14,15,16,17,18,19,20,21}. Thus, inclusion of immune signatures in the molecular subtyping may provide additional information beyond routine prognosis in breast cancer. However, until now, no attempt has been made to use these immune signatures to stratify breast cancer.

In this study, by using the gene expression profiles of immune-related genes with favorable prognosis, the k-means clustering was applied on the breast cancer samples to establish a robust molecular classification. Then, the associations between the molecular clusters and prognosis, clinicopathological factors, immune-related features and tumor infiltrating levels were assessed. The shrunken centroid algorithm was used to classify the clusters by the gene expression profiles of immune-related genes with favorable prognosis as the input parameters, and good predictive results were obtained in this study.

Results

Immune landscape of 17 immune cell types

We first examined whether the tumor infiltrating levels of 17 immune cell types were the prognostic factors in overall survival of breast cancer. The breast cancer patients were classified into two equal groups by using the median the ssGSEA score as the cutoff point. The univariate Cox analysis indicated that the higher tumor infiltrating levels of cytokine receptors, interleukins, and TGFb family member receptor were significantly associated with favorable prognosis in breast cancer patients (Fig. 1A–D). For example, patients with the high tumor infiltrating level of cytokine receptors had about 0.13 reduced risk of death compared with patients with the low tumor infiltrating level of cytokine receptors.

Using the ESTIMATE algorithm, the tumor purity, ESTIMATE score, immune score and stromal score were estimated, and the Spearman’s correlations between them with the tumor infiltrating levels of 17 immune cell types were calculated (Fig. 1E). The tumor purity, ESTIMATE score, immune score and stromal score were weakly to moderately correlated with the tumor infiltrating levels of 17 immune cell types. In addition, most of the 17 immune cell types shown significant correlations with CYT. The significant associations between tumor purity, ESTIMATE score, immune score, stromal score and CYT were also illustrated in Fig. 1F.

The cluster analysis of breast cancer

Then the univariate Cox regression analysis was conducted for assessing the correlation between the expression levels of 2498 immune-related genes and overall survival in the breast cancer cohort. 117 immune-related genes were considered to be correlated with overall survival of breast cancer with the criterion of P-value < 0.05 and hazard ratio (HR) < 1. These survival-related genes were selected for further cluster analysis. The k-means clustering was applied to cluster breast cancer samples based on the expression levels of 117 immune-related genes, and Nbclust testing was applied to determine the optimal number of stable clusters. According to the average silhouette width from the k-means clustering, 2 clusters were chosen as the optimal number of clusters (Fig. 2A). Patients in cluster 1 had significantly longer median overall survival than those in the cluster 2 (180 months versus 127 months; log-rank test P-value < 0.0001) (Fig. 2B). The multivariate Cox regression analysis revealed that the 117 immune-related genes derived clusters, together with progesterone receptor (PGR), HER2, node and size remained an independent prognostic factor (Fig. 2C). We then investigated the distribution of intrinsic molecular subtypes within the clusters. An imbalance in term of intrinsic molecular subtype was noticed (Fig. 2D). Her2 tumors and Luminal B tumors were more likely to be enriched in cluster 2, and Normal like tumors were more likely to be enriched in cluster 1.

Characterization of immune infiltration profiles between two clusters

The relative tumor infiltrating levels of 1980 breast cancer samples were quantified by using mRNA expression data of 2498 immune-related genes related to 17 immune cell types obtained from the ImmPort database (Fig. 3A). As illustrated in Fig. 3A, the cluster 1 samples were marked by the high level of immune infiltration level, whereas by contrast, the cluster 2 samples were characterized by the low immune infiltration level. These results revealed a comprehensive picture of tumor infiltrating levels, the heterogeneity across samples and differences in immune cell types. Then, the differences in the tumor infiltrating levels of 17 immune cell types between two clusters were also investigated. According to the Wilcoxon test, there were statistically significant differences in these immune cell types between the two clusters in the breast cancer patients (Figs. 3B and S1). The mean tumor infiltrating levels of the cluster 1 were significantly higher than those of the cluster 2.

We then compared the gene expression profile of cluster 1 samples with cluster 2 samples by the GSEA algorithm to determine how the tumor infiltrating levels differed between two clusters. 2498 immune-related genes of 17 immune cell types were selected as the reference gene set. The GSEA for the enriched and depleted immune cell types were illustrated in Figs. 4A–D and S2. Compared with the cluster 2 samples, the cluster 1 samples were significantly enriched with antimicrobials, cytokines, cytokine receptors, antigen processing and presentation, natural killer cell cytotoxicity, chemokines, TCR signaling pathway, BCR signaling pathway, interleukins, interleukins receptor, chemokine receptors, TNF family members and TNF family members receptors.

Immune checkpoints are critical modulators in the immune system, allowing the initiation of a productive immune response and preventing the onset of autoimmunity. Among these immune checkpoints, PD-1, PD-L1 and CTLA-4 were the most important immune checkpoints. In this study, we wanted to know whether the expression levels of PD-1, PD-L1 and CTLA4 were different between two breast cancer clusters. For doing this, the Wilcoxon test was applied to calculate the difference between the two breast cancer clusters in the expression levels of PD-1, PD-L1 and CTLA4. As illustrated in Fig. 5A, the expression levels of PD-1, PD-L1 and CTLA4 of the cluster 1 were significantly higher than those of the cluster 2.

The differences between the two breast cancer clusters in CYT, immune score, ESTIMATE score, stromal score and tumor purity were also investigated in the breast cancer patients (Fig. 5B). Among these four indices, the CYT, calculated from the geometric mean of the expression of the genes GZMA and PRF1, was used to reflect the patient’s antitumoral immune cytolytic activity and the immune score was used to reflect the infiltration of leukocytes. As illustrated in Fig. 5B, the average values of the CYT, immune score, ESTIMATE score and stromal score in the cluster 1 were significantly higher than those in the cluster 2. These results were expected, as the tumor infiltrating levels of 17 immune cell types in the cluster 1 were higher than those in the cluster 2, and the CYT, immune score, ESTIMATE score and stromal score were significantly correlated with most of immune cell types (all P-values < 2.20E-16; Wilcoxon test). Tumor purity is defined as the proportion of cancer cells in the tumor tissue²². The average tumor purity of the cluster 2 was significantly higher than that of the cluster 1 (P-value < 2.20E-16; Wilcoxon test). This result was expected, as previous published works suggested that the immune cells were negatively correlated with tumor purity at the pan-cancer level²².

Generation of the breast cancer classifier

In this study, we wanted to build a classifier that could identify the cluster of the breast cancer patients by using the expression profile of 117 immune-related genes. For doing this, the shrunken centroid algorithm²³ that implemented in the R package pamr (version 1.55) was used to learn a classifier for discriminating between cluster 1 and cluster 2. The ten-fold cross-validation was performed to select the optimal threshold for centroid shrinkage. The shrunken centroid algorithm identified a set of 117 signature genes with the most robust model that minimized the overall misclassification error. These 117 signature genes were used to predict the cluster of the breast cancer samples with the misclassification rate of 2.68% of the two tumor clusters (Fig. 6A). The predictive ability of the shrunken centroid algorithm for prediction each cluster was illustrated in Fig. 6B. These predictive results clearly indicated that the shrunken centroid algorithm was suitable to prediction the cluster of breast cancer samples (Fig. 6B).

Discussion

The breast cancer patients often display a heterogeneous clinical outcome. Given the heterogeneity of breast cancer patients, it is important to determine the appropriate treatment for patients diagnosed with breast cancer. Therefore, understanding the heterogeneity of breast cancer is one of the most fundamental goals in breast cancer. In the past few years, using mRNA expression profiles to stratify tumors into different molecular subtypes have been applied in several types of tumors^{24,25,26,27,28}. Here, in this study, by using the mRNA expression profile of 117 immune-related genes with favorable prognostic factor, the k-means clustering was applied to cluster the breast cancer patients without applying any clinical or biological information. By analyzing the associations between the clusters and clinical outcome of breast cancer patients, we found that the 117 immune-related genes derived clusters were significantly associated with the overall survival, and the clusters was an independent prognostic factor in the multivariate Cox proportional-hazard analysis.

Compared with patients in cluster 2, patients in cluster 1 had higher tumor infiltrating levels, CYT, immune score and stromal score. The expression levels of PD-1, PD-L1 and CTLA4 in the cluster 1 were also significantly higher than those in cluster 2. Benefit from the meaningful results of clustering breast cancer patients, we will strive to use the mRNA expression profile of 117 immune-related genes in other tumors to classify patients into distinct clusters in our future work. To this end, a computational model was built to predict the molecular clusters defined by the expression levels of the immune-related genes.

The naïve bayes, logistic, IBK, J48, random forest and libSVM that implemented in Weka (version 3.8.2)²⁹ were applied to compare the predictive results with the shrunken centroid classifier, the default parameters of these algorithms in Weka were used. The expression profile of 117 immune-related genes was used as the parameters of these classifiers. The ten-fold cross-validation was used to evaluate the performance of these classifiers. The overall accuracies of the naïve bayes, logistic, IBK, J48, random forest, libSVM and shrunken centroid were 95.05%, 97.12%, 89.39%, 88.79%, 95.61%, 96.57% and 97.32%, respectively (Table 1). These predictive results clearly demonstrated that the overall accuracy of the shrunken centroid classifier was higher than those of other classifiers, and the shrunken centroid classifier was a promising algorithm in prediction of the clusters of breast cancer patients.

Table 1 The predictive results of different input parameters.

Full size table

In this study, the Minimum Redundancy Maximum Relevance (mRMR)^30,31, the analysis of variance (ANOVA) and Maximum Relevance Maximum Distance (MRMD)^32,33 were applied on the expression profile of 117 immune-related genes, and the top 50 features, 84 features and 112 features were selected by these feature selection algorithms, respectively. These features were used as the input parameters of the naïve bayes, logistic, IBK, J48, random forest and libSVM. In the ten-fold cross validation, the overall accuracies of these algorithms with the default parameters in Weka were shown in Table 1. As shown in Table 1, the predictive results clearly indicated that these feature selection algorithms may improve the predictive results for IBK, J48 and libSVM.

In order to perform the cross platform data examination, the dataset of breast cancer was downloaded from TCGA, 1095 patients were contained in this cohort. Based on the expression levels of 117 immune-related genes, 2 clusters of TCGA breast cancer were identified by the Nbclust. Then, the expression profile of 117 immune-related genes was used as the input parameters of the shrunken centroid algorithm. In the ten-fold cross-validation, the overall accuracy of the shrunken centroid algorithm was 96.27%. The overall accuracies of the naïve bayes, logistic, IBK, J48, random forest, libSVM and shrunken centroid with the default parameter were 88.95%, 92.69%, 91.05%, 89.22%, 95.61%, 95.06% and 96.27%, respectively. Based on these results, we can conclude that our proposed model was suitable to predict the breast cancer patients in other platform data and its performance was better than other algorithms.

In this study, several limitations should be acknowledged. First, in our study, only the METABRIC breast cancer cohort was included in our analysis. Although, 1980 breast cancer patients were included in the METABRIC cohort, the dataset used here represented part but not all of the possible breast cancer presents. Since the TCGA breast cancer cohort and several GEO breast cancer cohorts were available on the website, more breast cancer cohorts were needed to confirm the effectiveness of our analysis. Second, the biological information on the mechanisms behind the immune-related genes was not clear, more experimental researches were needed to further understand their functional roles. Finally, there were more types of survival, such as progression-free survival, disease-free survival, and overall survival in the breast cancer cohorts, however, only the overall survival was used in this study. To yield more comprehensive analysis results for breast cancer patients, more types of survival should be used in our future work.

In summary, by employing the mRNA expression profile of the immune-related genes, our study demonstrated for the first time that the two molecular clusters of breast cancer patients. The approaches described here can conceivably be adapted for other tumors, and will provide a powerful tool for the systematic identification of immune-related biomarkers in clinical oncology. Prospective studies are needed to further validate our findings in prospectively planned clinical trials, and to test its clinical utility in individualized management of breast cancer.

Material and Methods

Breast cancer cohort

The relevant clinical data and gene expression data were retrieved from the METABRIC (Molecular Taxonomy of Breast Cancer International Consortium) breast cancer cohort⁵. Both the METABRIC training cohort and the METABRIC test cohort were used in this study. 1980 breast cancer patients with the clinical data, overall survival data and gene expression data were contained in our final cohort.

The mRNA expression data and clinical data of the breast cancer patients were downloaded from The METABRIC (Molecular Taxonomy of Breast Cancer International Consortium), which is public dataset for breast cancer patients, no experiments on humans and/or the use of human tissue samples were used in our study.

Immune-related genes and immune infiltration signatures

2498 immune-related genes were downloaded from the ImmPort database³⁴. 17 immune gene categories, such as antigen processing and presentation, antimicrobials, BCR signaling pathway, cytokine, interleukins, T-cell receptor signaling pathway, B-cell receptor signaling pathway and TNF family receptors were included in these immune-related genes. Subsequently, the single sample gene set enrichment analysis (ssGSEA)^35,36 was used to calculate the abundance level of each gene category for each sample, and the normalized abundance level was considered as the tumor-infiltrating level (TIL) of each gene category for each sample.

The tumor purity, ESTIMATE score, immune score and stromal score were calculated by the ESTIMATE algorithm⁸. The CYT was calculated as the mean expression level of the granzyme A (GZMA) and perforin 1 (PRF1) for assessing the intratumoral immune cytolytic activity in tumors¹⁰.

Breast cancer patients clustering

The subtypes of the breast cancer patients were identified by using the k-means clustering algorithm and the Nbclust that implemented in the R package factoextra (version 1.0.5) was used to determine the optimal number of stable breast cancer clusters. Silhouette width was computed to confirm the most stable samples within each cluster.

Gene set enrichment analysis (GSEA)

To determine how the immune cell types differ between two breast cancer clusters in the tumor microenvironment, GSEA was performed by the R package clusterProfiler (version 3.4.1)³⁷. All the immune-related genes that downloaded from the ImmPort database³⁴ were selected as the reference gene set. Gene sets with the P-value less than 0.05 after 1000 permutations were considered to be significantly enriched or depleted. The normalized enrichment score (NES) was used to examine gene set enrichment results across different gene sets.

Statistical analysis

Survival differences between two breast cancer clusters were assessed by the Kaplan-Meier estimate, and the differences between them were compared using the two-sided log-rank test. The univariable analysis and multivariate analysis were performed with the Cox proportional-hazards regression model. All statistical analyses were performed using R (version 3.6.1). All of the statistical tests were two-sided, and the significance was defined as P-values being less than 0.05.

References

Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2018. CA Cancer J. Clin. 68, 7–30, https://doi.org/10.3322/caac.21442 (2018).
Article PubMed Google Scholar
Ali, H. R. et al. Genome-driven integrated classification of breast cancer validated in over 7,500 samples. Genome Biol. 15, 431 (2014).
Article PubMed PubMed Central Google Scholar
Ciriello, G. et al. Comprehensive molecular portraits of invasive lobular breast cancer. Cell. 163, 506–519 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chuang, H. Y., Lee, E., Liu, Y. T., Lee, D. & Ideker, T. Network-based classification of breast cancer metastasis. Mol. Syst. Biol. 3, 140 (2007).
Article PubMed PubMed Central Google Scholar
Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nat. 486, 346–352, https://doi.org/10.1038/nature10983 (2012).
Article CAS Google Scholar
Prat, A., Parker, J., Fan, C. & Perou, C. PAM50 assay and the three-gene model for identifying the major and clinically relevant molecular subtypes of breast cancer. Breast cancer Res. Treat. 135, 301–306 (2012).
Article CAS PubMed PubMed Central Google Scholar
Haibe-Kains, B. et al. A three-gene model to robustly identify breast cancer molecular subtypes. J. Natl Cancer Inst. 104, 311–325 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612, https://doi.org/10.1038/ncomms3612 (2013).
Article ADS CAS PubMed Google Scholar
Şenbabaoğlu, Y. et al. Tumor immune microenvironment characterization in clear cell renal cell carcinoma identifies prognostic and immunotherapeutically relevant messenger RNA signatures. Genome Biol. 17, 231, https://doi.org/10.1186/s13059-016-1092-z (2016).
Article CAS PubMed PubMed Central Google Scholar
Rooney Michael, S., Shukla Sachet, A., Wu Catherine, J., Getz, G. & Hacohen, N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell. 160, 48–61, https://doi.org/10.1016/j.cell.2014.12.033 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ikeda, Y. et al. Clinical significance of T cell clonality and expression levels of immune-related genes in endometrial cancer. Oncol. Rep. 37, 2603–2610 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yang, L. et al. Clinical significance of the immune microenvironment in ovarian cancer patients. Mol. Omics 14, 341–351, https://doi.org/10.1039/c8mo00128f (2018).
Article CAS PubMed Google Scholar
Stanton, S. E. & Disis, M. L. Clinical significance of tumor-infiltrating lymphocytes in breast cancer. J. Immunother. Cancer 4, 59 (2016).
Article PubMed PubMed Central Google Scholar
Sato, E. et al. Intraepithelial CD8+ tumor-infiltrating lymphocytes and a high CD8+/regulatory T cell ratio are associated with favorable prognosis in ovarian cancer. Proc. Natl Acad. Sci. USA 102, 18538–18543 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Gooden, M. J., de Bock, G. H., Leffers, N., Daemen, T. & Nijman, H. W. The prognostic influence of tumour-infiltrating lymphocytes in cancer: a systematic review with meta-analysis. Br. J. Cancer 105, 93 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tomšová, M., Melichar, B., Sedláková, I. & Šteiner, I. Prognostic significance of CD3+ tumor-infiltrating lymphocytes in ovarian carcinoma. Gynecol. Oncol. 108, 415–420 (2008).
Article PubMed Google Scholar
Nguyen, N. et al. Tumor infiltrating lymphocytes and survival in patients with head and neck squamous cell carcinoma. Head. neck 38, 1074–1084 (2016).
Article PubMed PubMed Central Google Scholar
Santoiemma, P. P. & Powell, D. J. Jr. Tumor infiltrating lymphocytes in ovarian cancer. Cancer Biol. Ther. 16, 807–820, https://doi.org/10.1080/15384047.2015.1040960 (2015).
Article CAS PubMed PubMed Central Google Scholar
Deschoolmeester, V. et al. Tumor infiltrating lymphocytes: an intriguing player in the survival of colorectal cancer patients. BMC Immunol. 11, 19 (2010).
Article PubMed PubMed Central Google Scholar
Dadmarz, R. D. et al. Tumor-infiltrating lymphocytes from human ovarian cancer patients recognize autologous tumor in an MHC class II-restricted fashion. Cancer J. Sci. Am. 2, 263–272 (1996).
CAS PubMed Google Scholar
Ward, M. et al. Tumour-infiltrating lymphocytes predict for outcome in HPV-positive oropharyngeal cancer. Br. J. Cancer 110, 489 (2014).
Article CAS PubMed Google Scholar
Rhee, J. K. et al. Impact of tumor purity on immune gene expression and clustering analyses across multiple cancer types. Cancer Immunol. Res. 6, 87–97, https://doi.org/10.1158/2326-6066.CIR-17-0201 (2018).
Article CAS PubMed Google Scholar
Tibshirani, R., Hastie, T., Narasimhan, B. & Chu, G. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. 99, 6567, https://doi.org/10.1073/pnas.082099299 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Tomlins, S. A. et al. Characterization of 1577 primary prostate cancers reveals novel biological and clinicopathologic insights into molecular subtypes. Eur. Urol. 68, 555–567, https://doi.org/10.1016/j.eururo.2015.04.033 (2015).
Article PubMed PubMed Central Google Scholar
Markert, E. K., Mizuno, H., Vazquez, A. & Levine, A. J. Molecular classification of prostate cancer using curated expression signatures. Proc. Natl. Acad. Sci. USA 108, 21276–21281 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Golub, T. R. et al. Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286, https://doi.org/10.1126/science.286.5439.531 (1999).
Article CAS PubMed Google Scholar
Sboner, A. et al. Molecular sampling of prostate cancer: a dilemma for predicting disease progression. BMC Med. Genomics 3, 8, https://doi.org/10.1186/1755-8794-3-8 (2010).
Article CAS PubMed PubMed Central Google Scholar
Marisa, L. et al. Gene Expression Classification of Colon Cancer into Molecular Subtypes: Characterization, Validation, and Prognostic Value. PLOS Med. 10, e1001453, https://doi.org/10.1371/journal.pmed.1001453 (2013).
Article CAS PubMed PubMed Central Google Scholar
Frank, E., Hall, M., Trigg, L., Holmes, G. & Witten, I. H. Data mining in bioinformatics using Weka. Bioinforma. 20, 2479–2481, https://doi.org/10.1093/bioinformatics/bth261 (2004).
Article CAS Google Scholar
Peng, H. C., Long, F. H. & Ding, C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1226–1238 (2005).
Article PubMed Google Scholar
Ding, C. & Peng, H. C. Minimum redundancy feature selection from microarray gene expression data. J. Bioinf Comput. Biol. 3, 185–205 (2005).
Article CAS Google Scholar
Zou, Q., Zeng, J., Cao, L. & Ji, R. A novel features ranking metric with application to scalable visual and bioinformatics data classification. Neurocomputing 173, 346–354, https://doi.org/10.1016/j.neucom.2014.12.123 (2016).
Article Google Scholar
Zou, Q., Wan, S., Ju, Y., Tang, J. & Zeng, X. Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy. BMC Syst. Biol. 10, 114, https://doi.org/10.1186/s12918-016-0353-5 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bhattacharya, S. et al. ImmPort: disseminating data to the public for the future of immunology. Immunologic Res. 58, 234–239, https://doi.org/10.1007/s12026-014-8516-1 (2014).
Article CAS Google Scholar
Barbie, D. A. et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nat. 462, 108, https://doi.org/10.1038/nature08460 (2009).
Article ADS CAS Google Scholar
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 14, 7, https://doi.org/10.1186/1471-2105-14-7 (2013).
Article Google Scholar
Yu, G. C., Wang, L. G., Han, Y. Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics 16, 284–287, https://doi.org/10.1089/omi.2011.0118 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 18KJB520046), the Qing Lan Project of Jiangsu Province (Year 2016), the Scientific Research Foundation of Wuxi City College of Vocational Technology, the 333 High-level Talents Training Project of Jiangsu Province (No. BRA2018147).

Author information

Authors and Affiliations

School of Internet of Things Engineering, Wuxi City College of Vocational Technology, Wuxi, 214153, China
Juan Mei, Ji Zhao & Yi Fu

Authors

Juan Mei
View author publications
You can also search for this author in PubMed Google Scholar
Ji Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yi Fu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M. conceived and designed the experiments. J.M. and Y.F. performed the experiments. J.Z. analyzed the data. J.M. contributed materials/analysis tools. J.M. wrote the paper.

Corresponding author

Correspondence to Juan Mei.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mei, J., Zhao, J. & Fu, Y. Molecular classification of breast cancer using the mRNA expression profiles of immune-related genes. Sci Rep 10, 4800 (2020). https://doi.org/10.1038/s41598-020-61710-y

Download citation

Received: 19 December 2019
Accepted: 02 March 2020
Published: 16 March 2020
DOI: https://doi.org/10.1038/s41598-020-61710-y

This article is cited by

Identification of pyroptosis related subtypes and tumor microenvironment infiltration characteristics in breast cancer
- Guo Huang
- Jun Zhou
- Guowen Liu
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.