Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Oncogenomics
  • Published:

A gene-expression signature to predict survival in breast cancer across independent data sets

Abstract

Prognostic signatures in breast cancer derived from microarray expression profiling have been reported by two independent groups. These signatures, however, have not been validated in external studies, making clinical application problematic. We performed microarray expression profiling of 135 early-stage tumors, from a cohort representative of the demographics of breast cancer. Using a recently proposed semisupervised method, we identified a prognostic signature of 70 genes that significantly correlated with survival (hazard ratio (HR): 5.97, 95% confidence interval: 3.0–11.9, P=2.7e−07). In multivariate analysis, the signature performed independently of other standard prognostic classifiers such as the Nottingham Prognostic Index and the ‘Adjuvant!’ software. Using two different prognostic classification schemes and measures, nearest centroid (HR) and risk ordering (D-index), the 70-gene classifier was also found to be prognostic in two independent external data sets. Overall, the 70-gene set was prognostic in our study and the two external studies which collectively include 715 patients. In contrast, we found that the two previously described prognostic gene sets performed less optimally in external validation. Finally, a common prognostic module of 29 genes that associated with survival in both our cohort and the two external data sets was identified. In spite of these results, further studies that profile larger cohorts using a single microarray platform, will be needed before prospective clinical use of molecular classifiers can be contemplated.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1
Figure 2

Similar content being viewed by others

References

  • Bair E, Tibshirani R . (2004). Semi-supervised methods to predict patient survival from gene expression data. PLoS Biology 2: 503–511.

    Article  Google Scholar 

  • Brenton JD, Carey LA, Ahmed AA, Caldas C . (2005). Molecular classification and molecular forecasting of breast cancer: ready for clinical application? J Clin Oncol 23: 7350–7360.

    Article  CAS  Google Scholar 

  • Chang HY, Nuyten DSA, Sneddon JB, Hastie T, Tibshirani R, Sorlie T et al. (2005). Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival. Proc Natl Acad Sci USA 102: 3738–3743.

    Article  CAS  Google Scholar 

  • Cox DR, Oakes D . (1984). Analysis of Survival Data. Chopman and Hall: London.

    Google Scholar 

  • Dai H, van‘t Veer L, Lamb J, He YD, Mao M, Fine BM et al. (2005). A cell proliferation signature is a marker of extremely poor outcome in a subpopulation of breast cancer patients. Cancer Res 65: 4059–4066.

    Article  CAS  Google Scholar 

  • Degot S, Le Hir H, Alpy F, Kedinger V, Stoll I, Wendling C et al. (2004). Association of the breast cancer protein MLN51 with the exon junction complex via its speckle localizer and RNA binding module. J Biol Chem 279: 33702–33715.

    Article  CAS  Google Scholar 

  • Eden P, Ritz C, Rose C, Ferno M, Peterson C . (2004). ‘Good Old’ clinical markers have similar power in breast cancer prognosis as microarray gene expression profilers. Eur J Cancer 40: 1837–1841.

    Article  CAS  Google Scholar 

  • Ein-Dor L, Kela I, Getz G, Givol D, Eytan D . (2005). Outcome signature genes in breast cancer: is there a unique set? Bioinformatics 21: 171–178.

    Article  CAS  Google Scholar 

  • Galea MH, Blamey RW, Elston CE, Ellis IO . (1992). The Nottingham Prognostic Index in primary breast cancer. Breast Cancer Res Treat 22: 207–219.

    Article  CAS  Google Scholar 

  • Irizarry RA, Warren D, Spencer F, Kim IF, Biswal S, Frank BC et al. (2005). Multiple-laboratory comparison of microarray platforms. Nat Methods 2: 345–350.

    Article  CAS  Google Scholar 

  • Michiels S, Koscielny S, Catherine H . (2005). Prediction of cancer outcome with microarrays: a multiple random validation strategy. Lancet 365: 488–492.

    Article  CAS  Google Scholar 

  • Naderi A, Ahmed AA, Barbosa-Morais NL, Aparicio S, Brenton JD, Caldas C . (2004). Expression microarray reproducibility is improved by optimising purification steps in RNA amplification and labelling. BMC Genomics 5: 9.

    Article  Google Scholar 

  • Naderi A, Ahmed AA, Wang Y, Brenton JD, Caldas C . (2005). Optimal amounts of fluorescent dye improve expression results in tumor specimens. Mol Biotechnol 30: 151–154.

    Article  CAS  Google Scholar 

  • Olivotto IA, Bajdik CD, Ravdin CD, Speers CH, Coldman AJ, Norris BD et al. (2005). Population-based validation of the prognostic model ADJUVANT! for early breast cancer. J Clin Oncol 23: 2716–2725.

    Article  Google Scholar 

  • Paik S, Shak S, Tang G, Kim F, Baker J, Cronin M et al. (2004). A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med 351: 2817–2826.

    Article  CAS  Google Scholar 

  • Pawitan Y, Bjohle J, Amler L, Borg AL, Egyhazi S, Hall P et al. (2005). Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res 7: R953–R964.

    Article  CAS  Google Scholar 

  • Royston P, Sauerbrei W . (2004). A new approach to modelling interactions between treatment and continuous covariates in clinical trials by using fractional polynomials. Stat Med 23: 723–748.

    Article  Google Scholar 

  • Shen R, Ghosh D, Chinnaiyan AM . (2004). Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data. BMC Genomics 5: 94.

    Article  Google Scholar 

  • Sollich P, Krogh A . (1996). Learning with ensembles: how over-fitting can be useful. In: Touretzky DS, Mozer MC, Hasselmo ME (eds). Advances in Neural Information Processing Systems. MTT press: Cambridge, MA, vol. 8. pp 190–196.

    Google Scholar 

  • Sotiriou C, Neo SY, McShane LM, Korn EL, Long PM, Jazaeri A et al. (2003). Breast cancer classification and prognosis based on gene expression profiles from a population-based study. Proc Natl Acad Sci USA 100: 10393–10398.

    Article  CAS  Google Scholar 

  • Tan PK, Downey TJ, Spitzangel ELJ, Xu P, Fu D, Dimitrov DS et al. (2003). Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Res 31: 5676–5684.

    Article  CAS  Google Scholar 

  • Tibshirani R, Hastie T, Narasimhan B, Chu G . (2002). Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci USA 99: 6567–6572.

    Article  CAS  Google Scholar 

  • van de Vijver MJ, He YD, van‘t Veer L, Dai H, Hart AAM, Voskuil DW et al. (2002). A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009.

    Article  CAS  Google Scholar 

  • van‘t Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M et al. (2002). Gene expression profiling predicts clinical outcome of breast cancer. Nature 415: 530–536.

    Article  Google Scholar 

  • Wang Y, Klijn JG, Zhang Y, Sieuwerts AM, Look MP, Yang F et al. (2005). Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 365: 671–679.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

Research in the Cancer Genomics Program is funded by grants from Cancer Research UK, Cambridge MIT Institute and Isaac Newton Trust. JDB is a CR-UK Senior Clinical Research Fellow. We thank Dr Patrick Royston for advice on survival analysis and Ms Claire Paish, Nottingham City Hospital for collecting the tissue samples. NLB-M is the recipient of a Praxis XXI doctoral fellowship from FCT, Ministry of Science, Portugal.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to J D Brenton or C Caldas.

Additional information

Supplementary Information accompanies the paper on the Oncogene website (http://www.nature.com/onc).

Supplementary information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Naderi, A., Teschendorff, A., Barbosa-Morais, N. et al. A gene-expression signature to predict survival in breast cancer across independent data sets. Oncogene 26, 1507–1516 (2007). https://doi.org/10.1038/sj.onc.1209920

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/sj.onc.1209920

Keywords

This article is cited by

Search

Quick links