Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Short Communication
  • Published:

Predicting the site of origin of tumors by a gene expression signature derived from normal tissues

Abstract

Multiple expression signatures for the prediction of the site of origin of metastatic cancer of unknown primary origin (CUP) have been developed. Owing to their limited coverage of tumor types and suboptimal prediction accuracy on distinct tumors, there is still room for alternative CUP gene expression signatures. Whereas in past studies, CUP classifiers were trained solely on data from tumor samples, we now use expression patterns from normal tissues for classifier training. This approach potentially avoids pitfalls related to the representation of genetically heterogeneous tumor subtypes during classifier training. Two expression data sets of normal human tissues have been reanalyzed to derive an expression signature for liver, prostate, kidney, ovarian and lung tissues. In reciprocal validation, classifiers trained on either data set achieved overall accuracies greater than 97%. Classifiers trained on combined expression data from both normal tissue data sets were able to predict the site of origin in a cohort of 652 primary tumors with 90% accuracy. Prediction accuracies of primary cancer-based classifiers were in the same range, as determined by cross-validation on this cohort. For individual tumor types, normal tissue-based classifiers achieved sensitivities in the range of 64–99% and specificities in the range of 92–100%. Primary origins for 12 of 20 metastases were predicted correctly, with false predictions highlighting the need for accurate sample preparation to avoid contaminations by metastases-surrounding tissue. We conclude that gene expression patterns of normal tissues harbor phenotypic information that is retained in tumors and can be sufficient to recover the type of primary tumor from expression patterns alone.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1
Figure 2

Similar content being viewed by others

References

  • Abbruzzese JL, Abbruzzese MC, Hess KR, Raber MN, Lenzi R, Frost P . (1994). Unknown primary carcinoma: natural history and prognostic factors in 657 consecutive patients. J Clin Oncol 12: 1272–1280.

    Article  CAS  PubMed  Google Scholar 

  • Bloom G, Yang IV, Boulware D, Kwong KY, Coppola D, Eschrich S et al. (2004). Multi-platform, multi-site, microarray-based human tumor classification. Am J Pathol 164: 9–16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Bridgewater J, van LR, Floore A, Van'T VL . (2008). Gene expression profiling may improve diagnosis in patients with carcinoma of unknown primary. Br J Cancer 98: 1425–1430.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Canales RD, Luo Y, Willey JC, Austermiller B, Barbacioru CC, Boysen C et al. (2006). Evaluation of DNA microarray results with quantitative gene expression platforms. Nat Biotechnol 24: 1115–1122.

    Article  CAS  PubMed  Google Scholar 

  • Dennis JL, Hvidsten TR, Wit EC, Komorowski J, Bell AK, Downie I et al. (2005). Markers of adenocarcinoma characteristic of the site of origin: development of a diagnostic algorithm. Clin Cancer Res 11: 3766–3772.

    Article  CAS  PubMed  Google Scholar 

  • Dennis JL, Vass JK, Wit EC, Keith WN, Oien KA . (2002). Identification from public data of molecular markers of adenocarcinoma characteristic of the site of origin. Cancer Res 62: 5999–6005.

    CAS  PubMed  Google Scholar 

  • Dumur CI, Lyons-Weiler M, Sciulli C, Garrett CT, Schrijver I, Holley TK et al. (2008). Interlaboratory performance of a microarray-based gene expression test to determine tissue of origin in poorly differentiated and undifferentiated cancers. J Mol Diagn 10: 67–77.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Fellenberg K, Hauser NC, Brors B, Neutzner A, Hoheisel JD, Vingron M . (2001). Correspondence analysis applied to microarray data. Proc Natl Acad Sci USA 98: 10781–10786.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Horlings HM, van Laar RK, Kerst JM, Helgason HH, Wesseling J, van der Hoeven JJ et al. (2008). Gene expression profiling to identify the histogenetic origin of metastatic adenocarcinomas of unknown primary. J Clin Oncol 26: 4435–4441.

    Article  CAS  PubMed  Google Scholar 

  • Ma XJ, Patel R, Wang X, Salunga R, Murage J, Desai R et al. (2006). Molecular classification of human cancers using a 92-gene real-time quantitative polymerase chain reaction assay. Arch Pathol Lab Med 130: 465–473.

    CAS  PubMed  Google Scholar 

  • Monzon FA, Lyons-Weiler M, Buturovic LJ, Rigl CT, Henner WD, Sciulli C et al. (2009). Multicenter validation of a 1550-gene expression profile for identification of tumor tissue of origin. J Clin Oncol 27: 2503–2508.

    Article  PubMed  Google Scholar 

  • Oien KA, Evans TR . (2008). Raising the profile of cancer of unknown primary. J Clin Oncol 26: 4373–4375.

    Article  CAS  PubMed  Google Scholar 

  • Pentheroudakis G, Golfinopoulos V, Pavlidis N . (2007). Switching benchmarks in cancer of unknown primary: from autopsy to microarray. Eur J Cancer 43: 2026–2036.

    Article  PubMed  Google Scholar 

  • Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang CH, Angelo M et al. (2001). Multiclass cancer diagnosis using tumor gene expression signatures. Proc Natl Acad Sci USA 98: 15149–15154.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Roth RB, Hevezi P, Lee J, Willhite D, Lechner SM, Foster AC et al. (2006). Gene expression analyses reveal molecular relationships among 20 regions of the human CNS. Neurogenetics 7: 67–80.

    Article  CAS  PubMed  Google Scholar 

  • Sorlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A et al. (2003). Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci USA 100: 8418–8423.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T et al. (2002). Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci USA 99: 4465–4470.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Su AI, Welsh JB, Sapinoso LM, Kern SG, Dimitrov P, Lapp H et al. (2001). Molecular classification of human carcinomas by use of gene expression signatures. Cancer Res 61: 7388–7393.

    CAS  PubMed  Google Scholar 

  • Talantov D, Baden J, Jatkoe T, Hahn K, Yu J, Rajpurohit Y et al. (2006). A quantitative reverse transcriptase–polymerase chain reaction assay to identify metastatic carcinoma tissue of origin. J Mol Diagn 8: 320–329.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Tothill RW, Kowalczyk A, Rischin D, Bousioutas A, Haviv I, van Laar RK et al. (2005). An expression-based site of origin diagnostic method designed for clinical application to cancer of unknown origin. Cancer Res 65: 4031–4040.

    Article  CAS  PubMed  Google Scholar 

  • van de Wouw AJ, Janssen-Heijnen ML, Coebergh JW, Hillen HF . (2002). Epidemiology of unknown primary tumours; incidence and population-based survival of 1285 patients in Southeast Netherlands, 1984–1992. Eur J Cancer 38: 409–413.

    Article  CAS  PubMed  Google Scholar 

  • van Laar RK, Ma XJ, de JD, Wehkamp D, Floore AN, Warmoes MO et al. (2009). Implementation of a novel microarray-based diagnostic test for cancer of unknown primary. Int J Cancer 125: 1390–1397.

    Article  CAS  PubMed  Google Scholar 

  • Varadhachary GR, Talantov D, Raber MN, Meng C, Hess KR, Jatkoe T et al. (2008). Molecular profiling of carcinoma of unknown primary and correlation with clinical evaluation. J Clin Oncol 26: 4442–4448.

    Article  CAS  PubMed  Google Scholar 

  • Yeang CH, Ramaswamy S, Tamayo P, Mukherjee S, Rifkin RM, Angelo M et al. (2001). Molecular classification of multiple tumor types. Bioinformatics 17 (Suppl 1): S316–S322.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

We thank Anja von Heydebreck for comments on an earlier version of the paper.

Author contributions: ES, HJB and JG conceived the study. ES prepared and analyzed the data and drafted the paper. HJB and JG reviewed and refined the paper. ES, HJB and JG released the final paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to E Staub.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies the paper on the Oncogene website

Supplementary information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Staub, E., Buhr, HJ. & Gröne, J. Predicting the site of origin of tumors by a gene expression signature derived from normal tissues. Oncogene 29, 4485–4492 (2010). https://doi.org/10.1038/onc.2010.196

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/onc.2010.196

Keywords

This article is cited by

Search

Quick links