Abstract
Refractoriness to induction chemotherapy and relapse after achievement of remission are the main obstacles to cure in acute myeloid leukaemia (AML)1. After standard induction chemotherapy, patients are assigned to different post-remission strategies on the basis of cytogenetic and molecular abnormalities that broadly define adverse, intermediate and favourable risk categories2,3. However, some patients do not respond to induction therapy and another subset will eventually relapse despite the lack of adverse risk factors4. There is an urgent need for better biomarkers to identify these high-risk patients before starting induction chemotherapy, to enable testing of alternative induction strategies in clinical trials5. The high rate of relapse in AML has been attributed to the persistence of leukaemia stem cells (LSCs), which possess a number of stem cell properties, including quiescence, that are linked to therapy resistance6,7,8,9,10. Here, to develop predictive and/or prognostic biomarkers related to stemness, we generated a list of genes that are differentially expressed between 138 LSC+ and 89 LSC− cell fractions from 78 AML patients validated by xenotransplantation. To extract the core transcriptional components of stemness relevant to clinical outcomes, we performed sparse regression analysis of LSC gene expression against survival in a large training cohort, generating a 17-gene LSC score (LSC17). The LSC17 score was highly prognostic in five independent cohorts comprising patients of diverse AML subtypes (n = 908) and contributed greatly to accurate prediction of initial therapy resistance. Patients with high LSC17 scores had poor outcomes with current treatments including allogeneic stem cell transplantation. The LSC17 score provides clinicians with a rapid and powerful tool to identify AML patients who do not benefit from standard therapy and who should be enrolled in trials evaluating novel upfront or post-remission strategies.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Single-cell chromatin accessibility profiling of acute myeloid leukemia reveals heterogeneous lineage composition upon therapy-resistance
Communications Biology Open Access 21 July 2023
-
Removing the association of random gene sets and survival time in cancers with positive random bias using fixed-point gene set
Scientific Reports Open Access 29 May 2023
-
Integrative molecular subtypes of acute myeloid leukemia
Blood Cancer Journal Open Access 08 May 2023
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout




Accession codes
References
Ferrara, F. & Schiffer, C. A. Acute myeloid leukaemia in adults. Lancet 381, 484–495 (2013)
Grimwade, D. et al. Refinement of cytogenetic classification in acute myeloid leukemia: determination of prognostic significance of rare recurring chromosomal abnormalities among 5876 younger adult patients treated in the United Kingdom Medical Research Council trials. Blood 116, 354–365 (2010)
Döhner, H. et al. Diagnosis and management of acute myeloid leukemia in adults: recommendations from an international expert panel, on behalf of the European LeukemiaNet. Blood 115, 453–474 (2010)
Röllig, C. et al. Long-term prognosis of acute myeloid leukemia according to the new genetic risk classification of the European LeukemiaNet recommendations: evaluation of the proposed reporting system. J. Clin. Oncol. 29, 2758–2765 (2011)
Walter, R. B. et al. Resistance prediction in AML: analysis of 4601 patients from MRC/NCRI, HOVON/SAKK, SWOG and MD Anderson Cancer Center. Leukemia 29, 312–320 (2015)
Kreso, A. & Dick, J. E. Evolution of the cancer stem cell model. Cell Stem Cell 14, 275–291 (2014)
Saito, Y. et al. Identification of therapeutic targets for quiescent, chemotherapy-resistant human leukemia stem cells. Sci. Transl. Med. 2, 17ra9 (2010)
Li, L. et al. SIRT1 activation by a c-MYC oncogenic network promotes the maintenance and drug resistance of human FLT3-ITD acute myeloid leukemia stem cells. Cell Stem Cell 15, 431–446 (2014)
Fong, C. Y. et al. BET inhibitor resistance emerges from leukaemia stem cells. Nature 525, 538–542 (2015)
Lechman, E. R. et al. miR-126 regulates distinct self-renewal outcomes in normal and malignant hematopoietic stem cells. Cancer Cell 29, 214–228 (2016)
Eppert, K. et al. Stem cell gene expression programs influence clinical outcome in human leukemia. Nature Med. 17, 1086–1093 (2011)
Sarry, J. E. et al. Human acute myelogenous leukemia stem cells are rare and heterogeneous when assayed in NOD/SCID/IL2Rγc-deficient mice. J. Clin. Invest. 121, 384–395 (2011)
Laurenti, E. et al. The transcriptional architecture of early human hematopoiesis identifies multilevel control of lymphoid commitment. Nature Immunol. 14, 756–763 (2013)
Novershtern, N. et al. Densely interconnected transcriptional circuits control cell states in human hematopoiesis. Cell 144, 296–309 (2011)
Verhaak, R. G. et al. Prediction of molecular subtypes in acute myeloid leukemia based on gene expression profiling. Haematologica 94, 131–134 (2009)
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010)
Simon, N., Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for Cox’s proportional hazards model via coordinate descent. J. Stat. Softw. 39, 1–13 (2011)
Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N. Engl. J. Med. 368, 2059–2074 (2013)
Metzeler, K. H. et al. An 86-probe-set gene-expression signature predicts survival in cytogenetically normal acute myeloid leukemia. Blood 112, 4193–4201 (2008)
Grimwade, D., Ivey, A. & Huntly, B. J. Molecular landscape of acute myeloid leukemia in younger adults and its clinical relevance. Blood 127, 29–41 (2016)
Papaemmanuil, E. et al. Genomic classification and prognosis in acute myeloid leukemia. N. Engl. J. Med. 374, 2209–2221 (2016)
Levine, J. H. et al. Data-driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis. Cell 162, 184–197 (2015)
Gentles, A. J., Plevritis, S. K., Majeti, R. & Alizadeh, A. A. Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia. J. Am. Med. Assoc. 304, 2706–2715 (2010)
Jung, N., Dai, B., Gentles, A. J., Majeti, R. & Feinberg, A. P. An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis. Nature Commun. 6, 8489 (2015)
Geiss, G. K. et al. Direct multiplexed measurement of gene expression with color-coded probe pairs. Nature Biotechnol. 26, 317–325 (2008)
Cornelissen, J. J. et al. The European LeukemiaNet AML Working Party consensus statement on allogeneic HSCT for patients with AML in remission: an integrated-risk adapted approach. Nature Rev. Clin. Oncol. 9, 579–590 (2012)
Kohlmann, A. et al. Gene expression profiling in AML with normal karyotype can predict mutations for molecular markers and allows novel insights into perturbed biological pathways. Leukemia 24, 1216–1220 (2010)
Castaigne, S. et al. Effect of gemtuzumab ozogamicin on survival of adult patients with de-novo acute myeloid leukaemia (ALFA-0701): a randomised, open-label, phase 3 study. Lancet 379, 1508–1516 (2012)
Hills, R. K. et al. Addition of gemtuzumab ozogamicin to induction chemotherapy in adult patients with acute myeloid leukaemia: a meta-analysis of individual patient data from randomised controlled trials. Lancet Oncol. 15, 986–996 (2014)
Klco, J. M. et al. Association between mutation clearance after induction therapy and outcomes in acute myeloid leukemia. J. Am. Med. Assoc. 314, 811–822 (2015)
Du, P., Kibbe, W. A. & Lin, S. M. lumi: a pipeline for processing Illumina microarray. Bioinformatics 24, 1547–1548 (2008)
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015)
Qiao, W. et al. PERT: a method for expression deconvolution of human blood samples from varied microenvironmental and developmental conditions. PLOS Comput. Biol. 8, e1002838 (2012)
Gautier, L., Cope, L., Bolstad, B. M. & Irizarry, R. A. affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315 (2004)
Wu, J., Irizarry, R., MacDonald, J. & Gentry, J. Gcrma: background adjustment using sequence information. R package version 2.36.0 (2016)
Dai, M. et al. Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 33, e175 (2005)
Macrae, T. et al. RNA-seq reveals spliceosome and proteasome genes as most consistent transcripts in human cancer cells. PLoS ONE 8, e72884 (2013)
Scott, D. W. et al. Determining cell-of-origin subtypes of diffuse large B-cell lymphoma using gene expression in formalin-fixed paraffin-embedded tissue. Blood 123, 1214–1217 (2014)
Nielsen, T. et al. Analytical validation of the PAM50-based Prosigna Breast Cancer Prognostic Gene Signature Assay and nCounter Analysis System using formalin-fixed paraffin-embedded breast tumor specimens. BMC Cancer 14, 177 (2014)
R Development Core Team. A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2014)
Cheson, B. D. et al. Revised recommendations of the International Working Group for Diagnosis, Standardization of Response Criteria, Treatment Outcomes, and Reporting Standards for Therapeutic Trials in Acute Myeloid Leukemia. J. Clin. Oncol. 21, 4642–4649 (2003)
Gray, R. J. A class of K-sample tests for comparing the cumulative incidence of a competing risk. Ann. Stat. 16, 1141–1154 (1988)
Fine, J. P. & Gray, R. J. A proportional hazards model for the subdistribution of a competing risk. J. Am. Stat. Assoc. 94, 496–509 (1999)
Gray, B. cmprsk: subdistribution analysis of competing risks. R package version 2.2-7 (2014)
Gerds, T. A. & Scheike, T. H . riskRegression: risk regression for survival analysis. R package version 0.0.8 (2016)
Kanda, Y. Investigation of the freely available easy-to-use software ‘EZR’ for medical statistics. Bone Marrow Transplant. 48, 452–458 (2013)
Mantel, N. & Byar, D. Evaluation of response-time data involving transient states: an illustration using heart transplant data. J. Am. Stat. Assoc. 69, 81–86 (1974)
Andersen, P. & Gill, R. D. Cox’s regression model for counting processes: a large sample study. Ann. Stat. 10, 1100–1120 (1982)
Simon, R. & Makuch, R. W. A non-parametric graphical representation of the relationship between survival and the occurrence of an event: application to responder versus non-responder bias. Stat. Med. 3, 35–44 (1984)
Therneau, T. M. & Grambsch, P. M. Modeling Survival Data: Extending the Cox Model (Springer, 2000)
Harrell, F. E. Jr. rms: regression modeling strategies. R package version 4.4-1 (2016)
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12, 77 (2011)
Kundu, S., Aulchenko, Y. S., van Duijn, C. M. & Janssens, A. C. PredictABEL: an R psackage for the assessment of risk prediction models. Eur. J. Epidemiol. 26, 261–264 (2011)
Acknowledgements
This work was supported by grants from the Ontario Institute for Cancer Research with funds from the province of Ontario, the Cancer Stem Cell Consortium with funding from the Government of Canada through Genome Canada and the Ontario Genomics Institute (OGI-047), and the Canadian Institutes of Health Research (CSC-105367), Canadian Cancer Society, Terry Fox Foundation, a Canada Research Chair to J.E.D., the Philip S. Orsino Chair in Leukemia Research to M.D.M., and a Collaborative Translational Cancer Research Grant from the Princess Margaret Cancer Centre (formerly Ontario Cancer Institute). This research was funded in part by the Leukemia & Lymphoma Society of Canada (493946) and the Stem Cell Network (492019), Ontario Graduate Scholarships, and the Ontario Ministry of Health and Long Term Care (OMOHLTC). The views expressed do not necessarily reflect those of the OMOHLTC. L.B. was supported in part by the Deutsche Forschungsgemeinschaft (Heisenberg-Professur BU 1339/8-1). T.H. was supported by the Wilhelm-Sander-Stiftung (grant 2013.086.1). K.M. and W.H. received grant support from Deutsche Forschungsgemeinschaft (DFG SFB 1243). We thank The Centre for Applied Genomics (Hospital for Sick Children) and the Princess Margaret Genomics Centre for the generation of GE data for the PM sorted cell fractions and validation cohort. We thank M. Pintilie for discussions regarding time-dependent covariates in survival analysis. We thank S. Geffroy for technical support and running the microarrays for the ALFA-0701 trial cohort.
Author information
Authors and Affiliations
Contributions
S.W.K.N. developed the signature derivation workflow, identified, refined and validated prognostic and predictive signatures, designed the custom NanoString assay, processed and analysed GE data, and performed statistical analyses and bioinformatics. A.M., W.C.C., J.M. and A.P. carried out functional xenograft transplantation, RNA extraction for GE analysis, and provided technical support for experiments. J.A.K., N.I., A.A., V.G., A.D.S., A.C.S., K.W.Y. and M.D.M. provided clinical annotations for the PM AML cohort. M.D.M. provided PM AML samples. S.W.K.N., J.C.Y.W., J.E.D. and M.D.M. interpreted the data. W.H., W.E.B., B.W., T.B., D.G., L.B., K.M., T.H. and C.B. provided clinical annotations for the GSE15434 and GSE12417 data sets. M.C., C.P. and H.D. provided GE and clinical data for the ALFA-0701 trial cohort. P.J.M.V. and B.L. provided clinical annotations for the GSE6891 data set. J.C.Y.W. and J.E.D. supervised the study. S.W.K.N. and J.C.Y.W. wrote the paper. A.M., J.A.K., P.W.Z., J.E.D. and M.D.M. revised the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Additional information
Reviewer Information Nature thanks F. Holstege, G. Schuurhuis and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Extended data figures and tables
Extended Data Figure 1 Overview of LSC signature training and testing.
a, Clinical characteristics of the 78 patients analysed by xenotransplantation and microarray GE analysis. CMML, chronic myelomonocytic leukaemia; t-AML, therapy-associated AML; CN, cytogenetically normal. b, Schematic of the experimental protocol. c, d, Summary of functionally defined LSC+ and LSC− fractions in each phenotypic cell population as a whole (c) and for each patient (d). Red and blue denote LSC+ and LSC−, respectively. In d, each row represents fractions sorted from one patient sample. White boxes denote fractions that were not included in the analysis due to insufficient cell numbers for xenotransplantation and/or insufficient RNA. e, Strategy used to identify and test the 17 LSC signature genes. f, Key clinical characteristics of the GSE6891 signature training cohort. *P value calculated using the Wilcoxon rank-sum test; †P value calculated using the Student’s t-test; ‡P value calculated using Pearson’s chi-squared test; §P value calculated using log-rank test; ||P value calculated using Fisher’s exact test; ¶cytogenetic risk groups were defined as per GSE6891 investigators15.
Extended Data Figure 2 LSC17 and LSC3 scores are associated with survival in multiple AML cohorts.
a–n, q, Kaplan–Meier estimates of OS, EFS or RFS according to LSC17 scores in various patient cohorts, as indicated. In c, patients were also analysed according to whether or not CR was achieved after initial treatment (no CR, dotted lines; CR, solid lines). i, The subset of patients in the TCGA AML cohort with no clear genomic classification as defined previously21. o, Simon and Makuch estimates of OS, according to LSC17 scores and whether or not patients received aSCT (no aSCT, dotted lines; aSCT, solid lines). p, Kaplan–Meier estimates of OS of CN-LMR patients, according to LSC3 scores. In a–q, patients with scores above and below the median in each cohort are shown by red and blue lines, respectively. r, s, Kaplan–Meier estimates of RFS for patients with high (r) or low (s) LSC17 scores treated with standard chemotherapy with (red lines) or without (blue lines) addition of GO.
Supplementary information
Supplementary Information
This file contains a Supplementary Discussion of the 17 LSC signature genes. (PDF 84 kb)
Rights and permissions
About this article
Cite this article
Ng, S., Mitchell, A., Kennedy, J. et al. A 17-gene stemness score for rapid determination of risk in acute leukaemia. Nature 540, 433–437 (2016). https://doi.org/10.1038/nature20598
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nature20598
This article is cited by
-
FBXO22 promotes leukemogenesis by targeting BACH1 in MLL-rearranged acute myeloid leukemia
Journal of Hematology & Oncology (2023)
-
Expansion of interferon inducible gene pool via USP18 inhibition promotes cancer cell pyroptosis
Nature Communications (2023)
-
Single-cell chromatin accessibility profiling of acute myeloid leukemia reveals heterogeneous lineage composition upon therapy-resistance
Communications Biology (2023)
-
The expression level of Neuronal Calcium Sensor 1 can predict the prognosis of cytogenetically normal AML
The Pharmacogenomics Journal (2023)
-
Longitudinal single-cell profiling of chemotherapy response in acute myeloid leukemia
Nature Communications (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.