Introduction

Pediatric acute myeloid leukemia (pedAML) is a rare hematological disease that accounts for 20% of all pediatric leukemias.1 Cytogenetic risk stratification combined with response-guided therapeutic decisions considerably improved prognostication.2,3 Unfortunately, still 30–40% of the good responders experience relapse.2 During the past decade, ample evidence showed that relapse is associated with a high leukemic stem cell (LSC) load at diagnosis as well as LSC persistence during an apparent state of remission.4,5,6,7,8 However, relapse still occurs in a considerable part of LSClow pedAML patients identified by flow cytometry,7 emphasizing a high need for a more profound molecular and phenotypic characterization of LSC.

Hitherto, most pedAML gene expression profiles (GEPs) were established in bulk leukemic samples,9,10,11,12,13,14,15 not taking into account cellular heterogeneity, and thus fail to identify critical LSC-specific genes and pathways. In adult AML, by contrast, several LSC gene signatures have been developed these past years.16,17,18,19,20,21,22,23 Interestingly, the LSC17 signature by Ng et al.16 also held a prognostic value in pedAML.24,25 Moreover, it was used to develop a pediatric-specific LSC6 score able to identify high-risk (HR) pedAML patients.26 Unfortunately, this LSC signature also contains genes that are expressed in hematopoietic stem cells (HSCs) and lack the inclusion of downregulated targets.27 It was however shown that adding PCD17, an LSC-specific downregulated tumor suppressor gene (TSG), to the LSC17 score improved risk stratification in adult AML.28 Hence, the identification of novel differentially expressed genes (DEGs) in pedAML leukemic subpopulations could aid in providing novel biomarkers for risk stratification, follow-up, and targeted therapy.

Here, we describe LSC and leukemic blast (L-blast) targets in pedAML discovered by microarray profiling followed by quantitative PCR (qPCR) validation following a cancer vs. normal (CvN) approach. We reveal deregulated pathways that have not yet been addressed in children, and aid in a further understanding of the pedAML molecular biology.

Methods

Patients and controls

Bone marrow (BM) and/or peripheral blood (PB) from a total of 28 pedAML patients were selected based on cell availability (>50 × 106 after routine work-up) and CD34 positivity (≥1%). For 21/28 patients, both LSC and L-blast fractions were available, whereas for 3/28 and 4/28 patients only LSC or L-blast, respectively, could be evaluated. Demographics of patients used for LSC (n = 24) and L-blast (n = 25) expression evaluation are shown in Table 1. Details on treatment protocols and outcome definitions are described in Supplementary information. In addition, HSC and control myeloblasts (C-blast) were sorted from 19 and 20 healthy controls, respectively. Pediatric normal BM (NBM, n = 9, 12–18 years) was collected from posterior iliac crest during scoliosis surgery. Cord blood (CB, n = 11) was obtained after full-term delivery. All subjects gave their informed consent for inclusion before participation. The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of the University Hospital of Ghent (EC2015-1443 and EC2019-0294).

Table 1 Demographics of the pedAML patient cohort evaluated by qPCR.

Cell sorting

Mononuclear cells were isolated by Ficoll density gradient (Axis-Shield), complemented by CD34 isolation if expression was <50% (CD34 MicroBead Kit, Miltenyi). Cell sorting was performed to isolate CD34+/CD38− and CD34+/CD38+ cells from patients and controls, defined as LSC and HSC, and L-blast and C-blast, respectively. Availability of both PB and BM yielded a total of 35 cell fractions in the LSC cohort (24 patients, 11 PB–BM couples) and in the L-blast cohort (25 patients, 10 PB–BM couples). The LSC compartment of seven patients could additionally be phenotypical distinguished based on differential expression of CD45RA (n = 3), CLL-1 (n = 2), CD123 (n = 1), or GPR56 (n = 1), yielding a total of 42 LSC fractions. Staining and sorting strategy are described in Supplementary information. Monoclonal antibodies are described in Supplementary Table S1. Sorted cells were collected in RPMI supplemented with 50% fetal calf serum, with a minimum post-sort purity >90%, spun down (10 min, 3000 r.p.m., 4 °C), and resuspended in 700 µL TRIzol for RNA extraction and cDNA synthesis as described in Supplementary information.

Microarray downstream analyses

Microarray profiling was performed on LSC and L-blast sorted from 3/24 and 4/25 pedAML patients (Supplementary Table S2), respectively, next to two HSC and three C-blast fractions. Technical details are described in Supplementary information. DEGs were identified based on |log 2 FC| > 2 and adjusted P values (adj. P) < 0.05. Functional networks between protein–protein associations encoded by DEGs were identified by STRING at a high evidence level.29 Only KEGG (Kyoto Encyclopedia of Genes and Genomes) annotated pathways were derived from significant pathway analysis. Gene set enrichment analysis (GSEA) was performed by combining independent omics datasets through pathway enrichment meta-analysis in order to obtain gene set enrichment signatures.30 Unsupervised organization and visualization of enriched gene sets was performed in Cytoscape.31 At least two clustered gene sets based on P < 0.05, false discovery rate (FDR) < 0.25 and Jaccard overlap combined index 0.375 were required for node visualization.

In addition, we re-analyzed the publicly available GSE 17054 microarray dataset,17 containing GEPs of nine LSC from adult AML patients and four HSC from healthy adults.

Real-time quantitative PCR

Ninety-four targets were selected for qPCR validation based on the magnitude and significance of differential expression and feasibility of primer development. Available cell fractions were subdivided into three cohorts based on sample availability, while respecting a balanced distribution of the genetic variations, and subsequently measured by qPCR in three steps. Cell fractions with only limited material available were reserved for the most significant DEGs and evaluated in the third step, hereby avoiding sample paucity to present as an issue. Analytical details of qPCR experiments are described in Supplementary information, primers are shown in Supplementary Table S3. Data analysis were performed according to state-of-the-art methods30 as follows: Ct values were corrected for primer efficiency and expressed as relative quantities. Normalized relative quantities (NRQs) were calculated against the expression of housekeeping genes GAPD, HPRT1, and TBP. To allow inter-run comparison, calibrated NRQ values (CNRQ) were generated by taking into account the expression of an inter-run calibrator. Target-specific cut-offs for overexpression were calculated based on the average expression plus two standard deviations measured in the respective normal counterparts.

Results

In order to identify novel targets in LSC and L-blast, a CvN approach integrating different experimental and analysis techniques was used (Fig. 1). Each set of DEGs was explored in the context of intertwining pathways and databases to define the top deregulated pathways, and GSEA to gain insight into (anti-) correlated biological pathways. Full lists of significant DEGs and (anti-)correlated gene sets identified between LSC vs. HSC and L-blasts vs. C-blasts are shown in Supplementary Tables S4S13. Below, we discuss the top-ranked significant DEGs and (anti-)correlated gene sets for each comparison separately.

Fig. 1: Experimental setup and data processing steps.
figure 1

Step-by-step workflow illustrating the experimental and data processing steps pursued to filter out highly significant DEGs in leukemic subpopulations starting from a microarray profiling dataset. PedAML pediatric acute myeloid leukemia, CB cord blood, FC fold change, P P value, DEGs differentially expressed genes, LSC leukemic stem cell, HSC hematopoietic stem cell, L-blast leukemic blast, C-blast control blast.

Immune dysregulation separates LSC in pedAML from HSC

The expression of 295 targets significantly differed between LSC and HSC, with 83 targets up- and 212 downregulated in LSC. The top 50 ranked DEGs is shown in Fig. 2a. Well-known oncogenes were present among the highest LSC-overexpressed targets (e.g., CFD, ANXA2, NLRP3) next to genes with yet undefined roles in AML (e.g., PLIN2, CRIP1). The top 10 most downregulated genes also contained targets for which no role was yet reported in pedAML (e.g., ATP9A, PLCB4, COL5A1).

Fig. 2: Transcriptional differences and (anti-)correlated gene sets between LSC and HSC.
figure 2

a Visualization of DEGs identified between LSC (n = 3) and HSC (n = 2). Genes are plotted in a volcano plot as log 2 FC values against −log10 adj. P values. Thresholds |log 2 FC| > 2 and −log 10 adj. P < 0.05 are shown as dashed lines. Genes selected as significantly different are highlighted in red. The top 50-ranked downregulated genes (left) and upregulated genes (right), annotated with gene symbols, are sorted by log 2 FC values. b Top 10 most correlated (top) and anti-correlated (bottom) gene sets identified through GSEA. The number of concordantly expressed (CE) genes/total genes and normalized enrichment scores (NESs) is shown for each gene set individually. FC fold change, DEGs differentially expressed genes, LSC leukemic stem cell, HSC hematopoietic stem cell.

Analysis of functional protein associations (STRING) showed upregulated pathways in LSC related to (breast) cancer, osteoclast differentiation, and apoptosis, whereas transcriptional misregulation, and Rap1/MAPK signaling were downregulated (Table 2). Myeloid cell activation networks involved in immune response were enriched in LSC (FDR 4.6e − 6), whereas networks related to stimuli responses, signaling, and cell communication were suppressed (FDR 5.9e − 3). From a total of 3650 signatures available through GSEA, 240 and 18 gene sets were significantly enriched or suppressed in LSC, respectively. The top 10 LSC-enriched gene sets involved LSC signatures, inflammatory response, apoptosis, immune suppression, and adipogenesis, whereas HSC signatures were anti-correlated (Fig. 2b). Unsupervised visualization (Cytoscape) identified LSC-upregulated pathways related to abnormal cell division, quiescence, autoimmune regulation, and environmental stress, while gene sets involved in normal quiescence and cell death signaling were downregulated (Supplementary Fig. S1A). Altogether, these data suggest that dysregulation of the immune system contributes to the leukemic transformation of stem cells in pedAML.

Table 2 Enriched and suppressed pathways identified by functional protein association.

As pediatric and adult AML represent two genetically distinct diseases,32 we wondered if this heterogeneity is also reflected in the stem cell transcriptome. To this end, we re-analyzed the GSE 17054 microarray dataset from Majeti et al.17 and identified 486 significant DEGs between adult LSC and HSC (Supplementary Fig. S2A). Comparing the set of LSC-HSC DEGs identified in pediatric vs. adult AML revealed 71 common downregulated targets (Supplementary Fig. S2B), which was translated into mutual repressed pathways, for example, tight junction and MAPK signaling17 (Table 2). In sharp contrast, only three common LSC-upregulated transcripts were identified (TYROBP, CFP, and PTH2R).

Metabolic changes in pedAML L-blasts enhance proliferation compared to normal counterparts

We identified 157 and 332 significantly up- and downregulated targets in L-blast vs. C-blast. The top 50 upregulated transcripts is shown in Fig. 3a. Pathways enriched in L-blasts involved cancer transcriptional misregulation, FoxO signaling, and cytokine–cytokine receptor interaction (Table 2).

Fig. 3: Transcriptional differences and (anti-)correlated gene sets between L-blast and C-blast.
figure 3

a Visualization of genes identified to be upregulated in L-blast (n = 4) compared to C-blast (n = 3). Genes are plotted in a volcano plot as log 2 FC values against −log10 adj. P values. Thresholds |log 2 FC| > 2 and −log 10 adj. P < 0.05 are shown as dashed lines, DEGs are highlighted in red. The top 50-ranked upregulated genes, sorted by log 2 FC values, are annotated with gene symbols on the right. b Top 10 most correlated (top) and anti-correlated (bottom) gene sets identified through GSEA, based on the DEGs between L-blast and C-blast. The number of concordantly expressed (CE) genes/total genes and normalized enrichment scores (NESs) is shown for each gene set individually. FC fold change, DEGs differentially expressed genes, L-blast leukemic blast, C-blast control blast.

Functional protein network analysis (STRING) illustrated a significant enrichment in L-blasts of stimuli responses (FDR 6.8e − 11) and metabolic processes (FDR 1.4e − 05). GSEA identified 163 enriched and 23 suppressed gene sets in L-blast vs. C-blast. The top-ranked adipogenesis gene set correlates with metabolic dysregulation, whereas increased EGF signaling and decreased stemness signatures relate to high proliferation (Fig. 3b). Among others, upregulated cancer and EGFR signaling, and downregulated death signaling, were confirmed by unsupervised clustering (Supplementary Fig. S1B).

Interestingly, the top-ranked (anti-)correlated gene sets identified in LSC (Fig. 2b) and L-blast (Fig. 3b) partially overlapped, and also enriched and suppressed pathways were recurrent (Table 2). Therefore, we sought similarities in the DEGs identified in LSC and L-blast. From the 83 and 157 significantly upregulated genes in LSC and L-blasts, respectively, 49 genes appeared to be common (Supplementary Fig. S3A). On the other hand, 134 targets were mutually downregulated from a total set of 212 and 332 transcripts, respectively (Supplementary Fig. S3B). Taken together, we conclude that LSC and L-blast share pan-leukemic molecular aberrancies compared to their normal counterparts.

Novel candidate targets in pedAML leukemic subpopulations validated by qPCR

A selection of the top-ranked up- and downregulated targets in LSC vs. HSC and upregulated targets in L-blast vs. C-blast, as identified by microarray analysis, were validated by qPCR (29/83, 23/212, and 42/157, respectively). A total of 94 targets were evaluated according to a three-step exclusion strategy, allowing the most significant DEGs to be evaluated in the highest number of cell fractions. Patients were dichotomized as high and low for the highest differentially expressed targets. Per target, overexpression was correlated to cytogenetic and molecular markers, and if patients were included in the NOPHO-DBH AML2012 protocol, correlated with event-free survival (EFS).

First, differential expression was confirmed at a significant level (P < 0.05) for 24/29 LSC upregulated targets (Supplementary Fig. S4A). Moreover, differential expression was significant at P < 0.01 with concomitant low HSC expression for 12/24 targets. Expression of these 12 targets was further evaluated using additional LSC and HSC fractions (Supplementary Fig. S4B). Too low LSC expression, or too high HSC expression, led to the exclusion of 4/12 targets. Finally, expression of PLIN2, CFD, EMP1, DUSP5, ANXA2, CRIP1, CDKN1A, and CFP was evaluated in all fractions and shown to be highly significantly overexpressed in LSC (n = 42) compared to HSC (n = 20) (Fig. 4a). Overexpression of these eight targets was observed in 38–67% of the patients, with CDKN1A, CRIP1, CFP, and CFD overexpressed in more than half of the patients (Table 3). Expression was averagely 4- to 12-fold higher in LSC compared to HSC, with CFD and CRIP1 the most upregulated targets. CFP overexpression significantly correlated to FLT3-ITD mutations, for example, 46% in CFP-high (n = 14) vs. 10% in CFP-low (n = 10) patients (P = 0.043). High ANXA2 levels beneficially impacted EFS at a borderline significant level (P = 0.061, 4 ANXA2-high vs. 10 ANXA2-low patients, Supplementary Fig. S5A).

Fig. 4: qPCR validation of significantly DEGs in LSC vs. HSC.
figure 4

Targets with significant differential expression in LSC compared to HSC, shown by microarray analysis, were evaluated by qPCR analysis. Eight upregulated targets (a) and eleven downregulated targets (b) were evaluated in all available LSC (n = 42) and HSC (n = 20) fractions. Mean values are shown by horizontal lines, error bars indicate the 95% confidence interval of the mean, and the dotted line indicates the cut-off used to define overexpression, with the respective numbers of patients classified as high or low indicated above or below the line, respectively. P values (one-tailed) were calculated by the Mann–Whitney U test, and *, **, ***, or **** indicate the level of significance (P < 0.05, P < 0.01, P < 0.001 and P < 0.0001, respectively). LSC leukemic stem cell, HSC hematopoietic stem cell, DEG differentially expressed gene, CNRQ calibrated normalized relative quantities.

Table 3 Frequency and magnitude of overexpressed targets in LSC and L-blast.

Second, significant LSC downregulation was confirmed for 21/23 targets (Supplementary Fig. S4C, P < 0.05). Among these, 15/21 targets were even downregulated at P < 0.01, with virtual absent expression in LSC. Including more sample fractions led to the exclusion of 4/15 targets due to too low differential expression levels (Supplementary Fig. S4D). Evaluation of all fractions illustrated that MECOM, HLF, PLCB4, PLAG1, ATP9A, PTPRD, COL5A1, BEX2, DSG2, MYCT1, and PBX1 transcripts are highly significantly repressed in LSC compared to HSC (P < 0.0001, Fig. 4b). These targets appeared to be suppressed in 75–100% of all patients. BEX2 downregulation was significantly anticorrelated to KMT2A-rearrangements (P < 0.01), as previously demonstrated using cell lines.33 On the other hand, the previously reported association between PLAG1 and inv16 (p13q22), or between MYCT1 and FAB classifications M1/M5/M6, was not confirmed (P > 0.05).34,35 Interestingly, 9/11 pedAML LSC-downregulated targets were also significantly suppressed in adult LSC (Supplementary Fig. S1B). Furthermore, PBX1, MYCT1, HLF, ATP9A, and PLCB4 appeared to be also suppressed in other pediatric malignancies, suggesting a possible tumor suppressor role (Supplementary Fig. S6).

Third, qPCR confirmed a significant upregulation in L-blast vs. C-blast for 16/42 targets (Supplementary Fig. S4E, P < 0.05). Further analysis using more samples showed that expression was too low in L-blasts, or too high in C-blasts, for 7/16 targets (Supplementary Fig. S4F). Evaluation of the remaining nine targets in all samples illustrated that DUSP6, HOMER3, ANXA2P1, CTSA, RHBDF2, EMP1, GADD45B, TYROBP, and PNP transcripts were highly significantly overexpressed in L-blasts (n = 36) vs. C-blasts (n = 19) (Fig. 5). Five out of nine targets (RHBDF2, HOMER3, ANXA2P1, GADD45B, and TYROBP) were overexpressed in more than two-thirds of the patients, with HOMER3 showing the highest differential expression (Table 3). Interestingly, HOMER3-high cases (n = 21) showed significantly less inv16 (p13q22) (P = 0.014) and FAB M4 (P = 0.031), compared to HOMER3-low pedAML (n = 4). DUSP6, overexpressed in one-third of the patients, was previously shown to be significantly associated with FLT3-ITD in adult AML,36 which we could not confirm in a pediatric setting (P = 0.49). PNP-high patients showed a significant lower EFS (Supplementary Fig. S5B), which was confirmed by Cox log-rank univariate analysis (hazard ratio 9.24, P = 0.04), but did not remain significant in multivariate analysis.

Fig. 5: qPCR validation of significantly DEGs in L-blast vs. C-blast.
figure 5

Targets with significant differential expression in L-blast compared to C-blast, shown by microarray analysis, were evaluated by qPCR analysis. Nine upregulated targets were evaluated in all available L-blast (n = 35) and C-blast (n = 19) fractions. Mean values are shown by horizontal lines, error bars indicate the 95% confidence interval of the mean, and the dotted line indicates the cut-off used to define overexpression, with the respective numbers of patients classified as high or low indicated above or below the line, respectively. P values (one-tailed) were calculated by the Mann–Whitney U test, and *, **, ***, or **** indicate the level of significance (P < 0.05, P < 0.01, P < 0.001 and P < 0.0001, respectively). L-blast leukemic blast, C-blast control blast, DEG differentially expressed gene, CNRQ calibrated normalized relative quantities.

Discussion

We here describe a novel set of differentially expressed targets in LSC and L-blast of pedAML patients identified based on a CvN approach. Moreover, we reveal previously unexplored deregulated pathways in these leukemic subpopulations of children with AML.

Eight targets were found highly significantly overexpressed in LSC compared to HSC. CDKN1A, ANXA2, EMP1, and CFD were previously linked to leukemogenesis, whereas the role of PLIN2, DUSP5, CRIP1, and CFP in AML remains elusive. CDKN1A, CRIP1, CFP, and CFD were found to be most frequently overexpressed (>50% of the patients), with CRIP1 and CFD showing the highest differential expression. CDKN1A might represent an interesting target for LSC eradication, since elevated expression was reported to maintain LSC activity,37,38 and CDKN1A knockdown indirectly reversed stem cell quiescence.39 Overexpression of CFP and CFD in LSC of pedAML patients suggest a disturbed complement pathway regulation. CFP overexpression was significantly associated with FLT3-ITD mutations in pedAML (P = 0.043) and concomitantly overexpressed in adult AML, suggesting a role as age-independent LSC marker in (high-risk) AML. CFD expression was previously linked to poor outcome in adult AML,40 and its prognostic value in children awaits validation. Last, we found that patients with high ANXA2 LSC expression (38%, 4.8-fold higher expression than HSC) show a trend towards prolonged EFS (P = 0.061). This finding is in agreement with a previously reported favorable prognostic effect of ANXA2 in bulk pedAML cells.41 Hence, ANXA2 could hold promise as a prognostic biomarker in pedAML. Although only the top-ranked DEGs were validated by qPCR in our study, microarray analysis additionally revealed interesting targets for flow cytometric validation. Our data confirm upregulated CD96 and CLECL12 expression in pedAML LSC compared to HSC.42,43,44 A potential role for targeting CD180 and CD68 in LSC, or their qualification as follow-up marker, deserves further attention.

Suppression of LSC-specific downregulated targets was highly consistent across the different genetic subgroups (75–100% of all patients). Since several of these targets endow tumor suppressor roles in other cancer entities and are inactivated upon promoter hypermethylation, further investigation whether hypomethylating therapy could result into LSC eradication in pedAML is warranted. MYCT1 was already identified as a TSG in AML,35 and deserves the highest attention since expression levels are 100-fold lower in LSC compared to HSC. PBX1 was previously reported to act as both an oncogene and TSG.45 Suppressed PTPRD levels in LSC is in agreement with a previous report on PTPRD suppression in pedAML bulk leukemic cells.46

Transcriptional misregulation in cancer, osteoclastogenesis, and tight junction pathways were dysregulated in LSC. The “cancer transcriptional misregulation” pathway is associated with myeloid leukemogenesis and held responsible for tumorigenic epigenetic abnormalities.47,48 Distortion of osteoclastogenesis and tight junction pathways might provide LSC an advantage over HSC during homing towards the endosteal–vascular niche. The observed immune dysregulation, separating LSC from HSC, strokes with a previous statement that multiple inflammatory signaling pathways are involved in the generation of pre-LSCs.49

L-blast upregulated targets were overexpressed in a larger portion of patients compared to LSC upregulated targets (36–88%, median 72% vs. 38–67%, median 50%, respectively). Among these, only DUSP6 and HOMER3 were previously addressed in AML. DUSP6 is an important cellular signaling regulator overexpressed in AML.36 HOMER3 relates to the occurrence and development of AML,50 and increased levels were significantly associated with favorable cytogenetics in adult AML.51 Since HOMER3 also showed the highest differential expression compared to C-blasts, targeting could be of therapeutic value. CTSA also represent an interesting target, since several other cathepsins were shown to have a diagnostic, prognostic, or therapeutic significance in AML.52,53,54,55 GADD45B is known to be involved in negative growth control during myeloid differentiation,56 and suggested to play a role in the tumorigenesis of colorectal carcinoma.57 Finally, high PNP expression was significantly associated with a worse EFS. Studies with PNP inhibitors in relapsed and refractory leukemias are ongoing,58 and investigation of their applicability in pedAML might be worthwhile.

Deregulated pathways identified in L-blast compared to their normal counterparts suggest that disturbed regulation of cell cycling, apoptosis, glycolysis/gluconeogenesis, and oxidative stress resistance promote the maintenance and proliferation of blasts in a leukemic setting. Indeed, adapting to hypoxic conditions and switching from oxidative phosphorylation towards glycolysis was shown to correlate with an aggressive disease course in solid cancers.59 Gluconeogenesis blocking, previously proposed as general antitumor therapeutic strategy, should be further explored in pedAML.60

EMP1 was the only target highly significantly upregulated in LSC and L-blast (38% and 68% of patients, respectively, P < 0.0001). Although Ng et al.16 did not elaborate on its role, EMP1 was included in the LSC17 score,25 but not retained in the pedAML-specific LSC6-score.26 However, based on the here observed CD38-independent overexpression, and the previously reported in vitro targetability of EMP1 in B-ALL,61 its role as a therapeutic target in pedAML should be further explored.

We detected a high molecular heterogeneity between pediatric and adult AML at the stem cell level. LSC populations from both entities shared 71 suppressed transcripts, but only three mutual upregulated targets (TYROBP, CFP, and PTH2R) were identified. TYROBP and CFP have not been functionally associated with AML, and their role in LSC transformation should be further explored. PTH2R, on the other hand, is known to be upregulated in AML and MDS,62 including adult AML LSC.63 Further research is warranted to evaluate whether these three targets could serve as pan-LSC targets, irrespective of the age of onset. Although these findings further underline the distinct biology between pediatric and adult AML,32 it should be taken into account that, due to the small number of patients evaluated, the genetic subtypes might also play a role besides the age of the patients.

Our investigation has some limitations, including its design as a single-center study recruiting patients from different clinical trials, and the lack of protein expression data. We acknowledge that extensive in vitro and in vivo validation of several of the identified targets is needed to fully unlock the potential of the presented dataset. Nevertheless, we here provide a repository to the pediatric acute leukemia community allowing to fulfill the high need for alternative therapeutic strategies in pediatric AML. The limited number of patient samples asks for data to be interpreted with caution within the framework of generalizability. Although promising, these data need confirmation in larger, preferentially multicenter trials, as survival analyses were performed in a limited number patients. It is important to acknowledge that the pedAML cohort included one secondary AML evolved from juvenile myelomonocytic leukemia, one acute promyelocytic leukemia, and two relapsed patients, all excluded from outcome analysis.

In conclusion, we here report a unique set of LSC and L-blast-specific overexpressed genes in pedAML. Most targets have not been studied in AML, and are involved in immune regulation, apoptosis, adhesion, or intracellular signaling, making them attractive candidates for functional studies, refining signatures, and targeted therapy. Inflammatory pathways and immune regulation are critical biological networks perturbed in pedAML LSC, and L-blast presents a high proliferative cell cycle activity combined with metabolic dysregulation. In addition, we identified novel LSC-specific downregulated targets, often described as TSGs in solid tumors, of which some are relevant in adult AML LSC or other pediatric hematological diseases.