DNA sequencing-based measurable residual disease (MRD) detection has shown to be clinically relevant in AML. However, the same methodology cannot be applied to fusion gene-driven subtypes of AML such as core-binding factor AML (CBF-AML). Here in this study, we evaluated the effectiveness of using DNA and RNA sequencing in MRD detection and in tracking clonal dynamics in CBF-AML. Using RNA-seq, we were able to quantify expression levels of RUNX1-RUNX1T1 and CBFB-MYH11 at diagnosis and their levels of reduction during remission (P < 6.3e−05 and P < 2.2e−13). The level of reduction of RUNX1-RUNX1T1 as measured by RNA-seq and qPCR were highly correlated (R2 = 0.74, P < 5.4e−05). A decision tree analysis, based on 3-log reduction of RUNX1-RUNX1T1 and cKIT-D816mut at diagnosis, stratified RUNX1-RUNX1T1 AML patients into three subgroups. These three subgroups had 2-year overall survival rates at 87%, 74%, and 33% (P < 0.08) and 2-year relapse incidence rates at 13%, 42%, and 67% (P < 0.05). On the other hand, although low residual allelic burden was common, it was not associated with long-term outcome, indicating that mutation clearance alone cannot be interpreted as MRD-negative. Overall, our study demonstrates that the clinical utility of RNA sequencing as a potential tool for MRD monitoring in fusion gene-driven AML such as RUNX1-RUNX1T1 AML.
Leveraging on advances in next generation sequencing (NGS) technologies, a wide range of heterogenous genetic baseline characteristics has been discovered in acute myeloid leukemia (AML)1,2. Recent studies showed that targeted DNA sequencing-based measurable residual disease (MRD) detection is feasible in AML3,4,5,6,7. However, the weakness of targeted DNA sequencing based MRD detection is that it is not efficient to reliably track breakpoints of gene rearrangements using a targeted panel. The prevalence of gene rearrangement-driven AML is about 30%8. For instance, gene rearrangements affecting core binding factor (CBF) complex, RUNX1-RUNX1T1/t(8;21) or CBFB-MYH11/inv(16) are found in about 15% of AML patients (hereafter referred to as t(8;21) and inv(16) AML, respectively)9. Compared with other AML subtypes, CBF-AML is considered as a favourable risk subgroup due to favourable long-term outcomes following high-dose cytarabine-based chemotherapy8,10. However, up to 40% of CBF-AML patients still relapse11,12,13. As a post-remission monitoring of CBF-AML, a quantitative PCR (qPCR) assay targeting t(8;21) and inv(16) at remission can identify patients at high risk of relapse and has become a routine procedure14,15,16. Recently, a study by Dillon et al. demonstrated that MRD assessment using targeted RNA sequencing is feasible and correlates well with qPCR assays17.
While qPCR is highly sensitive and sequence-specific, it can only test a limited set of fusion events, making it neither scalable nor being used as a discovery tool. Thus, multiple tests have to be performed at diagnosis in order to characterize appropriate MRD targets to monitor post-treatment. In addition, standardization procedures make it challenging to compare qPCR results from different institutions and hospitals. On the other hand, RNA sequencing (RNA-seq) is scalable to quantify multiple targets in a single assay. In addition, it can also simultaneously quantify transcript expression, discover novel fusion transcripts, and even detect somatic mutations. Considering these advantages of RNA-seq and reductions in sequencing costs in recent years, we hypothesized that a single assay of RNA-seq can be utilized not only to detect mutations and fusion transcripts at diagnosis, but also quantify their reduction level at complete remission (CR).
Here we describe a study to evaluate DNA and RNA sequencing on longitudinal samples taken at diagnosis and at CR, as potential MRD detection tools for post-remission monitoring of CBF-AML. With this approach, we hoped to gain additional insights on how NGS can be applied to monitor MRD in CBF-AML, as well as reveal comprehensive dynamics of somatic mutations, transcripts, and gene rearrangements from diagnosis till CR. To address these questions, we conducted DNA sequencing (DNA-seq) on 223 DNA samples collected from 87 patients including 62 patients with t(8;21) and 25 patients with inv(16) taken at time points including diagnosis, CR, and relapse. For a subset of patients with available samples, we conducted RNA-seq on 90 samples consisted of 42 pairs of diagnosis-CR samples as well as 6 relapse samples.
The main goal of current study was to assess whether residual genetic alterations including primary (i.e. subtype-defining gene rearrangements) and secondary (i.e. single nucleotide variants and short indels) lesions at CR can be quantified using RNA and DNA-seq and are clinically relevant. Most notably, current study demonstrates how RNA-seq can be utilized to track gene rearrangements during remission and serves as a proof-of-concept study for RNA seq-based MRD detection for gene rearrangement-driven hematologic malignancies.
Materials and methods
Patient cohorts and acquisition of samples
Eighty-seven patients with newly diagnosed CBF-AML from January 2003 to September 2015 were included in this study. All patients had an Eastern Cooperative Oncology Group (ECOG) performance status ≤ 2 at the time of diagnosis. Patients treated with investigational drugs before induction therapy were excluded. Treatment and response assessments are summarized in Supplementary Information. Samples for NGS were obtained at the time of initial diagnosis (n = 87 and n = 42 for DNA and RNA-seq), CR (n = 53/87 and n = 42/42 for DNA and RNA-seq), and relapse (n = 15/87 and n = 6/42 for DNA and RNA-seq). CR samples were obtained either at day 28 (n = 45/53 and n = 34/42 for DNA and RNA-seq) or day 56 after re-induction (n = 8/53 and n = 8/42 for DNA and RNA-seq). Among 155 samples subjected for DNA sequencing, 151 samples were taken from bone-marrow (BM) and other 4 samples were taken from peripheral blood (PB). For patients with available serial samples, 90 samples (42 + 42 + 6) from 42 patients were also subjected to RNA-seq. All patient samples were collected after obtaining informed consent. Among 90 samples subjected for RNA sequencing, 86 samples were taken from BM and other 4 samples were taken from PB. All experiments and methods were carried out in accordance with relevant guidelines and regulations. This study was approved by the institutional ethics review boards at Chonnam National University Hwasun Hospital and Severance Hospital, Republic of Korea (CNUHH-2015-118).
Sample preparation and next generation sequencing
BM/PB samples used in this study were prospectively procured and banked. After DNA isolation and library preparation, all samples were subjected to targeted sequencing. Targeted gene panel was constructed using a custom Agilent Sureselect design, covering exonic regions of 83 genes (Table S1 and Table S2). The gene list for targeted sequencing was compiled from our previous studies as well as large-scale mutation profiling studies on AML and MDS1,2,18,19. For RNA-seq, the total RNA was extracted from BM/PB mononuclear cells using the RNeasy mini kit (Qiagen, Germany). Total RNA concentration was calculated by Quant-IT RiboGreen (Invitrogen, Carlsbad, CA, USA). To assess the integrity of the total RNA, samples were run on the TapeStation RNA screentape (Agilent Technologies, Santa Clara, CA, USA). A total of 100 ng of total RNA was subjected to a sequencing library construction using a TruSight RNA Pan-Cancer Panel (1385 genes) (Illumina, Inc., San Diego, CA USA) according to the manufacturer's protocols. Both DNA and RNA samples were multiplexed and sequenced using an Illumina HiSeq 2500 (Illumina, San Diego, California, USA).
Sequencing metrics can be found in Table S3 (DNA-seq) and Table S4 (RNA-seq) and all sequencing data have been deposited at the European Nucleotide Archive (Accession: PRJEB25960 for DNA-seq and PRJEB27973 for RNA-seq). The mean on-target coverage for 223 DNA samples was 1,645.1X. RNA sequencing of 90 samples yielded an average of 3.6 million reads (range 2.8–4.5 M) with an average 86.9% mapping rate (range 83.4–90.5%). Read processing and variant calling procedures for DNA-seq data were performed as in our previous study with minor modifications19. Detailed methods on variant calling, RNA-seq read processing, transcript quantification, differentially expressed gene analyses and, fusion transcript discovery in diagnostic samples and its tracking in CR samples are provided in the Supplementary Information.
Patient characteristics were summarized using descriptive statistics. Categorical variables were analyzed using a Chi-square test and comparison of continuous variables was performed using the Student’s t-test. The overall survival (OS) was defined as the time of diagnosis until death from any cause, which was analyzed using the Kaplan–Meier method. The groups were then compared using the log-rank test. The cumulative incidence of relapse was calculated using the Gray method, considering death without relapse as a competing risk. Non-relapse mortality was defined as any death while in remission with relapse as a competing risk and was assessed with the Gray’s test. The prognostic impact of risk factors on OS was determined using the Cox proportional hazard model. Fine-Gray proportional hazard regression with competing events were used to identify risk factors for relapse. P-values of less than 0.05 were considered significant. R statistical software 3.1.3 (the R foundation for Statistical Computing, Vienna, Austria; available at https://www.r-project.org) and EZR (version 1.33) were used to perform statistical analyses and generate all figures20.
Patient characteristics and treatment outcomes
Among the 87 patients with newly diagnosed CBF-AML, 62 patients were t(8;21) AML and 25 were inv(16) AML. The median age of the patients was 44 years (range 16–85 years) and 50 patients (57%) were male. Induction chemotherapy was administered for 76 out of 87 (87%) patients. CR was achieved in 73 out of 76 patients (96%). The CR rate was 96% in both t(8;21) AML (n = 49/51) and inv(16) AML (n = 24/25). Three patients who failed to achieve CR died of treatment-related mortality (n = 2) or disease progression (n = 1). Post-remission therapy with high dose cytarabine was administered at CR. The patient characteristics are summarized in Table 1.
Targeted DNA sequencing reveals mutation profiles at baseline and their clinical relevance in CBF-AML
At diagnosis, we detected 166 mutations in 79 patients (n = 79/87, 90.8%) (Fig. 1 and Table S5). The 166 mutations consisted of 120 nonsynonymous single nucleotide variants (SNVs), 21 frameshift insertions, 6 frameshift deletions, 5 non-frameshift insertions, 5 splicing events, 5 stop-gain SNVs, 2 non-frameshift deletions, and 2 synonymous SNVs. The median number of mutations per patient was 2 (range 1–7). Among the 83 targeted genes, 33 genes were mutated in at least one patient, and 18 genes were recurrently mutated. Notably, KIT (n = 34/87, 39%), NRAS (n = 29/87, 33%), ASXL2 (n = 12/87, 14%), and KRAS (n = 11/87, 13%) were frequently mutated. Frequencies of mutation occurrence within targeted gene panel between two subtypes of CBF-AML, irrespective of genes, were comparable (93.5% in t(8;21) AML and 88% in inv(16) AML, p < 0.29). When grouped by associated biological pathways, genes associated with signaling pathways (n = 70/87, 80%), chromatin modifiers (n = 19/87, 22%) and, the cohesin complex (n = 13/87, 15%) were mutated in more than 10% of the cohort and frequencies of mutations in RAS (KRAS or NRAS), chromatin modifiers (including ASXL2) and, cohesin complex were significantly different between the two subtypes of CBF-AML (P < 0.002, P < 0.01, and P < 0.02, respectively, Figure S1). In particular, members of cohesin complex and chromatin modifiers were nearly exclusively mutated in t(8;21) AML.
We performed survival analyses to investigate association of somatic mutation with patient outcome (n = 76/87 after excluding the patients who did not receive chemotherapy) (Table S6). In univariate analysis, KIT-D816mut was identified as an adverse prognostic factor for OS (Hazard ratio (HR) 2.31, 95% confidence interval (CI) 1.04–5.14, P = 0.035). Three-year OS rate was inferior in KIT-D816mut group (47.1 ± 12.9%) to KIT-D816wt group (68.6 ± 6.4%; P = 0.035). In terms of CIR (cumulative incidence of relapse), KIT-D816mut was an adverse factor for CIR (HR 2.76 [1.06–7.15], P = 0.04), while mutations in RAS (NRAS or KRAS) was favorable for CIR (HR 0.13 [0.03–0.56], P = 0.01). Three-year CIR was higher in KIT-D816mut group (43.8 ± 12.1%) than KIT-D816wt group (22.0 ± 5.6%; P = 0.035). CIR was lower in RASmut subgroup than RASwt (6.4 ± 4.5% vs. 41.9 ± 7.8%; P = 0.001), while NRM (non-relapse mortality) was higher in RASmut subgroup (22.8 ± 7.5% vs. 7.3 ± 4.2%; p = 0.001). Thus the 3-year OS rate was not different between the groups of RASmut and RASwt (70.0 ± 8.3% vs. 59.0 ± 8.0%; P = 0.999). Overall, KIT-D816mut was the only adverse prognostic mutation for OS.
Longitudinal tracking of somatic mutations from diagnosis to remission and their clinical relevance
Taking advantage of samples taken at diagnosis and at CR, we assessed whether allelic burden at CR is clinically relevant. Among the 53 patients with available CR samples, 49 patients carried 99 mutations at diagnosis. Overall, 99 mutations showed mean reduction at CR with mean reduction rate of 99.1% (ranges from 86.2% to 100%, p < 0.0001). Out of 99 mutations, 53 mutations were cleared at CR (i.e. no reads supporting the variant allele) (Fig. 2a). The 46 detectable mutations at CR from 32 patients were commonly located in KIT (n = 13), NRAS (n = 8), and KRAS (n = 3). Mutations with higher VAFs at diagnosis are more likely to be detected at CR (mean VAF 27.4% for mutations detectable at CR vs. 17.8% for mutations cleared at CR, p = 0.001). To estimate mutation-like sequencing errors, we computed the frequency of mutation-like nucleotide changes in CR samples from patients who did not have the mutation in their diagnosis specimens. Overall, we observed that median mutation-like error rate is 0.008% in our data (interquartile range 0.004–0.013%). We then investigated whether the mutation clearance at CR is associated with subsequent occurrence of relapse (Fig. 2b). Among them, 15 mutations were from the 9 relapsed patients and 31 mutations were from the 23 non-relapsed patients. When comparing the frequency of mutation clearance, we did not find significant differences between the patients who relapsed and did not relapse (12/27 mutations in relapsed patients vs. 41/72 mutations in non-relapsed patients, p = 0.27). In addition, complete clearance rate in relapsed patients was comparable to that in non-relapsed patients (5/14 relapsed patients vs. 12/35 non-relapsed patients, P = 0.92). We then assessed clinical relevance of mutation clearance from various perspectives. Using 0.3% as a cut-off for allelic burden at remission, we did not find association of mutation clearance at 0.3% (MC03) with OS (HR 0.63, [0.20–1.97], p = 0.43) or with relapse risk (HR 0.84, [0.23–3.04], P = 0.80) (Fig. 2c,d). We have applied various other cut-offs but observed likewise patterns of no correlation with OS. For example, complete mutation clearance also did not any association with OS or relapse risk (HR 0.39, [0.11–1.36], p = 0.14 and HR 0.88, [0.32–2.47], p = 0.81). Among the 29 patients with RUNX1-RUNX1T1 with available qPCR data, 20 patients were MRD-positive by qPCR at CR. Nine patients who achieved MRD-negative also achieved MC03 as well. When considering only MRD-positive patients, achievement of MC03 did not affect OS (HR 0.73, [0.15–3.52], P = 0.69) and relapse incidence (HR 0.85, [0.14–5.14], P = 0.86, Fig. 2e,f). Lastly, we assessed whether persistence of cKIT-D816mut at CR is associated with higher risk of relapse, but complete clearance of KIT-D816mut also did not affect OS and relapse incidence (HR 0.94, [0.16–5.18], P = 0.94 and HR 0.58, [0.16–2.09], P = 0.40, respectively, Fig. S2).
We next assessed the relationship between dynamics of the fusion transcript and mutation events. By comparing reduction level of the RUNX1-RUNX1T1 fusion transcript measured by qPCR with reduction of allelic burden from 29 patients, we inferred the relationship between them (Fig. 2g). We defined three clusters showing distinct patterns. Cluster 1 consisted of mutations for which both somatic mutations and fusion transcript levels were detected (purple, 24 mutations from 17 patients). Cluster 2 was defined by both allelic burden and fusion transcripts not detected (green, 6 mutations from 3 patients). Cluster 3 was a group in which only somatic mutations were not detectable (orange, 28 mutations from 20 patients). Altogether, our results confirmed that the reduction level of fusion transcripts is more appropriate measure to assess MRD than single nucleotide variants and short indels in CBF-AML.
Spectrum and dynamics of transcript expression in CBF AML from diagnosis to complete remission
After reads processing and removing genes with very low expression in most samples, we quantified expressions of 1293 genes in 90 samples (Table S7)21,22. At diagnosis, we identified 297 differentially expressed genes (DEGs) between t(8;21) AML and inv(16) AML (Fig. 3a,b, Table S7, Fig. S3). Notably, RUNX1T1 was the most differentially expressed gene between two subtypes, where t(8;21) AML expresses it nearly 4300 times higher than inv(16) AML (Fig. 3b). On the other hand, expression of MYH11 was 6.3 fold higher in inv(16) AML (Fig. 3b).When paired diagnostic-CR samples for each subtype were compared, we found 402 and 286 DEGs in t(8;21) and inv(16) AML, where 200 genes were shared (Fig. S3, colored region). These 200 shared DEGs including PD1, PD-L1, PD-L2, and CTLA4 were enriched in Kyoto Encyclopedia of Genes & Genomes (KEGG) terms related to immune response such as cytokine-cytokine receptor interaction (q < 2.2e−5), allograft rejection (q < 0.02) and, graft − versus − host disease (q < 0.02) (Figs. S3–S5, Table S8)23.
With respect to fusion transcripts, in addition to the subtype-defining fusion transcripts, we detected 26 additional fusion transcripts from 18 patients (9 inv(16) and 9 t(8;21) AML) including two recurrent fusion transcripts in t(8;21) AML (SLC45A3-ELK4 and CSNK1G2-JAK2) (Fig. 3c and Table S9). Six samples taken at relapse (all t(8;21) AML) expressed RUNX1-RUNX1T1 with breakpoints identified in their corresponding diagnostic samples and their transcriptome profiles were clustered with diagnostic t(8;21) AML samples (Fig. 3c). All CR samples were clustered together regardless of subtypes.
Feasibility of RNA sequencing -based fusion transcript tracking as an alternative tool for MRD monitoring in CBF-AML
Subtype-defining fusion transcripts were detected in all 42 diagnostic samples. In every CR sample, we tracked the fusion transcript identified in their corresponding diagnostic samples. As expected, both RUNX1-RUNX1T1 and CBFB-MYH11 showed significant reductions in all CR samples compared to their corresponding diagnostic samples (P < 6.3e−05 and P < 2.2e−13, paired t-test, Fig. 4a,b). Despite low expression, CBFB-MYH11 was detectable in 6/19 (32%) and RUNX1-RUNX1T1 was detectable in 15/23 (65%) CR samples. On average, 1618 reads were mapped to RUNX1-RUNX1T1(IQR 1284–1764 reads) and 239 reads were mapped to CBFB-MYH11 (IQR 201–416 reads) in diagnostic samples. As the number of aligned reads were comparable between diagnostic and remission samples (3.01 M vs. 2.92 M reads, p = 0.19), limits of detection of RUNX1-RUNX1T1 and CBFB-MYH11 fusion transcripts using RNA-seq in CR samples would be approximately 3-log (1 in 1000) and 2-log (1 in 100) reduction. Reduction level of RUNX1-RUNX1T1 measured by RNA-seq showed positive correlation with the reduction level measured by qPCR (Pearson’s Rho = 0.74, P < 5.4e−05, Fig. 4c and Table S10). Three-log or deeper reduction of RUNX1-RUNX1T1 was able to predict relapse incidence (HR 0.22 [0.03–1.69], p = 0.14). When compared with qPCR-based MRD, RNA-seq-based MRD showed comparable performance in terms of sensitivity, specificity, positive/negative predictive value using 3-log reduction as a cut-off in RUNX1-RUNX1T1 AML (Table S11). qPCR data were not available for CBFB-MYH11 AML. Survival analyses using mutation profile identified cKIT-D816mut at diagnosis as an adverse prognostic factor for long-term outcome. Just as cKIT-D816mut was all detected with high allelic burden using DNA-seq, it was reliably detected using RNA-seq as well (allelic burden ranges 20.8–46.4% in DNA-seq and 32.7–73.4% in RNA-seq) (Table S12).
Prognostic modeling utilizing RNA-sequencing data in CBF-AML
As RNA-seq can reliably measure reduction levels of fusion transcripts and detect high-risk mutations (i.e. cKIT-D816mut), we attempted to build a prognostic model using decision tree analysis incorporating genetic features obtained from RNA-seq. Decision tree analysis identified three distinct subgroups of RUNX1-RUNX1T1 AML on the basis of reduction in the level of RUNX1-RUNX1T1 transcript and mutation profile (Fig. 5a). Consistent with previous studies, 3-log or deeper reduction of RUNX1-RUNX1T1 transcript was the most significant prognostic factor (low risk group)14,15. The algorithm further divided the patients who failed to achieve 3-log reduction according to the presence of cKIT-D816 mutation at diagnosis (intermediate and high-risk groups). For three defined groups, 2-year OS rates were 87%, 74%, and 33% (p = 0.08, Fig. 5b) and 2-year relapse incidence rates were 13%, 42%, and 67% (p = 0.048, Fig. 5c).
Current study demonstrated that RNA-seq can measure reduction levels of RUNX1-RUNX1T1 and CBFB-MYH11 from diagnosis to CR. RUNX1-RUNX1T1 reduction levels measured by qPCR and RNA-seq showed high correlation and concordance. The prognostic mutation (i.e. KIT-D816mut) was reliably detected in both DNA and RNA-seq data. Solely using information from RNA-sequencing, we built a prognostic model that can predict long-term outcome for t(8;21) AML patients. On the other hand, complete clearance of secondary genetic lesions (i.e. single nucleotide variants and short indels) including KIT-D816mut alone was not associated with better long-term outcomes and mutation persistence at low level did not provide prognostic power in addition to reduction level of the fusion transcripts. Altogether, our data and analyses suggest the clinical utility of RNA-seq as a method to detect MRD in CBF-AML and potentially in other fusion-driven hematologic malignancies.
Leveraging on paired-diagnosis-CR samples, we assessed the clinical relevance of residual genetic lesions at remission in CBF-AML. Although several studies demonstrated clinical relevance of residual allelic burden in non-CBF-AML, none of the studies thus far focused on CBF-AML3,4,5,6,7. To the best of our knowledge, transcriptome along with mutation profile on paired diagnostic-CR samples have not been investigated in CBF-AML. By tracking junctions identified in corresponding diagnostic samples, we were able to quantify the level of RUNX1-RUNX1T1 and CBFB-MYH11 in CR samples. The level of RUNX1-RUNX1T1 reductions measured by RNA-seq was highly correlated with qPCR result (Pearson’s Rho = 0.74, p < 5.4e−05). RUNX1-RUNX1T1 was detectable in 15 samples by both techniques and in 6 samples only by qPCR. In 2 samples, RUNX1-RUNX1T1 was undetectable by neither method. qPCR was more sensitive at detecting trace amount of RUNX-RUNX1T1 compared to RNA-seq at the sequencing coverage in our study (avg. = 2.98 M mapped reads per sample). However, we believe that these obstacles can likely to be overcome by achieving deeper sequencing depth as well as more tailored sequencing strategies. CBFB-MYH11 was only detectable in 6/19 CR samples at the sequencing depth used in current study and qPCR results were not available for comparison. Future studies on CBFB-MYH11 cohort with deeper sequencing depth and comparative analyses with qPCR remain to be investigated.
With respect to single nucleotide variants and short indels, allelic burden of all mutations showed significant reduction from diagnosis to CR (ranges from 86.2 to 100%). At CR, only 5 mutations out of 99 mutations (2 in TET2, 1 in JAK3, U2AF1, and ASXL1 each) were observed with a VAF above 1%. This mutation clearance level is higher than what was reported in non-CBF-AML3,4. This could be due to differences in mutation profile between CBF and non-CBF-AML. Klco et al. only included intermediate and poor risk AML patients and most frequently persistent mutations were in DNA methylation or clonal hematopoiesis-associated genes (DNMT3A, TET2, IDH1/2)3. In CBF-AML, those genes are much less commonly mutated. Incorporation of mutation dynamics with RUNX1-RUNX1T1 fusion transcript reduction level showed that reduction of mutation burden from diagnosis to CR is deeper than or equal to the reduction of RUNX1-RUNX1T1 fusion transcript. This re-confirms that complete clearance of allelic burden (i.e. secondary lesion) during remission does not guarantee eradication of leukemic cells, but deeper reduction of primary lesion (i.e. fusion transcript) is a more appropriate surrogate marker for predicting relapse in CBF-AML. In addition to the assessment of the entire cohort with available CR samples, we also showed that residual allelic burden did not further stratify high risk patients among MRD-positive patients. This result indicates that DNA-based MRD detection, only targeting secondary lesions does not provide additional prognostic power in addition to reduction level of the fusion transcripts in CBF-AML.
Consistent with previous studies, mutation profiles of two CBF-AML subtypes were different24,25,26. For example, members of the cohesin complex and chromatin modifiers including ASXL2 were nearly exclusively mutated in RUNX1-RUNX1T1 AML. Genes associated with activated signaling such as KIT, N/KRAS, CBL, and FLT3 were frequently mutated in both subtypes. Survival analyses showed that KIT-D816mut was an adverse prognostic factor. As shown in Lavallée et al.27, the pattern of transcript expressions at diagnosis further distinguished these two related-subtypes of AML. CR samples were clustered regardless of their subtypes. We noticed that there are nine genes that are differentially expressed in CR samples depending on their AML subtypes. However, all nine genes were differentially expressed in at least one of three-way comparisons and this is likely due to the residual leukemia cells in CR samples.
There are several limitations in our study. First, the cohort size was relatively small, and the prognostic model developed using the current cohort could not be validated using an independent cohort. Second, the experimental setting of RNA sequencing utilized in this study such as the gene panel, amount of total RNA used, and sequencing depth was not optimized for MRD purposes. For example, detection of CBFB-MYH11 using RNA-seq in CR samples requires experimental optimizations to be compatible with qPCR. Lastly, as some of fusion calls were only supported by few reads, further validation using alternative techniques such as qPCR should be performed to validate them.
Altogether, the current study demonstrates that RNA-seq can be utilized to monitor RUNX1-RUNX1T1 and CBFB-MYH11 level during remission in CBF-AML. Combining all of these supporting evidences, our data and analyses confirmed that the reduction level of fusion transcripts and baseline characteristics at initial diagnosis are the most important genetic features in CBF-AML when predicting long-term outcome and they provide complementary prognostic information. On the other hand, residual allelic burden of secondary lesion at CR measured by DNA-seq alone does not provide clinically relevant information in addition to those two genetic features. We also showed that a single assay of RNA-seq can be used to stratify RUNX1-RUNX1T1 AML patients into three distinct risk groups according to their long-term prognosis. Our findings warrant prospective studies with larger cohorts on gene rearrangement-driven hematologic malignancies utilizing more tailored sequencing strategies.
Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N. Engl. J. Med. 368, 2059–2074 (2013).
Papaemmanuil, E. et al. Genomic classification and prognosis in acute myeloid leukemia. N. Engl. J. Med. 374, 2209–2221 (2016).
Klco, J. M. et al. Association between mutation clearance after induction therapy and outcomes in acute myeloid leukemia. JAMA 314, 811–822 (2015).
Jongen-Lavrencic, M. et al. Molecular minimal residual disease in acute myeloid leukemia. N. Engl. J. Med. 378, 1189–1199 (2018).
Morita, K. et al. Clearance of somatic mutations at remission and the risk of relapse in acute myeloid leukemia. J. Clin. Oncol. 36, 1788–1797 (2018).
Thol, F. et al. Measurable residual disease (MRD) monitoring by NGS before allogeneic hematopoietic cell transplantation in AML. Blood https://doi.org/10.1182/blood-2018-02-829911 (2018).
Kim, T. et al. Next-generation sequencing based post-transplant monitoring of acute myeloid leukemia. Blood 132, 1604–1613 (2018).
Döhner, H. et al. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood 129, 424–447 (2017).
Speck, N. A. & Gilliland, D. G. Core-binding factors in haematopoiesis and leukaemia. Nat. Rev. Cancer 2, 502–513 (2002).
O’Donnell, M. R. et al. Acute myeloid leukemia, version 3.2017, NCCN clinical practice guidelines in oncology. J. Natl. Compr. Canc. Netw. 15, 926–957 (2017).
Löwenberg, B. et al. Cytarabine dose for acute myeloid leukemia. N. Engl. J. Med. 364, 1027–1036 (2011).
Marcucci, G. et al. Prognostic factors and outcome of core binding factor acute myeloid leukemia patients with t(8;21) differ from those of patients with inv(16): a cancer and leukemia group B study. J. Clin. Oncol. 23, 5705–5717 (2005).
Schlenk, R. F. et al. Individual patient data-based meta-analysis of patients aged 16 to 60 years with core binding factor acute myeloid leukemia: a survey of the German Acute Myeloid Leukemia Intergroup. J. Clin. Oncol. 22, 3741–3750 (2004).
Yin, J. A. L. et al. Minimal residual disease monitoring by quantitative RT-PCR in core binding factor AML allows risk stratification and predicts relapse: Results of the United Kingdom MRC AML-15 trial. Blood 120, 2826–2835 (2012).
Jourdan, E. et al. Prospective evaluation of gene mutations and minimal residual disease (MRD) in patients with core binding factor acute myeloid leukemia (CBF-AML). Blood 121, 2221–2223 (2013).
Willekens, C. et al. Prospective long-term minimal residual disease monitoring using RQ-PCR in RUNX1-RUNX1T1-positive acute myeloid leukemia: results of the French CBF-2006 trial. Haematologica 101, 328–335 (2016).
Dillon, L. W. et al. Targeted RNA-sequencing for the quantification of measurable residual disease in acute myeloid leukemia. Haematologica 104, 297–304 (2019).
Papaemmanuil, E. et al. Clinical and biological implications of driver mutations in myelodysplastic syndromes. Blood 122, 3616–3627 (2013).
Kim, T. H. et al. Clonal dynamics in a single AML case tracked for 9 years reveals the complexity of leukemia progression. Leukemia https://doi.org/10.1038/leu.2015.264 (2015).
Kanda, Y. Investigation of the freely available easy-to-use software ‘EZR’ for medical statistics. Bone Marrow Transplant. 48, 452–458 (2013).
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 31 (2014).
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Duployez, N. et al. Comprehensive mutational profiling of core binding factor acute myeloid leukemia. Blood 127, 2451–2459 (2016).
Faber, Z. J. et al. The genomic landscape of core-binding factor acute myeloid leukemias. Nat. Genet. 48, 1551–1556 (2016).
Eisfeld, A.-K. et al. Mutations in the CCND1 and CCND2 genes are frequent events in adult patients with t(8;21)(q22;q22) acute myeloid leukemia. Leukemia 31, 1278–1285 (2017).
Lavallée, V.-P. et al. RNA-sequencing analysis of core binding factor AML identifies recurrent ZBTB7A mutations and defines RUNX1-CBFA2T3 fusion signature. Blood 127, 2498–2501 (2016).
This study was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) as funded by the Ministry of Science and ICT (NRF-2018R1A2A1A05078480), the National R&D Program for Cancer Control, Ministry of Health & Welfare, Republic of Korea (1720160). ZZ acknowledges funding from the Natural Science and Engineering Council of Canada (NSERC) (RGPIN-2017-06743) and from the National Natural Science Foundation of China (NSFC) (81672923). DK acknowledges funding from the Leukemia and Lymphoma Society of Canada (New Idea Award) and from the Princess Margaret Cancer Foundation. TK is supported by scholarship from NSERC (PGS-D). The biospecimens used for this study were provided by the Biobank of Chonnam National University Hwasun Hospital, a member of the Korea Biobank Network.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kim, T., Moon, J.H., Ahn, JS. et al. RNA sequencing as an alternative tool for detecting measurable residual disease in core-binding factor acute myeloid leukemia. Sci Rep 10, 20119 (2020). https://doi.org/10.1038/s41598-020-76933-2
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.