HMGCLL1 is a predictive biomarker for deep molecular response to imatinib therapy in chronic myeloid leukemia

Achieving a deep molecular response (DMR) to tyrosine kinase inhibitor (TKI) therapy for chronic myeloid leukemia (CML) remains challenging and at present, there is no biomarker to predict DMR in this setting. Herein, we report that an HMGCLL1 genetic variant located in 6p12.1 can be used as a predictive genetic biomarker for intrinsic sensitivity to imatinib (IM) therapy. We measured DMR rate according to HMGCLL1 variant in a discovery set of CML patients (n = 201) and successfully replicated it in a validation set (n = 270). We also investigated the functional relevance of HMGCLL1 blockade with respect to response to TKI therapy and showed that small interfering RNA mediated blockade of HMGCLL1 isoform 3 results in significant decrease in viability of BCR-ABL1-positive cells including K562, CML-T1 or BaF3 cell lines with or without ABL1 kinase domain mutations such as T315I mutation. Decreased cell viability was also demonstrated in murine CML stem cells and human hematopoietic progenitor cells. RNA sequencing showed that blockade of HMGCLL1 was associated with G0/G1 arrest and the cell cycle. In summary, the HMGCLL1 gene polymorphism is a novel genetic biomarker for intrinsic sensitivity to IM therapy in CML patients that predicts DMR in this setting.


Introduction
Introduction of tyrosine kinase inhibitors (TKI) has revolutionized the clinical management of patients with chronic myeloid leukemia (CML) [1] and has significantly improved the life expectancy of this group of patients to up to 98% of that in the general population [2]. In patients who These authors contributed equally: Jong-Ho Park and Young Min Woo achieve a sustained deep molecular response (DMR; defined as a molecular response with 4 or 4.5-log reduction) with TKI therapy, discontinuation of TKI therapy has emerged as a potential treatment option [3,4]. However, early identification of patients with higher chances of achieving DMR remains challenging and no predictive biomarker of DMR following TKI therapy has so far been described in CML patients.
Intrinsic drug sensitivity to TKI therapy is widely variable among CML patients. Although~70% of CML patients can achieve optimal responses such as major molecular response (MMR) following TKI therapy, not all of them can achieve DMR. Also the speed to achieve DMR is very variable. Some patients can rapidly achieve DMR typically within a year or two following initiation of TKI therapy, whereas others cannot achieve it even with longer durations of treatment. Given the observed wide spectrum of intrinsic sensitivity to TKI therapy among patients with CML, identification of a genetic biomarker that can predict DMR following TKI therapy could be of clinical value [5].
In the present study, we performed a genome-wide association study (GWAS) to identify genetic markers associated with DMR. We propose that HMGCLL1 gene polymorphism is a novel predicative biomarker for DMR following imatinib (IM) therapy. We also demonstrate that HMGCLL1 polymorphism is functionally relevant for CML cell growth and viability, and that HMGCLL1 blockade is cytotoxic to CML cells.

Discovery and validation data sets
We performed a GWAS on peripheral blood samples from 202 CML patients with East Asian ethnicity as a discovery set. The discovery set had been utilized in a previous study to identify a germline polymorphism marker associated with increased susceptibility to CML. A separate set of samples from 272 CML patients of European descent recruited in Canada was used as validation set. All patients in the discovery and validation sets were treated with IM frontline therapy [6][7][8][9]40]. The study was approved by the Institutional Review Boards.

Genotyping and quality control in the discovery and validation sets
In the discovery set, 906,530 SNPs were genotyped using Genome-Wide Human SNP Array 6.0 (Affymetrix, Santa Clara, CA, USA). SNPs showing erroneous genotype clustering patterns were filtered out. One sample with a missing genotype rate of > 5% was excluded from the analysis. In addition, 39,033 SNPs were excluded owing to low genotyping (with > 5% missing genotypes per marker) and 198,553 SNPs, owing to minor allele frequency of < 1%. A total of 637,886 autosomal SNPs in the discovery set (n = 201) were included in the final analysis.
We evaluated treatment response parameters following IM therapy including complete cytogenetic response (CCyR), and DMR. Definition of response following IM therapy is described in the Supplementary Information. Candidate loci associated with DMR were selected with the following criteria: minimum p values of < 5.0 × 10 -5 , and > five SNPs with p < 10 −4 within 1 Mb. For validation, a total of 25 SNPs with p < 10 −4 in candidate regions (i.e., 6p12.1 and 16q23.2) were genotyped in the validation set using the MassARRAY iPLEX system (Agena Bioscience, San Diego, CA, USA). Two samples with high missing genotype rates ( > 50%) were excluded from the final analysis.

Bioinformatics analysis
Linkage disequilibrium structure was assessed using Haploview version 4.2.7 [10]. IMPUTE2 program version 2.3.0 was used to impute SNPs not covered by Affymetrix SNP array 6.0 [11,12]. We also used a pre-phasing method, SHAPEIT2 (version v2.r727), for small loss in accuracy with fast imputation [13,14]. The reference panel used for imputation comprised of 90 known JPT + CHB haplotypes from the International HapMap Project data (Phase II Public Release #22 NCBI Build 36) [15]. Genome coordinates were converted from hg18 to hg19 using LiftOver tool from UCSC browser [16]. Locus-Zoom software was used to depict candidate regions in detail [17].

Fine mapping and expression quantitative trait loci (eQTL) analysis
Fine mapping and eQTL analysis were performed as described in the Supplementary Information.

Functional analysis of HMGCLL1 based on in vitro methods
We performed functional analysis of HMGCLL1 in order to investigate the effects of HMGCLL1 isoform type 3 blockade on cell lines expressing BCR-ABL1, as well as on hematopoietic progenitor cells (HPCs) isolated from CML patients' samples. Phospho-CrkL (pCrkL) assay and cell cycle analysis were also conducted. We performed more than three independent experiments with at least triplicates to ensure accuracy. Detailed descriptions of in vitro experiments are described in the Supplementary Information.

Statistical analysis
Cumulative incidence of responses to IM therapy including CCyR, MMR, and DMR were calculated considering competing risks (i.e., switch to other TKI or death or progression). Gray's test was used for comparison according to TCGAATAC haplotype. The Fine-Gray model was adopted for multivariate analysis. Student's t test was used for independent samples, and the Wilcoxon rank sum or Kruskal-Wallis rank sum test was used to calculate difference in cell viability or for eQTL analysis. All statistical analyses were performed using PLINK Version 1.07 [41], R (R Foundation for Statistical Computing, Austria), and EZR software (https://www.jichi.ac.jp/saitama-sct/SaitamaHP. files/statmedEN.html) [18].

Results
GWAS identified a locus of 6p12.1 as a predictive marker for DMR following IM therapy In the discovery set of CML patients (n = 201), two loci (i.e., 6p12.1 and 16q23.3) were identified as potential  Fig. 1 Results of genome-wide association analysis. a Manhattan plot shows the genome-wide p value identified in the discovery set of 201 chronic myeloid leukemia (CML) patients following imatinib (IM) therapy. Two loci (i.e., 6p12.1 and 16q23.2) were selected as candidate loci, each including more than five SNPs with p < 10 −4 within 1 Mb and minimum p values of less than 5.0 × 10 −5 . b and c The plots show cumulative incidence of deep molecular response (DMR; defined as a molecular response with 4 or 4.5-log reduction) in the discovery and validation sets, respectively. The red line indicates the group with TCGAATAC. TT indicates TCGAATAC/TCGAATAC homozygote haplotype. TG represents TCGAATAC/GTCTGCGT heterozygote haplotype. GG indicates GTCTGCGT/GTCTGCGT homozygote haplotype. Two cases in the discovery set and three cases in the validation set did not have haplotype information due to missing data of genotype or different haplotype constructed. One case in the validation set had missing data on days of achieving DMR candidates associated with DMR based on our selection criteria ( Fig. 1a In the validation set (n = 270), we attempted to replicate these two candidate loci (6p12.1 and 16q23.3) that were identified in the discovery set. Demographics and disease characteristics of patients in the discovery/validation sets are summarized in Table 1. The locus 6p12.1 was successfully validated, including eight SNPs (rs10948926, rs10948927, rs9370435, rs4546489, rs4275061, rs9475323, rs9475327, and rs9296791; Table 2), all of which were located in HMGCLL1 intron 6 (between exon 6 and exon 7). These SNPs showed high linkage disequilibrium in the same block (r 2 ≥ 0.99; Supplementary Fig. 2). As a representative SNP, the frequency of G allele in rs9370435 in the responder group from the discovery set was lower than that in the non-responder group (0.37 vs 0.45). A similar trend was seen in the validation set (0.29 vs 0.35). We also evaluated the association of eight SNPs, which were previously validated as markers for DMR following IM therapy with MMR and CCyR parameters (Supplementary  Tables 2-4).
In the discovery and validation sets, two representative haplotypes were constructed based on these eight SNPs (TCGAATAC and GTCTGCGT). These haplotypes covered 99% of samples in both the discovery and validation sets. The TCGAATAC haplotype was associated with lower chances of achieving DMR both in the discovery set (hazard ratio [HR]: 0.52; 95% confidence interval (0.40-0.73)) and the validation set (HR: 0.75 (0.57-0.98)) ( Table 2). When the cumulative incidence of DMR was analyzed, the group with the TCGAATAC haplotype showed a lower DMR rate than the group without this haplotype both in the discovery ( Fig. 1b; Supplementary Fig. 3a) and validation sets ( Fig. 1c; Supplementary Fig. 3b). When analyzed for MMR, as shown in Supplementary Fig. 4, the group with TCGAATAC haplotype showed a lower MMR rate compared with those without this haplotype both in the discovery and validation sets.
In multivariate analysis, the group with TCGAATAC haplotype had a lower probability of achieving DMR following IM therapy as expected. TCGAATAC haplotype remained an independent risk factor for DMR both in the discovery (p = 0.044, HR: 0.65 (0.43-0.99)) and validation sets (p = 0.019, HR: 0.66 (0.47-0.93)) after adjusting for variables such as age, disease stage, gender, and additional cytogenetic aberrations.
In order to exclude the possibility that these SNPs associate with CML leukemogenesis, we examined their frequency between CML patients and healthy controls.  Minor allele type (i.e., TCGAATAC haplotype or G allele in rs9370435) frequency in CML patients showed similar frequency to control subjects (0.41 vs 0.39; Supplementary Fine mapping study reveals that HMGCLL1 polymorphism haplotypes correlates with DMR In order to find potential causal variants associated with a risk haplotype, we performed targeted re-sequencing of the HMGCLL1 gene [19]. We selected 30 cases of CML (15 from the discovery and 15 from the validation group). Of these patients, 16 achieved DMR, whereas 14 did not (Supplementary Table 6). To maximize the detection probability of a DMR-associated functional variant, we excluded the cases carrying heterogeneous haplotypes and only included cases that carried a homogeneous haplotype of TCGAATAC (adverse haplotype) or GTCTGCGT (referent haplotype) [20]. A total of 673 variants (626 SNPs and 47 indels) were detected from the discovery set and 980 variants (869 SNPs and 111 indels) from the validation set. Although a coding variant was not detected in targeted resequencing, a few SNPs were found to be located in regulatory elements of HMGCLL1 gene, such as DNase I hypersensitive site (DHS) or in a sensitive region as previously described by Khurana et al. [21,22]. Specifically, in intron 6 where the TCGAATAC haplotype is located, three SNPs (rs7769320, rs7751266, and rs4521577) overlapped with the sensitive region and DHS in both sets. This finding implies that the intron 6 region is an important regulatory site that may affect gene expression and transcript splicing of HMGCLL1 through genetic variations, such as TCGAATAC haplotype [23].
HMGCLL1 produces several subtypes of HMGCLL1 isoforms HMGCLL1 is known to be transcribed into various alternative isoforms. To understand the functional role of the haplotype, we performed eQTL analysis to investigate the expression of HMGCLL1 isoforms [24]. The eQTL has been broadly utilized to investigate the biological effects of causal variants located in non-coding regions [25]. Detailed isoform types of the HMGCLL1 gene were extracted from public databases (NCBI and Ensembl; Supplementary Table 7). Isoforms were named from IS1 to IS6 according to the exon count of the isoforms. Six variants with total gene products spanning the whole HMGCLL1 gene region, IS1 + IS2, IS3 + IS6, IS4, IS5, and IS6, were found at mRNA level in normal samples (Supplementary Fig. 5; Supplementary Table 8). Of note, eQTL signals were significantly associated with TCGAATAC haplotype in the total gene Table 2 List of eight SNPs significantly associated with deep molecular response achievement following imatinib therapy in chronic myeloid leukemia patients HMGCLL1 is a predictive biomarker for deep molecular response to imatinib therapy in chronic myeloid. . . and in isoform IS3 (Total Gene: p = 0.0098, IS3 + IS6: p = 0.024; Supplementary Fig. 5b). The genotypic model also confirmed a significant association between total gene expression and TCGAATAC haplotype (Total Gene: p = 0.015; Supplementary Fig. 5c). Subjects with TCGAATAC haplotype showed higher gene expression levels of HMGCLL1 compared with those with GTCTGCGT haplotype in IS3, suggesting that HMGCLL1 gene polymorphism affects the expression level of a specific isoform [26]. Based on this result, we further investigated the effects of HMGCLL1 blockade targeting IS3 on CML cell lines as well as HPCs.  Fig. 2 Results of cell viability assay using K562, CML-T1, and BaF3 cell lines with ABL1 kinase domain mutations. a K562, b CML-T1, c BaF3/WT, d BaF3/G250E mut , e BaF3/ T315I mut , and f BaF3/F317L mut . IS3-targeted small-interfering RNA (IS3 siRNA) reduced growth of K562, CML-T1, BaF3/WT, BaF3/G250E mut , BaF3/T315I mut , and BaF3/ F317L mut cells. Results of cell growth are presented as average ± standard deviation of optical density (OD) value (Y axis) as time progresses (X axis) from at least three independent wells. Cell viability was also determined for untreated control, negative control (NC) treated with non-silencing siRNA, IS3si-treated, IM-treated, and combination of IS3si with IMtreated cells. Statistical analysis was performed using Student's t test with equal variance at 72 h after transfection. Compared with controls, cell viability (% mean ± SD) was significantly reduced to 32.59 ± 10.01 in the group transfected with IS3si, 7.11 ± 2.35 in the group treated with IM, and 0.17 ± 0.38 in the group treated with both IM and IS3si (Fig. 2a). Bliss independence (BLISS) analysis [27] showed a synergistic effect for IM therapy and IS3si in reducing the viability of K562 cells compared with IM alone.
Next, we evaluated the effect of blockade of other HMGCLL1 isoforms on K562 cell line. Cells were transfected with siRNAs targeting various HMGCLL1 isoforms (Supplementary Fig. 6a; Supplementary Table 9). The results of eQTL signal (i.e., IS3) and siRNA-targeted blockade of HMGCLL1 isoforms were concordant and showed that expression level of HMGCLL1 gene is strongly affected by the siRNA specific for isoform 3 (Supplementary Fig. 6b). TCGAATAC haplotype was associated with high IS3 expression level, which was associated with reduced DMR rate with IM therapy in the GWAS. Also, IS3 blockade inhibited the growth of K562 cells in vitro. Another BCR-ABL1-positive (BCR-ABL1 + ) cell line, CML-T1, also showed a similar trend of decrease in cell viability after treatment with IS3si, IM, or both (Fig. 2b).
We next tested the effects of IS3 blockade on growth of BaF3 wild-type and IM-resistant BCR-ABL1 mutant cells. Sequence homology of IS3 cDNA between human and murine genomes by ClustalW2 Multiple Sequence Alignment is known to be 88.35% [28]. The IS3si binding site sequence showed a perfect match between human and murine ( Supplementary Fig. 6c), which enabled us to test murine-derived BaF3 cells using the protocols for human cell lines. As expected, all those cell lines (BaF3/G250E mut , BaF3/T315I mut , and BaF3/F317L mut ) showed resistance to IM treatment (Fig. 2c-f). After treatment with IS3si alone, viabilities (%mean ± SD) of these three cell lines were reduced to 60.21 ± 5.72, 36.45 ± 1.98, and 56.40 ± 10.20, respectively. When IM was added to IS3si treatment, the repression of cell viability was observed regardless of the presence of mutations within ABL1 kinase domain (Fig. 2df). In wild-type BaF3 cells, synergistic inhibition of cell viability was observed when second generation TKIs, nilotinib, or dasatinib, were combined with IS3si (Fig. 3a).  Fig. 3 Results of cell viability assay after treatment with IS3si, dasatinib, and nilotinib in BaF3 cell lines with ABL1 kinase domain mutations. a BaF3/WT, b BaF3/G250E mut , c BaF3/ T315I mut , and d BaF3/F317L mut . IS3si reduced growth of BaF3/ WT, BaF3/G250E mut , BaF3/ T315I mut , and BaF3/F317L mut cells. Results of cell growth are presented as average ± standard deviation of OD value (Y axis) as time progresses (X axis) from at least three independent wells. Cell viability was determined for untreated control, NC treated with non-silencing siRNA, IS3si-treated, dasatinib-or nilotinib-treated, and combination of IS3si with dasatinib-or nilotinib-treated cells. Statistical analysis was performed using Student's t test with equal variance at 72 h Dasatinib reduced cell growth to 23.17 ± 6.82% in BaF3/ F317L mut and to 42.70 ± 12.77% in BaF3/G250E mut . Furthermore, IS3si universally showed synergistic inhibition of BaF3/G250E mut and BaF3/F317L mut when applied in combination with dasatinib (Fig. 3b, d). At last, the BaF3/ G250E mut cells showed resistance to nilotinib but the resistance was reversed when the cell line was treated with IS3si alone or a combination of IS3si and nilotinib (Fig. 3b). Of interest, T315I BCR-ABL1 mutant (BaF3/T315I mut ) cells also showed significant reduction in cell viability after treatment with either IS3si alone (to 36.45 ± 1.98%) or IS3si in combination with nilotinib (to 39.90 ± 2.82%) or dasatinib (to 29.23 ± 3.67%) (Fig. 3c). These data suggest that blockade of IS3 can inhibit the growth of both TKIsensitive and TKI-resistant CML cells. The results of CrkL assay confirmed that IS3 blockade-induced CML cell killing effect is mediated through BCR-ABL1 inhibition both in wild-type and mutant type CML cell lines including BaF3/ T315I mut cells ( Supplementary Fig. 7). BaF3/T315I mut cells especially did not show any decrease in the pCrkL/CrkL ratio after IM treatment. However, a significant reduction in the pCrkL/CrkL ratio was observed when these cells were treated with either IS3si alone or in combination with IM.

Influence of IS3si on proliferation of the human K562/T315I mut cell line
We also confirmed the inhibitory effect of IS3si on K562/ T315I mut cell line that was developed using CRISPR/ Cas9 system (Supplementary Fig. 8). Similar to the results observed in the BaF3/T315I mut cells, treatment with either IS3si alone or in combination with IM significantly decreased the viability of K562/T315I mut cells (Fig. 4). K562/T315I mut cells were resistant to IM or nilotinib monotherapy, but the sensitivity to IM or nilotinib was reversed when these agents were combined with IS3 siRNA.

IS3si suppresses colony-forming capacity of murine CML progenitor cells in vitro
We used an established protocol to isolate Lineage − Sca1 + cKit + (LSK) cells from tetracycline-inducible CMLaffected mice by cell sorting. These LSK cells were transfected with Cy3-labeled (fluorescence-marked) IS3si. Cy3positive and Cy3-negative cells were then purified and assessed for their colony-forming capacity (Fig. 5a). Compared with Cy3-negative LSK cells, colony-forming capacity of Cy3-positive LSK cells was attenuated with or without IM. Thus, IS3 blockade decreases the colonyforming capacity of murine CML HPCs in vitro and may inhibit the repopulation capacity of murine CML-HPCs (Fig. 5b).

Effects of IS3si on CD34 + cells from CML patients and normal healthy individuals-derived primary cells
Next, we investigated the effect of IS3 blockade on growth of primary CML cells isolated from patient samples. CD34 + fractions were isolated from samples of two CML patients (CD34 + /CML) and one normal healthy donor (CD34 + /Normal) and treated with IS3si with or without IM.
Although marked cell growth inhibition was seen in both CD34 + /CML and CD34 + /Normal cells after IS3si treatment, there was a trend of greater inhibition of cell growth in the CD34 + /CML fraction compared with the CD34 + /Normal fraction (p = 0.060). IM therapy significantly decreased the growth of CD34 + /CML fraction compared with the CD34 + /Normal fraction (p < 0.001; Fig. 5c).

RNA-seq reveals a mechanism related to IS3 isoform-mediated inhibition of CML cells
To elucidate the downstream signaling pathways affected by IS3 blockade, we performed RNA-seq using K562 cell line that were treated with IS3si or vehicle control. In particular, the cyclin-dependent kinases, CDK4 (p < 0.001, FC = −2.39) and CDK6 (p < 0.001, FC = −2.13), expression was significantly decreased with IS3 siRNA treatment (Supplementary Table 10). CDK4 and CDK6 are frequently overexpressed and dysregulated in diverse types of cancers [33,34]. The cyclin-dependent kinase CDK6 is crucial gene associated with leukemia stem cells (LSCs) [35]. Using qPCR, we confirmed that CDK4/6 gene expression was reduced after IS3si treatment (Supplemental Fig. 9). Gene set enrichment analysis (GSEA) showed that cell cycle gene set was the most significantly enriched pathway in the KEGG pathway sets (Familywise-error rate p = 0.001, Fig. 6a−c, Supplemental Table 11). Compared with vehicle control, suppression of K562 cells by IS3si promoted cell cycle arrest at G0/G1 phase and decreased S1 phase (p = 0.0017). IS3si-treated cells also displayed more profound cell cycle arrest at G0/G1 phase with decreased S1 phase compared with cells treated with CDK4 or CDK6 targeted siRNA. In K562/T315I mut cells with high TKI resistance, increased cell cycle arrest at G0/G1 phase was observed by cell cycle assay (p < 0.001; Fig. 6d The plot shows the effect of IS3si on murine CML LSK cells. Colony number per 1000 cells (Y axis) was counted after 7 days. Cy3-positive and Cy3-negative murine CML stem cells with or without IM (1 µM) were co-cultured with OP-9 stromal cells in vitro for 72 h. Colony-forming capacity of murine CML stem cells after treatment with IM was then determined. c The plot shows the effect of IS3si on human CD34 + cells isolated from clinical samples of patients with CML (CD34 + /CML) and normal healthy individuals (CD34 + /normal). Cell viability relative to control cells (Y axis) was measured at 192 h as average ± standard deviation (SD) from at least three independent wells. Cell viability (%) was calculated as follows: (absorbance of the treated wells−absorbance of the blank wells)/ (absorbance of the control wells−absorbance of the blank wells). Examination of cell viability was performed for untreated control, negative control (NC) treated with non-silencing siRNA, IS3si-treated, IM-treated, and combination treatment with IS3si and IM-treated cells. Statistical analysis was performed using Student's t test with equal variance. Asterisk marked p values were calculated by comparing with control fraction in each CD34 + /normal or CD34 + /CML, respectively. ***p < 0.001, **p < 0.01, *p < 0.05

Discussion
The present study demonstrates that the HMGCLL1 genetic variant is a powerful predictive biomarker for intrinsic sensitivity to TKI therapy that correlates with DMR rate in CML patients. The following findings have been replicated in an independent cohort of CML patients with different ethnicities and their functional roles have been validated using siRNA-based inhibition in in vitro models: (1) The HMGCLL1 genetic variant was a powerful predictive genetic biomarker for DMR with IM therapy, (2) Blockade of IS3 significantly reduced the viabilities of K562, CML-T1, BaF3/wild-type, and mutant BaF3 cell lines including the cell lines harboring ABL1 kinase domain mutation such as T315I mutation, (3) IS3 blockade reduced colonyforming capacity of murine CML stem cells treated in vitro and selectively suppressed the CD34 + fractions isolated from CML patients compared with healthy control.
Of clinical relevance, this genetic biomarker can be used to identify the cases expected to have poor intrinsic sensitivity to IM therapy even prior to commencement of TKI therapy. Our data indicate that patients with TCGAATAC haplotype have a 35% and 34% lower chance of achieving DMR with IM therapy in the discovery set and validation set, respectively, and would be good candidates for frontline therapy with second-generation TKIs, if the goal of CML treatment is achieving treatment free remission, for which DMR is a prerequisite. As shown by GWAS, locus 6p12.1 that includes HMGCLL1 had a strong correlation with DMR to IM therapy. TCGAATAC haplotype of HMGCLL1 gene variant was also strongly associated with lower MMR rate with IM therapy (Supplementary Fig. 4 Table 2). Thus, this genetic biomarker can be utilized for prediction of both DMR and MMR. HMGCLL1, also known as HMG-CoA Lyase Like 1, was first characterized as a novel lyase activity enzyme localized in extra-mitochondrial region [36,37]. The role of HMGCLL1 in susceptibility or resistance of cancer cells to chemotherapies has never been extensively investigated. Also, the biological relevance of HMGCLL1 in HPCs has never been systemically elucidated. Accordingly, we have tested whether this gene is associated with susceptibility to CML development, and compared the frequency of this haplotype between CML and healthy control groups using the data previously published (Supplementary Table 5) [38].

and Supplementary
Theoretically, HMGCLL1 inhibition can be a potential therapeutic strategy to improve DMR in CML patients. In our study, blockade of HMGCLL1 using siRNA significantly reduced the growth of BCR-ABL1 + cells, including K562, CML-T1, BaF3 wild-type, BaF3/T315I mut , and K562/T315I mut cell lines. Blockade of IS3 could also decrease the HPC fraction isolated from CML patients and CML-affected mice (Fig. 5). Of surprise, IS3 blockade could overcome ABL1 kinase domain mutations in BaF3 murine cell lines. Also, IS3 blockade could overcome the burden of T315I mutation in K562/T315I mut cells generated by CRISPR/Cas9 system. These findings suggest that mechanistically IS3 blockade is not conflicting with ATPpocket binding activity of TKIs, but functions through a different pathway. Thus, IS3 blockade could potentially be exploited to overcome TKI resistance in CML.
The results of the present study suggest that HMGCLL1 blockade could potentially sensitize CML LSCs to TKI therapy and improve the DMR rate following this treatment. As a result, more patients may be eligible for TKI discontinuation, suggesting a possible role for pharmacological blockade of HMGCLL1 gene or IS3 in treatment of CML.
Our RNA-seq data demonstrate that blockade of IS3 by IS3si in CML cells significantly decreases the expression levels of CDK4 and CDK6, suggesting a role for HMGCLL1 in cell cycle regulation in CML cells. This is strongly supported by consistent results of GSEA (in silico) and cell cycle analysis (in vitro) showing that the fraction of cell cycle arrest in G0/G1 phase is increased after IS3 blockade. Scheicher et al. have suggested that CDK6 is a key regulator of hematopoietic and leukemic stem cell activation [39]. In addition, Ng et al. recently published a 17-gene LSC score (LSC17) signature that included CDK6 as a marker to predict outcome in AML [35]. Given that the effect of HMGCLL1 inhibition on BCR-ABL1 + cell growth was mediated by the CDK4/6 pathway, CDK4/6 inhibitors such as palbociclib should be further investigated as a potential therapeutic option for CML treatment (Supplementary Fig. 9). Further preclinical studies are needed to determine the clinical applicability of HMGCLL1 or CDK4/ 6 inhibition in CML to improve DMR rates and enhance LSCs eradication.
In summary, our results highlight that the HMGCLL1 gene polymorphism is a predictive genetic biomarker for intrinsic sensitivity to IM therapy in CML and can be used to identify high risk patients that may fail to achieve DMR with IM therapy even prior to initiation of treatment.