Main

Transcription factor (TF) networks regulate gene expression and are a key determinant of cell fate decisions and identity1. In cancer and other disorders, these networks are frequently corrupted and are often driven by genomic mutations that alter the function, expression or genomic localization of key TFs2. However, despite the importance of aberrant TF networks in disease development, our understanding of the intricacies of TF dynamics is limited and the repertoire of small molecules available to specifically target TF networks is inadequate3.

Given the importance of TF DNA binding site occupancy in dictating cell fate4,5,6,7, we decided to investigate a chemical approach to selectively block TF occupancy by using DNA-binding heterocyclic diamidines. These compounds are designed to non-covalently bind to adenine–thymine (AT)-rich minor grooves adjacent to ETS family core binding motifs and destabilize specific TF–DNA interactions, leading to TF displacement in a sequence-selective manner8,9. This pharmacological approach is unique in that it specifically inhibits the TF–DNA interaction, distinct from other approaches whereby the TF protein itself is inhibited or targeted for degradation10. We leverage select versions of these compounds to gain insight into TF dynamics and cell fate, with a particular focus on the hematopoietic TF PU.1 (also known as SPI1). This TF is an ideal candidate for investigation as it directs a well-characterized transcriptional program that governs cell fate4,11,12, has demonstrated proof-of-concept sensitivity to binding site inhibition, specifically compared to other ETS family TFs8,9,13,14, and is commonly dysregulated in leukemia and other diseases15,16,17,18,19,20,21,22,23,24,25,26.

We uncover that chemically driven binding site restriction drives a phenomenon that we refer to as ‘pharmacologically induced TF redistribution’. Binding site restriction caused a redistribution of PU.1 genomic binding and a seemingly paradoxical increase in PU.1-driven transcription at a subset of target genes. As a consequence, these chemical agents drive a robust myeloid differentiation program in both cell lines and primary samples from patients with acute myeloid leukemia (AML). We use CLICK chemistry-tagged versions of these PU.1-redistributing agents to catalog the sensitivity of the entire PU.1 cistrome to canonical binding site restriction and provide insight into the fast biology of endogenous PU.1, particularly into its pioneering ability and chromatin occupancy kinetics. In principle, the chemically induced TF redistribution we describe herein can be used not only to broaden our understanding of transcriptional processes but also as a mechanism to designate a new class of agents—‘TF redistributors’—which can be used in the fight against transcriptional aberrations in disease.

Results

Pharmacological binding site restriction repositions PU.1

Proof-of-concept studies into the inhibition of TF binding using synthetic heterocyclic diamidines in cell-free systems have established pharmacological displacement of PU.1 as a viable pharmacological strategy8,27 but have provided little insight into the molecular and cellular consequences of such treatment. To characterize the consequences of PU.1 binding site restriction in a cellular context, we investigated the genomic localization of PU.1 following exposure to the tool compound DB2115 (Fig. 1a). A 5 µM dose of DB2115 was administered to PU.1-expressing MOLM13 cells to exert maximal functional effect (~90% growth inhibition) without altering the levels of PU.1 transcript or protein (Fig. 1b,c) before PU.1-directed CUT&Tag was performed.

Fig. 1: PU.1 binding site blockade causes a GC content-biased repositioning of genomic PU.1 binding.
figure 1

a, Schematic of the experimental approach to preparing MOLM13 cells for CUT&Tag following a 12 h 5 µM DB2115 treatment. b,c, Representative western blot displaying protein (b) and RNA expression of PU.1 following 12 h of 5 µM DB2115 treatment (mean ± s.e.m., n = 3 experimental replicates) (c). d, Proportions of lost, gained and unchanged PU.1 peaks following DB2115 treatment. Differential binding was calculated by Diffbind with an FDR < 0.1, n = 3. e, Z-score heatmap of ETS motif enrichments. Known ETS motifs were identified from Homer analysis of lost, gained and unchanged PU.1 peaks. f, Log odds ratio score for the PU.1 consensus sequence (from a previous publication30) in lost, gained and unchanged PU.1 peaks. PWM, position weight matrix. g, Representative viewer tracks of genomic loci displaying lost (blue boxes), gained (red boxes) and unchanged (unmarked) PU.1 binding. h, Annotation in relation to distance to gene transcription start site of lost, gained and unchanged PU.1 peaks. UTR, untranslated region; TTS, transcription termination site. i, Proportion of GC content from the central 100 bp of lost, gained and unchanged PU.1 peaks. ****P < 0.0001, two-sided Mann–Whitney test. j, Comparison of log2fold change (FC) in PU.1 peak score (DB2115-treated/vehicle) versus GC content of central 100 bp of all peaks (colored according to peak groups). k, GC content position frequency matrix of lost, gained and unchanged peaks that have been centered on a short consensus ETS motif (GAGGAAGT) and examined ±25 bp. l, Single-nucleotide position frequency matrices for adenine, thymine, guanine and cytosine, comparing lost, gained and unchanged PU.1-centered peaks. m, Motif generation from PU.1-centered, lost (DB-sensitive) and gained (DB-resistant) peaks displaying a robust and extended upstream enrichment of A nucleotides in lost peaks. Also shown is the PU.1 motif used for centering peaks. VEH, vehicle.

Source data

Strikingly, robust redistribution of PU.1 binding rather than global PU.1 displacement was uncovered. CUT&Tag identified 34,904 total PU.1 binding sites, consistent with the number of PU.1 peaks found in prior localization studies28,29,30, with 19% of sites being lost and 15% being gained following DB2115 exposure (n = 3, Diffbind false discovery rate (FDR) < 0.1; Fig. 1d,g and Extended Data Fig. 1a,b,e). Log odds ratio scoring of a canonical PU.1 motif30 and Homer analyses confirmed the enrichment of high-affinity PU.1 motifs (Class III ETS) within PU.1 CUT&Tag peaks, indicating a direct interaction between PU.1 and the DNA at gained, lost and unchanged sites (Fig. 1e,f and Supplementary Data 1). Interestingly, fewer lost PU.1 sites were found in promoter regions (4.4%) compared to gained sites (21.9%) (Fig. 1h).

Given that DB2115 was designed to inhibit canonical PU.1 sites by blocking key AT-rich minor groove interactions surrounding the core GGAA8,9,13, we examined the guanine–cytosine (GC) content of the central 100 bp of lost, gained and unchanged PU.1 sites. As expected, the GC content was significantly less in lost PU.1 binding sites (Fig. 1i) compared to gained and unchanged sites. Indeed, the majority of PU.1 peak losses occurred in regions with <50% GC content (Fig. 1j), suggesting that DB2115 specifically displaced PU.1 from AT-rich binding sites.

To understand the AT bias surrounding PU.1 motifs, we aligned all peaks to a central motif (GAGGAAGT) and examined the GC content of the adjacent nucleotides (±25 bp). There was robust depletion of GC content surrounding the core GGAA ETS motif for the lost PU.1 sites compared to gained or unchanged sites (Fig. 1k). Further examination of the nucleotide composition identified that adenines were the predominant nucleotide enriched in the upstream and downstream regions of lost sites (Fig. 1l). This was reflected in a de novo motif generated from DB-sensitive (lost) PU.1 sites containing more than six adenines upstream of the core motif (Fig. 1m). Although heterocyclic diamidines do not have a preference for poly-A or poly-T minor grooves31, thymines were not equivalently enriched. This is partially explained by overall motif occurrence, with poly-A flanked ETS sites being more common than poly-T flanked sites (Extended Data Fig. 1f,g).

To confirm that pharmacological PU.1 redistribution was not an artefact of DB2115-mediated Tn5 transposase inhibition, we performed classical chromatin immunoprecipitation (ChIP). Despite some differences in peak number and distribution, the ChIP dataset corroborated the PU.1 redistributive phenomenon, characterized by profound losses and gains of PU.1 binding sites occurring with similar biases in GC content (Extended Data Fig. 2).

PU.1 repositioning is a class-specific and selective phenomenon

Pharmacological redistribution of PU.1 occurred consistently across several PU.1-expressing cell lines, THP1, HL60 and MV411, with similar GC content biases (Fig. 2a and Extended Data Fig. 3a–c). Despite differences in baseline PU.1 localization, drug-induced PU.1 redistribution showed similarity across cell lines (Extended Data Fig. 3d,e). This was most profound amongst losses (71% of peaks lost in MOLM13 were commonly lost in ≥1 other cell line) compared to gains (32% of peaks gained in MOLM13 were commonly gained in ≥1 other cell line; Fig. 2b). Examination of these common sites identifies AT-richness and PU.1 motif score as contributing factors for consistent redistribution (Extended Data Fig. 2f–i). We generated a reference list of 347 high-confidence PU.1 binding sites that are consistently displaced with DB2115 (Supplementary Data 2). Overall, we posit that PU.1 loss is context-independent in nature, driven predominantly by sequence-specific drug affinity, whereas PU.1 relocalization is less predictable by sequence and more dependent on cellular context.

Fig. 2: TF redistributors mediate selective, class-specific PU.1 repositioning in cell lines and primary AML samples.
figure 2

a, Comparison of log2fold change of PU.1 peak score (12 h of 5 µM DB2115-treated/vehicle) versus GC content of the central 100 bp for all peaks for THP1, HL60 and MV411, showing the number and percentage of peaks redistributed in each cell line (n = 2 for each cell line). b, Similarity between MOLM13 PU.1 peak changes with the other three cell lines. Peaks were classed as either common change to MOLM13 in one, two or three other cell lines or not commonly changed to MOLM13. Lost peaks were first filtered to exclude peaks not detected in other cell lines. c, Scatter plots comparing log2fold change of PU.1 peak score (DB2115-treated/vehicle) versus GC content of the central 100 bp for all peaks from two AML primary patient samples, including the number and percentage of peaks that are lost and gained in each sample. d, Motif generation from PU.1-centered, pooled primary AML sample redistributed peaks displaying a robust and extended upstream enrichment of A nucleotides in lost peaks. Also shown is the PU.1 motif used for centering peaks. e,f, Representative genomic loci displaying lost (e; blue boxes), gained (f; red boxes) and unchanged (unmarked) PU.1 binding in primary AML samples. g, Percentage of lost and gained CUT&Tag peaks of the TFs PU.1, RUNX1, ELF1 and GABPA in MOLM13 cells treated for 12 h with 5 µM DB2115. The dashed line represents the percentage of PU.1 lost or gained peaks. h, The percentage of lost or gained TF peaks that overlap with redistributed PU.1. N/A, not applicable. i, Percentage of lost and gained CUT&Tag peaks of the TFs GATA3, RUNX1, ELF1 and FLI1 in JURKAT cells treated for 12 h with 5 µM DB2115. The dashed line represents the percentage of PU.1 lost or gained peaks found in MOLM13 cells. j,k, Scatter plots comparing log2fold change of PU.1 peaks (drug-treated/vehicle) versus GC content of the central 100 bp following 12 h of 5 µM DB2373, DB2313 and DB2826 (j), or 400 nM cytarabine (ARA-C) and 200 nM daunorubicin (DAUN.; k). Shown as an inset in each graph are the numbers and percentage of lost or gained PU.1 peaks, n = 1.

Encouragingly, primary AML or myelodysplastic syndrome (MDS)/myeloproliferative neoplasm cells exhibited robust TF redistribution following 12 h of DB2115 treatment in liquid culture (Fig. 2c,e,f). Similar biases in lost PU.1 peak GC content, including the enrichment for upstream A nucleotides, were evident (Fig. 2d and Extended Data Fig. 3j). Comparison of primary AML against the MOLM13 dataset revealed that there was more similarity between lost regions (33%) than gained regions (6%) (Extended Data Fig. 3k).

To evaluate the selectivity of the compounds for causing PU.1 redistribution, we performed CUT&Tag on other TFs (non-ETS family: RUNX1; other ETS family factors: ELF1 and GABPA). DB2115 exerted minimal (ELF1, GABPA) to mild (RUNX1) TF redistribution in MOLM13 cells (Fig. 2g and Extended Data Fig. 4a). However, a significant proportion of these minor TF redistribution events could be attributable to the redistribution of PU.1 itself, with 39–52% co-occurring with PU.1 (Fig. 2h). Furthermore, we examined drug-induced TF redistribution (RUNX1, GATA3, ELF1 and FLI1) in JURKAT cells (which express minimal to no PU.1). Minor redistribution was evident (Fig. 2i and Extended Data Fig. 4b) but this was ~fivefold less than the redistribution observed for PU.1 in MOLM13. Overall, these data indicate that PU.1 is uniquely sensitive to DB2115-mediated binding site inhibition compared to these other TFs.

To determine whether PU.1 redistribution was a drug class effect, we examined newly developed heterocyclic diamidine PU.1 binding site inhibitors DB2373 and DB2826 plus one previously described compound, DB2313 (ref. 9) (Extended Data Fig. 3l). Encouragingly, all compounds caused PU.1 redistribution, characterized by robust GC-content-specific gains and losses (Fig. 2j). Furthermore, to confirm that this is a class-specific effect, we examined PU.1 localization in MOLM13 cells after 12 h of exposure to 400 nM cytarabine or 200 nM daunorubicin and found that neither triggered PU.1 redistribution (Fig. 2k).

CLICK-on-CUT&Tag identifies A-rich PU.1 site drug targeting

To test whether TF redistributors directly displace PU.1 and which genomic loci experience direct displacement, we generated an alkyne-linker-tagged version of DB2115 that is amenable to click chemistry (DB2750). This linker-tagged compound (Fig. 3a) was viable for copper-catalyzed cycloaddition with azide-AF488 and localized to the nucleus (Fig. 3b). Furthermore, the antiproliferative activity of the molecule was similar to DB2115 (inhibitory concentration 50 (IC50) DB2750, 270 ± 20 nM versus DB2115, 630 ± 8 nM; Fig. 3c) and we found that DB2750 caused GC-content-dependent PU.1 redistribution (Fig. 3d) with >70% equivalence in peak gains and losses compared to DB2115 treatment (Extended Data Fig. 5a–c).

Fig. 3: CLICK-on-CUT&Tag identifies sensitive, A-rich PU.1 sites as direct targets of drug binding.
figure 3

a, Chemical structure of DB2750, an alkyne-linker-tagged version of DB2115. b, Representative immunofluorescence image of MOLM13 cells treated with vehicle or 5 µM DB2750, and CLICK-chemistry stained with azide-AF488 (green) and DAPI (gray). c, Dose–response curve of MOLM13 proliferation (cell titer blue assay) for DB2750 and DB2115; n = 3 experimental replicates displaying mean ± s.e.m. d, Scatter plots comparing log2fold change of PU.1 peaks (DB2115-treated/vehicle) with GC content of the central 100 bp following 12 h of DB2750. Shown as an inset in the graph are the number and percentage of lost or gained PU.1 peaks; n = 1. e, Schematic of CLICK-on-CUT&Tag and artificial fragment CLICK pulldown experimental procedures. NGS, next-generation sequencing. f, Enrichment of the two artificial DNA fragments (AT-rich and AT-poor; see Extended Data Fig. 2) following pulldown with DB2750-coated or non-drug-coated magnetic beads. Data are represented as a percentage of input cycle threshold score ± s.e.m.; n = 3 experimental replicates, *P < 0.05 (P = 0.0312) with two-sided Student’s t-test. g, Proportion of DB2750 binding sites out of all PU.1 binding sites (drug binding was determined to be log2fold change > log2(0.5)). h, Representative genome viewer tracks of vehicle or DB2115-treated PU.1 CUT&Tag (gray) as well as CLICK-on-CUT&Tag (purple), showing drug binding at specific PU.1 loci only. i, Summary enrichment scores of CLICK-on-CUT&Tag data, represented as the log2fold change of DB2750-pulldown/input. j, Highest ranked de novo motif found enriched in drug binding and non-drug-binding PU.1 sites. k,l, Scatter plot comparing CLICK-on-CUT&Tag enrichment score versus GC content of central 100 bp of each peak (k) or log2fold change of DB2115 over vehicle-treated PU.1 peak score (l). m, Color-coded scatter plot of CLICK-on-CUT&Tag enrichment score versus GC content displaying only lost (blue) and gained (red) peaks.

To investigate the sequence specificity of TF redistributors, we developed an in vitro click chemistry approach to screen affinity in select synthesized PU.1 binding sites or the entire PU.1 cistrome (Fig. 3e). Initially, to test the viability of this approach, we mixed two 68-bp DNA fragments from either an AT-rich, DB2115-sensitive PU.1 binding site (SENP2 intergenic, TAAAAGGAGGAAGTG) or an AT-poor, DB2115-resistant PU.1 binding site (SPI1 upstream regulatory element, GCTGTTAGGGAAGGG) and performed a pulldown with DB2750-coated magnetic beads (Fig. 3e and Extended Data Fig. 5d,e). There was a strong enrichment of AT-rich over AT-poor fragments, demonstrating that the drug is indeed binding preferably to AT-rich DNA and that the CLICK-pulldown approach is viable (Fig. 3f).

To identify drug-binding sites amongst the entire PU.1 cistrome, we developed ‘CLICK-on-CUT&Tag’, in which the input DNA for the pulldown is the output from PU.1 CUT&Tag (Fig. 3e). CLICK-on-CUT&Tag identified drug binding at many PU.1 binding sites (16,447 out of 38,549 sites >log(0.5) enrichment; Fig. 3g,h). The strongest enrichment occurred at DB2115-lost PU.1 sites, denoting direct drug-mediated displacement of PU.1 (Fig. 3i). Furthermore, de novo motif enrichment of drug-bound sites identified an upstream A-rich ETS motif similar to the motif identified from DB2115-lost PU.1 sites (Figs. 3j and 1e). Conversely, motif enrichment of non-drug-bound sites generated a motif without upstream A-bias and predicted low-affinity PU.1 binding (Fig. 3j). There was an inverse relationship between CLICK-pulldown enrichment and both GC content and fold change of PU.1 peaks following DB2115 treatment (Fig. 3k,l). Notably, there was a pattern of lost PU.1 sites with low GC content corresponding with high CLICK enrichment compared to gained PU.1 sites (Fig. 3m). Overall, these findings support a model of direct compound-mediated displacement of PU.1 from AT-rich flanking sites.

PU.1 repositioning restructures chromatin and transcriptome

To determine whether pharmacological TF repositioning could alter the accessible chromatin landscape, we performed assay for transposase-accessible chromatin with sequencing (ATAC–seq) following 12 h of DB2115 exposure (Fig. 4a). Profound changes to chromatin accessibility were evident; 33% of the total 149,462 accessible regions were closing and 20% were opening (FDR < 0.1; Fig. 4b). ETS motifs were strongly enriched in closing, opening and unchanged regions, suggesting that PU.1/ETS factors are key chromatin accessibility-controlling factors in these cells (Fig. 4c). Closing sites displayed a lower occurrence in promoter regions than unchanged and opening sites (Fig. 4e). Similarly, closed sites displayed a profound deficiency of GC content compared to gained or unchanged sites (Fig. 4d).

Fig. 4: PU.1 repositioning restructures the accessible chromatin landscape and rewires the PU.1-driven transcriptome.
figure 4

a, Schematic of the experimental approach to preparing MOLM13 cells for ATAC and RNA sequencing. b, Venn diagram of chromatin accessibility changes following DB2115 treatment (analyzed using Diffbind with an FDR < 0.1, n = 2). c, De novo motifs found in closing, opening or unchanged accessible chromatin regions (binomial exact test, Homer). d, GC content of the central 100 bp of closing, opening and unchanged accessible chromatin. e, Annotation of closing, opening or unchanged accessible chromatin. f, Summary pie charts of lost, gained and unchanged PU.1 peaks displaying their chromatin accessibility status following treatment. g, Representative viewer tracks of genomic loci displaying lost (blue box), gained (red boxes) or unchanged (unmarked) PU.1 binding regions with both PU.1 CUT&Tag (top) and ATAC data (bottom). h, Enriched de novo motifs from the following categories of peaks: lost PU.1 and closing, lost PU.1 and unchanged, gained PU.1 and opening and gained PU.1 and unchanged (Homer analysis with background of lost PU.1 and unchanged for lost PU.1 and closing peaks (and vice versa) or background of gained PU.1 and unchanged for gained PU.1 and opening peaks (and vice versa)). i, Differentially expressed genes (DEGs) identified following 20 h of DB2115 treatment (DESeq2, log2fold change >/< 0.5 and FDR < 0.1, n = 3). j,k, Gene set enrichment analysis (GSEA; MSigDB) of upstream TF pathways (j) and chemical and genetic perturbations (k). l, Volcano plots of gained, lost and unchanged PU.1 peak-associated gene expression. Accompanying pie charts illustrate the proportions of DEGs that are up-regulated and down-regulated. m,n, Volcano plots of the transcriptional changes of gained (m) and lost (n) PU.1 peak-associated genes filtered on opening or closing chromatin, and in promoter–intron–exon regions only. Accompanying pie charts illustrate the proportion of DEGs that are up-regulated and down-regulated. o, The promoter–intronic–exonic DB2115-target genes that were lost and closing (194) or gained and opening (506) were analyzed for cell identity from the human gene atlas (Enrichr). Average expression, P value (Fisher’s exact test) and odds ratio (OR) are shown. RPKM, reads per kilobase per million mapped reads. p, K-means clustering of binarized human cell ATAC peaks32 uncovered ten clusters associating with differing cell identities. q, Using this k-means cluster as a reference, the ‘identity’ of PU.1 CUT&Tag lost, gained and unchanged peaks was determined; only clusters showing enrichment were labeled.

Next, we filtered PU.1 peaks on redistribution status and examined chromatin accessibility. The majority of PU.1-lost sites were found in closing areas of the chromatin (84%), whereas the majority of PU.1-gained sites were found in opening areas (78%; Fig. 4f,g and Extended Data Fig. 6a). Overall, these data support the notion that PU.1 exerts powerful pioneering control over chromatin accessibility29. Given that some sites with PU.1 loss remained unchanged in accessibility (Fig. 4f), we investigated whether other chromatin factors were cooperating with PU.1 to mediate chromatin accessibility changes. We examined enriched motifs in the four different classes of PU.1 changed sites (Fig. 4h). Interestingly, lost PU.1 and closing sites, as well as gained PU.1 and unchanged sites were enriched for A-rich motifs such as SOX5, IRF4 and ZNF384. An opposite effect was observed in the lost PU.1 and unchanged chromatin, and gained PU.1 and opening chromatin, with a strong enrichment of G-rich motifs such as ETS1, BORIS and SMAD3. These data suggest that surrounding co-factor binding may have a role in determining the chromatin accessibility changes of regions experiencing PU.1 redistribution.

To investigate the effects of PU.1 redistribution on the transcriptome, we performed RNA sequencing 20 h after exposure to DB2115 (Fig. 4a). Gene set enrichment analysis of differentially expressed genes identified the PU.1/ETS pathways as predominant upstream networks driven by DB2115 treatment (Fig. 4i,j). In addition, there was positive enrichment of differentiation signatures and negative enrichment of proliferative and stem cell signatures, which would be expected during myeloid differentiation driven by high levels of PU.1 (Fig. 4k).

Overlap of transcriptomic changes with PU.1 CUT&Tag was evaluated by examining gene expression near annotated CUT&Tag peaks (Extended Data Fig. 6b). PU.1-gained sites were enriched for up-regulation of gene expression, whereas PU.1-lost sites had an equivalent level of up-regulation and down-regulation (Fig. 4l). The low enrichment of down-regulated genes in PU.1-lost sites is probably a result of the time required to degrade pre-existing transcripts, with long half-life transcripts unlikely to show reductions at this time point.

To identify high-confidence redistributed PU.1 target genes, we filtered on PU.1-gained and PU.1-lost sites that occurred within close proximity to gene bodies (that is, promoter, intron, exon and transcription termination site regions only) and that resulted in respective changes in ATAC status or gene expression (that is, opening and up-regulation for gained PU.1 sites and vice versa) (Fig. 4m,n). This produced lists of 194 PU.1-lost genes that showed enrichments in homeostatic and cell-cycle-related pathways and signatures of bone marrow and hematopoietic progenitor cells, and 506 PU.1-gained genes that showed enrichment in developmental and maturation pathways and signatures of differentiated myeloid cells (Supplementary Data 3, Fig. 4o and Extended Data Fig. 6c).

To further investigate the signatures found in TF redistributor exposed cells, we generated a cellular identity map32 (Extended Data Fig. 6d). This generated ten distinct open chromatin region clusters that associated with human hematopoietic cell types (Fig. 4p). By applying our PU.1 CUT&Tag data, we identified a unique enrichment of lost PU.1 sites in the stem cell or progenitor cluster (progenitor no. 2), whereas gained PU.1 sites were enriched in ubiquitous regions found in all cell types, including myeloid cell types (common no. 1 and common no. 2). Thus, pharmacological PU.1 redistribution directs cells away from a progenitor or stem cell identity and towards a differentiated state (Fig. 4q). Taken together with the transcriptomic data, these findings indicate that DB2115 directly displaces PU.1 from progenitor or stem cell-essential gene regulatory regions, leading to PU.1 relocation at pro-differentiation regions.

Temporally resolved consequences of redistributed PU.1

Although PU.1 has been described as possessing ‘non-classical’ pioneering abilities29, to our knowledge, the ability of PU.1 to pioneer chromatin under steady-state endogenous conditions has not been investigated. To understand the kinetics of PU.1 redistribution, we examined PU.1 localization after 1 h, 4 h and 12 h of DB2115 exposure. A total of 30,683 PU.1 binding regions were identified, with 11,487 sites changing in occupancy. A substantial proportion of these changes occurred after just 1 h (10%) or 4 h (22%) of incubation (Fig. 5a). These early changes predominantly consisted of PU.1 losses (Fig. 5a, right panels), but equilibrated to equivalent gains and losses at the later time points. Examination of the relationship of drug affinity or GC content over the PU.1 redistribution time course identified that the earliest losses exhibited the lowest GC content and highest CLICK enrichment score (Extended Data Fig. 7a,b, left panels), reflective of ordered and affinity-driven PU.1 displacement. Early PU.1 gains also exhibited lower GC content and higher CLICK enrichment, suggesting that PU.1 is ushered to the most GC-rich, drug-free PU.1 binding sites over time (Extended Data Fig. 7a,b, right panels). These findings suggest that PU.1 is rapidly displaced from chromatin by DB2115 but then requires a ‘searching’ period to locate and bind to alternative lower-affinity chromatin sites.

Fig. 5: Temporally resolved pioneering of chromatin accessibility and nascent transcription by redistributed endogenous PU.1.
figure 5

MOLM13 cells were treated with 5 µM DB2115 for 1 h, 4 h or 12 h and CUT&Tag, ATAC–seq and PRO–seq were performed. a, Time of PU.1 peak changes and corresponding gains and losses at each time point. b, Time of chromatin accessibility changes and corresponding opening and closing at each time point. c, Known motif enrichment z-scores from closing and opening regions at 1 h, 4 h and 12 h (Homer analyses). d,e, Representative viewer tracks of genomic loci displaying both PU.1 CUT&Tag (top tracks) and ATAC–seq (bottom tracks) over the time course. Arrows indicate time of first detection of gain (d) or loss (e) of PU.1 or opening (d) and closing (e) chromatin. f,g, Heatmap depicting the time of chromatin opening for PU.1-gained sites (f) or depicting the time of chromatin closing for PU.1-lost sites over the time course (g). Sites were filtered to remove gained and lost PU.1 sites without changes in chromatin accessibility. h, Comparison of log2fold change of gene expression from PRO–seq data versus associated PU.1 peak log2fold change from PU.1 CUT&Tag conducted over the time course. The proportions of reduced DEGs out of all lost PU.1-associated DEGs (blue font) and increased DEGs out of gained PU.1-associated DEGs (red font) are shown in each dot plot. i,j, Representative IGV tracks of positive and negative sense PRO–seq data displaying loss (i) or gain (j) of transcription over the time course. k, Cumulative pie charts depicting time of gene up-regulation or down-regulation from PRO–seq data grouped by time (1 h, 4 h or 12 h) of associated PU.1 loss or gain. l, Representative smFISH images from MOLM13 cells for the lost and gained PU.1-associated genes MYC (upper panels), and FGR (lower panels) over the DB2115 time course. MYC and FGR transcripts are in white pseudo-color; DNA is in blue pseudo-color. m,n, Transcription site (TS) burst frequency per cell for MYC (m) and FGR (n) over the time course. o,p, Number of nascent mRNA molecules per cell of MYC (o) and FGR (p) over the time course. PU.1 CUT&Tag and ATAC sequencing peaks were created using Diffbind, n = 2 and FDR < 0.1. PRO–seq DEGs were called using the NRSA pipeline; n = 2 and Padj < 0.05.

To understand the dynamics of chromatin accessibility changes in relation to PU.1 redistribution, we performed an identical DB2115 time course with ATAC–seq. Approximately 78,271 sites exhibited changing chromatin status, with a substantial proportion occurring after just 1 h (10%) and 4 h (23%) (Fig. 5b). Interestingly, both the 1 h and 4 h changes predominated with closing events, but this equilibrated to equal opening and closing events by 12 h. Homer motif analysis revealed that the early closing regions and late opening regions were most highly enriched for ETS motifs (Fig. 5c).

Focusing specifically on chromatin accessibility changes at PU.1 redistributed sites highlighted that many gained sites displayed PU.1 binding before chromatin opening (FGR and CSF1R; Fig. 5d and Extended Data Fig. 7c), whereas lost sites predominantly exhibited simultaneous closing (RUNX1 and POMP; Fig. 5e and Extended Data Fig. 7c). Global examination of these redistribution versus chromatin accessibility events revealed that PU.1 gains were preceding the opening of the chromatin (Extended Data Fig. 7d). This effect was most noticeable when examining PU.1-gained sites at 4 h, with the majority opening at the later 12 h time point (Fig. 5f). Conversely, the relationship between loss of PU.1 and closing of chromatin appeared more synchronous (Extended Data Fig. 7e), with predominant simultaneous losses and closing (Fig. 5g).

To determine the effect of pharmacologically mediated PU.1 redistribution on nascent transcription, we performed precision nuclear run-on with sequencing (PRO–seq). Overall, the number of differentially transcribed genes increased from 185 to 2,358 to 8,387 at 1 h, 4 h and 12 h of DB2115 exposure, respectively (Extended Data Fig. 7f). Focusing on genes associated with changes in PU.1 binding, gained sites typically led to increases in transcription at all time points and vice versa for lost sites (Fig. 5h), consistent with our RNA sequencing data (Fig. 4m and Extended Data Fig. 7g). Redistributed PU.1-associated nascent transcripts changed at later time points than the corresponding changes in PU.1 binding or chromatin accessibility, with the majority of nascent RNA transcription fluctuations occurring at 12 h (Fig. 5k and Extended Data Fig. 7h; for example, MYC and FGR loci in Fig. 5i,j).

To examine the transcriptional dynamics of PU.1 redistribution at single-molecule resolution, we performed targeted single-molecule RNA-fluorescence in situ hybridization (smFISH) for genes associated with loss (MYC), gain (FGR) or unchanged (SPI1) PU.1 binding (Fig. 5l and Extended Data Fig. 7i). For MYC, we observed significant reductions in transcription site burst frequency (Fig. 5m) and total nascent transcripts per cell (Fig. 5o). By contrast, for FGR, we observed significant increases in transcription site burst frequency (Fig. 5n) and total nascent transcripts per cell (Fig. 5p). SPI1 nascent transcripts and transcription site burst frequency did not change at any time point analyzed (Extended Data Fig. 7j,k). Overall, the dynamics of pharmacologically mediated PU.1 redistribution provide evidence for both the pioneering ability of endogenous PU.1 within native chromatin and, in turn, the ability of DB2115 to redirect transcriptional activity.

PU.1 redistribution activates genes at alternate PU.1 target sites

To identify direct regulation of PU.1 at candidate target sites, we used a genetic system to imitate pharmacological PU.1 displacement, whereby a catalytically dead Cas9 (dCas9) occupies cis-regulatory regions to block TF binding33. We designed sgRNA against a DB2115-sensitive PU.1 binding site in the enhancer of the STRAP gene (Extended Data Fig. 8a) (Fig. 6a). Characteristic pharmacological displacement of PU.1 by DB2115 in MOLM13 cells was recapitulated in sgSTRAP+ dCas9 cells (Fig. 6b, top and middle panels). A dCas9 peak was evident at the identical location as PU.1 loss, indicating dCas9-mediated PU.1 displacement. Quantification of STRAP mRNA confirmed a DB2115-like reduction in expression in sgSTRAP+ dCas9 cells, thereby confirming that PU.1 displacement from cis-regulatory sites has direct effects on gene expression (Fig. 6c).

Fig. 6: Both genetic and pharmacological PU.1 redistribution activates gene expression at known and newly identified alternate PU.1 target sites.
figure 6

a, Experimental schematic for generation and evaluation of dCas9 and sgRNA-expressing MOLM13 cells b, Representative viewer tracks of genomic loci displaying PU.1 CUT&Tag of vehicle or DB2115-treated cells (top two tracks), sgNT-expressing or sgSTRAP-expressing cells (middle two tracks) and dCas9 CUT&Tag of sgNT-expressing and sgSTRAP-expressing cells (bottom two tracks). Arrows highlight the PU.1 or dCas9 binding at the STRAP enhancer. sgNT, non-targeting control sgRNA. c, Relative mRNA expression of STRAP by qPCR for DB2115-treated cells (for 24 h, compared to vehicle) and sgSTRAP dCas9+ cells (48 h of doxycycline, compared to sgNT dCas9+); n = 4 experimental replicates displaying mean ± s.e.m. d, Experimental schematic for the design and evaluation of the native PU.1-driven eGFP reporter assay in MOLM13 cells. e,f, Representative eGFP fluorescence histograms (e) and summary mean fluorescence intensity (MFI) data for the three eGFP reporter-transduced MOLM13 cells (unchanged-CD11b site, lost-POMP site and gained-CSF1R PU.1 binding sites; see Extended Data Fig. 8b) following 24 h treatment with 5 µM DB2115 or vehicle (f); n = 4–5 experimental replicates, two-sided, paired Student’s t-tests, **P < 0.001 (P = 0.0096), ***P < 0.0005 (P = 0.0005). g, Mean change in eGFP MFI for the three reporter MOLM13 cell lines following 24 h treatment with 5 µM DB2115 (black), DB2373 (purple) or DB2836 (orange) compared to vehicle ± s.e.m., n = 3–5 experimental replicates.

To further investigate the effects of PU.1 redistribution on target gene expression, we designed several genetic eGFP reporter systems in which eGFP fluorescence is driven by an endogenous PU.1 binding site (Fig. 6d). The PU.1 binding sites selected were from a gained (CSF1R alternative promoter), lost (POMP intergenic enhancer) and an unchanged (CD11b promoter) site (Extended Data Fig. 8b). Upon stable transduction of these reporters, we found that the baseline level of eGFP was different for each reporter but was reflective of our CUT&Tag results (Fig. 6e and Extended Data Fig. 8c). Upon DB2115 exposure, the lost-site reporter showed a robust decrease in eGFP fluorescence whereas the gained-site reporter showed a significant increase in fluorescence (Fig. 6e,f). Additionally, two second-generation compounds, DB2373 and DB2826, demonstrated comparable changes to eGFP expression in the three reporters (Fig. 6g). These data establish that pharmacological PU.1 redistribution has both positive and negative effects on gene expression.

Myeloid differentiation is driven by PU.1 repositioning

Many myeloid surface receptor transcripts were directly up-regulated by pharmacologically induced PU.1 redistribution (Supplementary Data 3). To understand whether PU.1 redistribution can enhance protein expression or function, we investigated two myeloid and immune surface receptors, CSF1R (known PU.1 target) and IL-4R (non-canonical PU.1 target). Encouragingly, surface expression of CSF1R drastically increased from 0% to 30% (Fig. 7a) and IL-4R increased by 15% following DB2115 exposure (Fig. 7b).

Fig. 7: Pharmacological TF redistribution induces myeloid lineage receptor responsiveness and promotes differentiation of leukemic cells.
figure 7

a,b, Proportion of CSF1R+ (a) and IL-4R+ MOLM13 cells (b) after 24 h or 48 h of treatment with vehicle or 1 µM DB2115, and a representative histogram of staining intensity after 48 h; n = 7 experimental replicates displaying mean ± s.e.m. **P < 0.01 (P = 0.0057 (24 h IL-4) and P = 0.0096 (48 h IL-4)) and ****P < 0.0001. c, Schematic of approach to invoke surface receptor expression on MOLM13 cells and assess response to ligands (CSF1 and IL-4). d, Fold change of pS6 MFI versus baseline for CSF1 stimulations of drug-invoked MOLM13 cells, including representative histograms; n = 3 experimental replicates displaying mean ± s.e.m.; *P < 0.05 (P = 0.0234) and **P < 0.01 (P = 0.005). e, Fold change of pSTAT6 MFI versus baseline for IL-4 stimulations of drug-invoked MOLM13 cells, including representative histograms; n = 3 experimental replicates displaying mean ± s.e.m.; *P < 0.05 (P = 0.0164) and **P < 0.01 (P = 0.006 (3 ng ml−1) and P = 0.0025 (10 ng ml−1)). f, Representative day 7 colony assay images of drug-invoked or vehicle-invoked MOLM13 cells grown in methylcellulose containing no growth factors, +100 ng ml−1 CSF1 or +100 ng ml−1 IL-4. g, Summary D7 colony numbers from DB2115-pretreated or vehicle-pretreated MOLM13 cells in the absence or presence of ligands, n = 3–4 experimental replicates displaying mean ± s.e.m, *P < 0.05 (P = 0.0248). h, May-Grünwald Giemsa cytospin image of MOLM13 cells treated with vehicle or 1 µM DB2115 for 5 days. The experiment was repeated independently three times with similar results. i,j, Representative histograms (i) and fold change in MFI versus vehicle for cell surface markers from 5-day treated cells, n = 7 experimental replicates displaying mean ± s.e.m (j). k, Experimental schema illustrating the evaluation of primary AML samples following vehicle or DB2115 exposure in methylcellulose colony assays. l, Representative light microscope images of AML colonies (sample 6) after vehicle or 5 µM DB2115 treatment for 8 days. Example image is from one experiment with two technical replicates with similar results. m, Colony counts from samples from the seven patients with AML exposed to vehicle, 1 µM or 5 µM DB2115 for 8–13 days, n = 2 technical replicates per sample. n, Summary heatmap of the percentage change in CD marker expression in primary AML cells following 5 µM DB2115 compared to vehicle. All statistical tests displayed were unpaired, two-sided Student’s t-tests.

To investigate the functional effects of pharmacological induction of CSF1R or IL-4R expression, we induced receptor expression with DB2115 before assessing ligand stimulation responses via phospho-protein flow cytometry (Fig. 7c). Vehicle-pretreated MOLM13 cells did not respond to CSF1 stimulation at any dose, whereas DB2115-pretreatment led to dose-dependent increases in pS6 levels (Fig. 7d). Similarly, IL-4 stimulations led to enhanced pSTAT6 activity in drug-invoked over vehicle-invoked MOLM13 cells (Fig. 7e). Next, we grew drug-invoked MOLM13 cells in methylcellulose containing either CSF1 or IL-4 for 1 week (Fig. 7c). DB2115-pretreatment led to reduced frequency and size of MOLM13 colonies compared to vehicle (Fig. 7f). Addition of CSF1 or IL-4 increased the number of colonies in the DB2115-invoked group (Fig. 7g); however, the surviving cells displayed high expression of the myeloid marker CD15, indicating a commitment to differentiation (Extended Data Fig. 9a–e).

Next, we sought to investigate whether MOLM13 cells would differentiate following longer-term TF redistributor exposure. After a 5 day exposure to low-dose DB2115 in liquid culture, MOLM13 cells displayed myeloid differentiation, gaining macrophage-like morphology (Fig. 7h) and increased surface expression of myeloid markers CD11b and CD86, but not the stem cell marker CD34 (Fig. 7i,j). Consistently, THP1 cells displayed a similar myeloid differentiation phenotype under TF redistributor exposure, with macrophagic morphology (Extended Data Fig. 9f) and changes to myeloid surface marker expression (Extended Data Fig. 9g,h).

To test whether pharmacological repositioning of PU.1 can drive myeloid differentiation in primary cells, we examined the effect of TF redistributor exposure on seven samples from patients with AML or MDS that were grown in methylcellulose for 8–13 days. Cell growth and colony formation were severely hampered by DB2115 in a dose-dependent manner in all seven primary AML or MDS samples (Fig. 7l,m). Flow cytometry showed that the remaining viable DB2115-treated cells exhibited increases in myeloid surface markers, including CD11b, CD14, CD15 and CD86 (Fig. 7n and Extended Data Fig. 10a–c). Overall, this provides proof of concept that TF redistributors can be wielded to redeploy PU.1 to drive a differentiation program and induce cellular cytokine sensitivity.

Discussion

In this study, we document evidence for TF redistribution caused by genome-wide, small-molecule blockade of cis-regulatory PU.1 binding sites (Fig. 8a). Consequentially, redistribution of PU.1 leads to robust and rapid changes in chromatin accessibility and rewires PU.1 transcriptional networks to drive increased expression of alternative PU.1 gene targets, which encourages myeloid differentiation of AML cells (Fig. 8b). Therefore, use of ‘TF redistributors’ such as DB2115, DB2313, DB2373 and DB2826 can provide unique insight into the behavior and dynamics of TFs while maintaining the stoichiometry of endogenous protein, and offers a molecular strategy to abrogate the aberrant transcriptional circuits in cancer and other diseases.

Fig. 8: The molecular mechanism of action and cellular consequences of pharmacological PU.1 redistribution.
figure 8

a, TF ‘redistributors’ (for example, DB2115, DB2313, DB2373 or DB2826) directly and rapidly displace PU.1 from canonical adenine-rich ETS binding sites and subsequently redistribute it to second-tier, sequence-unbiased ETS binding sites. b, Under steady-state conditions, canonical PU.1 binding and the ensuing PU.1-driven transcriptome is essential for leukemia cell survival; however, this is perturbed through the administration of TF redistributors. PU.1 is repositioned to alternate binding sites, redirecting its pioneer activity leading to subsequent chromatin opening and a rewiring of the PU.1-driven transcriptome, ultimately driving myeloid differentiation.

We identified that A-rich surrounding PU.1 binding motifs were most sensitive to inhibition by the described class of agents. One possible explanation for this could be that specific A-rich binding co-factors are aiding or stabilizing PU.1 binding at these sites. This could include the reported A-rich binding and PU.1 interacting TFs SATB1 (ref. 4), IRF4/8 (refs. 34,35) or C/EBP family factors36. It is possible that the anti-leukemic effects of the compounds are linked to the blockade of co-factor–PU.1 complexes forming on the DNA37,38. Thus, further investigation is required to uncover which co-factors are essential for stabilizing PU.1 binding to adenine-biased sites and whether the co-factors themselves are sensitive to TF redistributors, thus contributing to the observed anti-leukemic effects of the drug class9.

As a direct result of PU.1 displacement, PU.1 was redeployed to alternative binding sites throughout the genome. It is noteworthy that the motifs derived from the localization data were in line with in vitro PU.1 affinity assays39,40, with GC-rich, low-affinity ETS sites only becoming populated with PU.1 once the high-affinity, A-rich ETS sites were blocked. Additionally, our temporal investigations into PU.1 redistribution provided evidence for the pioneering ability of endogenous PU.1, which supports the ‘non-classical’ pioneering function of PU.1 and dependence on other remodeling factors to modify the chromatin landscape11,41,42.

Interestingly, some PU.1 binding loci were unchanged following pharmacological binding site blockade despite these loci containing a typical PU.1 binding motif. There are a number of possibilities to explain this finding, including that PU.1 co-factors or nearby chromatin binding proteins may stabilize the PU.1–DNA interaction and prevent pharmacological displacement (this could include any of the known PU.1 interactors; for example, HOX, IRF or C/EBP family factors)35,38,43,44 or that the drug is unequally distributed throughout the chromatin landscape. It has been reported that pharmacological compounds can be hindered in their ability to freely diffuse throughout the dense and fractal nuclear compartments45,46. Additional research will be required to interrogate the underlying forces driving sensitivity and insensitivity to PU.1 displacement.

TF redistribution is a biological phenomenon that can be driven by changes in the chromatin context and/or the protein millieu4,7,47. This study has highlighted an interesting form of TF redistribution: direct pharmacological TF displacement and redistribution, which not only has therapeutic potential but also allows for rapid and tunable investigations into the fast biology and temporal chromatin dynamics of endogenous TFs. Pharmacological redistribution of TF localization could be applied to investigate the global behaviors of many TFs, including those that have been documented to be displaced by DNA binding site inhibitors (for example EVI1, PIT-1 and HER2 (refs. 48,49,50)) or those unexplored disease regulatory factors which rely upon AT-rich minor-groove contacts (for example, FOXA1 in breast cancer51,52).

Although still in its infancy, the potential for TF redistributors to be used for therapeutic application is highly attractive and theoretically plausible. Pharmacological targeting of transcriptional dysregulation found in diseases such as MDS and AML has proven enigmatic, with most approaches limited to targeting protein–protein interactions in the chromatin rather than TF–DNA interactions24,53,54,55. Initial pre-clinical investigations in mouse models of AML have demonstrated that these tool TF redistributors display on-target efficacy for eliminating leukemia9, warranting further development of next-generation versions of these agents pursuing superior pharmacokinetics. Additionally, it will be important to determine how best to use the molecular PU.1 redistribution phenomenon therapeutically and investigate whether a combination with other drugs can synergize to better control the aberrant PU.1 transcriptional network. Thus, with further investigation, TF redistributors offer an exciting potential therapeutic class of small molecules that could have far-reaching effects for the treatment of leukemias and other aberrant TF-mediated diseases.

Methods

Study approval

Human samples were collected after informed consent was obtained from patients and upon approval of the Institutional Review Board of Albert Einstein College of Medicine (protocol 2005–536).

Small molecules

The synthesis of DB2115 (ref. 56) and DB2313 (ref. 9) has been previously reported; the synthesis of DB2373, DB2826 and DB2750 is provided in the Supplementary Information57,58. Following synthesis, all compounds were dissolved in 10% dimethylsulfoxide and water at a stock concentration of 5 mM and stored at −20 °C.

Cell culture and drug treatment

THP1, MOLM13, MV411, JURKAT and HL60 cells were originally purchased from ATCC. MOLM13, THP1, JURKAT and MV411 were cultured in RPMI 1640 medium supplemented with 10% heat-inactivated FBS and 1% penicillin–streptomycin. HL60 were cultured in IMDM medium supplemented with 20% heat-inactivated FBS and 1% penicillin–streptomycin. All cell lines were maintained in an incubator at 37 °C and 5% CO2.

Cells were treated with 5 µM of DB2115, DB2313, DB2373 or DB2750 at a starting concentration of one million cells per ml for the specified time.

CUT&Tag

CUT&Tag was performed as previously reported59 but with a few technical alterations. In brief, 5 × 105 cells per cell line were collected and, to prevent the osmotic displacement of PU.1 (or other factors) from chromatin during the high-salt downstream protocol60, lightly fixed with 2% formaldehyde for 2 min. The cells were bound to Concavalin A-coated beads (Bangs Laboratories) and incubated with the primary antibodies (anti-PU.1, Santa Cruz, sc-352; anti-RUNX1, Cell Signaling, 4334S; anti-ELF1, Proteintech, 22565-1-AP; anti-FLI1, Invitrogen, PA5-29597; anti-GAPBA, Invitrogen, PA5-27735; and anti-GATA3, Cell Signaling, 5852T) or IgG control (Santa Cruz, sc-3888; Extended Data Fig. 1e) at 4 °C overnight. Samples were then incubated with a secondary antibody (guinea pig anti-rabbit; Antibodies Online, ABIN101961) followed by a pre-loaded pA-Tn5 adaptor complex (generated in-house). Tagmentation buffer with magnesium was used to induce fragmentation. DNA was extracted by phenol–chloroform–isoamyl alcohol and amplified with NEBNext HiFi 2× PCR Master Mix and universal i5 and barcoded i7 primers61 for 13 cycles. AMPure XP beads (A63880) were used for post-PCR clean-up of libraries. Libraries were subjected to 35-bp paired-end sequencing using an Illumina NextSeq 500 platform on high output mode at the Einstein Epigenomics Core. Fastq files were generated using Picard tools (v.2.17.1), with adaptor trimming by TrimGalore (v.0.3.7) and QC assessment using FASTQC (v.0.11.4).

CLICK-on-CUT&Tag

To determine binding preferences for DB2115 within the genome, we examined an alkyne-linker-tagged DB2115 (called DB2750) for in vitro DNA binding ability. Preparation of drug-coated magnetic beads was performed as previously described62, whereby a 25 µM solution of DB2750 in H20 was incubated with 150 µM Biotin Azide (Click Chemistry tools), 100 µM E301 (Sigma-Aldrich) and 4 mM CuSO4 for 1 h at 4 °C before adding streptavidin-coupled Dynabeads (MyOne Streptavidin T1, Invitrogen) and incubating for 1 h. Coated beads were washed and incubated overnight with either a 25 mM equal mix of the two synthesized 68-bp dsDNA fragments (AT-rich and AT-poor; see Supplementary Table 1b) or DNA libraries from PU.1 CUT&Tag of vehicle-treated MOLM13 cells. Beads were washed three times with 25 mM Tris-HCl, 1% Triton-X, 0.5% SDS, 150 mM NaCl and 5 mM EDTA wash buffer before being eluted in 50 µl of 0.1 M NaHC03, 0.1% SDS solution. QiaQuick PCR Purification Kit (Qiagen) was used to purify fragments. Synthesized fragment purified samples were quantified by quantitative PCR with reverse transcription (RT–qPCR) (see Supplementary Table 1b for AT-rich and AT-poor primers). CUT&Tag pulldown samples were amplified with NEBNext HiFi 2× PCR Master Mix and universal i5 and barcoded i7 primers61 for five cycles before being sequenced with Illumina NextSeq 500 35-bp PE sequencing (Einstein Epigenomics Core). Bioinformatic processing was performed identically to CUT&Tag.

Classical PU.1 ChIP

ChIP was performed similarly to the previously described protocol9. In brief, MOLM13 cells were treated with 5 µM DB2115 for 12 h before cells were crosslinked in 0.5% formaldehyde for 7 min at room temperature (25 °C). Samples were sonicated for 12 min before being incubated with 5 µg of PU.1 antibody (Santa Cruz, sc-352) or rabbit IgG (Santa Cruz, sc-3888). Also included was Activ-motif Drosophila normalization control DNA and antibody (cat. nos. 53083 and 61686). Following pulldown with Dynabeads Protein A (Thermo Fisher Scientific), DNA fragments were purified using QIAquick PCR Purification Kit (Qiagen). Library preparation was carried out using the NEB Ultra II DNA Prep Kit following the manufacturer's recommendations. Next-generation sequencing was performed with Illumina NextSeq 500, 75-bp paired-end reads on high output at Einstein Epigenomics Core. Fastq files were generated using Picard tools (v.2.17.1), with adaptor trimming by TrimGalore (v.0.3.7) and QC assessment using FASTQC (v.0.11.4).

Cell titer blue proliferation assay

To assess the IC50 of the small molecules in MOLM13 cells, we used the resazurin cell viability assay (Cell Titer Blue, Promega) following the manufacturer's guidelines. In brief, cells were plated at a density of 1 × 103 per 100 µl and incubated with vehicle or compounds for 48 h at a range of concentrations (0.01–50 µM) before 20 µl of Cell Titer Blue reagent was added and fluorescence (560EX/590EM) was measured using a FLUOstar Omega instrument (BMG Labtech).

Omni-ATAC–sequencing

Omni-ATAC–seq was performed as previously described63. In brief, 50,000 MOLM13 cells treated with either 5 µM DB2115 or vehicle were subjected to nuclei isolation with ATAC resuspension buffer (containing 0.1% NP-40, 0.1% Tween20 and 0.01% digitonin) before transposition was performed using Tn5 transposase in TD buffer (Illumina Tagment DNA kit). Transposed DNA was purified using Minelute PCR purification (Qiagen) before DNA was amplified by PCR with custom PCR primers61 and cycle number was determined using a KAPA Library Quantification Kit (KAPA Biosystems). Libraries were sequenced on an Illumina NextSeq 500, with 75-bp paired-end reads on high output at Einstein Epigenomics Core. Fastq files were generated using Picard tools (v.2.17.1), with adaptor trimming by TrimGalore (v.0.3.7) and QC assessment using FASTQC (v.0.11.4).

GFP reporter assays

Reporter constructs were designed around the promoter–enhancer cassette as previously described8,9. In brief, a minimal promoter (TATA box) was placed upstream from an open reading frame encoding destabilized D2EGFP. The original enhancer element was a five-repeat tandem of the λB motif of the murine Igλ2-4 enhancer. For this study, the λB motifs were replaced by POMP, CSF1R-alt or NS sequences (Supplementary Table 1c). PU.1 binding site underlined between NdeI and SdaI/PstI cleavage sites upstream of the TATA box. For the myeloid CD11b promoter, the entire regulatory region, including the TATA box, was replaced by the −412 to +98 sequence of the 5′ flanking region of the human CD11b gene (GenBank accession no. M84477.1)64. The cassettes plus open reading frames were inserted between the NdeI/EcoRI sites of pLJM1–EGFP, a puromycin-selectable lentiviral vector (Addgene, plasmid no. 19319)65. The inserts replaced the CMV–EGFP construct in the vector and the resultant plasmids were notated with the suffix pLJM1. All insertions were sequenced-verified by Sanger sequencing in the forward direction with a hU6-F primer (5′-GAGGGCCTATTTCCCATGATT-3′) and/or in the reverse direction with a standard EGFP-NR primer.

Next, 10 μg of pLJM1 vectors were transfected to 293T cells spread in a 100 mm culture dish with helpers (10 μg psPAX2 and 4 μg VSV-G) using lipofectamine 2000 reagent (Invitrogen). Virus-containing supernatant was collected and filtered at 48 h from transfection and was transduced into MOLM13 cells by centrifuging at 1,000g, 37 °C, 1 h with Transdux reagent (System Biosciences). Following 3 days of puromycin selection, pLJM1 MOLM13 cell lines were treated for 24 h with 5 µM DB2115 (or other drugs) before analysis on an LSRII flow cytometer was performed (Becton–Dickinson). Forward scatter and side scatter gated, DAPI-negative cells were quantified for GFP fluorescence using FlowJo software.

CRISPRd: dCas9 blocking studies

Lentiviral vectors for constitutive TET3G expression driven by a CMV promoter (hygromycin selection marker), and TRE3G-inducible dCas9 (blasticidin selection marker) were designed and produced by VectorBuilder and transduced using Transdux (System Biosciences) into MOLM13 cells. Successful transduction was selected with 100 µg ml−1 hygromycin and 10 µg ml−1 blasticidin treatment for 1 week. Expression of dCas9 was confirmed following 48 h of 1 µg ml−1 doxycycline by western blot; however, partial or leaky dCas9 expression was evident in untreated cells. A constitutive sgRNA-expressing lentiviral vector against STRAP or a non-targeting control were synthesized by Cellecta and subsequently transduced into TET3G-dCas9-expressing MOLM13 cells. Selection was performed for 3 days post transduction with puromycin before cells were treated with 1 µg ml−1 doxycycline for 48 h and collected for CUT&Tag or RT–PCR analysis.

Western blotting

Whole-cell extracts were prepared using lysis buffer (150 mM NaCl, 50 mM Tris-HCl, 5 mM EDTA, 1% NP-40, 1% phosphatase inhibitor cocktail, 1× Protease Inhibitor Cocktail, 1 mM phenylmethyl sulfonyl fluoride (PMSF), 10% glycerol). SDS–PAGE was performed with equal protein per sample. PVDF membranes were used for western blot transfer, and immunoblotting using primary antibodies (1:5,000 for actin, Sigma-Aldrich, A2066; 1:1,000 for PU.1, Santa Cruz, sc-352; 1:500 Cas9, Cell Signaling, 14697S) followed by HRP-conjugated secondary antibodies (1:5,000). Imaging of western blots was performed using chemiluminescent ECL substrate on a LI-COR Odyssey Fc imager.

CLICK-chemistry immunofluorescence staining

MOLM13 cells were attached to lysine-coated coverslips following 6 h of 5 µM DB2750 or vehicle treatment and subjected to processing as directed by a Click-iT EdU imaging Kit (Invitrogen) but with a substitution of DB2750 for the Click-Edu reagent. In brief, cells were fixed (3.7% formaldehyde) and permeabilized (0.5% Triton-X) before being stained with a Click-iT reaction mix containing Alexa Fluor 488 Azide. Prolong gold containing DAPI (Thermo Fisher) was added to stain DNA before imaging on an Olympus BX83 microscope with an X-Cite 120 PC lamp (EXFO) and an ORCA-R2 digital charge-coupled device camera (Hamamatsu) with a ×100 objective.

RT–qPCR and mRNA sequencing

RNA was extracted from 5 µM DB2115-treated cells using an RNeasy Mini Kit (Qiagen) and RT–qPCR was performed using iScript cDNA Synthesis Kit (Bio-Rad). qPCR reactions were performed on a ViiA7 instrument using Power SYBR Green PCR Master Mix (Thermo Fisher). Expression levels were normalized to GAPDH, and the primers used are listed in Supplementary Table 1a. For mRNA sequencing, library preparation and transcriptome sequencing were performed by Novogene.

Primary AML samples

Adult AML or MDS bone marrow and peripheral blood samples were obtained after written informed consent and following Albert Einstein College of Medicine Institutional Review Board approval (2005–536). Characteristics of the primary MDS and AML samples used for experiments are listed in Supplementary Table 1f.

For CUT&Tag experiments, primary AML cells were subjected to CD34+ column enrichment (Miltenyi Biotech) before being viably frozen. Upon thawing, one to two million CD34+ enriched cells were thawed and incubated in 1 ml StemSpan SFEMII medium plus 1× CC100 growth cocktail (StemCell Tech) and TPO (50 ng ml−1, R&D Systems) in the presence of 5 µM DB2115 or vehicle for 12 h before viable cells were collected for PU.1 CUT&Tag.

For colony assays, 50–100,000 unfractionated mononuclear AML cells were plated per ml of Methocult H4435 enriched medium (StemCell Tech) along with 1 µM or 5 µM of DB2115 or vehicle. After 8–13 days, colonies were counted and cells were collected for flow cytometry.

Surface receptor induction colony assays

For MOLM13 colony assays, cells were first ‘invoked’ for receptor expression with a 1 µM dose of DB2115 or vehicle for 24 h in 10% FBS, RPMI medium. After a washout, 3,000 cells were immediately plated per ml of methylcellulose in HSC002SF (R&D Systems) containing added recombinant IL-4 (100 ng ml−1, R&D Systems) or CSF1 (100 ng ml−1, R&D Systems) and scored after 7 days.

Light microscopy images to detail colony morphology were obtained using an EVOS FL Auto microscope (Life Technologies) with an objective at ×4 or ×10 magnification.

Cytomorphology

Cell morphology was assessed after cytospin of 100,000 cells onto a glass slide (5 min at 500 rpm) and May-Grünwald Giemsa staining, according to standard protocols. Images were obtained using an EVOS FL Auto microscope (Life Technologies) with an objective at ×10 or ×50 magnification.

Flow cytometry

Antibody staining was performed for 30 min before flow cytometry analyses, using the following Biolegend antibodies; CD14 FITC (no. 325603), CD15 APCCy7 (no. 323048), CD86 PECy7 (no. 374210), CD34 PE (no. 343506), CD209 AF647 (no. 330112), CD11b PerCPCy5.5 (no. 101228), CSF1R APC (no. 347306), IL-4R BV421 (no. 355014) and DAPI for alive cell gating. Flow cytometry analysis was performed on a BD LSRII containing a yellow laser running FACSDiva 8 software. Flow cytometry data was analyzed using FlowJo (v.10.6.1). Log10 scales are used for plots showing flow cytometry fluorescence intensity data.

Phospho-flow cytometry

MOLM13 cells were ‘invoked’ for surface receptor tyrosine kinase (RTK) expression with a 1 µM DB2115 treatment for 24 h, followed by a washout and incubation with media-only for 24 h. Cells were then incubated for 1 h in serum-free IMDM at 37 °C before a 5 min stimulation with recombinant mouse huCSF-1 (10–100 ng ml−1, R&D Systems) or huIL-4 (3–100 ng ml−1, R&D Systems). After fixation with 1% PFA and permeabilization with ice-cold acetone, intracellular staining for phospho-S6 ribosomal protein (608604, Biolegend) and pSTAT6 (686012, Biolegend) before flow cytometry analysis.

PRO–seq

Initially, 30 million MOLM13 cells were treated with 5 µM DB2115 or vehicle and collected at the indicated time points for nuclei extraction. Cells were washed with ice-cold PBS and lysed with cell lysis buffer (10 mM Tris-Cl pH 7.4, 300 mM sucrose, 3 mM CaCl2, 2 mM MgCl2, 0.5% NP-40, 5 mM dithiothreitol, 1 mM PMSF, EDTA-free protease cocktail inhibitor tablet). Using Dounce homogenization, nuclei were pelleted by centrifugation and washed with nuclei storage buffer (50 mM Tris-Cl pH 8.3, 40% glycerol, 5 mM MgCl2, 5 mM dithiothreitol, 0.1 mM EDTA, 1 mM PMSF, EDTA-free protease cocktail inhibitor tablet). After counting, pelleted nuclei were resuspended in storage buffer and stored at −80 °C.

PRO–seq was performed in two biological replicates as previously described using approximately 20 million nuclei per run on with GTP, ATP, UTP and biotin-11-CTP (PerkinElmer) using 0.5% Sarkosyl (Fisher Scientific) to prevent transcription initiation66,67,68. RNA was reversed-transcribed and amplified to make the cDNA library for sequencing by the Vanderbilt University Medical Center (VANTAGE Genome Sciences Shared Resource on an Illumina NovaSeq 6000 (PE-150, 50 million reads). Following adaptor trimming with Cutadapt (v.1.18), the sequences were aligned and mapped using bowtie2 (v.2.5.1). Samtools (v.1.9) was used for the file format conversion before using the Nascent RNA Sequencing Analysis (NRSA)67 pipeline to determine the gene body changes.

smFISH

To design mRNA-specific probes for sequential smFISH, a full-length transcript of each gene was used as input for PaintSHOP (https://paintshop.io)69 to retrieve 10–29 primary targeting sequences (23–39 bp). Sequences were screened for off-target activity using NCBI Blast (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Selected sequences were then concatenated on both 5′ and 3′ ends with flanking 20-mer sequences (RO1, ATACTGGAGCGACGCGTGAT; RO2, GTTTGAAGATTCGACCTGGA; or RO4, CTAAGGTACCTAATTGCCTAG), generating a final ‘primary probe’ (Supplementary Table 1d,e). SmFISH immunofluorescence staining procedure and analysis were performed as described previously70. In brief, MOLM13 cells treated with 5 µM DB2115 for 0 h, 1 h, 4 h and 12 h were fixed and permeabilized before primary and secondary hybridization reactions were performed. Samples were mounted in Prolong Diamond Antifade reagent plus DAPI (Invitrogen). Images were acquired using an oil immersion ×100 objective on a Leica Thunder fluorescence microscope. For data analysis, single-molecule mRNA and transcription site detection was performed using FISH-quant71 by 3D Gaussian fitting of thresholded spots implemented in MATLAB R2024a.

Data processing

CUT&Tag-generated fastq files were mapped to the human genome (hg38) using bowtie2 (v.2.2.3) with options –end-to-end –very-sensitive –no-mixed –no-discordant –phred33 −I 10 −X 700. Normalized bedgraph files were generated by using bedtools genomecov with a normalization factor of 1,000,000/oo. total human reads. Peak calling was performed using bdgpeakcall from MACS2 (v.2.1.0) with options −l 100 and −c 2. Bigwig files were generated using bedGraphToBigWig for visualization in IGV_2.4.15. Motif analysis and peak annotation was performed using the Homer package (v.4.11.1)36.

Differential peak analysis was performed in RStudio with either the Diffbind package (v.3.12.0) using DESEQ2 analyses72 if n > 2 or a custom script we termed ‘GoodpeaksScript’ (https://github.com/steidl-lab/rePU.1sitioning) if n = 1. In Diffbind, the FDR cutoff was set to 0.1 for determining significantly changed peaks, and technical variability between replicates is displayed in PCA and heatmap plots (Extended Data Fig. 1a,b). Additionally, Diffbind analysis was performed on a randomization of vehicle and DB2115 pairs, with no differential peaks identified (FDR < 0.1), providing confidence in the redistribution phenomenon (Extended Data Fig. 1c,d). For GoodpeakAnalysis, three stringent filters were used for the differential peak analysis of the average peak intensity: minimum intensity of >7.5, minimum fold change of >4 and minimum summit of >3.

For CLICK-on-CUT&Tag, the average MACS2 peak scores across three replicates were calculated and compared to input vehicle-treated PU.1 CUT&Tag peak scores to generate a log2fold change CLICK score. Peaks with CLICK scores above 0.5 were considered enriched for drug binding, whereas scores below 0.5 were considered non-drug-binding.

Classical ChIP-generated fastq files were mapped to the human or Drosophila melanogaster genome (hg38 or dm3) using bowtie2 (v.2.2.3). Duplicates were then removed with Picard, and normalization was performed by subsampling .bam files by a Drosophila reads ratio of vehicle over DB2115. The MACS2 package and callpeaks function was used to identify peaks in normalized files, and bigwig files were generated for Integrative Genomics Viewer (IGV) visualization. Differential peak analysis was performed in RStudio with ‘GoodpeakAnalysis’.

For RNA sequencing datasets, quality control was performed based on error distribution along the length or reads, GC distribution, N content, base quality and adaptor content. Reads were mapped to the hg38 transcriptome using STAR aligner73. Raw counts were subsequently normalized and analyzed for differential expression in R using the Bioconductor package DESeq2 (ref. 74). An enrichment score was generated using the negative logarithm of the adjusted P value multiplied by the sign of the fold change for each gene and input into Fast Gene Set Enrichment Analysis (FGSEA, Bioconductor75,76). Pre-ranked gene lists were queried against standard c1–8 and hallmark MSigDB gene lists (v.7.4, Broad Institute77). Additional gene list enrichments were conducted with Enrichr78,79,80.

For the k-means ATAC cell identity mapping, raw ATAC sequencing counts were obtained for healthy donor cell populations32. The raw counts were normalized using variance stabilizing transformation (DESeq2)74, and each locus was then binarized into open or closed chromatin regions using mean normalized count followed by k-means clustering of the binarized data (Extended Data Fig. 5). The loci in each cluster was overlapped with the PU.1 CUT&Tag data, and the significance of the overlap was calculated using hypergeometric distribution P value.

Statistics and reproducibility

Statistical tests were performed in RStudio (2023.12.1+402) or GraphPad Prism (v.9.5). No statistical method was used to predetermine sample size. No data were excluded from the analyses. The investigators were not blinded to allocation during experiments and outcome assessment. The experiments were not randomized.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.