Mechanism for DPY30 and ASH2L intrinsically disordered regions to modulate the MLL/SET1 activity on chromatin

Lee, Young-Tae; Ayoub, Alex; Park, Sang-Ho; Sha, Liang; Xu, Jing; Mao, Fengbiao; Zheng, Wei; Zhang, Yang; Cho, Uhn-Soo; Dou, Yali

doi:10.1038/s41467-021-23268-9

Download PDF

Article
Open access
Published: 19 May 2021

Mechanism for DPY30 and ASH2L intrinsically disordered regions to modulate the MLL/SET1 activity on chromatin

Young-Tae Lee ORCID: orcid.org/0000-0002-3522-9361¹^nAff5,
Alex Ayoub¹,
Sang-Ho Park²,
Liang Sha³,
Jing Xu¹,
Fengbiao Mao¹,
Wei Zheng ORCID: orcid.org/0000-0002-2984-9003⁴,
Yang Zhang^2,4,
Uhn-Soo Cho² &
…
Yali Dou ORCID: orcid.org/0000-0002-4040-1429^1,3

Nature Communications volume 12, Article number: 2953 (2021) Cite this article

4667 Accesses
17 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Recent cryo-EM structures show the highly dynamic nature of the MLL1-NCP (nucleosome core particle) interaction. Functional implication and regulation of such dynamics remain unclear. Here we show that DPY30 and the intrinsically disordered regions (IDRs) of ASH2L work together in restricting the rotational dynamics of the MLL1 complex on the NCP. We show that DPY30 binding to ASH2L leads to stabilization and integration of ASH2L IDRs into the MLL1 complex and establishes new ASH2L-NCP contacts. The significance of ASH2L-DPY30 interactions is demonstrated by requirement of both ASH2L IDRs and DPY30 for dramatic increase of processivity and activity of the MLL1 complex. This DPY30 and ASH2L-IDR dependent regulation is NCP-specific and applies to all members of the MLL/SET1 family of enzymes. We further show that DPY30 is causal for de novo establishment of H3K4me3 in ESCs. Our study provides a paradigm of how H3K4me3 is regulated on chromatin and how H3K4me3 heterogeneity can be modulated by ASH2L IDR interacting proteins.

Generation of Remosomes by the SWI/SNF Chromatin Remodeler Family

Article Open access 02 October 2019

A basic motif anchoring ISWI to nucleosome acidic patch regulates nucleosome spacing

Article 09 December 2019

Structural insights into assembly and function of the RSC chromatin remodeling complex

Article 07 December 2020

Introduction

Cells are complex, information-processing centers that handle an immense flow of signals often leading to fine tuning the expression of genes. To achieve exquisite regulation, chromatin post-translational modifications (PTMs) have evolved to demarcate, among a mosaic of functions, actively transcribed genes from the inactive ones¹. The mixed-lineage leukemia (MLL) family of histone methyltransferases (HMTs) catalyzes the deposition of histone H3 lysine 4 methylation (H3K4me) associated with active transcription^2,3. H3K4 methylation is highly enriched at gene promoters and distal regulatory enhancers, and plays a pivotal role in the recruitment of basal transcription machinery^4,5,6 and chromatin remodeling complexes^7,8,9. It also promotes long-range chromatin interactions and higher-order chromatin organization^10,11,12. The dynamic interplay between H3K4me and co-transcriptional processes has also been reported^13,14. Human genetic studies have corroborated the functional importance of the MLL family enzymes: heterozygous mutations in MLLs are reported in congenital human Kabuki^{15,16,17,18,19,20}, Wiedemann-Steiner, and Kleefstra spectrum syndromes^21,22,23. Furthermore, MLL family proteins are among the most frequently mutated genes in human malignancies²⁴.

The MLL/SET1 family enzymes interact with several evolutionarily conserved proteins, WDR5, ASH2L, RbBP5, and DPY30, through the C-terminal catalytic SET domain^25,26,27. We and others have previously shown that these core components are essential for MLL1 catalytic activity on histone H3^27,28,29. In particular, WDR5 is required to stabilize the trimeric RbBP5-ASH2L-MLL1 complex^30,31, a role exploited for the development of MLL1-specific inhibitors^32,33. In-depth biochemical studies also show that these core components have multiple relatively weak interactions amongst themselves^34,35,36. Recently, a co-crystal structure of the minimal trimeric complex (ASH2L^{276–500,Δ400–440}-RbBP5^330–375-MLL1/3^SET)³⁰ and cryo-EM structures of the MLL1-NCP (nucleosome core particle) complex^37,38 have revealed the overall architecture of the MLL1 core complex as well as its engagement with a physiological substrate (i.e., NCP). These studies, together with solution structures of MLL1³⁵, show a surprisingly dynamic nature of the MLL1 core complex, especially the MLL1^SET domain and the RbBP5-NCP interface. Despite these studies, regulation of structural dynamics of the MLL1 complex on the NCP and its functional implication remain largely unknown.

Compared to the well-studied WDR5, RbBP5, and ASH2L^SPRY proteins, the function of DPY30 and the extended intrinsically disordered regions (IDRs) of ASH2L in the MLL1 complex remains a mystery. The biochemically defined minimal core complex showed negligible DPY30 contribution to the activity of the MLL/SET1 family of enzymes using recombinant histone H3 or peptidic H3 as substrates^36,39,40. On the other hand, DPY30 is capable of regulating global H3K4 methylation in cells⁴¹ and DPY30 knockdown or knockout leads to global reduction of H3K4me3 in embryonic stem cells (ESCs) and hematopoietic stem cells^42,43. It is proposed as a potential therapeutic target for MLL1-rearranged leukemia⁴⁴. The conflicting reports of the minimal in vitro DPY30 activity versus its importance in regulating H3K4me3 in cells remain unresolved.

Here we show that DPY30 greatly stimulates MLL1 activity on the NCP. By combined NMR, SAXS, cryo-EM and biochemical approaches, we find that DPY30 functions through the extended IDRs of ASH2L to restrict the rotational dynamics of the MLL1 complex on the NCP, thereby promoting H3K4 methylation at higher methylation states. The NCP-specific regulation by DPY30 and ASH2L IDRs generally applies to all MLL/SET1 family enzymes. Cellular studies further confirm the importance of DPY30 in de novo establishment of H3K4me3 on chromatin. Taken together, we have established a paradigm of how the disordered regions in the chromatin-modifying complexes may exert loci-specific histone methylation and confer heterogeneity in the cellular epigenetic landscape.

Results

Activity of the MLL/SET1 family enzymes on the NCP requires DPY30

To examine the regulation of the MLL1 methyltransferase activity on the NCP in vitro, we performed the HMT assays using either recombinant histone H3 or NCP as substrates. The overall activity of the MLL1 core complex was much higher on the NCP as compared to recombinant histone H3 (Fig. 1a and Supplementary Fig. 1a). DPY30 was essential for the drastic increase of H3K4 methylation on the NCP (Fig. 1a), especially for higher H3K4 methylation states (i.e., H3K4me2 and H3K4me3) at the expense of H3K4me1 (Fig. 1a). In contrast, DPY30 had no effect on MLL1 activity or processivity when recombinant H3 was used as the substrate (Fig. 1a and Supplementary Fig. 1b), consistent with the previous studies^36,39,40. To test whether DPY30-dependent regulation on the NCP is a general mechanism for all MLL/SET1 family enzymes, we examined H3K4 methylation by MLL2-4 and SET1A/1B in the presence or absence of DPY30. As shown in Fig. 1b and Supplementary Fig. 1b, DPY30 was able to significantly enhance methylation activity of all MLL/SET1 complexes in an NCP-specific manner. Domain mapping confirmed that the dimerization domain (DD, 45–90) of DPY30, which forms a hydrophobic groove that directly interacts with the ASH2L Sdc-DPY30-Interacting domain (SDI, 504–534)⁴⁵, was sufficient to stimulate MLL1 activity on the NCP (Fig. 1c).

**Fig. 1: DPY30 specifically stimulates MLL1 activity on the NCP.**

DPY30-dependent stimulation requires IDR in ASH2L

The recent cryo-EM studies of the MLL1/3-NCP complexes show that DPY30 does not make direct contact with the NCP^37,38. Consistently, when we tested the binding of the MLL1 complex to the NCP with or without DPY30, DPY30 did not alter MLL1-NCP interaction in a gel mobility shift assay (Supplementary Fig. 1c). We next tested whether DPY30-mediated stimulation is redundant with that of H2BK120 ubiquitylation (H2BK120ub), which enhances H3K4 methylation without altering the binding affinity of the ySET1 complex to the NCP⁴⁶. As shown in Supplementary Fig. 1d, DPY30 was able to further enhance activities of SET1A and MLL1 on the H2BK120ub-containing NCP, suggesting that it functions through a distinct mechanism from that of H2BK120ub.

As ASH2L is only the direct binding partner of DPY30 in the MLL1 core complex, we examined the role of ASH2L in DPY30-dependent regulation. ASH2L contains the structurally defined N-terminal PHD/WH domain (aa 1–178)^47,48 and C-terminal split SPRY domain⁴⁹ as well as three IDRs (Fig. 2a), including Linker (aa 178–275), Loop (aa 400–440), and SDI (aa 504–534). The SDI of ASH2L directly interacts with DPY30^50,51, while both Linker and Loop are IDRs have not been previously characterized. In fact, Loop IDR was removed in the previous structural studies, as it does not contribute to the structural integrity of ASH2L SPRY^30,52. We made selective serial deletions for each of these domains or regions in ASH2L (Schematic in Fig. 2a) to test their respective contribution to DPY30-dependent stimulation in the in vitro HMT assays. As shown in Fig. 2b, while SDI deletion increased activity of the MLL1 complex, likely by reducing ASH2L aggregation through SDI dimerization⁵², it completely eliminated DPY30-dependent stimulation on the NCP (Fig. 2b, lane 2 versus lane 4). Deletion of PHD-WH-Linker or Loop, but not PHD-WH alone, also abolished DPY30-dependent regulation (Fig. 2c, d). Interestingly, both PHD-WH-Linker and Linker fragments were able to stimulate MLL1 activity in a DPY30-dependent manner in trans, albeit at a lower level compared to cis-regulation (Fig. 2e, f). This property was not shared by Loop IDR in the HMT assay (Fig. 2f). Furthermore, detailed mapping of ASH2L Linker IDRs (Fig. 3a) identified three highly conserved regions, 247–251, 252–263 and 275–285 (Supplementary Fig. 2a), that were critical for DPY30-dependent regulation (Fig. 3b–d). These results highlight a previously uncharacterized function of ASH2L IDRs in regulating MLL1 activity on the NCP.

**Fig. 2: DPY30 requires ASH2L IDRs to stimulate MLL1 activity on chromatin.**

**Fig. 3: Identification of essential ASH2L IDRs in DPY30-mediated stimulation.**

We next examined whether IDRs in other complex subunits are important for DPY30-mediated HMT stimulation. The potential IDRs in the MLL1 core complex include the RbBP5 C-terminus (aa. 382–538) and a segment of the SET domain between the WIN motif and the catalytic domain (aa. 3767–3812)³⁰. Sequential C-terminal RbBP5 truncations were tested and none of them abolished DPY30-mediated HMT stimulation (Supplementary Fig. 2b). Notably, larger deletion of RbBP5 C-terminus lowered the overall HMT activities (Supplementary Fig. 2b), consistent with previous studies for yeast homolog Swd1 in the SET1 complex^53,54. To test the MLL1 SET IDR, MLL^{SETIL (3813–3969)} was used so that the MLL1 complex remains active in the absence of the WIN motif or WDR5³⁰. Removal of the SET IDR did not affect DPY30-dependent stimulation (Supplementary Fig. 2b). Circumvention of WDR5 in the MLL1^{SETIL (3813–3969)}-containing core complex also indicates that WDR5 is dispensable for DPY30-mediated stimulation. These results suggest that ASH2L IDRs are necessary and sufficient for DPY30-dependent HMT stimulation on the NCP.

DPY30 induces widespread NMR spectra changes in ASH2L IDRs

To evaluate the effects of DPY30 binding on global ASH2L structure and to explore the mechanism by which DPY30 and ASH2L IDRs regulate MLL1 activity, we performed methyl-TROSY NMR on ¹³CH₃-labeled Ile-Leu-Val (ILV) ASH2L^202–534, in the presence of stoichiometric amount of unlabeled RbBP5 peptide (330–363), the minimal region for ASH2L binding (see Methods for details). We identified ~65% of the 100 anticipated peaks in ¹³CH₃-labeled ILV ASH2L^202–534 (Supplementary Fig. 3a, red). The majority of these peaks were also observed in the ¹³CH₃-labeled ILV ASH2L^276–534 (i.e., without Linker) sample (Supplementary Fig. 3a, black). Surprisingly, the addition of DPY30 triggered striking and widespread changes in the NMR spectrum, with the appearance of many new peaks with significantly dispersed chemical shifts (Fig. 4a and Supplementary Fig. 3b and 3c, red peaks). Chemical shift changes of some apo-state peaks were also observed (Fig. 4a). To further characterize these newly appeared peaks, we carried out residue-specific methyl-assignments by mutagenesis on the ASH2L^202–534-DPY30 complex (Supplementary Fig. 4a–d)⁵⁵. About 60% of total methyl peaks were unambiguously assigned (see Supplementary Table 1), owing to their dispersed chemical shifts. Interestingly, the majority of the DPY30-induced new peaks corresponded to residues in the ASH2L Linker and Loop IDRs (Fig. 4a, blue and orange, respectively). A number of peaks corresponding to residues in the SPRY domain (Fig. 4a, green) were also perturbed (e.g., I274, V287, I300, V322, I488) or newly appeared (e.g., L291, L350). Importantly, deletion of either Linker (blue) or Loop (orange) IDRs in ASH2L (modeled in Supplementary Figs. 5 and 6) abolished DPY30-induced changes in NMR spectra (Supplementary Fig. 6a–c, right). The NMR results suggest that DPY30 mainly affects ASH2L IDRs and the DPY30-dependent NMR changes require all ASH2L IDRs.

**Fig. 4: ASH2L IDRs undergo significant conformational changes upon DPY30 binding.**

Small-angle X-ray scattering (SAXS) of ASH2L and ASH2L/DPY30

The DPY30-dependent changes of ASH2L IDRs in NMR spectra can be due to alterations of inter-or intra-molecular interactions or stabilization of a particular conformation. To gain more insights into these possibilities, we performed SAXS experiment for ASH2L, DPY30, and the ASH2L/DPY30 complex. The molecular weight for ASH2L was estimated to be 65 KDa by the SAXS experiment. Since the combined mass of ASH2L (60.12 KDa) and RbBP5^330–363 (4.07 KDa), which was included in all ASH2L SAXS samples (see “Methods”), is ~64 kDa, ASH2L is likely monomeric in solution. This excludes the possibility that DPY30 functions through resolving intermolecular interactions of ASH2L IDRs. Furthermore, SAXS data show that pair distance distribution function of ASH2L had a peak around 30 Å and decreased smoothly (Supplementary Fig. 7a), suggesting that the structural domains in ASH2L were probably not locked in a rigid configuration. As shown in Supplementary Fig. 7a, ASH2L/DPY30 had a similar D_max (~140 Å) as compared to ASH2L despite a 30% increase in size (Supplementary Fig. 7a). It suggests that ASH2L in the DPY30/ASH2L complex is probably in a more compact conformation. Interestingly, analysis using ensemble-optimized method (EOM)⁵⁶ identified two distinguishable ASH2L populations in both the D_max and R_g plots (Supplementary Fig. 7b), suggesting that ASH2L is likely in a structural equilibrium between two largely different conformations, with one more extended than the other (Supplementary Fig. 7b). We were not able to perform EOM analysis for ASH2L/DPY30 due to method limitation⁵⁶. Taken together, we speculate that DPY30 binding may shift the structural equilibrium of ASH2L and stabilize ASH2L IDRs in a more compact conformation. This is consistent with the DPY30-dependent appearance of ASH2L NMR peaks with well-dispersed chemical shifts (Fig. 4a).

Molecular modeling of the DPY30-ASH2L complex

While it is challenging to determine the exact conformation(s) of the dynamic ASH2L IDRs in the apo-state, we were able to build a structural model to visualize ASH2L IDRs in the DPY30-bound state. The molecular model of the human ASH2L-DPY30 is based on the co-crystal structure of the ySET1 complex subunits Bre2-Sdc1 (PDB code: 6CHG)⁵³ as well as crystal structures of the human ASH2L SPRY domain (without Loop IDR, PDB code: 3TOJ)⁴⁷ (Fig. 4b, see “Methods”). When we mapped the residues that showed DPY30-dependent chemical shift in the NMR spectra onto this structural model, the close spatial proximity of these residues was apparent (Fig. 4b). They clustered together in the IDRs (Supplementary Fig. 5a) and SPRY regions (Supplementary Fig. 5b). In this model, ASH2L IDRs, the SPRY domain, and SDI adopt a compact triangular structural arrangement upon interacting with DPY30 (Fig. 4b). ASH2L IDRs form an ordered three-strand β-sheet, comprised of highly conserved residues 247–252 from Linker IDR and residues 416–428 from Loop IDR (Supplementary Fig. 8a, red box). In addition to the β-sheet structure, residues 252–263 and 275–286 of the Linker IDR also adopt a β-sheet-like conformation next to SDI (Supplementary Fig. 8a, blue box), enclosing a binding interface for the α-helical SDI (orange) and DPY30 (Supplementary Fig. 8b). Although this is only a computational model, many highlighted structural elements are essential for DPY30-dependent stimulation in the in vitro HMT assays. Removal of residues 247–253 or 400–440 completely abolished DPY30-dependent MLL1 regulation in vitro (Fig. 3b, d). Similarly, deletion of residues 252–263 or 275–285 in ASH2L also reduced DPY30-dependent activity (Fig. 3c, d) as well as the DPY30-dependent changes in NMR spectrum (Supplementary Fig. 6a–c).

DPY30/ASH2L IDRs restrict the rotational dynamics of the MLL1 complex on the NCP

Recently, we and others have solved the cryo-EM structure of the MLL1-NCP complex^37,38. It reveals the overall architecture of the five component MLL1 core complex with the NCP. In the MLL1-NCP structure, ASH2L binds to the NCP at DNA superhelical loop (SHL) 7 (Fig. 5a), which together with RbBP5 at SHL 1.5, allows MLL1^SET to bind above the nucleosome dyad³⁷. To understand the molecular mechanism by which DPY30 regulates MLL1 activity on the NCP, we determined the single-particle cryo-EM structure of the human recombinant MLL1^RWSA complex (4-MLL1), containing four of the five core proteins, i.e. RbBP5 (aa 1–538); WDR5 (aa 22–334); MLL1^SET (aa 3762–3969); and ASH2L^ΔSDI (aa 1–504), bound to the NCP (4-MLL1-NCP). Overall, a total of 1288 K particles were picked from 6242 micrographs collected from 300 keV Titan Krios equipped with the K2 summit direct director (Supplementary Fig. 9). After several rounds of heterogeneous refinement using cryoSPARC⁵⁷, we isolated four different subclasses of 4-MLL1-NCP (Class01, 02, 03, and 05). The best behaving particles were further selected from each subset of the 4-MLL1-NCP images after focused refinement and subsequent 3D classification in RELION (Supplementary Fig. 9)⁵⁸. In the end, we obtained three different subclasses of 4-MLL1-NCP structures (Class 01, 02, and 05, Fig. 5b–d and Supplementary Fig. 9). The overall resolution of these structures ranged from 4.6 Å to 6.9 Å (Supplementary Fig. 10), which were sufficient to dock coordinates of the MLL1 core components and the NCP from our previous MLL1^RWSAD-NCP structure (PDB ID: 6PWV [https://doi.org/10.2210/pdb6pwv/pdb])³⁷. In comparison to the MLL1^RWSAD-NCP complex (or 5-MLL1-NCP, Fig. 5a)³⁷, the 4-MLL1-NCP complexes displayed much higher dynamics at the ASH2L-NCP interface (Fig. 5b, c). While the majority of the 5-MLL1-NCP complexes anchored on the NCP with RbBP5 and ASH2L at DNA SHL 1.5 and 7, respectively, the 4-MLL1-NCP complex adopted multiple modes of interaction. With RbBP5 anchoring near SHL 1.5, ASH2L binding sites varied from SHL 7 to SHL 4.5 among different subclasses (Fig. 5b, d). Furthermore, local ASH2L binding dynamics on the NCP also increased significantly in the absence of DPY30, as demonstrated by extremely low or complete loss of ASH2L IDR density in a significant subset of the structures (Fig. 5c, d).

**Fig. 5: Cryo-EM structure of the 4-MLL1-NCP complexes.**

The molecular modeling using the iterative template-based fragment assembly refinement (I-TASSER) method^59,60 showed that ASH2L IDRs make multiple contacts with nucleosomal DNA (Supplementary Fig. 11a). In addition to the conserved basic residues (₂₀₅-KRK-₂₀₇) that contributes to overall MLL1 activity on the NCP³⁷, DPY30-induced ASH2L changes appear to enable ASH2L residues 419–421, which reside on a short loop between the newly formed three-stranded β-sheet, to provide another contact with DNA (Supplementary Fig. 11a). Consistent with the modeling, K419A/K421A mutation or deletion of 419–421 significantly reduced or abolished DPY30-dependent regulation of MLL1 activity, respectively (Supplementary Fig. 11b). These results are consistent with a model that DPY30 functions through ASH2L IDRs to restrict rotational dynamics of the MLL1 complex on the NCP and promote productive H3K4 methylation (see “Discussion”).

DPY30 is essential for establishing de novo H3K4me3 in E14 ESCs

To investigate the function of DPY30 in establishing H3K4me3 in cells, we first examined the correlation of DPY30 binding and H3K4me3 at MLL1 binding sites in E14 ESCs^41,60. We identified 4009 MLL1 peaks in ESCs⁶¹ and among them, 1070 (26.69%) MLL1 peaks overlapped with those of DPY30 (Fig. 6a)⁴¹. Selected loci were shown in Supplementary Fig. 12a. Strikingly, H3K4me3 was highly correlated with DPY30 binding at the MLL1 targets (Fig. 6a). A similar close correlation of DPY30 and H3K4me3 was also found at the 2431 ASH2L binding sites, 67% of which colocalized with DPY30 at gene regulatory regions in the E14 ESCs (Fig. 6b and Supplementary Figs. 12b and 13). These results showed that MLL1/ASH2L alone was ineffective for depositing H3K4me3 on chromatin. Instead, DPY30 was required for promoting high levels of H3K4me3 on chromatin. Next, we tested whether DPY30 plays a causal role in establishing de novo H3K4me3 on chromatin. To this end, we expressed catalytically inactive HA-dCas9 or HA-dCas9-DPY30 in E14 cells and targeted the fusion proteins to randomly selected genomic regions by gRNAs (Fig. 6b, left). The loci were selected from MLL1/ASH2L joint targets that had no prior DPY30 binding (Fig. 6b, right top). Upon HA-dCas9-DPY30 recruitment, there was a significant increase of H3K4me3 at these loci (Fig. 6b, bottom right). In contrast, no increase of H3K4me3 was observed for the no gRNA controls (Fig. 6b) or in cells expressing HA-dCas9 (Supplementary Fig. 12c). These results confirmed that DPY30 is required for de novo establishment of H3K4me3 in cells.

**Fig. 6: DPY30 regulates de novo establishment of H3K4me3 on chromatin.**

Discussion

Using the biochemical, structural, and cellular approaches, we have revealed the mechanism by which DPY30 regulates MLL//SET1 activity on chromatin. We show that DPY30 functions through ASH2L IDRs and DPY30-induced changes stabilize ASH2L-NCP interactions and restrict the rotational dynamics of the MLL1 complex on the NCP. Consequently, it promotes productive H3K4 methylation, especially at higher methylation states (i.e., H3K4me3 and H3K4me2). Our study has established a paradigm by which IDRs, the often-ignored segments in chromatin-interacting proteins, contribute to the heterogeneity of the epigenetic landscape in eukaryotic cells.

Previous studies have shown that DPY30 has negligible effects on H3 methylation in vitro^36,39,40, yet its deletion leads to global downregulation of H3K4me3 in cells⁴¹. Our study shows that DPY30 confers NCP-specific regulation of MLL1 activity by regulating ASH2L-NCP interactions. Combining complementary biophysical, structural, and biochemical experiments with computational modeling, we show that upon DPY30 binding, ASH2L IDRs converge to adopt a compact structural unit at the MLL1-NCP interface, enabling new contacts with the NCP. In support, deletion or mutating ASH2L IDRs greatly impaired DPY30-dependent methyltransferase activity in vitro (Fig. 3 and Supplementary Fig. 11b). The cryo-EM structure of the 4-MLL1-NCP complex shows significant rotational dynamics on the NCP as compared to the 5-MLL1-NCP, 5-MLL3-NCP (Fig. 5e) or ySET1-NCP complexes (Fig. 5f)^37,38,46,62. The 4-MLL1 complex is able to swing across the nucleosome disc with ASH2L binding near SHL4 in a subset of the cryo-EM structures (Fig. 5). Furthermore, ASH2L also exhibits higher local binding dynamics in the absence of DPY30. We envision that increased rotational dynamics of the 4-MLL1 complex or local ASH2L dynamics reduces the probability of the MLL1 SET domain positioning near nucleosome dyad. In this scenario, the MLL1^SET domain has to go through multiple spatial arrangements to optimally engage both H3 substrates in the NCP, which negatively affect MLL1 processivity³⁷. By limiting rotational dynamics of the MLL1 complex on the NCP, DPY30 as well as ASH2L IDRs promote productive enzyme-substrate engagement, which has a specific impact on higher methylation states.

Notably, DPY30/ASH2L IDRs regulate all MLL/SET1 family enzymes, regardless of their respective intrinsic activity and processivity (Fig. 1b). We find that despite its selective impact on global H3K4me3 in cells⁴¹, DPY30 is able to stimulate H3K4me1 by the MLL3 complex in vitro. The global reduction of H3K4me3, but not H3K4me1 or H3K4me2, after DPY30 deletion/depletion in cells is probably due to compounding effects of relative abundance and activity of different MLL family enzymes as well as the offset of H3K4me1 inhibition by blocking its conversion to higher methylation states. We also would like to point out that DPY30 is able to enhance human SET1 activity on the H2BK120ub-containing NCP (Supplementary Fig. 1d). Thus, it can probably cooperate with H2BK120ub in H3K4me3 regulation in vivo, which awaits future studies.

It is well established that intrinsically disordered proteins (IDPs), or proteins containing extensive IDRs, have unique biophysical properties^63,64. The undefined structures in the solution enable IDRs to adopt many possible conformations and meaningfully engage in versatile protein-protein interactions^65,66,67. As a result, IDRs or IDPs are often found at hubs of protein interaction networks and enable functional diversification and environmental responsiveness during the complex developmental processes^66,68. Recent studies also show that IDRs are able to facilitate phase transition and heterochromatin functions in cells⁶⁹. Our study here provides a paradigm for how IDRs in histone-modifying enzymes may regulate chromatin functions. We show that ASH2L IDRs and their interacting protein DPY30 can exert locus- and context-specific regulation of H3K4me3 in cells. While the exact conformation(s) of apo-state ASH2L IDRs remain to be determined, our study suggests that ASH2L IDRs are probably in a highly dynamic conformational equilibrium and DPY30 binding leads to stabilization of ASH2L IDRs in one of the more structurally organized conformations. Our study also raises the question of whether ASH2L IDRs can be modulated by other proteins beyond DPY30. We envision that proteins that are able to induce perturbations in ASH2L IDRs and/or stabilize ASH2L IDRs could potentially modulate MLL/SET1-NCP interactions, thereby regulating H3K4 methylation activity on chromatin. Aberrant expression of ASH2L has been reported in a wide spectrum of human tumors, and contributes to disease progression and prognosis^24,70,71,72. Notably, ASH2L cooperates with activating mutations of Ras in cellular transformation⁷³, recruits the oncogene MYC to target genes in conjunction with WDR5^74,75, and regulates p53 targeting gene expression⁷⁶. Future studies on ASH2L IDR and IDR interacting proteins will provide insights into the regulation of H3K4me3 heterogeneity in cells, and potentially shed light on human pathogenesis.

Finally, histone-modifying enzyme complexes usually contain multiple IDRs in both catalytic and non-catalytic subunits. Our survey indicates that IDR content can go up to 70–90% for some histone-modifying enzymes (Supplementary Table 3). Furthermore, 60% of lysine HMTs (HKMTs) contain IDRs of 80 residues or more, whereas only 20% of other annotated proteins have IDRs of similar length⁷⁷. It suggests that IDRs in the histone-modifying enzymes may have especially important regulatory roles, which may constitute a layer of complexity in epigenetic regulations. Inclusion of the IDRs in enzymes or enzyme complexes may be necessary to discover their regulation to the fullest extent.

Methods

Mouse and human ES cell lines

E14tg2a (E14) (ATCC, #30-2002) cell line was used for all cellular experiments. To generate the E14 cell line stably expressing HA-ASH2L, the plasmid expressing ASH2L from the pPiggybac-HA vector as well as plasmids carrying PBase transposase and rTTA element were co-transfected into E14 cells by electroporation. Geneticin was added one day after transfection and selection was carried out for 10 days. Single colonies were picked and screened for stable expression of HA-ASH2L in the presence of Doxycycline.

General protein expression and purification

All MLL1 complex subunits and their mutants were expressed using the pET-28a expression vector with N-terminal 6-histidine and SUMO tag²⁹. To make ASH2L mutants for methyl assignments, codon-optimized ASH2L^202–534 DNA (Integrated DNA Technologies) was used as a template for mutagenesis. Each Ile was changed to Leu, and each Leu or Val was changed to Ile. NEBaseChanger web tool (New England Biolabs) was used to design primers for single residue substitution. Mutant plasmids were constructed using Q5 Site-Directed Mutagenesis Kit (NEB, Cat#E0554S). All proteins were expressed in BL21(DE3) E. coli strain in LB media. Cells were grown initially at 37 °C till OD₆₀₀ reached 0.6–0.8 and shifted to 20 °C after IPTG was added at a final concentration of 0.2–0.4 mM. Cells were lysed by sonication and lysates were collected after centrifugation at 32,000 × g for 30 min at 4 °C. The supernatant was filtered through 0.45 μm syringe filter and purified through a Ni-NTA metal-affinity column (Qiagen and Goldbio). After extensive washing with 20 mM Tris (pH 8.0), 300–500 mM NaCl, 2 mM β-mercaptoethanol, and 10 mM imidazole (washing buffer), protein was eluted stepwise at 30, 60, 90, 120, 150, 210, and 300 mM imidazole. SUMO protease was added to the pooled fractions during dialysis at 4 °C overnight. Ni-NTA purification was repeated to remove 6-histidine tag and other bacterial impurities. Proteins were further purified on a HiLoad 16/60 Superdex 75 pg or 200 pg columns (GE Healthcare Life Sciences). All MLL complex subunits and their mutants are nicely expressed and well-behaved in solution with no noticeable differences in protein stability.

GST-fusion MLL and SET1 proteins

GST-tagged MLL (MLL1³⁷⁴⁵, MLL2²⁴⁹⁰, MLL3⁴⁶⁸⁹, MLL4⁵³¹⁹) and SET1 (SET1A¹⁴⁷⁴ and SET1B¹⁶⁸⁴) proteins were expressed using a pGEX-parallel 1 expression vector with N-terminal GST tag and TEV cleavage sequence³¹. Plasmids were transformed and expressed in BL21(DE3) E. coli in LB media. Cells were grown until OD₆₀₀ reached 0.6–0.8 when the temperature was reduced to 20 °C and, after temperature equilibration, protein expression was induced using 0.4 mM IPTG and grown for 16 h. Cells were harvested and lysed using sonication and the supernatant was collected by centrifugation at 32,000 × g, filtered through a 0.45 µm syringe, and loaded onto a pre-equilibrated Glutathione Sepharose 4B column (GE Healthcare Life Sciences). After several washes with 20 mM Tris HCl (pH 7.5), 300 mM NaCl, 2 mM DTT, 10% v/v glycerol (GST wash buffer), the protein was eluted off of the column using GST wash buffer with 10 mM reduced glutathione. Proteins were further purified over a HiLoad 16/60 Superdex 200 pg column (GE Healthcare Life Sciences). The purified SET domains remain soluble and stable for the in vitro assays.

In vitro HMT assay

Mixture of stoichiometric amounts of MLL1 core proteins was used for the in vitro HMT assay. Recombinant mono-nucleosome was prepared by salt dialysis of equal molar histone octamer⁷⁸ and 146 bp 601 DNA. The reaction was carried out in 20 μL of the HMT buffer of 20 mM Tris (pH 8.0), 50 mM NaCl, 5 mM Mg²⁺, 1 mM DTT and 10% v/v glycerol⁷⁹. The reaction was initiated by adding 1 μL of 100 μM S-adenosyl-L-methionine and incubated at room temperature for 1 h for the NCP substrates or 4 h for the recombinant H3 substrate. The 2× SDS-PAGE sample buffer was added to quench the reaction.

Western blotting

The histones were separated on a 10–15% polyacrylamide gel and transferred onto polyvinylidene difluoride membrane (Millipore). The membrane was blocked in blocking solution, consisting of 5% milk in 0.1% 1× Tween 20/TBS (TBST), followed by incubation at 4 ^oC overnight with the primary antibody in blocking solution. Membranes were washed three times in TBST and incubated with the HRP-conjugated anti-mouse/rabbit secondary antibodies at room temperature for 1 h. The membrane was developed using Pierce^TM ECL Western Blotting Substrate (Thermo Fisher Scientific, #32106), and images were captured by ChemiDoc^TM Touch Imaging System (Bio-rad).

Antibodies

The primary and secondary antibodies included: Rabbit anti-H3K4me1 (Abcam, cat # ab8895, 1:20,000), Rabbit anti-H3K4me2 (Millipore, cat # 07-030, 1:20,000), Rabbit-anti H3K4me3 (Millipore, cat # 07-473, 1:10,000), Rabbit anti-Histone H3 (Abcam, cat #ab1791, 1:20,000) and anti-Rabbit IgG Horseradish Peroxidase-linked whole antibody (GE Healthcare, cat #NA934, 1:10,000). Anti-HA (clone C29F4) rabbit monoclonal antibody (Cell signaling technology, cat #3724, 1:1000).

Preparation of ILV ¹³CH₃-labeled ASH2L NMR samples

The U-[²H] Ileδ1-[¹³CH₃] Leu, Val-[¹³CH₃, ¹³CH₃] ASH2L samples were produced using a previously developed protocol⁸⁰ with modifications. Freshly transformed single colony was inoculated into H₂O minimal media containing 6.5 g/L Na₂HPO₄, 3 g/L KH₂PO₄, 0.5 g/L NaCl, 120 mg/L MgSO₄, 11 mg/L CaCl₂, 10 mg/L biotin, 10 mg/L thiamine, 30 mg/L kanamycin, 2 g/L D-glucose and 1 g/L NH₄Cl. Cells were cultured at 37 °C until OD₆₀₀ reaches 0.25 and harvested to remove H₂O media. Then cells were resuspended in D₂O (99.9%, CIL, DLM-4-1000) minimal media containing the same salts in H₂O media in which plain glucose was replaced by D-[²H]-glucose (CIL, DLM-2062). Cells were cultured at 37 °C until OD₆₀₀ reaches 0.7–0.8. The temperature was lowered to 20 °C and 70 mg/L [¹³CH₃, 3,3-²H] α-ketobutyrate (Cambridge Isotope Laboratory, CDLM-7318) and 120 mg/L [3-¹³CH₃, 3,4,4,4-²H] α-ketoisovalerate (CIL, CDLM-73170) were added to the culture. After 1 h, IPTG dissolved in D₂O was added to the final concentration of 0.4 mM. Cells were cultured for another 24 h before harvesting. The labeled ASH2L proteins were purified through Ni-NTA column as described above. To prevent potential aggregation of ASH2L at high concentration^52,82, we added 1.5 molar excess of unlabeled RbBP5 (330–363) to all ASH2L samples. Acidic residue-rich RbBP5 (330–363) alleviates potential aggregation by masking the congregated basic residues in the ASH2L SPRY domain⁵². For simplicity, we use ASH2L to refer to ASH2L/RbBP5 (330–363) since only ASH2L was labeled and examined. To examine the effect of DPY30, additional 1.2 molar excess of unlabeled dimeric DPY30 was added. All the NMR samples were concentrated and buffer exchanged into 25 mM sodium phosphate, pH 6.5, 10 mM NaCl, 0.25 mM d₁₀-dithiothreitol (CIL, DLM-2622), and 1 mM NaN₃ in 99.99% D₂O (Aldrich Cat#151882).

NMR spectroscopy

NMR experiments were carried out on 800 MHz Bruker Ascend spectrometer equipped with pulsed-filed gradient 5 mm inverse triple resonance TXI probe and SampleCASE with 24 sample slots. IconNMR software was used for the automated collection of mutant samples for assignment. All HMQC experiments were acquired at 25 °C. Complex points of 2048 and 256 (¹H, ¹³C) were used for most of the experiments except for Ile mutants for assignment, for which 128 complex points in ¹³C dimension were used. The ¹H and ¹³C carrier frequencies were placed at 4.7 and 17 ppm, respectively. Spectral width was set to 12 and 20 ppm for ¹H and ¹³C dimensions, respectively. A recycle delay of 0.5 s was used with 32–256 scans depending on protein concentration. Residual water was suppressed by the WATERGATE method. ¹³C WALTZ-16 decoupling was employed during acquisition in the direct dimension. All spectra were processed using the NMRPipe program⁸¹. Gaussian broaden window and sine bell window functions were applied in ¹H and ¹³C dimensions. NMRFAM-Sparky was used to visualize NMR spectra⁷⁸.

Small-angle X-ray scattering

All SAXS data were collected at the 18-ID BioCAT Beamline (Biophysics Collaborative Access Team, Advanced Photon Source, Argonne National Laboratory) using the inline SEC-SAXS configuration, in which a flow cell was connected to a ÄKTApure FPLC system (GE Healthcare). To prevent potential aggregation, a stoichiometric amount of RbBP5 peptide (330–363) was added to the ASH2L samples as described above. About 200–500 μL of 1–2 mg/mL proteins were injected to a Superdex 200 column (10 × 300 mm, GE Healthcare) pre-equilibrated with 20 mM Tris (pH 7.5), 150 mM NaCl and 1 mM DTT. The flow rate was set to 0.7 mL/min during the data collection. The scattering data were collected every 2 s with 1 s exposure during the SEC elution between 5–24 ml. After data reduction, the strongest scattering data around the protein elution peak were selected for sample scattering. Several data points with minimal scattering near the elution peak were chosen for buffer-only scattering. PRIMUS⁸³ was used for data processing, including averaging scattering data, background subtraction, and calculation of the radius of gyration, R_g. and the Porod Volume. The molecular weight was estimated by dividing Porod Volume by 1.6. The pair distribution function was calculated by GNOM⁸⁴ in the GUI version of PRIMUS. For EOM analysis, a pool of 10,000 structures of ASH2L with N-terminal PHD-WH and C-terminal SPRY domains connected by the Linker and Loop IDRs was generated by RANCH⁵⁶. The sequence of RbBP5 (330–363) was not included in the EOM analysis given its small size, well-characterized interaction with the rigid SPRY domain⁵², as well as limitation of EOM for multiple polypeptide chains⁵⁶. GAJOE was used to select an ensemble that best fit the experimental data using a generic algorithm⁵⁶.

Molecular modeling of ASH2L IDRs

Human ASH2L protein consists of two domains, PHD-WH domain, and SPRY domain, that have homologous PDB structures, 3S32 (A-chains), 3TOJ, respectively. The crystal structure of yeast Bre2 determined in the COMPASS complex (PDB: 6CHG) contains the Linker and Loop IDRs. The three-dimensional (3D) model for the full-length human ASH2L protein (including PHD-WH domain, Linker-IDR and Loop-IDR regions and SPRY domain) was built by C-I-TASSER⁸⁵ using homologous PDB structures above. C-I-TASSER is a recently proposed protein structure prediction pipeline based on the classic I-TASSER protocol⁸⁶ with newly developed residue-residue contact predictors^87,88. LOMETS⁸⁹ threading is performed to align the query sequence to template structures from PDB database to extract continuous fragments. These fragments are used as initial models to assemble into full-length structure by a replica-exchange Monte Carlo (REMC) simulation guided by a composite force field consisting of deep learning-predicted contacts, template-derived distance restraints, and knowledge-based energy terms calculated by statistics of PDB database. The REMC simulation produces a variety of “decoy” conformations, which are then clustered by pairwise structure similarity⁹⁰. The centroid of the largest cluster is refined at the atomic level by FG-MD⁹¹ to obtain the final C-I-TASSER 3D model. The first model generated by C-I-TASSER was selected as the ASH2L model for the following analysis. The estimated TM-score of the entire model was 0.67 ± 0.13, indicating that it was a high-confidence model⁹². We removed the PHD-WH domain from the model during the cryo-EM fitting and refinement steps, since there is no density map collected for the PHD-WH domain.

Cryo-EM sample preparation and data collection

The GraFix method⁹³ was applied to the MLL1^RWSA- NCP complex to prepare for the cryo-EM grid. In brief, 30 μM of MLL1^RWSA was incubated with 10 μM NCP and 0.5 mM S-adenosyl-L-homocysteine for 30 min at 4 °C in the GraFix buffer (50 mM HEPES, pH 7.5, 50 mM NaCl, 1 mM MgCl₂, and 1 mM TCEP). The sample was centrifuged at 100,000 × g at 4 °C for 3 h after applying onto a centrifuge tube, which contained a gradient solution of 0–60% glycerol and 0–0.2% glutaraldehyde. After centrifugation, the crosslinked sample was quenched with 1 M Tris-HCl, pH 7.5. To remove glycerol from the GraFix buffer, we performed further buffer exchange using a centrifugal concentrator (Sartorius Vivaspin 500).

The sample at ~1 mg/ml was applied onto a glow discharged Quantifoil R1.2/1.3 grid (Electron Microscopy Sciences) at 4 °C with 100% humidity. The loaded grid was plunged-frozen in liquid ethane after 4 s blotting and 30 s waiting using a Mark IV Vitrobot (Thermo Fisher Scientific). The cryo-EM data were collected using Titan Krios (Thermo Fisher Scientific) operating at 300 keV with the K2 Summit direct electron detector. The movie data was recorded in a counting mode at a ×29,000 magnification and the pixel size of 1.01 Å/pixel, with a defocus range between −1.5 to −2.5 μm. A dose rate of 1.28 electrons/Å²/frame with a total 50 frames per 8 s was applied for data collection, resulting in a total dose of 64 electrons per Å². A total of 6,242 movies were collected.

Cryo-EM data processing and model refinement

Micrograph movies were aligned with whole-frame and local drift correction using MotionCorr2⁹⁴, and CTF was estimated with CTFFIND4.1⁹⁵. Micrographs with higher than 4.5 Å of the estimated resolution were further selected, which resulted in 6137 micrographs. A total of 1,287,771 particles were picked using Warp⁹⁶. The particles were extracted in RELION⁵⁸ and imported into cryoSPARC⁵⁷ for 2D classification. After excluding bad particles, a total of 1,194,542 particles were subjected to the first round of ab initio 3D classification into five classes (Supplementary Fig. 4). Two of five classes were subjected to the second round of ab initio 3D classification into five subclasses, and the subsequent heterogeneous refinement was performed. Four of the five subclasses displayed a well-defined map of the MLL and nucleosome complex after the heterogeneous refinement. They were exported for 3D classification. The focused 3D classification was performed at the MLL1^RWSA region without alignment (35 cycles, T = 4, binary mask: 10 pixels/soft mask: 10 pixels). The Class03 was excluded because it displayed a structurally heterogeneous and unresolvable EM density even after the focused 3D classification. The best behaving class selected from Class01 (13,086 particles), Class02 (27,730 particles), and Class05 (23,236 particles) was subjected to the 3D auto refinement and further post-processed to a resolution of 6.9, 4.6, and 6.0 Å, respectively. Each final cryo-EM map was locally filtered to avoid over-estimation. The resolution of all structures was estimated by RELION with Fourier shell correlation (FSC) at the criteria of 0.143.

For the model building, the rigid-body fitting was performed for each class using Chimera⁹⁷. The cryo-EM structure of MLL1^RWSAD-NCP (PDB ID: 6PWV [https://doi.org/10.2210/pdb6pwv/pdb])⁹⁸ was used for the rigid-body fitting for each individual class. For the model refinement, each class was subjected to the real-space refinement using PHENIX⁹⁹, and model validations were performed by MolProbity¹⁰⁰. Statistics for data collection, refinement, and validation were summarized in Supplementary Table 2.

ESC culture and transfection

E14tg2a (E14) (ACTT, #CRL-1821TM) were grown in the KnockOut™ DMEM medium containing 15% FBS, 2 mM glutamine, 1X non-essential amino acids, 0.1 mM 2-mercaptoethanol and 10³ U ml⁻¹ LIF (Millipore, #ESG1107), unless otherwise indicated. E14 cells were routinely tested for negative mycoplasma contamination using the LookOut® Mycoplasma PCR Detection Kit (SIGMA ALDRICH, #MP0035) according to the manufacturer’s instructions. For expressing dCas9 fusion proteins, E14 ESCs were transfected with pcDNA3-dCas9-HA and pcDNA3-dCas9-DPY30-HA plasmids using Fugene 6 (Promega, Cat# E2691) for 2 days and then selected with G418 (400 µg/ml, Gibco, Cat# 10131-035) for 5 days. After selection, the cells were split and transfected with a pool of three pspgRNA-gRNAs for selected genomic loci. The pspgRNA-gRNAs were co-transfected with a pBase vector (1:10) that confers puromycin resistance. After 2 days of puromycin selection (1.5 µg/ml, Gibco, Cat# A11138-03), the cells were subject to ChIP using anti-HA antibody (Cell Signaling Technology, cat# 3724) and anti-H3K4me3 antibody (Millipore, Cat# 07-473), respectively. ChIP-qPCRs were performed to detect the enrichment of H3K4me3 and HA in each location.

CUT&RUN

CUT&RUN was performed according to the protocol described previously¹⁰¹. HA-ASH2L E14 and the parental E14 (E14tg2a, ACTT, #CRL-1821TM) cell lines were cultured in presence of 1 μg/mL Doxycycline for 2 days. Biological duplicates were performed for HA-ASH2L and H3K4me3, respectively. For each experiment, 1 × 10⁶ cells were harvested, washed with wash buffer (20 mM HEPES pH7.5, 150 mM NaCl, 0.5 mM spermidine, 1× protease inhibitor cocktail), and incubated with Concanavalin A-coated beads (Bangs Laboratories, Inc. #BP531) for 15 min with rotation. Bead-bound cells were resuspended in solution (digitonin/wash buffer) and incubated with anti-HA (Cell Signaling, #3724) or anti-H3K4me3 (Millipore, #07-473) antibodies overnight at 4 °C. The beads were washed with digitonin/wash buffer three times before adding protein A-MNase (0.5 ng/μL) and incubating for 1 h at 4 °C. Following three washes, bound protein A-MNase was activated on ice for 30 min by the addition of 3 mM CaCl₂. The reaction was quenched with equal volume of 2× stop buffer (340 mM NaCl, 20 mM EDTA, 4 mM EGTA, 0.02% Digitonin (EMD Millipore #300410), 50 μg/mL RNase A (QIAGEN #19101), 50 μg/mL glycogen (Roche #10901393001), 2 pg/mL Drosophila spike-in DNA) at 37 °C for 30 min. The proteins were removed by incubating with 0.1% SDS and 0.15 mg/mL Proteinase K (Roche 3115879001) at 65 °C for 2 h. DNA fragments were purified by phenol-chloroform and ethanol precipitation and subjected to library preparation. The sequencing was performed at the University of Michigan Advance DNA Sequencing Core.

ChIP analysis and quantitative real-time PCR (qPCR)

E14 cells expressing dCas9 fusion proteins were transfected with or without pooled gRNAs (4~5 gRNAs for each selected region) (Supplementary Table 4) prior to the experiment. Cells were crosslinked with 1% paraformaldehyde at room temperature for 10 min and quenched by 250 mM glycine. After two washes with cold 1×PBS, cells were lysed, and the chromatin was sonicated for three times for 20 min each using Diagenode Bioruptor 300 for 3 rounds of 20 cycles with 30” on/off per cycle. The supernatant of the sonicated lysate was diluted with 5 volumes of ChIP dilution buffer (16.7 mM Tris-HCl pH 7.5, 12 mM EDTA, 1.1% Triton X-100, 167 mM NaCl, 0.01% SDS) and incubated with anti-H3K4me3 or anti-HA antibodies at 4 °C overnight. The immune complexes were purified on 30 µl of protein G magnetic beads (Invitrogen, Cat# 10003D) for 2 h at 4 °C, followed by three times of washes with low stringency buffer (50 mM HEPES pH 7.9, 5 mM EDTA pH 8.0, 1% NP-40, 0.2% DOC, 1×PBS) and high stringency buffer (50 mM HEPES pH 7.9, 5 mM EDTA pH 8.0, 1% NP-40, 0.7% DOC, 500 mM LiCl) as well as two times washes with Last Wash Buffer (5× TE pH 8.0, 0.3% NP-40). The beads were eluted twice with elution buffer (100 mM NaHCO3, 1% SDS) and reverse-crosslinked at 65 °C overnight. The samples were incubated with RNAse A at 37 °C for 30 min, followed by incubation with Proteinase K (20 mg/ml) at 45 °C for 1 h. DNA was recovered by phenol-chloroform extraction and ethanol precipitation. Real-time PCR was carried out using Radiant Green 2× QPCR mix (Alkali Scientific, Cat# QS1050) on Bio-Rad Real-time PCR machine. Primer information for real-time PCR is included in Supplementary Table 4.

ChIP-seq data mapping and normalization

ChIP-seq dataset for DPY30 and MLL1 were downloaded from GEO GSE26136 and GEO GSE107406, respectively. Paired-end sequencing reads were trimmed with trim_galore to remove adaptor sequences. We kept reads that were 20 bp or longer after trimming and paired between the mates. All ChIP-seq data were mapped to the mouse mm10 genome by using Bowtie2 (v2-2.2.4)¹⁰² with parameters “-q --phred33 --very-sensitive -p 10”. Duplicated reads were removed using SAMtools (v1.5)¹⁰³. The bigwig files for IP/input ratio were generated from BAM files by using deepTools3 (v3.2.1)¹⁰⁴ with command “bamCompare -b1 ChIP-bam -b2 Input-bam --ignoreDuplicates --minMappingQuality 30 --normalizeUsing RPKM --binSize 1 --operation ratio --scaleFactorsMethod None -p 20”. BAM files for mapping results were merged using SAMtools and converted to BED format using BEDTools¹⁰⁵. Peaks were called from bed files using MACS (v 1.4.2)¹⁰⁶ with parameters “-w -S -p 0.00001 -g mm”. The input signal was used as the control for peak calling. Heatmap of ChIP-seq signals were visualized using deepTools3.

CUT&RUN peak calling and visualization

HA or H3K4me3 CUT&RUN from two independent biological replicates were initially analyzed in parallel. Paired-end sequencing reads were processed as described above. The resulting alignments, recorded in BAM file, were sorted, indexed, and marked for duplicates with SAMtools¹⁰³. The analysis showed a good correlation and signal-noise ratio from replicates. The BAM files for mapping results from the replicates were used for further analysis. The overlapping peaks were merged as the union of all using SAMtools and converted to BED format using BEDTools¹⁰⁵. Fragments with size <120 bp were retained¹⁰⁷ by using subcommand “alignmentSieve” in deepTools3¹⁰⁴. Peaks were called from bed files using MACS (v 1.4.2)¹⁰⁶ with parameters “-w -S -p 0.00001 -g mm”. The bigwig files for visualization were generated from MACS. Heatmap of CUT&RUN signals were visualized using subcommand “computeMatrix” and “plotHeatmap” in deepTools3.

Statistical analysis and reproducibility

Statistical analysis was performed by two-tailed Student’s t-test using GraphPad Prism 7.0 software. Data were presented as the standard error of the mean (SEM). p value of <0.05 was considered statistically significant; *p < 0.05, **p < 0.01, ***p < 0.001. For all in vitro HMT experiments shown Figs. 1–3 as well as Supplementary Figs. 1, 2, and 11, a minimum of three independent experiments were performed and consistent results were obtained.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support this study are available from the corresponding author upon reasonable request. ChIP-seq datasets for HA-ASH2L and H3K4me3 generated in this study are accessible at GEO with accession code GSE146933. DPY30 and MLL1 datasets were downloaded from GEO GSE26136 and GEO GSE107406, respectively. Cryo-EM structures for the 4-MLL1-NCP reported in this study are available with accession numbers: Class01 – PDB 6W5I [https://doi.org/10.2210/pdb6w5i/pdb] and EMDB EMD-21542; Class02 – PDB 6W5M: [https://doi.org/10.2210/pdb6w5m/pdb] and EMDB: EMD-21543; Class05 – PDB 6W5N: [https://doi.org/10.2210/pdb6w5n/pdb] and EMDB: EMD-21544. Source data are provided with this paper.

References

Jenuwein, T. & Allis, C. D. Translating the histone code. Science 293, 1074–1080 (2001).
Article CAS PubMed Google Scholar
Calo, E. & Wysocka, J. Modification of enhancer chromatin: what, how, and why?. Mol. Cell 49, 825–837 (2013).
Article CAS PubMed Google Scholar
Bannister, A. J. & Kouzarides, T. Regulation of chromatin by histone modifications. Cell Res. 21, 381–395 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vermeulen, M. et al. Selective anchoring of TFIID to nucleosomes by trimethylation of histone H3 lysine 4. Cell 131, 58–69 (2007).
Article CAS PubMed Google Scholar
Tang, Z. et al. SET1 and p300 act synergistically, through coupled histone modifications, in transcriptional activation by p53. Cell 154, 297–310 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lauberth, S. M. et al. H3K4me3 interactions with TAF3 regulate preinitiation complex assembly and selective gene activation. Cell 152, 1021–1036 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ruthenburg, A. J., Allis, C. D. & Wysocka, J. Methylation of lysine 4 on histone H3: intricacy of writing and reading a single epigenetic mark. Mol. Cell 25, 15–30 (2007).
Article CAS PubMed Google Scholar
Wysocka, J. et al. A PHD finger of NURF couples histone H3 lysine 4 trimethylation with chromatin remodelling. Nature 442, 86–90 (2006).
Article ADS CAS PubMed Google Scholar
Taverna, S. D. et al. How chromatin-binding modules interpret histone modifications: lessons from professional pocket pickers. Nat. Struct. Mol. Biol. 14, 1025–1040 (2007).
Article CAS PubMed PubMed Central Google Scholar
Phillips, J. E. & Corces, V. G. CTCF: master weaver of the genome. Cell 137, 1194–1211 (2009).
Article PubMed PubMed Central Google Scholar
Tang, Z. et al. CTCF-mediated human 3D genome architecture reveals chromatin topology for transcription. Cell 163, 1611–1627 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yan, J. et al. Histone H3 lysine 4 monomethylation modulates long-range chromatin interactions at enhancers. Cell Res. 28, 204–220 (2018).
Sims, R. J. 3rd et al. Recognition of trimethylated histone H3 lysine 4 facilitates the recruitment of transcription postinitiation factors and pre-mRNA splicing. Mol. Cell 28, 665–676 (2007).
Article CAS PubMed PubMed Central Google Scholar
Khan, D. H. et al. Dynamic histone acetylation of H3K4me3 nucleosome regulates MCL1 pre-mRNA splicing. J. Cell Physiol. 231, 2196–2204 (2016).
Article CAS PubMed Google Scholar
Ng, S. B. et al. Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat. Genet 42, 790–793 (2010).
Article CAS PubMed PubMed Central Google Scholar
Paulussen, A. D. et al. MLL2 mutation spectrum in 45 patients with Kabuki syndrome. Hum. Mutat. 32, E2018–E2025 (2011).
Article CAS PubMed Google Scholar
Wang, K. C. et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 472, 120–124 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Micale, L. et al. Mutation spectrum of MLL2 in a cohort of Kabuki syndrome patients. Orphanet J. Rare Dis. 6, 38 (2011).
Article PubMed PubMed Central Google Scholar
Hannibal, M. C. et al. Spectrum of MLL2 (ALR) mutations in 110 cases of Kabuki syndrome. Am. J. Med Genet. A 155A, 1511–1516 (2011).
Article PubMed CAS Google Scholar
Kluijt, I. et al. Kabuki syndrome - report of six cases and review of the literature with emphasis on ocular features. Ophthalmic Genet. 21, 51–61 (2000).
Article CAS PubMed Google Scholar
Jones, W. D. et al. De novo mutations in MLL cause Wiedemann-Steiner syndrome. Am. J. Hum. Genet. 91, 358–364 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mendelsohn, B. A. et al. Advanced bone age in a girl with Wiedemann-Steiner syndrome and an exonic deletion in KMT2A (MLL). Am. J. Med. Genet. A 164A, 2079–2083 (2014).
Article PubMed CAS Google Scholar
Strom, S. P. et al. De Novo variants in the KMT2A (MLL) gene causing atypical Wiedemann-Steiner syndrome in two unrelated individuals identified by clinical exome sequencing. BMC Med. Genet. 15, 49 (2014).
Article PubMed PubMed Central Google Scholar
Rao, R. C. & Dou, Y. Hijacked in cancer: the KMT2 (MLL) family of methyltransferases. Nat. Rev. Cancer 15, 334–346 (2015).
Article CAS PubMed PubMed Central Google Scholar
Cho, Y. W. et al. PTIP associates with MLL3- and MLL4-containing histone H3 lysine 4 methyltransferase complex. J. Biol. Chem. 282, 20395–20406 (2007).
Article CAS PubMed Google Scholar
Cosgrove, M. S. & Patel, A. Mixed lineage leukemia: a structure-function perspective of the MLL1 protein. FEBS J. 277, 1832–1842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Dou, Y. et al. Regulation of MLL1 H3K4 methyltransferase activity by its core components. Nat. Struct. Mol. Biol. 13, 713–719 (2006).
Article CAS PubMed Google Scholar
Wu, L. et al. ASH2L regulates ubiquitylation signaling to MLL: trans-regulation of H3 K4 methylation in higher eukaryotes. Mol. Cell 49, 1108–1120 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cao, F. et al. An Ash2L/RbBP5 heterodimer stimulates the MLL1 methyltransferase activity through coordinated substrate interactions with the MLL1 SET domain. PLoS ONE 5, e14102 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Li, Y. et al. Structural basis for activity regulation of MLL family methyltransferases. Nature 530, 447–452 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Patel, A. et al. A conserved arginine-containing motif crucial for the assembly and enzymatic activity of the mixed lineage leukemia protein-1 core complex. J. Biol. Chem. 283, 32162–32175 (2008).
Article CAS PubMed Google Scholar
Cao, F. et al. Targeting MLL1 H3K4 methyltransferase activity in mixed-lineage leukemia. Mol. Cell 53, 247–261 (2014).
Article CAS PubMed PubMed Central Google Scholar
Vedadi, M. et al. Targeting human SET1/MLL family of proteins. Protein Sci. 26, 662–676 (2017).
Article CAS PubMed PubMed Central Google Scholar
Han, J. et al. The internal interaction in RBBP5 regulates assembly and activity of MLL1 methyltransferase complex. Nucleic Acids Res. 47, 10426–10438 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kaustov, L. et al. The MLL1 trimeric catalytic complex is a dynamic conformational ensemble stabilized by multiple weak interactions. Nucleic Acids Res. 47, 9433–9447 (2019).
CAS PubMed PubMed Central Google Scholar
Patel, A. et al. On the mechanism of multiple lysine methylation by the human mixed lineage leukemia protein-1 (MLL1) core complex. J. Biol. Chem. 284, 24242–24256 (2009).
Article CAS PubMed PubMed Central Google Scholar
Park, S. H. et al. Cryo-EM structure of the human MLL1 core complex bound to the nucleosome. Nat. Commun. 10, 5540 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Xue, H. et al. Structural basis of nucleosome recognition and modification by MLL methyltransferases. Nature 573, 445–449 (2019).
Article ADS CAS PubMed Google Scholar
Haddad, J. F. et al. Structural analysis of the Ash2L/Dpy-30 complex reveals a heterogeneity in H3K4 methylation. Structure 26, 1594–1603.e4 (2018).
Article CAS PubMed Google Scholar
Shinsky, S. A. & Cosgrove, M. S. Unique role of the WD-40 repeat protein 5 (WDR5) subunit within the mixed lineage leukemia 3 (MLL3) histone methyltransferase complex. J. Biol. Chem. 290, 25819–25833 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jiang, H. et al. Role for Dpy-30 in ES cell-fate specification by regulation of H3K4 methylation within bivalent domains. Cell 144, 513–525 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, Z. et al. The DPY30 subunit in SET1/MLL complexes regulates the proliferation and differentiation of hematopoietic progenitor cells. Blood 124, 2025–2033 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yang, Z. et al. Dpy30 is critical for maintaining the identity and function of adult hematopoietic stem cells. J. Exp. Med. 213, 2349–2364 (2016).
Article CAS PubMed PubMed Central Google Scholar
Shah, K. K. et al. Specific inhibition of DPY30 activity by ASH2L-derived peptides suppresses blood cancer cell growth. Exp. Cell Res. 382, 111485 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tremblay, V. et al. Molecular basis for DPY-30 association to COMPASS-like and NURF complexes. Structure 22, 1821–1830 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hsu, P. L. et al. Structural basis of H2B ubiquitination-dependent H3K4 methylation by COMPASS. Mol. Cell 76, 712–723.e4 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. Crystal structure of the N-terminal region of human Ash2L shows a winged-helix motif involved in DNA binding. EMBO Rep. 12, 797–803 (2011).
Article PubMed PubMed Central CAS Google Scholar
Ikegawa, S. et al. Cloning and characterization of ASH2L and Ash2l, human and mouse homologs of the Drosophila ash2 gene. Cytogenet. Cell Genet. 84, 167–172 (1999).
Article CAS PubMed Google Scholar
Roguev, A. et al. The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4. EMBO J. 20, 7137–7148 (2001).
Article CAS PubMed PubMed Central Google Scholar
Tremblay, V. et al. Molecular Basis for DPY-30 Association to COMPASS-like and NURF Complexes. Structure 22, 1821–1830 (2014).
South, P. F. et al. A conserved interaction between the SDI domain of Bre2 and the Dpy-30 domain of Sdc1 is required for histone methylation and gene expression. The. J. Biol. Chem. 285, 595–607 (2010).
Article CAS PubMed Google Scholar
Chen, Y. et al. Structure of the SPRY domain of human Ash2L and its interactions with RbBP5 and DPY30. Cell Res. 22, 598–602 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hsu, P. L. et al. Crystal structure of the COMPASS H3K4 methyltransferase catalytic module. Cell 174, 1106–1116.e9 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mersman, D. P. et al. Charge-based interaction conserved within histone H3 lysine 4 (H3K4) methyltransferase complexes is needed for protein stability, histone methylation, and gene expression. J. Biol. Chem. 287, 2652–2665 (2012).
Article CAS PubMed Google Scholar
Amero, C. et al. A systematic mutagenesis-driven strategy for site-resolved NMR studies of supramolecular assemblies. J. Biomol. NMR 50, 229–236 (2011).
Article CAS PubMed Google Scholar
Bernadó, P. et al. Structural characterization of flexible proteins using small-angle X-ray scattering. J. Am. Chem. Soc. 129, 5656–5664 (2007).
Article PubMed CAS Google Scholar
Punjani, A. et al. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife 7, e42166 (2018).
Zhang, Y. I-TASSER server for protein 3D structure prediction. BMC Bioinforma. 9, 40 (2008).
Article CAS Google Scholar
Roy, A., Kucukural, A. & Zhang, Y. I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 5, 725–738 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. MLL1 inhibition and vitamin D signaling cooperate to facilitate the expanded pluripotency state. Cell Rep. 29, 2659–2671.e6 (2019).
Article CAS PubMed PubMed Central Google Scholar
Worden, E. J., Zhang, X. & Wolberger, C. Structural basis for COMPASS recognition of an H2B-ubiquitinated nucleosome. Elife 9, e53199 (2020).
Haynes, C. et al. Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput. Biol. 2, e100 (2006).
Article ADS PubMed PubMed Central CAS Google Scholar
Kim, P. M. et al. The role of disorder in interaction networks: a structural analysis. Mol. Syst. Biol. 4, 179 (2008).
Article PubMed PubMed Central Google Scholar
Oldfield, C. J. & Dunker, A. K. Intrinsically disordered proteins and intrinsically disordered protein regions. Annu. Rev. Biochem. 83, 553–584 (2014).
Article CAS PubMed Google Scholar
Wright, P. E. & Dyson, H. J. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 16, 18–29 (2015).
Article CAS PubMed PubMed Central Google Scholar
van der Lee, R. et al. Classification of intrinsically disordered regions and proteins. Chem. Rev. 114, 6589–6631 (2014).
Article PubMed PubMed Central CAS Google Scholar
Dunker, A. K. et al. Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J. 272, 5129–5148 (2005).
Article CAS PubMed Google Scholar
Gibson, B. A. et al. Organization of chromatin by intrinsic and regulated phase separation. Cell 179, 470–484.e21 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bochynska, A., Luscher-Firzlaff, J. & Luscher B. Modes of interaction of KMT2 histone H3 lysine 4 methyltransferase/COMPASS complexes with chromatin. Cells 7, 17 (2018).
Butler, J. S. et al. Low expression of ASH2L protein correlates with a favorable outcome in acute myeloid leukemia. Leuk. Lymphoma 58, 1207–1218 (2017).
Article CAS PubMed Google Scholar
Magerl, C. et al. H3K4 dimethylation in hepatocellular carcinoma is rare compared with other hepatobiliary and gastrointestinal carcinomas and correlates with expression of the methylase Ash2 and the demethylase LSD1. Hum. Pathol. 41, 181–189 (2010).
Article CAS PubMed Google Scholar
Luscher-Firzlaff, J. et al. The human trithorax protein hASH2 functions as an oncoprotein. Cancer Res. 68, 749–758 (2008).
Article PubMed CAS Google Scholar
Ullius, A. et al. The interaction of MYC with the trithorax protein ASH2L promotes gene transcription by regulating H3K27 modification. Nucleic Acids Res. 42, 6901–6920 (2014).
Article CAS PubMed PubMed Central Google Scholar
Thomas, L. R. et al. Interaction with WDR5 promotes target gene recognition and tumorigenesis by MYC. Mol. Cell 58, 440–452 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mungamuri, S. K. et al. Ash2L enables P53-dependent apoptosis by favoring stable transcription pre-initiation complex formation on its pro-apoptotic target promoters. Oncogene 34, 2461–2470 (2015).
Article CAS PubMed Google Scholar
Lazar, T. et al. Intrinsic protein disorder in histone lysine methylation. Biol. Direct 11, 30 (2016).
Article PubMed PubMed Central Google Scholar
Lee, Y. T. et al. One-pot refolding of core histones from bacterial inclusion bodies allows rapid reconstitution of histone octamer. Protein Expr. Purif. 110, 89–94 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dou, Y. et al. Physical association and coordinate function of the H3 K4 methyltransferase MLL1 and the H4 K16 acetyltransferase MOF. Cell 121, 873–885 (2005).
Article CAS PubMed Google Scholar
Tugarinov, V., Kanelis, V. & Kay, L. E. Isotope labeling strategies for the study of high-molecular-weight proteins by solution NMR spectroscopy. Nat. Protoc. 1, 749–754 (2006).
Article CAS PubMed Google Scholar
Delaglio, F. et al. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293 (1995).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Evolving catalytic properties of the MLL Family SET domain. Structure 23, 1921–1933 (2015).
Article CAS PubMed PubMed Central Google Scholar
Konarev, P. V. et al. PRIMUS: a Windows PC-based system for small-angle scattering data analysis. J. Appl. Crystallogr. 36, 1277–1282 (2003).
Article CAS Google Scholar
Svergun, D. Determination of the regularization parameter in indirect-transform methods using perceptual criteria. J. Appl. Crystallogr. 25, 495–503 (1992).
Article CAS Google Scholar
Zheng, W. et al. Deep-learning contact-map guided protein structure prediction in CASP13. Proteins: Struct., Funct., Bioinforma. 87, 1149–1164 (2019).
Article CAS Google Scholar
Zheng, W. et al. I-TASSER gateway: a protein structure and function prediction server powered by XSEDE. Future Gener. Computer Syst. 99, 73–85 (2019).
Article Google Scholar
Li, Y. et al. ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks. Bioinformatics 35, 4647–4655 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. et al. Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13. Proteins: Struct., Funct., Bioinforma. 87, 1082–1091 (2019).
Article CAS Google Scholar
Zheng, W. et al. LOMETS2: improved meta-threading server for fold-recognition and structure-based function annotation for distant-homology proteins. Nucleic Acids Res. 47, W429–W436 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. & Skolnick, J. SPICKER: a clustering approach to identify near-native protein folds. J. Comput. Chem. 25, 865–871 (2004).
Article CAS PubMed Google Scholar
Zhang, J., Liang, Y. & Zhang, Y. Atomic-level protein structure refinement using fragment-guided molecular dynamics conformation sampling. Structure 19, 1784–1795 (2011).
Article CAS PubMed PubMed Central Google Scholar
Xu, J. & Zhang, Y. How significant is a protein structure similarity with TM-score = 0.5?. Bioinformatics 26, 889–895 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kastner, B. et al. GraFix: sample preparation for single-particle electron cryomicroscopy. Nat. Methods 5, 53–55 (2008).
Article CAS PubMed Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rohou, A. & Grigorieff, N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Article PubMed PubMed Central Google Scholar
Tegunov, D. & Cramer, P. Real-time cryo-electron microscopy data preprocessing with Warp. Nat. Methods 16, 1146–1152 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Park, S. H. et al. Cryo-EM structure of the human Mixed Lineage Leukemia-1 complex bound to the nucleosome. Nat. Commun. 10, 5540 (2019).
Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D. Struct. Biol. 74, 531–544 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D. Biol. Crystallogr 66, 12–21 (2010).
Article CAS PubMed Google Scholar
Skene, P. J., Henikoff, J. G. & Henikoff, S. Targeted in situ genome-wide profiling with high efficiency for low cell numbers. Nat. Protoc. 13, 1006–1019 (2018).
Article CAS PubMed Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Ramirez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Article CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Skene, P. J. & Henikoff, S. An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites. Elife 6, e21856 (2017).
Schneider, C. A., Rasband, W. S. & Eliceiri, K. W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are grateful to Dr. Cosgrove for providing the plasmids for the expression of the SET domains of human MLL2, MLL3, MLL4, SET1A, and SET1B, to Dr. Guobing Li for valuable technical help. We thank Drs. Debashish Sahu and Erik Zuiderweg for help with the NMR experiments. We are grateful to the Rogel Cancer Center at the University of Michigan and Norris Comprehensive Cancer Center at the University of Southern California for the research support. This work is also supported by the NIGMS grant (GM082856) to Y.D. and the NCI grant (CA250329) to Y.D. and U.S.C. A.A. is supported in part by the MERIT fellowship from the Rackham graduate school at the University of Michigan. L.S. is supported in part by the Michigan Institute for Clinical and Health Research (MICHR) Postdoctoral Translational Scholar Program (PTSP) Fellowship.

Author information

Young-Tae Lee
Present address: Accent Therapeutics, 65 Hayden Avenue, Lexington, MA, USA

Authors and Affiliations

Department of Pathology, University of Michigan, Ann Arbor, MI, USA
Young-Tae Lee, Alex Ayoub, Jing Xu, Fengbiao Mao & Yali Dou
Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
Sang-Ho Park, Yang Zhang & Uhn-Soo Cho
Department of Medicine, Department of Biochemistry and Molecular Medicine, University of Southern California, Los Angeles, CA, USA
Liang Sha & Yali Dou
Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Wei Zheng & Yang Zhang

Authors

Young-Tae Lee
View author publications
You can also search for this author in PubMed Google Scholar
Alex Ayoub
View author publications
You can also search for this author in PubMed Google Scholar
Sang-Ho Park
View author publications
You can also search for this author in PubMed Google Scholar
Liang Sha
View author publications
You can also search for this author in PubMed Google Scholar
Jing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Fengbiao Mao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Yang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Uhn-Soo Cho
View author publications
You can also search for this author in PubMed Google Scholar
Yali Dou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.-T.L. conceived, designed, performed most biochemical and NMR experiments and wrote the manuscript. A.A. performed in vitro histone methyltransferase assays and NMR experiments and wrote the manuscript. S.H.P. performed the cryo-EM studies under the supervision of U.-S.C. F.M. performed bioinformatics analyses; W.Z. performed molecular modeling experiment under the supervision of Y.Z. J.X. performed dCas9 transfection and ChIP experiments. L.S. performed CUT&RUN experiments for ASH2L and H3K4me3. Y.D. supervised the overall study and wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yali Dou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Tatiana Kutateladze, Christopher Douse and the other, anonymous, reviewer for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, YT., Ayoub, A., Park, SH. et al. Mechanism for DPY30 and ASH2L intrinsically disordered regions to modulate the MLL/SET1 activity on chromatin. Nat Commun 12, 2953 (2021). https://doi.org/10.1038/s41467-021-23268-9

Download citation

Received: 19 April 2020
Accepted: 16 April 2021
Published: 19 May 2021
DOI: https://doi.org/10.1038/s41467-021-23268-9

This article is cited by

ASH2L upregulation contributes to diabetic endothelial dysfunction in mice through STEAP4-mediated copper uptake
- Wen Zhong
- Ye-jun Dong
- Jun Chang
Acta Pharmacologica Sinica (2024)
lncCPSET1 acts as a scaffold for MLL2/COMPASS to regulate Bmp4 and promote the formation of chicken primordial germ cells
- Ying Ding
- Chen Zhang
- Bichun Li
Molecular Genetics and Genomics (2024)
ASH2L regulates postnatal neurogenesis through Onecut2-mediated inhibition of TGF-β signaling pathway
- Ya-Jie Xu
- Shang-Kun Dai
- Chang-Mei Liu
Cell Death & Differentiation (2023)
Methylation of histone H3 lysine 4 is required for maintenance of beta cell function in adult mice
- Ben Vanderkruk
- Nina Maeshima
- Brad G. Hoffman
Diabetologia (2023)
KMT2A-rearranged sarcoma with unusual fusion gene CBX6::KMT2A::PYGO1
- Harumi Nakamura
- Yoji Kukita
- Toshinari Yagi
Virchows Archiv (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.