Introduction

During the cell cycle, a cell replicates its genome into two identical copies and coordinates the chromosome segregation with cell division. The orderly progression through the different phases of the cell cycle relies on the cyclin-dependent kinase-cyclin (CDK-cyclin) oscillator that sequentially triggers cell cycle transitions by phosphorylation of specific targets including the downstream effectors of the cell cycle. The oscillating activity of the CDK-cyclins is achieved by cell cycle specific protein ubiquitination, phosphorylation and transcriptional regulation1,2.

Cell cycle-dependent transcription of cell cycle genes is controlled by the MuvB family of transcriptional regulators3,4,5. The MuvB core complex consists of a main scaffolding subunit called LIN9, the histone-binding protein RbBP4, the DNA-binding protein LIN54, and the smaller subunits LIN37 and LIN526. The DNA-binding domain of LIN54 recruits MuvB to its target genes at a specific sequence termed the cell cycle homology region (CHR)7,8. MuvB target promoters are TATA-less and the CHR is located directly upstream of the transcription start site (TSS) and the +1 nucleosome8,9. The presence of the +1 nucleosome generates a barrier for the assembly of the basal transcriptional apparatus10, and its positioning and dynamics are finely regulated to define the transcriptional status of each gene. Even though the mechanisms of regulation of the +1 nucleosome are not fully understood and are the object of intensive research11, the location of the CHR and several studies7,9,12 suggests that MuvB complexes, which perform both activating and repressive roles, may be involved in this process.

During cell cycle exit in quiescence and in senescence, also known as G0, the MuvB complex interacts with a retinoblastoma-like protein, thereby forming the DREAM complex3,4,13, which mediates transcriptional repression of about 1000 cell cycle genes6. Recognition of the target genes by the DREAM complex is combinatorial, as it involves binding of a CHR and an upstream cell-cycle dependent element (CDE)2,14,15.

When the cell is committed to divide, the increasing activity of CDK2-cylin D and E at the G1-S transition promotes the hyperphosphorylation of the retionoblastoma-like proteins16,17, thereby causing the disassembly of DREAM and cell cycle re-entry16,18,19. The remaining CHR-bound MuvB core complex sequentially associates with B-MYB and FOXM1 transcription factors (TFs) during S phase20. B-MYB and FOXM1 are oncogenes and are overexpressed in several cancer types21,22,23,24. Sequential assembly of B-MYB and FOXM1 onto the MuvB complex at CHR elements is required for triggering transcriptional activation of the mitotic gene programme at the G2 phase20,25,26. Importantly, the complex of B-MYB and MuvB (MMB) is required for FOXM1 recruitment at cell cycle genes, therefore B-MYB has been defined as a pioneer TF for cell cycle-dependent transcription of G2/M genes20,27.

The molecular mechanism for MuvB-dependent gene repression or activation is poorly understood, and the stoichiometry and overall architecture of this complex are unclear.

The presence of the histone-binding protein RbBP4, which is shared among several chromatin regulator complexes2,28,29,30, suggests that this complex is involved in the regulation of chromatin structure, but the lack of any ATPase domain in MuvB indicates that it is not involved in ATP-dependent chromatin remodelling. Many chromatin regulating complexes function in an ATP-independent fashion. For example, Polycomb repressive complex 1 (PRC1) utilises oligomerisation as part of its inhibitory mechanism31,32,33; pioneer transcription factors establish competence for gene expression by binding the nucleosomal DNA and initiating chromatin remodelling thereby exposing additional nucleosomal DNA to downstream TFs and to the transcriptional apparatus34,35,36,37,38. Histone chaperones assist nucleosome assembly and disassembly during transcription, DNA replication, and repair39,40. It is unknown if the MuvB complex could utilise any of the mechanisms listed above for actuating its transcriptional regulating functions.

Here, we set out to investigate whether MuvB uses one of these known ATP-independent chromatin remodelling mechanisms or possibly a previously undescribed mechanism. Therefore, we determined a structural snapshot of a MuvB complex in action using cryo-electron microscopy (cryo-EM), in the process of remodelling a nucleosome, and define its stoichiometry, assembly, and chromatin binding. Taken together, our results suggest that MMB is a pioneer transcription factor, and we derive a model according to which its nucleosome remodelling activity could be contributing to transcriptional activation in the context of cell proliferation.

Results

Biochemical characterisation of MuvB:nucleosome complexes

Given the compelling evidence that the MuvB complex is involved in nucleosome binding near the CHR-containing promoter of target genes7,9,12, we decided to attempt reconstitution of several MuvB complexes with the nucleosome. We recombinantly expressed all the subunits of the MuvB complex including LIN9, LIN37, LIN52, RbBP4, and a short version of LIN54 (LIN54sh) where the non-conserved N-terminus was removed (Fig. 1a) in a baculovirus/insect cell expression system. This resulted in the preparation of a highly pure protein complex that eluted in a sharp and symmetric peak in size exclusion chromatography (SEC) (Supplementary Fig. 1a).

Fig. 1: Structure of the MMB complex with the nucleosome core particle (NCP), overall complex assembly, and binding mode.
figure 1

a Schematics showing domains and functional regions on the subunits of the MMB complex represented in linear form. B-MYB N-terminus containing a DNA binding domain is disordered in the cryo-EM structure and it is represented with fading colours. LIN54 is represented with dashed lines for clarifying that this protein was not included in the core MMB (MMBcore):nucleosome structure. NCP subunit colour scheme used in this Fig. is indicated for each histone and DNA. b, c Cartoon representations of two main views of the MMBcore:nucleosome complex showing the overall architecture of the complex. Superhelical (SHL) positions in respect of the nucleosome dyad (0) are labelled in (b). DNA entry and exit is also indicated. MMBcore binding spans over three SHL positions (−8 to −6).

To analyse the interaction between MuvB and the nucleosome specifically and independently from the MuvB CHR binding function, we reconstituted a MuvB complex lacking LIN54, which is required for recognising the CHR sequence. We named this MuvB subcomplex core MuvB (MuvBcore). MuvBcore is as stable as the full MuvB complex (Supplementary Figs. 1a, b and 2a, b), demonstrating that the absence of LIN54 does not perturb the structural integrity of MuvB. We, therefore, used this complex to probe its interaction with a model nucleosome reconstituted with a 167 base pairs (bp) DNA containing the 601 Widom sequence by electrophoretic mobility shift assays (EMSA). The MuvBcore complex binds the nucleosome when present at 4-fold molar excess (Supplementary Fig. 2e). Expectedly, this complex binds a CHR-containing nucleosome with a lower affinity than the LIN54sh-containing MuvB complex (Supplementary Fig. 2g). Importantly, the affinity of MuvBcore for a nucleosome is higher than for free DNA (Supplementary Fig. 3a, b).

In order to define the subunits that directly interact with the nucleosome and the binding mode of the MuvB complex in respect of the nucleosome, we attempted reconstitution of a MuvBcore:nucleosome complex suitable for structural analysis by cryo-EM. The nucleosome complex with MuvBcore:B-MYB (MMBcore) (Supplementary Fig. 2f, lanes 6–9) manifested better homogeneity than nucleosome complexes with MuvBcore (Supplementary Fig. 2e, lanes 1–5) and with DREAMcore (Supplementary Fig. 2f, lanes 1–5), as shown by EMSA (Supplementary Fig. 2e, f). In fact MMBcore:nucleosome shifted band is sharper and more defined, unlike the other complexes tested (Supplementary Fig. 2e, f).

Cryo-EM structure of the MMBcore complex bound to a nucleosome

To determine the structure of the MMBcore:nucleosome complex (Fig. 1a), we used the GraFix protocol41 to minimise disassembly during vitrification (Supplementary Fig. 3c), and collected a large dataset on a Titan Krios electron microscope equipped with an electron-counting direct detector (see “Methods”, Supplementary Fig. 4a). Extensive two- and three-dimensional classification allowed us to resolve the overall structure of the highly heterogeneous and flexible MMBcore:nucleosome complex (see “Methods”, Supplementary Figs. 4, 5, and Supplementary Table 1). Our structure shows that a portion of MMBcore, which we call the MuvBTAIL, contacts the histone octamer and is embedded in the nucleosome disc, while a flexibly attached module that we refer to as the MuvBHEAD reaches across the nucleosomal entry DNA and sits on the outside of the DNA gyres (Fig. 1b, c). Due to the flexibility of the complex, we employed focused refinements to resolve the structures of MuvBHEAD and the MuvBTAIL at sub-nanometre resolution and the structure of nucleosome in the context of the MMBcore:nucleosome complex at 3.3 Å resolution (Supplementary Movie 1, Supplementary Fig. 5, Supplementary Table 1 and “Methods”).

By combining our cryo-EM reconstructions of the MMBcore:nucleosome complex with chemical crosslinking-mass spectrometry (XL-MS) data and an additional high-resolution cryo-EM structure of the MuvB apo-complex that we obtained (see below and Supplementary Figs. 69, Supplementary Table 1, and Supplementary Data 1), we were able to define the overall architecture of a MuvB core complex with the nucleosome (Fig. 1). Strikingly, our structure reveals that the nucleosome in the MMBcore:nucleosome complex is remodelled. Approximately 20 base pairs of the nucleosomal DNA are bent away from their unperturbed conformation and held between the MuvBHEAD and MuvBTAIL modules (Fig. 1b, c).

MuvBHEAD structure and assembly

Having identified the MuvBHEAD and MuvBTAIL modules in our reconstruction of the MMBcore:nucleosome complex, we sought to determine the detailed structure of MuvB to facilitate a mechanistic interpretation of the nucleosome-bound complex. Mass photometry measurements of the purified MuvB complex shows a main peak at around 176 kDa (Supplementary Fig. 1b), suggesting that the five MuvB subunits are present in a 1:1:1:1:1 stoichiometry in this complex. Interestingly a minor portion of the complex shows a higher molecular weight, suggesting that this complex could be forming oligomers (Supplementary Fig. 1b). In order to investigate this further, we performed SEC coupled with multi angle light scattering (SEC-MALS) on this complex at different concentrations. This analysis shows that MuvB can oligomerise in a concentration-dependent manner (Supplementary Fig. 1c). MuvB oligomers can be stabilised by briefly incubating the complex with glutaraldehyde causing them to elute as partially overlapping peaks in SEC (Supplementary Fig. 1a). A monomeric MuvB complex can be isolated from oligomers by SEC (Supplementary Fig. 1a). This cross-linking procedure was instrumental in obtaining a highly homogeneous sample that could be used for preparing cryo-EM grids (Supplementary Fig. 7a). We, therefore, obtained a 3D reconstruction map at 3.5 Å resolution (Supplementary Figs. 79). The excellent quality of the cryo-EM map (Supplementary Fig. 8 and 9) allowed the ab initio building of an atomic model of a ~70 kDa structure accounting for the WD40 domain-containing RbBP4, the Domain in Rb-related Pathway (DIRP), and tudor domain of LIN9, the middle domain (MD) and a proline-rich loop (PRL) of LIN37 (Figs. 1a–c and 2a–d). We name this portion of MuvB as RbBP4 subcomplex (Fig. 2a). Strikingly, the rest of the MuvB complex is not visible in our map indicating that these regions of the complex are highly mobile in respect of the RbBP4 subcomplex.

Fig. 2: Architecture and assembly of the MuvBHEAD complex.
figure 2

ad Cartoon representation showing the structure of the RbBP4 subcomplex. The position of this complex in relation to the entire model is illustrated in (a, b). Blades of the RbBP4 WD40 domain are numbered in (b). RbBP4 is shown is surface representation in c to highlight the extended interaction of LIN9 N-terminal domains (DIRP and tudor) and LIN37 middle domain proline-rich loop (PRL). e, f LIN37CTD bridges the RbBP4 complex to the nucleosomal DNA (SHL −7).

Except for the LIN37 PRL, we note that our results obtained using the full MuvB apo complex are consistent with the results of a recent crystallographic study reporting the structure of an isolated LIN9DIRP-Tudor-RbBP4-LIN37MD subcomplex9. The RMSD between our cryo-EM structure and the published crystal structure is of 0.658 Å.

In our structure, LIN9 amino-terminal region, including the DIRP and the tudor domain interact extensively with RbBP4 in four different sites around the conserved RbBP4 amino-terminal helix α1 (Fig. 2b–d), which is a hub of protein-protein interaction in several other RbBP4-containing complexes28,29,30. This interaction buries a total surface area of 2809 Å2.

RbBP4 WD40 domain consists of a propeller made of seven blades and it exhibits a so called “top” side, which is less wide than the bottom one (Fig. 2b). At the top side, blades one and seven form a first patch for LIN9 α1 and 2, which are oriented about 90 degrees in respect to each other (Figs. 1b, 2b–d and Supplementary Fig. 9b). Furthermore, LIN9 α2 packs against the RbBP4 amino-terminal helix (Figs. 1b, 2b–d and Supplementary Fig. 10c). LIN9 α3, 4 and 5 augment additional interactions with the carboxy-terminal part of the RbBP4 amino-terminal helix, where LIN9 reaches the bottom side of the WD40 domain at blade 6 (Fig. 2c, d). The basic loop following helix 5 occupies the histone H4 binding site of RbBP442, by being sandwiched between RbBP4 α1 and the PP loop coming from blade 6 (Fig. 2b,c). Consistently, a reconstituted MuvB complex is not able to bind H4 peptides9.

The LIN9 tudor domain is sandwiched between LIN9 α3 and 6, and the beginning of RbBP4 α1 (Figs. 1b, 2b–d and Supplementary Fig. 9e). The latter interaction is mediated by RbBP4 Arg15 and Glu14, which interacts with the Tyr269-Glu270, Arg229-Arg231 pairs coming from LIN9 tudor domain (Supplementary Fig. 9e). Similar arginine pairs in tandem tudor domains are involved in DNA binding43. Given that these arginine residues are heavily interacting with RbBP4, it is unlikely that LIN9 tudor domain is involved in DNA binding. Moreover, this domain is not involved in binding of histone peptides9, thereby suggesting that LIN9 tudor domain has exclusively a structural role in assembling the RbBP4 subcomplex and not a recruiting role to either DNA or histones. Consistently, in our MMBcore:nucleosome structure, the tudor domain is located far away from both histones or DNA and does not make any obvious contact with them (Fig. 1b).

LIN37MD is formed by 1 helix bearing a conserved PLYxCRxW sequence (Figs. 1b, 2a–d and Supplementary Figs. 9a–c10a). These residues interact with LIN9 α1 and 2, and with RbBP4 insertion loop, coming from blade 1 (Supplementary Fig. 9b, c). This is followed by a short loop and a small β-sheet, which are saddled onto LIN9 α1 (Supplementary Fig. 9c). This interaction involves LIN37 Glu111 and LIN9 Arg108 and extended hydrophobic contacts (Supplementary Fig. 9c).

Strikingly, a C-terminal PRL connects LIN37 to LIN9 α4 thereby buttressing LIN9 into RbBP4 (Supplementary Fig. 9a, and Figs. 1a, c, 2c, d). Our structure suggests that LIN37 has a key role in stabilising the MuvB complex and, consistently, MuvB reconstituted without LIN37 can still form but it is less stable as shown by mass photometry (Supplementary Fig. 2c, d). This is consistent with other findings showing that LIN37 is not required for MuvB assembly44, however, our structural-functional analysis shows in addition that it is indeed involved in complex stability.

Our RbBP4 subcomplex structure readily fits onto the MuvBHEAD module of the MMBcore cryo-EM map where it occupies most of this cryo-EM density (Supplementary Fig. 5a,c, d and Supplementary Movie 1). The docking is unambiguous as secondary structures visible in the cryo-EM map match with the atomic coordinates (Supplementary Fig. 5d and Supplementary Movie 1). Additional density extends from the PRL of LIN37 and could be assigned unambiguously to the conserved LIN37CTD, which folds into a kinked α-helix (Fig. 1b, c, Supplementary Figs. 5a, d, 10b, and Supplementary Movie 1). This assignment is consistent with our XL-MS data (Supplementary Fig. 6c) where several cross-links are observed between LIN37CTD and LIN9 α5 and 6, thereby supporting the fact that these portions of the complex are in proximity to each other.

LIN37CTD contains many basic residues which face the nucleosomal entry DNA of the MMBcore:nucleosome structure (Figs. 1b, c, 2e, f, 3a–c), suggesting that this domain is involved in anchoring the MuvBHEAD onto the outer side of the nucleosomal entry DNA. Consistent with this, a MuvBcore complex reconstituted without LIN37CTD (ΔLIN37CTD mutant) binds the nucleosome with lower affinity than the control MuvBcore (Fig. 3f, lanes 5–7 versus 1–4).

Fig. 3: MuvBTAIL complex forms a V-shaped channel with MuvBHEAD for the nucleosomal entry DNA.
figure 3

a, b close-up view from a on the V-shaped protein channel formed by B-MYBCTH:LIN9CC:LIN52CC and LIN37CTD. For clarity, LIN37CTD loops and the LIN9CTD are not shown. c This channel, which contains basic residues (indicated) clamping the entry DNA, is here represented as electrostatic surface potential representation (−/+5.000). d, e Structure based-mutational analysis to functionally validate interactions between LIN37, B-MYB, and the nucleosome in EMSA. Either MuvBcore or MMBcore containing LIN37 or not (ΔLIN37), or containing a LIN37 NTD truncation (LIN37ΔNTD), were assayed for their ability to bind NCP 167 (at a molar ratio of 1:1, 1:2, 1:4 NCP:MuvB). Quantifications for d are plotted in (e). Data are presented as mean values ± standard error of the mean of three independent experiments (n = 3). f MuvBcore containing FL-LIN37, or its NTD truncation (LIN37ΔNTD), or its CTD truncation (LIN37ΔCTD), were assayed as in (d). This experiment was repeated independently three times with similar results. gi Sequence alignment showing conservation on the LIN37 NTD. The RRKKRR motif region mutated in the assays in h and i is indicated. h, i MuvBcore containing LIN37, or LIN37 Arg to Glu point mutants i.e., R64E/R68E/R73E/R74E/R77E/R78E (RRKKRR), R19E/R37E (NTD_R) and R19E/R37E/K5A/K7A/K10A/K32A/K46A/K54A (NTD_RK), were assayed as in d for either nucleosome (h) or DNA (i) binding. These experiments were repeated independently three times with similar results. j EMSAs with NCP 167 and either MuvBcore or MuvBHEAD-LIN37FL or LIN9CC:LIN52 or LIN9CC-CTD:LIN52, to assay the ability of the MuvBHEAD and the MuvBTAIL to bind nucleosome. Molar ratios used here were 1:1, 1:2, 1:4 NCP:MuvB complex. This experiment was repeated independently three times with similar results.

In conclusion, RbBP4 subcomplex is formed at RbBP4 blade 1, and 6 where LIN9 DIRP and tudor bind, and this assembly is stabilised by LIN37 middle domain and PRL (Fig. 1a and Supplementary Fig. 9a). Binding to the nucleosome stabilises LIN37CTD in respect to the RbBP4 subcomplex, which extends from the LIN37PRL to the outer surface of the nucleosomal entry DNA (Fig. 1b), thereby contributing to nucleosome binding.

MuvBTAIL structure and assembly

MuvBTAIL module is anchored on the nucleosomal entry DNA extending towards the nucleosomal disc (Fig. 1b, c). Strikingly, MuvBHEAD and MuvBTAIL form a V-shaped tunnel which clamps onto the entry DNA (Figs. 1b, c3b, c and Supplementary Movie 1).

Most of the MuvBTAIL is composed of the LIN9 C-terminus including a coiled-coil domain interacting with LIN52 coiled-coil domain (hereafter named LIN9:52CC) and with the B-MYB C-terminal helix (CTH). The structure of this MMB subcomplex was previously solved by X-ray crystallography45 and it could be fit unambiguously in our MuvBTAIL:nucleosome cryo-EM reconstruction (Supplementary Fig. 5a, b, f and Supplementary Movie 1).

LIN9:52CC binds the entry DNA at the superhelical location (SHL) −8, located at the latest 10 base pairs at one end of the 167 nucleosomal DNA construct used. This interaction is mainly mediated by the B-MYBCTH, which contains two arginine residues oriented towards the DNA (i.e., Arg672 and Arg682) (Figs. 1b, c and 3b, c). Consistent with this, MMBcore binds the nucleosome better than MuvBcore (Fig. 3d, compare lanes 1–4 versus lanes 8–10 and Fig. 3e).

The basic LIN37NTD presents several crosslinks with MuvBTAIL mainly at the LIN9:52CC module (Supplementary Fig. 6c and Fig. 3g) suggesting that this region is in proximity of the nucleosomal DNA. Therefore, we investigated if LIN37NTD could contribute to nucleosome binding. Strikingly, deletion of this portion of LIN37 abolishes the binding of MuvBcore to nucleosomes (Fig. 3d, lanes 1–7 and Figs. 3e and 3f, lanes 1–4 vs lanes 8–10). Importantly deletion of LIN37NTD did not alter the structural stability and the assembly of MuvBcore (Supplementary Fig. 2h). Given the striking effect of this mutation, we narrowed down the portion of LIN37NTD that contributes to nucleosome binding and found that a sequence containing a RRKKRR motif (residues 64-78) is absolutely required by MuvBcore to bind both the nucleosome and free DNA (Fig. 3g–i, lanes 1–7). Strikingly, the same motif has DNA binding function in INCENP46.

In summary, the MMBcore complex clamps the nucleosomal entry DNA via a protein tunnel formed by MuvBHEAD and MuvBTAIL (Fig. 1b, c, and 3b, c), and DNA binding depends mainly on LIN37 and B-MYB.

Importantly, both MuvBHEAD and MuvBTAIL are required for full nucleosome binding activity. In fact, the MuvBHEAD-LIN37NTD construct, as well as constructs containing either a LIN9CC:LIN52 or a LIN9CC-CTD:LIN52 subcomplex, are not able to bind the nucleosome individually (Fig. 3j, lanes 1–4 versus lanes 5–13).

MMB remodels the nucleosome

In our MMBcore:nucleosome structure, the nucleosome is remodelled. Comparison between this nucleosome and an apo nucleosome reconstruction, isolated from our cryo-EM dataset by 3D classification (Fig. 4a–c and Supplementary Fig. 4c), shows that 20 base pairs of the entry DNA (from SHL −6 to SHL −8) are rotated 50 degrees over the nucleosomal disc and held in position by the MMBcore complex (Fig. 4a–d). This entry DNA is lifted about 25 Å upwards at its end as visible in our density. Moreover, the entry DNA is bent inwards in respect to a normal nucleosome (Fig. 4e). Strikingly, density extending from the N-terminus of LIN9CC passes under the entry DNA wedging it (Supplementary Fig. 5f and Fig. 1b, c), before continuing onto the LIN9 N-terminus in the MuvBHEAD module. This suggests that the MMB complex topologically entraps the nucleosomal exit DNA.

Fig. 4: MMB initiates chromatin remodelling. Details of distortions at the entry and exit DNA sites.
figure 4

Dyad view of the MMBcore:NCP structure is shown in (b), colour coded as specified in (a). The LIN9CTD is not shown for visualising better the distorted nucleosomal DNA. c cryo-EM density is displayed for the NCP complex within either the MMBcore:NCP (Supplementary Fig. 5g) or the apo NCP (Supplementary Fig. 4c). This comparison shows the conformational change at −/+7 SHL locations. df The DNA distortions are visualised in detail with different views of the NCP model (DNA in surface representation) from the MMBcore:NCP structure which is superposed to an apo NCP model (DNA in ribbon representation). g MNase digestion profile of NCP167 in the absence and presence of MMBcore. This experiment was repeated independently three times with similar results. h Cartoon illustration of nucleosome:H1 complex (PDB-ID: 5Nl0) i EMSA experiment shows H1 titration reduces NCP association with the MMBcore. This experiment was repeated independently three times with similar results.

On the other side, ~10 base pairs of the exit DNA at SHL +7 are not visible in our complex (Fig. 4f), indicating that this stretch of DNA is disordered. Interestingly, the unwrapping at SHL +7 is also observed in structures of ATP-dependent chromatin remodelling complexes with the nucleosome47,48.

Out of 167 bp, only 147 bp are visible in our MMBcore:nucleosome structure. Consistently with this, MNase digestion assay of the nucleosome in the presence of MMBcore shows the stabilisation of a 147 bp fragment (Fig. 4g).

Structural comparison of the MMBcore:nucleosome complex, with the histone H1:nucleosome complex49 shows that simultaneous MMB and H1 binding on the same nucleosome would be incompatible (Fig. 5e and h). This is because, firstly, similar to the linker H1 histone, MuvBHEAD localises near the nucleosome dyad (Fig. 4b, h); second, the path of the exit DNA is heavily rearranged in MMBcore:nucleosome complex, and not compatible with the formation of a H1:nucleosome complex (Fig. 4d, e and h). Consistent with this, adding H1 to a MMBcore:nucleosome complex reduces the amount of MMBcore:nucleosome, while the H1:nucleosome complex forms (Fig. 4i).

Fig. 5: Interactions of MMBcore with histones and model of pioneer transcription factor function of the MMB complex.
figure 5

The last helix of LIN9 (α12) interact with the nucleosome acidic patch and histone H4. A zoomed view of a is shown in (b). Although the resolution of our structural data in this area does not allow for resolving side-chains, possible side-chains interactions between the LIN9 C-terminal NRD sequence and residues from H2A and H2B histones (acidic residues are part of the nucleosome acidic patch) and H4 are modelled based on docking of secondary structure elements in the NCP:LIN9CTD map (Supplementary Fig. 5f, h) and here depicted. c LANA peptide competes with MuvB for binding the nucleosome. A complex of MuvB with a nucleosome containing a CHR sequence was run in an EMSA in either the absence or the presence of increasing concentrations of LANA peptide. This experiment was repeated independently three times with similar results. d Modelling of a MMB-FOXM1:nucleosome complex. The AlphaFold model from Supplementary Fig. 6d was superposed to the LIN9CTD. The interaction with the CHR element was modelled by superposing the DNA from PDB-ID: 5FD3, into the SHL −8 region of our MMBcore:nucleosome complex. FOXM1 was modelled by superposing the DNA from PDB-ID: 7FJ2 into the SHL −7 region of our MMBcore:nucleosome complex. The positioning of FOXM1 at the entry DNA is suggested by nucleosomal binding preferences of FOX TFs related to FOXM167. e Schematic cartoon of the complex in (d). Protein loops in a and d are hidden for reducing the complexity of the illustration. e, f A model of pioneer transcription factor function of the MMB complex is illustrated. CHR sites are located right upstream of a nucleosome, and they are exposed as parts of nucleosome depleted regions (NDRs)7,68. During S phase when MMB forms, remodelling of a nucleosome by the pioneer transcription factor function of MMB would allow exposure of cis acting sequences, including the TSS. This would allow establishment of competency for transcriptional activation at cell cycle genes. This, combined with the recruitment of downstream factors, and ATP-dependent chromatin remodellers, could promote formation of the preinitiation complex (PIC) and recruitment of the RNA polymerase. MMB can also participate in promoter–enhancer interactions56,57. Full activation of G2/M genes is a very complex process, which requires post-translational modifications on both B-MYB and FOXM120,27.

In conclusion, the MMBcore complex binds the entry DNA thereby distorting it, leading to the exposure of about 10 base pairs of nucleosomal DNA at and underneath the +/−7 SHL locations. We hypothesise that this exposed DNA, can be accessible to downstream TFs and can also facilitate the assembly of the transcriptional machinery, as discussed further below.

MMBcore interactions with histones and modelling of a full MMB:nucleosome complex

It has been shown that RbBP4, when bound to LIN9 and LIN37, is able to bind the H3 histone tail9. In our structure, RbBP4 is in proximity of the histone H3 tail (Fig. 1c), indicating that the binding mode of MMB with the nucleosome would allow interactions between RbBP4 and the H3 tail. However, our cryo-EM data do not resolve such an interaction directly. In general, histone tails are not required for MuvBcore:nucleosome complex formation (Supplementary Fig. 3d). Also, we cannot see any effect when adding the H3K4me3 modification, which is present at MuvB target promoters12, on MuvBcore:nucleosome complex formation (Supplementary Fig. 3e).

On the other hand, our cryo-EM maps show density for the conserved LIN9CTD which emerges from the C-terminal end of the LIN9CC domain (Supplementary Figs. 5f, h, 10d, Fig. 1b, c, and Supplementary Movie 1). This domain is composed of a 4-helix bundle where the longest C-terminal helix (α12) extends onto the nucleosome disc until its very C-terminus docks into the nucleosome acidic patch and the neighbour H4 histone (Fig. 5a, b). Consistently, increasing concentrations of the acidic patch binder LANA peptide50, can compete with MuvB for nucleosome binding, even in presence of the CHR element (Fig. 5c).

Moreover, our crosslinking data suggests that the LIN9CTD interacts with the LIN54CTD (Fig. 1a and Supplementary Fig. 6d). We used AlphaFold51 for modelling this interaction (Supplementary Fig. 6d–f) and thereby obtaining a complete model of the full MMB:nucleosome complex (Fig. 5d). This modelling exercise suggests that LIN54CTD may occupy a cavity between LIN9CTD, the LIN9:52CC, and the DNA. This arrangement would place the DNA binding domains of LIN54 in proximity of a CHR element at the SHL −8 of the nucleosome (Figs. 1a and 5d). LIN54sh crosslinks with all the other MuvB subunits (Supplementary Fig. 6e), thereby supporting our modelling that LIN54 is present at the centre of MuvB between TAIL and HEAD modules.

In conclusion, our data suggest that LIN9CTD is a critical component of the MMB complex, which connects MuvB directly to the nucleosome acidic patch and to the CHR via recruiting LIN54.

Discussion

Our cryo-EM structure shows that the MMB complex consists of two main modules, here named MuvBHEAD and MuvBTAIL. These two modules are flexibly linked by the main scaffolding protein of the complex LIN9 (Fig. 1b, c and Supplementary Movie 1). LIN9 orchestrates the assembly of MuvBHEAD via its N-terminal DIRP and tudor domains and the assembly of MuvBTAIL via its C-terminal coiled-coiled domain, which recruits LIN52CC and B-MYBCTH. LIN9 together with RbBP4 recruits LIN37 via its middle domain at the MuvBHEAD. LIN37 is a stabilising factor since it buttresses LIN9 on RbBP4 and also, we show that LIN37 within the MuvB complex is a DNA-binding protein. The LIN37 termini are essential for full nucleosome binding ability of the MuvB complex. Strikingly, these regions become less important for nucleosome binding in presence of B-MYB. In fact, the MMBcore complex can still bind a nucleosome in the absence of LIN37 (Fig. 3d). Our structure shows an explanation for this. B-MYBCTH recruited by LIN9:52CC directly contacts the entry DNA upstream of LIN37 (Figs. 1b, 3b, c and Supplementary Movie 1). This is consistent with prior data showing that B-MYB C-terminus is sufficient for B-MYB-dependent transcriptional activation52,53 and that LIN37 is dispensable for the transcriptional activating functions of MuvB44,54.

Our MMB:nucleosome complex also shows that MMB binds the nucleosome with a 1:1 stoichiometry and initiates chromatin remodelling, which is a functional feature of pioneer TFs. By clamping the entry DNA, MMB lifts 20 base pairs thereby exposing DNA at SHL −7. The DNA following the distortion is not visible in other pioneer TFs complexes structures34,35. In our structure, this region is visible because it is clamped by the MMB complex itself.

How is this remodelling achieved? Such a conformational change requires disruption of interactions between histone H3 with SHL −7 (Fig. 4d). Similarly to other pioneer TFs, in the case of MMB, disruption of histone:DNA interactions may be compensated by the energy of binding between MMB and the nucleosomal DNA. In addition, RbBP4 is heavily negatively charged, and it is positioned in proximity of SHL −6 (Fig. 4b), where the DNA distortion starts. Based on this, an intriguing hypothesis emerges whereby a charge repulsion mechanism between RbBP4 and the DNA at SHL −7 could contribute to DNA path deviation. This could also explain the high mobility of MuvBHEAD in respect to the nucleosome as observed in our cryo-EM data (Supplementary Fig. 4f). Moreover, we observe density coming from LIN9CC N-terminus directed towards MuvBHEAD (Fig. 1c and Supplementary Fig. 5f) This suggests that MMB topologically entraps the entry DNA, thereby contributing to hold the DNA in position.

The conformation of the entry/exit DNA in our MMB:nucleosome complex structure would not be compatible with linker histone H1 binding. Consistently, we show that H1 can compete MMB from the nucleosome (Fig. 4e and 4h, i). This is consistent with the fact that CHR-containing regions have typical features of actively transcribing genes12 and not of heterochromatin regions, which are enriched in H155.

Importantly, LIN54 and the MuvB target CHR sequence are not included in our MMB:nucleosome structure, indicating that the nucleosome-dependent remodelling by the MMB complex is independent of CHR recognition and so the remodelling would be an intrinsic property of the core MMB complex. Furthermore, we observe that the last helix of LIN9 is inserted into the acidic patch of the nucleosome (Fig. 5a, b). This suggests that a direct interaction of the MMB complex with the nucleosome contributes with anchoring and positioning the complex onto the nucleosome. The ability of binding both nucleosomal DNA and the histone octamer could explain why MuvB stabilises nucleosomes9.

Considering previous and our current work, we propose a model of MMB function as a pioneer TF complex (Fig. 5d–f). MuvB disassembled from DREAM at the G1/S phase transition assembles with B-MYB, thereby forming the MMB complex (Fig. 5d, e). The MMB complex is located at CHR DNA elements, in proximity of the TSS and the +1 nucleosome. This nucleosome is remodelled by MMB thereby exposing DNA at −/+7 SHL locations (Fig. 5d, e).

This would facilitate the recruitment of downstream transcription factors (i.e., FOXM1)20, and the assembly of the basal transcriptional machinery at the TSS. MMB has been shown to participate in interactions between the basal promoter and enhancer elements56,57, which underpins the importance of this complex in transcriptional activation of mitotic genes.

In conclusion, our structure-function investigation of the MuvB complex gives insights on complex assembly, interactions, and binding mode with a nucleosome. Our study paves the way for the investigation of a mechanistic model of MuvB function in transcriptional activation and B-MYB mediated oncogenesis.

Methods

Protein expression and purification

The codon optimised coding sequences (CDSs) of the wild-type, truncations, and point-mutants of human MuvB and B-MYB proteins were obtained from Genscript or GeneArt (Thermo Fischer Scientific) and cloned into pFASTBAC-1 or pACEBAC-1 vector using standard methods. Construct boundaries are as follows: LIN9FL(1–542), LIN9MuvB(HEAD)(94–278), LIN9CC(325–450), LIN9CC-CTD(332–542), LIN37FL(1–246), LIN37ΔNTD(92–246), LIN37ΔCTD(1–132), RbBP4FL(1–425), LIN52FL(1–116), LIN54CTD(635–749), LIN54short(515–749), B-MYBFL(1–700).

Point-mutants in LIN37FL: LIN37RRKKRR(R64E/R68E/R73E/R74E/R77E/R78E), LIN37NTD_R(R19E/R37E), LIN37NTD_RK(R19E/R37E/K5A/K7A/K10A/K32A/K46A/K54A). For MuvB complex expression and assembly, Hi-5 cells at a density of 1.5 × 106 cells ml−1 were co-infected with pre-cultures of Sf9 cells each pre-infected with recombinant MuvB baculoviruses. Cells were harvested at a viability of ~85% which typically took approx. 48 h and stored at −80 °C until processing. B-MYB protein was expressed separately in a similar manner to MuvB.

Hi-5 cells expressing MuvB (sub-)complexes (wild-type or mutants) were resuspended in buffer containing 50 mM Tris pH8, 300 mM NaCl, 0.5 mM TCEP, 5% Glycerol, 1 mM EDTA, 0.1 mM PMSF, 2 mM benzamidine, 5 U ml−1 benzonase (Novagen), Complete EDTA-free protease inhibitors (Roche) and lysed by sonication followed by centrifugation. The filtered soluble fraction of the lysate was loaded on a Strep-Tactin Superflow (30060, Qiagen) column for purification using the StrepIIx2 tag on LIN37 or RbBP4 (the latter on the MuvB ΔLIN37 complex). Bound protein was washed with 20 CV with buffer containing 50 mM Tris pH8, 300 mM NaCl, 0.5 mM TCEP, 5% Glycerol, followed by a wash with 4 CV 50 mM Tris pH8, 1000 mM NaCl, 0.5 mM TCEP, 5% Glycerol and subsequently eluted with buffer containing 50 mM Tris pH8, 300 mM NaCl, 0.5 mM TCEP, 5% Glycerol, 2.5 mM d-Desthiobiotin. Strep tag was removed by a 4 °C o/n incubation with 3 C protease. Final polishing was achieved by SEC on a S200 or Superose 6 column (GE Healthcare) in buffer containing 20 mM HEPES pH 8, 150 mM NaCl, 0.5 mM TCEP. Peak fractions were pooled and concentrated up to 5 mg/ml.

B-MYB protein was purified separately as per above protocol. For the MMB complex assembly, freshly prepared MuvB:B-MYB proteins were mixed at a 1:4 ratio and incubated on ice for 1 h followed by SEC on a Superose 6 column in buffer containing 20 mM HEPES pH 8, 100 mM NaCl, 0.2 mM TCEP.

Recombinant H1 with N-terminal Hexahistidine tag and C-terminal StrepIIx2 tag was expressed in Escherichia coli. Protein expression was induced by adding 0.5 mM IPTG when OD600 reached 0.7, and allowed to continue overnight at 18 °C. Cells were lysed by sonication. H1 protein was purified using Strep-Tactin Superflow Plus (QIAGEN) and HiTrap Heparin HP (GE Healthcare) according to standard protocol. Both N-terminal and C-terminal tags were removed by overnight 3C protease digestion. Untagged H1 protein was passed to a S200 16/600 column (GE Healthcare). Pure H1 fractions were concentrated, flash frozen in liquid Nitrogen, and stored at −80 °C for later use.

NCP167 assembly

Recombinant human histones H2A, H3, H4, and Xenopus laevis H2B were overexpressed separately in Escherichia coli, purified from inclusion bodies, and assembled together into a histone octamer as described previously58. Briefly, the inclusion bodies for each histone were resolubilised in Urea-containing buffer individually and further purified by ion exchange column. Denatured histones were mixed at a H2A:H2B:H3:H4 molar ratio of 1.2:1.2:1:1, and dialysed against refolding buffer containing 2 M NaCl, 10 mM Tris-HCl pH 7.5, 1 mM EDTA, and 10 mM β-mercaptoethanol. Refolded histone octamer was purified from aggregates and excess H2A-H2B dimer using gel filtration on a S200 16/600 column (GE Healthcare), run in the same refolding buffer. Pure histone octamer fractions were concentrated, mixed to a final glycerol concentration of 50% (v/v), and stored at −20 °C for later use.

Nucleosomal DNA was prepared using Plasmid Giga Kit (QIAGEN). Plasmid containing four repeats of 167 bp sequence (TAC CCG GGA TAT CGA GAA TCC CGG TGC CGA GGC CGC TCA ATT GGT CGT AGA CAG CTC TAG CAC CGC TTA AAC GCA CGT ACG CGC TGT CCC CCG CGT TTT AAC CGC CAA GGG GAT TAC TCC CTA GTC TCC AGG CAC GTG TCA GAT ATA TAC ATC CGA TAT CCC GGG TA) was transformed into XL10 Gold bacteria. Cells were grown overnight, and plasmid extraction was done according to QIAGEN Plasmid Giga Kit protocol. Extracted plasmid was digested with BstZ17I (NEB) overnight. The 167 bp nucleosomal DNA was purified from vector DNA by ion exchange column. Trace amount of BstZ17I was inactivated by phenol-chloroform extraction.

Nucleosomal DNA containing CHR sequence was prepared by Polymerase Chain Reaction (PCR). The PCR was performed according to standard protocol using Taq DNA Polymerase with ThermoPol® buffer (NEB). Widom 601 sequence was used as a template, and the PCR reaction was scaled up to 20 ml reaction volume. PCR product was purified using HiTrap Q HP anion exchange column (GE Healthcare). Fractions containing pure nucleosomal DNA were pooled, ethanol precipitated, dissolved in water, and finally stored at −20 °C for later use. The sequence of Nucleosomal DNA containing CHR sequence is GAG TTC AAA ATC GAG AAT CCC GGT GCC GAG GCC GCT CAA TTG GTC GTA GAC AGC TCT AGC ACC GCT TAA ACG CAC GTA CGC GCT GTC CCC CGC GTT TTA ACC GCC AAG GGG ATT ACT CCC TAG TCT CCA GGC ACG TGT CAG ATA TAT ACA TCC GAT.

Nucleosome (NCP167 and NCP containing the CHR sequence) reconstitution was performed as described previously58 with minor modifications. Briefly, histone octamer and nucleosomal DNA were mixed at 1.1:1 ratio in a high-salt reconstitution buffer containing 2 M NaCl, 20 mM HEPES pH 7.5, 5 mM DTT, 1 mM EDTA. Sample was transferred into a dialysis button (Hampton Research), and the button was placed inside a beaker containing the same high-salt reconstitution buffer. Low-salt reconstitution buffer containing 20 mM NaCl, 20 mM HEPES pH 7.5, 5 mM DTT, 1 mM EDTA was slowly pumped into the dialysis beaker to gradually reduce the salt concentration and allow NCP formation. Sample was transferred to a fresh low-salt reconstitution buffer after 24 h and allowed to further equilibrate for 4 h. Soluble NCP167 was separated from aggregates by centrifugation, and stored at 4 °C for subsequent use.

The NCP167 with the H3K4me3 modification was purchased from ActiveMotif.

Tailless NCP167 preparation

Reconstituted NCP167 was diluted in Digestion Buffer containing 20 mM HEPES pH 7.5, 75 mM NaCl, 1 mM TCEP to a final concentration of 1.8 μM. TPCK-trypsin immobilised on magnetic beads (Takara) was washed and equilibrated in the same Digestion Buffer, and mixed with NCP167 at 1:1 volume ratio. Digestion was stopped after 20, 40, 60, and 80 min by removing the magnetic bead-coupled trypsin from the solution. All samples were tested on SDS-PAGE to assess the completion of histone tails removal.

MNase protection assay

The NCP was mixed with MMBcore at a concentration of 0.6 and 1.8 μM, respectively and topped up with EMSA buffer containing 20 mM HEPES pH 7.5, 50 mM NaCl, 0.5 mM TCEP to final volume of 10 μL. As a control, a reaction containing NCP alone was also prepared separately. MNase reaction buffer (NEB) was added to each sample at a final concentration of 1×. 2.5 μL of 1000 unit/μl MNase (NEB) was then added to each sample. Reaction was conducted at room temperature. 3 μL were aliquoted from each sample at 30 s, 1 min, and 1.5 min after the MNase addition. Reaction was terminated by adding Proteinase K solution (ThermoFisher Scientific) into each aliquot. Samples were analysed on NovexTM 4–12% TBE gel (ThermoFisher). The gels were stained with SYBR Safe and scanned using Typhoon FLA 9500 imager.

SEC-MALS

Analytical SEC-coupled with multi-angle laser light scattering (MALS) was performed on an Agilent 1260 Infinity II HPLC system. Light scattering and differential refractive index (dRI) profiles of the samples were analysed in-line using the DAWN and Optilab instruments from Wyatt Technologies. Samples of purified MuvB complex at different concentrations (20 μL volume) were applied to a Superose 6 3.2/300 column (GE Healthcare) equilibrated with 20 mM HEPES pH 7.5, 150 mM NaCl, 0.5 mM TCEP at a flow rate of 0.05 ml/min. The data were analysed according to the Zimm light scattering model and a dn/dc of 0.185 mL/g on software ASTRA 7.3.1 (Wyatt).

Mass photometry

Mass photometry experiments were performed on a Refeyn OneMP mass photometer (Refeyn Ltd, Oxford, UK) as described previously59. Microscope coverslips (630-2187, VWR) and silicone CultureWell™ gaskets (GBL103250, Sigma-Aldrich) were cleaned with 100% isopropanol and water and dried with compressed air. The gaskets were placed on top of the cleaned coverslips on the sample stage of the mass photometer as per manufacturer’s instructions.

All measurements were performed at least five times independently, in separate wells, in buffer containing 20 mM HEPES pH 8, 150 mM NaCl, 0.5 mM TCEP. Firstly, 12 μL buffer was pipetted into a gasket-well followed by focal point acquisition using the autofocus functionality in the Refeyn AcquireMP 2.3.1 software. Secondly, 2 μL of 500 nM of MuvB complexes (diluted to this concentration just prior to measurement to avoid complex disassembly) was pipetted in the buffer-containing gasket-well and mixed to observe individual landing events below saturation levels. Mass photometry movies of 6000 frames were recorded from a 10.8 × 10.8 μm instrument field of view. Data were processed using the default pipeline on the Refeyn DiscoverMP 2.3.0 software. Individual particle contrasts from each Video were converted to mass using a contrast-to-mass (C2M) calibration which was performed in the acquisition buffer. Data were plotted as normalised histograms with a bin width of 4.4 and fitted to a Gaussian peak. For the calibration, 2 μL of a 1:100 pre-diluted NativeMark unstained protein standard (LC0725, Thermo Scientific) was added to an acquisition gasket-well; the 66, 146, 480, 1048 kDa masses were used for a standard calibration curve in the DiscoverMP software.

Cross-linking of MuvB-apo for cryo-EM

30 μL of freshly purified MuvB complex at a concentration of 28 μM was incubated with 0.1 μL 25% Glutaraldehyde (G5882, Sigma-Aldrich) for 2 min on ice followed by quenching with 2 μL 500 mM Tris pH8 and immediate SEC on a Superose 6 3.2/300 column (GE Healthcare) in buffer containing 20 mM Tris pH 8, 100 mM NaCl, 0.2 mM TCEP. Eluted fractions were analysed by SDS-PAGE and MuvB monomer-containing fractions were pooled and concentrated to 20 μL and re-injected on the SEC column as described above to remove any higher-order contaminants. Monomeric-containing fraction was used immediately for cryo-EM grid preparation.

Gradient fixation (GraFix)

NCP167 and MMBcore were mixed at 1:2.5 molar ratio and incubated at 4 oC for at least 30 min. GraFix was done as described previously41. A continuous gradient containing 20 mM HEPES pH 7.5, 50 mM NaCl, 0.1 mM TCEP, 10–30% sucrose, 0–0.1% glutaraldehyde was generated using a Gradient Master (Biocomp). Samples were centrifuged using a SW60 Ti ultracentrifuge rotor at 165, 100 × g for 16 h at 4 °C. Samples were manually fractionated by hand, and the crosslinking was terminated by adding Tris-HCl pH 7.5 to a final concentration of 35 mM. GraFix fractions were analysed on a 5% native PAGE, and those containing crosslinked NCP167-MMBcore complex were concentrated, and buffer exchanged into 10 mM Tris-HCl pH 7.5, 50 mM NaCl, 0.1 mM TCEP.

Cryo-EM grid preparation

MuvB-apo complex

2 μL of the cross-linked monomeric MuvB complex was applied to Quantifoil R1.2/1.3 Cu 300 grids which were glow discharged using Easiglow (Pelco) at 15 mA for 1 min. After sample application, the grids were immediately blotted for 5 s at 4 °C and 100% humidity and plunged into liquid ethane using a Vitrobot mark IV (Thermo Fisher).

MMBcore:NCP complex

2 μL of the GraFix-treated sample was applied to Quantifoil R1.2/1.3 Cu 300 grids which were glow discharged using Easiglow (Pelco) at 15 mA for 1 min. The grids were incubated for 5 s and then blotted (blot force 3) for 5 s at 4 °C and 100% humidity and plunged into liquid ethane using a Vitrobot mark IV (Thermo Fisher).

Cryo-EM data collection and processing

MuvB-apo complex

We collected 10,183 movie stacks in EER format on a Glacios (Thermo Fisher) Cryo-TEM 200 kV equipped with a Falcon 4 detector, based at ICR, at a nominal magnification of 240,000, which yielded a pixel size of 0.567 Å per pixel. These movies were live-processed during collection using the cryoSPARC live programme60.

Movie stacks were frame aligned and binned 4 times giving a pixel size of 2.268 Å per pixel. 8820 Images with resolution better than 6 Å and a total motion of 30 pixels (estimated during frame alignment) were selected for further processing. A blob picker was used for initial particle picking during the live processing, and for obtaining initial 2D class averages. Templates generated from the latter, were used for template picking and TOPAZ61 training and picking with cryoSPARC. In total, 1,184,371 particles were picked; duplicated particles were removed by using the “remove duplicates” function implemented in cryoSPARC. This step removed doubly picked particles within 60 Å (shortest dimension of the MuvB particles is ~50 Å). These particles were cleaned further with another 2D classification step (Supplementary Fig. 7). These particles were piped in the “ab initio model” function implemented in cryoSPARC. Three classes were used for allowing sorting of particles in the process. This generated one class containing 250,174 particles with clear domain structures and with resolved helices. 2D classification of this class showed improvement on the signal-to-noise of the class averages. These particles were then imported in RELION-3.1.162 by using pyem and the csparc2star.py script by Daniel Asarnow: Daniel Asarnow, Eugene Palovcak, & Yifan Cheng (2019). asarnow/pyem: UCSF pyem v0.5 (v0.5). Zenodo. https://doi.org/10.5281/zenodo.3576630. Refinement and further map improvement was performed as described in63 and in Supplementary Fig. 7. This yielded a map at the resolution of 3.5 Å.

MMBcore:NCP complex

We collected 22,293 Movie stacks in MRC format on a Titan KRIOS 300 kV cryo-TEM at eBIC (Diamond light source, BI21809-34 and BI21809-35 sessions), equipped with a K3 detector operated in super-resolution bin 2x mode, at a nominal magnification of 165 kx, which yielded a pixel size of 0.513 Å per pixel. Processing of these data was performed similarly to the MuvB-apo complex with variations explained below and in Supplementary Fig. 4 and 5.

2D class-averages show a clear additional density protruding from the nucleosome (Supplementary Fig. 4E). Ab initio reconstruction followed by 3D classification allowed removal of partially disassembled nucleosomes and apo nucleosomes particles from a MMBcore:nucleosome complex reconstruction (Supplementary Fig. 4C). Further 3D classification of this class shows high mobility of a portion of MMBcore in respect to the nucleosome (Supplementary Fig. F). We named this highly mobile portion of the complex MuvBHEAD. Out of four 3D classes, one presented clearer density in both MuvBHEAD and another portion of the MMBcore complex, here called MuvBTAIL, which is embedded in the nucleosome disc (Fig. 1B and Supplementary Fig. 4F). 3D refinements focusing on MuvBHEAD and MuvBTAIL, allowed us to obtain reconstructions at resolutions of 7.5 and 8.7 Å respectively (Supplementary Fig. 5A–D, J). For structural studies on the MuvB:nucleosome interactions we collected additional 29,102 Movie stacks. Particle data coming from the MMBcore:nucleosome class obtained after 2D and 3D classification within this second dataset were merged with the first dataset at the stage indicated in Supplementary Fig. 4C. Additional 3D classification yielded 237,061 particles of MMBcore:nucleosome complex (Supplementary Fig. 5 and J). Further focused 3D refinements were used to obtain MMBcore:DNA, NCP147, NCP:LIN9CTD, and NCP127 maps (Supplementary Fig. 5E–J).

Model building

The PDB 4PC0 was used as initial template for building RbBP4 in the MuvB-apo structure in Coot64. LIN9 DIRP, Tudor domain and LIN37 middle domain and proline-rich loop were build de novo based on the excellent quality of our cryo-EM map. Structural model refinement was performed with PHENIX real-space refinement65 at the resolution of 3.5 Å.

This structure was rigid-body fit in the MuvBHEAD complex map (Supplementary Fig. 5B) where an additional kinked helix belonging to the LIN37CTD (residues 203–246, predicted by AlphaFold51) could be fitted, this was also guided by our XL-MS data (Fig. 3b). PDB 6C48, was fitted into the MuvBTAIL-NCP map (Fig. c and d). The connection of MuvB HEAD and TAIL via the LIN9 loop is modelled for showing the connectivity between these modules. PDB 6PWE was used as initial template to model the NCP-DNA complex.

AlphaFold model of the LIN9CTD:LIN54CTD subcomplex was obtained by submitting LIN9 residues 421–542 and LIN54 residues 635–749 to AlphaFold51. The prediction of this subcomplex has an overall a high score as judged by the predicted local-distance difference test.

Electrophoretic mobility shift assay

NCP or DNA was mixed with MuvB or MMB complex at a 1:0 (input control) or 1:1, 1:2, 1:4 ratio unless otherwise stated. The concentration of NCP or DNA was constant at 0.5 μM. Reactions were topped up with buffer containing 20 mM HEPES pH 8, 50 mM NaCl, 0.5 mM TCEP and incubated on ice for 30 min. After the addition of 5% (v/v) sucrose, samples were resolved by gel electrophoresis using either 5% TBE-Polyacrylamide gel or 1% TBE-agarose gel. The gels were stained with SYBR Safe and scanned using either a Typhoon FLA 9500 imager or a Biorad ChemiDoc system. Quantification was performed using ImageLab (Biorad). Band intensities of unbound either NCP or DNA were normalised relative to the intensity of the NCP or DNA band in absence of the protein complex studied and plotted as a percentage of unbound NCP/DNA fraction. Means ± standard error of the mean of at least three independent experiments are shown.

H1 competition assay

NCP167 was mixed with MMBcore at 0.5 and 1.25 μM concentration, respectively. The mixture was incubated on ice for 15 min. H1 was titrated to the mixture at final concentration of 0.25, 0.5, and 0.75 μM. After another 15 min incubation, sucrose was added to final concentration of 5% (v/v), and samples were analysed on 1% TBE-agarose gel. The gel was run at 100 V for 3.5 h at 4 °C. The gel was stained with SYBR Safe and scanned using Typhoon FLA 9500 imager.

LANA competition assay

NCP containing CHR sequence was mixed with MuvB at 0.5 and 1.5 μM concentration, respectively. LANA peptide was titrated to the mixture at final concentration of 100, 200, and 300 μM. After 15 min incubation, sucrose was added to final concentration of 5% (v/v). Samples were analysed on 5% native PAGE gel. The gel was run at 100 V for 1.5 h at 4 °C. The gel was stained with SYBR Safe and scanned using Typhoon FLA 9500 imager.

Crosslinking mass spectrometry analysis

Similarly to the cross-linking protocol described above for the apo-MuvB, 30 μL of freshly purified MuvB complex at a concentration of 28 μM was incubated with 0.75 μL DSSO (disuccinimidyl sulfoxide) (A33545, Thermo Scientific) pre-dissolved in DMSO at a concentration of 100 mM. Sample was incubated on ice for 10 min followed by quenching with 2 μL 500 mM Tris pH8 and immediate SEC on a Superose 6 3.2/300 column (GE Healthcare) in buffer containing 20 mM Tris pH 8, 100 mM NaCl, 0.2 mM TCEP. Eluted fractions were analysed by SDS-PAGE and appropriate MuvB monomer-containing fractions were pooled and concentrated to 20 μL and re-injected on the SEC column as described above to remove any higher-order contaminants. Monomeric-containing fraction was analysed by mass spectrometry as follows.

After the crosslinking reaction, triethylammonium bicarbonate buffer (TEAB) was added to the sample at a final concentration of 100 mM. Proteins were reduced and alkylated with 5 mM tris-2-carboxyethyl phosphine (TCEP) and 10 mM iodoacetamide (IAA) simultaneously for 60 min in dark and were digested overnight with trypsin at final concentration 50 ng/μL (Pierce). Sample was dried and peptides were fractionated with high-pH Reversed-Phase (RP) chromatography using the XBridge C18 column (2.1 × 150 mm, 3.5 μm, Waters) on a Dionex UltiMate 3000 HPLC system. Mobile phase A was 0.1% v/v ammonium hydroxide and mobile phase B was acetonitrile, 0.1% v/v ammonium hydroxide. The peptides were fractionated at 0.2 mL/min with the following gradient: 5 min at 5% B, up to 12% B in 3 min, for 32 min gradient to 35% B, gradient to 80% B in 5 min, isocratic for 5 min and re-equilibration to 5% B. Fractions were collected every 42 s, SpeedVac dried and orthogonally pooled into 12 samples for MS analysis.

LC-MS analysis was performed on the Dionex UltiMate 3000 UHPLC system coupled with the Orbitrap Lumos Mass Spectrometer (Thermo Scientific). Each peptide fraction was reconstituted in 30 μL 0.1% formic acid and 15 μL were loaded to the Acclaim PepMap 100, 100 μm × 2 cm C18, 5 μm trapping column at 10 μL/min flow rate of 0.1% formic acid loading buffer. Peptides were then subjected to a gradient elution on the Acclaim PepMap (75 μm × 50 cm, 2 μm, 100 Å) C18 capillary column connected to a stainless steel emitter with integrated liquid junction (cat# PSSELJ, MSWIL) on the EASY-Spray source at 45 °C. Mobile phase A was 0.1% formic acid and mobile phase B was 80% acetonitrile, 0.1% formic acid. The gradient separation method at flow rate 300 nL/min was as follows: for 95 min gradient from 5 to 38% B, for 5 min up to 95% B, for 5 min isocratic at 95% B, re-equilibration to 5% B in 5 min, for 10 min isocratic at 5% B. Precursors between 375 and 1600 m/z and charge equal or higher than +3 were selected at 120,000 resolution in the top speed mode in 5 sec and were isolated for CID fragmentation (collision energy 25%) with quadrupole isolation width 1.6 Th and Orbitrap detection with 30,000 resolution. Fragments with targeted mass difference of 31.9721 (DSSO crosslinker) were further subjected to CID fragmentation at the MS3 level with collision energy 35%, iontrap detection, max IT 50 ms, AGC 2 × 104 and MS2 isolation window 2 Th. Two precursor groups were selected with both ions in the pair. Targeted MS precursors were dynamically excluded for further isolation and activation for 30 s with 10 ppm mass tolerance.

Identification of crosslinked peptides was performed in Proteome Discoverer 2.4 (Thermo) with the Xlinkx search engine in the MS2_MS3 mode. Precursor, FTMS and ITMS mass tolerances were 20 ppm, 30 ppm, and 0.5 Da respectively with maximum 2 trypsin missed cleavages allowed. Carbamidomethyl at C and Oxidation at M were selected as static and dynamic modifications respectively. Spectra were searched against a FASTA file containing the sequences of the proteins in the complex as well as a random selection of ~1000 Escherichia coli UniProt entries as an additional negative control of the crosslink matches. Crosslinked peptides were filtered at FDR < 0.01 using the Percolator node and decoy database search.

Interpretation of the crosslinking mass spectrometry dataset for Model building

Given the high flexibility between MuvBHEAD and MuvBTAIL (Supplementary Fig. 4F) the cross-links between these two modules were excluded from the structural interpretation (Supplementary Fig. 6B and Supplementary Data 1).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.