Abstract
The nucleosome is a fundamental unit of chromatin in which about 150 base pairs of DNA are wrapped around a histone octamer. The overlapping di-nucleosome has been proposed as a product of chromatin remodeling around the transcription start site, and previously found as a chromatin unit, in which about 250 base pairs of DNA continuously bind to the histone core composed of a hexamer and an octamer. In the present study, our genome-wide analysis of human cells suggests another higher nucleosome stacking structure, the overlapping tri-nucleosome, which wraps about 300-350 base-pairs of DNA in the region downstream of certain transcription start sites of actively transcribed genes. We determine the cryo-electron microscopy (cryo-EM) structure of the overlapping tri-nucleosome, in which three subnucleosome moieties, hexasome, hexasome, and octasome, are associated by short connecting DNA segments. Small angle X-ray scattering and coarse-grained molecular dynamics simulation analyses reveal that the cryo-EM structure of the overlapping tri-nucleosome may reflect its structure in solution. Our findings suggest that nucleosome stacking structures composed of hexasome and octasome moieties may be formed by nucleosome remodeling factors around transcription start sites for gene regulation.
Similar content being viewed by others
Introduction
Genetic information is encoded in DNA, which is compactly accommodated as chromatin within the nucleus in eukaryotic cells1. In chromatin, four core histone proteins, H2A, H2B, H3, and H4, form a histone octamer, in which two H2A–H2B and H3–H4 heterodimers associate with each other, and wrap 145–147 base pairs (bps) of DNA around the lateral periphery of the histone octamer2. The resulting histone–DNA complex is termed the nucleosome core particle (NCP) and is a fundamental structural unit of chromatin. The NCPs are connected by linker DNA segments and form fibers with a bead-on-a-string appearance3. The bead-on-a-string fiber may be further folded into an unknown higher-order chromatin architecture in the nucleus4.
The DNA wrapped in the nucleosome generally becomes inaccessible to DNA-binding proteins, which function in genomic DNA regulation5,6,7. To overcome the nucleosome barrier during the replication, repair, recombination, and transcription processes, nucleosomes are dissociated from and/or slide along the DNA during the nucleosome remodeling process, which is mediated by a group of enzymes called nucleosome remodelers8,9. In particular, during the transcription initiation process, the nucleosome located at the transcription start site (TSS) of a gene could be removed by the nucleosome remodeling activity for the subsequent assembly of the transcription machinery, such as RNA polymerase II10,11. In this process, the nucleosome remodeling potentially induces the collision of neighboring nucleosomes and forms the overlapping di-nucleosome (OLDN), in which the hexameric nucleosome (hexasome) lacking one H2A–H2B dimer associates with the nucleosome (octasome)12,13,14.
In the crystal structure of the OLDN, ~250 bp DNA is continuously wrapped in three turns around the histone core14. Consistent with the OLDN structure, the OLDN formation protects ~250 bp DNA from digestion by micrococcal nuclease (MNase), which preferentially degrades nucleosome-free DNA regions14. A genome-wide analysis revealed that the protection of 250 bp from MNase is predominantly observed at the downstream regions immediately adjacent to transcription start sites (TSSs) in human cells14. This suggests that the OLDN formation may be mediated by nucleosome remodelers, which function around the TSS for transcription initiation14.
In the present study, we found a higher-order nucleosome stacking structure, possibly corresponding to the overlapping tri-nucleosome (OLTN), which may be formed at the downstream regions of TSSs in the human genome. We then determined the structure of the OLTN, composed of hexasome, hexasome, and octasome, by cryo-electron microscopy (cryo-EM). The cryo-EM structure of the OLTN may reflect its structure in solution as revealed by small-angle X-ray scattering and coarse-grained molecular dynamics simulation analyses.
Results
The overlapping tri-nucleosome may be formed downstream of transcription start sites in the genome
A previous genome-wide MNase analysis revealed that approximately 250-bp, corresponding to the OLDN, are protected just downstream of TSSs in human cells14. This suggested that nucleosome collisions, probably by the nucleosome remodeling activity, may promote the OLDN formation around the TSS regions in the genome14. If the nucleosome remodeling constantly occurs with the OLDN around TSS regions, then another nucleosome could collide with the OLDN and form the OLTN, a further nucleosome stacking structure with three nucleosomes.
To test this possibility, we first performed the MNase protection analysis with chromatin isolated from HeLa cells. The isolated chromatin was extensively digested by MNase until genomic DNA fragments larger than 150 bp were barely detected (Fig. 1a and Supplementary Fig. 1a). Since the extensive MNase digestion generally removes DNA-binding proteins, this step is expected to reduce the possible contamination from DNA fragments protected by DNA-binding proteins other than histones. We then prepared the resulting MNase-treated 200–300 and 300–400 bp genomic DNA fragments, which may correspond to OLDN and OLTN, respectively, by polyacrylamide gel electrophoresis, and performed a next-generation sequencing (NGS) analysis. Consistent with the previous NGS analysis, we confirmed that the 250 bp fragments corresponding to the OLDN were accumulated in the regions downstream of TSSs in HeLa cells (Fig. 1b, c and Supplementary Fig. 1b, c). Interestingly, we found that 300–350 bp DNA fragments, which may correspond to the OLTN, were also accumulated in this region (Fig. 1d, e and Supplementary Fig. 1d–f). This suggested that OLTNs, in addition to OLDNs, may form in the downstream regions of TSSs in human cells.
Cryo-EM visualization of the OLTN
We then tested the OLTN formation in vitro, using a 353 bp DNA fragment containing 125, 103, and 125 bp DNA segments of the Widom 601 nucleosome positioning sequence, for in vitro nucleosome reconstitution (Fig. 2). In the previous OLDN reconstitution, the 250 bp tandem Widom 601 DNA was used as the template DNA, in which the 44 bp connection site region (22 bp of downstream and upstream regions of each Widom 601 DNA), was deleted14. In the present OLTN reconstitution, the 44 bp DNA region of the second connection site was also deleted (Fig. 2a). Purified human histones, H2A, H2B, H3, and H4, were mixed with the 353 bp DNA fragment, and the histone complex was assembled on the DNA by the salt dialysis method15. The reconstituted histone–DNA complex was purified by native polyacrylamide gel electrophoresis (Fig. 2b, c, and Supplementary Fig. 2). The additional bands just after the OLTN reconstitution may be the OLDN (migrating faster than the OLTN), the OLTN with improper additional histone binding (migrating slower than the OLTN), and the OLTN with the different positioning of subnucleosome moieties (Supplementary Fig. 2, Input). The reconstituted OLTN sample was subjected to MNase digestion coupled with NGS analysis. We found that 335–338 bp DNA fragments were protected from the MNase digestion (Fig. 2d). This is consistent with the MNase-seq analysis data, in which 300–350 bp DNA fragments, probably corresponding to the OLTN, are protected from MNase digestion in the genomic DNA (Fig. 1d, e).
The reconstituted OLTN was fractionated by sucrose gradient ultracentrifugation with glutaraldehyde fixation (GraFix) and subjected to cryo-EM data collection with a 300 kV electron microscope. From the 6953 electron micrographs, 1.7 million particles were identified (Supplementary Fig. 3). We then obtained the cryo-EM structure of the OLTN (Fig. 3a), with an overall resolution of 7.6 Å. In the OLTN structure, another hexasome is associated with the OLDN, and consequently, the hexasome (proximal), hexasome (central), and octasome (distal) moieties are aligned in this order (Fig. 3b–d). A 3D map (8.5%) was obtained with a possibly different orientation between the OLDN part and the proximal hexasome portion, but this is uncertain due to the noisy background (Supplementary Fig. 3). We refined each nucleosome moiety, proximal hexasome (5.2 Å), central hexasome (5.9 Å), and distal octasome (4.5 Å), and obtained a composite map (Supplementary Fig. 3b).
To determine whether the OLTN formation occurs with a native DNA sequence, we repeated the OLTN reconstitution experiment with the 350 bp DNA fragment containing the human genome sequence identified as the SMARCC1 and SMARCC2 accumulated region (conservative IDR thresholded ChIP-seq peaks; ENCSR000EDM, ENCSR000EDL). SMARCC1 and SMARCC2 are subunits of a prominent nucleosome remodeler, the mammalian SWI/SNF complex, which may promote the OLDN formation12. We selected the 350 bp DNA fragment with the DNA sequence ~70 bp downstream from the TSS. By 2D class averages, we then confirmed that the OLTN can be formed with this genomic DNA sequence (Supplementary Fig. 4).
The OLTN structure
In the OLTN structure, a 349 bp DNA segment is continuously wrapped in the hexasome–hexasome–octasome core and adopts a “slinky”-like shape (Fig. 4a). In the distal octasome part, 145 bp of DNA contact with the histone octamer (Fig. 4a), while in the proximal and central hexasomes, 97 bp of DNA directly bind to the histone surface (Fig. 4a). The hexasome–hexasome and hexasome–octasome boundaries are connected by 7 and 3 bp DNA segments, respectively (Fig. 4a). The 97 and 145 bp DNA regions wrapped around the histone hexamer and octamer, respectively, are aligned horizontally in the OLTN structure (Fig. 4a). The H2A–H2B dimers are missing at the hexasome–hexasome and hexasome–octasome interfaces (Fig. 4b–d). This may reduce the steric hindrance at the interfaces between hexasome and hexasome or hexasome and octasome, and allow the horizontal DNA alignment in the OLTN (Fig. 4a).
Small angle X-ray scattering and molecular dynamics simulation analyses of the OLTN in solution
To acquire structural insights into the structure of the OLTN in solution, we conducted a small angle X-ray scattering coupled with analytical ultracentrifugation (AUC-SAXS) analysis16,17. In this method, the bona fide SAXS of OLTN can be extracted from the scattering of the sample solution containing OLTN and aggregates. The OLTN was reconstituted with the same 353 bp DNA fragment (Widom 601 sequence) used in the cryo-EM analysis, and the sample was subjected to the SAXS experiment without crosslinking. The SAXS curve of the OLTN is quite similar to that obtained from the cryo-EM structure of the OLTN (Fig. 5a). Furthermore, we constructed a dummy atom model of the OLTN based on the SAXS, for comparison with the cryo-EM structure. It fits very well with the dummy atom model, suggesting that the cryo-EM structure likely represents a major form of the OLTN in solution (Fig. 5b). It should be noted that the SAXS curve of the OLTN slightly deviates from that calculated from the cryo-EM structure (Fig. 5a). This may reflect the structural dynamics of the OLTN in solution. To address this, we next performed a coarse-grained molecular dynamics (CG-MD) simulation analysis, which revealed that the trajectory of the OLTN structures is consistent with the SAXS and cryo-EM analyses (Fig. 5a). The representative structure in the CG-MD trajectory is shown in Fig. 5c. Given the dynamic arrangement of the hexasomal and octasomal portions in the OLTN, the histone–DNA and histone–histone interactions between the subnucleosomal portions may not be stable but transient around the cryo-EM structure.
Discussion
We previously reported the structure of the OLDN, composed of a hexasome and an octasome, and its presence in the regions immediately downstream of TSSs in human cells14. Consistently, the OLDN was reportedly visualized by cryo-electron microscopy in isolated human mitotic chromosomes18. These facts imply that the OLDN may be persistently maintained throughout the cell cycle. In the present study, our genome-wide MNase-seq analysis revealed that OLTNs may also form in the downstream regions of TSSs (Fig. 1). We successfully reconstituted and determined the cryo-EM structure of the OLTN, in which three nucleosomes, hexasome, hexasome, and octasome, are associated with short linker DNA segments (Figs. 3 and 4). It should be noted that, in the current method, the possible existence of the OLDN and OLTN can be deduced in cells; however, their prevalence relative to the normal nucleosome is currently difficult to evaluate. Therefore, the rates of OLDN and OLTN formation relative to the nucleosome may be an important future issue to solve.
Considering its formation in the downstream region of the TSS, the OLTN may be formed by the collision of an additional nucleosome with the OLDN, as a consequence of the nucleosome remodeling activity. As another possibility, the OLDN and OLTN may also be formed through nucleosome transfer during transcription elongation by RNA polymerase II19,20. In this process, the nucleosome located in the DNA region downstream of RNA polymerase II is disassembled and transferred to the upstream transcribed DNA region20. This nucleosome transfer may induce the OLDN and OLTN formation. RNA polymerase II-mediated nucleosome disassembly and reassembly may also function in resolving the OLDN and OLTN into the canonical nucleosome. The assembly and disassembly processes of the OLDN and OLTN remain to be clarified.
The first nucleosome (+1 nucleosome) of the protein-coding regions of genes is reportedly positioned approximately 40 bp downstream from the TSS21,22,23, and the preinitiation complex (PIC) containing RNA polymerase II is bound to the +1 nucleosome24,25. To do so, the nucleosome-depleted region (NDR) must be created by nucleosome remodelers before the PIC assembly around the TSS26. The SWI/SNF nucleosome remodeling complex, which promotes nucleosome sliding, is a possible candidate as a nucleosome remodeler to produce the NDR by its nucleosome sliding activity27,28,29. The nucleosome remodeling may induce nucleosome collision, forming the OLDN and OLTN in the region downstream of the TSS12,13. The cryo-EM structures of the mammalian SWI/SNF family remodelers, the human BAF and PBAF complexes, bound to the nucleosome have been reported30,31. In these structures, the nucleosome is located on the solvent-accessible surface and can associate with another nucleosome particle without serious steric hindrance30,31. Therefore, the OLDN and OLTN could bind to the BAF and PBAF complexes by a mechanism similar to that of the nucleosome, and become substrates for further nucleosome remodeling.
The +1 nucleosome may function as a barrier for transcription elongation32,33,34,35, and the OLDN and OLTN formed at the +1 position may enhance this function of the +1 nucleosome. In the PIC structures, the nucleosomes are quite flexible and largely exposed to the solvent25. The OLDN and OLTN can be accommodated within the PIC without substantial steric clashes. Further studies are awaited to clarify the relationship between OLDN/OLTN formation and transcription status in genes.
In the previous crystal structure of the OLDN, the hexasome–octasome moieties are more compactly associated than those in the OLTN structure (Supplementary Fig. 5). This may be due to restriction of the OLDN dynamics by crystal packing effects, or it may reflect one of the possible arrangements of the hexasome–octasome moieties. The hexasome–octasome arrangement of the OLTN is consistent with the structural range of the OLDN in solution36. These findings suggest that the octasome and hexasome moieties in the OLTN, as well as the OLDN, may have dynamic properties that allow regulation of transcription through the +1 nucleosome downstream of the TSS. To address this fundamental question, studies of OLTN and OLDN transcription by RNA polymerase II are awaited.
Methods
Preparation of DNA fragments for next-generation sequencing analyses
The HeLa nuclei were prepared as previously described14. The HeLa cells (5 × 107) were suspended in 15 ml of washing solution, containing 15 mM Tris–HCl (pH 8.0), 15 mM NaCl, 60 mM KCl, 300 mM sucrose, and a protease inhibitor (Roche). The cell suspension was mixed with 15 ml of detergent buffer, containing 15 mM Tris–HCl (pH 8.0), 15 mM NaCl, 60 mM KCl, 300 mM sucrose, protease inhibitor (Roche), and 1% NP-40, was then added to the cell suspension, and the mixture was rotated at 4 °C for 10 minutes. The HeLa nuclei were collected by centrifugation and resuspended in 100 µL of the washing solution. A large amount of MNase (3 units/µL) and CaCl2 (2 mM) were added to the HeLa nuclei suspension. The MNase treatment was performed at 37 °C for 30 min and then terminated by EDTA (pH 8.0) addition at a final concentration of 50 mM. The resulting reaction mixture was collected by centrifugation, and the supernatant was deproteinated by adding a mixture of proteinase K (0.9 mg/ml, Roche) and 0.5% SDS, followed by phenol/chloroform extraction. The MNase fragments were precipitated by ethanol and electrophoresed in a 4% agarose gel. DNA samples with sizes of 140–150, 200–300, and 300–400 bp were extracted from the agarose gel and used for the NGS analyses.
Next-generation sequencing analyses
The paired-end library of MNase-seq was prepared with a ThruPLEX® DNA-Seq Kit (Clontech, #R400674), and sequencing was performed using an Illumina NovaSeq6000 sequencer (in vivo data) and an Illumina MiSeq sequencer (in vitro data). The reads were mapped to the human genome (GRCh38) using Bowtie237 (version 2.3.5.1) with default parameters. BAM files of in vivo assays were created using SAMtools38 (version 1.9). BAM files were input to plot2DO39, to create heatmaps. To create aggregation plots, the mapped reads with presumable lengths of nucleosomal DNA were extracted from each BAM file using SAMtools38 OLTN: 337 (±5) bp, OLDN: 248 (±5) bp, and mono-nucleosome: 147 (±5) bp. The resulting files of the extracted reads were input to agplus40 to create the aggregation plots.
Reconstitution and purification of the OLTN
The OLTN was prepared with the 353 bp DNA fragment derived from the Widom 601 sequence (OLTN DNA) and a purified histone octamer. The OLTN DNA fragments were tandemly inserted into the pGEM-T Easy vector (Promega), and the plasmid was amplified in DH10alpha cells. The OLTN DNA fragment was then isolated from the vector by EcoRV restriction enzyme (Takara) treatment, and purified by polyethylene glycol precipitation and TSKgel DEAE-5PW (TOSOH) anion exchange column chromatography. The histone octamer was prepared from recombinant human histones H2A, H2B, H3, and H4, as previously described15. In this study, we employed H3.1 as the histone H3. In brief, the four histones were mixed at an equal molar ratio in denaturing buffer, containing 20 mM Tris–HCl buffer (pH 7.5), 7 M guanidine hydrochloride, and 20 mM 2-mercaptoethanol, and the mixture was dialyzed against refolding buffer, containing 10 mM Tris–HCl (pH 7.5), 2 M NaCl, 1 mM EDTA, and 5 mM 2-mercaptoethanol. After the dialysis, the refolded histone octamer was purified by size exclusion column chromatography in the refolding buffer.
The OLTN was reconstituted by the salt dialysis method, as previously described15. The purified histone octamer and the DNA were mixed at a 3:1 molar stoichiometry in reconstitution buffer-high, containing 10 mM Tris–HCl (pH 7.5), 2 M KCl, 1 mM EDTA (pH 8.0), and 1 mM dithiothreitol. The KCl concentration was decreased to 250 mM by continuous buffer exchange with peristaltic pumps. The reconstituted sample was fractionated by non-denaturing gel electrophoresis using a Prep Cell apparatus (Bio-Rad) in the elution buffer, containing 20 mM Tris–HCl (pH 7.5) and 1 mM dithiothreitol. The peak fractions were collected and concentrated by filtration. The concentration of OLTN was determined by the absorbance at 260 nm.
The sequence of the OLTN DNA is as follows: 5´-ATCGAGAATCCCGGTGCCGAGGCCGCTCAATTGGTCGTAGACAGCTCTAGCACCGCTTAAACGCACGTACGCGCTGTCCCCCGCGTTTTAACCGCCAAGGGGATTACTCCCTAGTCTCCAGGCTCGAGCTCAATTGGTCGTAGACAGCTCTAGCACCGCTTAAACGCACGTACGCGCTGTCCCCCGCGTTTTAACCGCCAAGGGGATTACTCCCTAGTCTCCAGGCTCGAGCTCAATTGGTCGTAGACAGCTCTAGCACCGCTTAAACGCACGTACGCGCTGTCCCCCGCGTTTTAACCGCCAAGGGGATTACTCCCTAGTCTCCAGGCACGTGTCAGATATATACATCCGAT-3´.
For the OLTN containing the native DNA sequence, the purified histone octamer and the native DNA (350 bp) were mixed at a 4.8:1 molar stoichiometry in reconstitution buffer-high. The following analyses of the OLTN containing native DNA sequence were performed in the same manner as for the OLTN.
The sequence of the native OLTN DNA (S100A13 gene locus, ENSEMBL_ID: ENSG00000189171) is as follows: 5´-TTTGAACCCAGTCCAATAAAACCTCAAAACCGGTGCACTTTCTACCGTATCCTGAGGCTTCTTTACTTTGGGGCCCTGGTTAGCCTTAGCAGCCTAGTTTTCTACATCCTTCATGCCAGTTGAATGAAACTACTGACATGCTCCACATTCTTCTTTCTCCCATGCTTTTGTTCATTCAGTTACCTCCTCCTAAAATGTCTGCATTTACCCAGATAATCTTCCAATGGAAATCCATGGTTCAAGTGCCACCTCTTCAGGAAAGCCATCTGACTTCAATCAGGTTAATAATCTCTCATAACCCTTTCTGGTACATCTGTTAAGGCTCTGTGTATTTCCTGGAGTTCATGGTT-3´.
MNase treatment of the purified OLTN
The purified OLTN (0.2 µM) was incubated with 25 mU/µL MNase (Takara) at 37 °C for 9 minutes in 296 µL of reaction solution, containing 41 mM Tris–HCl (pH 7.5 and 8.0), 5 mM NaCl, 2.5 mM CaCl2, 5% glycerol, 1 mM 2-mercaptoethanol, and 0.8 mM dithiothreitol. The MNase reaction was terminated by adding an equal volume of ProK solution, containing 200 mM Tris–HCl (pH 8.0), 80 mM EDTA (pH 8.0), 0.5 mg/ml proteinase K (Roche), and 0.25% SDS. The resulting DNA fragments were collected by phenol–chloroform–isoamyl alcohol extraction followed by ethanol precipitation and then electrophoresed in an 8% urea gel. The DNA sample with fragment sizes of 330–350 bp was extracted from the gel by electroelution and used for the NGS analysis.
Preparation of the OLTN for cryo-EM analysis
The OLTN and the OLTN containing the native DNA sequence for the cryo-EM analysis were prepared by the gradient fixation method (GraFix)41. The gradient solution contained 20 mM HEPES–KOH (pH 7.5), 50 mM KOAc, 0.2 µM Zn(OAc)2, 0.1 mM TCEP (pH 8.0), 10–25% (w/v) sucrose, and 0–0.1% (v/v) glutaraldehyde. The reconstituted OLTN was purified by non-denaturing polyacrylamide gel electrophoresis using a Prep Cell apparatus (Bio-Rad), concentrated by filtration, applied to the top of the gradient solution, and centrifuged at 27,000 rpm for 16 h (Beckman Coulter Optima™; SW41 Ti rotor). After centrifugation, aliquots were carefully collected from the top of the gradient. The peak fractions were analyzed by non-denaturing gel electrophoresis and then desalted using a PD-10 column (Cytiva) in a final buffer, containing 20 mM HEPES–KOH (pH 7.5), 50 mM KOAc, and 2 mM TCEP (pH 8.0). The fixed OLTN and the OLTN containing the native DNA sequence were concentrated by filtration and stored on ice.
Cryo-EM data collection
For the OLTN, the Quantifoil grids (R1.2/1.3 200-mesh copper) were treated with ethyl acetate and glow-discharged before use. Aliquots of the OLTN (0.3 mg/ml) were blotted in a Vitrobot Mark IV (Thermo Fisher Scientific) at blot force 5 for 10 s, under 100% humidity at 16 °C, and the grids were immediately plunged into liquid ethane to embed the sample particles into amorphous ice. Data collections were performed on a Krios G3i cryo-transmission electron microscope (Thermo Fisher Scientific) operated at 300 kV. The micrographs were automatically recorded on a K3 BioQuantum (Gatan) direct electron detector, calibrated at a pixel size of 1.07 Å in the electron counting mode, using a slit width of 25 eV and retaining 40 frames with a total dose of 55.1 electron/Å, using the EPU software. For the OLTN containing the native DNA sequence, the Quantifoil grids (R1.2/1.3 200-mesh copper) were treated with ethyl acetate and glow-discharged before use. Aliquots of the OLTN containing the native DNA sequence (1 mg/ml) were blotted in a Vitrobot Mark IV (Thermo Fisher Scientific) at blot force 0 for 6 s, under 100% humidity at 16 °C, and the grids were immediately plunged into liquid ethane to embed the sample particles into amorphous ice. Data collections were performed on a Krios G4 cryo-transmission electron microscope (Thermo Fisher Scientific) operated at 300 kV. The micrographs were automatically recorded on a K3 BioQuantum (Gatan) direct electron detector, calibrated at a pixel size of 1.06 Å in the electron counting mode, using a slit width of 20 eV and retaining 40 frames with a total dose of 62.0 electron/Å, using the EPU software. The defocus range was between 1 and 2.5 µm. The recording parameters are summarized in Table 1.
Image processing
For both the OLTN and the OLTN containing the native DNA sequence, the movie frames were aligned by the MotionCor2 software42, and the contrast transfer functions for each frame were estimated by the CTFFIND4 software43. The following image processing of the OLTN structure was performed using the RELION 3.1 software44. In total, 1,718,300 particles from 6953 electron micrographs were automatically picked using the Laplacian-of-Gaussian filter, with a box size of 300 pixels. Subsequently, the junk particles were removed through one round of 2D classification. The initial 3D model was then generated by de novo modeling for the reference model of the first 3D classification. After two rounds of 3D classification, the selected OLTN structure, in which each proximal hexasome, center hexasome, and octasome were clearly visible, and comprising 148,082 particles, was refined using a global soft mask followed by post-processing. The resolution for the final map was estimated as 7.6 Å based on the Fourier shell correlation criteria of 0.143. To resolve the octasome unit and the two hexasome units, multi-body refinements45 using each mask were performed and followed by the postprocesses. The final resolutions of the proximal hexasome, center hexasome, and octasome were 5.2, 5.9, and 4.5 Å, respectively.
The following image processing of the OLTN containing the native DNA sequence was performed using the RELION 4.0 software46. In total, 1,666,759 particles from 4851 electron micrographs were picked automatically, followed by 2D classification.
Model building and refinement
The structural model was built from segments of the octasome unit and the hexasome unit extracted from the crystal structure of the OLDN (PDB ID: 5GSE)14. The octasome unit and the two hexasome units were fitted into the corresponding cryo-EM densities using rigid-body fitting in UCSF Chimera47. The DNA segments between each unit were connected and some histone residues outside the cryo-EM density were removed manually, using the Coot software48,49. The model was refined by phenix_real_space_refine50 using the histone octamer segment of the nucleosome structure (PDB ID: 5Y0C)51 as the reference model. The DNA sequence of the OLTN was estimated from the sequencing analysis of the MNase fragment of the purified OLTN. The model parameters calculated by the MolProbity software52 and the Q-score53 obtained from wwPDB validation are summarized in Table 1.
Integrated method of analytical ultracentrifugation (AUC) and small-angle X-ray scattering (SAXS): AUC–SAXS
Sample
A 2.9 mg/mL OLTN solution, in buffer containing 20 mM Tris–HCl (pH 8.0), 50 mM NaCl, and 1 mM dithiothreitol, was utilized.
SAXS measurement
SAXS measurement was performed using a NANOPIX (Rigaku, Japan) equipped with a point-focused generator of a Cu-Kα source, MicroMAX-007 HFMR (Rigaku, Japan) (wavelength = 1.54 Å). Scattered X-rays were counted using a HyPix-6000 detector (Rigaku, Japan). The sample-to-detector distances were set at 1330 and 350 mm, covering a q-range of 0.008–0.750 Å−1 (q: magnitude of scattering vector). All measurements were performed at 25 °C. The two-dimensional scattering patterns were converted into one-dimensional scattering profiles with the SAngler software (version 2.1.45)54. After correcting for transmission and subtracting the buffer scattering, we obtained the absolute scattering intensity by using the standard scattering intensity of water (1.632 × 10−2 cm−1)55.
AUC measurement
To investigate the potential presence of aggregated components in the sample solution, AUC measurements were performed using a ProteomeLab XL-I instrument (Beckman Coulter, USA), with Rayleigh interference optics at 25 °C and a rotor speed of 30,000 rpm. The weight fraction distribution as a function of the sedimentation coefficient, normalized to the value at 20 °C in pure water, and the frictional ratio was calculated using the SEDFIT software (version 15.01c)56. The molecular weights of the components were determined based on the sedimentation coefficient and frictional ratio57. From the AUC data, the presence of OLTN and three aggregates in the sample solution was observed as follows: Component 1 (OLTN) {466 kDa, 64.3%}, Component 2 (Aggregate 1) {731 kDa, 13.2%}, Component 3 (Aggregate 2) {882 kDa, 16.4%}, and Component 4 (Aggregate 3) {1150 kDa, 6.1%} {obtained molecular weights, weight fractions}.
AUC–SAXS
AUC–SAXS treatment16,17 was applied to extract the scattering profile of OLTN from the measured SAXS data, including the aggregates. As a result, the gyration radius, determined through Guinier analysis, decreased from 70.1 ± 0.4 Å (non-treated experimental data) to 61.0 ± 0.4 Å (AUC–SAXS-treated data), indicating the true gyration radius of OLTN in solution.
3D-modeling from the SAXS data
Ab initio 3D modeling
An ab initio model of OLTN in solution was built with the dummy atom method, DAMMIF, in the ATSAS software package (version 3.1.3)58, based on the SAXS profile.
Coarse-grained molecular dynamics (CG-MD) modeling
The possible structures of OLTN were explored through CG-MD simulations, using CafeMol59. The OLTN model with full-length histone proteins was prepared in two steps. First, the missing residues in the histone cores were filled in by using the known 3D structure (PDB:5GSE)14. Second, histone tails were generated using PyMOL60. The 3SPN.2 DNA model, AICG2+ model, and a flexible local potential were employed for DNA, histone cores, and histone tails, respectively61,62,63. Charges on the histone cores were defined using RESPAC64. For the histone tails, + 1 charge was assigned to each arginine or lysine bead, and −1 charge was assigned to each aspartic acid or glutamic acid bead. Go-like AICG2+ potentials between histone cores and DNA, electrostatic interactions, and excluded volume effects were defined between DNA and proteins. Debye–Hückel-type electrostatic interactions were employed. Five Langevin dynamics simulations were performed at 300 K with a time unit of 0.2 with CafeMol, for a total of 5,000,000 steps. Since Pepsi-SAXS requires a fully atomistic model to calculate the scattering profile, CG-MD snapshots were reverse-mapped to atomistic models using BBQ, SCWRL4, and DNAbackmap65,66,67.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The MNase sequencing data have been deposited in the Gene Expression Omnibus with the accession (GEO) number GSE224789. The cryo-EM map and the atomic coordinates of the OLTN have been deposited in the Electron Microscopy Data Bank (EMDB) and the Protein Data Bank (PDB), under the accession codes EMD-35448 and PDB ID 8IHL, respectively. All data presented in this study are included in the main text or the supplementary information. Uncropped images are provided in Supplementary Fig. 6. Any additional information is available from the corresponding authors upon request.
References
Wolffe, A. P. Chromatin: Structure and Function 3rd edn (Academic Press, 1998).
Luger, K., Mäder, A. W., Richmond, R. K., Sargent, D. F. & Richmond, T. J. Crystal structure of the nucleosome core particle at 2.8 Å resolution. Nature 389, 251–260 (1997).
Olins, A. L. & Olins, D. E. Spheroid chromatin units (v bodies). Science 183, 330–332 (1974).
Lavelle, C. & Prunell, A. Chromatin polymorphism and the nucleosome superfamily: a genealogy. Cell Cycle 6, 2113–2119 (2007).
Segal, E. et al. A genomic code for nucleosome positioning. Nature 442, 772–778 (2006).
Thurman, R. E. et al. The accessible chromatin landscape of the human genome. Nature 489, 75–82 (2012).
Neph, S. et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 489, 83–90 (2012).
Clapier, C. R. & Cairns, B. R. The biology of chromatin remodeling complexes. Annu. Rev. Biochem. 78, 273–304 (2009).
Mueller-Planitz, F., Klinker, H. & Becker, P. B. Nucleosome sliding mechanisms: new twists in a looped history. Nat. Struct. Mol. Biol. 20, 1026–1032 (2013).
Bai, L. & Morozov, A. V. Gene regulation by nucleosome positioning. Trends Genet. 26, 476–483 (2010).
Clapier, C. R., Iwasa, J., Cairns, B. R. & Peterson, C. L. Mechanisms of action and regulation of ATP-dependent chromatin-remodelling complexes. Nat. Rev. Mol. Cell Biol. 18, 407–422 (2017).
Ulyanova, N. P. & Schnitzler, G. R. Human SWI/SNF generates abundant, structurally altered dinucleosomes on polynucleosomal templates. Mol. Cell. Biol. 25, 11156–11170 (2005).
Engeholm, M. et al. Nucleosomes can invade DNA territories occupied by their neighbors. Nat. Struct. Mol. Biol. 16, 151–158 (2009).
Kato, D. et al. Crystal structure of the overlapping dinucleosome composed of hexasome and octasome. Science 356, 205–208 (2017).
Kujirai, T. et al. Methods for preparing nucleosomes containing histone variants. Methods Mol. Biol. 1832, 3–20 (2018).
Morishima, K. et al. Integral approach to biomacromolecular structure by analytical-ultracentrifugation and small-angle scattering. Commun. Biol. 3, 294 (2020).
Morishima, K., Inoue, R. & Sugiyama, M. Derivation of the small-angle scattering profile of a target biomacromolecule from a profile deteriorated by aggregates. AUC–SAS. J. Appl. Crystallogr. 56, 624–632 (2023).
Beel, A. J., Azubel, M., Matteï, P. J. & Kornberg, R. D. Structure of mitotic chromosomes. Mol. Cell 81, 4369–4376.e4363 (2021).
Filipovski, M., Soffers, J. H. M., Vos, S. M. & Farnung, L. Structural basis of nucleosome retention during transcription elongation. Science 376, 1313–1316 (2022).
Ehara, H., Kujirai, T., Shirouzu, M., Kurumizaka, H. & Sekine, S. I. Structural basis of nucleosome disassembly and reassembly by RNAPII elongation complex with FACT. Science 377, eabp9466 (2022).
Albert, I. et al. Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome. Nature 446, 572–576 (2007).
Schones, D. E. et al. Dynamic regulation of nucleosome positioning in the human genome. Cell 132, 887–898 (2008).
Mavrich, T. N. et al. Nucleosome organization in the Drosophila genome. Nature 453, 358–362 (2008).
Tramantano, M. et al. Constitutive turnover of histone H2A.Z at yeast promoters requires the preinitiation complex. eLife 5, e14243–e14243 (2016).
Chen, X. et al. Structures of +1 nucleosome–bound PIC–Mediator complex. Science 378, 62–68 (2022).
Kornberg, R. D. & Lorch, Y. Primary Role of the Nucleosome. Mol. Cell 79, 371–375 (2020).
Hartley, P. D. & Madhani, H. D. Mechanisms that specify promoter nucleosome location and identity. Cell 137, 445–458 (2009).
Gkikopoulos, T. et al. A role for Snf2-related nucleosome-spacing enzymes in genome-wide nucleosome organization. Science 333, 1758–1760 (2011).
Tolkunov, D. et al. Chromatin remodelers clear nucleosomes from intrinsically unfavorable sites to establish nucleosome-depleted regions at promoters. Mol. Biol. Cell 22, 2106–2118 (2011).
He, S. et al. Structure of nucleosome-bound human BAF complex. Science 367, 875–881 (2020).
Yuan, J., Chen, K., Zhang, W. & Chen, Z. Structure of human chromatin-remodelling PBAF complex bound to a nucleosome. Nature 605, 166–171 (2022).
Yuan, G. C. et al. Molecular biology: genome-scale identification of nucleosome positions in S. cerevisiae. Science 309, 626–630 (2005).
Jin, C. et al. H3.3/H2A.Z double variant-containing nucleosomes mark ‘nucleosome-free regions’ of active promoters and other regulatory regions. Nat. Genet. 41, 941–945 (2009).
Weber, C. M., Ramachandran, S. & Henikoff, S. Nucleosomes are context-specific, H2A.Z-modulated barriers to RNA polymerase. Mol. Cell 53, 819–830 (2014).
Rhee, H. S., Bataille, A. R., Zhang, L. & Pugh, B. F. Subnucleosomal structures and nucleosome asymmetry across a genome. Cell 159, 1377–1388 (2014).
Matsumoto, A. et al. Structural studies of overlapping dinucleosomes in solution. Biophys. J. 118, 2209–2219 (2020).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Beati, P. & Chereji, R. V. Creating 2D occupancy plots using plot2DO. Methods Mol. Biol. 2117, 93–108 (2020).
Maehara, K. & Ohkawa, Y. agplus: a rapid and flexible tool for aggregation plots. Bioinformatics 31, 3046–3047 (2015).
Kastner, B. et al. GraFix: sample preparation for single-particle electron cryomicroscopy. Nat. Methods 5, 53–55 (2008).
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Rohou, A. & Grigorieff, N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166–e42166 (2018).
Nakane, T., Kimanius, D., Lindahl, E. & Scheres, S. H. W. Characterisation of molecular motions in cryo-EM single-particle data by multi-body refinement in RELION. eLife 7 https://doi.org/10.7554/eLife.36861 (2018)
Kimanius, D., Dong, L., Sharov, G., Nakane, T. & Scheres, S. H. W. New tools for automated cryo-EM single-particle analysis in RELION-4.0. Biochem. J. 478, 4169–4185 (2021).
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. Sect. D: Biol. Crystallogr. 60, 2126–2132 (2004).
Casañal, A., Lohkamp, B. & Emsley, P. Current developments in Coot for macromolecular model building of Electron Cryo-microscopy and Crystallographic Data. Protein Sci. 29, 1069–1078 (2020).
Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. Sect. D: Struct. Biol. 74, 531–544 (2018).
Arimura, Y. et al. Cancer-associated mutations of histones H2B, H3.1 and H2A.Z.1 affect the structure and stability of the nucleosome. Nucleic Acids Res. 46, 10007–10018 (2018).
Williams, C. J. et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
Pintilie, G. et al. Measurement of atom resolvability in cryo-EM maps with Q-scores. Nat. Methods 17, 328–334 (2020).
Shimizu, N. et al. Software development for analysis of small-angle x-ray scattering data. AIP Conf. Proc. 1741 https://doi.org/10.1063/1.4952937 (2016)
Doris, O., Alexander, B. & Otto, G. SAXS experiments on absolute scale with Kratky systems using water as a secondary standard. J. Appl. Crystallogr. 33, 218–225 (2000).
Schuck, P. Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and Lamm equation modeling. Biophys. J. 78, 1606–1619 (2000).
Lebowitz, J., Lewis, M. S. & Schuck, P. Modern analytical ultracentrifugation in protein science: a tutorial review. Protein Sci. 11, 2067–2079 (2002).
Franke, D. & Svergun, D. I. DAMMIF, a program for rapid ab-initio shape determination in small-angle scattering. J. Appl. Crystallogr. 42, 342–346 (2009).
Kenzaki, H. et al. CafeMol: a coarse-grained biomolecular simulator for simulating proteins at work. J. Chem. Theory Comput. 7, 1979–1989 (2011).
Schrödinger, L.L.C. The PyMOL Molecular Graphics System. v. 1.8 (2015).
Hinckley, D. M., Freeman, G. S., Whitmer, J. K. & de Pablo, J. J. An experimentally-informed coarse-grained 3-Site-Per-Nucleotide model of DNA: structure, thermodynamics, and dynamics of hybridization. J. Chem. Phys. 139, 144903 (2013).
Li, W., Wang, W. & Takada, S. Energy landscape views for interplays among folding, binding, and allostery of calmodulin domains. Proc. Natl Acad. Sci. USA 111, 10550–10555 (2014).
Terakawa, T. & Takada, S. Multiscale ensemble modeling of intrinsically disordered proteins: p53 N-terminal domain. Biophys. J. 101, 1450–1458 (2011).
Terakawa, T. & Takada, S. RESPAC: method to determine partial charges in coarse-grained protein model and its application to DNA-binding proteins. J. Chem. Theory Comput. 10, 711–721 (2014).
Gront, D., Kmiecik, S. & Kolinski, A. Backbone building from quadrilaterals: a fast and accurate algorithm for protein backbone reconstruction from alpha carbon coordinates. J. Comput. Chem. 28, 1593–1597 (2007).
Krivov, G. G., Shapovalov, M. V. & Dunbrack, R. L. Jr Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77, 778–795 (2009).
Shimizu, M. & Takada, S. Reconstruction of atomistic structures from coarse-grained models for protein–DNA complexes. J. Chem. Theory Comput. 14, 1682–1694 (2018).
Acknowledgements
We are grateful to Y. Iikura, Y. Takeda, Y. Fukaya, and M. Dacher (University of Tokyo) for their assistance. We thank Dr. M. Kikkawa (University of Tokyo) for cryo-EM data collection. This work was supported in part by JSPS KAKENHI Grant Numbers JP22K06098 [to Y.T.], JP18H05534 [to H.K.], JP20H00449 [to H.K.], JP22J20665 [to T.F.], JP18H05527, JP20H00456 [to Y.O.], Research Support Project for Life Science and Drug Discovery (BINDS) from AMED under Grant Number JP22ama121009 [to H.K.], JP22ama121017j0001 [to Y.O.], JP22ama121002j001 [to M. Kikkawa, University of Tokyo], JST PRESTO Grant Number JPMJPR2026 [to K.Ma.], and JST ERATO Grant Number JPMJER1901 [to H.K.].
Author information
Authors and Affiliations
Contributions
M.N. and Y.K. reconstituted the OLTN, and M.N. H.T., K.N., and Y.T. performed the cryo-EM analysis. M.N., T.F., K.Ma., and Y.O. performed the genome-wide analysis with next-generation sequencing technology. K.Mo., M.Sh., and M. Su. performed the solution structural analysis and MD simulation analysis of the OLTN. H.K. designed the research and drafted the manuscript with M.N. All authors discussed and commented on the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks Karolin Luger and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Joanna Timmins, Tobias Goris, and David Favero. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nishimura, M., Fujii, T., Tanaka, H. et al. Genome-wide mapping and cryo-EM structural analyses of the overlapping tri-nucleosome composed of hexasome-hexasome-octasome moieties. Commun Biol 7, 61 (2024). https://doi.org/10.1038/s42003-023-05694-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-023-05694-1
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.