Predicting scale-dependent chromatin polymer properties from systematic coarse-graining

Kadam, Sangram; Kumari, Kiran; Manivannan, Vinoth; Dutta, Shuvadip; Mitra, Mithun K.; Padinhateeri, Ranjith

doi:10.1038/s41467-023-39907-2

Download PDF

Article
Open access
Published: 11 July 2023

Predicting scale-dependent chromatin polymer properties from systematic coarse-graining

Nature Communications volume 14, Article number: 4108 (2023) Cite this article

2753 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Simulating chromatin is crucial for predicting genome organization and dynamics. Although coarse-grained bead-spring polymer models are commonly used to describe chromatin, the relevant bead dimensions, elastic properties, and the nature of inter-bead potentials are unknown. Using nucleosome-resolution contact probability (Micro-C) data, we systematically coarse-grain chromatin and predict quantities essential for polymer representation of chromatin. We compute size distributions of chromatin beads for different coarse-graining scales, quantify fluctuations and distributions of bond lengths between neighboring regions, and derive effective spring constant values. Unlike the prevalent notion, our findings argue that coarse-grained chromatin beads must be considered as soft particles that can overlap, and we derive an effective inter-bead soft potential and quantify an overlap parameter. We also compute angle distributions giving insights into intrinsic folding and local bendability of chromatin. While the nucleosome-linker DNA bond angle naturally emerges from our work, we show two populations of local structural states. The bead sizes, bond lengths, and bond angles show different mean behavior at Topologically Associating Domain (TAD) boundaries and TAD interiors. We integrate our findings into a coarse-grained polymer model and provide quantitative estimates of all model parameters, which can serve as a foundational basis for all future coarse-grained chromatin simulations.

Mechanistic modeling of chromatin folding to understand function

Article 08 June 2020

Nucleosome plasticity is a critical element of chromatin liquid–liquid phase separation and multivalent nucleosome interactions

Article Open access 17 May 2021

Loop-extrusion and polymer phase-separation can co-exist at the single-molecule level to shape chromatin folding

Article Open access 13 July 2022

Introduction

The eukaryotic genome is organized in the form of many long chromatin polymer chains, each essentially a string of nucleosomes—DNA wrapped around histone proteins—folded, looped, and condensed into domains of different compaction^1,2,3,4. The spatial and temporal organization of these chains and their internal epigenetic states are crucial in deciding aspects ranging from cellular function to differentiation and development^5,6,7,8.

Chromosomes are typically simulated and studied as coarse-grained(CG) bead-spring polymer chains^9,10. A coarse-grained polymer picture is useful for many reasons: it is nearly impossible to simulate the huge polymer set (~millions of nucleosomes) in its entirety. More importantly, the coarse-grained representation, with effective parameters, can be a powerful tool to understand chromatin organization and dynamics and make useful predictions^{11,12,13,14,15,16,17,18,19,20,21,22,23,24}. However, since we still do not understand the chromatin structure and properties in detail, systematic coarse-graining has been a difficult task. We do not fully know the polymer properties/parameters relevant for simulating coarse-grained chromatin.

Owing to a large body of work over the past few decades, double-stranded DNA has a good coarse-grained description as a semi-flexible polymer^25,26,27. We understand its coarse-graining size, bending stiffness, stretching elasticity, and other relevant parameters^26,27. However, chromatin is a more complex polymer having heterogeneous properties arising from different epigenetic states, amount of different proteins bound, and local folding^28,29. This complexity makes it difficult to accurately compute coarse-grained bead diameter, elastic constants, and other physical properties for a chromatin polymer.

Recent experimental advances have made it possible to understand chromatin structure using biochemical methods like Hi-C, Micro-C^{30,31,32,33,34,35,36,37,38,39,40,41} and imaging methods like SAX, cryo-EM, and super-resolution imaging^{42,43,44,45,46,47,48,49}. The studies so far show that chromatin is organized into different compartments and topologically associated domains (TADs)^2,7,31,32,33. While histone modifications, transcription factors, and chromatin binding proteins greatly affect chromatin folding and make it a highly heterogeneous polymer, how the interplay between these factors decides the compaction and dynamics of chromatin is currently being investigated.

While different experimental methods provided us data to understand chromatin organization^{30,31,32,33,35,36,37,42,43,44,46,47,48,50,51}, theoretical/computational studies have been pivotal in understanding and explaining chromatin characteristics^{11,12,13,14,15,18,19,20,21,22,24,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67}. Models that simulated chromatin at nucleosome resolution primarily investigated how different molecular interactions influenced higher-order organization beyond the 10 nm chromatin^{53,55,56,61,66,68,69,70}. Nearly all models that are employed to understand Hi-C/microscopy data represented chromatin as a bead-spring polymer chain with each bead representing chromatin of length in the range ~1 kb to 1Mb^{12,13,14,15,16,17,20,21,22,23,24}. However, physical dimensions and elastic properties of chromatin are not well understood. What is the diameter of a 1 kb, 10 kb, or 100 kb chromatin bead? What is the magnitude of the spring constant that would represent the thermal fluctuation of chromatin at different length scales? Should chromatin be considered a flexible polymer or a semi-flexible polymer? Does it have an intrinsic curvature/bending stiffness? None of these questions have clear, definitive answers in the literature, currently. While it is well known that chromatin behavior is heterogeneous, depending on the epigenetic state, existing coarse-grained models of chromatin typically assume that the physical dimensions of the beads and elastic properties of the filament are uniform along the polymer, independent of the epigenetic state. How these properties—the size of the beads, stretching elasticity, bending elasticity, etc.—vary along the contour is also unknown. In the current models, the heterogeneity of chromatin is often incorporated into additional intra-chromatin interactions—interaction between two far-away beads^12,24,71. Since one does not know the size of a coarse-grained chromatin bead, it is taken as a fitting parameter—a constant number across the filament—to achieve experimentally measured 3D distance values^{12,15,22,24,72,73}. Supplementary Fig. 1 shows some of the reported values and how scattered they are. We do not understand this variability and what each number means.

One could not do systematic coarse-graining so far because the chromatin conformation capture data was available only with lower resolutions like 100 kb, 10 kb, and up to 1 kb^{30,31,32,33,50}. Obtaining information smaller than the HiC resolution was not possible. Moreover, even at the smallest size scale (~kb), the physical dimension of chromatin—chromatin bead diameter—was an unknown parameter. However, recent experiments have provided us chromatin conformation capture data at near-nucleosome resolution—200 bp resolution, which is essentially a nucleosome plus the linker DNA^34,35,39 (also see Supplementary Fig. 2a). This data enables us to make a fine-grained chromatin model and systematically probe the properties of the coarse-grained chromatin. The advantage here is that the physical size of a 200 bp chromatin is not a completely unknown free parameter; we do have a fair idea about the size of a 200 bp chromatin bead. In this work, we take advantage of this recent fine-grained data, start from the 200 bp Micro-C contact map, and construct chromatin polymer configurations that satisfy the map. From our work, without any arbitrary fitting parameter, the 3D distances and radius of gyration values emerge in a reasonable range comparable to known experiments. Using the 200 bp-chromatin as a fine-grained polymer, we coarse-grain the chromatin systematically. This enables us to predict several quantities essential for anyone simulating a coarse-grained chromatin polymer. We predict (i) the physical sizes of coarse-grained beads of various chromatin length scales, (ii) the overlap between coarse-grained beads and an effective inter-bead soft potential energy, (iii) the value of the spring constant between neighboring beads dictating the fluctuation, and (iv) the distributions of bond angles and dihedral angles giving insights into the stiffness of chromatin. We show that some of the ideas we learned—e.g., soft inter-bead interactions that allow overlap—are crucial for obtaining sensible 3D distances/R_g when coarse-grained models are employed.

Results

Constructing 3D chromatin configurations at near-nucleosome resolution consistent with Micro-C contact map and measured 3D distances

We simulated a fine-grained chromatin polymer made of “nucleosome-linker” (NL) beads, with each bead representing 200bp of chromatin (Fig. 1a, Methods and Supplementary Information (SI)), and generated an ensemble of steady-state chromatin configurations, taking the Micro-C contact probability data (P_ij) of mouse embryonic stem cells (mESCs) as input³⁵. We simulated ten different genomic loci (see Supplementary Table 1) having broad euchromatic or heterochromatic chromatin state characteristics. We computed the ensemble-averaged contact probability for each locus and compared them with the input contact map. The contact maps from the simulations appear visually similar to the Micro-C data (Fig. 1b–d and Supplementary Fig. 3). The bottom panel shows representative snapshots from the simulations. For the Ppm1g locus, the beads are colored based on the domains in the contact map (color-strip at the top of Fig. 1b–d), while for the Gm29683 and Cbx8 loci, the far-away heterochromatic regions interacting with each other are shown in red and blue color (Fig. 1c, d). Even though these are representative snapshots, one can see signatures of domain separation (configurations below Fig. 1b) and interaction among far-away regions (configurations below Fig. 1c, d). The contact probability versus genomic distance plots from simulations and experiments are comparable (Supplementary Fig. 4). We quantified the similarity between the experimental and simulation contact matrix by computing the stratum-adjusted correlation coefficient (SCC)⁷⁴ (see Supplementary Note 1D). The SCC values for most regions are above 0.9, suggesting that the simulations reproduce contact maps well (Supplementary Table 1). Beyond the contact map, we also compared the mean 3D distance from our simulations for the alpha globin region with the available experimental data^12,75 (see Fig. 1e). These results suggest that our simulations have generated an ensemble of configurations with the contact map and 3D distances comparable with experiments.

**Fig. 1: Chromatin configurations are consistent with experiments.**

We now systematically coarse-grain all the above chromatin polymers. We chose n_b consecutive NL beads to form a coarse-grained CG bead (colored big bead in Fig. 2a top right). The coarse-grained polymer consists of N/n_b number of CG beads. We then measured various properties of the coarse-grained polymer such as the size of a CG bead R_g, bond length l_cg, bond angle θ_cg (Fig. 2a bottom), and dihedral angle ϕ_cg (see below). We study how these properties depend on the coarse-graining size n_b and the genomic location. As a control, we have compared our chromatin results with the ideal chain (bead spring polymer with no self-avoiding interaction), the SAW (Self Avoiding Walk; bead spring polymer with Weeks–Chandler–Anderson potential), and a highly packed globule (bead spring polymer with attractive Lennard-Jones potential with ϵ = 1k_BT).

**Fig. 2: Coarse-graining, bead size and its variability along the genome.**

Predicting the size of coarse-grained chromatin beads and its variability along the genome

Since nearly all polymer simulations use coarse-grained beads of various genomic lengths like 1 kb, 10 kb, 100 kb, etc., it is important to understand the physical size (radius) of such a coarse-grained bead. Does bead size depend on the state of the chromatin (heterochromatin or euchromatin) and/or the genomic location of the bead (TAD interior, TAD boundaries)? To answer this, we first computed the radius of gyration (R_g) of the n_b consecutive beads that form a CG bead. Taking a sliding window of n_b beads, we plotted the average radius of gyration (${R}_{g}^{i}$) as a function of genomic location (Fig. 2b and Supplementary Fig. 5a; real units in X₂ and Y₂ axes). For both euchromatic and heterochromatic segments, R_g values vary along the genomic length for the same coarse-graining, representing heterogeneities along the chromatin. In contrast, R_g curves for SAW and packed globule do not vary along the polymer contour (Supplementary Fig. 5b, c). To understand how the locations of the peaks and troughs in the R_g correlate with the contact map, we compared them for a 1 kb coarse-graining scale (Fig. 2c, d and Supplementary Fig. 6). R_g values peak at the boundaries of TAD-like domains and are relatively small within the interior of the domain. That is, coarse-grained beads representing inter-TAD regions will have a larger physical dimension. This is consistent with what is observed in recent experiments⁴².

How big is a typical 1 kb, 5 kb, or 10 kb chromatin? We compared the mean R_g values (averaged over the entire region we considered) for several genomic segments – some of them are euchromatic, and others are heterochromatic regions in terms of dominating histone modification marks (Fig. 2e). Throughout this paper, the chromatin loci with broad euchromatic characteristics are plotted in shades of red, while the loci with broad heterochromatic marks are plotted using shades of blue. SAW polymer data is presented as a “control” having an expected ${R}_{g} \sim {n}_{b}^{0.6}$. Interestingly, even though there is variability among the gene region, they more or less fall in a narrow range. The range of average R_g values for 1 kb, 10 kb, and 80 kb chromatin regions are 27–28 nm, 85−97 nm, and 135−155 nm, respectively (see Fig. 2e inset). The curves for SAW and globule mark the extreme values possible. The R_g predictions for various regions from our simulations are of the same order as what is seen in experiments for the repressed chromatin domains in Drosophila cells⁴⁷ (orange data points Fig. 2e). Although the experimentally known R_g values are for a different cell type, they do indeed fall in the range that we are predicting, suggesting that R_g values from our simulation are reasonable. In Supplementary Fig. 5d, e, we also show the distribution of R_g values.

To independently check the order of magnitude of R_g values, we employed another realistic, detailed model of short chromatin with explicit nucleosomes having entry/exit angles and linker DNA explicitly (Supplementary Fig. 2b and Supplementary Note 1B). We find that the size of a 1 kb chromatin made of 5 nucleosomes in this model is comparable to what we predict using our basic fine-grained model (Supplementary Fig. 2c), suggesting that the fine-grained model we use is reasonable.

Coarse-grained chromatin beads are not hard spheres; they overlap impacting bond length and stiffness

Another important quantity is the distance between the centers of two neighboring coarse-grained beads, defined as the bond length l_cg, depicted in Fig. 2a. In Fig. 3, we present the bond length, its statistics, and the stretching elastic constants derived from it. First, mean l_cg values vary depending on the coarse-graining size and genomic location for all the genomic regions we studied. (Fig. 3a, b and Supplementary Fig. 7a, b). Similar to R_g, the bond length is also high at certain genomic locations like boundaries of TAD-like domains and low in the domain interiors (Fig. 2c, d and Supplementary Fig. 6). This is essentially the same behavior found in recent experiments⁴², which showed that the inter-TAD distances are larger than the intra-TAD distances, consistent with the spatial variation in coarse-grained bead sizes observed in our simulations. Given that a typical chromatin polymer will contain both euchromatic and heterochromatic regions of different compaction, it is instructive to compare extreme sizes of coarse-grained beads to understand the variability one can expect along the genome. For a 5 kb segment (n_b = 25), the mean l_cg values at different genomic locations vary in the range ≈95−120 nm (heterochromatin) to ≈105−150 nm (euchromatin)—compare n_b = 25 in Fig. 3a, b. Interestingly, these values are comparable to the l_cgmeasurements from microscopy experiments that “paint” 5 kb chromatin segments⁴⁷. This also implies that real chromatin will have highly heterogeneous bead dimensions, unlike the prevalent uniform bead size picture. The mean values of bond length, averaged along the polymer contour, are shown as a function of n_b (coarse-graining scale, genome size) for ten different chromatin loci (Fig. 3c). Equivalent coarse-grained bond lengths for SAW and globule are shown as control. Similar to R_g, there is some amount of variability in mean l_cg across different gene regions; however, they fall within the range of 58−67 nm for 1 kb and 114−132 nm for 10 kb.

**Fig. 3: Chromatin as bead-spring chain: bond length, spring constant and overlap.**

How is l_cg related to R_g? Naively one would expect that l_cg ≈ 2R_g. However, this is not the case; chromatin has l_cg < 2R_g because the two nearby polymer segments can “mix”—CG beads can overlap—and have their center of mass locations nearby (Fig. 3d). To quantify the overlap or mixing between the two adjacent coarse-grained beads, we define an overlap parameter ${{{{{{{\mathcal{O}}}}}}}}=(2\langle {R}_{g}\rangle /\langle {l}_{cg}\rangle )$ (see Fig. 3e and Supplementary Note 1D). If the coarse-grained regions are perfectly spherical and non-overlapping (no mixing) ${{{{{{{\mathcal{O}}}}}}}}\le 1$. Imagine two hard spheres of radius R_g connected by a spring. The thermal fluctuations will result in the average inter-bead distance (the equivalent of l_cg here) being slightly larger than 2R_g and ${{{{{{{\mathcal{O}}}}}}}} < 1$. As a control, one can see that for SAW, ${{{{{{{\mathcal{O}}}}}}}} < 1$ and is nearly independent of coarse-graining scale (n_b). For euchromatin and heterochromatin ${{{{{{{\mathcal{O}}}}}}}} > 1$ (i.e., l_cg < 2R_g), implying the mixing of adjacent polymer segments. We also find that ${{{{{{{\mathcal{O}}}}}}}}$ depends on the coarse-graining scale — overlap is high at larger n_b values. Since l_cg ≠ 2R_g, should one consider l_cg as the size (diameter) of an effective coarse-grained bead, or 2R_g? This is a relevant question for coarse-graining; given the fact that the beads can overlap, we propose that l_cg may be considered as the effective diameter of a coarse-grained bead since it is the effective bond length. We also computed ${{{{{{{\mathcal{O}}}}}}}}$ as a function of genomic location (Supplementary Fig. 8a,b). Comparing the overlap with the contact map shows that the overlap at the boundary of TAD-like domains is smaller than the domain interior (Fig. 2c, d and Supplementary Fig. 6). While the above quantity measures the overlap between neighboring regions ("bonded” CG beads), any two chromatin regions residing far away along the polymer contour ("nonbonded” CG beads) can also overlap. To quantify this overlap, we first computed the probability distribution of 3D distance r_ij between any two CG beads, P(r_ij)(Supplementary Fig. 8c). The probability that r_ij < l_cg is a measure of overlap among far away beads (see Supplementary Fig. 8d).

Going beyond the average size, we computed the bond length distribution P(l_cg) that has all the information about the fluctuation and higher moments. As a control, for the ideal polymer chain, P(l_cg) from the simulation matches with the analytical relation proposed by Laso et al.⁷⁶ (Supplementary Fig. 7c). We then plot P(l_cg) for different chromatin loci for different coarse-graining sizes (Fig. 3f and Supplementary Fig. 7d, e). From the distribution, we can derive an effective potential energy $V({l}_{cg})=-{k}_{{{{{{{{\rm{B}}}}}}}}}T\ln P({l}_{cg})$ with which two neighboring beads interact. Even though the distribution is not perfectly Gaussian, a measure of the elastic constant of the interaction can be computed from the inverse of the standard deviation. Hence, we define an effective spring constant between two neighboring CG beads as ${K}_{cg}=\frac{{k}_{B}T}{\langle {l}_{cg}^{2}\rangle -{\langle {l}_{cg}\rangle }^{2}},$ where the angular brackets indicate the average computed using P(l_cg). In Fig. 3g, we plot K_cg for different coarse-graining sizes and various gene loci.

The spring constant is scale dependent—it decreases as the coarse-graining size increases. For most gene regions, the spring constant values appear to saturate at a large coarse-graining scale, unlike the SAW polymer. The K_cg value for large n_b is in the range (0.1–1) k_BT/σ², which is ≈ (1–10) pN/μm. This value is roughly comparable to some of the experimentally measured values from pulling long chromatin under certain in vitro conditions⁷⁷. Note that, in contrast to pulling experiments where external forces can disrupt protein-mediated interactions, our estimate of the spring constant arises purely from thermal fluctuations and is thus expected to be a reliable signature of chromatin flexibility. Moreover, this is for relatively more dynamic MESc; hence the chromatin stretching stiffness obtained here will be less than that from the pulling experiments of the full-length mitotic chromosomes⁷⁸.

The spring constant above is presented in units of k_BT/σ² where σ is the size of a 200 bp NL bead. However, in coarse-grained polymer simulations, one uses K_cg in units of ${k}_{B}T/{l}_{cg}^{2}$. Since l_cg also depends on CG size (n_b), the spring constant has a non-trivial behavior and is presented in Fig. 3h. This gives a very useful range of numbers that can be used in all future coarse-grained simulations as K_cg = 5–10 k_BT$/{l}_{cg}^{2}$ for coarse-grained beads of size 1–20 kb.

Predicting angle distribution and stiffness of coarse-grained chromatin segments

How flexible is a chromatin polymer segment? Do chromatin polymer segments have intrinsic curvature? While we understand the bendability of DNA reasonably well, we know very little about the bending elastic behavior of chromatin. From the large ensemble of structures that we have produced, consistent with nucleosome level Micro-C data, we computed the distribution of the bond angle (θ_cg)—angle between two neighboring bonds connecting three consecutive CG beads (see Fig. 4a top), and the dihedral angle (ϕ_cg)—angle between two neighboring planes formed by three consecutive bond vectors (Fig. 4a bottom).

**Fig. 4: Angle distributions of chromatin segments revealing bendability.**

The bond angle is defined as θ_cg = π − α, where $\alpha={\cos }^{-1}({\hat{l}}_{cg}^{i}\cdot {\hat{l}}_{cg}^{i+1})$, and it can take any value in the range [0, π] (see Supplementary Note 1D). As a control, we computed the angle and its distribution for an ideal chain, and our results match well with the known analytical answer⁷⁶ for P(θ_cg) for different coarse-graining sizes (Supplementary Fig. 9a). Then we computed the distribution of angles P(θ_cg) for chromatin segments in different epigenetic states. As shown in Fig. 4b, for an ideal chain, even with no coarse-graining (n_b = 1), the distribution has a shape given by $P({\theta }_{cg})=\frac{1}{2}\sin ({\theta }_{cg})$^76,79. This is due to the geometric measure, and it implies that when θ_cg is near 90^∘, a large number of configurations are possible (having different azimuthal angles), while there is only one possible configuration for extreme cases if θ_cg = 0^∘ and θ_cg = 180^∘. Hence, to have a better understanding of the system, we also plot the corresponding probability density defined as $\tilde{P}({\theta }_{cg})=P({\theta }_{cg})/\sin ({\theta }_{cg})$ in Fig. 4c. For the ideal chain, with n_b = 1, $P({\theta }_{cg})/\sin ({\theta }_{cg})$ is a flat curve (uniform distribution) reiterating the fact that the ideal chain is unbiased, and all configurations are equally likely. For SAW, the excluded volume would ensure that configurations with θ_cg ≈ 0 are not possible, and there is a natural bias towards extended configurations (θ_cg > 90^∘).

The emergence of preferred inter-nucleosome angle from folded chromatin configurations: Our results here describe angle distribution for different of chromatin loci. For all the chromatin loci we simulated, at the nucleosomal (fine-grained, n_b = 1) resolution, a new peak emerges near θ_cg ≈ 60^∘ (Fig. 4b). The deviation from the ideal chain and SAW emerges due to intra-chromatin interactions. Since we do not impose any preferred angle in the fine-grained model, this population with angles near 60^∘ emerges purely from the packaging, based on the contact probability map. To understand this better, we deconvoluted the P(θ_cg) distribution and represented it as a sum of two Gaussian distributions giving us two populations having mean values near ≈60^∘ and ≈110^∘ (Supplementary Fig. 9b). Comparing the widths of the two populations suggests that the distribution with mean ≈60^∘ is 2–3 times stiffer than the population with mean ≈110^∘. For the highly folded globule, the peak at ≈60^∘ is even more prominent, suggesting that tighter packaging could result in a population with ≈60^∘ angles. In contrast to the prevalent notion of smaller angles around ≈60^∘, our analysis shows that, at least in the case of mESC chromatin, there is a prominent signature of two sub-populations of angles, one highly folded and one extended, for n_b = 1 (Fig. 4b, c).

Next, we examined the angle distribution of a coarse-grained chromatin polymer (Fig. 4d, e). A lesser-discussed fact about polymers (even for the ideal chain) is that, when coarse-graining is performed (n_b > 1), the angle distribution gains a bias (or a shift), with a preference emerging for the larger θ_cg angles (θ_cg > 90^∘) (See refs. ^76,79 and Supplementary Fig. 9a). The SAW polymer has an extra bias towards extended angles as smaller angles are disfavored due to excluded volume effects.

For coarse-gained chromatin, the angle distributions deviate a lot from the ideal chain and SAW, displaying a preferred intrinsic angle around ${\theta }_{cg}^{0}\approx 6{0}^{\circ }$ for a coarse-graining scale of 5 kb (Fig. 4d, e). For chromatin loci, consistent with ideal chain and SAW, coarse-graining initially shifts the angle distribution towards larger angles (see n_b = 5 in Supplementary Fig. 9c–f). However, for larger coarse-graining, long-range intra-chromatin interactions, such as TAD-forming loops, fold chromatin and shift the distribution towards smaller angles (see n_b = 10, 50 in Supplementary Fig. 9c–f).

The effect of coarse-graining and deviation from ideal/SAW chain behavior is visible in $\tilde{P}({\theta }_{cg})$ distribution as well (Fig. 4e). While the direct experimental readout of angles (e.g., via imaging) would yield P(θ_cg), the scaled $\tilde{P}({\theta }_{cg})$ is what would be useful for simulations; one can define an effective bond angle potential $V({\theta }_{cg})=-{k}_{B}T\log \tilde{P}({\theta }_{cg})$⁷⁹. The V(θ_cg) curves for chromatin segments have a well-defined minimum at preferred angles, which depends on the coarse-graining scale, n_b (Supplementary Fig. 10). One can also compute the effective bending “stiffness” of chromatin segments by comparing the inverse of the standard deviation around the local maxima of $\tilde{P}({\theta }_{cg})$ or by equating $V({\theta }_{cg})=\frac{{k}_{b}}{2}(1+cos({\theta }_{cg}-{\theta }_{cg}^{0}))$. We find that bending stiffness is of the order of thermal energy k_b ≈ k_BT for all coarse-graining scales. This suggests that the chromatin polymer is not highly stiff and explores a wide range of angles. This is consistent with the emerging notion that chromatin is highly dynamic^80,81,82,83, and has high cell-to-cell variability.

We examined how the angles vary along the genomic locations. Similar to R_g and l_cg, angles too have heterogeneity along the genome (Fig. 2c, d and Supplementary Fig. 9g, h). A comparison of the average angle for different genomic locations reveals that there are higher angles near TAD-like domain boundaries and lower angles in the interior of the domains. This spatial variation of chromatin properties could be important for understanding and reconstructing chromatin configurations.

Dihedral angle distribution: The distributions of the dihedral angles P(ϕ_cg) for different chromatin/polymer segments for fine-grained (n_b = 1) and coarse-grained (n_b = 25) level are shown in Fig. 4f, g. For the fine-grained model (n_b = 1), the ideal chain (control) has a uniform angle distribution as expected; the SAW polymer has a dip near ϕ = 0 indicating self-avoidance/steric hindrance (Fig. 4f). Even for an ideal chain, the coarse-graining leads to non-trivial changes in the ϕ distribution where a preference for smaller dihedral angles arises. For the fine-grained chromatin and globule, due to high folding, the probability of obtaining smaller angles (ϕ near zero) increases, and larger angles become rarer compared with the SAW polymer. Folding also leads to a peak near ϕ ≈ 60^∘, which is prominent for the globule.

A preference for smaller ϕ values appears on coarse-graining, similar to the Ideal chain. At the same time, the folding via long-range intra-chromatin interactions results in the formation of peaks near ϕ ≈ ± 60^∘. Both of these effects together define the coarse-grained ϕ distributions (Fig. 4g). Similar to the bending angle distribution, the dihedral angle distributions are very broad, implying a weak angle stiffness.

Similar to V(θ_cg), we define an effective dihedral potential $V({\phi }_{cg})=-{k}_{B}T\log P({\phi }_{cg})$ (see Supplementary Fig. 11). Assuming that the distributions of θ_cg and ϕ_cg are independent, we have plotted the heatmap of V(θ_cg, ϕ_cg) = V(θ_cg) + V(ϕ_cg) in the (θ_cg – ϕ_cg) plane (see Fig. 4h–j). Here the color-bar represents the energy V(θ_cg, ϕ_cg) values in k_BT units. For the fine-grained model, low values of θ_cg and ϕ_cg are penalized due to self-avoidance (Fig. 4h). This effect reduces with coarse-graining. For lower coarse-graining, higher θ_cg and intermediate ϕ_cg values are preferred (Fig. 4i), while for higher coarse-graining θ_cg in the range 50^∘–90^∘ and ϕ_cg values close to ±60^∘ are favored (Fig. 4j). This again shows that angle preferences for chromatin are scale-dependent—depending on the coarse-graining scale, the preferred values vary considerably.

Determining optimal soft inter-bead potential and simulating a coarse-grained chromatin

This work systematically estimates the size of coarse-grained beads, their fluctuations, the overlap among the beads due to the mixing of polymer segments, and the distribution of bond and dihedral angles. Here we integrate these quantities to simulate a coarse-grained chromatin polymer and predict 3D size or R_g measurable in microscopy experiments. While the spring constants and bead sizes (l_cg) can be directly used from our results discussed so far, we lack the non-bonded interaction potential that would ensure appropriate compaction. Therefore, we perform an iterative Boltzmann inversion (IBI) to determine the form of an inter-bead soft potential that would achieve the 3D distance distribution consistent with our original fine-grained simulation (see Supplementary Note 1C and Supplementary Fig. 12).

We implemented the iterative Boltzmann inversion method for the Arsg locus that we studied using the fine-grained model. Since the overlap (Fig. 3e) depends on the level of coarse-graining, it is expected that the potential energy would also depend on n_b. Hence, for each n_b, we simulated a coarse-grained bead-spring polymer with N/n_b beads connected by harmonic bonds with equilibrium bond length l_cg and spring constant K_cg taken from Fig. 3c, h. Starting with a flat inter-bead potential energy ${V}_{i=0}^{nb}=0$, at each step of iteration i, we simulated the polymer until equilibrium and updated this potential using the relation:

$${V}_{i+1}^{nb}(r)={V}_{i}^{nb}(r)+\alpha (r)\,{k}_{{{{{{{{\rm{B}}}}}}}}}T\,\ln \left(\frac{{P}_{i}(r)}{{P}_{{{{{{{{\rm{target}}}}}}}}}(r)}\right).$$

(1)

Here P_i(r) is the steady-state distribution of distances between all pairs of non-bonded beads. The distribution was compared with the known distance distribution, a target distribution P_target(r) for the corresponding level of coarse-graining from our fine-grained model. $\alpha (r)=0.2\,{e}^{-{r}^{2}/2}$ is taken as a decaying function to ensure that the resulting potential is short-range. We checked for the convergence of the algorithm by computing the Kullback-Leibler Divergence between the target and CG model distance distributions (see Supplementary Note 1C and Supplementary Fig. 13a). In other words, for each n_b, we have computed a soft potential energy function between CG bead pairs and used it to perform CG polymer simulations that would reproduce 3D distance distribution exactly as we got from our fine-grained model (see Fig. 5a). The resulting potential energy functions V^nb for various n_b values are shown in Fig. 5b. We also fit a functional form to this potential (solid line), such that the softness and depth of the potential can be tuned independently. We use the functional form

$${V}_{{{{{{{{\rm{soft}}}}}}}}}(r)=\left\{\begin{array}{ll}{V}_{0}{\left[1-{\left(\frac{r}{{r}_{m}}\right)}^{{\eta }_{1}}\right]}^{{\eta }_{2}}-\epsilon \quad &r < {r}_{m},\\ \frac{1}{2}\epsilon \left[\cos (\mu {r}^{2}+\nu )-1\right]\quad &{r}_{m}\leqslant r < {r}_{c},\\ 0\quad &r\geqslant {r}_{c}.\end{array}\right.$$

(2)

The first part of the equation (r < r_m) gives the repulsive part of the potential⁸⁴. Here, V₀ controls the height of the potential at r = 0 (see Supplementary Fig. 13b), r_m is the position of minima, and ϵ denotes the depth of the potential. The parameters η₁ and η₂ can be tuned to get the desired softness. The second part of the equation (r_m ⩽ r < r_c) represents the attractive part of the potential^24,85,86. This function has the advantage that it can ensure the continuity and differentiablity at r = r_m and r = r_c by tuning the values of μ and ν such that the value of potential is V_soft(r = r_m) = − ϵ and V_soft(r = r_c) = 0 (see Supplementary Note 1C and Supplementary Table 2). The negative slope of the corresponding potential ${F}^{nb}=\frac{-d{V}^{nb}}{dr}$ is plotted in Fig. 5c. The inverse of the maximum value of the force (1/F_max) can be used as a measure of the softness of CG beads (Fig. 5d).The depth of the potential captures the effective attractive interaction between a pair of beads (Fig. 5e). The important points to note are: (i) the potential is derived from fine-grained model that is consistent with the Micro-C experimental data. (ii) The potential is highly soft – softer than the typically used LJ potential (Supplementary Fig. 13c). (iii) The potential energy and the two important physical parameters of the potential — softness and the attractive interaction strength—are scale-dependent. Different levels of coarse-graining have different softness and interaction strength. This is highly relevant for anyone wanting to simulate chromatin as a coarse-grained bead spring polymer.

**Fig. 5: Soft inter-bead potential energy for coarse-grained chromatin beads.**

Finally, we compare the radius of gyration of the chromatin polymer predicted by our CG simulations with our fine-grained model for various levels of coarse-graining (Fig. 5f). The radius of gyration values match with the fine-grained model. Note that this is equivalent to comparing a coarse-grained model simulation results with microscopy experiments that label DNA/chromatin (e.g., methods that “paint" chromatin⁴⁷) with equivalent resolution. The radius of gyration of both fine-grained and coarse-grained polymers decreases slightly with increasing coarse-graining. This is because the distance of a CG bead from the center of mass of the polymer is smaller than the root mean square distance of the fine-grained beads it replaces (see Supplementary Fig. 13d). This also predicts that the overall R_g value of a long chromatin region (made of many painted small segments) will marginally decrease as one increases the length of the labeled (painted) segment. This decrease is of course less than the size of the painted segment (l_cg). As mentioned elsewhere in this manuscript, we find that the most probable value that we predict for l_cgis comparable with the available data from chromatin microscopy experiments that paint 5 kb segments.

Discussion

This paper addresses a fundamental question in modeling chromatin: what are the properties and parameters of a coarse-grained chromatin polymer, and how do they vary in a scale-dependent manner as we go from the ~10 nm nucleosome scale to hundreds of nanometers gene scale, domain scale or micron-sized chromatin scale? Recent papers have given us a good understanding of the scaling laws, TAD formation, roles of phase separation, loop extrusion, and so on^{72,82,87,88,89,90,91}. However, we do not understand the physical dimension of loci that we consider a “bead” in simulations, how stretchable chromatin loci are (spring constant), angle flexibility (bendability), how soft the inter-bead potentials are, and so on. We do not know how chromatin compaction (R_g), spring constant, bending angle, overlap, etc., depend on the local contact map (e.g., TAD) structures and epigenetic states.

To fill this gap, we used the recently published Micro-C contact map for mESCs and constructed an ensemble of chromatin configurations at 200 bp resolution. These configurations simultaneously satisfy three constraints: (i) they comply with the Micro-C contact probability, (ii) the mean 3D distance values computed from the configurations are comparable with known experiments, and (iii) the size of the 200 bp fine-grained bead (nucleosome + linker) is in a sensible range. We used this set of configurations and systematically coarse-grained them to predict physical properties and parameters relevant to a chromatin polymer bead-spring chain. We have determined the physical dimensions of chromatin loci (bead sizes of chromatin polymer) for ten different mESC gene regions having different epigenetic state characteristics. We have computed the distributions of the inter-bead distances, predicting how stretchable different chromatin loci are and quantifying their spring constants. We have also predicted the bending and dihedral angle fluctuations revealing how bendable chromatin loci are. Our work not only shows the similarity/variability among different loci but also reveals the effect of chromatin heterogeneity along the polymer contour, finding that TAD interior and TAD boundary have different properties and parameters—different CG bead dimensions, average angle values, overlap, etc. Contrary to the prevalent notion, our results show that CG chromatin beads should be modeled as soft particles that can overlap. We then compute the inter-bead soft potential and propose a functional form to quantify the softness. All our predictions reveal how chromatin properties and parameters change in a scale-dependent manner.

The chromatin polymer parameter values that we have predicted—bead sizes, spring constants, angle distributions, overlap/softness, etc.—are essential for anyone wanting to simulate chromatin polymer. We provide a comprehensive prediction of numerical values of all parameters starting with nucleosome resolution data. Moreover, our finding that chromatin polymer parameters depend on the scale one chooses to study is significant. The polymer parameters relevant for 1 kb chromatin are not the same as that for 10 kb or 100 kb chromatin, which is essential to account for in future simulations. We also argue that many of these parameters (like overlap) are crucial for predicting 3D distance accurately. We have determined an effective inter-bead potential via an iterative Boltzmann inversion method. We used all of these CG results to compute the R_g of a chromatin locus. In other words, our claim is: we have computed the relevant parameters for a polymer simulation of mESC chromatin at different scales. Anyone can use our parameters, simulate coarse-grained chromatin satisfying contact probability, and predict average 3D distances reasonably well within the region-to-region variability we show. Our work has biological significance for connecting chromatin structure to function. Many of the biological processes like recombination, DNA breakage/repair, enhancer-activation, and spreading of histone modifications occur at the scale of nucleosomes. The 3D structure we predict at nucleosome resolution is crucial for understanding these functional aspects. Our work connects the coarse-grained picture (100 nm to μm scale experiments having a few kb or Mb resolution) with a nucleosome-resolution picture and will enable Hi-C or Microscopy experiments to extrapolate and predict nucleosome-level structure. This is highly relevant for understanding the biological functions that occur at nucleosome resolution.

While building the fine-grained model, we made minimal assumptions. The primary assumption we made is that all the chromatin details (e.g., inter-nucleosome interaction potential, histone tails, etc.) result in deciding the contact probability; generating conformations that satisfy the contact map would implicitly account for the role of various local chemical and structural details. Since we use the Micro-C data with 200 bp resolution, our model (model-I) cannot study details below this resolution. We also employed a model with linker DNA (model-II) and showed that our model-I results are sensible. Using model II, we also examined how the variability in nucleosome positioning would affect the overall size (R_g) of the folded chromatin. Apart from studying a fixed linker length of 50 bp, we have also performed simulations choosing linker lengths from a Gaussian distribution to incorporate variability. If there are 5 ± 1 nucleosomes in 1 kb chromatin, the mean R_g is roughly the same order of magnitude as we reported (Supplementary Fig. 2d). We have also reported these quantities for different mean linker length values. The difference in mean may represent different chromatin states.

For the ten gene loci we studied, heterochromatic and euchromatic regions have R_g, angles, and other quantities in a comparable range. This could be because (i) our study is for an embryonic stem cell where the chromatin could be more open. (ii) The underlying Micro-C data itself shows that heterochromatic and euchromatic regions have comparable contact probability as a function of genomic distance P(s) (Supplementary Fig. 4e). This can also be consistent with the irregular nature of chromatin organization as indicated by the power law decay of P(s). Recent experiments have also indicated that heterochromatin can be diverse, and euchromatin can get highly folded due to multiple loops, resulting in similar compaction and other physical properties^48,92,93.

Our work predicts the mean values and variability in bead sizes and other physical properties like elasticity and bendability. It has been suggested that the variability in thickness and flexibilities could affect the chromatin properties below 100 kb⁹⁴. This implies that the variability we find may be relevant since many of the enhancers and promoters can be within 100 kb^40,95. However, note that, apart from variability, we predict the average bead size, l_cg, K_cg, etc.; the change in the average value would affect measurable quantities at all length scales.

One of the important results of our work is the inter-bead soft potential and the quantification of overlap. Very high-resolution models or models that used sub-beads to represent a larger CG bead would have some signatures of overlap^21,72. However, most of the current coarse-grained simulation studies use the Lennard-Jones potential for inter-bead interactions, and it quickly goes to infinity with negligible softness. Unlike earlier models^{12,13,21,72,84}, here we derive the functional form of the soft-potential starting with nucleosome-resolution contact map data and quantify the softness in a scale-dependent manner. One of the concerns regarding the soft potentials is that it may allow chain crossing leading to incorrect dynamics. However, recent experiments show that chain crossings are indeed present, and topoisomerase activity is required to remove these crossings and have entanglement-free interphase chromosomes⁹⁶. This implies that more accurate dynamics would require the presence of enzymes like topoisomerase that actively regulate chromosome topology in terms of entanglements. This may be an essential feature necessary to study dynamics in coarse-grained models.

Experimental tests of our predictions: We simulate the fine-grained model (scale ~10−20 nm) and predict quantities at a much larger scale (~100 nm−μm) that can be measured in experiments. Our predictions of the radius of gyration, bond length, and 3D distances can be tested using microscopy experiments, and we have compared some of them in Fig. 1e and Fig. 2e. Combining biochemistry and microscopy, recent studies have proposed methods to “paint” chromatin segments (size ≈5 kb or higher) and trace the chromatin contour. This method allows one to test many polymer predictions, including coarse-grained inter-bead distances and angle fluctuations. Even though the experimental data is not available for the mESC segments that we simulated, we compared our predictions with the available data, and we found that the most probable value that we predict for l_cg is comparable with the measured data⁴⁷. We also find that our prediction of the fluctuation of the angles—width of the angle distribution—is comparable to the experimentally measured values. Such experiments may be performed for the mESC gene regions we simulated to compare with our predictions. Future experiments could also test how these values change as one changes the segment size indicating how bead sizes and bendability would vary with the choice of the coarse-graining scale. Future experiments may also measure spring constants at different scales, either through measuring chromatin segment fluctuations or doing pulling experiments at various scales. All of our predictions can be tested using microscopy, chromatin pulling, and other biophysical experiments.

It must be stated that the whole of our analysis is based on the Micro-C data for the mESCs from Hsieh et al.³⁵. Hence the numbers emerging from this study would represent embryonic stem cell chromatin. In the future, analysis can be further extended to study various other cell types as new data emerge. The future direction is also to understand the role of nucleosome positioning heterogeneity and assembly/disassembly/sliding kinetics. It requires a much more detailed polymer model⁹⁷ and a model to understand how chromatin conformation capture contact maps are influenced by the heterogeneity of nucleosome organization.

Methods

Model-I. Fine-grained chromatin model with 200 bp resolution chromatin: Our basic model is the fine-grained chromatin polymer model with 200 bp resolution, constructed based on the publicly available Micro-C data for mouse embryonic stem cells (mESCs)^35,36. The polymer is made of N spherical beads, having the size of 200 bp chromatin (diameter σ), with nearest-neighbor connectivity via harmonic springs and self-avoiding interaction via the repulsive part of the Lennard-Jones potential (see Fig. 1a, Supplementary Note 1A). Since each bead consists of a nucleosome and 50bp linker DNA, we call the bead a “nucleosome-linker" (NL) bead. To generate an ensemble of configurations consistent with Micro-C data, we connected (brought into proximity) bead pairs i and j with the experimentally observed contact probability P_ij in a two-step process. First, we defined a set of prominent (strong) contacts of the Micro-C contact map (see Supplementary Note 1A)⁷¹. Taking only the prominent contact probability values, we inserted harmonic springs between bead pairs i and j if r_n < P_ij, where r_n is a uniformly distributed random number between 0 and 1. Using this procedure, we generated 1000 independent polymer configurations and equilibrated them using Langevin simulations with LAMMPS⁹⁸. We defined “prominent contacts" as follows⁷¹: Since the contact map depends only on ∣i − j∣ for homogeneous polymers, we took the set of all P_ij values for a given ∣i−j∣ and computed their mean and standard deviation. If P_ij was at least one standard deviation larger than the mean, we considered it as a prominent contact (see Supplementary Note 1A). Prominent contacts are defined for each ∣i − j∣ line in the matrix (line parallel to the diagonal representing all equidistant bead pairs). Bonding prominent contacts ensured that all actively acquired far-away contacts (e.g., contacts via loop extrusion) were present.

In the second step, going beyond the prominent contacts, our aim is to insert contacts in the P_ij fraction of the configurations (out of the 1000 configurations) for each bead pair (i, j). To achieve this, we started with the ensemble of equilibrated configurations from step-1 and inserted harmonic springs between beads i and j in the P_ij fraction of configurations whose 3D distances (r_ij) are the smallest (see SI). This system was then equilibrated using Langevin simulations with LAMMPS. While the first step ensured that the strong contacts formed via events like loop extrusion were established, the second step ensured that all bonds closer in space would have priority in forming protein-mediated contacts.

We have used the minimal fine-grained model that accounts for polymer connectivity, self-avoidance, and contact probability. The assumption here is that all other properties of the fine-grained polymer (like inter-nucleosome interactions and stiffness) lead to the experimentally observed contact probability, which we have ensured. Our model generates all possible polymer configurations such that the experimentally known constraint of the contact map is satisfied.

Size of a 200 bp chromatin bead (σ): Since a 200 bp chromatin bead is bigger than a nucleosome, its size has to be greater than the size of the nucleosome (11 nm)³. As geometrically shown in Supplementary Fig. 2a, since two neighboring nucleosomes are connected via a rigid 50 bp linker DNA, the distance between them can be ≈28 nm. However, two far-away nucleosomes can come as close as 11–12 nm (with histone tails and other bound proteins). Hence, on average, one expects an effective size ≈20 nm. In the Results section, we have shown that when σ = 21 nm, the 3D distances and R_g values match well with experimental data. This is sensible considering the linker length and that a typical nucleosome in vivo will likely be covered by several enzymes/proteins like acetyl/methyl transferases, HMG, HP1, remodelers, etc. This is also consistent with the earlier observation that σ = 25 nm for 250 bp beads⁷².

Since Model-I did not have explicit linker DNA, we also simulated short chromatin with nucleosomes, explicit linker DNA, and entry-exit angles between nucleosomal DNA. In this detailed Model-II, the chromatin polymer has two types of beads—linker DNA bead and nucleosome bead (see Supplementary Note 1B)⁹⁹. In the Results section, we have compared the radius of gyration of chromatin segments from Model-II and the first fine-grained model. This also suggests that our σ = 21nm value is indeed reasonable.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Published Micro-C data³⁵ used in this study is available at the Gene Expression Omnibus (GEO) database with accession number GSE130275. Relevant data generated from this study are included in this article’s Figures, text, and supplementary information. Source data are provided with this paper.

Code availability

All the simulations, analysis and visualization in this study were performed using publicly available software packages or custom codes. LAMMPS (16 March 2018) version was used to perform Langevin Dynamics simulations. VMD version 1.9.3 was used for visualization of 3D polymer configurations and computation of dihedral angles. Custom codes were used for all other analysis. All the codes required to perform the simulations are available in the repository: https://github.com/sangramkadam/chromatin_coarse_graining¹⁰⁰.

References

Alberts, B. Molecular Biology of The Cell, 6th edn (Garland Science, Taylor and Francis Group, New York, 2014).
Bickmore, W. A. The spatial organization of the human genome. Annu. Rev. Genomics Hum. Genet. 14, 67–84 (2013).
Article CAS PubMed Google Scholar
Kornberg, R. D. & Lorch, Y. Twenty-five years of the nucleosome, fundamental particle of the eukaryote chromosome. Cell 98, 285–294 (1999).
Article CAS PubMed Google Scholar
Rowley, M. J. & Corces, V. G. Organizational principles of 3D genome architecture. Nat. Rev. Genet. 19, 1 (2018).
Article Google Scholar
Hug, C. B. & Vaquerizas, J. M. The birth of the 3d genome during early embryonic development. Trends Genet. 34, 903–914 (2018).
Article CAS PubMed Google Scholar
Long, H. K., Prescott, S. L. & Wysocka, J. Ever-changing landscapes: transcriptional enhancers in development and evolution. Cell 167, 1170–1187 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bonev, B. & Cavalli, G. Organization and function of the 3d genome. Nat. Rev. Genet. 17, 661–678 (2016).
Article CAS PubMed Google Scholar
Stephens, A. D., Banigan, E. J. & Marko, J. F. Chromatin’s physical properties shape the nucleus and its functions. Curr. Opin. Cell Biol. 58, 76–84 (2019).
Article CAS PubMed PubMed Central Google Scholar
Oluwadare, O., Highsmith, M. & Cheng, J. An overview of methods for reconstructing 3-d chromosome and genome structures from hi-c data. Biol. Proced. Online 21, 1–20 (2019).
Article Google Scholar
Bianco, S. et al. Computational approaches from polymer physics to investigate chromatin folding. Curr. Opin. Cell Biol. 64, 10–17 (2020).
Article CAS PubMed Google Scholar
Di Stefano, M., Paulsen, J., Jost, D. & Marti-Renom, M. A. 4d nucleome modeling. Curr. Opin. Genet. Dev. 67, 25–32 (2021).
Article PubMed PubMed Central Google Scholar
Giorgetti, L. et al. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell 157, 950–963 (2014).
Article CAS PubMed PubMed Central Google Scholar
Di Pierro, M., Zhang, B., Aiden, E. L., Wolynes, P. G. & Onuchic, J. N. Transferable model for chromosome architecture. PNAS 113, 12168–12173 (2016).
Article PubMed PubMed Central ADS Google Scholar
MacPherson, Q., Beltran, B. & Spakowitz, A. J. Bottom–up modeling of chromatin segregation due to epigenetic modifications. PNAS 115, 12739–12744 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Brackley, C. A. et al. Predicting the three-dimensional folding of cis-regulatory regions in mammalian genomes using bioinformatic data and polymer models. Genome Biol. 17, 1–16 (2016).
Article Google Scholar
Brackey, C. A., Marenduzzo, D. & Gilbert, N. Mechanistic modeling of chromatin folding to understand function. Nat. Methods 17, 767–775 (2020).
Article CAS PubMed Google Scholar
Bajpai, G., Pavlov, D. A., Lorber, D., Volk, T. & Safran, S. Mesoscale phase separation of chromatin in the nucleus. Elife 10, e63976 (2021).
Article CAS PubMed PubMed Central Google Scholar
Falk, M. et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature 570, 395–399 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Goloborodko, A., Marko, J. F. & Mirny, L. A. Chromosome compaction by active loop extrusion. Biophys. J. 110, 2162–2168 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Ghosh, S. K. & Jost, D. How epigenome drives chromatin folding and dynamics, insights from efficient coarse-grained models of chromosomes. PLoS Comput. Biol. 14, e1006159 (2018).
Article PubMed PubMed Central ADS Google Scholar
Bianco, S. et al. Polymer physics predicts the effects of structural variants on chromatin architecture. Nat. Genet. 50, 662–667 (2018).
Article CAS PubMed Google Scholar
Shi, G., Liu, L., Hyeon, C. & Thirumalai, D. Interphase human chromosome exhibits out of equilibrium glassy dynamics. Nat. Commun. 9, 1–13 (2018).
Article Google Scholar
Ganai, N., Sengupta, S. & Menon, G. I. Chromosome positioning from activity-based segregation. Nucleic Acids Res. 42, 4145–4159 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kumari, K., Duenweg, B., Padinhateeri, R. & Prakash, J. R. Computing 3D chromatin configurations from contact probability maps by inverse brownian dynamics. Biophys. J. 118, 2193–2208 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Chatenay, D., Cocco, S., Monasson, R., Thieffry, D. & Dalibard, J. Multiple Aspects of DNA and RNA: from Biophysics to Bioinformatics: Lecture Notes of the Les Houches Summer School 2004 (Elsevier, 2005).
Bustamante, C., Bryant, Z. & Smith, S. B. Ten years of tension: single-molecule dna mechanics. Nature 421, 423–427 (2003).
Article PubMed ADS Google Scholar
Marko, J. F. & Cocco, S. The micromechanics of dna. Phys. World 16, 37 (2003).
Article CAS Google Scholar
Mir, M., Bickmore, W., Furlong, E. E. & Narlikar, G. Chromatin topology, condensates and gene regulation: shifting paradigms or just a phase? Development 146, dev182766 (2019).
Article CAS PubMed PubMed Central Google Scholar
Klemm, S. L., Shipony, Z. & Greenleaf, W. J. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 20, 207–220 (2019).
Article CAS PubMed Google Scholar
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Article CAS PubMed PubMed Central ADS Google Scholar
Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Rao, S. S. et al. A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping. Cell 159, 1665 – 1680 (2014).
Article PubMed PubMed Central Google Scholar
Hsieh, T.-H. S. et al. Mapping nucleosome resolution chromosome folding in yeast by micro-c. Cell 162, 108–119 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hsieh, T.-H. S. et al. Resolving the 3d landscape of transcription-linked mammalian chromatin folding. Mol. cell 78, 539–553 (2020).
Article CAS PubMed PubMed Central Google Scholar
Krietenstein, N. & Rando, O. J. Mesoscale organization of the chromatin fiber. Curr. Opin. Genet. Dev. 61, 32–36 (2020).
Article CAS PubMed Google Scholar
Jäger, R. et al. Capture hi-c identifies the chromatin interactome of colorectal cancer risk loci. Nat. Commun. 6, 1–9 (2015).
Article Google Scholar
Ramani, V. et al. Mapping 3d genome architecture through in situ dnase hi-c. Nat. Protoc. 11, 2104–2121 (2016).
Article CAS PubMed PubMed Central Google Scholar
Swygert, S. G. et al. Condensin-dependent chromatin compaction represses transcription globally during quiescence. Mol. Cell 73, 533–546 (2019).
Article CAS PubMed Google Scholar
Islam, Z. et al. Active enhancers strengthen insulation by rna-mediated ctcf binding at chromatin domain boundaries. Genome Res. 33, 1–17 (2023).
Article PubMed PubMed Central Google Scholar
Guin, K. et al. Spatial inter-centromeric interactions facilitated the emergence of evolutionary new centromeres. Elife 9, e58556 (2020).
Article CAS PubMed PubMed Central Google Scholar
Szabo, Q. et al. Regulation of single-cell genome organization into tads and chromatin nanodomains. Nat. Genet. 52, 1151–1157 (2020).
Article CAS PubMed PubMed Central Google Scholar
Imai, R. et al. Density imaging of heterochromatin in live cells using orientation-independent-dic microscopy. Mol. Biol. Cell 28, 3349–3359 (2017).
Article CAS PubMed PubMed Central Google Scholar
Maeshima, K., Ide, S. & Babokhov, M. Dynamic chromatin organization without the 30-nm fiber. Curr. Opin. Cell Biol. 58, 95 – 104 (2019).
Article PubMed Google Scholar
Eltsov, M. et al. Nucleosome conformational variability in solution and in interphase nuclei evidenced by cryo-electron microscopy of vitreous sections. Nucleic Acids Res. 46, 9189–9200 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ou, H. D. et al. ChromEMT: visualizing 3D chromatin structure and compaction in interphase and mitotic cells. Science 357, 1–13 (2017).
Article Google Scholar
Boettiger, A. N. et al. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature 529, 418–422 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Shaban, H. A., Barth, R., Recoules, L. & Bystricky, K. Hi-d: nanoscale mapping of nuclear dynamics in single living cells. Genome Biol. 21, 1–21 (2020).
Article Google Scholar
Parmar, J. J., Woringer, M. & Zimmer, C. How the genome folds: the biophysics of four-dimensional chromatin organization. Ann. Rev. Biophys. 48, 231–253 (2019).
Article CAS Google Scholar
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Article CAS PubMed ADS Google Scholar
Nishino, Y. et al. Human mitotic chromosomes consist predominantly of irregularly folded nucleosome fibres without a 30-nm chromatin structure. EMBO J. 31, 1644–1653 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jost, D., Carrivain, P., Cavalli, G. & Vaillant, C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 42, 9553–9561 (2014).
Article CAS PubMed PubMed Central Google Scholar
Huertas, J., Woods, E. J. & Collepardo-Guevara, R. Multiscale modelling of chromatin organisation: Resolving nucleosomes at near-atomistic resolution inside genes. Curr. Opin. Cell Biol. 75, 102067 (2022).
Article CAS PubMed Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS PubMed PubMed Central Google Scholar
Parmar, J. J. & Padinhateeri, R. Nucleosome positioning and chromatin organization. Curr. Opin. Struct. Biol. 64, 111–118 (2020).
Article CAS PubMed Google Scholar
Bascom, G. D., Myers, C. G. & Schlick, T. Mesoscale modeling reveals formation of an epigenetically driven hoxc gene hub. PNAS 116, 4955–4962 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Rosa, A. & Everaers, R. Structure and dynamics of interphase chromosomes. PLoS Comput. Biol. 4, e1000153 (2008).
Article MathSciNet PubMed PubMed Central ADS Google Scholar
Clarkson, C. T. et al. Ctcf-dependent chromatin boundaries formed by asymmetric nucleosome arrays with decreased linker length. Nucleic Acids Res. 47, 11181–11196 (2019).
Article CAS PubMed PubMed Central Google Scholar
Conte, M. et al. Polymer physics indicates chromatin folding variability across single-cells results from state degeneracy in phase separation. Nat. Commun. 11, 3289 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Qi, Y. & Zhang, B. Predicting three-dimensional genome organization with chromatin states. PLoS Comput. Biol. 15, e1007024 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Bajpai, G. & Padinhateeri, R. Irregular chromatin: packing density, fiber width, and occurrence of heterogeneous clusters. Biophys. J. 118, 207–218 (2020).
Article CAS PubMed ADS Google Scholar
Tjong, H. et al. Population-based 3d genome structure analysis reveals driving forces in spatial genome organization. Proc. Natl Acad. Sci. 113, E1663–E1672 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hua, N. et al. Producing genome structure populations with the dynamic and automated pgs software. Nat. Protoc. 13, 915–926 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tiana, G. et al. Structural fluctuations of the chromatin fiber within topologically associating domains. Biophys. J. 110, 1234–1245 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Tortora, M. M., Salari, H. & Jost, D. Chromosome dynamics during interphase: a biophysical perspective. Curr. Opin. Genet. Dev. 61, 37–43 (2020).
Article CAS PubMed Google Scholar
Ohno, M. et al. Sub-nucleosomal genome structure reveals distinct nucleosome folding motifs. Cell 176, 520–534 (2019).
Article CAS PubMed Google Scholar
Guha, S. & Mitra, M. K. Multivalent binding proteins can drive collapse and reswelling of chromatin in confinement. Soft Matter 19, 153–163 (2023).
Article CAS ADS Google Scholar
Collepardo-Guevara, R. & Schlick, T. Chromatin fiber polymorphism triggered by variations of dna linker lengths. Proc. Natl Acad. Sci. 111, 8061–8066 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Farr, S. E., Woods, E. J., Joseph, J. A., Garaizar, A. & Collepardo-Guevara, R. Nucleosome plasticity is a critical element of chromatin liquid–liquid phase separation and multivalent nucleosome interactions. Nat. Commun. 12, 1–17 (2021).
Article Google Scholar
Grigoryev, S. A. Chromatin higher-order folding: a perspective with linker dna angles. Biophys. J. 114, 2290–2297 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Kumari, K., Prakash, J. R. & Padinhateeri, R. Heterogeneous interactions and polymer entropy decide organization and dynamics of chromatin domains. Biophys. J. 121, 2794–2812 (2022).
Chiariello, A. M. et al. A dynamic folded hairpin conformation is associated with α-globin activation in erythroid cells. Cell Rep. 30, 2125–2135 (2020).
Article CAS PubMed Google Scholar
Forte, G. et al. Transcription modulates chromatin dynamics and locus configuration sampling. bioRxiv (2021).
Yang, T. et al. Hicrep: assessing the reproducibility of hi-c data using a stratum-adjusted correlation coefficient. Genome Res. 27, 1939–1949 (2017).
Article CAS PubMed PubMed Central Google Scholar
Brown, J. M. et al. A tissue-specific self-interacting chromatin domain forms independently of enhancer-promoter interactions. Nat. Commun. 9, 1–15 (2018).
Article ADS Google Scholar
Laso, M., Öttinger, H. & Suter, U. Bond-length and bond-angle distributions in coarse-grained polymer chains. J. Chem. Phys. 95, 2178–2182 (1991).
Article CAS ADS Google Scholar
Cui, Y. & Bustamante, C. Pulling a single chromatin fiber reveals the forces that maintain its higher-order structure. Proc. Natl Acad. Sci. 97, 127–132 (2000).
Article CAS PubMed PubMed Central ADS Google Scholar
Poirier, M. G. & Marko, J. F. Mitotic chromosomes are chromatin networks without a mechanically contiguous protein scaffold. Proc. Natl Acad. Sci. 99, 15393–15397 (2002).
Article CAS PubMed PubMed Central ADS Google Scholar
Vettorel, T., Besold, G. & Kremer, K. Fluctuating soft-sphere approach to coarse-graining of polymer models. Soft Matter 6, 2282–2292 (2010).
Article CAS ADS Google Scholar
Hansen, A. S., Cattoglio, C., Darzacq, X. & Tjian, R. Recent evidence that tads and chromatin loops are dynamic structures. Nucleus 9, 20–32 (2018).
Article CAS PubMed Google Scholar
Prieto, E. I. & Maeshima, K. Dynamic chromatin organization in the cell. Essays Biochem. 63, 133–145 (2019).
Article CAS PubMed Google Scholar
Gabriele, M. et al. Dynamics of ctcf-and cohesin-mediated chromatin looping revealed by live-cell imaging. Science 376, 496–501 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Hansen, A. S., Pustova, I., Cattoglio, C., Tjian, R. & Darzacq, X. Ctcf and cohesin regulate chromatin loop stability with distinct dynamics. elife 6, e25776 (2017).
Article PubMed PubMed Central Google Scholar
Fujishiro, S. & Sasai, M. Generation of dynamic three-dimensional genome structure through phase separation of chromatin. Proc. Natl Acad. Sci. 119, e2109838119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Soddemann, T., Dünweg, B. & Kremer, K. A generic computer model for amphiphilic systems. Eur. Phys. J. E 6, 409–419 (2001).
Article CAS Google Scholar
Santra, A., Kumari, K., Padinhateeri, R., Dünweg, B. & Prakash, J. R. Universality of the collapse transition of sticky polymers. Soft Matter 15, 7876–7887 (2019).
Article CAS PubMed ADS Google Scholar
Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).
Article PubMed PubMed Central ADS Google Scholar
Gibson, B. A. et al. Organization of chromatin by intrinsic and regulated phase separation. Cell 179, 470–484 (2019).
Article CAS PubMed PubMed Central Google Scholar
Conte, M. et al. Loop-extrusion and polymer phase-separation can co-exist at the single-molecule level to shape chromatin folding. Nat. Commun. 13, 4070 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Racko, D., Benedetti, F., Dorier, J. & Stasiak, A. Are tads supercoiled? Nucleic Acids Res. 47, 521–532 (2019).
Article CAS PubMed Google Scholar
Tang, Z. et al. Ctcf-mediated human 3d genome architecture reveals chromatin topology for transcription. Cell 163, 1611–1627 (2015).
Article CAS PubMed PubMed Central Google Scholar
Buckle, A., Brackley, C. A., Boyle, S., Marenduzzo, D. & Gilbert, N. Polymer simulations of heteromorphic chromatin predict the 3d folding of complex genomic loci. Mol. Cell 72, 786–797 (2018).
Article CAS PubMed PubMed Central Google Scholar
Spracklin, G. et al. Diverse silent chromatin states modulate genome compartmentalization and loop extrusion barriers. Nat. Struct. Mol. Biol. 30, 38–51 (2023).
Florescu, A.-M., Therizols, P. & Rosa, A. Large scale chromosome folding is stable against local changes in chromatin structure. PLoS Comput. Biol. 12, e1004987 (2016).
Article PubMed PubMed Central Google Scholar
Oh, S. et al. Enhancer release and retargeting activates disease-susceptibility genes. Nature 595, 735–740 (2021).
Article CAS PubMed ADS Google Scholar
Hildebrand, E. M. et al. Chromosome decompaction and cohesin direct topoisomerase ii activity to establish and maintain an unentangled interphase genome. bioRxiv 2022–10 (2022).
Lu, W., Onuchic, J. N. & Di Pierro, M. An associative memory hamiltonian model for dna and nucleosomes. PLOS Comput. Biol. 19, e1011013 (2023).
Article CAS PubMed PubMed Central Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS MATH ADS Google Scholar
Bajpai, G., Jain, I., Inamdar, M. M., Das, D. & Padinhateeri, R. Binding of dna-bending non-histone proteins destabilizes regular 30-nm chromatin structure. PLoS Comput. Biol. 13, e1005365 (2017).
Article PubMed PubMed Central ADS Google Scholar
Kadam, S. et al. Predicting scale-dependent chromatin polymer properties from systematic coarse-graining. sangramkadam/chromatin_coarse_graining https://doi.org/10.5281/zenodo.8064568 (2023).

Download references

Acknowledgements

R.P. acknowledges useful discussions with Xavier Darzacq, Leonid Mirny, Daniel Jost, Geeta Narlikar, and Marc Marti-Renom. We acknowledge useful discussions with Vladimir Teif, Mayuri Rege, PB Sunil Kumar, Madan Rao, and Gaurav Bajpai. S.K. acknowledges fellowship support from the CSIR, India, and KK acknowledges iPDF support from IIT Bombay. We acknowledge funding from the Department of Biotechnology, India (Grant number: BT/HRD/NBA/39/12/2018-19). We also acknowledge the National Supercomputing Mission (NSM) for providing computing resources of ‘PARAM Brahma’ at IISER Pune, which is implemented by C-DAC and supported by the Ministry of Electronics and Information Technology (MeitY) and Department of Science and Technology (DST), Government of India. R.P. acknowledges support from Sunita Sanghi Centre of Aging and Neurodegenerative Diseases, IIT Bombay.

Author information

Authors and Affiliations

Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, 400076, India
Sangram Kadam, Kiran Kumari, Vinoth Manivannan & Ranjith Padinhateeri
Department of Physics, Indian Institute of Technology Bombay, Mumbai, 400076, India
Shuvadip Dutta & Mithun K. Mitra
Sunita Sanghi Centre of Aging and Neurodegenerative Diseases, Indian Institute of Technology Bombay, Mumbai, 400076, India
Ranjith Padinhateeri

Authors

Sangram Kadam
View author publications
You can also search for this author in PubMed Google Scholar
Kiran Kumari
View author publications
You can also search for this author in PubMed Google Scholar
Vinoth Manivannan
View author publications
You can also search for this author in PubMed Google Scholar
Shuvadip Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Mithun K. Mitra
View author publications
You can also search for this author in PubMed Google Scholar
Ranjith Padinhateeri
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.P., S.K., and K.K. conceived the project. S.K., R.P., and K.K. designed the project with inputs from M.K.M. S.K. performed the research with guidance from R.P. and M.K.M. S.K. developed the codes, performed simulations and analyzed the data with input from all authors. V.M. developed codes and simulated the Model-II with inputs from S.K., S.D. and R.P. All authors contributed ideas and participated in the scientific discussion. S.K. and R.P. wrote the initial draft of the paper with inputs from M.K.M. All authors edited and prepared the final version of the paper.

Corresponding authors

Correspondence to Sangram Kadam or Ranjith Padinhateeri.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Dusan Racko and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kadam, S., Kumari, K., Manivannan, V. et al. Predicting scale-dependent chromatin polymer properties from systematic coarse-graining. Nat Commun 14, 4108 (2023). https://doi.org/10.1038/s41467-023-39907-2

Download citation

Received: 08 September 2022
Accepted: 30 June 2023
Published: 11 July 2023
DOI: https://doi.org/10.1038/s41467-023-39907-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.