Conformational heterogeneity in human interphase chromosome organization reconciles the FISH and Hi-C paradox

Shi, Guang; Thirumalai, D.

doi:10.1038/s41467-019-11897-0

Download PDF

Article
Open access
Published: 29 August 2019

Conformational heterogeneity in human interphase chromosome organization reconciles the FISH and Hi-C paradox

Nature Communications volume 10, Article number: 3894 (2019) Cite this article

3501 Accesses
25 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Hi-C experiments are used to infer the contact probabilities between loci separated by varying genome lengths. Contact probability should decrease as the spatial distance between two loci increases. However, studies comparing Hi-C and FISH data show that in some cases the distance between one pair of loci, with larger Hi-C readout, is paradoxically larger compared to another pair with a smaller value of the contact probability. Here, we show that the FISH-Hi-C paradox can be resolved using a theory based on a Generalized Rouse Model for Chromosomes (GRMC). The FISH-Hi-C paradox arises because the cell population is highly heterogeneous, which means that a given contact is present in only a fraction of cells. Insights from the GRMC is used to construct a theory, without any adjustable parameters, to extract the distribution of subpopulations from the FISH data, which quantitatively reproduces the Hi-C data. Our results show that heterogeneity is pervasive in genome organization at all length scales, reflecting large cell-to-cell variations.

A maximum-entropy model to predict 3D structural ensembles of chromatin from pairwise distances with applications to interphase chromosomes and structural variants

Article Open access 01 March 2023

Learning the distribution of single-cell chromosome conformations in bacteria reveals emergent order across genomic scales

Article Open access 30 March 2021

Order and stochasticity in the folding of individual Drosophila genomes

Article Open access 04 January 2021

Introduction

Through remarkable Hi-C experiments^1,2,3,4,5,6, based on the Chromosome Conformation Capture (3C) technique⁷, indirect glimpses of how the genome in a number of species is organized is starting to emerge. Because chromosome lengths are extremely large, ranging from tens of million base pairs in yeast to billion base pairs in human cells, they have to fold into highly compact structures in order to be accommodated in the cell nucleus. This requires that loci that are well separated along the one-dimensional genome sequence be close in three-dimensional (3D) space, which is made possible by forming a large number of loops. The high-throughput Hi-C technique and its variants are used to infer the probability of genome-wide contact formation between loci. In order to determine the contact probabilities between various loci in a genome, Hi-C experiments are performed in an ensemble of millions of cells. The readouts of the Hi-C experiment are contact frequencies between a large number of loci from instantaneous snapshots of each cell, which are then used to construct the contact maps (Hi-C maps). The contact map is a matrix (2D representation) in which the elements represent the probability of contact between two loci that are separated by a specified genomic distance. A high contact count between two loci means that they interact with each other more frequently compared with ones with low contact count.

A complementary and potentially a more direct way to determine genome organization is to measure spatial distances between loci using a low-throughput fluorescence in situ hybridization (FISH) technique^8,9. In addition to providing 3D distances in fixed cells, recently developed CRISPR–dCas9 FISH can be used to assay the dynamic behavior of loci in real time^10,11,12. However, due to the current limitation of the number of distinct color probes, this method provides distance distribution information for only a small number of loci.

FISH and Hi-C, which are entirely different experimental techniques, provide data on different aspects of genome organization. As noted in recent reviews^13,14, there are problems associated with each method. It is difficult to reconcile Hi-C and FISH data for the following reasons. In interpreting the Hi-C contact map, one makes the intuitive assumption that loci with high probability contact must also be spatially close. However, it has been demonstrated using Hi-C and FISH data on the same chromosome that high contact frequency does not always imply proximity in space^13,15,16,17. It should be noted that in most cases, the Hi-C and FISH measurements agree very well^8,9,18,19. However, from a purely theoretical perspective, even a single contradiction is intriguing if the experimental errors can be ruled out. An outcome of our theory is that the discordance between FISH and Hi-C data arises because of extensive heterogeneity, which is embodied by the presence of a variety of conformations adopted by chromosomes in each cell. There are a variety of reasons, including differing fixation conditions and presence of two or more subpopulation of cells in which the chromosomes are present in distinct conformations, which could give rise to the discordance between FISH and Hi-C data, as lucidly described recently^13,14. Contact between two loci could be a rare event, not present in all cells, which is captured in a Hi-C experiment by performing an ensemble average. We show using a precisely solvable model that due to the absence of a contact between two specific loci in a number of cells, those with higher contact frequency could be spatially farther on an average than two others with lower contact frequency. In contrast, the probability of contact formation using the FISH method can only be obtained if the tail (small distance) of the distance distribution between locus i and j can be accurately measured. For a variety of reasons, including the size of the probe and the signal strength, this not altogether straightforward using the FISH technique. Thus, in order to combine the data from the two powerful techniques, it is crucial to establish a theoretical basis with potential a practical link, between the contact probability and average spatial distance.

Setting aside the conditions under which FISH and Hi-C are performed (see recommendations for comparing the results from the two techniques with minimum bias which are described elsewhere¹³) insights into the discordance between the two methods, when they occur, can be obtained using polymer physics concepts. Recently, Fudenberg and Imakaev¹⁵ performed polymer simulations using a strong attractive energy between two labeled loci and a tenfold weaker interaction between two other loci that are separated by a similar genomic distance. In addition, they also reported simulations based on the loop extrusion model. Both these types of simulations showed there could be discordance between FISH and Hi-C, which we refer to as the FISH–Hi-C paradox. However, they did not provide any solution to the paradox, which is the principal goal of this work.

In addition, recent single-cell Hi-C^20,21,22 and FISH experiments^8,9,18,19 have revealed that there are substantial cell-to-cell variations on genome organization. However, how to utilize the data reported in these experiments to enhance our understanding of 3D genome structural heterogeneity has not been unexplored. One approach is to create an appropriate polymer model based on Hi-C and imaging data, which would readily allow us to probe the structural variability using simulations^23,24,25,26. Indeed, it has been shown, using Hi-C and FISH data as well as simulations²⁶, that if the conformation of the chromatin fiber is taken to be homogeneous then trends observed in the FISH data could not be predicted. However, using simulations and including two levels of chromatin organization (open and compact) qualitative trends observed in the FISH data could be recovered²⁶.

Here, we first establish a relationship between the contact probability and the mean spatial distance using an analytically solvable Generalized Rouse Chromosome Model (GRMC), which incorporates the presence of CTCF/cohein-mediated loops. The GRMC may be thought of as an ideal chromosome model, very much in the spirit of the Rouse model for polymers, in which conceptual issues such as the origin of the FISH–Hi-C paradox can be rigorously established. We first consider the solvable homogeneous limit, in which contacts are present in all the cells. In this case, precise numerical and analytical results show that there is a simple relation between the contact probability, P, and the ensemble mean 3D distance 〈R〉. However, the unavoidable heterogeneity in the cell populations in Hi-C experiments results in contacts between loci only in a fraction of cells. We first show that a direct consequence of the heterogeneity in both GRMC and chromosomes is that two loci (m and n) that have higher probability (P_mn) of being in contact relative to another two loci (k and l) does not imply a direct spatial correlation, a finding that has already been qualitatively established in previous studies^13,15. In other words, the average spatial distance between m and n (〈R_mn〉) could be larger than 〈R_kl〉, the distance between loci k and l, even if P_mn > P_kl. These results provide a basis for understanding the origin of the FISH–Hi-C paradox.

We develop a fully theoretical approach, which allows us to provide quantitative insights into the extent of heterogeneity in chromosome organization. From our theory, it follows that the resolution of the FISH–Hi-C paradox requires invoking the notion of heterogeneity, which implies multiple populations of chromosomes coexist. By using the concepts that emerge from the study of the GRMC, we demonstrate that the information of cell subpopulations can be extracted by fitting the experimental FISH data using our theory, thus allowing us to calculate the Hi-C contact probabilities from the theoretically calculated cumulative distribution function of spatial distance (CDF)—a quantity that can be measured using FISH and super-resolution imaging methods. Our approach provides a theoretically based method to combine the available FISH and Hi-C data to produce a more refined characterization of the heterogeneous chromosome organization than is possible by using data from just one of the techniques. In other words, sparse data from both the experimental methods can be simultaneously harnessed to predict the 3D organization of chromosomes.

Results

Relating contact probability to mean spatial distance

The exact relationship between P_mn (contact probability between mth and nth locus) and the corresponding mean spatial distance, 〈R_mn〉 for GRMC (see the Methods section for details of the derivation) is,

$$P_{mn} = {\mathrm{erf}}\left( {\frac{{2r_{\mathrm{c}}}}{{\sqrt \pi \langle R_{mn}\rangle }}} \right) - \frac{4}{\pi }\frac{{r_{\mathrm{c}}}}{{\langle R_{mn}\rangle }}e^{ - \frac{{4r_{\mathrm{c}}^2}}{{\pi \left\langle {R_{mn}} \right\rangle ^2}}} \equiv R_0(\langle R_{mn}\rangle ).$$

(1)

The inverse of R₀(〈R_mn〉), the solution to Eq. (1), gives the mean spatial distance 〈R_mn〉 as a function of the contact probability P_mn. Note that m and n are arbitrary locations of any two loci, and thus Eq. (1) is general for any pair of loci.

A couple of conclusions, relevant to the application to the chromosomes, follow from Eq. (1). (i) Note that Eq. (1) is an exact one-to-one relation between the mean distance 〈R_mn〉 and the contact probability P_mn provided r_c is known, and if the contacts are present in all the cells, which is not the case in experiments. For small P_mn, it is easy to show from Eq. (1) that $\langle R_{mn}\rangle \approx r_{\mathrm{c}}P_{mn}^{ - 1/3}$. For the ideal GRMC, this implies that for any m, n, k, l, if P_mn < P_kl then 〈R_mn〉 > 〈R_kl〉, a consequence anticipated on intuitive grounds. (ii) If the value of the contact probability P and the threshold distance r_c are known precisely, then the distribution of the spatial distance can be readily computed by solving Eq. (1) numerically. In Fig. 1b, we show the comparison between theory (Eq. (1)) and simulations (see the Methods section for details). The simulated curves are computed as follows: first collect (P_mn, 〈R_mn〉) for every pair labeled (m, n), where P_mn and 〈R_mn〉 are computed using Eqs. (17) and (18) in the Methods section. The total number of pairs is N(N−1)/2. We then bin the points over the values of P_mn. Finally, the mean value of 〈R_mn〉 for each bin, 〈R〉 = E[〈R_mn〉], is computed where E[⋯] is the binned average, which is computed using $(1/N_i)\mathop {\sum}\limits_{j = 1}^{N_i} {\left\langle {R_{mn}} \right\rangle ^j}$, where N_i is the number of points in the ith bin. The bin size, Δ, is centered at P_mn, spanning P_mn−Δ/2 ≤ P_mn ≤ P_mn + Δ/2. Using this procedure, we find (Fig. 1) that the theory and simulations are in perfect agreement, which validates the theoretical result.

Contact distance r _c affects the inferred value of the spatial distance

However, in practice, the elements P_mn are measured with (unknown) statistical errors, and the value of the contact threshold r_c is only estimated. In the Hi-C experiments, contact probabilities and r_c by implication, are determined by a series of steps that start with cross-linking spatially adjacent loci using formaldehyde, chopping the chromatin into fragments using restriction enzymes, ligating the fragments with biotin, followed by sequence matching using deep-sequencing methods⁵. Because of the inherent stochasticity associated with the overall Hi-C scheme, as well as the unavoidable heterogeneity (only a fraction of cells has a specific contact and the contact could be dynamic) in the cell population, the relationship P_mn and 〈R_mn〉 is not straightforward.

To illustrate how the uncertainty in r_c affects the determination of the spatial distance in GRMC even when the population is homogeneous (all cells have a specific contact), we plot the distributions of distance for r_c = 0.02, 0.03 μm in Fig. 1c. A small change in r_c (from 0.02 to 0.03 μm) completely alters the distance distribution P(R), and hence the mean spatial distance (from ≈0.2 to ≈0.3 μm). For the exactly solvable GRMC, this can be explained by noting that $\langle R_{mn}\rangle \approx r_{\mathrm{c}}P_{mn}^{ - 1/3}$ for small P_mn. Because P_mn appears in the denominator, any uncertainty in r_c is amplified by P_mn, especially when P_mn is small.

Heterogeneity causes paradox between FISH and Hi-C

The expectation that the contact probability should decrease as the mean distance between the loci increases, which is the case in the exactly solvable ideal GRMC (P_mn ≈r_c〈R_mn〉⁻³), is sometimes violated when the experimental data⁶ is analyzed^13,15. The paradox is a consequence of heterogeneity due to the existence of more than one population of cells, which implies that in some fraction of cells, contact between two loci exists while in others it is absent. Each distinct population has its own statistics. For instance, the probability distribution of the spatial distance between the mth and the nth loci, _Pi,mn(r), for one population of cells could be different from another population of cells P_j,mn(r), where i and j are the indices for the two different populations (Fig. 2a). The Hi-C experiments yield only an average value of the contact probability. Let us illustrate the consequence of the inevitable heterogeneous mixture of cell populations by considering the simplest case in which only two distinct populations, one with probability η and the other 1 − η, are present (a generalization is presented below). For instance, in one population of cells, there is a CTCF loop between m and n, and it is absent in the other population. The probability distribution of spatial distance between the mth and the nth loci is a superposition of distributions for each population. Using Eq. (10), the mixed distribution can be written as,

$$P(R_{mn} = r) = \sqrt {\frac{2}{\pi }} \left( {\eta \frac{{r^2}}{{\sigma _{1,mn}^3}}e^{ - \frac{{r^2}}{{2\sigma _{1,mn}^2}}} + (1 - \eta )\frac{{r^2}}{{\sigma _{2,mn}^3}}e^{ - \frac{{r^2}}{{2\sigma _{2,mn}^2}}}} \right),$$

(2)

where σ_1,mn and σ_2,mn are the parameters with different values characterizing the two populations. In the GRMC, σ_1,mn and σ_2,mn are related to the mean spatial distances in the two populations by $\langle R_{1,mn}\rangle = 2\sqrt {2/\pi } \sigma _{1,mn}$ and $\langle R_{2,mn}\rangle = 2\sqrt {2/\pi } \sigma _{2,mn}$. The mean spatial distance is, 〈R_mn〉 = η〈R_1,mn〉 + (1 − η)〈R_2,mn〉, and the contact probability is P_mn = ηP_1,mn + (1 − η)P_2,mn, where P_1,mn and P_2,mn are the contact probabilities for each population, given by Eq. (1), which depends on the values of 〈R_1,mn〉 and 〈R_2,mn〉 as well as r_c.

If the values of 〈R_1,mn〉 and 〈R_2,mn〉 are unknown (as is the case in Hi-C experiments), and only the value of the contact probability between the two loci is provided, one can not uniquely determine the values of the mean spatial distances. This is the origin of the Hi-C and FISH data paradox. In Fig. 2b–e, we show an example of the paradox for a particular set of parameters (η, σ_1,mn, σ_2,mn). Pair #1 has a larger contact probability than pair #2, while also exhibiting a larger mean spatial distance. The GRMC explains in simple terms the origin of the paradox.

To systematically explore the parameter space, we display 〈R_mn〉 and P_mn as heatmaps showing 〈R〉_1,mn versus 〈R〉_2,mn for different values of η (Fig. 3). When there is a single homogenous population (η = 0.0), the mean spatial distance 〈R_mn〉 and contact probability P_mn depend only on the value of 〈R_2,mn〉 (upper panel in Fig. 3). In this case, there is a precise one-to-one mapping between 〈R_mn〉 and P_mn. However, if η ≠ 0 (η = 0.3, lower panel in Fig. 3) then the relation between P_mn and 〈R_mn〉 is complicated. The contour lines for P_mn cross the contour lines of 〈R_mn〉, which implies that for a given value of P_mn, one cannot infer the value of 〈R_mn〉 without knowing the value of η, 〈R_1,mn〉, and 〈R_2,mn〉. For instance, the triangle and circle shown for η = 0.3 in Fig. 3 demonstrate an example of the paradox, in which $\langle R(\blacktriangledown )\rangle ( = 57a) > \langle R( \bullet )\rangle ( = 40a)$ whereas $P(\blacktriangledown )( \approx 7.7 \times 10^{ - 4}) > P( \bullet )( \approx 3.9 \times 10^{ - 4})$.

Extracting cell subpopulation information from FISH data

Can we extract the information about subpopulations from experimental data so that the result from two vastly different techniques can be reconciled? To answer this question, we first generalize our theory for the GRMC to real chromatins. The generalization of Eq. (2) is,

$$P(R_{mn} = r) = \eta P(r|\langle R_{1,mn}\rangle ) + (1 - \eta )P(r|\langle R_{2,mn}\rangle ),$$

(3)

where P(r|〈R_1,mn〉) and P(r|〈R_2,mn〉) are the Redner-des Cloizeaux distribution of distances for polymers^27,28 (Supplementary Note 1 and Supplementary Fig. 1). The distribution P(r|〈R_mn〉) is rigorously known for self-avoiding homopolymer in a good solvent, generalized Rouse model (Eq. (10) in the Methods section), and a semi-flexible polymer^29,30. However, a simple analytic expression for chromosomes is not known. By assuming that the Redner-des Cloizeaux form for P(r|〈R_mn〉) also holds for chromosomes (see Supplementary Eq. (1) for details), we find that g = 1 and δ = 5/4 in Supplementary Eq. (1). These parameters were previously extracted using the experimental data⁸, and the Chromosome Copolymer Model (CCM) for chromosomes²⁴. The value of g is inferred from the scaling relationship between mean spatial distance 〈R〉 and contact probability P, P ~〈R〉^3+g. The value of δ is computed as δ = 1/(1 − ν). ν is inferred from scaling 〈R(s)〉 ~s^ν, where s is the genomic distance.

The integral of Eq. (3) up to R, which is the cumulative distribution function CDF(R), can be used to fit the FISH data. Thus, the probability of contact formation can be computed as, ${\int_0^{r_{\mathrm{c}}}} P (r|\langle R\rangle ){\mathrm{d}}r$, where r_c is the contact threshold. Using the data in ref. ⁶, the CDF(R) for two pairs of loci are shown in Fig. 4a. By fitting the two experimentally measured curves to the theoretical prediction (see Supplementary Note 2), we obtain η ≈0.42 for peak4-loop and η ≈0.97 for peak3-control. The parameters obtained can then be used to compute the contact probability. Since the Hi-C experiments measure the number of contact events instead of contact probability and the value of r_c is unknown, we compare the relative contact frequency, which is computed as _Pi/〈P〉, where _Pi is the contact probability computed using the model or the contact number measured in Hi-C for the ith pair and 〈P〉 is the mean value for all the pairs considered. First, we fit all the eight CDF(R) curves in ref. ⁶. the excellent agreement between theory and experiments is vividly illustrated in Supplementary Fig. 2 and also manifested by the Kolmogorov–Smirnov statistics (Supplementary Note 5 and Supplementary Table 1). Second, we calculate their corresponding relative contact frequency (Fig. 4b). Comparison of the theoretical calculations with Hi-C measurements shows excellent agreement (Fig. 4b) with the Pearson correlation coefficient being 0.87. The contact probability is computed using r_c = 10 nm. Note that any value of r_c ≤ 10 nm gives similar results (Supplementary Fig. 3). The goodness of fits using different sets of g and δ is summarized in Supplementary Table 2. The set of g = 0 and δ = 2 gives equivalent good fits as the set of g = 1 and δ = 5/4. It is also important to note that fitting the FISH data with the assumption that cell population is homogeneous leads to unphysical values of g and δ and the Kolmogorov–Smirnov statistics are inferior (see Supplementary Note 4, Supplementary Fig. 5 and Supplementary Table 3).

Interestingly, the values of 〈R₁〉 obtained from fitting the four CTCF/cohesin-mediated loops (peak(1, 2, 3, 4)-loop) are all about 0.25−0.35 μm (R_{1,peak1–loop} ≈0.24 μm, R_{1,peak2–loop} ≈0.33 μm, R_{1,peak3–loop} ≈0.35 μm, R_{1,peak4–loop} ≈0.30 μm) regardless of their genomic separation (see Supplementary Table 1), suggesting that the mechanism of looping between CTCF motifs are similar with a mean spatial distance ≈0.3 μm. The physically reasonable value of 〈R_mn〉 ≈0.3 νm for all peak–loop pairs shows that these CTCF-mediated contacts describe molecular interactions between loci that are separated by a few hundred kilo base pairs. It has been shown that these contacts, referred to as peaks⁶ are significantly closer in space than others that are separated by similar genomic distance. The peak–loop contacts correspond to chromatin loops with the loci in the peaks being the anchor points between a specific loop. In sharp contrast, the distances between peaki and control (i goes from 1 to 4), which are greater than the distances between peak loci, vary ranging from ≈0.47 to ≈0.67 μm (see Supplementary Table 1). It is likely that these contacts are more dynamic because they are not be anchored by CTCF-binding proteins.

Massive heterogeneity in chromosome organization

In a recent study¹⁹, which combined Hi-C and high-throughput optical imaging to map contacts within single chromosomes in human fibroblasts, revealed massive heterogeneity. Such extensive existence of a large number of conformations, leading to multiple or nearly continuous distribution of subpopulations, was much greater than previously anticipated. Although, the results in Fig. 4 quantitatively reveal heterogeneity associated with CTCF loops by considering only two dominant subpopulations, the most recent experiment requires a generalization of the theory. In principle, our theory also applies to interactions of any nature, not only the CTCF loops. In doing so, it may be more reasonable to assume a continuous distribution of subpopulations, P(〈R〉), (see Supplementary Notes 6 and 7, and Supplementary Fig. 6 for generalization) instead of two discrete subpopulations, 〈R₁〉 and 〈R₂〉, which of course is much simpler and may suffice in many cases as the results in Fig. 4 illustrate. As a proof of concept of our theory, we solve P(〈R〉) for the eight pairs of contacts analyzed in the previous section. The results are shown in Supplementary Fig. 7. In all cases, P(〈R〉)s are found to be multimodal. For peak1/2/3/4-control and peak3-loop, P(〈R〉) yield peaks located at positions very close to 〈R₁〉 and 〈R₂〉 shown in Supplementary Table 1, justifying the effectiveness of the theory. To show that our theory has a broader range of applicability, we use the FISH data from the recent study¹⁹, which reports spatial distance measurements for 212 pairs of loci. P(〈R〉) is solved for each of a total of 212 pairs of loci. To illustrate our results, we compare in Fig. 5 the predicted CDF(r) and the experimentally measured CDF(r), as well as the P(〈R〉) obtained by fitting for six pairs of loci as examples in Fig. 5. The results show substantial variations in 〈R〉, manifested by the multiple peaks and wide spread variations in P(〈R〉). Remarkably, the calculated CDF(r) (without any adjustable parameters) and the measured CDF(r) are in excellent agreement for the six loci pairs, which were arbitrarily chosen for illustration purposes. The residual errors between the two, shown as insets in Fig. 5, are extremely small.

In Fig. 6a, we show the normalized distributions P(〈R〉/μ(〈R〉)) for each of the 212 pairs of loci (see Supplementary Fig. 8 for each pair as a separate figure). We expect that P(〈R〉/μ(〈R〉)) should be narrowly distributed around value 1 if there is only one population. However, many P(〈R〉/μ(〈R〉)) show multiple peaks with large variations. To further quantify the extent of heterogeneity, we calculate the coefficient of variation, CV = σ(〈R〉)/μ(〈R〉), where σ(〈R〉) and μ(〈R〉) are the standard deviation and the mean of 〈R〉, respectively. If there is only one population associated with 〈R〉, CV should have a value of around zero. Figure 6b shows the histogram of CV for all 212 pairs of loci. The CV values are widely distributed, suggesting that 3D structural heterogeneity is common and is associated with many pairs of loci rather than a few. Thus, the analyses of experimental data are not possible without taking heterogeneity into account. The theory presented here is sufficiently general and simple that it can be used to calculate the measurable quantities readily.

The role of loop extrusion in chromosome heterogeneity

What is the origin of heterogeneity in the individual cell populations? There are two possibilities. The first one is static heterogeneity: each subpopulation explores a distinct region of the genomic folding landscape (GFL) (Fig. 7a). The second is the dynamic heterogeneity. Each cell explores a local minimum of the GFL before transiting to another local minimum (Fig. 7b). The only assumption in the application of our theory to genome organization is that there must be more than one population of cells, which does not violate the observation that the Hi-C experiment report only the average contact probability over millions of cells. Dynamic looping would be an example of the dynamic heterogeneity where the CTCF/cohein-mediated loops are formed and broken dynamically on a fast time scale compared with the lifetime of a cell. Such a picture is supported by recent single-cell molecule experiments^31,32. The average residence time of CTCF/cohesin complex is shown to be in the range of a few to tens of minutes, which is much smaller compared with the time scale of the cell cycle (15–30 h). Loop extrusion model^33,34,35 is another possible origin of dynamic heterogeneity. In the loop extrusion model, it is thought that cohesins extrude loops along the chromosome fiber, which could detach stochastically. At any given time, there would be many subpopulations, each characterized by a distinct set of loops in the chromosome. Indeed, our analyses of the most recent high-throughput optical imaging data lend credence to the notion that multiple subpopulations in chromosomes arise because of massive dynamic heterogeneity. Our theory also gives an indirect theoretical justification for the work in ref. ¹⁵, in which the authors found the loop extrusion model could lead to the [P_mn, 〈R_mn〉] paradox.

Single-cell temporal information is necessary to determine whether the loops are static or dynamic or a combination of the two (Fig. 7c). Hence, the combination of the dynamic FISH technique such as CRISPR–dCas9 FISH and single-cell Hi-C would be crucial for us to fully understand the organization of genomes. Our theory provides a theoretically rigorous method based on polymer physics to connect the results from measurements using the two vastly different techniques.

Discussion

From polymer physics for single chains it follows that in a homogeneous system, the contact probability and mean 3D distances are linked, resulting in a power-law relation connecting the two quantities that can be measured using Hi-C and FISH techniques. However, the one-to-one mapping does not hold in Hi-C experiments because of the presence of a mixture of distinct cell subpopulations each characterized by its own statistics leads to heterogeneity, which in turn gives rise to the [P_mn, 〈R_mn〉] paradox. We show that the theory based on precisely solvable GRMC could be used to solve the paradox in practice. The theory can be readily used to analyze data from experiments, provided the FISH and Hi-C experiments are done under similar conditions⁶. The central result of the theory in Eq. (3) can be used to analyze the available sparse FISH data. We show that the fraction of cell subpopulations (η in Eq. (3)) and the generalization derived in Supplementary Note 6 can be extracted by fitting the FISH data using our theory. From Eq. (3), we calculate the Hi-C contact probabilities, thus establishing that the theory resolves the [P_mn, 〈R_mn〉] paradox.

In this work, we confine ourselves to two-point interactions, which allows us to consider one pair of loci at a time. However, recent experiments probing multipoint interactions have suggested that formations of loops are likely to be cooperative^9,36, such that the formation of one loop could facilitate the formation of a nearby loop. Such cooperative loop formation was previously shown in an entirely different context involving the folding of proteins directed by disulfide bond formation³⁷. It can be shown within our framework that the formation of one loop can certainly increase the probability of formation of another loop. The theoretical basis for this finding is given in the Supplementary Note 8.

The reconciliation of the FISH and Hi-C data using polymer physics concepts is the first key step in integrating the data from these experimental techniques to construct the 3D structures of chromosomes. The work described here provides a theoretical basis for accomplishing this important task. Finally, our results suggest that heterogeneity in contact formation is an intrinsic property of genome organization, and hence the acquisition of single-cell experimental data is crucial to further our understanding of both the dynamics and the heterogeneous structural organization of chromosomes.

Methods

Generalized rouse model for chromosome

In order to derive an approximate relationship connecting contact probabilities between loci and the three-dimensional distances, we use a variant of the random loop model^38,39. We first consider a minimal cross-linked phantom chain model, which incorporates the presence of CTCF/cohein-mediated loops⁶. The model, originally introduced for describing physical gels³⁸, and more recently used for chromosome dynamics in a number of insightful studies^23,39, could be viewed as a Generalized Rouse Chromosome Model (GRMC)^40,41. The cross-links modeling the CTCF/cohein-mediated loops here are not random. Their locations are predetermined by the Hi-C data⁶.

The equation of motion for the GRMC is⁴²

$$\xi \frac{{{\mathrm{d}}{\mathbf{R}}}}{{{\mathrm{d}}t}} = {\mathbf{AR}} + {\mathbf{F}},$$

(4)

where ξ is the friction coefficient, R = [r₁, r₂, …, r_N]^T with r_i being the position of the ith locus. The vector F = [f₁, f₂, …, f_N]^T (T is the transpose), where f_i is the Gaussian random force acting on the ith locus, characterized by 〈f_n(t)〉 = 0 and 〈f_nα(t)f_mβ(t′)〉 = 2ξk_BTδ_nmδ_αβδ(t − t′); A is the N × N connectivity matrix, embedding the information of chain connectivity and the location of the loops connecting two loci (Fig. 1a)

$$A_{mn} = \left\{ {\begin{array}{*{20}{l}} { - 2\kappa - |\Sigma _m|\omega ,} \hfill & {{\mathrm{if}}\,m = n \ne 1\;{\mathrm{or}}\;N} \hfill \\ { - \kappa - |\Sigma _m|\omega ,} \hfill & {{\mathrm{if}}\,m = n = 1{\mathrm{ or }}N} \hfill \\ {\kappa ,} \hfill & {{\mathrm{if}}\,|m - n| = 1} \hfill \\ {\omega ,} \hfill & {{\mathrm{if}}\,|m - n| > 1,{\mathrm{and}}\,{\mathrm{connected}}\,{\mathrm{in}}\,\Sigma } \hfill \\ {0,} \hfill & {{\mathrm{if}}\,{\mathrm{otherwise}}} \hfill \end{array}} \right.$$

(5)

where Σ is the set of indices representing the loci pairs specifying the CTCF facilitated loop anchors, and |Σ_m| is the number of loops connected to the mth locus. The spring constant κ enforces chain connectivity, and ω is the associated spring constant for a CTCF pair. Note that the GRMC model does not account for excluded volume interactions, which in the modeling of chromatin is often justified by noting that topoisomerases enable chain crossing. Our purpose is to use GRMC to first illustrate concretely the challenges in going from the measured average contact map to spatial organization, precisely. More importantly, using the insights from the study of the GRMC, we provide a solution to the FISH–Hi-C paradox.

Since A in Eq. (5) is a real symmetric matrix, it can be diagonalized using the orthonormal matrix V

$${\mathbf{VAV}}^T = {\mathbf{\Lambda}} = {\mathrm{diag}}(\lambda _0,\lambda _1,...,\lambda _{N - 1}),$$

(6)

where λ₀, λ₁, …, λ_N−1 are the eigenvalues of A. By defining X = VR and using R = V^TX and VV^T = I, we obtain the equations of motion of the normal coordinates X

$$\xi \frac{{\mathrm{d}}{\mathbf{X}}}{\mathrm{d}}t = {\mathbf{\Lambda}}{\mathbf{X}} + {\mathbf{f}}.$$

(7)

Because Λ is a diagonal matrix, the normal coordinates of the GRMC X_p are decoupled. Using the normal modes, X, the physical quantities associated with the polymer can be readily calculated. Therefore, for GRMC with a predetermined set of CTCF/cohein-mediated loops, we can solve for the eigenvalues of the connectivity matrix A, and the orthonormal matrix V numerically, and thus calculate the contact probability and spatial distance precisely.

Relation between contact probability and mean spatial distance

The vector between the positions of the mth and the nth loci may be written as

$${\mathbf{R}}_m - {\mathbf{R}}_n = \mathop {\sum}\limits_{p = 0}^{N - 1} {(V_{pm} - V_{pn})} {\mathbf{X}}_p,$$

(8)

where V_pm and V_pn are the elements of orthonormal matrix V. The equilibrium solution of Eq. (7) yields, $\mathop {{\lim }}\nolimits_{t \to \infty } X_{p,\alpha }(t)\sim {\cal{N}}(0, - \frac{{k_{\mathrm{B}}T}}{{\lambda _p}})$, where α = x, y, z, ${\cal{N}}$ is Gaussian distribution. Therefore

$$\mathop {{\lim }}\limits_{t \to \infty } R_{mn,\alpha }(t)\sim {\cal{N}}(0, - \mathop {\sum}\limits_{p = 0}^{N - 1} {(V_{pm} - V_{pn})^2} \frac{{k_{\mathrm{B}}T}}{{\lambda _p}}) \equiv {\cal{N}}(0,\sigma _{mn,\alpha }^2).$$

(9)

where $\sigma _{mn,\alpha } = - \mathop {\sum}\limits_{p = 0}^{N - 1} {(V_{pm} - V_{pn})^2} (k_{\mathrm{B}}T/\lambda _p)$. Since the model is isotropic, it follows that $\sigma _{mn,x}^2 = \sigma _{mn,y}^2 = \sigma _{mn,z}^2 \equiv \sigma _{mn}^2$. The mean distance 〈R_mn〉 is related to σ_mn through $\langle R_{mn}\rangle = 2\sqrt {2/\pi } \sigma _{mn}$. The distribution of the distance between the mth and the nth loci, $\mathop {{\lim }}\nolimits_{t \to \infty } |{\mathbf{R}}_{mn}(t)| = \mathop {{\lim }}\nolimits_{t \to \infty } \sqrt {\mathop {\sum}\limits_\alpha {R_{mn,\alpha }^2} (t)}$ is a non-central chi distribution (we will neglect the notation $\mathop {{\lim }}\nolimits_{t \to \infty }$ from now on)

$$P(R_{mn} = r) = \sqrt {\frac{2}{\pi }} \frac{1}{{\sigma _{mn}}}e^{ - r^2/(2\sigma _{mn}^2)}\frac{{r^2}}{{\sigma _{mn}^2}}.$$

(10)

The contact probability P_mn, for a given threshold r_c (contact exists if r ≤ r_c), computed using Eq. (10) yields

$$\begin{array}{c}P_{mn} = {\int_0^{r_{\mathrm{c}}}} {\mathrm{d}} r\sqrt {\frac{2}{\pi }} \frac{1}{{\sigma _{mn}}}e^{ - r^2/(2\sigma _{mn}^2)}\frac{{r^2}}{{\sigma _{mn}^2}}\\ = {\mathrm{Erf}}\left( {\frac{{r_{\mathrm{c}}}}{{\sqrt 2 \sigma _{mn}}}} \right) - \sqrt {\frac{2}{\pi }} e^{ - \frac{{r_{\mathrm{c}}^2}}{{2\sigma _{mn}^2}}}\frac{{r_{\mathrm{c}}}}{{\sigma _{mn}}}.\end{array}$$

(11)

The mean spatial distance 〈R_mn〉 is given by

$$\langle R_{mn}\rangle = {\int_0^\infty} {\mathrm{d}} rr\sqrt {\frac{2}{\pi }} \frac{1}{{\sigma _{mn}}}e^{ - r^2/(2\sigma _{mn}^2)}\frac{{r^2}}{{\sigma _{mn}^2}} = 2\sqrt {\frac{2}{\pi }} \sigma _{mn}.$$

(12)

Using Eqs. (11) and (12), the desired relation between P_mn and 〈R_mn〉 becomes

$$P_{mn} = {\mathrm{erf}}\left( {\frac{{2r_{\mathrm{c}}}}{{\sqrt \pi \langle R_{mn}\rangle }}} \right) - \frac{4}{\pi }\frac{{r_{\mathrm{c}}}}{{\langle R_{mn}\rangle }}e^{ - \frac{{4r_{\mathrm{c}}^2}}{{\pi \left\langle {R_{mn}} \right\rangle ^2}}} \equiv R_0(\langle R_{mn}\rangle ).$$

(13)

Eq. (13) is identical with Eq. (1) in the main text.

Simulations

The energy function for the GRMC is

$$U({\mathbf{r}}_1,...,{\mathbf{r}}_N) = \mathop {\sum}\limits_{i = 1}^{N - 1} {U_i^{\mathrm{S}}} + \mathop {\sum}\limits_{\{ p,q\} } {U_{\{ p,q\} }^{\mathrm{L}}} .$$

(14)

For the bonded stretch potential, $U_i^{\mathrm{S}}$, we use

$$U_i^{\mathrm{S}} = \frac{\kappa }{2}(|{\mathbf{r}}_{i + 1} - {\mathbf{r}}_i| - a)^2,$$

(15)

where a is the equilibrium bond length. The interaction between the loop anchors is also modeled using a harmonic potential

$$U_{\{ p,q\} }^{\mathrm{L}} = \frac{\omega }{2}(|{\mathbf{r}}_p - {\mathbf{r}}_q| - a)^2,$$

(16)

where the spring constant is associated with the CTCF facilitated loops, and {p, q} represents the indices of the loop anchors, which are taken from the Hi-C data⁶ (Supplementary Note 3). We simulate the chromosome segment from 146 to 158 Mbps of Chromosome 5. Each monomer represents 1200 bps, resulting the total number of coarse-grained loci N = 10,000.

In order to accelerate conformational sampling, we perform Langevin Dynamics simulations at low friction⁴³. We simulate each trajectory for 10⁸ time steps, and save the snapshots every 10,000 time steps. We generate ten independent trajectories, which are sufficient to obtain reliable statistics (Supplementary Fig. 4).

Data analyses

The contact probability between the mth and nth loci in the simulation is calculated using

$$P_{mn} = \frac{1}{{TM}}\mathop {\sum}\limits_{a = 1}^M {\mathop {\sum}\limits_{t = 1}^T \Theta } (r_{\mathrm{c}} - |{\mathbf{r}}_m^{(a)}(t) - {\mathbf{r}}_n^{(a)}(t)|),$$

(17)

where Θ(⋅) is the Heaviside step function, r_c is the threshold distance for determining the contacts, the summation is over the snapshots along the trajectory, and the total M number of independent trajectories, and T is the number of snapshots for a single trajectory. The mean spatial distance between the ith and the jth loci in the simulation is calculated using

$$\langle R_{mn}\rangle = \frac{1}{{TM}}\mathop {\sum}\limits_{a = 1}^M {\mathop {\sum}\limits_{t = 1}^T {|{\mathbf{r}}_m^{(a)}(t) - {\mathbf{r}}_n^{(a)}(t)|} } .$$

(18)

The objective is to go from P_mn to 〈R_mn〉, and to determine, if in doing so, we get reasonable results. Because these quantities can be computed precisely in the GRMC, the [P_mn, 〈R_mn〉] relationship can be tested, which allows us to obtain the needed cues to solve the FISH–Hi-C paradox.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All relevant data supporting the findings of this study are available within the article and its Supplementary Information files or upon requests from the corresponding author. The Hi-C and FISH experimental data used in this study are publicly available from GEO database under accession number GSE63525 and from 4DN portal at https://data.4dnucleome.org/publications/80007b23-7748-4492-9e49-c38400acbe60/. The processed data are available upon request from the authors.

Code availability

The polymer simulations are performed using LAMMPS Molecular Dynamics Simulation software⁴⁴, which is an open-source code available at http://lammps.sandia.gov. The codes used to analyze data in the present study are deposited to Github repository https://github.com/anyuzx/chromosome-heterogeneity-analysis.

References

Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Article ADS CAS Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article ADS CAS Google Scholar
Sexton, T. et al. Three-dimensional folding and functional organization principles of the drosophila genome. Cell 148, 458–472 (2012).
Article CAS Google Scholar
Jin, F. et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature 503, 290–294 (2013).
Article ADS CAS Google Scholar
Dekker, J., Marti-Renom, M. A. & Mirny, L. A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat. Rev. Genet. 14, 390–403 (2013).
Article CAS Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS Google Scholar
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Article ADS CAS Google Scholar
Wang, S. et al. Spatial organization of chromatin domains and compartments in single chromosomes. Science 353, 598–602 (2016).
Article ADS CAS Google Scholar
Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).
Article ADS Google Scholar
Chen, B. et al. Dynamic imaging of genomic loci in living human cells by an optimized crispr/cas system. Cell 155, 1479–1491 (2013).
Article CAS Google Scholar
Ma, H. et al. Multicolor CRISPR labeling of chromosomal loci in human cells. Proc. Natl Acad. Sci. USA 112, 3002–3007 (2015).
Article ADS CAS Google Scholar
Ma, H. et al. Multiplexed labeling of genomic loci with dCas9 and engineered sgRNAs using CRISPRainbow. Nat. Biotechnol. 34, 528–530 (2016).
Article CAS Google Scholar
Giorgetti, L. & Heard, E. Closing the loop: 3C versus DNA FISH. Genome Biol. 17, 215 (2016).
Article Google Scholar
Fraser, J., Williamson, I., Bickmore, W. A. & Dostie, J. An overview of genome organization and how we got there: from FISH to Hi-C. Microbiol. Mol. Biol. R. 79, 347–372 (2015).
Article Google Scholar
Fudenberg, G. & Imakaev, M. FISH-ing for captured contacts: towards reconciling FISH and 3C. Nat. Methods 14, 673–678 (2017).
Article CAS Google Scholar
Bickmore, W. A. & van Steensel, B. Genome architecture: domain organization of interphase chromosomes. Cell 152, 1270–1284 (2013).
Article CAS Google Scholar
Williamson, I. et al. Spatial genome organization: contrasting views from chromosome conformation capture and fluorescence in situ hybridization. Gene. Dev. 28, 2778–2791 (2014).
Article Google Scholar
Szabo, Q. et al. TADs are 3D structural units of higher-order chromosome organization in Drosophila. Sci. Adv. 4, eaar8082 (2018).
Article Google Scholar
Finn, E. H. et al. Extensive heterogeneity and intrinsic variation in spatial genome organization. Cell 176, 1502–1515 (2019).
Article CAS Google Scholar
Stevens, T. J. et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature 544, 59–64 (2017).
Article ADS CAS Google Scholar
Flyamer, I. M. et al. Single-nucleus hi-c reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature 544, 110–114 (2017).
Article ADS CAS Google Scholar
Tan, L., Xing, D., Chang, C.-H., Li, H. & Xie, X. S. Three-dimensional genome structures of single diploid human cells. Science 361, 924–928 (2018).
Article ADS CAS Google Scholar
Shukron, O. & Holcman, D. Transient chromatin properties revealed by polymer models and stochastic simulations constructed from Chromosomal Capture data. PLoS Comput. Biol. 13, e1005469 (2017).
Article ADS Google Scholar
Shi, G., Liu, L., Hyeon, C. & Thirumalai, D. Interphase human chromosome exhibits out of equilibrium glassy dynamics. Nat. Commun. 9, 3161 (2018).
Article ADS Google Scholar
Liu, L., Shi, G., Thirumalai, D. & Hyeon, C. Chain organization of human interphase chromosome determines the spatiotemporal dynamics of chromatin loci. PLoS Comput. Biol. 14, e1006617 (2018).
Article ADS Google Scholar
Buckle, A., Brackley, C. A., Boyle, S., Marenduzzo, D. & Gilbert, N. Polymer simulations of heteromorphic chromatin predict the 3d folding of complex genomic loci. Mol. Cell 72, 786–797.e11 (2018).
Article CAS Google Scholar
Jannink, G. & Des Cloizeaux, J. Polymers in solution. J. Phys. Condens. Mat. 2, 1–24 (1990).
Article ADS CAS Google Scholar
Redner, S. Distribution functions in the interior of polymer chains. J. Phys. A 13, 3525–3541 (1980).
Article ADS CAS Google Scholar
Hyeon, C. & Thirumalai, D. Kinetics of interior loop formation in semiflexible chains. J. Chem. Phys. 124, 104905 (2006).
Article ADS Google Scholar
Wilhelm, J. & Frey, E. Radial distribution function of semiflexible polymers. Phys. Rev. Lett. 77, 2581–2584 (1996).
Article ADS CAS Google Scholar
Hansen, A. S., Pustova, I., Cattoglio, C., Tjian, R. & Darzacq, X. CTCF and cohesin regulate chromatin loop stability with distinct dynamics. eLife 6, e25776 (2017).
Article Google Scholar
Hansen, A. S., Cattoglio, C., Darzacq, X. & Tjian, R. Recent evidence that TADs and chromatin loops are dynamic structures. Nucleus 9, 20–32 (2017).
Article Google Scholar
Alipour, E. & Marko, J. F. Self-organization of domain structures by DNA-loop-extruding enzymes. Nucleic Acids Res. 40, 11202–11212 (2012).
Article CAS Google Scholar
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. USA 112, E6456–E6465 (2015).
Article CAS Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS Google Scholar
Quinodoz, S. A. et al. Higher-order inter-chromosomal hubs shape 3D genome organization in the nucleus. Cell 174, 744–757 (2018).
Article CAS Google Scholar
Camacho, C. J. & Thirumalai, D. Theoretical predictions of folding pathways using the proximity rule with applications to BPTI. Proc. Natl Acad. Sci. USA 92, 1277–1281 (1995).
Article ADS CAS Google Scholar
Bryngelson, J. & Thirumalai, D. Internal constraints induce localization in an isolated polymer molecule. Phys. Rev. Lett. 76, 542–545 (1996).
Article ADS Google Scholar
Shukron, O. & Holcman, D. Statistics of randomly cross-linked polymer models to interpret chromatin conformation capture data. Phys. Rev. E 96, 012503 (2017).
Article ADS CAS Google Scholar
Hyeon, C., Morrison, G. & Thirumalai, D. Force-dependent hopping rates of RNA hairpins can be estimated from accurate measurement of the folding landscapes. Proc. Natl Acad. Sci. USA 105, 9604–9609 (2008).
Article ADS CAS Google Scholar
O’Brien, E. P., Morrison, G., Brooks, B. R. & Thirumalai, D. How accurate are polymer models in the analysis of Förster resonance energy transfer experiments on proteins? J. Chem. Phys. 130, 124903 (2009).
Article ADS Google Scholar
Doi, M. & Edwards, S. F. The Theory of Polymer Dynamics, Vol. 73 (Oxford University Press, 1988).
Honeycutt, J. & Thirumalai, D. The nature of folded states of globular proteins. Biopolymers 32, 695–709 (1992).
Article CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We are grateful to the National Science Foundation (CHE 19-00093) and the Collie-Welch Regents Chair (F-0019) for supporting this work.

Author information

Authors and Affiliations

Biophysics Program, Institute for Physical Science and Technology, University of Maryland, College Park, MD, 20742, USA
Guang Shi
Department of Chemistry, University of Texas at Austin, Austin, TX, 78712, USA
D. Thirumalai

Authors

Guang Shi
View author publications
You can also search for this author in PubMed Google Scholar
D. Thirumalai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.S. and D.T. designed and performed the research, G.S. and D.T. analyzed the data, G.S. and D.T. wrote the paper.

Corresponding author

Correspondence to D. Thirumalai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information: Nature Communications thanks Bogdan Bintu and other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shi, G., Thirumalai, D. Conformational heterogeneity in human interphase chromosome organization reconciles the FISH and Hi-C paradox. Nat Commun 10, 3894 (2019). https://doi.org/10.1038/s41467-019-11897-0

Download citation

Received: 01 October 2018
Accepted: 06 August 2019
Published: 29 August 2019
DOI: https://doi.org/10.1038/s41467-019-11897-0

This article is cited by

A maximum-entropy model to predict 3D structural ensembles of chromatin from pairwise distances with applications to interphase chromosomes and structural variants
- Guang Shi
- D. Thirumalai
Nature Communications (2023)
Heterogeneous fluid-like movements of chromatin and their implications to transcription
- S. S. Ashwin
- Kazuhiro Maeshima
- Masaki Sasai
Biophysical Reviews (2020)
CTCF modulates allele-specific sub-TAD organization and imprinted gene activity at the mouse Dlk1-Dio3 and Igf2-H19 domains
- David Llères
- Benoît Moindrot
- Daan Noordermeer
Genome Biology (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.