Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes

Oliveira, Guilherme M.; Oravecz, Attila; Kobi, Dominique; Maroquenne, Manon; Bystricky, Kerstin; Sexton, Tom; Molina, Nacho

doi:10.1038/s41467-021-26466-7

Download PDF

Article
Open access
Published: 26 October 2021

Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes

Nature Communications volume 12, Article number: 6184 (2021) Cite this article

3945 Accesses
5 Citations
5 Altmetric
Metrics details

Subjects

Abstract

The spatiotemporal organization of chromatin influences many nuclear processes: from chromosome segregation to transcriptional regulation. To get a deeper understanding of these processes, it is essential to go beyond static viewpoints of chromosome structures, to accurately characterize chromatin’s diffusion properties. We present GP-FBM: a computational framework based on Gaussian processes and fractional Brownian motion to extract diffusion properties from stochastic trajectories of labeled chromatin loci. GP-FBM uses higher-order temporal correlations present in the data, therefore, outperforming existing methods. Furthermore, GP-FBM allows to interpolate incomplete trajectories and account for substrate movement when two or more particles are present. Using our method, we show that average chromatin diffusion properties are surprisingly similar in interphase and mitosis in mouse embryonic stem cells. We observe surprising heterogeneity in local chromatin dynamics, correlating with potential regulatory activity. We also present GP-Tool, a user-friendly graphical interface to facilitate usage of GP-FBM by the research community.

Simultaneous single-cell three-dimensional genome and gene expression profiling uncovers dynamic enhancer connectivity underlying olfactory receptor choice

Article Open access 15 April 2024

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Nuclear mRNA decay: regulatory networks that control gene expression

Article 18 April 2024

Introduction

The spatiotemporal organization of chromatin plays a crucial role in several nuclear processes: from cell division, where chromatin is compacted to facilitate chromosome segregation during mitosis, to gene regulation, where precise control of transcription correlates with specific long-range chromatin contacts¹. Chromosome conformation capture techniques and imaging approaches have revealed fundamental structural features of chromatin at different resolutions. Of special interest are topological associated domains (TADs) which are characterized by an increased frequency of interactions between genomic loci within the same domain with reduced interactions across domains^2,3. Remarkably, it has been shown that TAD organization can influence regulation of transcription⁴ and that TADs are dismantled during mitosis where chromatin density dramatically increases^5,6. However, much less is known about the diffusion properties of chromatin and how they depend on the genomic context. For instance, it is not clear whether or how transcriptional activation affects chromatin mobility, with previous studies giving seemingly conflicting results^7,8. Insights into the dynamic properties of chromatin motion are required to understand how gene regulatory elements communicate within the nuclear space⁹.

The simplest model to describe the diffusion of microscopic systems is Brownian motion, whereby movements are caused by random collisions of small particles within the system^10,11. However, as a bulky polymer interacting with itself and the nuclear environment^12,13,14, chromatin displays sub-diffusive behavior, more constrained than classical Brownian motion for short periods of time^15,16,17. Therefore, the mean squared displacement (MSD) of chromatin is expected to follow this relationship with time: MSD ∝ D_αt^α. Two parameters thus describe the diffusion properties of chromatin: the apparent diffusion coefficient D_α, indicating the “speed” of motion, and the anomalous coefficient α, which for sub-diffusive behavior is <1 indicating greater constraint of movement. The traditional method used to estimate the diffusion parameters is based on calculating the MSD over time from measured trajectories and fitting the above theoretical expression to the data. More sophisticated methods based on particle displacement use higher-order moments¹⁸ or probability density functions^19,20,21 (henceforth referred to as displacement distribution-based methods or DDB) to obtain more accurate estimations of the diffusion parameters. However, these methods do not use all the information contained in the trajectories as higher-order temporal correlations are discarded. Furthermore, errors due to measurement noise cannot easily be included into the analysis, and it is not possible to recover missing data points due to misdetection or occlusions.

We propose GP-FBM, a computational method based on Gaussian Processes (GP)^22,23 and fractional Brownian motion (FBM)^24,25,26, which improves and extends the concepts presented in^27,28. Importantly, GP provides a consistent probabilistic framework that considers entire trajectories and thus utilizes all the available information. Trials on simulated data demonstrate a greater precision of GP-FBM in measuring diffusion parameters over MSD and DDB. Furthermore, as it is applied directly on trajectories, GP-FBM naturally takes into account localization errors and occlusions without the need to establish a fitted MSD curve or displacement distributions. We further extend this model to account for external sources of movement (e.g. displacement of whole nuclei or chromosomes) using underlying correlations between multiple trajectories, without the need to further develop substrate motion models and experiments for calibration²⁹. Finally, we applied GP-FBM to two experimental systems to study chromatin diffusion properties in different contexts. First, we characterized chromatin dynamics in interphase and mitosis using tagged arrays inserted at random genomic locations in mouse embryonic stem (ES) cells. Although chromatin density increases by a factor of three during mitosis³⁰, our results surprisingly indicate that there are no significant differences on average in the apparent diffusion or anomalous coefficients. Second, to compare the diffusion properties of different specific genomic regions, we performed double-labeling and live tracking experiments around the HoxA locus in mouse ES cells before and after induction of the genes with retinoic acid. We discover that, instead of having homogeneous diffusion properties across euchromatin, genomic loci significantly differ in both their apparent diffusion and anomalous coefficients. In some cases, altered chromatin diffusion properties correlate with underlying functions such as gene regulation or CTCF binding. The methods we have developed are integrated into a user-friendly package, GP-Tool, for use in the scientific community. Chromatin mobility has been overlooked in previous studies of genome functions, and we anticipate that GP-FBM will facilitate research in that area.

Results

Modeling diffusion dynamics with GP-FBM

Traditional methods to analyze particle diffusion dynamics rely on particle displacements calculated between two frames at different time intervals, hence information on how precisely the particle moves between the two points is not considered. This has important drawbacks: higher-order temporal correlations within the trajectories are discarded; errors due to frame-dependent measurement noise cannot be easily included into the analysis; and missing data points due to misdetection or occlusions are ignored and cannot be recovered by inference. To address these problems we built a consistent probabilistic framework based on Gaussian Processes (GP)^22,23. Briefly, a GP is defined as a collection of random variables such that every finite subset of them follows a multivariate normal distribution which is fully determined by its mean and kernel functions μ(t) and ${{\Sigma }}(t,t^{\prime} )$. We assume that a stochastic diffusion trajectory x(t) of a given chromatin locus can be modeled as a Gaussian process with the following fractional Brownian kernel^24,25,26:

$${{{\Sigma }}}_{{D}_{\alpha },\alpha }(t,t^{\prime} )={D}_{\alpha }\left(| t{| }^{\alpha }+| t^{\prime} {| }^{\alpha }-| t-t^{\prime} {| }^{\alpha }\right),$$

(1)

where D_α is the apparent diffusion coefficient and α is the anomalous coefficient defined in the range 0 < α < 2. This kernel produces a generalized Brownian motion with a mean squared displacement 〈r²〉 = 2nD_αt^α, where n corresponds to the number of degrees of freedom. Notice that the traditional Brownian dynamics is recovered with α = 1. Then the probability of observing a discrete trajectory r = {r_i} measured at a set of times t = {t_i} is given by the multivariate Gaussian distribution,

$${{{{{{{\mathcal{N}}}}}}}}({{{{{{{\boldsymbol{r}}}}}}}}| {D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}})\propto \exp \left[-\frac{1}{2}{({{{{{{{\boldsymbol{r}}}}}}}}-{{{{{{{\boldsymbol{\mu }}}}}}}})}^{T}{{{\Sigma }}}^{-1}({{{{{{{\boldsymbol{r}}}}}}}}-{{{{{{{\boldsymbol{\mu }}}}}}}})\right],$$

(2)

where the covariance matrix is defined as ${{{\Sigma }}}_{ij}\equiv {{{\Sigma }}}_{{D}_{\alpha },\alpha }({t}_{i},{t}_{j})$ and we take a constant μ without loss of generality. Furthermore, we can easily incorporate localization errors by adding the diagonal term ${\sigma }_{i}^{2}{\delta }_{ij}$ to the covariance matrix Σ, which assumes that errors are decorrelated and normally distributed with standard deviations σ = {σ_i} (see Methods). Ultimately, providing a trajectory r and the localization errors σ, the likelihood (2) can be used to calculate estimates of the diffusion parameters D_α and α either via optimization or sampling using the Metropolis-Hastings algorithm^23,31.

We first test the performance of the GP-FBM method on synthetic trajectories simulated from a FBM model with a given time step (dt). To mimic the measurement noise observed in real trajectories, we introduce the localization error σ and the occlusion rate o. An example of simulated trajectory and the effect of the measurement noise is shown in Fig. 1a. We then obtain a posterior distribution over the parameters D_α and α given the trajectory and the localization errors by combining the likelihood (2) with flat priors over all parameters, which can be sampled using Markov Chain Monte Carlo (MCMC) (see Fig. 1b and Methods). Interestingly, once the diffusion parameters are estimated, the power of the GP framework can be used to infer the most probable trajectory of the particle by removing measurement noise and predicting the particle position where occlusions or misdetections occurred (Fig. 1a). To systematically evaluate the performance of our method to infer diffusion parameters, we generated 50000 synthetic trajectories using uniformly distributed random values of D_α and α in the range 0 < D_α < 1.5 and 0 < α < 2. For generality, we also sample the simulation time step (dt), localization error (σ) and occlusion ratio (o) from a uniform random distribution in the respective ranges: 0.1 < dt < 1.0, 0.001 < σ < 0.25 and 0 < o < 0.8. We compared the results obtained using our approach on the simulated data with the traditional MSD and DDB methods. GP-FBM clearly outperforms both methods, producing smaller relative errors of parameter estimation over all (Fig. 1c) and across different parameter ranges (Supplementary Figs. 1 and 2). Unlike GP-FBM, both MSD and DDB methods require trajectories to be split into individual displacements, thus neglecting higher-order temporal correlations that the trajectories may contain. Therefore GP-FBM method can optimally infer diffusion parameters from single trajectories, using all the information contained in the data and thus achieving greater precision.

**Fig. 1: GP-FBM outperforms existing methods on simulated data.**

Accounting for substrate movement with GP-FBM

Often a particle may be subject to secondary movement that is entangled with its diffusion dynamics. In chromatin dynamics, this movement is frequently associated with the substrate in which the particle is diffusing, such as cell displacement, membrane fluctuations or chromatin reallocation, as well as technical considerations such as thermal drift and undesired media flow. If overlooked, this may result in over-estimation of the diffusive properties. However, when two or more particles are measured in the same context, this substrate movement can be accounted for by analyzing the cross-correlation introduced between the particle trajectories. To that end, we developed a covariance model that takes advantage of the GP-FBM framework to quantify substrate movement and handle the cross-correlation that it may introduce into the movement of all particles (see Methods). In the case of two particles, we obtain the probability distribution,

$$\rho ({{{{{{{\boldsymbol{{r}}}}}}}_{1}}},{{{{{{{\boldsymbol{{r}}}}}}}_{2}}}| {{{{{{{\boldsymbol{\alpha }}}}}}}},{{{{{{{{\boldsymbol{D}}}}}}}}}_{{{{{{{{\boldsymbol{\alpha }}}}}}}}})\propto \exp \left\{-\frac{1}{2}{\left(\begin{array}{l}{{{{{{{\boldsymbol{{r}}}}}}}_{1}}}\\ {{{{{{{\boldsymbol{{r}}}}}}}_{2}}}\\ \end{array}\right)}^{T}{\left(\begin{array}{ll}{{{\Sigma }}}_{1}+{{{\Sigma }}}_{R}&{{{\Sigma }}}_{R}\\ {{{\Sigma }}}_{R}&{{{\Sigma }}}_{2}+{{{\Sigma }}}_{R}\\ \end{array}\right)}^{-1}\left(\begin{array}{l}{{{{{{{\boldsymbol{{r}}}}}}}_{1}}}\\ {{{{{{{\boldsymbol{{r}}}}}}}_{2}}}\\ \end{array}\right)\right\},$$

(3)

where Σ₁, Σ₂ and Σ_R are FBM covariance matrices for the two particles and substrate respectively with diffusion parameters D_α = {D_α,1, D_α,2, D_α,R} and α = {α₁, α₂, α_R}. This method can easily be extended for higher number of particles, limited only by required computational power in practice, even though most of the correction is already achieved with two particles (Supplementary Fig. 3). In this study, we restrict our analysis to five particles per cell.

To demonstrate the utility of this approach, we generated 2000 synthetic trajectories as before, but now including substrate movement, generating vectors r_i as a combination of substrate displacement R and the actual particle displacement a_i (Fig. 2a). Simulations are generated with 10% occlusion rate and localization error, which are values commonly found in our experiments. As expected, D_α and α tend to be overestimated if substrate movement is unconsidered; however, the parameters are more precisely determined when the substrate correction is incorporated into the model (Fig. 2b,c). The method is also able to estimate the dynamic properties of the substrate and, albeit with less precision, the substrate movement itself (see Fig. 2d, Fig. S4 and Methods). Furthermore, we tested the performance of the method depending on the number of tracked particles subjected to the same substrate movement. Precision is increased with use of more particles, but the bulk of the error is already removed with only two particles (see Supplementary Fig. 3). Finally, we showed that GP-FBM outperforms the DDB method even when the substrate movement is taken into account (see Methods and Supplementary Fig. 5). In conclusion, GP-FBM has the ability to remove the substrate movement from the analysis which is demonstrably important for precise measurement of diffusion parameters. Other approaches can be applied to estimate cell movements and correct trajectories^32,33, but GP-FBM has the advantage of being able to derive this information directly from the trajectories themselves, provided that two or more particles are tracked per cell. Consequently, we are able to automatically characterize the substrate movement and remove it from the analysis without the need of extra image processing steps thanks to the cross-correlation that this external movement imprints in the particle diffusion dynamics.

**Fig. 2: GP-FBM can correct for substrate movement to improve estimation of diffusion parameters.**

Analyzing chromatin dynamics in interphase and mitosis

Due to chromosome compaction and condensation, chromatin density increases by a factor of three during mitosis³⁰. Although the structure of mitotic chromatin has been intensively studied^5,34,35, it is unknown if or how the higher density and rearrangement of chromatin fibers affects chromatin diffusion properties. To measure chromatin dynamics in interphase and mitosis, we used a mouse ES cell line carrying approximately 20 TetO arrays of 7 kb length inserted at random genomic locations³⁶. GFP::TetR is stably expressed in these cells, where it binds to the TetO arrays for the simultaneous visualization of several chromatin loci in each cell. We performed confocal live-imaging and distinguished interphase and mitotic cells by DNA staining using Hoechst 33342, recording images at 4 frames per second for 75 s. To increase the number of mitotic cells, we also performed live-imaging experiments on cells arrested in prometaphase with nocodazole (see Fig. 3a and Methods). We tracked spots using ICY³⁷ and enhanced particle localization precision by fitting a 2D Gaussian function to the signal of the tracked spots (see Methods and Supplementary Fig. 6). Before applying the GP-FBM probabilistic framework, we first determined whether the measured stochastic trajectories present, to a certain approximation, self-similar Gaussian distributed displacements and a FBM velocity autocorrelation function (see Methods). Interestingly, that seems to be the case for chromatin movements at the time scale of this study, hence GP-FBM is an appropriate approach for the analysis (Supplementary Figs. 7 and 8).

**Fig. 3: Average chromatin dynamics is similar in interphase and mitosis, but are highly variable across loci.**

Comparing the performance of GP-FBM with and without substrate movement correction, it was apparent that actively dividing mitotic cells had greater substrate movement (presumably due to coordinated alignment and movement of chromosomes by the mitotic spindle), but that appreciable correction was required for precise chromatin dynamics measurements in all conditions (Fig. 3b). Surprisingly, we observed no significant differences in the mean apparent diffusion or the mean anomalous coefficients between interphase and mitotic chromosomes (p >= 0.05), suggesting that condensation may not necessarily affect the average local diffusion dynamics of chromatin (Fig. 3c and d). We observed a small but significant increase in the anomalous coefficient of mitotic-arrested cells compared to interphase, which might be related to the effect that nocodazole has on microtubule formation and thus mitotic chromosome stability³⁸.

Interestingly, we obtained a wide range of estimated D_α and α coefficients indicating a remarkable spot-to-spot variability in their diffusion dynamics, even when correcting for substrate movement. This variability could partially be caused by differences in the state of the analyzed cells (inter-cell variability) leading to different overall chromatin dynamics. Alternatively, differences in the chromatin context of the genomic loci could lead to specific diffusion dynamics (intra-cell variability). Applying the law of total variance (see Methods), we quantified the contribution of inter-cell vs intra-cell variability (Fig. 3e and f). Strikingly, as much as 75% of the variability in D_α and 65% in α could be explained by differences within the same cells. This estimate is even higher when substrate movement is taken into account, especially in the case of mitotic cells, when mouse ES cells tend to detach from their colonies, becoming more prone to movement. In contrast, the nocodazole arrested cells are allowed to sediment onto the glass surface, thus are less mobile during imaging. Together, this suggests that different genomic loci may have characteristic local diffusion properties due to their specific chromatin or nuclear context.

Distinguishing locus- and cell-specific diffusion properties

Except for a tendency for chromatin mobility to be reduced at centromeric or telomeric locations in yeast³⁹, little is known about how different genomic contexts may affect dynamics of the underlying chromatin. Further, previous studies give conflicting views on whether transcriptional activation can increase local confinement of a gene (as observed in the same cell before and after estrogen stimulation⁷) and/or increase gene mobility (as observed comparing cells before and after differentiation⁸). To compare the diffusion properties of different specific genomic regions, we performed double-labeling and live tracking experiments with the ANCHOR system⁴⁰ around the HoxA locus in mouse ES cells before and after induction of Hox genes with retinoic acid. We engineered the ANCH1 and ANCH3⁷ labels into different locations within the same allele to generate two ES lines with equidistant probes assessing inter-TAD (T1-T2) or intra-TAD (T2-T3) associations (Fig. 4a, b) and imaged at 2 frames per second for 2 min. As may be expected, the average inter-probe distance was higher for the inter-TAD than intra-TAD combination, but with large heterogeneity in the distance distributions (Fig. 4c;⁴¹). Interestingly, Hox gene induction had no effect on intra-TAD distances within the neighboring domain, but increased inter-TAD distances, supporting the idea of general TAD reinforcement as cell differentiation is induced⁴². As tests of Gaussianity and velocity autocorrelation again verified approximation of chromatin dynamics to FBM (see Methods and Supplementary Figs. 7 and 8), we performed GP-FBM for the three loci and found that, in undifferentiated ES cells, although they have equivalent apparent diffusion rates, region T1 is significantly more constrained than T2 or T3 (Fig. 4d, e). Closer inspection of the ES (and differentiated neuronal precursor cell) epigenomic profiles around these regions showed that T1 is close (<15 kb) to a putative active enhancer of Halr1 (Supplementary Fig. 9). This gene encodes the long non-coding RNA Haunt, whose specific expression in ES cells is linked to suppression of the HoxA genes⁴³. Active histone modifications around T1, compared to the silent T2 and T3 regions, correlates with a greater constraint of the chromatin, in line with a previous study of an estrogen-induced gene⁷ and predictive polymer models¹⁴. Hox gene induction by retinoic acid had no significant effect on the diffusive rate of T1 but did reduce locus constraint (Fig. 4d, e). In contrast, the region T2, which lacks any known epigenomic or regulatory features, had increases in D_α and α, perhaps indicative of general chromatin remodeling caused on onset of differentiation. Curiously, T3 became more constrained on retinoic acid treatment, with a concomitant increase in mobility. This region contains sites bound by the architectural protein CTCF, whose binding is either lost or reduced on differentiation to neuronal precursors (Supplementary Fig. 9). CTCF is proposed to form a roadblock for cohesin-mediated loop extrusion processes^44,45, and this may be expected to play out in alterations to local chromatin dynamics, although this has been largely unexplored. Overall, these results show previously unappreciated locus-specific variation in chromatin diffusive properties, which in some cases correlate with underlying histone modifications or CTCF binding.

**Fig. 4: Chromatin dynamics of three chromatin loci within the HoxA genomic region.**

GP-Tool allows user-friendly application of GP-FBM

To facilitate use of GP-FBM by the community, we developed a freely available graphical user interface called GP-Tool (Fig. 5; github.com/guilmont). GP-Tool contains 4 plugins: movie, alignment, trajectories and g-process. The movie plugin allows the user to open TIFF files, display basic ImageJ and OME metadata, define colormaps for each channel and manually correct for contrast. The alignment plugin runs the algorithm described in Methods to digitally correct chromatin aberration and possible camera alignment issues. Alternatively, the user can manually modify each of the parameters. Finally, the g-process plugin allows to infer optimal values for the apparent diffusion and anomalous coefficients for several cells in the same movie whilst correcting for substrate movement if two or more particles are selected. It is also possible to use a Metropolis-Hastings sampler to obtain the posterior probability distribution associated with each of these parameters. Once the analysis is complete, the tool provides the possibility to save the results into JSON files. It also provides export functions to save tables in CSV format. All these formats are easily parsed in all major computing languages, such as C/C++, Python and R. Finally, GP-Tool provides shared libraries and C/C++ examples for batching multiple movies. A complete documentation of the software can be found in the aforementioned Github account and in Supplementary Materials.

**Fig. 5: GP-Tool: A graphical user interphase to apply GP-FBM on microscopy movies.**

Discussion

We developed GP-FBM, a Bayesian framework that combines the inference power of Gaussian processes with fractional Brownian motion, a flexible model to describe Gaussian-like diffusion dynamics. Importantly, chromatin loci show Gaussian and self-similar displacement distributions, indicating that FBM is an adequate approximation to assessing chromatin movement, at least for the time scales over which experiments are commonly performed (from a few up to hundreds of seconds). Notice that for longer time scales a crossover has been observed between different diffusion regimes and our model would have to be modified to incorporate this behavior¹⁶. Note also that a myriad of other biological systems have non-Gaussian dynamics, so would not be suitably analyzed by GP-FBM^20,21,46,47.

GP-FBM treats stochastic trajectories as a whole without preprocessing or extracting limited statistics from them. Therefore, this approach utilizes optimally all the information contained in the data by incorporating higher-order temporal correlations into the analysis. In addition, the Gaussian process framework allows easy integration of spot-dependent localization errors which translate into a consistent weighting of time points, depending on the precision at which the spot position is determined. Furthermore, missing data due to spot misdetection or occlusion does not hinder the analysis and, on the contrary, GP-FBM can be used to probabilistically assign spot positions for any given time point. Finally, when two or more particles are tracked in similar context, GP-FBM uses possible cross-correlations between trajectories to characterize substrate movement and, therefore, remove it from the analysis. A number of other methods have been developed over recent years to better characterize diffusion of particles, employing variations of MSD^18,48, probability density functions for particle displacements^19,20,21, Bayesian inference^27,28 or even machine learning approaches^49,50. However, to our knowledge, these methods are either not readily applicable to experimental data, require extra experiments to precisely measure complementary parameters to determine background movement, require large amounts of varying training sets to account for different shapes/types of input data, and/or are not robust to mislocalization and occlusion events that are commonplace in imaging experiments. Benchmarking against MSD and DDB, GP-FBM show improved results over all combinations and ranges of tested parameters. GP-FBM is thus a precise and robust tool. Providing that the model assumptions are fulfilled, this increase in accuracy can be crucial to study changes in diffusion dynamic properties in different conditions.

We applied GP-FBM to two ES cell systems and observed a large variability in chromatin dynamics when comparing individual cells and comparing different loci. A fraction of this variability can be explained by differences across cells, especially in interphase cells, indicating that cell state (cell cycle or metabolism) may globally influence chromatin dynamics. However, the majority of the observed variability is related to differences across loci. Unexpectedly, chromatin exhibits similar average diffusion dynamics in interphase and mitosis despite a large difference in chromatin density. This result may be related with recent findings showing that mitotic chromatin is not as inaccessible and inert as previously thought. Indeed, several studies have shown that mitotic chromatin is bound by transcription factors^51,52 and some genes are even transcribed during mitosis⁵³. In contrast, different genomic loci can have striking differences from average chromatin dynamic properties, which in some cases correlate with underlying functional chromatin marks. It has been previously proposed that chromatin mobility is affected directly by transcription, although results were seemingly conflicting^7,8. More widespread application of GP-FBM to labeled transcribed loci and other specific regulatory elements, such as enhancers or TAD borders, will likely uncover more interesting functional links between genome function and the dynamics of its component chromatin.

Finally, we present GP-Tool a graphical user interface that helps to perform GP-FBM analysis on microscopy movies with only a few mouse clicks. Importantly, this tool and the GP-FBM framework can be applied to study not only chromatin dynamics but potentially any labeled particle that can be tracked over time providing that Gaussianity and other assumptions of FBM are met. Alternatively, the FBM kernel used in this study can potentially be replaced by alternative kernels that may better describe dynamics of other systems. We thus anticipate that GP-FBM and GP-Tool will greatly facilitate the analysis of diffusion dynamics in biology.

Methods

Cell lines, culture, and treatments

Transgenic TetO ES line

The mouse ES cell line was kindly provided by Dr. Luca Giorgetti. It is derived from an X0 clone of the PGKT2 subclone of the feeder-independent PGK12.1 mouse ESC line which was engineered by co-transfection with pBROAD3-TetR-ICP22NLS-eGFP and pcDNA3.1Hygro to stably express the TetR-eGFP recombinant protein after random integration and hygromycin selection (250 μg/ml) as described in ref. ^36,54. The piggyBac transposon system was then used to generate cells with 20–25 stable random integrations of a 150 TetO binding site array as described in³⁶. Cells were cultured on 0.1% gelatin-coated culture plates in DMEM (4.5 g/l glucose) supplemented with GLUTAMAX-I, 15% fetal calf serum (ES cell culture tested), 0.1 mM beta-mercaptoethanol, 1,500 U/ml leukemia inhibitory factor (LIF; produced in house), and 0.1 mM non-essential amino acids in 5% CO₂ at 37 °C. Mitotic arrest was performed by treating the cells for 5 h with 100 ng/ml Nocodazole (Sigma, M1404-2MG). This cell line is available upon reasonable request.

Transgenic ANCHOR ES lines

J1 mouse ES cells were grown on gamma-irradiated mouse embryonic fibroblast cells under standard conditions (4.5 g/L glucose-DMEN, 15% FCS, 0.1 mM non-essential amino acids, 0.1 mM beta-mercaptoethanol, 1 mM glutamine, 500 U/mL LIF, gentamicin), then passaged onto feeder-free 0.2% gelatin-coated plates for at least two passages to remove feeder cells before subsequent transfections. The two ("inter-TAD” and “intra-TAD”) ANCHOR transgenic lines were generated by sequential CRISPR/Cas9-mediated knock-in experiments in the following manner. First, flanking homology arms (mm9 chr6: 52,320,061-52,321,144, and chr6: 52,321,145-52,322,244) were introduced by PCR amplification and Gibson assembly into a vector containing ANCH1 sequence⁴⁰. This vector (1 μg) was co-transfected with 3 μg of a vector containing Cas9-GFP, a puromycin resistance marker, and the scaffold to transcribe the sgRNA specific to the T2 insertion site (CGGCGCGCACTTAACACCAA; vector generated by the IGBMC Molecular Biology platform) in 1 million cells with Lipofectamine-2000. Two days after transfection, the cells were cultured for 24 h with 3 μg/ml puromycin, then 48 h with 1 μg/ml puromycin to enrich for transfected cells, before sorting individual GFP-positive cells on to feeders to amplify individual clones. Clones with the correct sequence were screened by PCR and sequencing, then the CRISPR knock-in process was repeated to insert the ANCH3 sequence⁷ into either the T1 site ("inter-TAD” line; homology arms at chr6: 52,013,471-52,014,370 and chr6: 52,014,371-52,015,270; gRNA sequence AATCGAGCTCACGCCATTAG) or the T3 site ("intra-TAD” line; homology arms at chr6: 52,622,955-52,623,855 and chr6: 52,623,856-52,624,755; gRNA sequence TATGCTGAGGCGTGTCGCAA). Final clones were verified for maintained pluripotency by qRT-PCR to assess Oct4, Nanog (e.g. Supplementary Fig. 9), and Sox2 expression. Subsequent microscopy experiments (see below) confirmed heterozygous incorporation of the ANCH sequences (detection of one specific spot per ANCH sequence per cell) within the same allele (two spots were always in close proximity). This cell line is available upon reasonable request.

OR transfection

150,000 cells are plated two days prior to imaging off feeder cells onto laminin-511-coated 35 mm glass bottom petri dishes, and transfected with 3 μg OR1-EGFP and 3 μg OR3-IRFP plasmids (vectors available from NeoVirTech (contact@neovirtech.com); were modified from original source by changing the C-terminal fluorescent protein sequence, introducing Kozak sequence before the translation start site and replacing the CMV promoter with EF-1α) using Lipofectamine-2000. After two days, the medium is changed to remove dead cells, before passing directly to microscopy.

Hox induction

ES cells were passaged without feeders and cultured on laminin-511 for two days without LIF, then for a subsequent three days without LIF and with the addition of 5 μM retinoic acid. One day after the addition of retinoic acid, the cells are transfected with the OR proteins as previously.

Microscopy

Live cell imaging of TetO ES cells

35 mm glass-bottom dishes (Ibidi 81158) were coated with 10 μg/ml fibronectin human plasma (Sigma, F2006-1MG) in PBS for 45 min at room temperature. A total of 3–5 × 10⁵ cells were seeded one day before imaging, then the medium was replaced by phenol-red-free medium containing 500 ng/ml Hoechst 33342 (Invitrogen, H3570). Cells arrested in mitosis were collected on the day of imaging by “shake-off”, incubated with 0.25% Trypsin-1 mM EDTA (Invitrogen, 25200-072) for 1 min at 37 °C and washed, and placed on fibronectin-coated glass-bottom dishes in phenol-red-free medium containing 100 ng/ml Nocodazole and 500 ng/ml Hoechst 33342. Confocal live-cell imaging was performed on a Nikon Eclipse Ti-E inverted widefield microscope (Perfect Focus System) equipped with a CSU-X1 confocal scanner unit and an Evolve back-illuminated EMCCD camera (Photometrics). Images were recorded using 100 × HC Plan APO oil immersion objective (Leica, NA 1.4). Intensities were set to 10% for the 405 nm and 30% or 50% for the 491 nm lasers, with exposure times of 100 ms and 50 ms or 25 ms, respectively. 5 z-stacks with 0.5 μm distances were recorded for each channel. 301 time-lapse images were recorded only in the 491 channel.

Live cell imaging of ANCHOR ES cells

Imaging experiments were performed on an inverted Nikon Eclipse Ti microscope equipped with a PFS (perfect focus system), a Yokogawa CSU-X1 confocal spinning disk unit, two sCMOS Photometrics Prime 95B cameras for simultaneous dual acquisition to provide 95% quantum efficiency at 11 μm × 11 μm pixels and a Leica 100× oil objective (HC PL APO 1,4 oil immersion). We excited EGFP and IRFP with a 491 nm (100 mw) and a 635-nm laser (> 28 mW), respectively. We detected green and far red fluorescence with an emission filter using a 525/50 nm and a 708/75 nm detection window, respectively. A thermostated heater (Tokai Hit Stage Top Incubator) allowed for heating at 37 °C, humidity, and CO₂ control (5%). Time-lapse analysis of GFP and IRFP foci was performed in 2D acquiring 241 time points at a 0.5 s time interval. The system was controlled using Metamorph 7.10 software. Time-lapse was concatenated into single TIFF file.

RT-qPCR

RNA was extracted from cells using the Nucleospin RNA extraction kit (Machery-Nagel), then cDNA was prepared with SuperScript IV (Invitrogen), following the manufacturer’s instructions and using random hexanucleotides as primers. The cDNA was quantified by qPCR on a LC480 LightCycler (Roche), using QuantitTect SYBR Green PCR kit (Qiagen). Amplification was normalized to GAPDH. Primer sequences are given in Supplementary Table 1.

Image pre-processing

Spot detection and tracking

Spot detection and tracking for all movies was performed with ICY, an image analysis software³⁷. Localization precision was then enhanced by assuming that the spots have the shape of a 2D Gaussian function as follows,

$${S}_{x,y}={I}_{o}\,\exp \left\{-\frac{1}{2}{\left(\begin{array}{l}x-{\mu }_{x}\\ y-{\mu }_{y}\end{array}\right)}^{T}{\left[\begin{array}{ll}{L}_{x}^{2}&\theta {L}_{x}{L}_{y}\\ \theta {L}_{x}{L}_{y}&{L}_{y}^{2}\\ \end{array}\right]}^{-1}\left(\begin{array}{l}x-{\mu }_{x}\\ y-{\mu }_{y}\end{array}\right)\right\}+{B}_{G}.$$

(4)

with μ_i representing the center of mass of the spot, L_i its size in directions x and y, − 1 < θ < 1 a possible rotation, while B_G and I_o are background and spot signal, respectively. We optimize its localization using the NM-Simplex method⁵⁵ and estimate localization error using the Metropolis-Hastings algorithm^23,31. This method is implemented and automatically runs when trajectories are loaded in GP-Tool. For more information see Supplementary Figs. 6 and 10.

Multi-channel alignment correction

For the ANCHOR ES cell line experiments, we used a spinning disk microscope setup with 2 cameras, i.e., one per channel. Even though these cameras were aligned manually using fluorescent beads, we could still observe non-negligible differences between images captured in both cameras. Furthermore, even in rare situations when both cameras were properly aligned, we could observe effects of chromatic aberrations towards the edges of the image due the different wavelengths used. To correct for such problems, we performed digital post-alignment using a generic set of affine transformations including translation, rotation and scaling as defined in

$${{\Omega }}=\left(\begin{array}{lll}{s}_{x}&0&(1-{s}_{x})W/2\\ 0&{s}_{y}&(1-{s}_{y})H/2\\ 0&0&1\\ \end{array}\right)\left(\begin{array}{lll}1&0&{d}_{x}+{c}_{x}\\ 0&1&{d}_{y}+{c}_{y}\\ 0&0&1\\ \end{array}\right)\left(\begin{array}{lll}\cos (\theta )&\sin (\theta )&0\\ -\sin (\theta )&\cos (\theta )&0\\ 0&0&1\\ \end{array}\right)\left(\begin{array}{lll}1&0&-{c}_{x}\\ 0&1&-{c}_{y}\\ 0&0&1\\ \end{array}\right),$$

(5)

where, s_i accounts for scaling in directions x and y, d_i accounts for translation in both directions and θ is the angle of rotation between both channels in relation to point c_i.

To infer optimal parameters for correction, we used 5 frames from all the movies recorded in the session and maximize the following likelihood using the Nelder-Mead simplex method⁵⁵

$${{{{{{\mathrm{log}}}}}}}\,P\propto -\frac{WH}{2}{{{{{{\mathrm{log}}}}}}}\,\left\{\mathop{\sum}\limits_{k,l}{\left[{I}_{2}(k,l| {{\Omega }})-{I}_{1}(k,l| {\mathbb{1}})\right]}^{2}\right\},$$

(6)

where W and H correspond to width and height of images and I_r(k, l∣A) is the value of pixel (k,l) in channel r given transformation A. Here, ${\mathbb{1}}$ represents the identity matrix. Supplementary Fig. 11 shows examples of misaligned images and how the alignment improves greatly after applying our algorithm.

Derivation of GP-FBM models

Fractional Brownian motion

The covariance function of FBM can easily be derived from the assumption of two basic properties⁵⁶: stationary increments B(t) − B(s) ∝ B(t − s) and a power-law variance, $\left\langle B{(t)}^{2}\right\rangle \propto | t{| }^{\alpha }$. Then, the off-diagonal terms of the covariance function can be determined as follows:

$$\left\langle {B}_{\alpha }(t){B}_{\alpha }(s)\right\rangle \, \propto \frac{1}{2}\left\langle [{B}_{\alpha }(s)-{B}_{\alpha }(s)+{B}_{\alpha }(t)]{B}_{\alpha }(s)+{B}_{\alpha }(t)[{B}_{\alpha }(t)-{B}_{\alpha }(t)+{B}_{\alpha }(s)]\right\rangle \\ \,=\frac{1}{2}\left\langle {B}_{\alpha }{(s)}^{2}+{B}_{\alpha }(t-s){B}_{\alpha }(s)-{B}_{\alpha }(t){B}_{\alpha }(t-s)+{B}_{\alpha }{(t)}^{2}\right\rangle \\ \, =\frac{1}{2}\left\langle {B}_{\alpha }{(t)}^{2}+{B}_{\alpha }{(s)}^{2}+{B}_{\alpha }(t-s)({B}_{\alpha }(s)-{B}_{\alpha }(t))\right\rangle \\ \, =\frac{1}{2}\left\langle {B}_{\alpha }{(t)}^{2}+{B}_{\alpha }{(s)}^{2}-{B}_{\alpha }{(t-s)}^{2}\right\rangle \\ \, =\frac{1}{2}(| t{| }^{\alpha }+| s{| }^{\alpha }-| t-s{| }^{\alpha }).$$

(7)

Finally, the apparent diffusion coefficient D_α is introduced as a proportionality factor to re-scale mobility, leading to the final kernel as presented in the main text:

$${{{\Sigma }}}_{{D}_{\alpha },\alpha }(t,s)=2{D}_{\alpha }\left\langle {B}_{\alpha }(t){B}_{\alpha }(s)\right\rangle .$$

(8)

We can also calculate the velocity autocorrelation function for the FBM model²⁶, which can be easily calculated from experimental trajectories using

$${C}_{\nu }^{(\epsilon )}(\tau )=\frac{1}{{\epsilon }^{2}}\left\langle (x(\tau +\epsilon )-x(\tau ))(x(\epsilon )-x(0))\right\rangle ,$$

(9)

where velocity is defined as $\nu (\tau )={\epsilon }^{-1}\left[x(\tau +\epsilon )-x(\tau )\right]$. Using that, the theoretical curve for velocity autocorrelation function for FBM is calculated to be

$$\frac{{C}_{\nu }^{(\epsilon )}(\tau )}{{C}_{\nu }^{(\epsilon )}(0)}=\frac{{(\tau +\epsilon )}^{\alpha }-2{\tau }^{\alpha }+| \tau -\epsilon {| }^{\alpha }}{2{\epsilon }^{\alpha }}.$$

(10)

To show that FBM is a viable approximation for the dynamics displayed by chromatin in the time range of our experimental measurements, we verified that displacements are self-similar Gaussian distributed with aforementioned covariance matrix and that its velocity autocorrelation agrees with theoretical predictions (Supplementary Figs. 7 and 8).

Bayesian inference of diffusion parameters

The GP provides the probability of observing a trajectory r given D_α and α. Then, we applied Bayes theorem²³ to obtain the posterior distribution over the diffusion parameters given the measured trajectory:

$$P({D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}}| {{{{{{{\boldsymbol{r}}}}}}}})=\frac{P({{{{{{{\boldsymbol{r}}}}}}}}| {D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}})\,P({D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}})}{\int P({{{{{{{\boldsymbol{r}}}}}}}}| {D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}})\,P({D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}})\,d{D}_{\alpha }\,d\alpha \,d{{{{{{{\boldsymbol{\mu }}}}}}}}},$$

(11)

where P(D_α, α, μ) represents the prior distribution of the model parameters. Assuming a flat prior on μ, D_α and α, the log-posterior can be expressed as

$${{{{{{\mathrm{log}}}}}}}\,(P({D}_{\alpha },\alpha ,{{{{{{{\boldsymbol{\mu }}}}}}}}| {{{{{{{\boldsymbol{r}}}}}}}}))\propto -\frac{1}{2}{({{{{{{{\boldsymbol{r}}}}}}}}-{{{{{{{\boldsymbol{\mu }}}}}}}})}^{T}{{{\Sigma }}}_{{D}_{\alpha },\alpha }^{-1}({{{{{{{\boldsymbol{r}}}}}}}}-{{{{{{{\boldsymbol{\mu }}}}}}}})-\frac{1}{2}{{{{{{\mathrm{log}}}}}}}\,| {{{\Sigma }}}_{{D}_{\alpha },\alpha }| -\frac{N}{2}{{{{{{\mathrm{log}}}}}}}\,(2\pi ),$$

(12)

where N represents the number of points measured and ∣ ⋅ ∣ is the determinant function. To obtain maximum posterior estimates, we optimized (12) using the Nelder-Mead Simplex method⁵⁵. In addition, we used the Metropolis-Hastings method^23,31 to sample the posterior probability distribution in order to calculate confidence intervals for our estimations. Note that, thanks to this Bayesian approach, available prior knowledge of the diffusion parameters can easily be incorporated into the analysis. For more information regarding MH sampler, view Supplementary Fig. 12.

Incorporating of substrate movement in the GP-FBM framework

In the main text, we introduced an extended GP-FBM model to deal with external sources of movement. Here, we present the derivation for two particles subject to a common substrate movement, however, it can be extended for an arbitrary number of particles using the marginalization rule of multi-variate Gaussian distributions²³. The key idea is to assume that the movement of the particles with respect to the substrate as well as the movement of the substrate itself can be described by independent fractional Brownian motions. Therefore, the probability of observing the particle trajectories a₁ and a₂ with respect to a given frame of reference R that moves with the substrate is:

$$\rho ({{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}},{{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}},{{{{{{{\boldsymbol{R}}}}}}}}| {{{{{{{\boldsymbol{\alpha }}}}}}}},{{{{{{{{\boldsymbol{D}}}}}}}}}_{{{{{{{{\boldsymbol{\alpha }}}}}}}}})\propto \exp \left(-\frac{1}{2}{{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}^{T}{{{\Sigma }}}_{1}^{-1}{{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}-\frac{1}{2}{{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}^{T}{{{\Sigma }}}_{2}^{-1}{{{{{{{{\boldsymbol{a}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}-\frac{1}{2}{{{{{{{{\boldsymbol{R}}}}}}}}}^{T}{{{\Sigma }}}_{R}^{-1}{{{{{{{\boldsymbol{R}}}}}}}}\right)$$

(13)

where Σ₁, Σ₂ and Σ_R are FBM covariance matrices that are fully characterized given the diffusion parameters D_α = {D_α,1, D_α,2, D_α,R} and α = {α₁, α₂, α_R}.

Next, to obtain the probability distribution over the trajectories r₁ and r₂, we applied the change of coordinates r_i = a_i + R (see scheme in Fig. 2a) leading to the matrix expression,

$$\rho ({{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}},{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}},{{{{{{{\boldsymbol{R}}}}}}}}| {{{{{{{\boldsymbol{\alpha }}}}}}}},{{{{{{{{\boldsymbol{D}}}}}}}}}_{{{{{{{{\boldsymbol{\alpha }}}}}}}}})\propto \exp \left(-\frac{1}{2}{\left(\begin{array}{l}{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}\\ {{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}\\ {{{{{{{\boldsymbol{R}}}}}}}}\\ \end{array}\right)}^{T}\left(\begin{array}{lll}{{{\Sigma }}}_{1}^{-1}&0&-{{{\Sigma }}}_{1}^{-1}\\ 0&{{{\Sigma }}}_{2}^{-1}&-{{{\Sigma }}}_{2}^{-1}\\ -{{{\Sigma }}}_{1}^{-1}&-{{{\Sigma }}}_{2}^{-1}&{{{\Sigma }}}_{1}^{-1}+{{{\Sigma }}}_{2}^{-1}+{{{\Sigma }}}_{R}^{-1}\\ \end{array}\right)\left(\begin{array}{l}{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}\\ {{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}\\ {{{{{{{\boldsymbol{R}}}}}}}}\\ \end{array}\right)\right),$$

(14)

Then, to marginalize the unobserved trajectory of the moving frame R_i, we need to calculate the inverse of the block matrix in equation (14). To do so, we use results from⁵⁷ on inverting a 2x2 block matrix such as

$${{\Lambda }}=\left(\begin{array}{ll}A&B\\ C&D\\ \end{array}\right)$$

(15)

according to the following result

$${{{\Lambda }}}^{-1}=\left(\begin{array}{ll}{A}^{-1}+{A}^{-1}B{(D-C{A}^{-1}B)}^{-1}C{A}^{-1}&-{A}^{-1}B{(D-C{A}^{-1}B)}^{-1}\\ -{(D-C{A}^{-1}B)}^{-1}C{A}^{-1}&{(D-C{A}^{-1}B)}^{-1}\end{array}\right).$$

(16)

Taking $A=\left[({{{\Sigma }}}_{1}^{-1},\,0);\,(0,\,{{{\Sigma }}}_{2}^{-1})\right]$, $B=-\left[{{{\Sigma }}}_{1}^{-1};\,{{{\Sigma }}}_{2}^{-1}\right]$, C = B^T and $D={{{\Sigma }}}_{1}^{-1}+{{{\Sigma }}}_{2}^{-1}+{{{\Sigma }}}_{R}^{-1}$, it is easily shown that the top-left corner of Λ⁻¹ is given by $\left[({{{\Sigma }}}_{1}+{{{\Sigma }}}_{R},\,{{{\Sigma }}}_{R});\,({{{\Sigma }}}_{R},\,{{{\Sigma }}}_{2}+{{{\Sigma }}}_{R})\right]$. Using this result, we can marginalize R in equation (14) giving the expression,

$$\rho ({{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}},{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}| {{{{{{{\boldsymbol{\alpha }}}}}}}},{{{{{{{{\boldsymbol{D}}}}}}}}}_{{{{{{{{\boldsymbol{\alpha }}}}}}}}})\propto {\left|\begin{array}{ll}{{{\Sigma }}}_{1}+{{{\Sigma }}}_{R}&{{{\Sigma }}}_{R}\\ {{{\Sigma }}}_{R}&{{{\Sigma }}}_{2}+{{{\Sigma }}}_{R}\\ \end{array}\right|}^{-1}\exp \left(-\frac{1}{2}{\left(\begin{array}{l}{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}\\ {{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}\\ \end{array}\right)}^{T}{\left(\begin{array}{ll}{{{\Sigma }}}_{1}+{{{\Sigma }}}_{R}&{{{\Sigma }}}_{R}\\ {{{\Sigma }}}_{R}&{{{\Sigma }}}_{2}+{{{\Sigma }}}_{R}\\ \end{array}\right)}^{-1}\left(\begin{array}{l}{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}\\ {{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}\\ \end{array}\right)\right).$$

(17)

This result clearly shows that the substrate movement induces a correlation between the particle trajectories as the off-diagonal elements of the block matrix in equation (17) are non zero. In addition, the covariance matrix of the substrate movement appears also in the diagonal terms, increasing the overall variance of the particles and their total movement. Consequently, if this correction is ignored the diffusion parameters are over-estimated.

Inference of substrate movement

Similarly as before, the diffusion parameters of the particles as well as the substrate can be estimated using equation (17) and Bayesian inference. We can also estimate how the substrate moves. For that, we can calculate the conditional distribution of R given the particle trajectories as

$$\left\langle {{{{{{{\boldsymbol{R}}}}}}}}\right\rangle =-{\left({{{\Sigma }}}_{1}^{-1}+{{{\Sigma }}}_{2}^{-1}+{{{\Sigma }}}_{R}^{-1}\right)}^{-1}\left({{{\Sigma }}}_{1}^{-1}\,\,{{{\Sigma }}}_{2}^{-1}\right)\left(\begin{array}{l}{{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{1}}}}}}}}}-{{{{{{{{\boldsymbol{\mu }}}}}}}}}_{1}\\ {{{{{{{{\boldsymbol{r}}}}}}}}}_{{{{{{{{\boldsymbol{2}}}}}}}}}-{{{{{{{{\boldsymbol{\mu }}}}}}}}}_{2}\\ \end{array}\right)$$

(18)

with co-variance matrix given by ${{{\Sigma }}}_{\left\langle {{{{{{{\boldsymbol{R}}}}}}}}\right\rangle }={\left({{{\Sigma }}}_{1}^{-1}+{{{\Sigma }}}_{2}^{-1}+{{{\Sigma }}}_{R}^{-1}\right)}^{-1}$.

Unfortunately, we could not find an analytical solution for this problem. Nonetheless, we can solve it numerically. In Supplementary Fig. 4.a we show an example of the estimated movement for the substrate with one standard deviation compared to the real simulated trajectory for a system with two particles. In Supplementary Fig. 4.b-c we display the overall accuracy when working with two or more particles.

Benchmarking GP-FBM

Simulated trajectories

We simulated single trajectories with 250 time points using the aforementioned Gaussian process with FBM kernel. To keep the benchmark as general and unbiased as possible, we uniformly sampled values for our parameters in the ranges 0.01 < D_α < 1.5, 0.01 < α < 1.9, 0.1 < dt < 1.0 and 0.001 < σ < 0.25. To benchmark a system of N particles affected by substrate movement, we generated N+1 trajectories and add the latter to all the others. Finally, a uniform distribution is used to remove 0% to 80% of points from each trajectory to simulate experimental occlusions.

Mean squared displacement (MSD) implementation

To calculate D_α and α for single trajectories, we estimate a MSD using a sliding window method. This method is mathematically defined as follows

$$\left\langle {r}_{n}^{2}\right\rangle =\frac{1}{N-n}\mathop{\sum }\limits_{i=1}^{N-n}{\left({{{{{{{{\boldsymbol{r}}}}}}}}}_{i+n}-{{{{{{{{\boldsymbol{r}}}}}}}}}_{i}\right)}^{2},$$

(19)

for a trajectory with N points and step interval n. Due to implicit correlations present in single trajectories, we use only initial 10% step intervals. To improve accuracy, we also estimate an average localization error σ. Finally, this experimental curve is approximated by the theoretical mean squared displacement equation

$$\left\langle {r}^{2}\right\rangle =4{D}_{\alpha }\,{t}^{\alpha }+2{\sigma }^{2},$$

(20)

from which diffusion parameters are inferred using linear regression. For more information⁵⁸.

Displacement distribution based (DDB) implementation

The theoretical expressions for the displacement distribution is obtained as a solution of the Fokker-Planck equation with localization error σ. In polar coordinates it takes the form

$$\rho (r,\theta | {D}_{\alpha },\alpha ,t,\sigma )\,drd\theta =\frac{r}{2\pi (2{D}_{\alpha }\,{t}^{\alpha }+{\sigma }^{2})}{e}^{-\frac{{r}^{2}}{4{D}_{\alpha }{t}^{\alpha }+2{\sigma }^{2}}}drd\theta .$$

(21)

In order to calculate experimental distributions for single cells, we resort to a sliding window method similar to the one present for MSD. Differently, we calculate normalized histograms with all the absolute displacement values. As before, we calculate an average localization error σ to improve localization and use only histograms calculated for initial 10 step intervals. With these measurements, we optimize the equation above for D_α and α using Bayes approach with non-informative priors for both parameters.

Law of total variance

The law of total variance is used to determine how much of the measured variance comes from within or across samples. Starting off from the law of total expectation:

$$\left\langle x\right\rangle =\int dy\,\left\langle x| y\right\rangle \rho (y)=\left\langle \left\langle x| y\right\rangle \right\rangle ,$$

(22)

we can calculate

$$\left\langle {x}^{2}\right\rangle =\left\langle \,{{\mbox{var}}}\,\left(x| y\right)+{\left\langle x| y\right\rangle }^{2}\right\rangle .$$

(23)

Subtracting ${\left\langle \left\langle x| y\right\rangle \right\rangle }^{2}$ from both sides

$$\left\langle {x}^{2}\right\rangle -{\left\langle x\right\rangle }^{2}=\left\langle \,{{\mbox{var}}}\,\left(x| y\right)\right\rangle +\left\langle {\left\langle x| y\right\rangle }^{2}\right\rangle -{\left\langle \left\langle x| y\right\rangle \right\rangle }^{2}.$$

(24)

Upon algebraic manipulation, we obtain the final result

$$\,{{\mbox{var}}}\,\left(x\right)=\left\langle \,{{\mbox{var}}}\,\left(x| y\right)\right\rangle +\,{{\mbox{var}}}\,\left(\left\langle x| y\right\rangle \right),$$

(25)

which states that the total measured variance in x is composed by the $\,{{\mbox{var}}}\,\left(x\right)$ given sample y and $\left\langle x\right\rangle$ calculated for each y.

Statistical analysis and reproducibility

Data for interphase and mitotic cells are from two independent experiments, while Nocodazole-treated cells are from one experiment. Anchor data is accumulated from 11 independent experiments.

To compare diffusive properties or inter-probe distances across different loci or conditions (Fig. 3c, d, and 4b–d), we performed Wilcoxon rank sum tests. For inter-probe distances, the distributions of the median distances for each movie were used. For Fig. 4c, d, where fifteen pairwise comparisons are possible, the p-values were corrected for multiple testing with the Benjamini-Hochberg method and differences are considered statistically significant for p-values inferior to 0.05.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support this study are available from the corresponding authors upon reasonable request. Microscopy data with analysis files that support the findings of this study have been deposited in Zenodo and can be accessed with https://doi.org/10.5281/zenodo.5359893⁵⁹, https://doi.org/10.5281/zenodo.5360028⁶⁰ and https://doi.org/10.5281/zenodo.5361054⁶¹. The source data are provided with this paper.

ES Hi-C sequence data from⁴² were taken from Gene Expression Omnibus (GSE96107 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE96107]), and mapped to mm9 and normalized with FAN-C⁶². The normalized submatrix (chr6:51500000-53000000) was then extracted for visualization. ES and neuronal precursor cell H3K27ac and CTCF ChIP-seq data were taken as bigWig files from Gene Expression Omnibus; GSE96107 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE96107] for all except ES H3K27ac, taken from GSE49847 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE49847]) and visualized in R using the package rtracklayer. Source data are provided with this paper.

Code availability

C++ libraries, batch templates and graphical user interface are available at https://github.com/guilmont⁶³.

References

Sexton, T. & Cavalli, G. The role of chromosome domains in shaping the functional genome. Cell 160, 1049–1059 (2015).
Article CAS Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Sexton, T. et al. Three-dimensional folding and functional organization principles of the drosophila genome. Cell 148, 458–472 (2012).
Article CAS Google Scholar
Lupiáñez, D. G. et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161, 1012–1025 (2015).
Article PubMed PubMed Central CAS Google Scholar
Naumova, N. et al. Organization of the mitotic chromosome. Science 342, 948–953 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. Chromatin structure dynamics during the mitosis-to-G1 phase transition. Nature 576, 158–162 (2019).
Article CAS PubMed PubMed Central Google Scholar
Germier, T. et al. Real-time imaging of a single gene reveals transcription-initiated local confinement. Biophys. J. 113, 1383–1394 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Gu, B. et al. Transcription-coupled changes in nuclear mobility of mammalian cis-regulatory elements. Science 359, 1050–1055 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, J. et al. Single-gene imaging links genome topology, promoter-enhancer communication and transcription control. Nat. Struct. Mol. Biol. 27, 1032–1040 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mirny, L. et al. How a protein searches for its site on DNA: The mechanism of facilitated diffusion. J. Phys. A: Math Theor. (2009).
Givaty, O. and Levy, Y. Protein sliding along DNA: dynamics and structural characterization. J. Mol. Biol. (2009).
Weber, S. C., Theriot, J. A. & Spakowitz, A. J. Subdiffusive motion of a polymer composed of subdiffusive monomers. Phys. Rev. E 82, 011913 (2010).
Article ADS CAS Google Scholar
Amitai, A. & Holcman, D. Polymer model with long-range interactions: analysis and applications to the chromatin structure. Phys. Rev. E 88, 052604 (2013).
Article ADS CAS Google Scholar
Shinkai, S. et al. Phi-c: deciphering hi-c data into polymer dynamics. NAR genomics Bioinforma. 2, lqaa020 (2020).
Article CAS Google Scholar
Tortora, M. M. C., Salari, H. & Jost, D. Chromosome dynamics during interphase: a biophysical perspective. Curr. Opin. Genet. Dev. 61, 37 – 43 (2020). Genome Architecture and Expression.
Article PubMed CAS Google Scholar
Bronstein, I. et al. Transient anomalous diffusion of telomeres in the nucleus of mammalian cells. Phys. Rev. Lett. 103, 018102 (2009).
Article ADS CAS PubMed Google Scholar
De Gennes, P. G. Dynamics of entangled polymer solutions. i. the rouse model. Macromolecules 9, 587–593 (1976).
Article ADS Google Scholar
Ferrari, R., Manfroi, A. J. & Young, W. R. Strongly and weakly self-similar diffusion. Phys. D: Nonlinear Phenom. 154, 111–137 (2001).
Article ADS MathSciNet CAS MATH Google Scholar
Hansen, A. S. et al. Robust model-based analysis of single-particle tracking experiments with Spot-On. eLife 7, e33125 (2018).
Article PubMed PubMed Central Google Scholar
Chakraborty, I. & Roichman, Y. Disorder-induced fickian, yet non-gaussian diffusion in heterogeneous media. Phys. Rev. Res. 2, 022020 (2020).
Article CAS Google Scholar
Lampo, T. J., Stylianidou, S., Backlund, M. P., Wiggins, P. A. & Spakowitz, A. J. Cytoplasmic RNA-protein particles exhibit non-gaussian subdiffusive behavior. Biophysical J. 112, 532–542 (2017).
Article ADS CAS Google Scholar
Rasmussen, C. E. and Williams, C. K. I. Gaussian processes for machine learning. Adaptive computation and machine learning. MIT Press, (2006).
Murphy, K. P. Machine learning: a probabilistic perspective. MIT press, (2012).
Jeon, J.-H. & Metzler, R. Fractional Brownian motion and motion governed by the fractional Langevin equation in confined geometries. Phys. Rev. E 81, 021103 (2010).
Article ADS MathSciNet CAS Google Scholar
Höfling, F. and Franosch, T. Anomalous transport in the crowded world of biological cells. Reports on Progress in Physics 76(4), (2013).
Burov, S., Jeon, J.-H., Metzler, R. & Barkai, E. Single particle tracking in systems showing anomalous diffusion: the role of weak ergodicity breaking. Phys. Chem. Chem. Phys. 13, 1800–1812 (2011).
Article CAS PubMed Google Scholar
Krog, J., Jacobsen, L. H., Lund, F. W., Wüstner, D. & Lomholt, M. A. Bayesian model selection with fractional brownian motion. J. Stat. Mech.: Theory Exp. 2018, 093501 (2018).
Article MathSciNet MATH Google Scholar
Thapa, S., Lomholt, M. A., Krog, J., Cherstvy, A. G. & Metzler, R. Bayesian analysis of single-particle tracking data using the nested-sampling algorithm: maximum-likelihood model selection applied to stochastic-diffusivity data. Phys. Chem. Chem. Phys. 20, 29018–29037 (2018).
Article CAS PubMed Google Scholar
Vestergaard, C. L., Blainey, P. C. & Flyvbjerg, H. Optimal estimation of diffusion coefficients from single-particle trajectories. Phys. Rev. E 89, 022726 (2014).
Article ADS CAS Google Scholar
Vagnarelli, P. Mitotic chromosome condensation in vertebrates. Exp. Cell Res. 318, 1435–1441 (2012). Experimental Cell Research Special Review Issue: Chromosome Biology, 2012.
Article CAS Google Scholar
Andrieu, C., De Freitas, N., Doucet, A. & Jordan, M. I. An introduction to MCMC for machine learning. Mach. Learn. 50, 5–43 (2003).
Article MATH Google Scholar
Sun, D., Roth, S. & Black, M. J. A quantitative analysis of current practices in optical flow estimation and the principles behind them. Int. J. Comput. Vis. 106, 115–137 (2014).
Article Google Scholar
Vestergaard, C. L. Optimizing experimental parameters for tracking of diffusing particles. Phys. Rev. E 94, 1–17 (2016).
Article CAS Google Scholar
Horng D. Ou, Sébastien Phan, Thomas J. Deerinck, Andrea Thor, Mark H. Ellisman, and Clodagh C. O’Shea. Chromemt: Visualizing 3d chromatin structure and compaction in interphase and mitotic cells. Science, 357(6349), (2017).
Johan H. Gibcus et al. A pathway for mitotic chromosome formation. Science, 359(6376), (2018).
Redolfi, J. et al. Damc reveals principles of chromatin folding in vivo without crosslinking and ligation. Nat. Struct. Mol. Biol. 26, 471–480 (2019).
Article CAS PubMed PubMed Central Google Scholar
de Chaumont, F. et al. Icy: an open bioimage informatics platform for extended reproducible research. Nat. Methods 9, 690–696 (2012).
Article PubMed CAS Google Scholar
Amitai, A., Seeber, A., Gasser, S. M. & Holcman, D. Visualization of chromatin decompaction and break site extrusion as predicted by statistical polymer modeling of single-locus trajectories. Cell Rep. 18, 1200–1214 (2017).
Article CAS PubMed Google Scholar
Heun, P., Laroche, T., Shimada, K., Furrer, P. & Gasser, S. M. Chromosome dynamics in the yeast interphase nucleus. Science 294, 2181–2186 (2001).
Article ADS CAS PubMed Google Scholar
Saad, H. et al. Dna dynamics during early double-strand break processing revealed by non-intrusive imaging of living cells. PLoS Genet. 10, e1004187 (2014).
Article PubMed PubMed Central CAS Google Scholar
Finn, E. H. et al. Extensive heterogeneity and intrinsic variation in spatial genome organization. Cell 176, 1502–1515 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bonev, B. et al. Multiscale 3d genome rewiring during mouse neural development. Cell 171, 557–572 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yin, Y. et al. Opposing roles for the lncrna haunt and its genomic locus in regulating hoxa gene activation during embryonic stem cell differentiation. Cell Stem Cell 16, 504–516 (2015).
Article CAS PubMed Google Scholar
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. USA 112, E6456–E6465 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS PubMed PubMed Central Google Scholar
Metzler, R. Gaussianity fair: the riddle of anomalous yet non-gaussian diffusion. Biophys. J. 112, 413 (2017).
Article CAS PubMed PubMed Central Google Scholar
Metzler, R., Jeon, J.-H. & Cherstvy, A. G. Non-brownian diffusion in lipid membranes: experiments and simulations. Biochim. et. Biophys. Acta-Biomemb.ranes1858, 2451–2467 (2016).
Article CAS Google Scholar
Renner, M., Wang, L., Levi, S., Hennekinne, L. & Triller, A. A simple and powerful analysis of lateral subdiffusion using single particle tracking. Biophys. J. 113, 2452–2463 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Granik, N. et al. Single-particle diffusion characterization by deep learning. Biophys. J. 117, 185–192 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Munoz-Gil, G., Garcia-March, M. A., Manzo, C., Martín-Guerrero, J. D. & Lewenstein, M. Single trajectory characterization via machine learning. N. J. Phys. 22, 013010 (2020).
Article MathSciNet Google Scholar
Cédric Deluz et al. A role for mitotic bookmarking of sox2 in pluripotency and differentiation. Genes Dev. (2016).
Festuccia, N. et al. Mitotic binding of Esrrb marks key regulatory regions of the pluripotency network. Nat. Cell Biol. 18, 1139–1148 (2016).
Article CAS PubMed Google Scholar
Palozola, K. C. et al. Mitotic transcription and waves of gene reactivation during mitotic exit. Science 358, 119–122 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Masui, O. et al. Live-cell chromosome dynamics and outcome of x chromosome pairing events during es cell differentiation. Cell 145, 447–458 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nelder, J. A. & Mead, R. A simplex method for function minimization. Computer J. 7, 308–313 (1965).
Article MathSciNet MATH Google Scholar
Mandelbrot, B. B. & Van Ness, J. W. Fractional brownian motions, fractional noises and applications. SIAM Rev. 10, 422–437 (1968).
Article ADS MathSciNet MATH Google Scholar
Lu, T.-T. & Shiou, S.-H. Inverses of 2 × 2 block matrices. Computers Math. Appl. 43, 119–129 (2002).
Article MathSciNet MATH Google Scholar
Michalet, X. Mean square displacement analysis of single-particle trajectories with localization error: Brownian motion in an isotropic medium. Phys. Rev. E 82, 041914 (2010).
Article ADS MathSciNet CAS Google Scholar
Guilherme M. Oliveira et al. Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes (Inter-Mito) [Data set]. Zenodo, https://doi.org/10.5281/zenodo.5359893, (2021).
Guilherme M. Oliveira et al. Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes (T1T2 Anchor) [Data set]. Zenodo, https://doi.org/10.5281/zenodo.5360028, (2021).
Guilherme M. Oliveira et al. Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes (T2T3 Anchor) [Data set]. Zenodo, https://doi.org/10.5281/zenodo.5361054, (2021).
Kruse, K., Hug, C. B. & Vaquerizas, J. M. Fan-c: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data. Genome Biol. 21, 1–19 (2020).
Article Google Scholar
Guilherme M. Oliveira, Attila Oravecz, Dominique Kobi, Manon Maroquenne, Kerstin Bystricky, Tom Sexton, Nacho Molina. Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes (Code)) [Data set]. Zenodo, https://doi.org/10.5281/zenodo.5503470, (2021).

Download references

Acknowledgements

We thank Luca Giorgetti for providing the TetO ES line and for critical reading of the manuscript. This work was possible thanks to funding from grants by LabEx INRT (ANR-10-LABX-0030-INRT, a French State fund managed by the Agence Nationale de la Recherche under the frame program Investissements d’Avenir ANR-10-IDEX-0002-02), CNRS << Osez l’interdisciplinarité ! >> , ERC (Starting Grant 678624 - CHROMTOPOLOGY) and ATIP-Avenir. The microscopy was performed at the Imaging Center of the IGBMC.

Author information

Authors and Affiliations

Institute of Genetics and Molecular and Cellular Biology (IGBMC) CNRS UMR7104, INSERM U1258, University of Strasbourg, Illkirch, France
Guilherme M. Oliveira, Attila Oravecz, Dominique Kobi, Manon Maroquenne, Tom Sexton & Nacho Molina
Molecular Cellular and Developmental Biology unit (MCD), Centre de Biologie Integrative (CBI) UPS, CNRS, Toulouse, France
Kerstin Bystricky

Authors

Guilherme M. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Attila Oravecz
View author publications
You can also search for this author in PubMed Google Scholar
Dominique Kobi
View author publications
You can also search for this author in PubMed Google Scholar
Manon Maroquenne
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Bystricky
View author publications
You can also search for this author in PubMed Google Scholar
Tom Sexton
View author publications
You can also search for this author in PubMed Google Scholar
Nacho Molina
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.M.O. developed mathematical models, algorithms, and GP-Tool, also performed overall data analysis. D.K. generated anchor cell lines. A.O., D.K., and M.M. did experiments and validations. K.B. provided ANCHOR constructs and consulted on ANCHOR experiments. T.S. and N.M. conceived, designed, and supervised the study. G.M.O., T.S., and N.M. wrote this manuscript with input from all other authors.

Corresponding authors

Correspondence to Guilherme M. Oliveira, Tom Sexton or Nacho Molina.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Ralf Metzler, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oliveira, G.M., Oravecz, A., Kobi, D. et al. Precise measurements of chromatin diffusion dynamics by modeling using Gaussian processes. Nat Commun 12, 6184 (2021). https://doi.org/10.1038/s41467-021-26466-7

Download citation

Received: 21 March 2021
Accepted: 07 October 2021
Published: 26 October 2021
DOI: https://doi.org/10.1038/s41467-021-26466-7

This article is cited by

Live-cell imaging of chromatin contacts opens a new window into chromatin dynamics
- Jente van Staalduinen
- Thomas van Staveren
- Kerstin S. Wendt
Epigenetics & Chromatin (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.