Abstract
Condensation of hundreds of mega-base-pair-long human chromosomes in a small nuclear volume is a spectacular biological phenomenon. This process is driven by the formation of chromosome loops. The ATP consuming motor, condensin, interacts with chromatin segments to actively extrude loops. Motivated by real-time imaging of loop extrusion (LE), we created an analytically solvable model, predicting the LE velocity and step size distribution as a function of external load. The theory fits the available experimental data quantitatively, and suggests that condensin must undergo a large conformational change, induced by ATP binding, bringing distant parts of the motor to proximity. Simulations using a simple model confirm that the motor transitions between an open and a closed state in order to extrude loops by a scrunching mechanism, similar to that proposed in DNA bubble formation during bacterial transcription. Changes in the orientation of the motor domains are transmitted over ~50 nm, connecting the motor head and the hinge, thus providing an allosteric basis for LE.
Similar content being viewed by others
Introduction
How chromosomes are structurally organized in the tight space of the nucleus is a long-standing problem in biology. Remarkably, these information-carrying polymers in humans with more than 100 million base pairs, are densely packed in the 5−10 μm cell nucleus1,2. In order to accomplish this herculean feat, nature has evolved a family of structural maintenance of chromosomes (SMC) complexes3,4 (bacterial SMC, cohesin, and condensin) to enable large scale compaction of chromosomes in both prokaryotic and eukaryotic systems. Compaction is thought to occur by an active generation of a large array of loops, which are envisioned to form by extrusion of the genomic material5,6,7 driven by ATP-consuming motors. The SMC complexes have been identified as a major component of the loop extrusion (LE) process3,4.
Of interest here is condensin, whose motor activity8, results in active extrusion of loops in an ATP-dependent manner9. Let us first describe the architecture of condensin, shown schematically in Fig. 1. Condensin is a ring-shaped dimeric motor, containing a pair of SMC proteins (Smc2 and Smc4). Both Smc2 and Smc4, which have coiled-coil (CC) structures, are connected at the hinge domain. The ATP binding domains are in the motor heads4,10. There are kinks roughly in the middle of the CCs10. The relative flexibility in the elbow region (located near the kinks) could be the key to the conformational transitions in the CC that are powered by ATP binding and hydrolysis4,11.
Previous studies using simulations6,12,13, which were built on the pioneering insights by Nasmyth5, suggested that multiple condensins translocate along the chromosome extruding loops of increasing length. In this mechanism, the two condensin heads move away from each other extruding loops in a symmetric manner. Cooperative action of many condensins14 might be necessary to account for the ~(1000−10,000) fold compaction of human chromosomes15. The only other theoretical study that predicts LE velocity as a function of an external load16 is based on a four-state stochastic kinetic model, with minimally twenty parameters, for the catalytic cycle of the condensin that is coupled to loop extrusion16. In sharp contrast, by focusing on the motor activity of condensin through ATP-driven allosteric changes in the enzyme, our theory and simulations support "scrunching" as a plausible mechanism for loop extrusion. Scrunching is reminiscent of the proposal made over a decade ago in the context of the first stage in bacterial transcription that results in bubble formation in promoter DNA17, which was quantitatively affirmed using molecular simulations18. Recently, the scrunching mechanism was proposed to explain loop extrusion19, which is fully supported by theory and simulations presented here.
We were inspired by the real-time imaging of LE in λ-DNA by a single condensin9, which functions by extruding loops asymmetrically. To describe the experimental outcomes quantitatively, we created a simple analytically solvable theory, with two parameters, that produces excellent agreement with experiments for the LE velocity as a function of external load. We also quantitatively reproduce the distribution of LE length per cycle measured using magnetic tweezer experiments20. The theory and simulations show that for LE to occur there has to be an ATP-powered allosteric transition in condensin involving a large conformational change that brings distant parts (head and the hinge in Fig. 1) of the motor to proximity. We predict that, on an average, the distance between the head and the hinge decreases by conformational ~(22−26) nm per catalytic cycle. These values are in remarkably close to the experimentally inferred values9. Simulations using a simple model, with and without DNA, lend support to our findings. Our work strongly suggests that the conformational transitions are driven by a scrunching mechanism in which the motor is relatively stationary, but DNA is reeled in by an allosteric mechanism.
Results
Model description
In order to develop a model applicable to condensin (and cohesin), we assume that condensin is attached to two loci (A and B) on the DNA (Fig. 1; right panel). Although, we do not explicitly describe the nature of the attachment points our model is based on the idea of scrunching motion, where two distant ends of condensin move closer upon conformational change, triggered by ATP binding. For example, the green and blue sphere may be mapped onto motor heads and hinge, respectively. The structure of condensin-DNA complex in the LE active form is currently unavailable. However, cryo-EM structures for the related cohesin-DNA complex21 reveal that DNA is tightly gripped by the two heads of cohesin and the subunits (NIPBL and RAD21). When the results of structural studies are integrated with the observation that the hinge domain of the SMC complexes binds to DNA22,23,24, we conclude that both condensin and cohesin must use a similar mechanism to engage with DNA. The head domains in these motors interact with the DNA segment that is in proximity whereas DNA binds only transiently to the hinge. We constructed the model in Fig. 1 based in part on these findings.
In state 1, the spatial distance between the condensin attachment points is, R1, and the genomic length between A and B is L1. Due to the polymeric nature of the DNA, the captured length L1 could exceed R1. However, R1 cannot be greater than the overall dimension of the SMC motor, which is on the order of ~50 nm. Once a segment in the DNA is captured, condensin undergoes a conformational change driven most likely by ATP binding20, shrinking the distance from R1 to R2 (where R2 < R1). As a result, the captured genomic length between A and B reduces to L2 (state 2). Consequently, the loop grows by L1 − L2. The step size of condensin is ΔR = R1 − R2, and extrusion length per step is ΔL = L1 − L2. After the extrusion is completed one end of condensin (blue circle in Fig. 1; right panel) is released from the genome segment and starts the DNA capturing process again, likely mediated by diffusion leading to the next LE cycle.
Theory for the captured length (L) of DNA
In order to derive an expression for the loop extrusion velocity, we first estimate the loop length of DNA, L, captured by condensin when the attachment points are spatially separated by R. We show that on the length scale of the size of condensin (~50 nm), it is reasonable to approximate L ≈ R. To calculate the LE velocity it is necessary to estimate the total work done to extrude DNA with and without an external load. Based on these considerations, we derive an expression for the LE velocity, given by \({k}_{0}\exp (-f{{\Delta }}R/{k}_{{{{{{\rm{B}}}}}}}T){{\Delta }}R\), where k0 is the rate of mechanical step in the absence of the external load (f), kB is Boltzmann constant, and T is temperature.
Distribution of captured DNA length without tension, P(L∣R)
We examined the possibility that the loop extrusion length per step can be considerably larger than the size of condensin8,9,10,25 by calculating, P(L∣R), the conditional probability for realizing the contour length L for a given end-to-end distance, R. We calculated P(R∣L) using a mean-field theory that is an excellent approximation26 to the exact but complicated expression27. The expression for P(L∣R), which has the same form as P(R∣L), up to a normalization constant, is given by (Sec.III in SI)
where t{L} = 3L/2lp, lp is the persistence length of the polymer, and \(N\{L\}=\frac{4{\alpha }^{3/2}{e}^{\alpha }}{{\pi }^{3/2}(4+12{\alpha }^{-1}+15{\alpha }^{-2})}\) with α{L} = 3t/4. In Eq. (1), A is a normalization constant that does not depend on L where integration range for L is from R to ∞. The distribution P(L∣R), which scales as L−3/2 for large L, has a heavy tail and does not have a well defined mean (see Fig. 2a for the plots of P(L∣R) for different R). The existence of long-tail in the distribution P(L∣R) already suggests that condensin, in principle, could capture DNA segment much larger than its size ~50 nm. However, this can only happen with lower probability compared to a more probable scenario where R ≈ L near the position of the peak in P(L∣R) (see Fig. 2a). Therefore, we evaluated the location of the peak (Lpeak) in P(L∣R), and solved the resulting equation numerically. The dependence of Lpeak on R, which is almost linear (Fig. 2a), is well fit using \({L}_{{{{{{\rm{peak}}}}}}}=R\exp (aR)\) with a = 0.003 nm−1 for R < 60 nm. Thus, with negligible corrections, we used the approximation L ≈ R on the length scales corresponding to the size of condensin or the DNA persistence length. Indeed, the location of the largest probability is at L ≈ lp ≈ R (lp is the persistence length of polymer), which is similar to what was found for proteins28 as well. The presence of f would stretch the DNA, in turn decrease the length of DNA that condensin captures, further justifying the assumption (L ≈ R). Therefore, we conclude that R1 ≈ L1 and R2 ≈ L2 (note that \({R}_{2} \; < \; {R}_{1}\;\lesssim\; {l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{DNA}}}}}}}\); \({l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{DNA}}}}}}}\) is the persistence length of DNA), and that LE of DNA loop that is much larger than the size of condensin is less likely. Thus, we expect that the extrusion length of DNA is nearly equal to the step size of condensin, ΔR ≈ ΔL.
Force-dependent distribution, P(L∣R, f), of the captured DNA length
Following the steps described above, we write the end-to-end distribution of semi-flexible polymer under tension f, P(R, f∣L), as \(P(R,f| L)=BP(R| L){e}^{{{{{{\rm{fR}}}}}}/{{{{{{\rm{k}}}}}}}_{{{{{{\rm{B}}}}}}}{{{{{\rm{T}}}}}}}\), where B is normalization constant. Thus, P(L∣R, f) is obtained using,
where C is a normalization constant that does not depend on L. The constant B{L} for P(R, f∣L), which carries the L dependence, prevents us from deriving an analytically tractable expression for P(L∣R, f). We find that, for a sufficiently stiff polymer, (R/lp ≲ 1), P(L∣R, f) is well approximated by,
where f0 = 1/7 pN, t{L} = 3L/2lp, lp is the persistence length of the polymer, and \({N}^{2}\{L\}={\left(\frac{4{\alpha }^{3/2}{e}^{\alpha }}{{\pi }^{3/2}(4\,+\,12{\alpha }^{-1}\,+\,15{\alpha }^{-2})}\right)}^{2}\) with α{L} = 3t/4. The constant D does not depend on L in the integration range. Note that on the scale of condensin DNA is relatively rigid (\(R\;\lesssim\; {l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{DNA}}}}}}} \sim 50\ {{{{{{{\rm{nm}}}}}}}}\)). Probability Eqs. 2 and 3 for different values of f in Fig. 2 show that the agreement between PA(L∣R) and P(L∣R, f) is good. Therefore, in what follows we use Eq. (3). The distributions for f > 0 in Fig. 2 show that the position of the peak for P(L∣R, f) do not change over the range of f of interest. However, the height of the peak increases as f increases accompanied by the shrinking of the tail for large L.
Condensin converts chemical energy into mechanical work for LE
Just like other motors, condensin hydrolyzes ATP, generating μ ≈ 20 kBT chemical energy that is converted into mechanical work, which in this case results in extrusion of a DNA loop9. To derive an expression for LE velocity, we calculated the thermodynamic work required for LE. The required work W modulates the rate of the mechanical process by the exponential factor \(\exp (-W/{k}_{{{{{{\rm{B}}}}}}}T)\). In our model, W has two contributions. The first is the work needed to extrude the DNA at f = 0 (W0). Condensin extrudes the loop by decreasing the spatial distance between the attachment points from R = R1 to R = R2 (Fig. 1). The associated genomic length of DNA that has to be deformed is LΣ = L0 + L1. The second contribution is Wf, which comes by applying an external load. Condensin resists f up to a threshold value9, which may be thought of as the stall force. The mechanical work done when condensin takes a step, ΔR = R1 − R2, is Wf = fΔR.
We calculated W as the free energy change needed to bring a semi-flexible polymer with contour length LΣ, from the end-to-end distance R1 to R2. It can be estimated using the relation, \(W\approx -{k}_{{{{{{\rm{B}}}}}}}T\;{{{{{{\mathrm{log}}}}}}}\,\left(P({R}_{2},f| {L}_{{{\Sigma }}})\right)+{k}_{{{{{{\rm{B}}}}}}}T\;{{{{{{\mathrm{log}}}}}}}\,\left(P({R}_{1},f| {L}_{{{\Sigma }}})\right)\), where P(R,f∣L) is given by \(P(R,f| L)=BP(R| L){e}^{fR/{k}_{{{{{{\rm{B}}}}}}}T}\) where B is a normalization constant. Although P(R,f∣L) is a distribution, implying that there is a distribution for W, for illustrative purposes, we plot W in Fig. 3 for a fixed R1 = 40 nm and R2 = 14 nm corresponding to ΔR = 26 nm as estimated using our theory to experiment in later section. It is evident that condensin has to overcome the highest bending penalty in the first step of extrusion, and subsequently W is essentially a constant at large LΣ. Note that when f = 0 (red line in Fig. 3), W = W0 because the Wf term vanishes. If R1 = 40 nm, which is approximately the size of the condensin motor, we estimate that condensin pays 5kBT to initiate the extrusion process without tension (red line in Fig. 3).
Once the energetic costs for LE are known, we can calculate the LE velocity as a function of an external load applied to condensin. From energy conservation, we obtain the equality, nμ = W0 + Wf + Q, where n is the number of ATP molecules consumed per mechanical step, μ is the energy released by ATP hydrolysis, and Q is the heat dissipated during the extrusion process. The maximum force is obtained at equilibrium when the equality nμ = W0 + Wf{fmax} holds. If we denote the rate of mechanical transition as k+ and reverse rate as k−, fluctuation theorem29,30,31 together with conservation of energy gives the following relation:
Note that condensin operates in non-equilibrium by transitioning from state 1 to 2, which requires input of energy. The load dependent form for the above equation may be written as,
where \({k}_{0}={k}^{-}{e}^{(n\mu -{W}_{0})/{k}_{{{{{{\rm{B}}}}}}}T}\) is the rate of the mechanical transition at 0 load. Thus, with ΔR being the extruded length per reaction cycle, the velocity of LE, Ω, may be written as,
It is worth stating that k0 is the chemical energy dependent term, which includes μ, depends on the nucleotide concentration. In order to obtain the ATP dependence, we assume the Michaelis–Menten form for k0, \(\frac{{\hat{k}}_{0}\left[T\right]}{\left[T\right]+{K}_{{{{{{\rm{M}}}}}}}}\), where [T] is the concentration of ATP, \({\hat{k}}_{0}\) is the maximum rate at saturating ATP concentration, and KM is the Michaelis–Menten constant (see Fig. 4b). In principle, KM should be determined from the measurement of LE velocity as a function of [T]. Here, we use KM = 0.4mM obtained as Michaelis–Menten constant for ATP hydrolysis rate8, assuming that ATP hydrolysis rate is the rate-limiting step in the loop extrusion process.
In order to calculate Ω as a function of the relative DNA extension, x, we use the expression32,33,
The dimensionless variable, x, is the f-dependent relative extension. In the Ganji et al.9 experiment x is measured, and the f-dependence of Ω is obtained by expressing x in terms of f using a numerical procedure.
Analysis of experimental data
Loop extrusion velocity
We used Eq. (6) to fit the experimentally measured LE velocity as a function of DNA extension9. The two fitting parameters are ΔR, and k0, the average step size for condensin, and the extrusion rate at f = 0, respectively. Excellent fit of theory to experiments, especially considering the dispersion in the data, gives k0 = 20 s−1 and ΔR = 26 nm. This indicates that condensin undergoes a conformational change that brings the head and the hinge to within ΔR ~ 26 nm (~76 bps), during each extrusion cycle. This prediction is remarkably close to the value measured in the recent AFM experiment ~22 nm19, and is further supported by our simulations (see below). We note that k0 = 20 s−1 is roughly ten times greater than the bulk hydrolysis rate estimated from ensemble experiments8,9. A plausible reason for the apparent discrepancy, already provided in the experimental studies8,9, is that bulk hydrolysis rate could underestimate the true hydrolysis rate due to the presence of inactive condensins. Thus, the estimated rate of k0 = 2 s−1 should be viewed as a lower bound9. Another possible reason for the discrepancy may be due to methods used to estimate k0 in previous experiments8,9. It is clear that additional experiments are needed to obtain better estimates of the hydrolysis rate, which is almost all theories is seldom estimated. In Fig. 4a right panel we compare the dependence of Ω on f obtained in a previous kinetic model that has in excess of 20 parameters16. In contrast to our theory, even the shape of the LE velocity for Ω versus f does not agree with experiment. In addition, there is a major discrepancy (factor of 2–3) between the predicted and the measured values of Ω at low force.
LE length distribution
Recently Ryu et al.20 measured the distribution of LE length per step using magnetic tweezers. We calculated the LE distribution using Eq. (1) and Eq. (3). Of interest here is the distribution for L1 − L2 where L1 is the captured length of DNA in open shape (see Fig. 1; right panel). We use the length of condensin in the open state as R1 = 40 nm (roughly the peak in the distance between the head and the hinge in the O shape in the wild type condensin), and assume that ΔR = 26 nm during a single catalytic cycle, as theoretically calculated in the previous section. This gives the length of the closed state, R2 = 14 nm. Previous experiments19 reported significant fluctuations in the size of the open and closed states, leading to the standard deviation for the change between these two states to be Δ = 13 nm. For simplicity, we include the standard deviation for the conformational change in the open state R1 ± Δ and keep R2 fixed. This is justified because as R decreases (R2 < R1), not only does the peak in P(L∣R) and PA(L∣R,f) moves to small L but also there is a decrease in the fluctuation (width of the distribution is smaller) (see Fig. 2). This suggests that the variance of the extrusion length owing to R2 is negligible. We assume that the distribution of R1 is a Gaussian centered at 40 nm with a standard deviation of 13 nm. It is reasonable to approximate L2 ≈ R2 since \({R}_{2}=14\ {{{{{{{\rm{nm}}}}}}}} \; < \; {l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{DNA}}}}}}}=50\ {{{{{{{\rm{nm}}}}}}}}\). With these parameters in hand, the captured DNA length, Eq. 1 and 3, directly lead to an expression for the distributions for LE length per step. The probability for the LE length per step (ΔL) is, P(ΔL) = P(L + R2∣R1) for f = 0 and P(ΔL∣f) = PA(L + R2∣R1,f) for f > 0.
We compare in Fig. 4 the theoretically calculated distribution for f = 0 pN from Eq. (1) and f = 0.4 pN from Eq. (3) with experiments. The distribution for f = 0 pN cannot be measured using magnetic tweezers but provides insights into the range of LE length that condensin could take at f = 0. Remarkably, the calculated distribution is in excellent agreement with the experimental data20.
Plausible conformational change of condensin in LE process
Simulations without DNA
Next we tested whether the predicted value of ΔR ~ 26 nm, which is in fair agreement with the experiment, is reasonable using simulations of a simple model. Because the ATPase domains are located at the heads of condensin, it is natural to assume that the head domain undergoes conformational transitions upon ATP binding and/or hydrolysis. Images of the CCs of the yeast condensin (Smc2–Smc4) using liquid atomic force microscopy (AFM) show they could adopt a few distinct shapes19,34. Based on these experiments, we hypothesize that the conformational changes initiated at the head domain result in changes in the angle at the junction connecting the motor head to the CC that propagates through the whole condensin via the CC by an allosteric mechanism35. The open (O-shaped in ref. 19), with the hinge that is ≈40 nm away from the motor domain, and the closed (B-shape19) in which the hinge domain is in proximity to the motor domain are the two relevant allosteric states for LE19,34. To capture the reaction cycle (O → B → O), we model the CCs as kinked semi-flexible polymers (two moderately stiff segments connected by a flexible elbow), generalizing a similar description of stepping of Myosin V on actin36. By altering the angle between the two heads the allosteric transition between the open (O-shaped) and closed (B-shaped) states could be simulated (SI contains the details).
We tracked the head-hinge distance (R) changes during the transition from the open (R = R1) to the closed state (R = R2) in order to calculate the distribution of ΔRs = R1 − R2. The sample trajectory in Fig. 5a, monitoring the conformational transition between the two states, shows that ΔRs changes by ~22 nm for the persistence length of condensin (\({l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{CC}}}}}}}\); see Sec. II in SI for the detail) \({l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{CC}}}}}}}=24\ {{{{{{{\rm{nm}}}}}}}}\), which roughly coincides with the value extracted by fitting the theory to the experimental data. Higher (smaller) values of ΔRs may be obtained using larger (smaller) values of \({l}_{{{{{{\rm{p}}}}}}}^{{{{{{\rm{CC}}}}}}}\) (Supplementary Fig. 3 in SI). The distributions, P(R), calculated from multiple trajectories (Fig. 5b) are broad, suggestive of high degree of conformational heterogeneity in the structural transition between the open and closed states. The large dispersions found in the simulations is in surprisingly excellent agreement with experiments19, which report that the distance between the peaks is 22 ± 13 nm. We find that the corresponding value is 22 ± 9 nm. The uncertainty is calculated using standard deviation in the distributions. Overall the simulations not only provide insight into the physical basis of the theory but also lend support to recent single molecule experiments19.
Simulations with DNA
The purpose of the simulations discussed in the previous section was to assess whether the allosteric mechanism produces a structural rationale for the value of ΔR ~ 26 nm extracted from the theory. We also created a simple model of condensin with DNA to give additional insights into the DNA-capture mechanism, which is directly related to the extrusion length of DNA per step. We assume that the capture length of DNA by condensin, L, is governed by diffusion of the hinge domain, and that the L is solely determined by the semi-flexible polymer nature of DNA. We attached one end of the DNA to the heads of condensin. The other end of DNA diffuses freely during the simulations. We define the DNA "capture" event by the distance between a DNA segment and the condensin hinge, with a cut-off length of 4 nm: if the distance is less than 4 nm we assume that condensin captures the DNA segment. Captured DNA length is the contour length of DNA held between the heads and the hinge. We used a coarse-grained bead-spring model for DNA26,37. Each bead represents ten base-pairs, which implies that the bead size is σDNA = 3.4 nm. The chain has N = 100 beads or 1000 base-pairs. The simulation model for DNA is in the SI (Sec. I). The simulations with DNA explain an interesting aspect of the DNA capture process. In contrast to well-studied molecular motors that take a step that is nearly constant (conventional kinesin and myosin V) and walks on rigid linear track (microtubule and actin filament, respectively), condensin could in principle capture variable length of DNA during each catalytic cycle because it is a flexible polymer unlike microtubule. Figure 5d shows that there is a finite probability that L exceeds the position of the peak by a considerable amount, as predicted in Fig. 2a. This implies that the L1 can be as large as (60–100) nm (~(180–290)bps), which allows for condensin to extrude substantial length of DNA in each catalytic cycle (see the distributions P(ΔL) in Fig. 4).
Our results show that theory and simulations for the LE velocity [Eq. (6)] predicts the extent of the conformational change of condensin during the LE process fairly accurately (~26 nm in theory and ~22 nm in the experiment19 and simulations) and gives the distributions of LE length per cycle in good agreement with experiment. Although our theory is in good agreement with experiments for load-dependent LE velocity and distribution of step sizes, the calculated persistence length (24 nm) of the SMC coils is much higher than the value (~3.8 nm) estimated from analyses of AFM images in combination with simulations using the worm-like chain model34. It is possible that due to interaction between condensin and other proteins (Brn1 for example) could constrain the head movement, and thus stiffen the coiled-coil. However, the discrepancy between theoretical predictions and simulations is too large to be explained by such effects. Despite performing many simulations using a variety of polymer models, the origin of this discrepancy is unclear. Simulations with flexible coiled-coil do not reproduce the measurements of quantities such as ΔR, that are directly monitored in single molecule experiments19 (see Fig. S3). We believe that additional experiments and simulations based on higher resolution structures in different nucleotide states are required to close the gap.
Discussion
Connection to experiments
Two of the most insightful experimental studies19,20 have reported the mean LE velocity and step size distribution as a function of f. Because these are direct single molecule measurements that have caught the motor in the act of LE, the results are unambiguous, requiring little or no interpretation. Minimally any viable theory must account for these measurements as quantitatively as possible, using only a small number of physically meaningful parameters. To our knowledge, our theory is currently the only one that reproduces the experimental observations accurately with just two parameters. The only other theory16 that reported LE velocity as a function of force, not only has a large number of parameters but also it cannot be used to calculate the step size distribution.
We first showed that the calculated LE velocity could be fit to experimental data in order to extract the hydrolysis rate and the mean step size. To provide a physical interpretation of the theoretically predicted mean step size, we performed polymer based simulations using a simple model of the CCs. The model for the simulations was based on the AFM images19, which showed that, during loop extrusion, there is a transition between the O shape (head and the hinge are far apart) to the B shape where they are closer. Remarkably, our simulations capture the distributions of the head-hinge distances in the O and B states well without any fitting parameters. The mean distance between the peaks (ΔRs ≈ 22 nm) in the simulations is in excellent agreement with measured value (see Fig. 2c in ref. 19). It is worth emphasizing that our theoretical fit to experiment yielded ΔR ~ 26 nm, which is also in very good agreement with ~22 nm measured in the high speed AFM imaging experiment19. Thus, both experiments and simulations support the mechanism that repeated O → B shape transitions result in extrusion of the DNA loops.
Relation to a previous study
Recently, a four state chemical kinetic model16, similar to the ones used to interpret experiments on stepping of myosin and kinesin motors on polar tracks (actin and microtubule)31,38, was introduced in order to calculate the f-dependent LE velocity and the loop size. The agreement between the predicted dependence on LE velocity and loop size as a function of f is not satisfactory (see Fig. 4a). Apart from the very large number of parameters (about twenty one in the simplified version of the theory16), our two parameter theory differs from the previous study in other important ways. (1) The model16 is apparently based on the rod-like (or I shape) X-ray structure of the prokaryotic CC of the SMC dimer10, which was pieced together by joining several segments of the CC. However, the theory itself does not incorporate any structural information but is based on a number of rates connecting the four assumed states in the reaction cycle of the motor, and energetics associated with the isolated DNA. (2) Because the previous purely kinetic model16 does not explicitly consider the structure of condensin, it implies that an allosteric communication between the hinge and the head—an integral part of our theory and observed in AFM experiments34, is not even considered for the LE mechanism16. The lack of conformational changes in response to ATP-binding implies that the substantial decrease in the head-hinge distance by about ~22 nm observed in AFM imaging experiments cannot be explained, as was noted previously19. (3) In the picture underlying the DNA capture model16 (referred to as the DNA pumping model elsewhere19), the distance between the head and the hinge changes very little, if at all. Such a scenario is explicitly ruled out in an experimental study by Ryu et. al.19 in part because they seldom observe the I shape in the holocomplex by itself or in association with DNA. For this reason, we believe that the mechanism proposed in the recent simulation study39 is unlikely to be viable. Rather, it is the O ↔ B transition that drives the loop extrusion process, as found in experiments19, and affirmed here using our simulations.
Structural basis for LE
The paucity of structures for condensin and cohesin in distinct nucleotide bound states makes it difficult to interpret experiments, theory and simulations in molecular terms. The situation is further exacerbated because even the biochemical reaction cycle of condensin (or the related motor cohesin) has not been determined. A recent 8.1 Å structure of the yeast condensin holocomplex in both the apo non-engaged state (the one in which the regulatory element, YCS4, bring the heads in proximity), and the apo-bridged state in which the heads interact with each other show a sharp turn in the elbow region, resembling an inverted letter J in the representation in Fig. 1 in a recent study40. The functional importance of the inverted J state is unclear because when condensin is active (extruding loops in an ATP-dependent manner) only the O-shaped and B-shaped structures are apparently observed19. Furthermore, the structure of the related motor, cohesin, with DNA shows that the heads are bound to DNA with the hinge is in proximity21, which is inconceivable if the CCs adopt only the I shape. For this reason, we compared our simulations directly with experiments that have measured the distance changes between the head and the hinge during the active LE process19.
The partially resolved structure of the ATP-bound state shows a large opening of the CC near the heads40, suggestive of an allosterically driven conformational change. In contrast, the structure of only the prokaryotic SMC coiled-coil at 3.2 Å resolution, which was created by piecing together several fragments in the CC, showed that it adopts the I shape. As noted elsewhere19, the I-shaped structure is almost never observed in yeast condensin during its function. The paucity of structures prevents any meaningful inclusion of structural details in theory and simulations. It is for this reason, we resorted to comparisons to AFM imaging data and results from single molecule magnetic tweezer experiments, which have caught the yeast condensin as it executes its function, in validating our theory. After all it is the function that matters. In the SI (Sec. VI), we performed structural alignment and normal mode analysis using the partially available cryo-EM structures40 in order to capture the possible conformational transition in the apo state that is poised to transition to the LE active state. Even using only partially resolved structures, the normal mode analyses show that ATP binding induces a substantial opening in the CC region that interacts with the head domains. This preliminary analysis does suggest that for loop extrusion to occur there has to be an allosteric mechanism that brings the head and hinge of the motor close to each other spatially.
Scrunching versus translocation
Using a combination of simulations based on a simple model and theory, we have proposed that LE occurs by a scrunching mechanism. The crux of the scrunching mechanism is that the motor heads, once bound to the DNA, are relatively stationary. Extrusion of the loop occurs by the change in the distance between the head and the hinge by about ~22 nm. This conformational change is likely driven by ATP binding to condensin20. As a result, the head reels in the DNA, with the mean length that could be as large as ~100 nm, although the most probable value is ≈(25−40) nm depending on the external load (Fig. 4c, d). Recent experiments have suggested that LE occurs by a scrunching mechanism20, although it was (as stated earlier) proposed in the context of DNA bubble formation17, which is the initial stage in bacterial transcription. The near quantitative agreement with experiments for load-dependent LE velocity, and step size distribution shows that the theory and the mechanism are self-consistent.
In contrast, the other mechanism is based on the picture that condensin must translocate along the DNA in order to extrude loops15. Elsewhere9 it is argued that the translocation observed in the experiment is an artifact due to salt and buffer conditions, thus casting doubt on the motor translocating along DNA.
Directionality of LE
Directionality is a vital characteristic of biological molecular machines with SMC proteins being no exception. In our model (Fig. 1), the unidirectional LE arises during two stages. One is the DNA capture process and the other is the actual active loop extrusion (State 1 to State 2 in Fig. 1). Directional loop extrusion emerges as a result of binding of the SMCs and the associated subunits to DNA via anisotropic interactions. Indeed, the structure of cohesin21 suggests that the subunit (STAG1) interacts with the hinge of cohesin by interactions that are anisotropic. This implies that the directional LE could be set by the very act of binding of the condensin motor heads to DNA, which poises the hinge to preferentially interact with DNA downstream of the motor head, we show schematically as point A in Fig. 1. This in turn ensures that the probability of capture of DNA resulting in ΔL < 0 is minimized.
In addition, in our theory there is an asymmetry in the expressions for k+ and k− in Eq. (4) arising from the free energy, μ, due to the ATP hydrolysis. This process leads to a decrease in the head-hinge distance, as seem in the liquid AFM images of condensin in action. However, we believe that, with small probability, ΔL < 0 could arise because of the slippage of DNA from the extruded DNA by the strong resistive force on the motor20. This situation is reminiscent of the slippage of kinesin on microtubule under resistive load41. In the absence of any opposing force, the probability that ΔL < 0 is likely to be small.
Brownian ratchet and power-stroke
Whether SMC proteins employ Brownian ratchet mechanism42 or power stroke43 is an important question that has to be answered to fully elucidate the molecular mechanism of LE. A recent study proposed Brownian ratchet model for cohesin44. An analogy to the well-studied motor conventional kinesin-1 (Kin1), which walks towards the plus end of the stiff microtubule, is useful. It could be argued that Kin1 makes use of both power stroke and biased diffusion. Previous studies45,46 showed that power stoke (neck-linker docking) propels the trailing head of Kin1 only by ≈ 5–6 nm forward, which creates a strong bias to the next binding site. The rest of the step (≈6–8 nm) is completed by diffusion. Therefore, it is possible that both power stroke (sets the directionality) and Brownian ratchet are not mutually exclusive. Both the mechanisms could play a role in the LE process as well. Further structural, experimental, and computational studies are required to resolve fully the interplay between these mechanisms in LE.
Conclusion
We conclude with a few additional remarks. (1) We focused only on one-sided loop extrusion (asymmetric process) scenario for a single condensin, as demonstrated in the in vitro experiment9. Whether symmetric LE could occur when more than one condensin loads onto DNA producing Z-loop structures14, and if the LE mechanism depends on the species47 is yet to be settled. Similar issues likely exist in loop extrusion mediated by cohesins48,49. We believe that our work, which only relies on the polymer characteristics of DNA and on an allosteric (action at a distance) mechanism for loop extrusion, provides a framework for theoretical investigation of LE, accounting for different scenarios. (2) The f dependence of LE velocity allows us to estimate the time scale for compacting the whole genome. In particular, if the loop extrusion velocity at f = 0 is taken to be ~1 kbp/s, we can calculate the LE time using the following assumptions. The number of condensin I and condensin II that are likely bound to DNA is ~3000 and ~500, respectively50. Therefore, the loops in the entire chromosome1 (~250 Mbps) could be extruded in a few minutes with the motors operating independently. The assumption that the motors operate independently is reasonable because the linear density (number of motors per genomic base pair) of the bound motors is low. A similar estimate has been made for loop extrusion time by cohesin in the G1 phase of HeLa cells49. These times are faster than the time needed to complete mitosis (~an hour)51. (3) Finally, if LE occurs by scrunching, as gleaned from simulations, and advocated through experimental studies19, it would imply that the location of the motor is relatively fixed on the DNA and the loop is extruded by ATP-driven shape transitions in the coiled-coils.
Methods
Analysis of experimental data
For a given L, we sampled a Gaussian distribution for R1 and generated 500 R1 values. This procedure yields P(L∣R1), allowing us to calculate the mean R1. We repeated this procedure for different L values. A similar procedure was use when f ≠ 0. The step size distribution, P(ΔL), is \(P({{\Delta }}L)={P}{\left(\right.}L+{R}_{2}| {R}_{1}\left.\right)\) where \({P}{\left(\right.}L+{R}_{2}| {R}_{1}\left.\right)\) is given in Eq. (1). Similarly, for f ≠ 0, the step size distribution is P(ΔL∣f) = PA(L + R2∣R1,f) where PA(L + R2∣R1,f) is given in Eq. (3). The generated discrete distributions are smoothed by box moving average. The Mathematica notebook containing the demonstration of the main results is available, see “Code availability” section. The notebook contains: LE velocity (Ω), Ω as a function of ATP concentration, P(L∣R), P(L∣R,f), P(ΔL), and P(ΔL∣f).
Experimental data are digitized using Webplotdigitizer52. All the analysis for the experimental LE data is conducted using Wolfram Mathematica 1253.
MD simulations
In order to study the O–B transition, we performed coarse-grained MD simulations as detailed in the supplementary information. The integration timestep was set to ΔtL = 0.01τL, where \({\tau }_{{{{{{\rm{L}}}}}}}=0.4\sqrt{m{\sigma }^{2}/{k}_{{{{{{\rm{B}}}}}}}T}\). For simplicity, we set mass m = 1, σ = 1 and kBT = 1. We simulated 20,000 steps with Condensin in either B or O conformation. We repeated these simulations 50 times to ensure that the calculations have converged. The simulations were performed using OpenMM ver754. The initialization script was written in-house in python 3.7 and has been uploaded to Github (see “Code availability” section). The Hamiltonian used for the simulation and parameters are given in the SI. We used MDTraj 1.9.555 for the analysis of the trajectories. We visualized the trajectories using VMD 1.9.356. For sequence and structural alignment we used MultiSeq57.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The data that support this study are available from the corresponding author upon reasonable request. The trajectory data from MD simulation generated have been deposited at https://bit.ly/3yfFRKg. The digitized experimental data9,20,58 are available at https://github.com/kibidanngo/Condensin-Analysis_Experiments. In the SI, we used crystal structure for condensin, 6YVU and 6YVD, for normal mode analysis. The PDB structures are provided with this paper40. Source data are provided with this paper.
Code availability
Molecular dynamics code for condensin simulations is deposited to Github: https://github.com/biofizzatreya/Condensin. Mathematica notebook for our theoretical results to analyze LE experiments is available https://github.com/kibidanngo/Condensin-Analysis_Experiments.
References
Flemming, W. Zellsubstanz, kern und zelltheilung (Vogel, 1882).
Alberts, B. et al. Essential Cell Biology (Garland Science, 2013).
Hagstrom, K. A. & Meyer, B. J. Condensin and cohesin: more than chromosome compactor and glue. Nat. Rev. Genet. 4, 520–534 (2003).
Yatskevich, S., Rhodes, J. & Nasmyth, K. Organization of chromosomal DNA by SMC complexes. Annu. Rev. Genet. 53, 445–482 (2019).
Nasmyth, K. Disseminating the genome: joining, resolving, and separating sister chromatids during mitosis and meiosis. Annu. Rev. Genet. 35, 673–745 (2001).
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. 112, E6456–E6465 (2015).
Terakawa, T. et al. The condensin complex is a mechanochemical motor that translocates along DNA. Science 358, 672–676 (2017).
Ganji, M. et al. Real-time imaging of DNA loop extrusion by condensin. Science 360, 102–105 (2018).
Diebold-Durand, M.-L. et al. Structure of full-length SMC and rearrangements required for chromosome organization. Mol. Cell 67, 334–347 (2017).
Buermann, F. et al. A folded conformation of MukBEF and cohesin. Nat. Struct. Mol. Biol. 26, 227 (2019).
Alipour, E. & Marko, J. F. Self-organization of domain structures by DNA-loop-extruding enzymes. Nucleic Acids Res. 40, 11202–11212 (2012).
Goloborodko, A., Imakaev, M. V., Marko, J. F. & Mirny, L. Compaction and segregation of sister chromatids via active loop extrusion. Elife 5, e14864 (2016).
Kim, E., Kerssemakers, J., Shaltiel, I., Haering, C. & Dekker, C. DNA-loop extruding condensin complexes can traverse one another. Biophys. J. 118, 380a (2020).
Banigan, E. J. & Mirny, L. A. Limits of chromosome compaction by loop-extruding motors. Phys. Rev. X 9, 031007 (2019).
Marko, J. F., De Los Rios, P., Barducci, A. & Gruber, S. DNA-segment-capture model for loop extrusion by structural maintenance of chromosome (SMC) protein complexes. Nucleic Acids Res. 47, 6956–6972 (2019).
Kapanidis, A. N. et al. Initial transcription by RNA polymerase proceeds through a DNA-scrunching mechanism. Science 314, 1144–1147 (2006).
Chen, J., Darst, S. A. & Thirumalai, D. Promoter melting triggered by bacterial RNA polymerase occurs in three steps. Proc. Natl Acad. Sci. 107, 12523–12528 (2010).
Ryu, J. -K. et al. The condensin holocomplex cycles dynamically between open and collapsed states. Nat. Struct. Mol. Biol. 27, 1–8 (2020).
Ryu, J.-K., Rah, S.-H., Janissen, R., Kerssemakers, J. W. & Dekker, C. Resolving the step size in condensin-driven DNA loop extrusion identifies ATP binding as the step-generating process. Available at SSRN 3728949 (2020).
Shi, Z., Gao, H., Bai, X.-c. & Yu, H. Cryo-em structure of the human cohesin-NIPBL-DNA complex. Science 368, 1454–1459 (2020).
Chiu, A., Revenkova, E. & Jessberger, R. Dna interaction and dimerization of eukaryotic smc hinge domains. J. Biol. Chem. 279, 26233–26242 (2004).
Griese, J. J., Witte, G. & Hopfner, K.-P. Structure and dna binding activity of the mouse condensin hinge domain highlight common and diverse features of smc proteins. Nucleic Acids Res. 38, 3454–3465 (2010).
Alt, A. et al. Specialized interfaces of smc5/6 control hinge stability and dna association. Nat. Commun. 8, 1–14 (2017).
Lawrimore, J., Friedman, B., Doshi, A. & Bloom, K. Cold Spring Harbor Symposia on Quantitative Biology, Vol. 82, 101–109 (Cold Spring Harbor Laboratory Press, 2017).
Hyeon, C. & Thirumalai, D. Kinetics of interior loop formation in semiflexible chains. J. Chem. Phys. 124, 104905 (2006).
Wilhelm, J. & Frey, E. Radial distribution function of semiflexible polymers. Phys. Rev. Lett. 77, 2581 (1996).
Thirumalai, D. Time scales for the formation of the most probable tertiary contacts in proteins with applications to cytochrome c. J. Phys. Chem. B 103, 608–610 (1999).
Seifert, U. Fluctuation theorem for a single enzym or molecular motor. EPL 70, 36 (2005).
Seifert, U. Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 75, 126001 (2012).
Mugnai, M. L., Hyeon, C., Hinczewski, M. & Thirumalai, D. Theoretical perspectives on biological machines. Rev. Mod. Phys. 92, 025001 (2020).
Marko, J. F. & Siggia, E. D. Stretching DNA. Macromolecules 28, 8759–8770 (1995).
Rubinstein, M. et al. Polymer Physics, Vol. 23 (Oxford University Press, 2003).
Eeftens, J. M. et al. Condensin Smc2-Smc4 dimers are flexible and dynamic. Cell Rep. 14, 1813–1818 (2016).
Muir, K. W., Li, Y., Weis, F. & Panne, D. The structure of the cohesin ATPase elucidates the mechanism of SMC–kleisin ring opening. Nat. Struct. Mol. Biol. 27, 233–239 (2020).
Hinczewski, M., Tehver, R. & Thirumalai, D. Design principles governing the motility of myosin V. Proc. Natl Acad. Sci. 110, E4059–E4068 (2013).
Dey, A. & Reddy, G. Toroidal condensates by semiflexible polymer chains: Insights into nucleation, growth and packing defects. J. Phys. Chem. B 121, 9291–9301 (2017).
Kolomeisky, A. B. & Fisher, M. E. Molecular motors: a theorist’s perspective. Annu. Rev. Phys. Chem. 58, 675–695 (2007).
Nomidis, S. K., Carlon, E., Gruber, S. & Marko, J. F. DNA tension-modulated translocation and loop extrusion by SMC complexes revealed by molecular dynamics simulations. Preprint at bioRxiv (2021).
Lee, B.-G. et al. Cryo-em structures of holo condensin reveal a subunit flip-flop mechanism. Nat. Struct. Mol. Biol. 27, 743–751 (2020).
Sudhakar, S. et al. Germanium nanospheres for ultraresolution picotensiometry of kinesin motors. Science 371, eabd9944 (2021).
Astumian, R. D. Thermodynamics and kinetics of a brownian motor. Science 276, 917–922 (1997).
Sindelar, C. V. et al. Two conformations in the human kinesin power stroke defined by x-ray crystallography and epr spectroscopy. Nat. Struct. Biol. 9, 844–848 (2002).
Higashi, T. L., Tang, M., Pobegalov, G., Uhlmann, F. & Molodtsov, M. A brownian ratchet model for dna loop extrusion by the cohesin complex. eLife 10, e67530 https://doi.org/10.7554/eLife.67530 (2021).
Zhang, Z., Goldtzvik, Y. & Thirumalai, D. Parsing the roles of neck-linker docking and tethered head diffusion in the stepping dynamics of kinesin. Proc. Natl Acad. Sci. 114, E9838–E9845 (2017).
Zhang, Z. & Thirumalai, D. Dissecting the kinematics of the kinesin step. Structure 20, 628–640 (2012).
Kong, M. et al. Human condensin I and II drive extensive ATP-dependent compaction of nucleosome-bound DNA. Mol. Cell 79, 99–114 (2020).
Kim, Y., Shi, Z., Zhang, H., Finkelstein, I. J. & Yu, H. Human cohesin compacts DNA by loop extrusion. Science 366, 1345–1349 (2019).
Davidson, I. F. et al. DNA loop extrusion by human cohesin. Science 366, 1338–1345 (2019).
Walther, N. et al. A quantitative map of human condensins provides new insights into mitotic chromosome architecture. J. Cell Biol. 217, 2309–2328 (2018).
Gibcus, J. H. et al. A pathway for mitotic chromosome formation. Science 359, eaao6135 (2018).
Rohatgi, A. Webplotdigitizer: Version 4.5 (2021).
Inc., W. R. Mathematica, Version 12.3.1. Champaign, IL, 2021.
Eastman, P. et al. Openmm 7: rapid development of high performance algorithms for molecular dynamics. PLoS Comput. Biol. 13, e1005659 (2017).
McGibbon, R. T. et al. Mdtraj: a modern open library for the analysis of molecular dynamics trajectories. Biophys. J. 109, 1528 – 1532 (2015).
Humphrey, W., Dalke, A. & Schulten, K. Vmd: visual molecular dynamics. J. Mol. Graph. 14, 33–38 (1996).
Roberts, E., Eargle, J., Wright, D. & Luthey-Schulten, Z. Multiseq: unifying sequence and structure data for evolutionary analysis. BMC Bioinform. 7, 382 (2006).
Strick, T. R., Kawaguchi, T. & Hirano, T. Real-time detection of single-molecule DNA compaction by condensin I. Curr. Biol. 14, 874–880 (2004).
Acknowledgements
We thank Rasika Harshey, Changbong Hyeon, Mauro Mugnai, and Johannes Stigler for useful comments and discussions. We are grateful to John Marko for clarifying some aspects of his model. This work was supported by NSF (CHE 19-00093), NIH (GM - 107703) and the Welch Foundation Grant F-0019 through the Collie-Welch chair.
Author information
Authors and Affiliations
Contributions
R.T. and D.T. conceived the theory, R.T., A.D., G.S. and D.T. performed the calculations and simulations, the experiment(s), R.T., A.D., G. S. and D.T. analyzed the results. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Takaki, R., Dey, A., Shi, G. et al. Theory and simulations of condensin mediated loop extrusion in DNA. Nat Commun 12, 5865 (2021). https://doi.org/10.1038/s41467-021-26167-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-26167-1
This article is cited by
-
Single cohesin molecules generate force by two distinct mechanisms
Nature Communications (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.