The in vivo measurement of replication fork velocity and pausing by lag-time analysis

Huang, Dean; Johnson, Anna E.; Sim, Brandon S.; Lo, Teresa W.; Merrikh, Houra; Wiggins, Paul A.

doi:10.1038/s41467-023-37456-2

Download PDF

Article
Open access
Published: 30 March 2023

The in vivo measurement of replication fork velocity and pausing by lag-time analysis

Nature Communications volume 14, Article number: 1762 (2023) Cite this article

2704 Accesses
5 Citations
21 Altmetric
Metrics details

Subjects

Abstract

An important step towards understanding the mechanistic basis of the central dogma is the quantitative characterization of the dynamics of nucleic-acid-bound molecular motors in the context of the living cell. To capture these dynamics, we develop lag-time analysis, a method for measuring in vivo dynamics. Using this approach, we provide quantitative locus-specific measurements of fork velocity, in units of kilobases per second, as well as replisome pause durations, some with the precision of seconds. The measured fork velocity is observed to be both locus and time dependent, even in wild-type cells. In this work, we quantitatively characterize known phenomena, detect brief, locus-specific pauses at ribosomal DNA loci in wild-type cells, and observe temporal fork velocity oscillations in three highly-divergent bacterial species.

Genome-wide mapping of individual replication fork velocities using nanopore sequencing

Article Open access 08 June 2022

High-throughput analysis of single human cells reveals the complex nature of DNA replication timing control

Article Open access 03 May 2022

Monitoring genome-wide replication fork directionality by Okazaki fragment sequencing in mammalian cells

Article 13 January 2021

Introduction

At a single-molecule scale, all cellular processes are both highly stochastic as well as subject to a crowded cellular environment where they typically compete with a large number of potentially antagonistic processes that share the same substrate^1,2. In spite of these challenges, essential processes must be robust at a cellular scale to facilitate efficient cellular proliferation. Understanding how these processes are regulated to achieve robustness remains an important and outstanding biological question^{3,4,5,6,7,8,9}. However, a central challenge in investigating these questions is the quantitative characterization of the activity of enzymes in the context of the living cell. For instance, although single-molecule assays can resolve the pausing of molecular motors on nucleic-acid substrates in the context of in vitro measurements^10,11, performing analogous measurements in the physiologically-relevant environment of the cell poses a severe challenge to the existing methodologies¹².

In this paper, we develop an approach, lag-time analysis, that facilitates the quantitative characterization of dynamics, with resolution of seconds, in the context of the living cell. The approach exploits exponential growth as the stopwatch to capture dynamics in exponentially proliferating cellular cultures¹³ and unlike competing approaches, it can circumvent the difficulties and potential artifacts introduced by cell synchronization¹⁴ or fluorescent labeling. Lag-time analysis exploits the same data as marker-frequency analysis, but it directly measures the locus-specific fork velocity, in units of kilobases per second, and the duration of replisome pauses in seconds. Lag-time analysis facilitates detailed comparisons to be made, not just between different loci in a single cell, but between wild-type and mutant cells as well as between bacterial species. Unlike a recent competing analysis, no detailed stochastic models or simulations are employed¹⁵. We apply this approach to analyze three model bacterial systems: Bacillus subtilis, Vibrio cholerae, and Escherichia coli. In B. subtilis, we analyze transcription-induced replication antagonism which is the main determinant of replisome dynamics in a set of mutants with retrograde (reverse-oriented) fork motion. An analysis of V. cholerae provides evidence that fork number is an important determinant of fork velocity, but also provides clear evidence that fork velocity is time dependent. To explore this time-dependence, we analyze the fork velocity in E. coli which provides strong evidence for temporally oscillating fork velocity, consistent with a recent report¹⁵. Finally, we demonstrate that these oscillations are observed in all three organisms. In summary, the observed phenomena demonstrate the central importance of characterizing central dogma processes in the context of the living cell, where their activity is regulated and modulated by the cellular environment.

Results

The bacterial cell cycle

The bacterial cell cycle is divided into three periods^16,17: The B period is analogous to the G₁ phase of the eukaryotic cell cycle, corresponding to the period between cell birth and replication initiation. The C period is analogous to the S phase (and early M phase) in which the genome is replicated and simultaneously and sequentially segregated¹⁸. The D period is analogous to a combination of phases G₂ and late-M, corresponding to a period of time between replication termination and cell division, including the process of septation (i.e., cytokinesis).

The demographics of cell-cycle periods of exponentially growing bacterial cells were first quantitatively modeled by Cooper and Helmstetter in an influential paper¹⁹ and then refined by multiple authors^20,21,22. In the Methods Section, we generalize these models to demonstrate that marker-frequency analysis quantitatively measures the cell-cycle replication dynamics. The key results are summarized below.

Lag-time analysis

Our strategy will be to use exponential growth as the stopwatch with which we resolve cell-cycle dynamics. In short, cells with greater cell-cycle progression (i.e., age) are depleted in the population, equivalent to an independent, exponentially proliferating species that lags newborn cells by a time equal to its age¹³ (see Fig. 1 for a schematic illustration of the approach). Lag-time analysis is the measurement of this time lag. In principle, this approach can be applied to characterize the dynamics of any biological molecules or complexes; however, for concreteness, we will focus on replication dynamics. This process is of great biological interest and next-generation sequencing provides a powerful tool for digital, as well as genome-wide, quantitation of the number of genomic loci.

In marker-frequency experiments, the number of each sequence N(ℓ) in a steady-state, asynchronously growing population is determined by mapping next-generation-sequencing reads to the reference genome. This marker frequency can be reinterpreted as a measurement of the lag time τ(ℓ):

$$\tau (\ell )=\frac{1}{{k}_{G}}\ln \frac{{N}_{0}}{N(\ell )},$$

(1)

where N(ℓ) is the observed number of the locus at genomic position ℓ, N₀ is the observed number of the origin in the culture, and k_G is the growth rate. This relation can be understood as a consequence of the exponential growth law¹³.

In a deterministically timed model, the measured lag time would be equal to the replication time relative to initiation. In reality, the timing of all processes in the cell cycle is stochastic. We previously showed that the measured lag time is related to the distribution of durations in single cells by the exponential mean¹³:

$${\tau }_{i}\equiv -\frac{1}{{k}_{G}}\ln {{\mathbb{E}}}_{t}\exp (-{k}_{G}t),$$

(2)

where ${{\mathbb{E}}}_{t}$ is the expectation over stochastic time t with distribution t ~ p_i( ⋅ ).

Determination of replisome-pause durations

Replisome-pause durations or the lag time difference between the replication of any two loci can be computed using the difference of lag times between the two loci:

$${{\Delta }}{\tau }_{ij}\equiv {\tau }_{j}-{\tau }_{i}=\frac{1}{{k}_{G}}\ln \frac{N({\ell }_{i})}{N({\ell }_{j})}.$$

(3)

We emphasize that the observed difference in lag time is the exponential mean of the stochastic time difference, which has important consequences for slow processes.

Determination of the fork velocity

For fast processes, like single-nucleotide incorporation, the exponential mean leads to a negligible correction (see Methods); therefore, the fork velocity has a simple interpretation: it is the slope of the genomic position versus lag-time curve:

$$v(\ell )\equiv \frac{{{{{{{{\rm{d}}}}}}}}\ell }{{{{{{{{\rm{d}}}}}}}}\tau }=\frac{{k}_{G}}{\alpha (\ell )},$$

(4)

or equivalently it is the ratio of the growth rate to the log-slope:

$$\alpha (\ell )\equiv -\frac{{{{{{{{\rm{d}}}}}}}}}{{{{{{{{\rm{d\ell }}}}}}}}}\ln N(\ell ),$$

(5)

which can be directly determined from the marker frequency.

Lag-time analysis reveals V. cholerae replication dynamics

To explore the application of lag-time analysis to characterize replication dynamics, we begin our analysis in the bacterial model system Vibrio cholerae, which harbors two chromosomes: Chromosome 1 (Chr1) is 2.9 Mb and Chromosome 2 (Chr2) is 1.1 Mb. The origin of Chr1, oriC1, fires first and roughly the first half of replication is completed before the replication-initiator-RctB-binding-site crtS is replicated, triggering Chr2 initiation at oriC2^23,24,25. Chr1 and Chr2 then replicate concurrently for the rest of the C period (see Fig. 2a).

**Fig. 2: Replication fork dynamics in *V. cholerae*.**

To demonstrate the power of lag-time analysis, we compute the marker frequency, lag time, and fork velocities. To measure pause times and replication velocities, we generate a piecewise linear model with a resolution set by the Akaike Information Criterion (AIC). The AIC-optimal model for fast growth (in LB) had 39 knots, spaced by 100 kb, generating 38 measurements of locus velocity across the two chromosomes. The replication dynamics for growth in LB is shown in Fig. 2. For tabulated velocities, see Supplementary Data 1.

The measurement of the duration of fast processes

We focus first on the duration of time between crtS replication and the initiation of oriC2. Fluorescence microscopy imaging reveals that this wait time is very short²⁶, but it is very difficult to quantify since, the precise timing of the replication of the crtS sequence is difficult to determine by fluorescence imaging; however, this is a natural application for lag-time analysis. To measure the difference in lag time between crtS replication and oriC2 replication, we use Equation (3) to compute the replication time difference from the relative copy numbers. For this analysis, we generate a piecewise linear model with knots at the crtS and oriC2 loci. The measured lag time is

$${{\Delta }}{\tau }_{{{{{{{{\rm{pause}}}}}}}}}=3.5\pm 0.1\,{{\mbox{min}}}\,,$$

(6)

a pause duration which is clearly resolved in the lag-time plot shown in Fig. 2e.

The fork velocity is locus dependent

It is qualitatively clear from the fork-distance-versus-time plot (Fig. 2e) that the fork velocity is locus dependent, since the trajectory is not straight. To test this question statistically, we compare the 39-knot model to the null hypothesis (constant fork velocity), which is rejected with a p-value of p ≪ 10⁻³⁰ and therefore the data cannot be described by a single fork velocity (see Table 1). The resulting velocity profiles are shown in Fig. 2c, f.

Table 1 Fork number and velocities under different growth conditions

Full size table

Bilateral symmetry supports a time-dependent mechanism

Our understanding of the replication process motivated two general classes of mechanisms: (i) time-dependent and (ii) locus-dependent mechanisms. Time-dependent mechanisms, like a dNTP-limited replication rate, affect all forks uniformly and therefore loci equidistant from the origin should have identical fork velocities:

$$v(\ell )=v(-\ell ),$$

(7)

where ℓ is the genetic position relative to the origin. In contrast, in a locus-dependent mechanism, like replication-conflict-induced slowdowns, the slow regions are expected to be randomly distributed over the chromosome. In this scenario we expect to see no bilateral symmetry between arms (see Methods).

A bilateral symmetry between the arms is clearly evident in the data (the mirror symmetry about the origin in Fig. 2b, c and is manifest in the lag-time analysis as the coincidence between the left and right arm trajectories and velocities in Fig. 2e, f. To quantitate this symmetry, we divide the variance of the fork velocity into symmetric and antisymmetric contributions (see Supplementary Method 4). A time-dependent mechanism would generate a f_S = 100% symmetric variance, whereas a locus-dependent mechanism would be expected to generate equal symmetric and antisymmetric variance contributions (f_S = 50%). V. cholerae Chr1 and Chr2 have f_S = 76% symmetry, consistent with a time-dependent mechanism playing a dominant but not exclusive role in determining the fork velocity (see Table 1).

The replisome pauses briefly at rDNA in B. subtilis

To explore the possibility that locus-dependent mechanisms could play a dominant role in determining the fork velocity profile, we next characterized the fork dynamics in the context of replication conflicts, where the antagonism between active transcription and replication, have been reported to stall the replisome by a locus-specific mechanism^9,27. In B. subtilis, there are seven highly transcribed rDNA loci on the right arm and only a single locus of the left arm. Consistent with the notion of rDNA-induced pausing, the ter locus is positioned asymmetrically on the genome, at 172° rather than 180°, leading the right arm of the chromosome to be shorter than the left arm (see Fig. 3a). In spite of the difference in length, both arms terminate roughly synchronously, implying that the average fork velocity is lower on the right arm, consistent with putative fork pausing at the rDNA loci. Are these conflict-induced pauses present in wild-type cells where the replication and transcription are co-directional? We have previously reported evidence based on single-molecule imaging that they are¹², but there is as of yet no other unambiguous supporting evidence.

**Fig. 3: *B. subtilis* fork dynamics and transcriptional conflicts.**

To detect putative short pauses at the rDNA loci in wild-type B. subtilis, a low-noise dataset was essential. We therefore examined a number of different datasets, including our own, to search for a dataset with the lowest statistical and systematic noise. A marker-frequency dataset for a nearly wild-type strain growing on minimal media was identified for which the noise level was extremely low (see Supplementary Method 3B). The lag-time analysis is shown in Fig. 3b. Replication pauses should result in discrete steps in the lag time; however, no clearly defined steps are visible in the lag-time plot. The pauses are either absent or too small to be clearly visible without statistical analysis.

To achieve optimal statistical resolution, we used the AIC model-selection framework^28,29 on four competing hypotheses: In Model 1, fork velocities are constant and equal on both arms with no pauses. In Model 2, fork velocities are constant but unequal on the left and right arms with no pauses. In Model 3, fork velocities are constant and equal on the left and right arms with equal-duration pauses at each rDNA locus. In Model 4, fork velocities are constant and unequal on the left and right arms with equal-duration pauses at each rDNA locus. AIC selected Model 3 (equal arm velocities with rDNA pauses) and a pause duration of:

$${{\Delta }}{\tau }_{{{{{{{{\rm{pause}}}}}}}}}=17\pm 8\,{{\mbox{s}}}\,,$$

(8)

is observed. The pause models were strongly supported over the non-pause models (ΔAIC₂₃ = 4.3 and ΔAIC₄₃ = 9.4). Therefore, statistical analysis supports the existence of short slowdowns (i.e., pauses) at the rDNA, even if these features are not directly observable without statistical analysis. In higher-noise datasets, the statistical inference was ambiguous.

Strong, head-on conflicts lead to long pauses

Although we have just demonstrated that endogenous co-directional conflicts are detected statistically, they do not lead to a clear unambiguous signature. In contrast, strong, exogenous head-on conflicts in which the replisome and transcriptional machinery move in opposite directions can lead to particularly potent conflicts and even cell death^{3,4,5,6,7,8,9,30}. The ability to engineer conflicts at specific loci facilitates the use of lag-time analysis for measuring the duration of the replication pauses.

To measure the pause durations due to head-on conflicts, we analyze the marker frequency from a strain, rrnIHG(inv), generated by Srivatsan and coworkers with three rDNA genes (rrnIHG) inverted so that they are transcribed in the head-on orientation. Marker-frequency datasets were reported for this strain in two growth conditions: minimal supplemented with casamino acids, in which the strain grows at an intermediate growth rate, and unsupplemented minimal media, in which the strain grows at a slow growth rate³¹. (Mutant cells cannot proliferate in rich media, presumably because the transcription conflicts are so severe³¹.) In both slow and intermediate growth conditions, a clearly resolved step at the head-on locus is observed in the marker-frequency and lag-time analysis (Fig. 3b), exactly analogous to the simulated pause (see Methods).

To determine the pause durations in the two growth conditions, we again consider a model with an unknown pause duration (at the inverted rDNA locus) and constant but unequal fork velocities on the left and right arms. The observed lag-time pauses are

$${{\Delta }}{\tau }_{{{{{{{{\rm{pause}}}}}}}}}=\left\{\begin{array}{l}3.3\pm 0.7\,{{\mbox{min (slow)}}}\hfill\,\quad \\ 9.7\pm 0.9\,{{\mbox{min (intermediate)}}} \end{array}\right.,$$

(9)

for the slow and intermediate growth rates, respectively.

Although lag-time analysis reports a precise pause duration, it is important to remember that the observed lag time corresponds to the exponential mean of the stochastic state lifetime, Equation (2), including cells that arrest and therefore never complete the replication process. Equation (18) accounts for the pause generated by this arrested cell fraction. Srivatsan and coworkers report that 10% of the cells are arrested in intermediate growth, which accounts for 8.3 min of the lag time, leaving an estimated pause time of Δτ_pause = 1.4 ± 0.9 min for non-arrested cells, which is roughly consistent with the pause time observed in slow growth conditions.

Slow retrograde replication in B. subtilis

Are all conflict-induced slowdowns consistent with long pauses at a small number of rDNA loci? Wang et al. have previously engineered a head-on strain, 257°::oriC, with less severe conflicts by moving oriC down the left arm of the chromosome to 257°³² (see Fig. 3a). The resulting strain has a very short left arm and a very long right arm, the first third of which is replicated in the retrograde (i.e., reverse to wild-type) orientation. This retrograde region contains only a single rDNA locus. All other regions are replicated in the antegrade (i.e., endogenous) orientation.

Consistent with the analysis of Wang et al., we position knots to divide the chromosome into three regions with three distinct slopes: an early antegrade region A1 (the short left arm) with log-slope α_A1 = 0.34 ± 0.01 Mb⁻¹, a retrograde region R with log-slope α_R = 0.63 ± 0.01 Mb⁻¹ and a late antegrade region A2 with log-slope α_A2 = 0.26 ± 0.01 Mb⁻¹, that replicates after the left arm terminates (see Fig. 3c). Due to the higher percentage of head-on genes in the R region compared with the A1 region, the conflict model predicts more rapid replication in region A1 versus R. Consistent with this prediction, the ratio of replication velocities is:

$${v}_{A1}/{v}_{R}=1.84\pm 0.4,$$

(10)

revealing a strong replication-direction dependence. The slope appears relatively constant, consistent with a model of uniformly-distributed slow regions rather than a small number of long pauses as observed in the reversal of the rDNA locus rrnIHG. Our quantitative analysis is consistent with the interpretation of Wang et al.³².

Rapid late replication due to genomic asymmetry

This dataset has a striking feature that is not emphasized in previous reports. Late antegrade fork velocity is faster than early antegrade velocity:

$${v}_{A2}/{v}_{A1}=1.29\pm 0.05.$$

(11)

Although this effect is weaker than the replication-direction dependence discussed above, Equation (10), its size is still comparable. An analogous late-time speedup is seen in two other ectopic origin strains (see the Supplementary Figs. 8 and 10).

One potential hypothesis is that a locus-dependent mechanism slows the fork in the A1 region relative to the A2 region; however, no velocity difference is evident in these regions in the wild-type cells (Fig. 3b). Alternatively, one could hypothesize that there is some form of communication between forks that leads to a slowdown in region A1 due to the slowdown in region R; however, no coincident slowdown is observed in rrnIHG(inv) at a position opposite the rrnIHG locus, inconsistent with this hypothesis. Another possible hypothesis is that late-time replication is always rapid; however, no significant speedup is observed in either wild-type B. subtilis (Fig. 3a) or V. cholerae cells at the end of the replication process (Fig. 3b and Fig. 2e). However, there is one extremely important difference between 257°::oriC and the wild-type strains: Due to the asymmetric positioning of the origin and replication traps at the terminus (Fig. 3a), there is only a single active replication fork as the A2 region is replicated. We therefore hypothesize that the fork velocity is inversely related to active fork number.

Fork number determines velocities in V. cholerae

To explore the effects of changes in the fork number on fork velocity, it is convenient to return to V. cholerae. In slow growth conditions, the cells start the C period with a pair of replication forks, for which the fork-number model predicts faster fork velocity, and finish the replication cycle with two pairs of forks, predicting slower fork velocity.

Although the structure of the velocity profile is more complex than predicted by the fork-number model alone, the observed fork velocity is broadly consistent with its predictions. If a mean fork velocity is computed before and after oriC2 initiates, the ratio is:

$${v}_{{{{{{{{\rm{before}}}}}}}}}/{v}_{{{{{{{{\rm{after}}}}}}}}}=1.46\pm 0.02,$$

(12)

which is quantitatively consistent with the hypothesis that more forks lead to a slowdown in replication and the size of the effect is comparable to what is observed in B. subtilis, Equation (11).

A mutant V. cholerae strain has been constructed that facilitates a non-trivial test of the fork-number model: In the monochromosomal strain MCH1, Chr2 is recombined into Chr1 at the terminus of Chr1, resulting in a single monochromosome (Chr 1–2) (see Fig. 4a). Both the wild-type and MCH1 strains have essentially identical sequence content, implying the locus-dependent model would predict identical replication velocities; however, all replication in MCH1 occurs with only a single set of forks whereas the wild-type strain replicates the latter half of the C period with two pairs of forks, one pair on each chromosome.

**Fig. 4: Reducing fork number increases fork velocity.**

The measured fork velocities are shown in Fig. 4b and support the fork-number model: MCH1 replicates the sequences after crtS at roughly 1.6 times the fork velocity of the wild-type cells, consistent with the fork-number model. Alternatively, we can consider the same quantitation of fork velocity we considered above: The ratio of fork velocities of loci replicated before crtS to those replicated afterwards:

$${v}_{{{{{{{{\rm{before}}}}}}}}}/{v}_{{{{{{{{\rm{after}}}}}}}}}=1.11\pm 0.03,$$

(13)

therefore only a very small slowdown is observed after crtS is replicated in MCH1, even though exactly the same sequences are replicated, again consistent with the fork-number model.

The fork velocity oscillates in E. coli

Although experiments in V. cholerae clearly support the fork-number model, there is significant variability that cannot be explained by this model alone. Are time-dependent variations in fork velocity also observed in organisms that replicate a single chromosome? To answer this question, we worked in the gram-negative model bacterium Escherichia coli, which harbors a single 4.6 Mb chromosome. A large collection of marker-frequency datasets have already been generated for both rapid and slow growth conditions by the Rudolph lab³³. As with the B. subtilis marker-frequency datasets, we selected those that had the lowest statistical and systematic noise (see the Supplementary Methods 3B).

The fork velocities are shown in Fig. 5. As before, statistically significant variation is observed in the fork velocity as a function of position (see Table 1 and Supplementary Method 6). As discussed above in the context of V. cholerae, we had initially hypothesized that this variation might be a consequence of rDNA position or some other locus-dependent mechanism; however, there are three arguments against this hypothesis: (i) The slow-velocity regions are not coincident with rDNA locations (Fig. 5a) or relative GC content (Supplementary Fig. 1). (ii) Consistent with the time-dependent model, 84% (and 59%) of the observed variation in the fork velocity is symmetric for fast (and slow) growth. (iii) We would expect that a locus-dependent model would predict slow regions that are consistent between fast and slow growth, which is not observed (see the purple arrows in Fig. 5a). We therefore conclude that the dominant mechanism for determining the fork velocity is a time-dependent mechanism, consistent with our observations for V. cholerae.

**Fig. 5: Observed oscillations are consistent with a temporal mechanism.**

Lag-time analysis is particularly informative with respect to the mechanism of variation in the fork velocity: Although there is no alignment in the velocity with respect to locus position (Fig. 5a), there is clear alignment of the fork velocity variation with respect to lag time (Fig. 5b), not only between the left and right arms of the chromosome, but between slow and fast growth conditions. The oscillations do not align with respect to locus position (Fig. 5a) since the difference in average fork velocity leads the slow and fast temporal periods to correspond to different locus positions under slow and fast growth conditions.

Fork-velocity oscillations are observed in three organisms

Temporal oscillations in the fork velocity are an unexpected phenomenon. Are these features a systematic error with a single dataset? First we note that these oscillations are present in two E. coli growth conditions (LB and minimal). This phenomenon would be on sounder footing if similar oscillations are observed in two evolutionarily distant species: the gram-negative V. cholerae and gram-positive B. subtilis. If this phenomenon is observed, to what extent are the oscillations of similar character (e.g., phase, amplitude, and period)?

We compared the lag-time-dependent fork velocity for all three species. In B. subtilis, we have already discussed a rDNA-induced pausing on the right arm, which could complicate the interpretation of the data. We therefore consider the fork velocity on just the left arm. For E. coli and V. cholerae, we compute the average velocity as a function of lag time between the two arms. Since the different organisms and growth conditions have different mean fork velocities, we compare the fork velocity relative to the overall mean. The results are shown in Fig. 6 and Table 2.

Table 2 Velocity oscillation characteristics for different bacterial species and growth conditions

Full size table

All three organisms show oscillations with the same qualitative features: Each fork velocity has roughly the same phase: The velocity begins high, before decaying. The relative amplitudes, roughly 0.5 peak-to-peak, are all comparable with the largest-amplitude oscillations observed in V. cholerae and the smallest in E. coli. When the relative velocities are compared, it is striking how much consistency there is between growth conditions in E. coli and B. subtilis. Finally, the period of oscillation is comparable but distinct in all three organisms, ranging from 10 to 15 min. The oscillation characteristics are summarized in a table in Fig. 6 and Table 2.

Discussion

The focus of this paper is on the development of lag-time analysis, which uses exponential growth as the timer to characterize replication dynamics. Previous marker-frequency analyses have often reported a log-slope (e.g., refs. ^32,34), which is closely related to the fork velocity. What new insights does the measurement of the fork velocity offer over this closely related approach? The fork velocity approach has two important advantages: (i) The first advantage is a conceptual one. The underlying quantity of interest is velocity (or rate per base pair). This is the quantity that is measured in vitro and is relevant in a mechanistic model. In contrast, the log-slope is an emergent quantity that is only relevant in the context of exponential growth. (ii) The second advantage is concrete: Although log-slope measurements allow ratiometric comparisons between fork velocity at different loci in the same dataset, they cannot be used to make comparisons across datasets. Any comparison of the log-slope between cells with different growth rate (e.g., due to changes in growth conditions, mutations, species, etc.) are meaningless. For instance, the log-slopes of the wild-type and MCH1 V. cholerae strains are very different even though the changes in the fork velocity are small. Our wide-ranging comparisons between growth conditions, mutants, and organisms demonstrate the power of reporting fork velocity over the log-slope.

Although our focus has been on replication in bacterial cells, an important question is to what extent our approach could be adapted to eukaryotic cells. First, we emphasize that the lag-time analysis is directly applicable without modification to the eukaryotic context. As such, the timing of the replication of loci can be analyzed; however, since the S phase is typically a smaller fraction of the cell cycle and the genomes of eukaryotic cells are larger, deeper sequencing will be required to achieve the same resolution we demonstrate in the context of bacterial cells. One significant potential refinement to this approach is the use of cell sorting (sort-seq) to enrich for replicating cells which can greatly increase the signal-to-noise ratio^35,36; however, this approach appears to lead to significant flattening near early-firing origins, as we have observed in other contexts (Supplementary Method 3B), and therefore increasing sequencing depth is probably the most promising approach for eukaryotic systems when quantitative characterization is a priority (see Methods Equations (22) and (23) for an estimate of resolution).

Although lag-time analysis can easily be extended to the eukaryotic context, the measurement of the fork velocity will require some care. A critical assumption in our analysis is that replication forks move unidirectionally at any particular locus, i.e., it can be either rightward or leftward moving but not both (see Supplementary Method 3I). Fork traps prevent this bidirectionality in many bacterial cells. For loci in the terminus region, although the replication timing can be determined with high precision, the bidirectionality of the fork movement prevents the measurement of fork velocities in these regions. This is a more important limitation in eukaryotic cells where the number of origins is much greater; however, if regions of the chromosome can be found where fork movement is unidirectional, e.g., sufficiently close to early-firing origins, fork velocity measurements could be made in eukaryotic cells. For instance, these conditions appear to be met for a significant fraction of the Saccharomyces cerevisiae genome³⁶. With significant increases in sequencing depth, we expect analogous replication phenomenology, including pausing and locus- and time-dependent fork velocities, will be observed in eukaryotic systems using lag-time analysis.

As we prepared this manuscript, we became aware of a competing group which also uses marker-frequency analysis to test a specific hypothesis: the fork velocity is oscillatory in E. coli¹⁵, consistent with our own observations. Although our reports share some conclusions, this competing approach requires detailed models for the cell cycle and the fork velocity, along with explicit stochastic simulations. We demonstrate an approach to measure fork velocities independent of model assumptions or detailed hypotheses for the fork velocity, without the need for numerical simulation and complete with the ability to perform an explicit and tractable error analysis.

Although our initial investigations were dependent on explicit numerical simulations of stochastic models, the use of lag-time analysis not only circumvents the need to perform these numerical simulations, but demonstrates that stochastic models are equivalent to deterministic models as well as providing a framework to understand the effects of stochasticity on the growth of populations through the use of the exponential mean, Equation (2)¹³. This significant simplification will make lag-time analysis both widely applicable as well as accessible to other investigators who lack specialized analytical skills and modeling expertise.

Our measurements of the replication velocity reveal that there are multiple important determinants that result in complex velocity profiles. Previous work had already demonstrated that increases (or decreases) in dNTP pool levels lead to concomitant decreases (or increases) in the C period duration, consistent with a dNTP-limited model of the replication velocity^37,38,39,40. Our data are broadly consistent with these previous results, but in a subcellular context: (i) The fork-number model, in which fork velocities decrease as the number of active forks increase, is clearly consistent with a mechanism in which the nucleotide pool levels, although highly regulated⁴¹, cannot completely compensate for the increased incorporation rate associated with multiple forks. (ii) The observation of the fork velocity oscillations is also consistent with an analogous failure of the regulatory response to compensate, this time temporally. The initial fall in the fork velocity is consistent with a model in which dNTP levels initially fall as replication initiates and nucleotides begin to be incorporated into the genome. Reduction in the dNTP levels causes a regulatory response to increase dNTP synthesis by ribonucleotide reductase⁴¹, but the finite response time of the network could lead to dynamic overshoot in the regulatory feedback, leading to oscillations⁴². Ref. ¹⁵ has also argued that this oscillating-dNTP-level model would lead to time-dependent oscillations in the mutation rate which are consistent with the origin-mirror-symmetric distribution of the mutation observed in E. coli. However, this interesting phenomenon and this hypothesized mechanism will require further investigation.

A key clue to the potential significance of the fork-velocity oscillations comes from their observation, not only in E. coli, but also in B. subtilis and V. cholerae, three highly divergent species, as well as their observation under multiple growth conditions. Although it has long been assumed that homeostatic regulation keeps key cellular metabolites in a relatively narrow range, our observations, as well as the recent reports of oscillations in other key nucleotides in bacteria (e.g., ATP in E. coli⁴³), suggest that key metabolites are in fact subject to significant temporal oscillations even in the context of steady-state log-phase growth. These observations, if their ubiquity is supported by future work, may require a significant revision of our understanding of the metabolic environment of the cell.

Retrograde fork motion, where the fork moves in the opposite direction from wild-type cells, lead to the largest changes in fork velocity observed. To what extent is the observed slowdown a consequence of a few long-duration pauses versus a region-wide slowdown? In regions which exclude the rDNA, the effect appears well distributed. However, it is important to note that the genomic resolution of lag-time analysis is still much too low to resolve individual transcriptional units. We anticipate that with increased sequencing depth as well as improvements in sample preparation, this approach could detect genomic structure in the fork velocity at the resolution of individual transcriptional units. Although we did analyze a number of mutants with retrograde fork movement in V. cholerae and E. coli (analysis not shown), the competing effect of increased fork number as well as the genomic instability of these strains made these experiments difficult to interpret quantitatively, since fork number and direction were both affected in these strains^32,44. We concluded qualitatively that retrograde replication direction appears not to play as large a role in these gram-negative bacteria as it does in gram-positive B. subtilits, consistent with previous evidence^{31,32,45,46,47}. However, we expect lag-time analysis could be used to characterize even small effects of the retrograde fork orientation in more-carefully engineered strains, analogous to those that we analyzed in the context of B. subtilis^31,32.

Previous reports^31,32, including our own^{12,48,49,50,51}, had reported long-duration replication-conflict induced pauses, especially in mutant strains where the orientation of rDNA³¹ or other highly transcribed genes⁴⁸ are inverted to give rise to a head-on conflict between replication and transcription. The contribution of lag-time analysis in this context is multifold: First, we provide a quantitative number in the context of the very-short-duration pauses for co-directional transcription in wild-type cells. This analysis supports a long-standing hypothesis that the right arm of the B. subtilis chromosome is shorter than the left arm to compensate for pausing at the rDNA loci that arm predominately located on this arm.

We also report quantitative measurements for the longer pauses that results from head-on conflicts in mutants where highly transcribed genes are inverted. Our analysis gives us the ability to quantitatively differentiate the contributions of fork pausing and arrest in the analysis of the marker frequency, which was previously impossible. Our measurement of a timescale of minutes is consistent with our previous in vivo single-molecule measurements in which we report transcription-dependent disassembly of the core replisome¹². Could the observed fork-velocity oscillations be misinterpreted as pauses? The observed lag-time offset between the two arms (e.g., Fig. 3b) is not predicted by fork-velocity oscillations.

In this paper, we introduce a method for quantitively characterizing cellular dynamics by lag-time analysis. Although more broadly applicable, we focus our analysis on the characterization of replication dynamics using next-generation sequencing to quantitate DNA locus copy number genome-wide. The approach has the ability to make precise, even at the resolution of seconds, measurements of time differences and pause durations, as well as the ability to quantitatively measure fork velocities in vivo in physiological units of kb s⁻¹, at genomic resolutions of roughly 100 kb. Importantly, unlike marker-frequency analysis, our approach allows direct quantitative comparisons to be made between growth conditions, mutant strains, and even different organisms. The resulting measurements of replication dynamics reveal complex phenomenology, including temporal oscillations in the fork velocity as well as evidence for multiple mechanisms that determine the fork velocity. The lag-time analysis has great potential for application beyond bacterial systems as well as the potential to significantly increase in resolution and sensitivity as sequencing depth and sample preparation improve.

Methods

Strains used in this study

Detailed information about the strains used in this study are included in Supplementary Table 1.

Introduction to marker-frequency analysis

Our focus will be on marker-frequency analysis, which measures the total number of a genetic locus in an asynchronous population. The model was generalized to predict the marker frequency N(ℓ) of a locus a genomic distance ℓ away from the origin^20,21,22:

$$N(\ell )={N}_{0}\,{{{{{{{{\rm{e}}}}}}}}}^{-\alpha|\ell|},$$

(14)

where N₀ is the number of origins, which grows exponentially in time with the rate of mass doubling of the culture, k_G. Since the origin is replicated first, the number of origins is always largest compared to the numbers of other loci. Quantitatively, the copy number is predicted to decay exponentially with log-slope:

$$\alpha=-\frac{{{{{{{{\rm{d}}}}}}}}}{{{{{{{{\rm{d\ell }}}}}}}}}\ln N(\ell )={k}_{G}/v,$$

(15)

where k_G is the population growth rate and v is the fork velocity, typically expressed in units of kilobases per second. To derive this result, two critical assumptions were made: (i) the timing of the cell cycle is deterministic and (ii) the fork velocity is constant^19,20.

Initially, our naïve expectation was that the interplay between the significant stochasticity of the cell-cycle timing with the asynchronicity of the culture would prevent marker-frequency analysis from being used as a quantitative tool for characterizing cell-cycle dynamics. For instance, significant stochasticity is observed in the duration of the B period⁵² (i.e., the duration of time between cell birth and the initiation of replication). Does this stochasticity lead to a failure of the log-slope relation, Equation (15)?

Stochastic simulations support the log-slope relation

To explore the role of stochasticity and a locus-dependent fork velocity in shaping the marker frequency, we simulated the cell cycle using a stochastic simulation. Our aim was not to perform a simulation whose mechanistic details were correct, but rather to study how strong violations of the Cooper-Helmstetter assumptions, in particular how stochasticity, as strong or stronger than that observed, influenced the observed marker frequency and the log-slope relation, Equation (15). In short, we used a Gillespie simulation⁵³ where the B period duration and the lifetime of replisome nucleotide incorporation steps are exponentially distributed, and we added regions of the genome where the incorporation rate was fast as well as a single slow step on one arm. See Fig. 7a and Supplementary Notes 1–6 for a detailed description of the model, as well as movies of the marker frequency approaching steady-state growth, starting from a single-cell progenitor (Supplementary Movies 1 and 2).

To our initial surprise, the stochasticity of the model had no effect on the predicted log-slope of the locus copy number (see Fig. 7b). In spite of the stochastic duration of the B period and the locus-dependence, the marker frequency still decays exponentially with the same decay length locally, i.e.,:

$$\alpha (\ell )\equiv -\frac{{{{{{{{\rm{d}}}}}}}}}{{{{{{{{\rm{d\ell }}}}}}}}}\ln N(\ell )={k}_{G}/v(\ell ),$$

(16)

where k_G was the empirically determined growth rate in the simulation and v(ℓ) was the local fork velocity at the locus with position ℓ.

We hypothesized that this result might be a special case of choosing an exponential lifetime distribution, since this is consistent with a stochastic realization of chemical kinetics. To test this hypothesis, we simulated several different distributions, including a uniform distribution, for the duration of the B period and the stepping lifetime for the replisome (as well as simulating multifork replication). In each case, the local log-slope relation held, Equation (16), even as the growth rate and fork velocities changed with the changes in the underlying simulated growth dynamics. We therefore hypothesized that Equation (16) was a universal law of cell-cycle dynamics and independent of Cooper and Helmstetter’s original assumptions.

The exponential-mean duration

Motivated by this empirical evidence, we exactly computed the population demography in a class of stochastically timed cell models¹³. In short, we showed that there is an exact correspondence between these stochastically timed models and deterministically timed models in exponential growth. The relationship between the corresponding deterministic lifetime τ_i of a state i and the underlying distribution p_i in the stochastic model is the exponential mean, Equation (2)¹³. The exponential mean biases the mean towards short times, the growth rate k_G determines the strength of this bias, and the biological mechanism for this bias is due to the enrichment of young cells relative to old cells in an exponentially growing culture¹³.

To understand the consequences of this result, we consider two special cases of this exponential mean. For processes with lifetimes short compared to the doubling time, Equation (2), can be Taylor expanded to show that the exponential mean is:

$$\tau \, \approx \, {\mu }_{t}-\frac{1}{2}{k}_{G}{\sigma }_{t}^{2}+...,$$

(17)

the regular arithmetic mean μ_t with a leading-order correction proportional to the product of the growth rate and variance ${\sigma }_{t}^{2}$. In the context of single-nucleotide incorporation, this correction is on order one-part-in-a-million and therefore can be ignored. As a consequence, Equation (16), corresponding to the transitions between states with short-lifetimes, is unaffected by the stochasticity, exactly as we observed in our simulations.

Another important case to consider is the strong disorder limit, in which a small fraction of the population ϵ stochastically arrests, i.e., with lifetime ∞, while the other individuals have exponential-mean lifetime τ₀. Using the definition in Equation (2), it is straightforward to show that the deterministic lifetime is:

$$\tau={\tau }_{0}-T{\log }_{2}(1-\epsilon )\approx {\tau }_{0}+\frac{\epsilon }{\ln 2}T,$$

(18)

where T is the population doubling time and the second equality is an approximation for small ϵ. The exponential-mean duration is extended by the arrest, but remains finite. Therefore, an arrest of a subpopulation is indistinguishable from a longer duration pause in an exponentially proliferating population (see ref. ¹³).

Marker-frequency demography

For a stochastic model with locus-dependent fork velocity, we showed that Equations (14) and (15) generalize to

$$N(\ell )={N}_{0}\,{{{{{{{{\rm{e}}}}}}}}}^{-{k}_{G}\tau (\ell )},$$

(19)

where we will call τ(ℓ) the lag time of a locus at position ℓ, which is equal to the sum of the differential lag times for each sequential step:

$${\tau }_{j}=\mathop{\sum }\limits_{i=0}^{j-1}\delta {\tau }_{i},$$

(20)

where δτ_i is the differential lag time for state i or the exponential mean of the state lifetime¹³. In the continuum limit, it is more convenient to represent this sum as an integral:

$$\tau ({\ell }_{i})=\int\nolimits_{0}^{{\ell }_{i}}\,\,\,\,{{{{{{{\rm{d}}}}}}}}\ell \frac{1}{v(\ell )},$$

(21)

where the fork velocity is defined: v(ℓ_i) ≡ 1 bp/δτ_i. To demonstrate that the generalized stochastic model predicts the log-slope relation, Equation (16), the log-slope can be derived by substituting Equation (21) into Equation (19), as was observed in the stochastic simulations, demonstrating the universality of Equation (4). We note that Wang and coworkers had previously derived an equivalent expression using the deterministic framework of the Cooper-Helmstetter model in the Material and Methods Section of ref. ³¹.

Stochasticity has a minimal effect on the marker frequency

We initially had hypothesized that stochasticity should affect the marker frequency. As explained above, it is the rapidity of base incorporation that explains why stochasticity is dispensable in this context. The same argument does not apply to the B period which is comparable to the duration of the cell cycle. However, for the marker frequency, it is lag-time differences between the replication times of loci that is determinative, and therefore the lag time of the B period cancels from these lag-time differences. Although it is mostly irrelevant for understanding wild-type cell dynamics, stochasticity and an arrested subpopulation will play an important role in one phenomenon we analyze: replication-conflict induced pauses.

Time resolution

Due to the large number of reads achievable in next-generation sequencing, the time resolution will be high in carefully designed analyses. The number of reads is subject to counting or Poisson noise. It is therefore straightforward to estimate the experimental uncertainty in the lag time due to finite read number:

$${\sigma }_{{\tau }_{j}}={k}_{G}^{-1}\frac{1}{\sqrt{{N}_{j}}}=1\,{{{{{{{\rm{s}}}}}}}}\cdot {\left(\frac{6\times 1{0}^{6}}{{N}_{j}}\right)}^{1/2},$$

(22)

where we have used a read number inspired by the replication-conflict pausing example. This estimate suggests that under standard conditions, time measurements with an uncertainty of seconds are possible using this approach.

Fork-velocity resolution

To compute the slope in Equation (4), the log-marker-frequency is fit to a piecewise linear function with equal spacing between knots (see Fig. 7b). There is an important tradeoff between genomic resolution (i.e., the genomic distance between knots) and fork velocity precision (i.e., the uncertainty in velocity measurement): Increasing the genomic distance between knots reduces the genomic resolution but also reduces the uncertainty in the velocity measurement. We therefore consider a series of models with increasing genomic resolution and use the Akaike Information Criterion (AIC) to select the optimal model^28,29 (see Supplementary Methods 3J). This approach balances the desire to resolve features by increasing the genomic resolution with the loss of velocity precision.

Given a knot spacing, it is straightforward to estimate the relative error:

$$\frac{{\sigma }_{v}}{v}=\sqrt{\frac{2}{n\,{({{\Delta }}\ell )}^{3}}}\frac{v}{{k}_{G}}\, \approx \,0.1\cdot {\left(\frac{1.5}{n}\right)}^{1/2}{\left(\frac{100\,{{{{{{{\rm{kb}}}}}}}}}{{{\Delta }}\ell }\right)}^{3/2},$$

(23)

where n is the read depth in reads per base and Δℓ is the spacing between knots in basepairs. Therefore, for a canonical next-generation-sequencing experiment, we can expect to achieve roughly 10% error in the fork velocity for 100 kb genomic resolution. Note that in our error analysis, we have included only the error from cell number N, not the error from the uncertainty in the cell-cycle duration, which covaries between loci in a particular experiment.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The sequencing datasets generated during the current study are available from the NCBI Sequence Read Archive with the BioProject accession code PRJNA919081. The data from Galli et al.³⁴ and Midgley-Smith et al.⁵⁴ are both available from the European Nucleotide Archive (ENA), with the accession codes PRJEB28538 and PRJEB25595, respectively. The digitized data from Wang et al.³² and Srivatsan et al.³¹ are available in the Source Data file. More detailed information about data availability is provided in Supplementary Table 2. Source data are provided with this paper.

Code availability

MATLAB scripts written for this study are available on the GitHub repository and on reasonable request.

References

Phillips, R., Kondev, J., Theriot, J. & Orme, N. Physical Biology of the Cell (Garland Science, 2013).
Tinoco, I. Jr & Gonzalez, R. L. Jr Biological mechanisms, one molecule at a time. Genes Dev. 25, 1205–1231 (2011).
Article CAS PubMed PubMed Central Google Scholar
Elías-Arnanz, M. & Salas, M. Resolution of head-on collisions between the transcription machinery and bacteriophage phi29 DNA polymerase is dependent on RNA polymerase translocation. EMBO J. 18, 5675–5682 (1999).
Article PubMed PubMed Central Google Scholar
Deshpande, A. M. & Newlon, C. S. DNA replication fork pause sites dependent on transcription. Science 272, 1030–1033 (1996).
Article ADS CAS PubMed Google Scholar
Liu, B. & Alberts, B. M. Head-on collision between a DNA replication apparatus and RNA polymerase transcription complex. Science 267, 1131–1137 (1995).
Article ADS CAS PubMed Google Scholar
Rocha, E. P. C. The organization of the bacterial genome. Annu. Rev. Genet. 42, 211–233 (2008).
Article CAS PubMed Google Scholar
Mirkin, E. V. & Mirkin, S. M. Replication fork stalling at natural impediments. Microbiol. Mol. Biol. Rev. 71, 13–35 (2007).
Article CAS PubMed PubMed Central Google Scholar
Yao, N. Y. & O’Donnell, M. Replisome structure and conformational dynamics underlie fork progression past obstacles. Curr. Opin. Cell Biol. 21, 336–343 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pomerantz, R. T. & O’Donnell, M. The replisome uses mRNA as a primer after colliding with RNA polymerase. Nature 456, 762–766 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Adelman, K. & Lis, J. T. Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat. Rev. Genet. 13, 720–731 (2012).
Article CAS PubMed PubMed Central Google Scholar
Davenport, R. J., Wuite, G. J., Landick, R. & Bustamante, C. Single-molecule study of transcriptional pausing and arrest by E. coli RNA polymerase. Science 287, 2497–2500 (2000).
Article ADS CAS PubMed Google Scholar
Mangiameli, S. M., Merrikh, C. N., Wiggins, P. A. & Merrikh, H. Transcription leads to pervasive replisome instability in bacteria. Elife 6, e19848 (2017).
Article PubMed PubMed Central Google Scholar
Huang, D., Lo, T., Merrikh, H. & Wiggins, P. A. Characterizing stochastic cell-cycle dynamics in exponential growth. Phys. Rev. E 105, 014420 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bates, D. et al. The Escherichia coli baby cell column: a novel cell synchronization method provides new insight into the bacterial cell cycle. Mol. Microbiol. 57, 380–391 (2005).
Article CAS PubMed PubMed Central Google Scholar
Bhat, D., Hauf, S., Plessy, C., Yokobayashi, Y. & Pigolotti, S. Speed variations of bacterial replisomes. Elife 11, e75884 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. D. & Levin, P. A. Metabolism, cell growth and the bacterial cell cycle. Nat. Rev. Microbiol. 7, 822–827 (2009).
Article CAS PubMed PubMed Central Google Scholar
Willis, L. & Huang, K. C. Sizing up the bacterial cell cycle. Nat. Rev. Microbiol. 15, 606–620 (2017).
Article CAS PubMed Google Scholar
Reyes-Lamothe, R. & Sherratt, D. J. The bacterial cell cycle, chromosome inheritance and cell growth. Nat. Rev. Microbiol. 17, 467–478 (2019).
Article CAS PubMed Google Scholar
Cooper, S. & Helmstetter, C. E. Chromosome replication and the division cycle of Escherichia coli B/r. J. Mol. Biol. 31, 519–540 (1968).
Article CAS PubMed Google Scholar
Bremer, H. & Churchward, G. An examination of the Cooper-Helmstetter theory of DNA replication in bacteria and its underlying assumptions. J. Theor. Biol. 69, 645–654 (1977).
Article ADS CAS PubMed Google Scholar
Bird, R. E., Louarn, J., Martuscelli, J. & Caro, L. Origin and sequence of chromosome replication in Escherichia coli. J. Mol. Biol. 70, 549–566 (1972).
Article CAS PubMed Google Scholar
Pritchard, R. H., Chandler, M. G. & Collins, J. Independence of F replication and chromosome replication in Escherichia coli. Mol. Gen. Genet. 138, 143–155 (1975).
Article CAS PubMed Google Scholar
Val, M.-E., Soler-Bistué, A., Bland, M. J. & Mazel, D. Management of multipartite genomes: the Vibrio cholerae model. Curr. Opin. Microbiol. 22, 120–126 (2014).
Article CAS PubMed Google Scholar
Srivastava, P., Fekete, R. A. & Chattoraj, D. K. Segregation of the replication terminus of the two Vibrio cholerae chromosomes. J. Bacteriol. 188, 1060–1070 (2006).
Article CAS PubMed PubMed Central Google Scholar
Rasmussen, T., Jensen, R. B. & Skovgaard, O. The two chromosomes of Vibrio cholerae are initiated at different time points in the cell cycle. EMBO J. 26, 3124–3131 (2007).
Article CAS PubMed PubMed Central Google Scholar
Val, M.-E. et al. A checkpoint control orchestrates the replication of the two chromosomes of Vibrio cholerae. Sci. Adv. 2, e1501914 (2016).
Article ADS PubMed PubMed Central Google Scholar
French, S. Consequences of replication fork movement through transcription units in vivo. Science 258, 1362–1365 (1992).
Article ADS CAS PubMed Google Scholar
Burnham, K. P. & Anderson, D. R. Model Selection and Multimodel Inference, 2nd. edn. (Springer-Verlag New York, Inc., 1998).
Akaike, H. In 2nd International Symposium of Information Theory (eds. N., P. B. & Csaki, E.) 267–281 (Akademiai Kiado, Budapest., 1973).
Dutta, D., Shatalin, K., Epshtein, V., Gottesman, M. E. & Nudler, E. Linking RNA polymerase backtracking to genome instability in E. coli. Cell 146, 533–543 (2011).
Article CAS PubMed PubMed Central Google Scholar
Srivatsan, A., Tehranchi, A., MacAlpine, D. M. & Wang, J. D. Co-orientation of replication and transcription preserves genome integrity. PLoS Genet. 6, e1000810 (2010).
Article PubMed PubMed Central Google Scholar
Wang, J. D., Berkmen, M. B. & Grossman, A. D. Genome-wide coorientation of replication and transcription reduces adverse effects on replication in Bacillus subtilis. Proc. Natl Acad. Sci. USA 104, 5608–5613 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Rudolph, C. J., Upton, A. L., Stockum, A., Nieduszynski, C. A. & Lloyd, R. G. Avoiding chromosome pathology when replication forks collide. Nature 500, 608–611 (2013).
Article ADS CAS PubMed Google Scholar
Galli, E. et al. Replication termination without a replication fork trap. Sci. Rep. 9, 8315 (2019).
Article ADS PubMed PubMed Central Google Scholar
Batrakou, D. G., Müller, C. A., Wilson, R. H. C. & Nieduszynski, C. A. DNA copy-number measurement of genome replication dynamics by high-throughput sequencing: the sort-seq, sync-seq and MFA-seq family. Nat. Protoc. 15, 1255–1284 (2020).
Article CAS PubMed Google Scholar
Müller, C. A. et al. The dynamics of genome replication using deep sequencing. Nucleic Acids Res. 42, e3 (2014).
Article PubMed Google Scholar
Churchward, G. & Bremer, H. Determination of deoxyribonucleic acid replication time in exponentially growing Escherichia coli B/r. J. Bacteriol. 130, 1206–1213 (1977).
Article CAS PubMed PubMed Central Google Scholar
Zaritsky, A. & Pritchard, R. H. Changes in cell size and shape associated with changes in the replication time of the chromosome of Escherichia coli. J. Bacteriol. 114, 824–837 (1973).
Article CAS PubMed PubMed Central Google Scholar
Odsbu, I., Morigen & Skarstad, K. A reduction in ribonucleotide reductase activity slows down the chromosome replication fork but does not change its localization. PLoS ONE 4, e7617 (2009).
Article ADS PubMed PubMed Central Google Scholar
Zhu, M. et al. Manipulating the bacterial cell cycle and cell size by titrating the expression of ribonucleotide reductase. mBio 8, e01741–17 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gon, S. et al. A novel regulatory mechanism couples deoxyribonucleotide synthesis and DNA replication in Escherichia coli. EMBO J. 25, 1137–1147 (2006).
Article CAS PubMed PubMed Central Google Scholar
Arfken, G. B. & Weber, H. J. Mathematical Methods for Physicists, 4th edn. (Academic Press, 1995).
Lin, W.-H. & Jacobs-Wagner, C. Connecting single-cell ATP dynamics to overflow metabolism, cell growth, and the cell cycle in Escherichia coli. Curr. Biol. 32, 3911–3924.e4 (2022).
Article CAS PubMed Google Scholar
Dimude, J. U. et al. Origins left, right, and centre: increasing the number of initiation sites in the Escherichia coli chromosome. Genes (Basel) 9, 376 (2018).
Article PubMed Google Scholar
Mirkin, E. V. & Mirkin, S. M. Mechanisms of transcription-replication collisions in bacteria. Mol. Cell Biol. 25, 888–895 (2005).
Article CAS PubMed PubMed Central Google Scholar
Mirkin, E. V., Castro Roa, D., Nudler, E. & Mirkin, S. M. Transcription regulatory elements are punctuation marks for DNA replication. Proc. Natl Acad. Sci. USA 103, 7276–7281 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Lang, K. S. & Merrikh, H. The clash of macromolecular titans: replication-transcription conflicts in bacteria. Annu. Rev. Microbiol. 72, 71–88 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lang, K. S. et al. Replication-transcription conflicts generate R-loops that orchestrate bacterial stress survival and pathogenesis. Cell 170, 787–799.e18 (2017).
Article CAS PubMed PubMed Central Google Scholar
Merrikh, H., Machón, C., Grainger, W. H., Grossman, A. D. & Soultanas, P. Co-directional replication-transcription conflicts lead to replication restart. Nature 470, 554–557 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Merrikh, C. N., Brewer, B. J. & Merrikh, H. The B. subtilis accessory helicase PcrA facilitates DNA replication through transcription units. PLoS Genet. 11, e1005289 (2015).
Article PubMed PubMed Central Google Scholar
Million-Weaver, S. et al. An underlying mechanism for the increased mutagenesis of lagging-strand genes in Bacillus subtilis. Proc. Natl Acad. Sci. USA 112, E1096–E1105 (2015).
Article CAS PubMed PubMed Central Google Scholar
Si, F. et al. Mechanistic origin of cell-size control and homeostasis in bacteria. Curr. Biol. 29, 1760–1770.e7 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 (1977).
Article CAS Google Scholar
Midgley-Smith, S. L. et al. Chromosomal over-replication in Escherichia coli recG cells is triggered by replication fork fusion and amplified if replichore symmetry is disturbed. Nucleic Acids Res. 46, 7701–7715 (2018).
Article CAS PubMed PubMed Central Google Scholar
Val, M.-E., Skovgaard, O., Ducos-Galand, M., Bland, M. J. & Mazel, D. Genome engineering in Vibrio cholerae: a feasible approach to address biological issues. PLoS Genet. 8, e1002472 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank B. Traxler, A. Nourmohammad, J. Mougous, and J. Mittler for many useful conversations. We would like to thank P. Levin, J. Wang, L. Simmons, and S. Pigolotti for advice on our manuscript. We thank S. B. Peterson and A. Schaefer for help with V. cholerae. We would like to thank J. Wang, C. Possoz, F.-X. Barre, and C. Rudolph for detailed conversations about their data. This work was supported by NIH grant R01-GM128191, which was awarded to P.A.W. and H.M.

Author information

Authors and Affiliations

Department of Physics, University of Washington, Seattle, WA, 98195, USA
Dean Huang, Brandon S. Sim, Teresa W. Lo & Paul A. Wiggins
Department of Biochemistry, Vanderbilt University, Nashville, TN, 37205, USA
Anna E. Johnson & Houra Merrikh
Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Anna E. Johnson & Houra Merrikh
Department of Bioengineering, University of Washington, Seattle, WA, 98195, USA
Paul A. Wiggins
Department of Microbiology, University of Washington, Seattle, WA, 98195, USA
Paul A. Wiggins

Authors

Dean Huang
View author publications
You can also search for this author in PubMed Google Scholar
Anna E. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Brandon S. Sim
View author publications
You can also search for this author in PubMed Google Scholar
Teresa W. Lo
View author publications
You can also search for this author in PubMed Google Scholar
Houra Merrikh
View author publications
You can also search for this author in PubMed Google Scholar
Paul A. Wiggins
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.H., A.E.J., T.W.L., H.M., and P.A.W. designed the experiments. D.H., A.E.J., and B.S.S. assembled input data and ran experiments. D.H. and P.A.W. developed the theory, wrote code, ran the model, and analyzed output data. D.H., A.E.J., H.M., and P.A.W. wrote the manuscript.

Corresponding authors

Correspondence to Houra Merrikh or Paul A. Wiggins.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Sergei Mirkin and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Movie 1

Supplementary Movie 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huang, D., Johnson, A.E., Sim, B.S. et al. The in vivo measurement of replication fork velocity and pausing by lag-time analysis. Nat Commun 14, 1762 (2023). https://doi.org/10.1038/s41467-023-37456-2

Download citation

Received: 06 October 2022
Accepted: 17 March 2023
Published: 30 March 2023
DOI: https://doi.org/10.1038/s41467-023-37456-2

This article is cited by

Chromosome organization shapes replisome dynamics in Caulobacter crescentus
- Chen Zhang
- Asha Mary Joseph
- Suliana Manley
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.