Homeostasis of protein and mRNA concentrations in growing cells

Lin, Jie; Amir, Ariel

doi:10.1038/s41467-018-06714-z

Download PDF

Article
Open access
Published: 29 October 2018

Homeostasis of protein and mRNA concentrations in growing cells

Nature Communications volume 9, Article number: 4496 (2018) Cite this article

15k Accesses
104 Citations
38 Altmetric
Metrics details

Subjects

Abstract

Many experiments show that the numbers of mRNA and protein are proportional to the cell volume in growing cells. However, models of stochastic gene expression often assume constant transcription rate per gene and constant translation rate per mRNA, which are incompatible with these experiments. Here, we construct a minimal gene expression model to fill this gap. Assuming ribosomes and RNA polymerases are limiting in gene expression, we show that the numbers of proteins and mRNAs both grow exponentially during the cell cycle and that the concentrations of all mRNAs and proteins achieve cellular homeostasis; the competition between genes for the RNA polymerases makes the transcription rate independent of the genome number. Furthermore, by extending the model to situations in which DNA (mRNA) can be saturated by RNA polymerases (ribosomes) and becomes limiting, we predict a transition from exponential to linear growth of cell volume as the protein-to-DNA ratio increases.

Contributions to the ‘noise floor’ in gene expression in a population of dividing cells

Article Open access 11 August 2020

Environment-specificity and universality of the microbial growth law

Article Open access 31 August 2022

Exploring the effect of network topology, mRNA and protein dynamics on gene regulatory network stability

Article Open access 08 January 2021

Introduction

Despite the noisy nature of gene expression^1,2,3,4,5,6, various aspects of single cell dynamics, such as volume growth, are effectively deterministic. Recent single-cell measurements show that the growth of cell volume is often exponential. These include bacteria^7,8,9,10, archaea¹¹, budding yeast^{10,12,13,14,15} and mammalian cells^10,16. Moreover, the mRNA and protein numbers are often proportional to the cell volume throughout the cell cycle: the homeostasis of mRNA concentration and protein concentration is maintained in an exponentially growing cell volume with variable genome copy number^{17,18,19,20,21,22}. The exponential growths of mRNA and protein number indicate dynamical transcription and translation rates proportional to the cell volume, rather than the genome copy number. However, current gene expression models often assume constant transcription rate per gene and constant translation rate per mRNA (constant rate model)^1,5,23,24,25. Assuming a finite degradation rate of mRNAs and non-degradable proteins, these models lead to a constant mRNA number proportional to the gene copy number and linear growth of protein number^26,27,28, incompatible with the proportionality of mRNA and protein number to the exponentially growing cell volume.

Since the cell volume, protein copy number and mRNA copy number grow exponentially throughout the cell cycle, one may expect a sufficient condition to achieve a constant concentration is to let them grow with the same exponential growth rate. However, mathematical analysis suggests this is insufficient. Let us consider the logarithm of protein concentration c, which can be written as ln(c) = ln(p) − ln(V). Here p is the protein number and V is the cell volume. If one assumes the protein number and the cell volume grow exponentially but independently, with time-dependent exponential growth rates λ_p(t) and λ_v(t) respectively, the time derivative of the logarithm of concentration then obeys d ln(c)/dt ~ λ_p(t) − λ_v(t). Even when the time-averaged growth rates of protein number and cell volume are equal, $\langle {\lambda _p(t)}\rangle = \langle {\lambda _v(t)}\rangle$, any fluctuations in the difference between them will accumulate and lead to a random walk behavior of the logarithm of concentration. The homeostasis of protein and mRNA concentrations implies that there must be a regulatory mechanism in place to prevent the accumulation of noise over time.

The main goal of this work is to identify such a mechanism by developing a coarse-grained model taking into account cell volume growth explicitly. Specially, we only consider continuously proliferating cells and do not take account of non-growing cells, e.g., bacterial cells in stationary phase²⁹. The ubiquity of homeostasis suggests that the global machinery of gene expression, RNA polymerases (RNAPs) and ribosomes, should play a central role within the model. Based on the assumption that the number of ribosomes is the limiting factor in translation, we find that the exponential growth of cell volume and protein number originates from the auto-catalytic nature of ribosomes^30,31,32,33. The fact that ribosomes make all proteins ensures that the protein concentrations do not diverge. Based on the assumption that the number of RNAP is the limiting factor in transcription, we find that the mRNA number also grows exponentially and the mRNA concentration is independent of the genome copy number because of the competition between genes for this global resource^18,19,20. We also study the effects of genome replication. Due to the heterogeneous timing of gene replication, the transcription rate of one gene has a cell cycle dependence. Within our model, it doubles immediately after the gene is replicated and decreases gradually as other genes are replicated. Nevertheless, we find that this leads to a small effect on protein levels. Finally, we extend our model to more general situations in which an excess of RNAP (ribosome) leads to the saturation of DNA (mRNA). We propose a phase diagram of gene expression and cellular growth controlled by the protein-to-DNA ratio. We predict a transition from exponential growth to linear growth of cell volume as the protein-to-DNA ratio passes a threshold.

Results

Model of stochastic gene expression

In constant rate models, the transcription rate per gene and the translation rate per mRNA are constant^1,5,24 (Fig. 1a). Constant rate models predict a constant mRNA number proportional to the gene copy number and independent of the cell volume. However, experimental observations on plant and mammalian cells have revealed a proportionality between mRNA number and cell volume for cells with a constant genome copy number^18,19,20. Moreover, even comparing the cells before and after the genome replication (S phase), the proportionality coefficient between mRNA and cell volume does not exhibit any obvious change. In contrast, a constant transcription rate per gene would predict a doubled transcription rate after the replication of the whole genome, leading to a higher mRNA concentration. In one class of constant rate models^26,27,34, a deterministic exponential growth of cell volume is explicitly considered. The resulting perturbation on the concentrations due to genome replication is suppressed in the long lifetime limit, but still significant for short lifetime molecules, e.g., mRNA (see Fig. 1 in ref.²⁷).

Considering translation, various experiments have shown that the number of ribosomes is the limiting factor rather than the number of mRNAs. The most direct evidence is the growth law: the growth rate of cells is proportional to the fraction of ribosomal proteins in the total proteome (with a constant factor depending on the growth condition)³⁵ both for bacterial cells^30,31,36 and budding yeast cells³². This means a constant fraction of ribosomes are actively translating mRNAs. These results suggest that in general cells are below the saturation limit in which there are too many ribosomes that the mRNAs can bind. We will therefore assume the biological situation in which mRNAs in the cell compete for the limiting resource of actively translating ribosomes, therefore the translation rate of one type of mRNA is proportional to the number of active ribosomes times its fraction in the total pool of mRNAs.

Considering transcription, experiments have shown that mutants of fission yeasts altered in cell size regulated global transcription to maintain similar transcription rates per cell volume regardless of the cellular DNA content. The changes in total transcription correlated with coordinated changes in gene occupancy by RNA polymerases³⁷. These results suggest that the number of RNAPs may be the limiting factor in transcription rather than the gene number, and similar evidence has been shown for bacterial cells³⁸ and mammalian cells³⁹. However, in the same experiments on fission yeast³⁷, it has also been found that in cell-cycle-arrested mutants, total transcription rates stopped increasing as the cell volume exceeded a certain value, which suggested DNA became limiting for transcription at low DNA concentration. This result suggests that an excess of RNAPs may lead the gene number to become the limiting factor in certain conditions. In this section, we will focus on the scenario that both RNAP and ribosome are limiting in gene expression, which we denote as Phase 1. In this phase, we will show that the mRNA number and the protein number are proportional to the cell volume and grow exponentially. In a later section (A unified phase diagram of gene expression and cellular growth), we will consider a more general model in which the limiting nature of RNAPs and ribosomes may break down and the dynamics of mRNA and protein number is different.

To address the limiting nature of RNAP, we define an effective gene copy number g_i for each gene to account for its copy number and the binding strength of its promoter, which determines its ability to compete for RNAPs. The transcription rate for one specific gene i is proportional to the fraction of RNAPs that are working on its gene(s), $\phi _i = g_i{\mathrm{/}}\mathop {\sum}\nolimits_j {\kern 1pt} g_j$, which we denote as the gene allocation fraction. Gene regulation is thus coarse-grained into the gene allocation fraction ϕ_i. The transcription rate is independent of the genome copy number since a change in the genome number leaves the allocation fraction of one gene invariant, a conclusion which is consistent with a number of experimental results on various organisms^18,19,20,37.

In fact, explicit gene regulation can also be included in our model (Methods), with a time-dependent g_i. In such scenarios, g_i may be a function of protein concentrations (for instance, the action of transcription factors modifies the transcription rate). Such models will lead to more complex dynamics of mRNA and protein concentrations. However, since we are interested in the global behavior of gene expression and cell volume growth, we do not focus on these complex regulations in this manuscript. Our conclusions regarding the exponential growth of mRNA and protein number for constitutively expressed genes and the exponential growth of cell volume on the global level are not affected by the dynamics of gene expression of particular genes.

In the following, m, p, r, n represents the numbers of mRNAs, proteins, ribosomes and RNA polymerases, respectively. Proteins (p) also include RNAPs (n) and ribosomes (r)³⁰. We consider the degradation of mRNA with degradation time τ for all genes. The protein number decreases only through cell divisions (though adding a finite degradation rate for proteins does not affect our results). The stochastic dynamics of gene expression within Phase 1 of our model are summarized in the following sets of equations and Fig. 1b,

$$m_i\mathop{\longrightarrow}\limits^{{k_0\left( {g_i{\mathrm{/}}\mathop {\sum}\limits_j {\kern 1pt} g_j} \right)n}}m_i + 1,$$

(1)

$$m_i\mathop{\longrightarrow}\limits^{{m_i/\tau }}m_i - 1,$$

(2)

$$p_i\mathop{\longrightarrow}\limits^{{k_t\left( {m_i{\mathrm{/}}\mathop {\sum}\limits_j {\kern 1pt} m_j} \right)f_ar}}p_i + 1.$$

(3)

Here k₀, k_t are constants, characterizing the transcription (translation) rate of a single RNAP (ribosome). f_a is the fraction of active ribosomes, which we assume to be constant in a given nutrient environment^30,32. We note that nonspecifically bound RNAPs have been reported in bacteria^40,41. We will discuss their effect later. For simplicity, we first assume the values of ϕ_i do not change in time. This can be formally thought of as corresponding to an instantaneous replication of the genome. In reality, a finite duration of DNA replication and the varying time of replication initiation for different genes lead to ϕ_i’s that change during the DNA replication. We later analyze a more complete version of the model which includes these gene dosage effects, but we first consider the simplified scenario of constant ϕ_i that will capture the essential features of the problem.

We assume the cell volume is approximately proportional to the total protein mass, i.e., $V \propto M = \mathop {\sum}\nolimits_j {\kern 1pt} p_j$, which is a good approximation for bacteria^42,43 and mammalian cells¹⁷. To simplify the following formulas, we consider each protein has the same mass and set the cell density as 1.

Due to the fast degradation of mRNA compared with the cell cycle duration^44,45, the mRNA number can be well approximated as being in steady state. We can express the ensemble-averaged number of mRNA of gene i as

$$\left\langle {m_i(t)} \right\rangle = k_0\phi _i\left\langle {n(t)} \right\rangle \tau .$$

(4)

Equation (3) then leads to the time-dependence of average ribosome number, $d\left\langle r \right\rangle {\mathrm{/}}dt = k_tf_a\phi _r\left\langle r \right\rangle$, reproducing the auto-catalytic nature of ribosome production and the growth rate

$$\mu = k_tf_a\phi _r,$$

(5)

determined by the relative abundance of active ribosomes in the proteome^30,32.

Similarly, the number of protein i grows as $d\left\langle {p_i} \right\rangle {\mathrm{/}}dt = k_tf_a\phi _i\left\langle r \right\rangle$. As the cell grows and divides, the dynamics becomes insensitive to the initial conditions, so the protein number will grow exponentially as well²¹. The ratio between the averages of two protein numbers in the steady state is set by the ratio of their production rate, therefore $\langle {p_i} \rangle {\mathrm{/}}\langle {p_j} \rangle = \phi _i{\mathrm{/}}\phi _j$. The average number of mRNA traces the number of RNA polymerases according to Eq. (4), and therefore also grows exponentially. Throughout the cell cycle we have

$$\left\langle {m_i(t)} \right\rangle = m_b(i){\mathrm{exp}}\left( {\mu t} \right),$$

(6a)

$$\left\langle {p_i(t)} \right\rangle = p_b(i){\mathrm{exp}}\left( {\mu t} \right),$$

(6b)

where m_b(i) (p_b(i)) is the number of mRNA (protein) of gene i at cell birth.

We denote the concentrations of mRNA and protein of gene i as $c^{m}_{i}$ = m_i/V and c_i = p_i/V respectively. According to Eqs. (1)–(3), the deterministic equations of the above variables become (see details in Methods)

$$\frac{{dc_i}}{{dt}} \approx \mu \left( {\phi _i - c_i} \right).$$

(7a)

$$\frac{{dc^m_i}}{{dt}} \approx \frac{1}{\tau }\left( {k_0\phi _i\phi _n\tau - c^m_i} \right).$$

(7b)

A fixed point exists for the dynamics of c_i and $c^{m}_{i}$, namely $c_i=\phi_i$ and $c^m_i=k_0 \phi_i\phi_n \tau$. This fixed point is stable due to the global nature of RNAPs and ribosomes: any noises arising from the copy number of RNAPs (ribosomes) equally affect all mRNAs (proteins), and therefore leave the relative fraction of one type of mRNA (protein) in the total pool of mRNAs (proteins) invariant. The average concentrations of mRNA and protein of gene i become $\left\langle {c_i} \right\rangle = \phi _i$, and $\left\langle {c^m_i} \right\rangle = k_0\tau \phi _i \phi _n$. The results are independent of the cell volume and genome copy number agreeing with experimental data on various organisms^18,19,20,22.

We take cell division explicitly into account and, for concreteness, use the “adder” model for cell division by considering an initiator protein I. The initiator protein accumulates from cell birth, triggers the cell division once it reaches the division threshold I_c and is then destroyed (or “reset”, e.g., after initiation of DNA replication in bacteria, the ATP-bound DnaA is dephosphorylated to the ADP-bound form)^46,47,48. During a division event, we assume proteins and mRNAs are divided between the two daughter cells following a binomial distribution⁴⁹. The initiator protein sets the scale of absolute protein number, and the average number of proteins produced in one cell cycle is equal to Δ(i) = I_cϕ_i/ϕ_I⁴⁷. Since the protein number grows twofold during one cell cycle, the average protein number of gene i at cell birth is p_b(i) = I_cϕ_i/ϕ_I and the corresponding average mRNA number at cell birth is m_b(i) = k₀I_cτϕ_iϕ_n/ϕ_I. We remark that the exact molecular mechanism of cell division does not affect our results.

We corroborate the above analytical calculations with numerical simulations. These will also capture the stochastic fluctuations in gene expression levels, which are not included in the previous analysis. Due to the short lifetime of mRNAs, the production of proteins can be approximated by instantaneous bursts²⁴. We introduce the burst size parameter b₀ as the average number of proteins made per burst, $b_0$ = $k_tf_a\langle {r(t)} \rangle {\mathrm{/}}\langle {\mathop {\sum}\nolimits_j {\kern 1pt} m_j} \rangle \times \tau$ ≈ $k_tf_a\phi _r{\mathrm{/}}\left( {k_0\phi _n} \right)$, independent of the cell volume. ϕ_i for N = 200 proteins are uniformly sampled in logarithmic space, with the sum over ϕ_i (including ribosome and RNAP) constraint to be precisely one. We choose the parameters to be biologically relevant for bacteria: the doubling time T = ln(2)/μ = 150 min, r_b = 10⁴, n_b = 10³, b₀ = 0.8, I_c = 20, ϕ_r = 0.2, f_a = 0.7 and τ = 3.5 min, see other numerical details in Methods. Our conclusions are independent of the specific choice of parameters.

In Fig. 2a, we show the typical trajectories from our simulations of cell volume, protein number and mRNA number for the same gene over multiple generations. To verify the exponential growth of protein and mRNA, we average the protein and mRNA numbers given a fixed relative phase in the cell cycle progression, which is normalized by the generation time and changes from 0 to 1. The averaged values of protein and mRNA numbers (circles) are well predicted by exponential growth, Eqs. (6a) and (6b) (black lines) without any fitting parameters, as shown in Fig. 2b with 3 single trajectories in the background. We also simulate a regulated gene with a time-dependent gene copy number and obtain qualitatively similar results (Methods, Supplementary Fig. 1).

The corresponding trajectories of protein and mRNA concentrations are shown in Fig. 2c, with bounded fluctuations around the predicted averaged values (black lines). In contrast, if the protein number and cell volume grow exponentially but independently, the ratio between them will diverge as the effects of noise accumulate, exhibiting a random walk behavior (Fig. 2d). Considering the cell cycle dependence of mRNA number and the homeostasis of protein concentration throughout the cell cycle, the experimental observation in Escherichia coli showing negligible correlations between mRNA number and protein concentration⁵⁰ is consistent with our model, and not contradictory to the strong correlation of mRNA concentration and protein concentration⁵¹.

Within our model, we may also study the protein number dynamics: how does the protein number at cell division correlate with that at cell birth? We find that the correlations follow an “adder” (i.e. the number of new proteins added is uncorrelated with the number at birth), as shown in Fig. 2e. While this has been quantified in various organisms with respect to cell volume^{8,9,11,52,53,54}, checking correlations between protein content at cell birth and division has received significantly less attention^55,56. Related to this, we study the auto-correlation function of protein concentration in time. We find that the auto-correlation function is approximately exponential, with a correlation time bounded from below by the doubling time (Supplementary Fig. 2). Both of these results provide experimentally testable predictions.

Effects of finite duration of gene replication

So far, we considered a constant ϕ_i throughout the cell cycle assuming an instantaneous replication of the genome. In this section, we relax this condition and study the effects of finite DNA replication time. We consider the bacterial model of DNA replication, specifically, E. coli, for which the mechanism of DNA replication is well characterized⁵⁷. The duration of DNA replication is constant, and defined as the C period. The corresponding cell division follows after an approximately constant duration known as the D period. Details of the DNA replication model are in the Methods. In Fig. 3a, we show the time trajectories of the gene allocation fraction, mRNA concentration and protein concentration of one gene for a doubling time of T = 30 min with C + D = 70 min. The DNA replication introduces a cell cycle dependent modulation of ϕ_i. The abrupt increase of ϕ_i corresponds to the replication of the specific gene i (Fig. 3a) ϕ_i → 2ϕ_i. However, as other genes are replicated, the relative fraction of gene i in the total genome decreases. This modulation propagates to the mRNA concentration which essentially tracks the dynamics of ϕ_i due to its short lifetime. The modulation of mRNA concentration affects the protein concentration as well, yet with a much smaller amplitude. These results can be tested experimentally by monitoring the DNA replication process and mRNA concentration simultaneously.

Noise in gene expression can be classified as intrinsic and extrinsic noise⁵⁸. While intrinsic noise is due to the stochastic nature of the chemical reactions involved in gene expression, extrinsic noise is believed to be due to the fluctuations of external conditions and common to a subset of proteins. Experiments have revealed a global extrinsic noise that affects all protein concentrations in the genome^50,59,60. Because all genes are subjected to the finite duration of DNA replication, it is tempting to attribute the finite duration of DNA replication as one of the main sources of global extrinsic noise³⁴. Within our model in the previous section (constant ϕ_i’s throughout the cell cycle), there is no global extrinsic noise (Supplementary Fig. 3). A global extrinsic noise may emerge after we introduce the time-dependent ϕ_i due to DNA replication. However, we find that the coefficient of variation (CV, the ratio between standard deviation and mean) of the most highly expressed proteins is only about 0.02 within the growing cell model (Fig. 3b), much smaller than that found in experiments^50,59. We note that a small extrinsic noise due to gene replication is also observed in constant rate models^26,27. Moreover, recent experiments and modeling have suggested that a significant part of the extrinsic noise of mRNA expression level can be attributed to the fluctuations of RNAP copy number²⁸. Within our model, RNAP level fluctuations will lead to extrinsic noise in mRNA concentrations.

A unified phase diagram of gene expression and cellular growth

Experimental observations on E. coli³⁰ and budding yeast³² support our assumption that ribosomes are limiting for translation. Experimental observations on plant and mammalian cells^18,19,20 and fission yeast³⁷ are also consistent with our assumption that RNA polymerase is limiting for transcription. However, as we discussed in the first section, in the same experiments on fission yeast³⁷ DNA became limiting for transcription at low DNA concentration. Therefore, we cannot exclude the possibility that in some cases because RNAPs are too abundant, DNA becomes the limiting resource for transcription rather than the number of RNAPs. Similarly, when ribosomes are too abundant relative to the transcript number, the limiting factor for translation becomes the transcript number rather than ribosome number.

In this section, we generalize our model by assuming that each gene has an upper bound on the number of RNAPs (n_s) than can simultaneously work on it. A possible extreme case is that the gene is fully loaded with RNAPs, on which RNAPs are only constrained by steric hindrance. The same assumption is made for mRNA with an upper bound of ribosomes (r_s) that can work on it simultaneously. We remark that the exact mechanism of DNA and mRNA saturation is beyond our coarse-grained model. If the number of RNAP (ribosome) is above the upper bound, the transcription (translation) rate is limited by the gene (mRNA) number, in a similar fashion to the constant rate models.

We define the protein-to-DNA ratio (PTD ratio) as the sum of protein numbers divided by the sum of effective gene numbers,

$$\gamma = \mathop {\sum}\limits_i {\kern 1pt} p_i{\mathrm{/}}\mathop {\sum}\limits_i {\kern 1pt} g_i.$$

(8)

As the PTD ratio becomes larger, e.g., due to a sufficiently large cell volume with a fixed number of gene, the number of RNAPs (ribosomes) will exceed the maximum load the total genes (mRNAs) can hold. We have discussed thoroughly Phase 1 (neither DNA nor mRNA is saturated) earlier and we summarize our predictions on the transition from Phase 1 to other phases in the following.

Phase 2: In phase 2, the limiting factor in transcription becomes the gene copy number and the transcription rate is proportional to the gene copy number (Fig. 4b). The threshold PTD ratio for the transition from Phase 1 to Phase 2 is (Methods),

$$\gamma _1 = \frac{{n_s}}{{\phi _n}}.$$

(9)

Here n_s is the upper bound of the number of RNAPs that can work on one gene and ϕ_n is the gene allocation fraction of RNAP. Because mRNA is not saturated, the protein number and the cell volume grow exponentially with the same growth rate as Phase 1, Eq. (5), and the homeostasis of protein concentration is still valid. However, because the production rate of mRNA is now proportional to the gene copy number, the mRNA concentration is not constant anymore as the cell volume grows (Methods). In Phase 2, even though the transcription rate doubles after the genome is replicated, the translation rate is proportional to the relative fraction of mRNA in the total pool of mRNAs and therefore still independent of the genome copy number. The average protein concentration is equal to the gene allocation fraction ($\left\langle {c_i} \right\rangle = \phi _i$). Recent proposed theoretical models of gene expression are consistent with this phase⁶¹. In terms of transcription, our model in Phase 2 is equivalent to constant rate models and we have confirmed that for both bacteria and mammalian cells, the typical lifetime of mRNA is short enough compared with the doubling time to distinguish Phase 1 and Phase 2 (Supplementary Fig. 4).

Phase 3: As the cell volume becomes larger, mRNA may get saturated as well. The limiting factor for translation is now the mRNA copy number (Fig. 4c). The threshold PTD ratio for the transition from Phase 2 to Phase 3 is (Methods)

$$\gamma _2 = \frac{{k_0\tau r_sn_s}}{{\phi _r}}.$$

(10)

Here r_s is the upper bound of the number of ribosomes that can work on one mRNA. In this phase, the translation rate is proportional to the mRNA number and the protein number grows linearly as $\dot p_i = k_tk_0g_i\tau n_sr_s$, with a linear growth rate proportional to the gene number. Therefore, within the assumption that the cell volume is determined by the total protein number, the cell volume grows linearly as well with the linear growth rate proportional to the total gene number,

$$\mu _l = k_tk_0\tau n_sr_s\mathop {\sum}\limits_i {\kern 1pt} g_i,$$

(11)

and therefore proportional to the genome copy number. As in Phase 2, the mRNA concentration decreases as the cell volume grows, however, the protein concentration is still constant with the average protein concentration equal to the gene allocation fraction ($\left\langle {c_i} \right\rangle = \phi _i$, Methods). In Phase 3, even though the cell volume grows linearly, the population still grows exponentially with a population growth rate. However, there is no general relation between the ribosomal fraction in the proteome and the population growth rate, in contrast to the growth law in Phase 1 and 2. We summarize the predicted phase diagram of cellular growth in Fig. 4a.

To gain some sense regarding the parameters associated with our proposed phase diagram, we estimate the PTD ratio of E. coli. Considering the typical cell volume of E. coli as 1 μm³, the protein density as 3 × 10⁶proteins/μm³ and the total number of protein-coding genes in E. coli as 4000⁶², we estimate the protein-to-DNA ratio for E. coli as γ ~ 1000. Estimates of the two threshold values of PTD ratios (see Methods) suggest that γ₁ ~ 1500 and γ₂ ~ 20,000.

These estimates suggest that wild-type E. coli cells are found in Phase 1, but close to Phase 2. We remark that the actual threshold values of PTD ratio for the transitions between different growth phases may be affected by other factors, e.g., the heterogeneous size of genes, but we propose that the general scenario of the transition from Phase 1 to Phase 3 as the protein-to-DNA ratio increases should be generally applicable. As the PTD ratio increases, we predict a transition from exponential growth to linear growth for protein number and cell volume (Supplementary Fig. 5). We propose future experiments to study the potential transition from exponential to linear growth of cell volume, for example using filamentous E. coli where cell division and gene replication are inhibited. Similar experiments can also be done for larger cells, e.g., mammalian cells, in which the transition from exponential growth to linear growth of cell volume may be easier to achieve. Preliminary results from experiments measuring the growth of cell mass of mammalian cells⁶³ and yeast cells⁶⁴ indeed show a crossover from exponential growth to linear growth when the cell mass is above a threshold value, consistent with our prediction.

It has been shown in bacteria that there are excess RNAPs nonspecifically bound to DNA^40,41. In the Methods, we consider a modified model taking into account the partitioning of RNAPs to free RNAPs, elongating RNAPs, promoter-bound RNAPs and nonspecifically bound RNAPs. The transcription rate is determined by the concentration of free RNAPs through Michaelis-Menten kinetics^40,65. We find that our conclusions remain intact with an approximately constant fraction of actively transcribing RNAPs in the total RNAPs for Phase 1 (Supplementary Fig. 6). The effect of nonspecifically bound RNAPs is therefore to renormalize the transcription constant k₀ in Phase 1 (Eq. (1)) by a constant factor. The transition from Phase 1 to Phase 2 is qualitatively unaffected (Supplementary Fig. 7) and the threshold PTD ratio γ₁ (Eq. (9)) from Phase 1 to Phase 2 is changed by a constant factor (Methods). We note that alternative mechanisms of gene saturation can occur upon introducing the different classes of RNAPs, through the saturation of free RNAPs and the Michaelis-Menten kinetics (Methods).

Discussion

In this work, we propose a coarse-grained model of stochastic gene expression incorporating cell volume growth and cell division. In the first part, we consider the biological scenario that RNAPs are limiting for transcription and ribosomes are limiting for translation. In other words, neither DNA nor mRNA is saturated. We find that the limiting nature of ribosomes in the translation process leads to the exponential growth of protein numbers. The limiting nature of RNA polymerase and its exponential growth lead to the exponential growth of mRNA numbers. Homeostasis of protein concentrations originates from the fact that ribosomes make all proteins. Homeostasis of mRNA concentration comes from the resulting bounded concentration of RNAPs. Our model is consistent with the constancy of mRNA and protein concentration as the genome copy number varies since the transcription rate depends on the relative fraction of genes in the genome rather than its absolute number²².

During DNA replication, we find that the gene allocation fraction ϕ_i for one specific gene doubles after the gene is replicated but decreases afterwards since other genes are replicated as well and compete for RNAPs. This prediction can be tested by monitoring the mRNA concentration and the copy number of one gene throughout the cell cycle. Furthermore, we extend our model to more general cases in which DNA and mRNA can be saturated by an excess of RNAP and ribosome. We find three possible phases of cellular growth as the protein-to-DNA ratio γ increases. A transition from exponential growth to linear growth of protein number and cell volume is predicted. In the future, it will be interesting to study the interplay between the global interactions which are the focus of this work and local interactions between genes. Our model provides an alternative model to constant rate models to study genetic networks, which would be advantageous when cell cycle effects are important.

Methods

Derivation of protein and mRNA concentrations

We define the fraction of mRNA i in the total mRNA pool as $f_i = m_i{\mathrm{/}}\mathop {\sum}\nolimits_j {\kern 1pt} m_j$, and the concentration of mRNA and protein of gene i as $c^{m}_{i}$ = m_i/V, c_i = p_i/V. We denote the RNAP and ribosome concentrations as c_n and c_r, respectively. According to Eqs. (1)–(3), the deterministic equations of the above variables then become

$$\frac{{df_i}}{{dt}} = \frac{{k_0n}}{{\mathop {\sum}_j {\kern 1pt} m_j}}\left( {\phi _i - f_i} \right)$$

(12)

$$\frac{{dc_i}}{{dt}} = k_tc_rf_a\left( {f_i - c_i} \right) \approx \mu \left( {f_i - c_i} \right).$$

(13)

$$\frac{{dc^m_i}}{{dt}} = \frac{1}{\tau }\left( {k_0\phi _ic_n\tau - \left( {1 + \mu \tau } \right)c^m_i} \right).$$

(14)

Using the condition that mRNA degradation time is much smaller than the doubling time ($\mu \tau \ll 1$), we find the fixed points for the dynamics of f_i, c_i, and $c^{m}_{i}$. These are, $f_i=c_i=\phi_i$ and $c^m_i=k_0\phi_i c_n\tau$. Replacing f_i by ϕ_i and c_n by ϕ_n, we obtain the approximate version of the above equations, Eqs. (7a) and (7b).

Simulations of independent growth model

In the growth model corresponding to Fig. 2d, we assume the protein number and cell volume grow exponentially and independently,

$$\frac{{dp}}{{dt}} = \left( {1 + \xi _p(t)} \right)p$$

(15)

$$\frac{{dV}}{{dt}} = \left( {1 + \xi _V(t)} \right)V.$$

(16)

Here, ξ_p(t), ξ_V(t) are white noise terms, with the auto-correlation function, $\left\langle {\xi _{p,V}(0)\xi _{p,V}(t)} \right\rangle$ = $A_{p,V}\delta (t)$. In Fig. 2d of the main text, we choose A_p = A_V = 1.

Simulations of growing cell model

We simulated Eqs. (1)–(3), fixing r_b, n_b, b₀, ϕ_r, f_a, I_c, τ as well as the growth rate μ. Other parameters are inferred given the above values, e.g., ϕ_n = n_bϕ_r/r₀, k_t = μ/(ϕ_rf_a), k₀ = k_tf_ar_b/(b₀n_b). We fix the time step δt so that the probability for one event to happen during a time step is smaller than 0.1. We track one of the daughter cells after cell division.

Gene dosage effects

In reality, the gene allocation fraction ϕ_i changes during the cell cycle due to the finite duration of DNA replication. In this section we introduce the modified version of the gene expression model incorporating DNA replication. Although our model is general, we focus on DNA replication in bacteria for concreteness, specifically E. coli where this process is very well characterized. We expect our conclusions to be generally valid. Furthermore, we refine our model for cell division, assuming that the initiator protein triggers the initiation of DNA replication rather than cell division, with the threshold I_c proportional to the number of origins of replication^57,66 (the number of which doubles at each initiation). We assume that the cell division takes place a fixed time C + D after initiation of the DNA replication, where C, D are respectively the time for DNA replication and the time between the completion of DNA replication and cell division. The number of origins reduce by half at each cell division. Other details are the same as in the main text. Each gene doubles its copy number during the C period, and we choose this gene replication time to be randomly and uniformly distributed across all genes. When a gene i replicates,

$$\phi _i \to 2\phi _i$$

(17)

$$\phi _j \to \frac{{\phi _j}}{{\mathop {\sum}_k {\kern 1pt} \phi _k}},$$

(18)

where the second equation accounts for the normalization of the gene allocation fraction. We choose the experimentally reported C and D and cell doubling time from ref.⁵⁷. In Fig. 3a, we simulate the model by tracking one daughter cells. In Fig. 3b, we track all the cells in an exponentially growing population, which starts from 100 cells to 5000 cells.

Simulations of gene activation

We generalize the constitutive expressed genes considered in the main text to include a single regulated gene by considering a random telegraph process of the effective gene copy number¹,

$$g_{i0}\begin{array}{*{20}{c}} {k_g^ - } \\ \rightleftharpoons \\ {k_g^ + \left( {c_{TF}} \right)} \end{array}0.$$

(19)

Here the gene deactivation rate $k_g^ -$ is constant, and the activation rate is set by the concentration of transcription factor through positive regulation, $k_g^ + = k_{g0}c_{TF}$. Here, k_g0 is constant. When gene i is active, the corresponding gene allocation fraction follows $\phi _i = g_{i0}{\mathrm{/}}\mathop {\sum}\nolimits_j {\kern 1pt} g_j$, and when it becomes deactivated ϕ_i = 0. Note that here we only consider one regulated gene i, but the changing gene allocation of gene i also affects other genes’ allocation fraction. We simulate the model in Phase 1, and the deactivation of gene i increases other genes’ allocation fraction as ϕ_j → ϕ_j/(1 − ϕ_i), with $\phi _i = g_{i0}{\mathrm{/}}\mathop {\sum}\nolimits_j {\kern 1pt} g_j$.

Simulated trajectories of gene allocation fraction, mRNA number, protein number and cell volume are shown in Supplementary Fig. 1.

General model of gene expression

We consider the generalized equation of mRNA number, Eq. (1) in the deterministic limit as

$$\dot m_i = \left\{ {\begin{array}{*{20}{l}} {k_0\phi _in - m_i{\mathrm{/}}\tau ,} \hfill & {{\mathrm{if}}} \hfill & {n < n_c,} \hfill \\ {k_0g_in_s - m_i{\mathrm{/}}\tau ,} \hfill & {{\mathrm{if}}} \hfill & {n \ge n_c.} \hfill \end{array}} \right.$$

(20)

Here n_c is the threshold number of RNAPs above which DNA starts to be saturated, in which case the transcription rate becomes proportional to the effective gene copy number g_i and independent of the RNAP number. For one gene, the maximum load of RNAP that it can hold is g_in_s, where n_s is the maximum number of RNAPs that a single copy of constitutively expressed gene (g_i = 1) can hold. n_c can be computed as

$$\phi _in_c = g_in_s \Rightarrow n_c = \mathop {\sum}\limits_i {\kern 1pt} g_in_s.$$

(21)

We also generalize the growth of protein number from Eq. (3) to

$$\dot p_i = \left\{ {\begin{array}{*{20}{l}} {k_t\frac{{m_i}}{{\mathop {\sum}_j {\kern 1pt} m_j}}r,} \hfill & {{\mathrm{if}}} \hfill & {r < r_c} \hfill \\ {k_tm_ir_s,} \hfill & {{\mathrm{if}}} \hfill & {r \ge r_c.} \hfill \end{array}} \right.$$

(22)

Here r_c is the maximum number of ribosomes above which mRNA starts to be saturated. We drop the fraction of actively working ribosomes since it is often a constant depending on the growth condition³⁰. r_s is the maximum number of ribosomes one mRNA can hold. We can calculate r_c as

$$\frac{{m_i}}{{\mathop {\sum}_j {\kern 1pt} m_j}}r_c = m_ir_s \Rightarrow r_c = \left\{ {\begin{array}{*{20}{l}} {k_0\tau nr_s,} \hfill & {{\mathrm{if}}} \hfill & {n < n_c} \hfill \\ {k_0\tau n_cr_s,} \hfill & {{\mathrm{if}}} \hfill & {n \ge n_c} \hfill \end{array}} \right.$$

(23)

Given Eqs. (20) and (22), we obtain four possible phases: (i) n < n_c, r < r_c, (ii) n > n_c, r < r_c, (iii) n > n_c, r > r_c, and (iv) n < n_c, r > r_c. Given a fixed value of ϕ_r and ϕ_n, either (ii) or (iv) is possible. Realization of (ii) requires that $n > \mathop {\sum}\nolimits_i {\kern 1pt} g_in_s$ and $r < k_0\tau r_s\mathop {\sum}\nolimits_i {\kern 1pt} g_in_s$, therefore

$$\frac{{\phi _n}}{{\phi _r}} > \frac{1}{{k_0\tau r_s}}.$$

(24)

In cases where Eq. (24) breaks down, a finite fraction of ribosomes are not utilized. This requires a large fraction of genes in the genome making ribosomes that cannot translate because mRNAs are saturated. Since ribosomes are typically more expensive to make than other proteins^30,33, we assume the biological scenario, Eq. (24) will be satisfied. From Eq. (21) and using $n{\mathrm{/}}\mathop {\sum}\nolimits_i {\kern 1pt} p_i = \phi _n$, we obtain the threshold PTD ratio for the transition from Phase 1 to Phase 2,

$$\gamma _1 = \frac{{n_s}}{{\phi _n}}.$$

(25)

In Phase 2, the average mRNA concentration becomes

$$\left\langle {c^m_i} \right\rangle = \frac{{k_0g_in_s\tau }}{V} = \frac{{k_0\phi _in_s\tau \mathop {\sum}\limits_i {\kern 1pt} g_i}}{V} = \frac{{k_0\phi _in_s\tau }}{\gamma },$$

(26)

which is inversely proportional to the protein-to-DNA ratio.

From Eq. (23) and using $r{\mathrm{/}}\mathop {\sum}\nolimits_i {\kern 1pt} p_i = \phi _r$, we obtain the transition PTD ratio from Phase 2 to Phase 3 as,

$$\gamma _2 = \frac{{k_0\tau r_sn_s}}{{\phi _r}}.$$

(27)

In Phase 3, the mRNA concentration is the same as Phase 2. Because the cell volume is the sum of all proteins, the protein concentration is the same as Phase 2 and Phase 1, $\left\langle {c_i} \right\rangle = g_i{\mathrm{/}}\mathop {\sum}\nolimits_i {\kern 1pt} g_i = \phi _i$.

Estimation of the threshold protein-to-DNA ratios for E. coli

We approximate the upper bound of RNAP number working on a single gene as roughly equal to the number of RNAPs on a typical gene (~10³ base pairs) when half of the gene is occupied. The linear size of RNAP is about 5 nm, and the length of one base pair is about 0.3 nm, leading to the estimate n_s ~ 30. A similar calculation for the upper bound of ribosome on a single mRNA leads to r_s ~ 10 since ribosome’s linear size is about 3 times larger than RNAP⁶².

We take ϕ_r ≈ 0.2 according to the ref.³⁰, and estimate the gene allocation fraction of RNAP to be ϕ_n ~ 0.02 since the number of RNAPs in E. coli is roughly 10% of the number of ribosomes⁶². We estimate the life time of mRNA as 5 min⁶².

We estimate the transcription rate of one RNAP by considering two potential limiting steps in transcription and taking the slower one. First, assuming the initiation of transcription is diffusion limited, we could estimate the time scale for one RNAP to bind the transcription site as Δt ~ 1 μm²/(0.2 μm²/s) ~ 5 s using the measured diffusion constant of RNAP^41,67. Second, we could also estimate the elongation time as the typical length of gene divided by the elongation rate of RNAP, Δt ~ 1000 nt/50(nt/s) ~ 20 s⁶². Taking the slower time scale from the above two calculations, we estimate k₀ ≈ 0.05 s⁻¹. Finally, we compute γ₁ and γ₂ using the above estimated parameters, and obtain γ₁ ~ 1500, γ₂ ~ 20,000.

Effect of nonspecifically bound RNAPs

Previous studies on bacteria have shown that there are excess RNAPs bound nonspecifically to the genome and modeled their kinetics^40,41. In this section, we consider a modified model to take into account nonspecifically bound RNAPs. For our purpose, we consider a simplified model with four classes of RNAPs, namely, (i) elongating RNAPs, n_e (ii) RNAPs bound to a promoter, n_p (iii) RNAPs nonspecifically bound to DNA, n_ns (iv) free RNAPs, n_free. We assume a Michaelis-Menten relation for the number of promoter-bound RNAPs and nonspecifically bound RNAPs^40,65,

$$n_p = G\frac{{c_{free}}}{{c_{free} + K_s}},$$

(28)

$$n_{ns} = Gg_{ns}\frac{{c_{free}}}{{c_{free} + K_{ns}}}.$$

(29)

Here c_free is the concentration of free RNAPs and K_s, K_ns are the Michaelis constants. G is the total number of genes and g_ns is the number of nonspecific binding sites per gene. Note that c_free = c_nF_free, with c_n the concentration of total RNAPs and F_free the fraction of free RNAPs in the total RNAP pool. For simplicity, we assume one promoter for each gene.

The number of elongating RNAPs is then given by:

$$n_e = n_pk_{cat}\tau _0 = n_p{\mathrm{\Lambda }},$$

(30)

where k_cat is the transition rate from promoter-bound RNAPs to elongating RNAPs, τ₀ is the time for transcribing a gene, and Λ = k_catτ₀ is the ratio between the number of elongating RNAPs and promoter-bound RNAPs⁴⁰. We consider parameter regimes motivated by typical biological scenarios, in particular (1) the number of sites for nonspecific binding is much larger than the number of promoters, (2) nonspecific binding of RNAPs to DNA is much weaker than the specific binding of RNAPs to promoters, (3) the number of promoter-bound RNAPs is small compared with elongating RNAPs.

Using n = n_e + n_p + n_ns + n_free, we can find a self-consistent equation for F_free,

$$c_n\gamma \left( {1 - F_{free}} \right) = (1 + {\mathrm{\Lambda }})\frac{{c_nF_{free}}}{{c_nF_{free} + K_s}} + g_{ns}\frac{{c_nF_{free}}}{{c_nF_{free} + K_{ns}}}.$$

(31)

Here γ is the protein-to-DNA (PTD) ratio, i.e., the ratio between the total number of proteins (equivalent to cell volume V within our model) and the total gene number, γ = V/G. We can use Eq. (31) to compute the fraction of free RNAPs given a PTD ratio and use Eqs. (28)–(30) to compute the fraction of elongating RNAPs, F_e and nonspecifically bound RNAPs, F_ns.

Since the left side of Eq. (31) monotonically decreases from c_nγ to 0 as F_free increases from 0 to 1 and the right side of Eq. (31) monotonically increases as F_free increases, we find that as the protein-to-DNA ratio increases, F_free increases. Previous studies have shown that F_free ≈ 0.1 for bacteria^40,41 and support the assumption that c_free is smaller or comparable to K_s^40,65 (note that nonspecific binding is characterized by K_ns larger than K_s). In the following, we first assume a small F_free and $c_{free} \ll K_s$, and show this leads to a behavior qualitatively equivalent to Phase 1 of the main text, albeit with a renormalization of the transcription constant k₀. We also later discuss the situations when these conditions break down, and show that they lead to behavior consistent with Phase 2. The transcription rate for one specific gene with an effective gene copy number g_i can be written as

$$TR_i = \frac{1}{{\tau _0}}g_i\frac{{n_e}}{G} = \frac{1}{{\tau _0}}\phi _inF_e$$

(32)

Here g_i/G = ϕ_i is the gene allocation fraction of gene i, and F_e is the fraction of elongating RNAPs in the total RNAP pool. In the limit of a small F_free and $c_{free} \ll K_s$, F_e becomes a constant:

$$F_e \approx \frac{{n_e}}{{n_e + n_p + n_{ns}}} = \frac{{\mathrm{\Lambda }}}{{1 + {\mathrm{\Lambda }} + g_{ns}K_s{\mathrm{/}}K_{ns}}}.$$

(33)

Therefore, the transcription rate corresponds to Phase 1 of our model, $TR_i = \tilde k_0 \times \phi _i \times n$ with $\tilde k_0 = k_0F_e$.

In the limit that all RNAPs are actively transcribing and no nonspecifically bound RNAPs, ${\mathrm{\Lambda }} \gg 1$ and g_ns = 0, we go back to the situation that all RNAPS are working with $\tilde k_0 = 1{\mathrm{/}}\tau _0 = k_0$. Therefore we conclude that the introduction of nonspecifically bound RNAPs does not affect our model qualitatively, and its effect is to renormalize the transcription constant k₀ in Eq. (1) by a constant factor, $\tilde k_0 = k_0F_e$.

We simulate a single lineage of growing cells using the full model (with partitioning of RNAPs and gene replication). We set the parameters as: Λ = 50, g_ns = 1000, k₀ = 1 min⁻¹, G = 2000 (before gene duplication), K_s = 0.02〈c〉, K_ns = 0.8〈c〉, where 〈c〉 is the total protein concentration within the cell which we set to 1 throughout the paper. Our conclusions are independent of the specific values of parameters. The gene allocation fractions are the same as the main text and the average RNAP concentration during the cell cycle 〈c_n〉 = ϕ_n = 0.02. The fractions of elongating RNAPs and nonspecifically bound RNAPs are approximately constant with a small cell cycle modulation (the coefficient of variation is of the order of 0.01), consistent with the above results since F_free is small (Supplementary Fig. 6a). We also find a linear scaling between mRNA number and cell volume, consistent with Phase 1 of our model (Supplementary Fig. 6b).

We next consider the transition from Phase 1 (RNAP limiting) to Phase 2 (gene limiting). Assuming the saturation of genes is due to the steric hindrance of elongating RNAPs with a minimum distance between two successive RNAPs, we can find the threshold PTD ratio from Phase 1 to Phase 2. Since the fraction of elongating RNAPs is constant, we can compute the threshold PTD ratio using nF_e/G = n_s (where n_s is the upper bound of the number of RNAPs that can work on one gene simultaneously), and obtain

$$\tilde \gamma _1 = \frac{{n_s}}{{\phi _n}}\frac{{1 + {\mathrm{\Lambda }} + g_{ns}K_s{\mathrm{/}}K_{ns}}}{{\mathrm{\Lambda }}}.$$

(34)

We find that the introduction of nonspecifically bound RNAPs does not affect our main results qualitatively and it changes the threshold PTD ratio from Phase 1 to Phase 2, Eq. (9), by a constant factor, which is the inverse of the fraction of elongating RNAPs.

In Supplementary Fig. 7, we show the transition from Phase 1 to Phase 2 in the proposed experiment in which cell division is blocked, in the presence of nonspecific binding. At each point in time we solve the self-consistent equation of the partitioning of RNAPs, Eq. (31). The plot shows both the deterministic dynamics of average mRNA number (black lines) as well as the results of the stochastic simulations (red/blue solid lines). We compare the dynamics for two cells with different fixed genome sizes. Initially, the two cells are in Phase 1 and therefore have identical mRNA number proportional to cell volume. When they exceed their respective threshold PTD ratio, the mRNA number begins to saturate and becomes limited by the gene number, therefore the cell with a twice larger genome size has twice more mRNAs (Phase 2).

So far we assumed that F_free is small and c_free is smaller than the Michaelis constant. In the following we relax these assumptions. We find that the introduction of the different RNAP classes introduces alternative mechanisms for gene saturation, that lead to behavior consistent with Phase 2.

We first relax the assumption that $F_{free} \ll 1$ (assuming that $c_{free} \ll K_s$ still holds). Because the transcription rate for a specific gene is

$$TR_i = \frac{1}{{\tau _0}}g_i{\mathrm{\Lambda }}\frac{{c_nF_{free}}}{{K_s}},$$

(35)

when F_free is comparable to 1, the transcription rate will be saturated as well and proportional to the gene copy number g_i. From Eq. (31), we find that $F_{free}$ ≈ $\gamma {\mathrm{/}}\left( {{\textstyle{{1 + {\mathrm{\Lambda }}} \over {K_s}}} + {\textstyle{{g_{ns}} \over {K_{ns}}}} + \gamma } \right)$, therefore the threshold PTD ratio for F_free to be comparable to 1 is

$$\gamma _{1,F} = \frac{{1 + {\mathrm{\Lambda }}}}{{K_s}} + \frac{{g_{ns}}}{{K_{ns}}}.$$

(36)

Second, we can relax the assumption that $c_{free} \ll K_s$ (assuming $F_{free} \ll 1$ still holds). When c_free ≈ K_s, the transcription rate can be saturated as well. Using $F_{free}$ ≈ $\gamma {\mathrm{/}}\left( {{\textstyle{{1 + {\mathrm{\Lambda }}} \over {K_s}}} + {\textstyle{{g_{ns}} \over {K_{ns}}}}} \right)$, we can obtain the corresponding threshold PTD ratio as

$$\gamma _{1,M} = \frac{{1 + {\mathrm{\Lambda }} + g_{ns}K_s{\mathrm{/}}K_{ns}}}{{\phi _n}}$$

(37)

We remark that the particular mechanism which drives the cell to Phase 2 (gene limiting) is the one with the smallest threshold. Comparison between Eqs. (34), (36) and (37) shows that when n_s/ϕ_n < Λ/K_s and n_s < Λ, genes get saturated due to steric hindrance.

Reference⁶⁸ shows that for wild type E. coli in fast growth conditions the mRNA levels in the cell do not change when the DNA amount is lowered. Within our model this is consistent with Phase 1, and inconsistent with Phase 2, thus suggesting that $F_{free} \ll 1$ and $c_{free} \ll K_s$ as discussed earlier.

Data availability

The data that support the findings of this study are available from the authors on request.

References

Paulsson, J. Models of stochastic gene expression. Phys. Life Rev. 2, 157–175 (2005).
Article ADS Google Scholar
Kærn, M., Elston, T. C., Blake, W. J. & Collins, J. J. Stochasticity in gene expression: from theories to phenotypes. Nat. Rev. Genet. 6, 451–464 (2005).
Article Google Scholar
Pedraza, J. M. & van Oudenaarden, A. Noise propagation in gene networks. Science 307, 1965–1969 (2005).
Article ADS CAS Google Scholar
Raj, A. & van Oudenaarden, A. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell 135, 216–226 (2008).
Article CAS Google Scholar
Thattai, M. & Van Oudenaarden, A. Intrinsic noise in gene regulatory networks. Proc. Natl Acad. Sci. 98, 8614–8619 (2001).
Article ADS CAS Google Scholar
Sanchez, A. & Golding, I. Genetic determinants and cellular constraints in noisy gene expression. Science 342, 1188–1193 (2013).
Article ADS CAS Google Scholar
Wang, P. et al. Robust growth of escherichia coli. Curr. Biol. 20, 1099–1103 (2010).
Article CAS Google Scholar
Campos, M. et al. A constant size extension drives bacterial cell size homeostasis. Cell 159, 1433–1446 (2014).
Article CAS Google Scholar
Taheri-Araghi, S. et al. Cell-size control and homeostasis in bacteria. Curr. Biol. 25, 385–391 (2015).
Article CAS Google Scholar
Cermak, N. et al. High-throughput measurement of single-cell growth rates using serial microfluidic mass sensor arrays. Nat. Biotechnol. 34, 1052–1059 (2016).
Article CAS Google Scholar
Eun, Y.-J. et al. Archaeal cells share common size control with bacteria despite noisier growth and division. Nat. Microbiol. 3, 148 (2018).
Article CAS Google Scholar
Elliott, S. & McLaughlin, C. Rate of macromolecular synthesis through the cell cycle of the yeast saccharomyces cerevisiae. Proc. Natl Acad. Sci. 75, 4384–4388 (1978).
Article ADS CAS Google Scholar
Cooper, S. Distinguishing between linear and exponential cell growth during the division cycle: single-cell studies, cell-culture studies, and the object of cell-cycle research. Theor. Biol. Med. Model. 3, 10 (2006).
Article Google Scholar
Di Talia, S., Skotheim, J. M., Bean, J. M., Siggia, E. D. & Cross, F. R. The effects of molecular noise and size control on variability in the budding yeast cell cycle. Nature 448, 947–951 (2007).
Article ADS Google Scholar
Godin, M. et al. Using buoyant mass to measure the growth of single cells. Nat. Methods 7, 387–390 (2010).
Article CAS Google Scholar
Sinclair, W. K. & Ross, D. W. Modes of growth in mammalian cells. Biophys. J. 9, 1056–1070 (1969).
Article ADS CAS Google Scholar
Crissman, H. A. & Steinkamp, J. A. Rapid, simultaneous measurement of dna, protein, and cell volume in single cells from large mammalian cell populations. J. Cell. Biol. 59, 766 (1973).
Article CAS Google Scholar
Kempe, H., Schwabe, A., Crémazy, F., Verschure, P. J. & Bruggeman, F. J. The volumes and transcript counts of single cells reveal concentration homeostasis and capture biological noise. Mol. Biol. Cell. 26, 797–804 (2015).
Article Google Scholar
Padovan-Merhar, O. et al. Single mammalian cells compensate for differences in cellular volume and dna copy number through independent global transcriptional mechanisms. Mol. Cell 58, 339–352 (2015).
Article CAS Google Scholar
Ietswaart, R., Rosa, S., Wu, Z., Dean, C. & Howard, M. Cell-size-dependent transcription of flc and its antisense long non-coding rna coolair explain cell-to-cell expression variation. Cell Syst. 4, 622–635.e9 (2017).
Article CAS Google Scholar
Brenner, N. et al. Single-cell protein dynamics reproduce universal fluctuations in cell populations. Eur. Phys. J. E 38, 1–9 (2015).
Article CAS Google Scholar
Zheng, X.-y & O’Shea, E. K. Cyanobacteria maintain constant protein concentration despite genome copy-number variation. Cell Rep. 19, 497–504 (2017).
Article CAS Google Scholar
Friedman, N., Cai, L. & Xie, X. S. Linking stochastic dynamics to population distribution: an analytical framework of gene expression. Phys. Rev. Lett. 97, 168302 (2006).
Article ADS Google Scholar
Shahrezaei, V. & Swain, P. S. Analytical distributions for stochastic gene expression. Proc. Natl Acad. Sci. 105, 17256–17261 (2008).
Article ADS CAS Google Scholar
Cai, L., Friedman, N. & Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 440, 358–362 (2006).
Article ADS CAS Google Scholar
Marathe, R., Bierbaum, V., Gomez, D. & Klumpp, S. Deterministic and stochastic descriptions of gene expression dynamics. J. Stat. Phys. 148, 608–627 (2012).
Article ADS MathSciNet Google Scholar
Bierbaum, V. & Klumpp, S. Impact of the cell division cycle on gene circuits. Phys. Biol. 12, 066003 (2015).
Article ADS Google Scholar
Jones, D. L., Brewster, R. C. & Phillips, R. Promoter architecture dictates cell-to-cell variability in gene expression. Science 346, 1533–1536 (2014).
Article ADS CAS Google Scholar
Kolter, R., Siegele, D. A. & Tormo, A. The stationary phase of the bacterial life cycle. Annu. Rev. Microbiol. 47, 855–874 (1993).
Article CAS Google Scholar
Scott, M., Gunderson, C. W., Mateescu, E. M., Zhang, Z. & Hwa, T. Interdependence of cell growth and gene expression: origins and consequences. Science 330, 1099 (2010).
Article ADS CAS Google Scholar
Klumpp, S., Scott, M., Pedersen, S. & Hwa, T. Molecular crowding limits translation and cell growth. Proc. Natl Acad. Sci., 110, 16754–16759 (2013).
Article ADS CAS Google Scholar
Metzl-Raz, E. et al. Principles of cellular resource allocation revealed by condition-dependent proteome profiling. eLife 6, e28034 (2017).
Article Google Scholar
Reuveni, S., Ehrenberg, M. & Paulsson, J. Ribosomes are optimized for autocatalytic production. Nature 547, 293–297 (2017).
Article ADS CAS Google Scholar
Cole, J. A. & Luthey-Schulten, Z. Careful accounting of extrinsic noise in protein expression reveals correlations among its sources. Phys. Rev. E 95, 062418 (2017).
Article ADS Google Scholar
Neidhardt, F. C., Ingraham, J. L. & Schaechter, M. Physiology of the bacterial cell: a molecular approach 20 (Sinauer, Sunderland, 1990).
Google Scholar
Hui, S. et al. Quantitative proteomic analysis reveals a simple strategy of global resource allocation in bacteria. Mol. Syst. Biol. 11, 784 (2015).
Article Google Scholar
Zhurinsky, J. et al. A coordinated global control over cellular transcription. Curr. Biol. 20, 2010–2015 (2010).
Article CAS Google Scholar
Chandler, M. & Pritchard, R. The effect of gene concentration and relative gene dosage on gene output inescherichia coli. Mol. General. Genet. MGG 138, 127–141 (1975).
Article CAS Google Scholar
Schmidt, E. E. & Schibler, U. Cell size regulation, a mechanism that controls cellular rna accumulation: consequences on regulation of the ubiquitous transcription factors oct1 and nf-y and the liver-enriched transcription factor dbp. J. Cell. Biol. 128, 467–483 (1995).
Article CAS Google Scholar
Klumpp, S. & Hwa, T. Growth-rate-dependent partitioning of rna polymerases in bacteria. Proc. Natl Acad. Sci. 105, 20245–20250 (2008).
Article ADS CAS Google Scholar
Bakshi, S., Dalrymple, R. M., Li, W., Choi, H. & Weisshaar, J. C. Partitioning of rna polymerase activity in live escherichia coli from analysis of single-molecule diffusive trajectories. Biophys. J. 105, 2676–2686 (2013).
Article ADS CAS Google Scholar
Kubitschek, H. E., Baldwin, W. W., Schroeter, S. J. & Graetzer, R. Independence of buoyant cell density and growth rate in escherichia coli. J. Bacteriol. 158, 296–299 (1984).
CAS PubMed PubMed Central Google Scholar
Basan, M. et al. Inflating bacterial cells by increased protein synthesis. Mol. Syst. Biol. 11, 836 (2015).
Article Google Scholar
Bernstein, J. A., Khodursky, A. B., Lin, P.-H., Lin-Chao, S. & Cohen, S. N. Global analysis of mrna decay and abundance in escherichia coli at single-gene resolution using two-color fluorescent dna microarrays. Proc. Natl Acad. Sci. 99, 9697–9702 (2002).
Article ADS CAS Google Scholar
Wang, Y. et al. Precision and functional specificity in mrna decay. Proc. Natl Acad. Sci. 99, 5860–5865 (2002).
Article ADS CAS Google Scholar
Sompayrac, L. & Maaløe, O. Autorepressor model for control of dna replication. Nature 241, 133–135 (1973).
Article ADS CAS Google Scholar
Ghusinga, K. R., Vargas-Garcia, C. A. & Singh, A. A mechanistic stochastic framework for regulating bacterial cell division. Sci. Rep. 6, 30229 (2016).
Article ADS CAS Google Scholar
Bertaux, F., Von Kügelgen, J., Marguerat, S. & Shahrezaei, V. A unified coarse-grained theory of bacterial physiology explains the relationship between cell size, growth rate and proteome composition under various growth limitations. Preprint at https://www.biorxiv.org/content/early/2016/10/03/078998078998 (2016).
Golding, I., Paulsson, J., Zawilski, S. M. & Cox, E. C. Real-time kinetics of gene activity in individual bacteria. Cell 123, 1025–1036 (2005).
Article CAS Google Scholar
Taniguchi, Y. et al. Quantifying E. coli proteome and transcriptome with single-molecule sensitivity in single cells. Science 329, 533–538 (2010).
Article ADS CAS Google Scholar
Hilfinger, A., Norman, T. M. & Paulsson, J. Exploiting natural fluctuations to identify kinetic mechanisms in sparsely characterized systems. Cell Syst. 2, 251–259 (2016).
Article CAS Google Scholar
Amir, A. Cell size regulation in bacteria. Phys. Rev. Lett. 112, 208102 (2014).
Article ADS Google Scholar
Soifer, I., Robert, L. & Amir, A. Single-cell analysis of growth in budding yeast and bacteria reveals a common size regulation strategy. Curr. Biol. 8, 356–361 (2016).
Article Google Scholar
Ho, P.-Y., Lin, J. & Amir, A. Modeling cell size regulation: From single-cell-level statistics to molecular mechanisms and population-level effects. Annu. Rev. Biophys. 47, null (2018). PMID: 29517919.
Article Google Scholar
Tanouchi, Y. et al. A noisy linear map underlies oscillations in cell size and gene expression in bacteria. Nature 523, 357–360 (2015).
Article ADS CAS Google Scholar
Brenner, N. et al. Universal protein distributions in a model of cell growth and division. Phys. Rev. E 92, 042713 (2015).
Article ADS Google Scholar
Wallden, M., Fange, D., Lundius, E. G., Baltekin, Ö. & Elf, J. The synchronization of replication and division cycles in individual E. coli cells. Cell 166, 729–739 (2016).
Article CAS Google Scholar
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expression in a single cell. Science 297, 1183–1186 (2002).
Article ADS CAS Google Scholar
Bar-Even, A. et al. Noise in protein expression scales with natural protein abundance. Nat. Genet. 38, 636–643 (2006).
Article CAS Google Scholar
Dar, R. D., Razooky, B. S., Weinberger, L. S., Cox, C. D. & Simpson, M. L. The low noise limit in gene expression. PLoS ONE 10, 1–21 (2015).
Article Google Scholar
Thomas, P., Terradot, G., Danos, V. & Weisse, A. Stochasticity of cellular growth: sources, propagation and consequences. Preprint at https://www.biorxiv.org/content/early/2018/02/18/267658 (2018).
Milo, R. & Phillips, R. Cell biology by the numbers (Garland Science, New York, 2015).
Liu, X., Oh, S. & Kirschner, M. Personal Communications. (2018).
Neurohr, G & Amon, A. Personal Communications. (2018).
Patrick, M., Dennis, P. P., Ehrenberg, M. & Bremer, H. Free rna polymerase in escherichia coli. Biochimie 119, 80–91 (2015).
Article CAS Google Scholar
Ho, P.-Y. & Amir, A. Simultaneous regulation of cell size and chromosome replication in bacteria. Front. Microbiol. 6, 662 (2015).
Article Google Scholar
Bratton, B. P., Mooney, R. A. & Weisshaar, J. C. Spatial distribution and diffusive motion of rna polymerase in live escherichia coli. J. Bacteriol. 193, 5138–5146 (2011).
Article CAS Google Scholar
Churchward, G., Bremer, H. & Young, R. Transcription in bacteria at different dna concentrations. J. Bacteriol. 150, 572–581 (1982).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Naama Barkai, Ido Golding, Andreas Hilfinger, Po-Yi Ho, Meriem El Karoui, Andrew Murray, Johan Paulsson, Leonardo A. Sepúlveda, and Sven van Teeffelen for useful discussions related to this work. AA thanks the A.P. Sloan foundation, the Milton Fund, the Volkswagen Foundation and Harvard Dean’s Competitive Fund for Promising Scholarship for their support. JL was supported by the George F. Carrier fellowship and the National Science Foundation through the Harvard Materials Research Science and Engineering Center (DMR-1420570).

Author information

Authors and Affiliations

John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, 02138, USA
Jie Lin & Ariel Amir

Authors

Jie Lin
View author publications
You can also search for this author in PubMed Google Scholar
Ariel Amir
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors conceived the work, carried out the work, and jointly wrote the manuscript.

Corresponding author

Correspondence to Ariel Amir.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, J., Amir, A. Homeostasis of protein and mRNA concentrations in growing cells. Nat Commun 9, 4496 (2018). https://doi.org/10.1038/s41467-018-06714-z

Download citation

Received: 23 January 2018
Accepted: 17 September 2018
Published: 29 October 2018
DOI: https://doi.org/10.1038/s41467-018-06714-z

This article is cited by

Transcription–replication interactions reveal bacterial genome regulation
- Andrew W. Pountain
- Peien Jiang
- Itai Yanai
Nature (2024)
A coarse-grained bacterial cell model for resource-aware analysis and design of synthetic gene circuits
- Kirill Sechkar
- Harrison Steel
- Guy-Bart Stan
Nature Communications (2024)
Microeukaryote metabolism across the western North Atlantic Ocean revealed through autonomous underwater profiling
- Natalie R. Cohen
- Arianna I. Krinos
- Mak A. Saito
Nature Communications (2024)
Genome dilution by cell growth drives starvation-like proteome remodeling in mammalian and yeast cells
- Michael C. Lanz
- Shuyuan Zhang
- Jan M. Skotheim
Nature Structural & Molecular Biology (2024)
Navigating the intricacies of RT-qPCR data analysis in gene expression studies
- Linah Wetthasinghe
- Hien Fuh Ng
- Way Seah Lee
Functional & Integrative Genomics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.