Negative autoregulation, where a transcription factor regulates its own expression by preventing transcription, is commonly used to suppress fluctuations in gene expression. Recent single molecule in vivo imaging has shown that it takes significant time for a transcription factor molecule to bind its chromosomal binding site. Given the slow association kinetics, transcription factor mediated feedback cannot at the same time be fast and strong. Here we show that with a limited association rate follows an optimal transcription factor binding strength where noise is maximally suppressed. At the optimal binding strength the binding site is free a fixed fraction of the time independent of the transcription factor concentration. One consequence is that high-copy number transcription factors should bind weakly to their operators, which is observed for transcription factors in Escherichia coli. The results demonstrate that a binding site’s strength may be uncorrelated to its functional importance.
The number of molecules in a biological cell is always fluctuating because of the probabilistic nature of chemical reactions1,2. Negative feedback is therefore commonly used to maintain homeostasis and to reduce the fluctuations in intracellular processes such as gene expression3,4,5. For example, nearly half of the ~300 transcription factors (TFs) in Escherichia coli make use of negative feedback on their own expression6,7. However, the chemical noise that makes regulation necessary also limits the accuracy of feedback regulation8. For this reason, we need to consider the physical limitations in how feedback systems can be implemented by chemical components when analysing optimal strategies for suppressing noise in gene expression9.
Recent single molecule in vivo measurements show that it takes much longer time for a TF molecule to find its chromosomal operator site than what is expected from in vitro measurements. The discrepancy is due to slow intracellular diffusion, significant binding to non-specific DNA and that chromosomal DNA is crowded by other proteins10. As a consequence, it takes about 4 min for a single LacI repressor molecule to find and bind a single lac operator in the E. coli chromosome11. The slow association kinetics, as quantified by a low association rate constant ka, implies that the dissociation rate from the operator has to be low (low kd) to accomplish strong feedback with a high binding constant K=ka/kd. However, kd also sets the time scale for how often the feedback system samples the cellular concentration that should be regulated. We should therefore ask if transcriptional noise is better suppressed with fast or strong negative feedback.
Modelling slow TF binding
The consequences of limited TF-binding rates can be modelled by the basic steps in expressing a TF gene and the TF binding to the DNA12,13,14,15,16,17,18,19, illustrated in Fig. 1. In summary, an active gene is transcribed into mRNA, the mRNA is translated into a repressor protein several times or degraded, and the repressor binds the operator site mediating the negative feedback or is degraded. When the operator is bound the gene is inactive until the repressor dissociates. We now ask what dissociation rate leads to optimal noise suppression when the average expression level is maintained constant by compensating stronger repression with higher transcription rate. By posing the optimization problem in this way, we implicitly assume that the purpose of the negative feedback system is to maintain a certain concentration of the regulated protein, and that the parameters that evolution can work with are the strength of the promoter and the strength of repressor binding.
It is straightforward to perform stochastic simulations corresponding to the reaction system in Fig. 1 (ref. 20). The results of such simulations are displayed in Supplementary Fig. S1 and will serve as reference point for the mathematical analysis. The fluctuations are quantified using the variance, σ2, to mean, <x>, ratio VMR=σ2/<x>. As we fix the average number of molecules <x> for varying feedback, finding an optimal kd in terms of minimizing the VMR is thus equivalent to optimize noise in terms of the coefficient of variation σ2/<x>2 or the s.d. over the mean σ/<x>. In Fig. 2a, we show that the VMR has a minimum with respect to the dissociation rate kd. Higher dissociation rates imply weak binding, insensitive regulation and small noise suppression. Lower dissociation rates imply that mRNAs have to be made in noisy bursts to maintain the average protein expression level despite rare dissociation events. This inevitable property of negative transcriptional autoregulation is lost in the quasi steady state (QSS) assumption where it is assumed that the TF binding equilibrates rapidly in its binding to operator sites. The QSS assumption leads to the misconception that stronger binding, compensated by a strong promoter to maintain constant copy number, results in more sensitive regulation and better noise suppression. The reason for the error is that the slow TF kinetics that follows with strong feedback is in conflict with the QSS assumption. In Supplementary Fig. S2, the discrepancy for strong feedback is shown, an effect that is previously demonstrated in comparisons with simulations16.
Optimal binding strength
To understand how the noise relates to the binding strength, we need to solve the chemical master equation21 for the well-stirred system corresponding to the reaction network in Fig. 1. The molecules are considered to be well-mixed in the cell since the search time of finding the binding site is of the order of minutes and proteins diffuse across the cell on the order of seconds. Possible effects of diffusion noise are neglected22. We derive an expression for the variance in the number of repressor molecules, see Methods section and Supplementary Note 1, where a key step is to approximate third order moments in terms of lower order moments. In Fig. 2a we display the analytical solution for the VMR, solid lines and simulation results for the complete system. As a reference, we display the VMR of unregulated protein synthesis (dashed line), which is approached in the limit of no feedback and given by13,14
where b is the average number of proteins made per mRNA molecule. Noise is suppressed below the unregulated level, VMR0, when the dissociation rate is higher than kd,min.
where cp is the rate of protein turnover, which can be a combination of dilution and degradation, mp is the mean number of repressor protein molecules and cpmp is therefore equal to the average rate of protein synthesis. See Supplementary Note 2 for details. When several proteins are made per mRNA (b»1), Equation 2 implies that the repressor has to dissociate at a rate corresponding to the protein synthesis rate divided by the number of proteins made per mRNA. This result makes sense, as the mRNAs otherwise would have to be made in bursts. The denominator is 1+b, and not b, as it is sufficient that the repressor dissociates at the rate of protein synthesis, cpmp, to maintain the uncorrelated protein synthesis events, even if b<1. High-copy number E. coli TF exists in numbers up to several thousands23,24,25. If we consider as an example a high-copy number TF to be made at a rate of 1,000 molecules per generation and with b=8, the median for E. coli26, the TF needs to dissociate more than 100 times per generation or negative feedback adds more noise than the unregulated situation.
In order to gain noise suppression by the regulation, the transcription events should not only be uniformly distributed in time, which is the case for unregulated synthesis, but respond to deviations from the average value. For this reason, the operator binding state has to be given some time to equilibrate to free protein concentration between expression events. We find that the optimal dissociation rate is
where ka is the association rate for the repressor to a single operator site. See Supplementary Note 2 for details. In Fig. 2b, we show that equation 3 describes the optimal dissociation rate for all relevant parameters by comparing the equation with the optimal dissociation rate obtained from fully stochastic simulations of growing cells. The result can also be described in terms of the fraction of time the operator should be free, that is,
where the search time of the TF, τs=1/ka, and the protein turnover time, τG=1/cp. For example, if the search time is 4 min, which is close to the physical limit11, the generation time is 22 min and b=8, the operator should be free 17% of the time independent of the number of regulated protein molecules. A highly abundant autorepressor TF should therefore bind weakly to its operator. With the previous example of 1,000 molecules, Equation 3 gives that a TF should (roughly) be bound on average one second before dissociating for optimal noise suppression.
Global TF-binding sequence statistics
In E. coli, the expression of 51% of all genes are under control of seven highly expressed global transcriptional regulators (CRP, FNR, IHF, FIS, ArcA, NarL and Lrp). Among these all but ArcA are employing negative feedback on their own synthesis6,7. However, all the six TFs bind weakly to the operators mediating the feedback compared with their binding to other specific sites (Fig. 3), where the binding strengths are estimated using the heterology index27,28. The heterology index is for a given TF-binding site an estimation of the total reduction of binding free energy compared with if each base pair would be occupied by the most representative base pair in a set of recognition sites for the TF. The observed weak autorepressor binding sites cannot be explained from the need of having a high expression of the TFs, as the same expression level could be achieved by a combination of stronger promoters and stronger feedback. Instead, the results from our calculations demonstrate another explanation; the binding sites need to be weak to keep them free a significant fraction of the time despite the high concentration of these TFs.
Recently Lestas et al.8 derived a fundamental limit for noise suppression in a feedback control system. When applied to the case where the regulated quantity itself is mediating the feedback, there is no information loss between making the species and regulating its production and the noise can therefore theoretically be suppressed to zero. Such feedback would be implemented by having infinite synthesis rate when the copy number is below the set point and zero synthesis when the copy number is at the set point or higher. However, in practice the physical implementation of the control systems implies that gene expression cannot be controlled this accurately. For example, although the repressor is bound to the operator, the copy number can fall below the set point without the control system being able to respond. In order to determine and respond to the copy number deviations, the TF has to dissociate such that the rebinding time can be used as a measure of the free concentration. The rebinding time will, however, be exponentially distributed and information is lost by making a decision based on a single-rebinding event. To estimate the concentration accurately in the system, the repressor has to bind and dissociate several times for each synthesis event such that the average occupancy at the time scale of the gene expression accurately reflects the concentration in the cells. Related constraints for optimal information transmission in gene regulatory systems where a signal molecule is regulated by upstream regulators have previously been analysed29 and how feedback may improve the fidelity in transmission of time-varying signals30.
In summary, we have derived the optimal dissociation rate for the case of a single auto-regulatory repressor protein and in the Supplementary Fig. S2 we demonstrate that the same principles apply for cooperative repression schemes. Based on these results, we conclude that there is no reason to consider weak-binding sites as unimportant. On the contrary, for regulatory fidelity by abundant TFs it is critical that binding is weak.
Solving the moment equations
The first and second moments at stationary conditions have been calculated from the chemical master equation by performing two moment closure steps, where third order moments are approximated with lower order moments (see Supplementary Note 1). After the approximation steps are performed, the second moment Mpp of the protein copy number is shown to satisfy the cubic equation
The cubic equation can be solved using Cardano’s method and the second moment Mpp will depend upon the mean protein number mp, the protein turnover rate cp, the number of proteins translated per mRNA b, the binding association rate ka and the dissociation rate kd. The VMR of the protein copy number can be calculated from the second moment as
Finding the optimal dissociation rate
The dissociation rate kd,opt that minimize the VMR can be obtained by differentiating the cubic equation (5) with respect to kd and setting the derivative of Mpp to zero (see Supplementary Note 2).
How to cite this article: Grönlund, A. et al. Transcription factor binding kinetics constrain noise suppression via negative feedback. Nat. Commun. 4:1864 doi: 10.1038/ncomms2867 (2013).
Berg, O. G. . A model for the statistical fluctuations of protein numbers in a microbial population. J. Theor. Biol. 71, 587–603 (1978).
Paulsson, J. . Summing up the noise in gene networks. Nature 427, 415–418 (2004).
Becskei, A. & Serrano, L. . Engineering stability in gene networks by autoregulation. Nature 405, 590–593 (2000).
Savageau, M. A. . Comparison of classical and autogenous systems of regulation in inducible operons. Nature 252, 546–549 (1974).
Dublanche, Y., Michalodimitrakis, K., Kümmerer, N., Foglierini, M. & Serrano, L. . Noise in transcription negative feedback loops: simulation and experimental analysis. Mol. Syst. Biol. 2, 41 (2006).
Martinez-Antonio, A. & Collado-Vides, J. . Identifying global regulators in transcriptional regulatory networks in bacteria. Curr. Opin. Microbiol. 6, 482–489 (2003).
Gama-Castro, S. et al. RegulonDB version 7.0: transcriptional regulation of Escherichia coli K12 integrated within genetic sensory response units (gensor units). Nucleic Acids Res. 39, D98–D105 (2011).
Lestas, I., Vinnicombe, G. & Paulsson, J. . Fundamental limits on the suppression of molecular fluctuations. Nature 467, 174–178 (2010).
Grönlund, A., Lötstedt, P. & Elf, J. . Delay-induced anomalous fluctuations in intracellular regulation. Nat. Commun. 2, 419 (2011).
Li, G. -W., Berg, O. G. & Elf, J. . Effects of macromolecular crowding and DNA looping on gene regulation kinetics. Nat. Phys. 5, 294–297 (2009).
Hammar, P., Leroy, P., Mahmutovic, A., Marklund, E. G., Berg, O. G. & Elf, J. . The lac repressor displays facilitated diffusion in living cells. Science 336, 1595–1598 (2012).
Thattai, M. & van Oudenaarden, A. . Intrinsic noise in gene regulatory networks. Proc. Natl Acad. Sci. USA 98, 8614–8619 (2001).
Ozbudak, E. M., Thattai, M., Kurtser, I., Grossman, A. D. & van Oudenaarden, A. . Regulation of noise in the expression of a single gene. Nat. Genet. 31, 69–73 (2002).
Paulsson, J. . Models of stochastic gene expression. Phys. Life Rev. 2, 157–175 (2005).
Singh, A. & Hespanha, J. P. . Optimal feedback strength for noise suppression in autoregulatory gene networks. Biophys. J. 96, 4013–4023 (2009).
Stekel, D. J. & Jenkins, D. J. . Strong negative self regulation of prokaryotic transcription factors increases the intrinsic noise of protein expression. BMC Syst. Biol. 2, 6 (2008).
Tkačik, G., Gregor, T. & Bialek, W. . The role of input noise in transcriptional regulation. PLoS One 3, e2774 (2008).
Walczak, A. M., Onuchic, J. N. & Wolynes, P. G. . Absolute rate theories of epigenetic stability. Proc. Natl Acad. Sci. USA 102, 18926–18931 (2005).
Kim, K. -Y. & Wang, J. . Potential energy landscape and robustness of a gene regulatory network: toggle switch. PLoS Comput. Biol. 3, e60 (2007).
Lu, T., Volfson, D., Tsimring, L. & Hasty, J. . Cellular growth and division in the Gillespie algorithm. Syst. Biol. 1, 121–128 (2004).
van Kampen, N. G. . Stochastic Processes in Physics and Chemistry, 5th edn Elsevier: Amsterdam, (2004).
Bialek, W. & Setayeshgar, S. . Physical limits to biochemical signaling. Proc. Natl Acad. Sci. USA 102, 10040–10045 (2005).
Cai, S. J. & Inouye, M. . EnvZ-OmpR interaction and osmoregulation in Escherichia coli. J. Biol. Chem. 277, 24155–24161 (2002).
Overgaard, M., Borch, J. & Gerdes, K. . RelB and RelE of Escherichia coli form a tight complex that represses transcription via the ribbon–helix–helix motif in RelB. J. Mol. Biol. 394, 183–196 (2009).
Ali Azam, T., Iwata, A., Nishimura, A., Ueda, S. & Ishihama, A. . Growth phase-dependent variation in protein composition of the escherichia coli nucleoid. J. Bacteriol. 181, 6361–6370 (1999).
Taniguichi, Y. et al. Quantifying E. coli proteome and transcriptome with single-molecule sensitivity in single cells. Science 329, 533 (2010).
Berg, O. G. & von Hippel, P. H. . Selection of DNA binding sites by regulatory proteins: II. The binding specificity of cyclic AMP receptor protein to recognition sites. J. Mol. Biol. 200, 709–723 (1988).
Berg, O. G. . Selection of DNA binding sites by regulatory proteins: the LexA protein and the arginine repressor use different strategies for functional specificity. Nucleic Acids Res. 16, 5089–5105 (1988).
Tkacik, G., Walczak, A. M. & Bialek, W. . Optimizing information flow in small genetic networks. III. A self-interacting gene. Phys. Rev. E 85, 041903 (2012).
de Ronde, W. H., Tostevin, F. & ten Wolde, P. R. . The effect of feedback on the fidelity of information transmission of time-varying signals. Phys. Rev. E 82, 031914 (2010).
This work was supported by the European Research Council (203083), the Swedish Foundation for Strategic Research, the Swedish Research Council, Göran Gustafssons stiftelse, eSSENCE, and the Knut and Alice Wallenberg Foundation.
The authors declare no competing financial interests.
About this article
Cite this article
Grönlund, A., Lötstedt, P. & Elf, J. Transcription factor binding kinetics constrain noise suppression via negative feedback. Nat Commun 4, 1864 (2013). https://doi.org/10.1038/ncomms2867
Nature Reviews Genetics (2020)
Bulletin of Mathematical Biology (2018)
Journal of Mathematical Biology (2018)
Nature Ecology & Evolution (2017)
Scientific Reports (2017)