A minimal model for gene expression dynamics of bacterial type II toxin–antitoxin systems

Kosmidis, Kosmas; Hütt, Marc-Thorsten

doi:10.1038/s41598-021-98570-z

Download PDF

Article
Open access
Published: 30 September 2021

A minimal model for gene expression dynamics of bacterial type II toxin–antitoxin systems

Kosmas Kosmidis¹ &
Marc-Thorsten Hütt²

Scientific Reports volume 11, Article number: 19516 (2021) Cite this article

1330 Accesses
1 Citations
Metrics details

Subjects

Abstract

Toxin–antitoxin (TA) modules are part of most bacteria’s regulatory machinery for stress responses and general aspects of their physiology. Due to the interplay of a long-lived toxin with a short-lived antitoxin, TA modules have also become systems of interest for mathematical modelling. Here we resort to previous modelling efforts and extract from these a minimal model of type II TA system dynamics on a timescale of hours, which can be used to describe time courses derived from gene expression data of TA pairs. We show that this model provides a good quantitative description of TA dynamics for the 11 TA pairs under investigation here, while simpler models do not. Our study brings together aspects of Biophysics with its focus on mathematical modelling and Computational Systems Biology with its focus on the quantitative interpretation of ’omics’ data. This mechanistic model serves as a generic transformation of time course information into kinetic parameters. The resulting parameter vector can, in turn, be mechanistically interpreted. We expect that TA pairs with similar mechanisms are characterized by similar vectors of kinetic parameters, allowing us to hypothesize on the mode of action for TA pairs still under discussion.

Molecular stripping underpins derepression of a toxin–antitoxin system

Article 27 March 2024

Biology and evolution of bacterial toxin–antitoxin systems

Article 02 January 2022

The extracellular contractile injection system is enriched in environmental microbes and associates with numerous toxins

Article Open access 18 June 2021

Introduction

The vast majority of free-living bacteria contain a number of toxin–antitoxin (TA) gene pairs^1,2,3,4. The toxin products target key cellular functions inhibiting cell growth and eventually leading to cell death, while the corresponding antitoxin neutralizes the toxin’s effect, thus, forming a TA system whose accurate expression regulation is vital to the survival of the cell⁵. These TA systems are currently classified in six groups (types I, II, III, IV, V, VI)² according to the mechanism used by the antitoxin to neutralize the toxin. Types I-III are considered to be well-established TA systems^3,6,7,8,9 while types IV-VI consist of newly discovered types^{10,11,12,13,14}. Type II TA systems are the largest and best studied TA system class. Type II antitoxins are proteins. They typically have two domains, one that binds DNA and a second that binds and inhibits the activity of the cognate protein toxin^2,3,9. The presence of TA systems is considered to be associated to persistence, i.e. the multidrug tolerance of bacteria, which obviously compromises the effectiveness of antibiotics on many pathogenic bacteria¹⁵. It is believed^4,15,16 that when antibiotics are applied, a small sub-population of bacteria, called persisters, enters a dormant, non-dividing state and thus are protected from being killed. Experiments have shown a connection between persister formation and the competition between a toxin and its antitoxin inside an E. coli cell. Toxins inhibit cell growth and most antibiotics target the cell during the growth phase. Cells entering this persistent state seem to be immune to antibiotics but this immunity is different from the one obtained through advantageous mutations that result in antibiotic resistance since it is not permanent or inherited¹⁷. Knowledge about TA systems in bacteria is still accumulating¹⁸. This is true for the discovery of new TA modules¹⁹, their classification^5,20, their functional roles^21,22,23,24 as well as their detailed molecular mechanisms²⁵. Very recently for example, it was discovered that the type II TA system PrpT-PrpA of the Pseudoalteromonas rubra plasmid, directly controls plasmid replication. It seems that the antitoxin PrpA binds to the iterons in the origin of replication (Ori), interfering with the binding of RepB to the Ori and, thus, preventing overreplication of the plasmid²⁶.

In E. coli, there are more than ten well-characterized type II TA systems¹. These include relE-relB, yafQ-dinJ, yoeB-yefM, hipA-hipB, yafO-yafN, hicA-hicB, higB-higA, ypjF-yfjZ, mqsR-mqsA, ymcE-gnsA and ydaT-ydaS^{10,27,28,29,30,31,32,33,34,35,36,37}. The genomic location of each of these TA systems is indicated in Fig. 1. It is of considerable practical importance to understand the dynamics of TA systems and several plausible models for TA dynamics and persister formation have been proposed (see, for example^38,39,40 and references therein). It is also important that the proposed model predictions are compared to, nowadays available, high-throughput data. In this paper, we present a minimal model for the description of TA type II dynamics in E. coli. The basic characteristics of the minimal model is that it assumes: (a) regulation of toxin and antitoxin production rate by means of a negative feedback through DNA binding of the TA complex (b) toxin induced growth rate modulation. The model’s predictions are compared to the RNA-Seq gene expression data published in⁴¹ (see Results and Discussion).

TA dynamics have been of interest to mathematical modelling for a long time. So far, the focus of research has been on the basic dynamical properties of TA modules^39,40,42,43 and the synchronization of multiple TA modules in response to environmental stimuli (e.g.,⁴⁴), rather than the agreement with high-throughput data. For high-throughput data, in particular gene expression patterns, the dominant avenue of research has been to compare these patterns with large-scale regulatory networks or classes of regulatory mechanisms. In the case of bacterial gene regulation, successes have been understanding and experimentally confirming the role of small regulatory devices like feedforward loops^45,46, the discovery of an interplay the regulatory network and chromosomal structure^47,48,49,50 and the organization of gene expression along the axis from the origin (OriC) to the terminus (Ter) of replication⁵⁰.

TA systems are often embedded in an intricate network of regulatory processes⁵ and part of functional regulatory modules⁵¹. There is evidence of collective behaviors arising from the interplay between TA systems. Such a model of coupled TA systems has for example been studied in²¹ and in⁴⁴. Simple ordinary differential equation (ODE) models of (type-II) TA systems have for example been formulated in²¹ with an emphasis on coupled systems and the spontaneous switching occurring in stochastic dynamics, in⁴⁰, where conditional cooperativity of the RelBE system has been studied and its response to environmental stimuli (e.g., nutritional stress), in⁵², which contains a simplified system capable of excitable dynamics, as well as in³⁹ and⁴⁴ with a focus on bistability. For type-I TA systems, a mathematical model has been developed in⁵³, offering insight in time scales involved.

Here we study the long-term dynamics of TA pairs in time-resolved RNA-Seq data for E. coli. Our question is, whether the dynamics of all TA pairs in the data can be described by the same model, or whether qualitatively different models have to be assumed for the different TA modules.

Methods

Figure 2 shows a schematic of the basic characteristics of the minimal model of type II TA gene expression. Toxin T and antitoxin A are expressed by neighbouring genes. It is known^1,39 that toxins are more stable than the antitoxins, thus, the latter have to be constantly expressed in order to neutralize the toxin effects. The toxin and antitoxin form a complex AT which inhibits toxin and antitoxin production. More complex TA interaction (such as conditional cooperativity^39,40 or cooperation between multiple TA systems¹⁷) are not included in the minimal model. Moreover, the presence of toxin has an inhibitory effect on the cell growth. This last fact is found to be an essential characteristic of an acceptable minimal model.

We denote the concentration of the antitoxin A with the variable $y_{1}$, that of the toxin T with $y_{2}$ and, finally, the concentration of the TA complex AT with $y_{3}$. The system of ordinary differential equations (ODEs) that describes the system is:

$$\begin{aligned} \frac{d y_1}{dt}&= \frac{k'_{1}}{\left( 1 + \frac{y_{3}}{s'_{1}}\right) \left( b'_{m} y_{2} + 1 \right) } - d_{1} y_{1} + d_{3} y_{3} - k_{3} y_{1} y_{2} \end{aligned}$$

(1)

$$\begin{aligned} \frac{d y_{2}}{dt}&= \frac{k'_{2}}{\left( 1 + \frac{y_{3}}{s'_{2}}\right) \left( b'_{m} y_{2} + 1\right) } - \frac{d_{2} y_{2}}{b'_{c} y_{2} + 1} + d_{3} y_{3} - k_{3} y_{1} y_{2} \end{aligned}$$

(2)

$$\begin{aligned} \frac{d y_{3}}{dt}&= - d_{3} y_{3} + k_{3} y_{1} y_{2} . \end{aligned}$$

(3)

Equation (3) is a standard chemical kinetics equation. We assume that the production rate of the complex $y_3$ is proportional to the product of the concentrations of $y_1$ and $y_2$, thus the term $k_{3} y_{1} y_{2}$ where $k_3$ is the respective rate constant. We also assume that the complex degrades to its constituents A and T with a rate constant $d_3$. To be precise, the rate constants $d_1, d_2, d_3$ are considered to be a sum of 2 terms due to a. protein degradation (specific destruction by specialized proteins in the cell) and b. dilution (the reduction in concentration due to the increase of cell volume during growth)⁵⁴. This is the standard way of dealing with cell growth in the mathematical modeling of bacterial gene expression and is adequate in steady-state models. However, in the context of this work, since the abundance of free toxin can directly affect growth rate (and thus dilution), dilution cannot be properly characterized using a fixed number. Thus, the above model and, for that matter all other models in the scientific literature we are aware of, do not fully considered the effect of bacterial growth.

The inhibitory action of the AT complex is modelled through the inclusion of negative feedback terms such as $k'_{1}/\left( 1 + \frac{y_{3}}{s'_{1}}\right)$ in Eq. (1). The existence of toxin T in the cell reduces all protein production and decreases protein dilution by decreasing cell growth. Thus, the toxin concentration will have an inhibitory impact on the production rates of toxin, antitoxin, and on the cellular growth rate. We introduce an inhibition factor $1/(b'_{m} y_{2} + 1 )$ in Eqs. (1)–(2). The parameter $b'_m$ represents the redaction of protein expression due to the presence of toxin molecules. We also assume that growth inhibition will influence the toxin degradation rate, and we introduce a factor $(b'_{c} y_{2} + 1)$ that modulates the toxin degradation rate in Eq. (2), while we assume that the degradation rate of the free antitoxin remains the same. This is in agreement with a recent finding from⁵⁵ that importantly, although free antitoxin is readily degraded in vivo, antitoxin bound to toxin is protected from proteolysis, preventing release of active toxin.

However, Eqs. (1)–(3), if one includes the unknown initial conditions for the quantities $y_1, y_2, y_3$ at $t=0$, contain 13 adjustable parameters. Our aim is to estimate the model parameters using experimental RNA-Seq data obtained from⁴¹. These experimental data (10 data points for each toxin antitoxin pair) would render such an estimation problematic, since such a model is structurally unidentifiable⁵⁶.

In order to reduce the number of adjustable parameters we rescale the unobserved variable $y_3$ by setting $y_3 = (k'_2/d_3)z_3$ and rescale the variables $y_1, y_2$ by the same factor $\beta = k'_2$, i.e. by setting $y_1 = k'_2 z_1$ and $y_2 = k'_2 z_2$. Thus, we arrive at a system of ODEs for the rescaled variables $z_1, z_2, z_3$ which is:

$$\begin{aligned} \frac{d z_1}{dt}&= - d_{1} z_{1} + \frac{k_{1}}{\left( 1 + \frac{z_{3}}{s_{1}}\right) \left( b_{m} z_{2} + 1\right) } - k_{2} z_{1} z_{2} + z_{3} \end{aligned}$$

(4)

$$\begin{aligned} \frac{d z_{2}}{dt}&= - \frac{d_{2} z_{2}}{b_{c} z_{2} + 1} - k_{2} z_{1} z_{2} + z_{3} + \frac{1}{\left( 1 + \frac{z_{3}}{s_{2}}\right) \left( b_{m} z_{2} + 1\right) } \end{aligned}$$

(5)

$$\begin{aligned} \frac{d z_{3}}{dt}&= - d_{3} \left( - k_{2} z_{1} z_{2} + z_{3}\right) , \end{aligned}$$

(6)

where the new kinetic constants are related the those in Eqs. (1)–(3) by the relations $k_1 = k'_1/k'_2, s_1 = d_3 s'_1/k'_2, b_m = b'_m k'_2, k_2 = k'_2 k_3, s_2 = d_3 s'_2/k'_2, b_c = b'_c k'_2$. Moreover, we assume that $z_1$ and $z_2$ at time $t = 0$ are equal to zero and allow the unobserved complex concentration $z_3(0)$ to be equal to a constant $c_0$ which will be determined from the fitting of the solution of Eqs. (4)–(6) to the data. Henceforth, we will refer to the above model (Eqs. (4)–(6)) as the Z-model. The model is essentially a rescaled version of the model proposed in^39,40 with the additional assumption that the antitoxin bound to toxin is protected from proteolysis.

Our numerical investigations have shown that the Z-model (Eqs. (4)–(6)) is the simplest model able to represent the complete set of the experimental data that we have in our disposal with reasonable accuracy. Omission of any of the above basic ingredients of the model (e.g. setting $b_m$ and $b_c$ equal to zero) leads to plausible models, which may describe adequately the time evolution of the concentrations of some TA pairs, but fail to describe the expression of the entire set. It is obvious to the reader that the Z-model and its variants that we examine in this manuscript are deterministic models. We will not deal with the important topic of investigating a stochastic variant of the Z-model through a Monte Carlo approach based on the Gillespie algorithm. Our modeling decision is based on the fact that the RNA Seq data that we will use to fit the model parameters are not single cell sequencing data. As one can see in the detailed description of the experimental data used in this study, each RNA seq “read” represents multi-cell averages on a timescale of hours. Of course for single cell RNA seq experiments a stochastic modelling approach would be more appropriate although admittedly much more difficult. There is, however, important progress in the direction of using stochastic models and the inference of parameter values from noisy data, see for example⁵⁷. Bulk RNA-Seq data have clear limitations regarding such mechanistic interpretations. When technology advances (see, e.g.⁵⁸ for an important step in this direction) and time-resolved single cell experiments are readily available, we envision that repeating our analysis could provide further valuable insights. In this case, however, it is known that, on a single cell level, mRNA and protein concentrations do not correlate well⁵⁹. Repeating our analysis on a single cell level would then require time-resolved proteomics data.

For our analysis we used experimental RNA-Seq data obtained from⁴¹ (GEO accession number: GSE65244). The RNA Seq data used here are for the wild-type(wt) strain and obtained after the culture growth in rich medium during the stationary phase. The system of Eqs. (4)–(6) was solved numerically with custom code written in Python using the scipy python module⁶⁰. Fitting of the numerical solutions of the ODE’s was performed as part of the code using the Nelder-Mead minimization algorithm as implemented in scipy. Since the task of performing fits for all TA pairs and all model variants is quite demanding the code was parallelized using the dask.distributed python module. All numerical simulations were performed on a workstation equipped with 2 Intel Xeon Gold 6140 Processors (72 cpu cores in total).

Results

Figure 3 shows the concentrations of toxin and antitoxin for 11 known TA pairs of E. coli as a function of time. Symbols represent experimental RNA-Seq data obtained from⁴¹ (GEO accession number: GSE65244). The above list is exhaustive meaning that it includes all the TA pairs for which there are experimental measures in the dataset. All data have been rescaled (multiplied by the same constant $c=10^{5}$ in order to avoid numerical errors during the fitting process). Lines are the numerical solutions of the ODE system, Eqs. (4)–(6). The kinetic constants of the system were estimated so that the weighted sum of the squared differences between the experimental data and the model predictions becomes minimum. We calculate weighted least squares since we have to fit two different experimental curves simultaneously whose y-axis values may differ considerably. Thus, we first calculate the mean values for each curve and then the weighted sum of the squared differences. Otherwise, curves with low mean values are practically ignored during the fitting process. Thus, the lines represent the “best” fit of the model to the data. We observe a very good agreement between the model predictions and the experimental data. As mentioned above, we assume that $z_1$ and $z_2$ at time $t = 0$ are equal to zero. This is a rather harsh, and possibly unrealistic, condition to impose. If more data points were available the more natural and appropriate choice would be to use the RNA seq measurements of the earliest available timepoint as our initial conditions. This is indeed the approach we took in our analysis in Appendix B (Supplementary Materials). We should point out, however, that since the same initial condition is imposed to all TA pairs and since there is no indication that the TA systems will exhibit chaotic dynamics—which is known to be rare in chemical systems, requiring rather special conditions—we do not have any reason to expect sensitivity of the dynamics to the initial conditions and, thus, we do not believe that our choice to affect the accuracy of the model. An additional analysis in Appendix B, where a different choice of initial conditions has been adopted, i.e. the average concentration across all measurements, seems to support such a claim.

Figure 4 shows a box plot of the model parameters estimated from the best fit of the ODE system, Eqs. (4–6), to the RNA-Seq data. Each box shows the “dispersion” of eleven values, one per TA pair. We observe a wide distribution of parameter values across the different TA pairs. This is rather common in biological systems, where the kinetic constants of various metabolic reactions can differ by several orders of magnitude. Therefore, the same underlying differential equations lead to quite different dynamics precisely due to the broad range of the kinetic constants. In Appendix A we include a detailed discussion of the estimated covariances and standard deviations of the fitting parameters (see also the attached files in supplementary materials).

Figure 5 shows in a log-linear plot the toxin, antitoxin and TA complex concentrations as a function of time for the 11 known TA pairs of E. coli. Solid lines show the result $z_1(t)$ of the numerical solution of the ODE system, Eqs. (4)–(6), for the antitoxin. Dashed lines show the corresponding variable $z_2(t)$ for the toxin. Dotted lines show the corresponding variable $z_3(t)$ for the TA complex. We observe a variety of different dynamics, but interestingly enough in all cases the complex concentration $z_3$ seems to be lower than that of both the toxin and the antitoxin. For the majority of cases the antitoxin concentration is higher than that of the toxin. There are, however, exceptions, namely the relB-relE, mqsR-mqsA and the ymcE-gnsA pairs. The ydaT-ydaS pair also exhibits higher toxin expression for the most part of the observation time and only at the final stage the toxin level drop below that of the antitoxin. It is also quite intriguing that the Z-model predicts expression states where the toxin is constantly quite higher than the antitoxin (e.g. ymcE-gnsA) without resorting to the mechanism of conditional cooperativity^2,39, although it is quite well-established that certain TA pairs (e.g. the relB-relE pair) exhibit conditional cooperativity and, obviously, such effects are not accounted for in the Z-model.

Next, we are interested in examining simpler versions of the proposed model and assessing their ability to describe the experimental data. We compare the Z-model to 7 simpler (i.e. with less adjustable parameters) variants, which we obtain from Eqs. (4)–(6) by forcing constraints on some of the constants, i.e. by fixing their numerical value or by setting them numerically equal to other constants. We describe these simpler variants below:

Model “s1=s2” is obtained by forcing the constants $s_1$ and $s_2$ to have the same numerical value.
Model “s1=s2 no bm” is obtained by forcing the constants $s_1$ and $s_2$ to have the same numerical value and by dropping the $b_m$ constant, i.e. setting $b_m = 0$.
Model “s1=s2 no bc” is obtained forcing the constants $s_1$ and $s_2$ to have the same numerical value and by setting $b_c = 0$.
Model “s1=s2 no bm bc” is obtained by forcing the constants $s_1$ and $s_2$ to have the same numerical value and by setting both $b_m =0$ and $b_c = 0$.
Model “s1!=s2 no bm” is obtained by setting $b_m = 0$. Note that now constants $s_1$ and $s_2$ are allowed to have different numerical values.
Model “s1!=s2 no bc” is obtained by setting $b_c = 0$.
Model “no s1 s2 bm bc” is the simplest variant and is obtained from the Z-model ODEs by setting $s_1=1, s_2=1, b_m =0, b_c = 0$.

Models, where the parameter $b_m$ is identically zero, do not take into account the reduction of protein expression due to the existence of toxin, while variants, where the parameter $b_c$ is identically zero, ignore the effect of growth inhibition. Figure 6 shows the minimum values of the objective function (i.e. the sum of weighted squared differences between model predictions and the experimental data) for all TA pairs and for the 7 model variants described above. The objective function values depend on the values of the experimental data which differ considerably between different TA pairs, thus the noticeable difference in the y-axis scales of Fig. 6.

The objective function of the Full Z-model is always lower than that of the variants, as expected. We should also mention that the algorithms (basinhopping in combination with a local Nelder-Mead algorithm) used for the minimization of the objective function are guaranteed to find local, not global, minima. Although we have performed a rather extensive search of the parameter space, there is always the chance that there are sets of parameters that will lead to lower values of the objective function than those reported here. We see that there are TA pairs for which simpler variants are capable of fitting the data with results comparable to those of the Full Z-model. However, the Full Z-model is the appropriate choice if one wants to describe the expression of the entire set of TA pairs.

Since we want to compare models with different numbers of parameters, it might be plausible to examine two widely used model selection criteria, the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC) for the Full Z-model and its seven variants. These are calculated as follows:

$$\begin{aligned} \begin{array}{l} \text{ aic } =N \ln \left( \chi ^{2} / N\right) +2 N_{v} \\ \text{ bic } =N \ln \left( \chi ^{2} / N\right) +\ln (N) N_{v } , \end{array} \end{aligned}$$

(7)

where $\chi ^2$ stands for the sum of the squares of the residuals (i.e., the objective function discussed above), N is the number of data points (common for all model variants) and $N_{v}$ is the number of adjustable parameters for each model. $N_{v}$ is different for each variant. The full Z-model has the highest value, i.e. $N_{v} = 10$. The most appropriate model is considered to be the one with the lower AIC or BIC value since both these criteria penalize the a large $N_{v}$ number and reward a low objective function. Generally, the Bayesian information criterion is considered the most conservative of the two statistics. Figure 7 shows the AIC and BIC for the “collective” description of the TA gene expression set, i.e. when we describe the complete set of TA-pair with $N = 10*11 = 110$ data points and $\chi ^2$ is the sum of the objective functions of all the TA pairs.

Finally, it is helpful to compare the values of the constants that we obtained from the minimal ODE model for the different TA pairs. To this end we may view them as a “vector” characterizing the TA pair and we use an unsupervised learning method, namely a Principal Component Analysis (PCA), a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components⁶¹. PCA is routinely applied to experimental measurments directly for reasons of dimensionality reduction. Using PCA, however, to interpret the parameters of a deterministic ODE model consists a novel approach which has been recently used to interpret the parameters of a fractal kinetics SI model of Covid-19 spreading⁶². Figure 8 shows a plot of the two largest PCA components.

Typically in a PCA plot we try to identify clusters and perceive them as an indication of similar underlying causal behavior. For cluster identification, to avoid subjectivity, we applied a clustering identification algorithm i.e. DBSCAN with parameter $eps = 0.8$⁶³. For DBSCAN the number of clusters is not predefined but decided by the algorithm. Here, the clustering algorithm has identified one cluster of 7 TA pairs, namelydinJ-yafQ, relB-relE, yafN-yafO, higA-higB, hipB-hipA, hicB-hicA, and mqsA-mqsR, which form a large central cluster, and four outliers i.e. the three pairs yefM-yoeB, ydaT-ydaS, and ymcE-gnsA, which have a negative PC2 component, and yfjZ-ypjF with relatively large PC1 and PC2 values.

In Table 1 we summarize this distinction between a main cluster and several outliers, together with the associated functional classification of the TA pairs. This distinction can serve as a starting point for comparing this statistical result with the wealth of biological information available for each of these TA modules. For the TA module hipB-hipA for example, the mode of action has been debated over the last years^64,65, but is still not clear¹. The similarity of estimated parameters to higA-higB, hicB-hicA and other members of the main cluster may be seen as evidence of a functional classification of this TA system as RNA interferases and guide further attempts of functional elucidation, in particular a better understanding of superfamilies of type-II TA systems⁶⁶.

Table 1 Identified clusters and their functional classification.

Full size table

Appendix B (Supplementary Materials) contains the results for another time-resolved gene expression data set, namely the data from⁷² which are available at GEO (accession number: GSE131992).

In Appendix C (Supplementary Materials), we present in tabular form the biological information relevant to the members of the clusters identified in Fig. 7 as obtained from The Universal Protein Resource (UniProt), a comprehensive resource for protein sequence and annotation data (https://www.uniprot.org).

Conclusions

We have proposed a minimal model that is able to capture the dynamics of TA systems in E. coli and agrees with experimental high-throughput RNA-Seq data reasonably well. We find that a minimal acceptable model of TA regulation should at least include a negative feedback loop through a TA pair formation and the effect of toxin induced growth modulation. Despite the obvious over-simplifications of the model, e.g. we study each TA pair in isolation, and we do not account for the influence on cell growth due to the remaining toxin proteins, the model is able to replicate a variety of experimental curves.

With the availability of more time-resolved high-quality gene expression data, the description of time courses of systemic components with the help of simple mathematical models can provide an important instrument for the interpretation of such high-throughput data and thus bridge the gap between Theoretical Biology, Statistical Physics and Systems Biology⁷³.

Data availability

The datasets analysed and the custom code used during the current study are available from the corresponding author on reasonable request. They are also available for direct download from Zenodo at https://doi.org/10.5281/zenodo.5162947.

References

Yamaguchi, Y. & Inouye, M. Regulation of growth and death in Escherichia coli by toxin–antitoxin systems. Nat. Rev. Microbiol. 9, 779–790 (2011).
Article CAS PubMed Google Scholar
Page, R. & Peti, W. Toxin-antitoxin systems in bacterial growth arrest and persistence. Nat. Chem. Biol. 12, 208–214 (2016).
Article CAS PubMed Google Scholar
Pandey, D. P. & Gerdes, K. Toxin-antitoxin loci are highly abundant in free-living but lost from host-associated prokaryotes. Nucleic Acids Res. 33, 966–976 (2005).
Article CAS PubMed PubMed Central Google Scholar
Balaban, N. Q., Merrin, J., Chait, R., Kowalik, L. & Leibler, S. Bacterial persistence as a phenotypic switch. Science 305, 1622–1625 (2004).
Article ADS CAS PubMed Google Scholar
Harms, A., Brodersen, D. E., Mitarai, N. & Gerdes, K. Toxins, targets, and triggers: An overview of toxin-antitoxin biology. Mol. Cell 70, 768–784 (2018).
Article CAS PubMed Google Scholar
Thisted, T., Sørensen, N., Wagner, E. & Gerdes, K. Mechanism of post-segregational killing: Sok antisense RNA interacts with Hok mRNA via its $5^\prime$-end single-stranded leader and competes with the $3^\prime$-end of Hok mRNA for binding to the mok translational initiation region. EMBO J. 13, 1960–1968 (1994).
Article CAS PubMed PubMed Central Google Scholar
Gerdes, K., Nielsen, A., Thorsted, P. & Wagner, E. G. H. Mechanism of killer gene activation. Antisense RNA-dependent RNase III cleavage ensures rapid turn-over of the stable hok, srnB and pndA effector messenger RNAs. J. Mol. Biol. 226, 637–649 (1992).
Article CAS PubMed Google Scholar
Brantl, S. & Jahn, N. sRNAs in bacterial type I and type III toxin–antitoxin systems. FEMS Microbiol. Rev. 39, 413–427 (2015).
Article CAS PubMed Google Scholar
Pedersen, K. & Gerdes, K. Multiple hok genes on the chromosome of Escherichia coli. Mol. Microbiol. 32, 1090–1102 (1999).
Article CAS PubMed Google Scholar
Brown, J. M. & Shaw, K. J. A novel family of Escherichia coli toxin–antitoxin gene pairs. J. Bacteriol. 185, 6600–6608 (2003).
Article CAS PubMed PubMed Central Google Scholar
Masuda, H., Tan, Q., Awano, N., Wu, K. P. & Inouye, M. YeeU enhances the bundling of cytoskeletal polymers of MreB and FtsZ, antagonizing the CbtA (YeeV) toxicity in Escherichia coli. Mol. Microbiol. 84, 979–989 (2012).
Article CAS PubMed Google Scholar
Wang, X. et al. A new type V toxin–antitoxin system where mRNA for toxin GhoT is cleaved by antitoxin GhoS. Nat. Chem. Biol. 8, 855 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. Type II toxin/antitoxin MqsR/MqsA controls type V toxin/antitoxin GhoT/GhoS. Environ. Microbiol. 15, 1734–1744 (2013).
Article CAS PubMed PubMed Central Google Scholar
Aakre, C. D., Phung, T. N., Huang, D. & Laub, M. T. A bacterial toxin inhibits DNA replication elongation through a direct interaction with the $\beta$ sliding clamp. Mol. Cell 52, 617–628 (2013).
Article CAS PubMed PubMed Central Google Scholar
Balaban, N. Q. et al. Definitions and guidelines for research on antibiotic persistence. Nat. Rev. Microbiol. 17, 441–448 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sneppen, K., Micheelsen, M. A. & Dodd, I. B. Ultrasensitive gene regulation by positive feedback loops in nucleosome modification. Mol. Syst. Biol. (2008).
Fasani, R. A. & Savageau, M. A. Molecular mechanisms of multiple toxin-antitoxin systems are coordinated to govern the persister phenotype. Proc. Natl. Acad. Sci. 110, E2528–E2537 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Fraikin, N., Goormaghtigh, F., & Van Melderen, L. Type II toxin–antitoxin systems: Evolution and revolutions. J. Bacteriol.202. (2020).
Leplae, R. et al. Diversity of bacterial type II toxin–antitoxin systems: A comprehensive search and functional analysis of novel families. Nucleic Acids Res. 39, 5513–5525 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ghafourian, S., Raftari, M., Sadeghifard, N. & Sekawi, Z. Toxin–antitoxin systems: Classification, biological function and application in biotechnology. Curr. Issues Mol. Biol. 16, 9–14 (2014).
PubMed Google Scholar
Fasani, R. A. & Savageau, M. A. Unrelated toxin–antitoxin systems cooperate to induce persistence. J. R. Soc. Interface 12, 20150130 (2015).
Article PubMed PubMed Central Google Scholar
Gerdes, K. Hypothesis: Type I toxin–antitoxin genes enter the persistence field—A feedback mechanism explaining membrane homoeostasis. Philos. Trans. R. Soc. B Biol. Sci. 371, 20160189 (2016).
Article CAS Google Scholar
Kedzierska, B. & Hayes, F. Emerging roles of toxin-antitoxin modules in bacterial pathogenesis. Molecules 21, 790 (2016).
Article PubMed Central CAS Google Scholar
Massey, S. E. & Mishra, B. Origin of biomolecular games: Deception and molecular evolution. J. R. Soc. Interface 15, 20180429 (2018).
Article PubMed PubMed Central CAS Google Scholar
Ruangprasert, A. et al. Mechanisms of toxin inhibition and transcriptional repression by Escherichia coli DinJ-YafQ. J. Biol. Chem. 289, 20559–20569 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ni, S. et al. Conjugative plasmid-encoded toxin-antitoxin system PrpT/PrpA directly controls plasmid copy number. Proc. Natl. Acad. Sci. 118(4), e2011577118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yamaguchi, Y., Park, J.-H. & Inouye, M. MqsR, a crucial regulator for quorum sensing and biofilm formation, is a GCU-specific mRNA interferase in Escherichia coli. J. Biol. Chem. 284(42), 28746–28753 (2009).
Article CAS PubMed PubMed Central Google Scholar
Takagi, H. et al. Crystal structure of archaeal toxin–antitoxin RelE-RelB complex with implications for toxin activity and antitoxin effects. Nat. Struct. Mol. Biol. 12, 327 (2005).
Article CAS PubMed Google Scholar
Zhang, Y., Zhu, L., Zhang, J. & Inouye, M. Characterization of ChpBK, an mRNA interferase from Escherichia coli. J. Biol. Chem. 280, 26080–26088 (2005).
Article CAS PubMed Google Scholar
Motiejūnaitė, R., Armalytė, J., Markuckas, A. & Sužiedėlienė, E. Escherichia coli dinJ-yafQ genes act as a toxin–antitoxin module. FEMS Microbiol. Lett. 268, 112–119 (2007).
Article PubMed CAS Google Scholar
Prysak, M. H. et al. Bacterial toxin YafQ is an endoribonuclease that associates with the ribosome and blocks translation elongation through sequence-specific and frame-dependent mRNA cleavage. Mol. Microbiol. 71, 1071–1087 (2009).
Article CAS PubMed Google Scholar
Kamada, K. & Hanaoka, F. Conformational change in the catalytic site of the ribonuclease YoeB toxin by YefM antitoxin. Mol. Cell 19, 497–509 (2005).
Article CAS PubMed Google Scholar
Zhang, Y. & Inouye, M. The inhibitory mechanism of protein synthesis by YoeB, an Escherichia coli toxin. J. Biol. Chem. 284, 6627–6638 (2009).
Article CAS PubMed PubMed Central Google Scholar
Keren, I., Shah, D., Spoering, A., Kaldalu, N. & Lewis, K. Specialized persister cells and the mechanism of multidrug tolerance in Escherichia coli. J. Bacteriol. 186, 8172–8180 (2004).
Article CAS PubMed PubMed Central Google Scholar
Korch, S. B., Henderson, T. A. & Hill, T. M. Characterization of the hipA7 allele of Escherichia coli and evidence that high persistence is governed by (p) ppGpp synthesis. Mol. Microbiol. 50, 1199–1213 (2003).
Article CAS PubMed Google Scholar
Zhang, Y., Yamaguchi, Y. & Inouye, M. Characterization of YafO, an Escherichia coli toxin. J. Biol. Chem. 284, 25522–25531 (2009).
Article CAS PubMed PubMed Central Google Scholar
Brown, B. L. et al. Three dimensional structure of the MqsR: MqsA complex: a novel TA pair comprised of a toxin homologous to RelE and an antitoxin with unique properties. PLoS Pathog. 5, e1000706 (2009).
Article PubMed PubMed Central CAS Google Scholar
Vandervelde, A., Loris, R., Danckaert, J. & Gelens, L. Computational methods to model persistence. In Methods in Molecular Biology, Methods in Molecular Biology Vol. 1333 (eds Michiels, J. & Fauvart, M.) 207–240 (Springer, 2016).
Cataudella, I., Sneppen, K., Gerdes, K. & Mitarai, N. Conditional cooperativity of toxin–antitoxin regulation can mediate bistability between growth and dormancy. PLoS Comput. Biol. 9, e1003174 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Cataudella, I., Trusina, A., Sneppen, K., Gerdes, K. & Mitarai, N. Conditional cooperativity in toxin–antitoxin regulation prevents random toxin activation and promotes fast translational recovery. Nucleic Acids Res. 40, 6424–6434 (2012).
Article CAS PubMed PubMed Central Google Scholar
Beber, M. E., Sobetzko, P., Muskhelishvili, G. & Hütt, M. T. Interplay of digital and analog control in time-resolved gene expression profiles. EPJ Nonlinear Biomed. Phys. 4, 8 (2016).
Article Google Scholar
Gelens, L., Hill, L., Vandervelde, A., Danckaert, J. & Loris, R. A general model for toxin–antitoxin module dynamics can explain persister cell formation in E. coli. PLoS Comput. Biol. 9, e1003190 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Nikolic, N. et al. Autoregulation of mazEF expression underlies growth heterogeneity in bacterial populations. Nucleic Acids Res. 46, 2918–2931 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tian, C., Semsey, S. & Mitarai, N. Synchronized switching of multiple toxin–antitoxin modules by (p) ppGpp fluctuation. Nucleic Acids Res. 45, 8180–8189 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64 (2002).
Article CAS PubMed Google Scholar
Alon, U. Network motifs: Theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 (2007).
Article CAS PubMed Google Scholar
Marr, C., Geertz, M., Hütt, M. T. & Muskhelishvili, G. Dissecting the logical types of network control in gene expression profiles. BMC Syst. Biol. 2, 18 (2008).
Article PubMed PubMed Central CAS Google Scholar
Travers, A., Muskhelishvili, G. & Thompson, J. DNA information: From digital code to analogue structure. Philos. Trans. R. Soc. A 370, 2960–2986 (2012).
Article ADS CAS MATH Google Scholar
Sonnenschein, N., Geertz, M., Muskhelishvili, G. & Hütt, M. T. Analog regulation of metabolic demand. BMC Syst. Biol. 5, 40 (2011).
Article PubMed PubMed Central Google Scholar
Kosmidis, K., Jablonski, K. P., Muskhelishvili, G. & Hütt, M. T. Chromosomal origin of replication coordinates logically distinct types of bacterial genetic regulation. NPJ Syst. Biol. Appl. 6, 1–9 (2020).
Article CAS Google Scholar
Fang, X. et al. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc. Natl. Acad. Sci. 114, 10286–10291 (2017).
Article CAS PubMed PubMed Central Google Scholar
Vet, S., Vandervelde, A., & Gelens L. Excitable dynamics through toxin-induced mRNA cleavage in bacteria. PLoS ONE14 (2019).
Himeoka, Y. & Mitarai, N. Modeling slow-processing of toxin messenger RNAs in type-I toxin–antitoxin systems: Post-segregational killing and noise filtering. Phys. Biol. 16, 026001 (2019).
Article ADS CAS PubMed Google Scholar
Alon, U. An Introduction to Systems Biology: Design Principles of Biological Circuits (CRC Press, 2019).
LeRoux, M., Culviner, P. H., Liu, Y. J., Littlehale, M. L. & Laub, M. T. Stress can induce transcription of toxin–antitoxin systems without activating toxin. Mol. Cell (2020).
DiStefano, J. III. Dynamic Systems Biology Modeling and Simulation (Academic Press, 2015).
Cao, Z. & Grima, R. Accuracy of parameter estimation for auto-regulatory transcriptional feedback loops from noisy data. J. R. Soc. Interface 16, 20180967 (2019).
Article CAS PubMed PubMed Central Google Scholar
Blattman, S. B., Jiang, W., Oikonomou, P. & Tavazoie, S. Prokaryotic single-cell RNA sequencing by in situ combinatorial indexing. Nat. Microbiol. 5(10), 1192–1201 (2020).
Article CAS PubMed PubMed Central Google Scholar
Taniguchi, Y. et al. Quantifying E. coli proteome and transcriptome with single-molecule sensitivity in single cells. Science 329, 533–538 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in python. Nat. Methods 16, 261–272 (2020).
Article CAS Google Scholar
Jolliffe, I. Principal Component Analysis 2nd edn. (Springer, 2002).
Kosmidis, K. & Macheras, P. A fractal kinetics SI model can explain the dynamics of COVID-19 epidemics. PLoS ONE 15, e0237304 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schubert, E., Sander, J., Ester, M., Kriegel, H. P. & Xu, X. DBSCAN revisited, revisited: Why and how you should (still) use DBSCAN. ACM TODS 42, 1–21 (2017).
Article MathSciNet Google Scholar
Germain, E., Castro-Roa, D., Zenkin, N. & Gerdes, K. Molecular mechanism of bacterial persistence by HipA. Mol. Cell 52, 248–254 (2013).
Article CAS PubMed Google Scholar
Hansen, S., Vulić, M., Min, J., Yen, T. J., Schumacher, M. A., Brennan, R. G., & Lewis, K. Regulation of the Escherichia coli HipBA toxin–antitoxin system by proteolysis. PLoS ONE7 (2012).
Guglielmini, J. & Van Melderen, L. Bacterial toxin–antitoxin systems: Translation inhibitors everywhere. Mobile Genet. Elem. 1, 283–306 (2011).
Article Google Scholar
Armalytė, J., Jurėnaitė, M., Beinoravičiūtė, G., Teišerskas, J. & Sužiedėlienė, E. Characterization of Escherichia coli dinJ-yafQ toxin–antitoxin system using insights from mutagenesis data. J. Bacteriol. 194, 1523–1532 (2012).
Article PubMed PubMed Central CAS Google Scholar
Gerdes, K. In Type II Toxin-Antitoxins Loci: The relBE Family, pp. 69–92. Berlin, Heidelberg: Springer Berlin Heidelberg. (2013).
Hurley, J. M. & Woychik, N. A. Bacterial toxin HigB associates with ribosomes and mediates translation-dependent mRNA cleavage at A-rich sites. J. Biol. Chem. 284, 18605–18613 (2009).
Article CAS PubMed PubMed Central Google Scholar
Unterholzner, S. J., Poppenberger, B. & Rozhon, W. Toxin–antitoxin systems: Biology, identification, and application. Mobile Genet. Elem. 3, e26219 (2013).
Article CAS Google Scholar
Wei, Y., Zhan, L., Gao, Z., Privé, G. G. & Dong, Y. Crystal structure of GnsA from Escherichia coli. Biochem. Biophys. Res. Commun. 462(1), 1–7 (2015).
Article CAS PubMed Google Scholar
Lempp, M. et al. Systematic identification of metabolites controlling gene expression in E. coli. Nat. Commun. 10, 1–9 (2019).
Article CAS Google Scholar
Schureck, M. A. et al. Structural basis of transcriptional regulation by the HigA antitoxin. Mol. Microbiol. 111, 1449–1462 (2019).
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Physics Department, Aristotle University of Thessaloniki, 54124, Thessaloníki, Greece
Kosmas Kosmidis
Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
Marc-Thorsten Hütt

Authors

Kosmas Kosmidis
View author publications
You can also search for this author in PubMed Google Scholar
Marc-Thorsten Hütt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.K, M-T.H designed research, K.K. performed research, K.K, M-T.H analyzed the results, K.K, M-T.H wrote the manuscript.

Corresponding author

Correspondence to Kosmas Kosmidis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kosmidis, K., Hütt, MT. A minimal model for gene expression dynamics of bacterial type II toxin–antitoxin systems. Sci Rep 11, 19516 (2021). https://doi.org/10.1038/s41598-021-98570-z

Download citation

Received: 22 June 2021
Accepted: 07 September 2021
Published: 30 September 2021
DOI: https://doi.org/10.1038/s41598-021-98570-z

This article is cited by

Network location and clustering of genetic mutations determine chronicity in a stylized model of genetic diseases
- Piotr Nyczka
- Johannes Falk
- Marc-Thorsten Hütt
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.