Introduction

The ribosome is at the core of any (known) self-replicating organism. In a process called translation, ribosomes read the instructions from messenger ribonucleic acids (mRNAs) to synthesize the corresponding proteins, including ribosomal proteins (rPs). This autocatalytic nature of ribosomes ultimately limits the doubling time of a cell to the period it takes a ribosome to synthesize itself1,2. In Escherichia coli this would be 6 min, assuming that the ribosome consists of a 55-protein complex of approximately 7400 amino acids (AAs) that is translated at a speed of 21 AA/sec3. In fact, even in growth-optimized E. coli, that doubling limit remains far from being reached4. Nonetheless, it has been proposed that ribosomes, not only in E. coli, have been subjected to strong selective pressure to minimize their own duplication time in order to speed up the production of all other proteins5. With this principle in mind, Reuveni et al.5 explain why ribosomes have many rPs of similar length.

Ribosomes are structures that have developed over time by adding ribosomal ribonucleic acid (rRNA) and rP around a central core6. This core is considered to be a leftover from ancient translation systems that evolved alongside the genetic code. Different types of ribosomes have evolved in bacteria, archaea, and eukaryotes, but their overall structures are similar within each kingdom7. For example, the mass of ribosomes in prokaryotes is made up of approximately 63% rRNA and 37% rPs8,9, whereas eukaryotic ribosomes have an equal mass distribution of rRNA and rPs5,10,11. Thus, the question arises whether there is an evolutionary advantage in having such a high RNA content.

It has been suggested that the ribosome composition can be understood as a competition for resources between rRNA synthesis and rP synthesis12,13. In particular, Kostinski and Reuveni12 derived two upper bounds on growth rate resulting from two autocatalytic loops, one for rP production, and one for RNA polymerase (RNAP) and rRNA production. By analyzing allocation data from E. coli, they concluded that maximum growth rate occurs at the current ribosome composition of 2/3 RNA and 1/3 protein. However, the specific processes that limit the two autocatalytic processes remained elusive.

Here we aim to provide a mechanistic understanding of these processes. We set up a small (coarse-grained) model of a self-replicating cell and perform resource balance analysis (RBA)14,15. In particular, we vary ribosome composition and ribosome allocations (fractions of ribosomes allocated to the synthesis of different proteins) and maximize growth rate. For simplicity, we assume a constant ribosome mass.

We focus on the primary role of ribosomal proteins, which is stabilizing rRNA (by preventing its degradation or misfolding). Ribosome protein content might also affect other parameters, such as translation rate. Proteins are generally better catalysts than RNA16, but the ribosome’s catalytic core is formed by rRNA17 and operates at a relatively slow catalytic rate compared to typical enzymes. This suggests that there is little evolutionary pressure to increase the catalytic rate. Furthermore, ribosomes with the lowest protein content, like the E. coli ribosome, exhibit the highest translation rates18,19,20. Therefore, we do not consider the impact on translation rate in this study.

We find that the costs of stabilizing rRNA strongly influence the optimal ribosome composition. If we neglect rRNA turnover, our predictions suggest the presence of RNA-only ribosomes (in contrast to experimental evidence). Taking RNA degradation into account increases its biosynthesis costs, and maximum growth occurs for a mixed (RNA+protein) ribosome.

Results

We introduce a (coarse-grained) mechanistic model of a self-fabricating cell and investigate optimal ribosome composition using RBA. That is, we maximize growth rate under several sets of constraints. We validate the model by predicting RNAP fluxes (rRNA synthesis fluxes), and RNA to protein ratios at different growth rates. Ultimately, we predict maximum growth rate at different ribosome compositions.

A small model of a self-fabricating cell

We consider the small (coarse-grained) model of a self-fabricating cell depicted in Fig. 1. The cell imports a carbon source (C) and has two types of metabolic enzymes, one synthesizing amino acids (AA) from the carbon source and the other one synthesizing nucleotides (NT) from the carbon source and amino acids. RNA polymerase (RNAP) uses nucleotides to form the ribosomal RNA (rRNA), while the ribosome (R) uses amino acids to synthesize all proteins, including the importer (IC), the metabolic enzymes (EAA, ENT), the RNA polymerase and optionally a ribonuclease (RNase), the ribosomal assembly factors (AF), and the ribosomal proteins (rP). Finally, the assembly factors build the ribosome from ribosomal RNA and protein. In a base model, we neglect RNA degradation, whereas in an extended model we consider the enzyme (RNase) that breaks down RNA into nucleotides. We now provide a more formal definition of the two models.

Fig. 1: A small model of a self-fabricating cell.
figure 1

a The cell imports a carbon source (C) and has two types of metabolic enzymes synthesizing amino acids (AA) from the carbon source and nucleotides (NT) from the carbon source and amino acids. The RNA polymerase (RNAP) uses nucleotides to form the ribosomal RNA (rRNA), and the ribosome (R) uses amino acids to synthesize the importer (IC), the metabolic enzymes (EAA, ENT), the ribosomal assembly factors (AF), and the ribosomal protein (rP). Finally, the assembly factors build the ribosome from ribosomal RNA and protein. The processes above constitute the base model. In the extended model, RNase degrades ribosomal RNA (and is synthesized by the ribosome). The additional processes are shown in red. Created with BioRender.com. b The resulting stoichiometric matrix and the corresponding flux vector. Here, s is used for protein synthesis reactions (and w for the corresponding fluxes), and r is used for all other reactions (and v for the corresponding fluxes). Additional columns and rows for the extended model are shown in red.

Given the stoichiometric matrix N and the vector of molar masses ω, the dynamic model of cellular growth relates growth rate μ, the vector of (metabolite, RNA, protein, and ribosome) concentrations c, and the vector of fluxes (v for enzymatic reactions and w for protein synthesis) according to

$$\frac{{{{{{{{\rm{d}}}}}}}}{{{{{{{\boldsymbol{c}}}}}}}}}{{{{{{{{\rm{d}}}}}}}}t}={{{{{{{\boldsymbol{N}}}}}}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)-\mu \,{{{{{{{\boldsymbol{c}}}}}}}}\qquad \,{{\mbox{and}}}\,\qquad {{{{{{{{\boldsymbol{\omega }}}}}}}}}^{T}{{{{{{{\boldsymbol{c}}}}}}}}=1.$$

At steady state, growth rate μ and concentrations c are determined by the fluxes v and w,

$${{{{{{{\boldsymbol{N}}}}}}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)=\mu \,{{{{{{{\boldsymbol{c}}}}}}}}\ge {{{{{{{\mathbf{0}}}}}}}}\qquad \,{{\mbox{and}}}\,\qquad \mu ={{{{{{{{\boldsymbol{\omega }}}}}}}}}^{T}{{{{{{{\boldsymbol{N}}}}}}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right).$$
(1)

To take limited cellular resources into account, we consider capacity constraints for the enzymatic fluxes v, including transcription (and optionally RNA degradation),

$${v}_{i}\le {k}_{i}^{{\mathsf{cat}}}{c}_{i},\quad i\in \{{\mathsf{IC}},{\mathsf{EAA}},{\mathsf{ENT}},{\mathsf{AF}}\},$$
(2a)
$${n}_{{\mathsf{rRNA}}}{v}_{{\mathsf{RNAP}}}\le {\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}{c}_{{\mathsf{RNAP}}},$$
(2b)
$$({n}_{{\mathsf{rRNA}}}{v}_{{\mathsf{RNase}}}\le {k}^{{{\rm{deg}}}}_{{\mathsf{RNase}}}{c}_{{\mathsf{RNase}}}).$$
(2c)

Further, we consider the ribosome capacity constraint for the protein synthesis fluxes w,

$$ \mathop{\sum }\limits_{i\in {\mathsf{Proteins}}}{n}_{i}{w}_{i}\le \,{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}{c}_{{\mathsf{R}}},\quad \\ {\mathsf{Proteins}}=\{{\mathsf{IC}},{\mathsf{EAA}},{\mathsf{ENT}},{\mathsf{RNAP}},({\mathsf{RNase}}),{\mathsf{AF}},{\mathsf{rP}}\}.$$
(2d)

Here, nrRNA is the number of nucleotides in rRNA, ni is the number of amino acids in protein i, \({k}_{i}^{{\mathsf{cat}}}\) is the corresponding enzyme turnover rate, and \({\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}={k}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}{f}_{{\mathsf{RNAP}}}^{{\mathsf{act}}}\) and \({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}={k}_{{\mathsf{R}}}^{{\mathsf{el}}}{f}_{{\mathsf{R}}}^{{\mathsf{act}}}\) are the effective transcription and translation elongation rates, respectively. \({f}_{{\mathsf{RNAP}}}^{{\mathsf{act}}}\) is the fraction of actively transcribing RNAPs, and \({f}_{{\mathsf{R}}}^{{\mathsf{act}}}\) is the fraction of actively translating ribosomes. As mentioned above, RNase is synthesized optionally and hence put in brackets. By defining the ribosome allocations,

$${\phi }_{i}^{{\mathsf{R}}}=\frac{{n}_{i}{w}_{i}}{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}{c}_{{\mathsf{R}}}},\quad i\in {\mathsf{Proteins}},$$
(3)

that is, the fraction of ribosomes translating a certain protein i, constraint (2d) can be written as

$$\mathop{\sum}\limits_{i\in {\mathsf{Proteins}}}{\phi }_{i}^{{\mathsf{R}}}\le 1.$$

We refer to the model given by Equations (1) and (2a, b, d) as the base RBA model. Equations (1) and (2), including (2c), define the extended RBA model which additionally considers RNA degradation.

Throughout our study, we consider a fixed molar ribosome mass ωR, but variable rRNA and protein content,

$${\omega }_{{\mathsf{R}}}={n}_{{\mathsf{rRNA}}}\,{\omega }_{{\mathsf{NT}}}+{n}_{{{{{{{{\rm{rP}}}}}}}}}\,{\omega }_{{\mathsf{AA}}},$$

and we study the influence of ribosome composition on the cell’s maximum growth rate, under the constraints specified above. Here, ωNT and ωAA are the molar masses of nucleotides and amino acids, respectively. For convenience, we define the ribosomal protein (mass) fraction

$${x}_{{{{{{{{\rm{rP}}}}}}}}}={n}_{{{{{{{{\rm{rP}}}}}}}}}\,\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{R}}}},$$
(4)

and express nrRNA and nrP by xrP,

$${n}_{{\mathsf{rRNA}}}=(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\,\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{NT}}}}\quad \,{{\mbox{and}}}\,\quad {n}_{{{{{{{{\rm{rP}}}}}}}}}={x}_{{{{{{{{\rm{rP}}}}}}}}}\,\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{AA}}}}.$$

In our analysis, we vary ribosomal protein fraction and maximize growth rate under given constraints. Modeling details can be found in section Methods/subsection Model, the stoichiometric, capacity, and (dry) mass constraints are summarized in Table 1, and the parameter values, variables and the units are given in Supplementary Table 1. We used parameters from E. coli grown in six different media. Three of them are rich media (Gly+AA, Glc+AA, LB) where amino acids (and nucleotides) are provided so cells only have to express the corresponding transporters instead of the synthesis pathways. In our model, the enzymes ENT and EAA represent lumped pathways for glycolysis and nucleotide/amino acid synthesis, and we only consider one type of transporter. Therefore, to model the changing nutrient quality of the different media (inspired by Scott et al.21), we assume that turnover numbers of EAA and ENT increase with growth rate.

Table 1 Constraints used in the extended and base models (with and without RNA degradation), see Fig. 1.

Base model recovers linear correlation of RNA to protein ratio with growth rate

With parameters for E. coli in different media (listed in Supplementary Table 1) and the experimentally observed ribosome composition (xrP = 36%), the base model correctly recovers the well-known linear dependence of the RNA to protein ratio and growth rate21, see Fig. 2a, but not the offset at zero growth rate, since our model does not contain any non-growth associated processes and we assume constant translation elongation rate \({k}_{{\mathsf{R}}}^{{\mathsf{el}}}\) as in Scott et al.21. At low growth rate, \({k}_{{\mathsf{R}}}^{{\mathsf{el}}}\) decreases, most likely because of the lower availability of the required substrates20,22. Interestingly, when we use variable \({k}_{{\mathsf{R}}}^{{\mathsf{el}}}\), we observe a nonzero offset (Supplementary Fig. 1).

Fig. 2: Validation of the base model.
figure 2

a The model predicts a linear relationship between RNA to protein ratio and growth rate. The points represent the predicted maximum growth rates in six experimental conditions (Supplementary Table 1). The line is a linear fit. b Alternative RNAP fluxes at different non-optimal growth rates in glucose minimal medium. These are obtained by varying the growth rate step by step from zero to maximum and enumerating all solutions (elementary growth vectors as defined in Müller et al.23) for each growth rate. The grey and blue lines are the alternative solutions. The blue line corresponds to solutions, where rRNA and ribosomes do not accumulate (constraints rRNA and cap R in Table 1 are limiting). Light green diamonds are experimental data from Bremer and Dennis20, black triangles are the data from Bremer and Dennis20 corrected for rRNA degradation28. Data was converted to mmol g−1 with E. coli dry masses from Bremer and Dennis20 and the number of nucleotides in rRNA (nrRNA) at xrP = 36%.

To further test the model, we predict RNAP fluxes (vRNAP) at various non-optimal growth rates in glucose minimal medium. In particular, we compute alternative solutions to the system of (in)equalities (1) and (2). (Technically, these solutions are elementary growth vectors as defined in Müller et al.23). We observe three lines (Fig. 2b). Two lines (in gray) correspond to solutions where either ribosomes or rRNA accumulate in excess of what is needed to support growth. In other words, constraints (2d) and (2b) (rows cap R and rRNA in Table 1) are not limiting. With increasing growth rate, the excess of rRNA and ribosome decreases, reaching zero at the maximum growth rate. The third line (in blue) corresponds to no accumulation of ribosomes or rRNA. In particular, the RNAP flux exactly matches the demand. At maximum growth rate, all lines converge to one optimal value.

For higher growth rates, experimental data are best fit by the line without accumulation of ribosomes or rRNA. In fact, the accumulation of free rRNA in a cell is biologically not realistic as it is bound by rPs already during transcription24. Furthermore, if rRNA is expressed in excess of rPs, it is rapidly degraded25. For the growth rates studied here (0.4–2.1 h−1), the fraction of inactive ribosomes stays roughly constant at 15% to 20%3,20,22. In our model, we have already incorporated this fraction using the effective translation elongation rate (\({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}={k}_{{\mathsf{R}}}^{{\mathsf{el}}}{f}_{{\mathsf{R}}}^{{\mathsf{act}}}\)). (However, below the growth rate of ≈0.5 h−1, the fraction of active ribosomes rapidly decreases22). Therefore, the disagreement between experimental and simulated data at lower growth rates is probably caused by neglecting other types of RNA. Indeed, RNAP allocation to the synthesis of different types of RNA changes with growth rate3,20.

Base model predicts maximal growth for RNA-only ribosomes

We study the dependence of maximum growth rate on the ribosomal protein fraction using the base model described above. We find that, for realistic parameters from E. coli (Supplementary Table 1), rRNA synthesis is cheaper than protein synthesis for all tested growth conditions (see Fig. 3a). Thus, according to our base model, ribosomes should consist of rRNA only. Indeed, it has been suggested that higher growth rates could be achieved if ribosomes were to consist only of rRNA5.

Fig. 3: Maximum growth rate and ribosome allocations as functions of ribosomal protein fraction xrP for the base model.
figure 3

a Maximum growth rate for E. coli in six different conditions (see Supplementary Table 1). b Ribosome allocations \({\phi }_{i}^{{\mathsf{R}}}\) as defined in Eqn. (3), for glucose minimal medium (Glc).

If we (hypothetically) adjust the parameters to make rRNA synthesis more expensive than protein synthesis (e.g. by decreasing \({\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\) or increasing \({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\)), then maximum growth rate is achieved for a protein-only ribosome (Supplementary Fig. 2). By a symbolic analysis, we can rigorously prove that maximum growth rate is generically attained at an exclusive ribosome composition, either at xrP = 0% or xrP = 100%, regardless of the parameters (see section Methods/subsection Symbolic analysis of growth rate maximization).

To conclude, RBA with standard capacity constraints does not explain mixed (RNA+ protein) ribosomes. Thus, additional constraints are needed.

rRNA instability leads to maximal growth for mixed ribosomes

As one potential explanation, we hypothesize that the different stabilities of rPs and rRNA affect the composition of the ribosome. While proteins are known to be highly stable26, rRNA is susceptible to degradation by RNases, which are ubiquitous in cells27. Even at maximum growth, about 10% of rRNA is still degraded, and, thus, cannot be incorporated into the ribosome27,28. Furthermore, rRNA can easily misfold, rendering it inactive and prone to degradation24,29.

To account for rRNA degradation, we introduce an RNase enzyme that breaks down rRNA into individual nucleotides (NT), via the reaction

$${r}_{{\mathsf{RNase}}}:\,{\mathsf{rRNA}}\mathop{\to }\limits^{{\mathsf{RNase}}}{n}_{{\mathsf{rRNA}}}\cdot {\mathsf{NT}},$$

see Fig. 1. Since RNases are essential for quality control, we assume some minimum activity and add a minimum degradation rate,

$${v}_{{\mathsf{RNase}}}\ge {k}^{{{\rm{deg}}}}(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\,{c}_{{\mathsf{R}}},$$
(5)

to the list of constraints (row min deg in Table 1). In the simplest case, this rate is directly proportional (with a constant \({k}^{{{\rm{deg}}}}\)) to the rRNA concentration. The latter is given by the fraction of rRNA in the ribosome concentration, since there is no free rRNA in the cell3. Additionally, \({k}^{{{\rm{deg}}}}={k}^{{{\rm{deg}}}}({x}_{{{{\rm{rP}}}}})\) can be a (monotonically decreasing) function of xrP,

$${k}^{{{\rm{deg}}}}({x}_{{{{{{{{\rm{rP}}}}}}}}})={k}_{{{\mbox{max}}}}^{{{\rm{deg}}}}\left(1-\frac{{x}_{{{{{{{{\rm{rP}}}}}}}}}^{n}}{{K}^{n}+{x}_{{{{{{{{\rm{rP}}}}}}}}}^{n}}\right)$$
(6)

modeling the cooperative protection of rRNA by proteins, where n is Hill-factor and K is half-saturation constant. As for the other enzymes, we add a capacity constraint for the RNase to account for its cost,

$${n}_{{\mathsf{rRNA}}}\,{v}_{{\mathsf{RNase}}}\le {k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}\,{c}_{{\mathsf{RNase}}},$$
(7)

where we use \({k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}=88\,{{{{{{{\rm{NT}}}}}}}}\,{{{{{{{{\rm{s}}}}}}}}}^{-1}\) of an enzyme called RNase R30. The base RBA model together with RNA degradation, RNase synthesis, and constraints (5) and (7) constitutes the extended RBA model.

Taking rRNA degradation into account leads to maximum growth rates at mixed (RNA+protein) ribosome compositions (Fig. 4). As it turns out, the assumption of a constant \({k}^{{{\rm{deg}}}}\) in constraint (5) leads to a very shallow optimum (Fig. 4a). To account for the stabilizing influence of rPs on the folded structure, we introduce the non-linear (Hill-type) degradation term (Equation (6) with half-saturation K = 0.2 and Hill-factors n = 2 or n = 6), leading to a pronounced optimum, see Fig. 4b and c.

Fig. 4: Extended model. Accounting for RNA degradation leads to a mixed (RNA+protein) ribosome composition.
figure 4

ac Maximum growth rate of E. coli in six different conditions (see Supplementary Table 1) for three types of rRNA degradation. a No cooperativity of rPs. b Weak cooperativity of rPs. c Strong cooperativity of rPs. df Ribosome allocations in glucose minimal medium (Glc) for three types of rRNA degradation. At low protein fractions, rRNA degradation is high, and RNAP (light green) takes up a large amount of cellular resources.

In the following, we investigate how the optimal ribosome composition depends on growth conditions.

First, we study growth on glucose minimal medium and adjust \({k}_{{{\mbox{max }}}}^{{{\rm{deg}}}}\) such that the optimal ribosome composition matches the experimentally observed value of xrP = 0.36 for E. coli. We validate the model for the three types of degradation, and we correctly predict the linear dependence of the RNA to protein ratio on growth rate (Supplementary Fig. 3). However, RNAP flux predictions are only realistic when assuming strong cooperativity (n = 6). For the other two cases, rRNA degradation in the optimum is too high which leads to overestimated RNAP fluxes (Supplementary Fig. 4).

Second, we predict maximum growth rate as a function of the ribosomal protein fraction in six different growth media. We find that the more proteins cooperate, the less the optimal ribosome composition depends on the growth conditions, see Fig. 4a–c.

Using variable or constant \({k}_{{\mathsf{R}}}^{{\mathsf{el}}}\) has no impact on the predicted optimal ribosome composition. As in the base model, variable \({k}_{{\mathsf{R}}}^{{\mathsf{el}}}\) leads to predicted non-zero offset of RNA/protein ratio at zero growth rate (Supplementary Fig. 5).

Third, to further understand these results, we plot ribosome allocations for glucose minimal medium, see Fig. 4d–f. Interestingly, at low xrP, a large fraction of ribosomes is allocated to the production of RNAP, whereas with increasing xrP, this ribosome allocation rapidly drops. In the case of the highest cooperativity, allocations at the optimal xrP are comparable to the base RBA model (without RNA degradation), compare Fig. 4f with Fig. 3b.

Finally, we qualitatively predict that the fraction of degraded rRNA decreases with growth rate (Fig. 5), which is in agreement with experimental observations28. This effect gets stronger (and closer to experimental data) with higher rP cooperativity. The quantitative disagreement between the experimental and predicted values is probably due to the simplicity of our model. For example, it does not include other types of RNA or regulatory processes, both of which influence RNAP activity. If we consider RNAP allocation to rRNA (\({\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}={k}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{f}_{{\mathsf{RNAP}}}^{{\mathsf{act}}}\,{\phi }_{{\mathsf{rRNA}}}^{{\mathsf{RNAP}}}\), where \({\phi }_{{\mathsf{rRNA}}}^{{\mathsf{RNAP}}}\) is the fraction of RNAP allocated to the synthesis of rRNA), the results get closer to the experimental data (Supplementary Fig. 6).

Fig. 5: Fraction of degraded rRNA at different growth rates.
figure 5

The extended model recapitulates the experimentally observed decrease in the fraction of degraded RNA with increasing growth rate. The circles are the predicted ratios of RNAse fluxes to RNAP fluxes at different conditions. The triangles represent experimental data from Gausing28, extracted from the original plot with WebPlotDigitizer66. Panels (ac) represent three types of rRNA degradation. a No cooperativity of rPs. b Weak cooperativity of rPs. c Strong cooperativity of rPs.

Based on these results, we conclude that accounting for RNA degradation and cooperative binding of rP can explain the mixed ribosome composition.

Extreme conditions increase the optimal protein fraction in (archaeal) ribosomes

As a straightforward extension, we explore whether the current model can be adapted to predict the ribosome composition of other organisms. For example, archaeal ribosomes contain 36–50% protein31, eukaryotic ribosomes 42–50% protein5,10,11, and mitochondrial ribosomes 51–89% protein32. We ask whether this variability can be explained by efficient resource allocation.

It has been hypothesized that the extra archaeal/eukaryotic ribosomal proteins primarily serve to stabilize the ribosomes33. This may be particularly important for archaea because they commonly live in extreme conditions, such as high temperatures or low pH, which may lead to higher (misfolding and) degradation of RNA. To mitigate this, archaea might need a higher protein content compared to bacteria. It has been shown that the initial steps in ribosome assembly of the thermophilic archaeon Sufolobus solfataricus do not require high temperature and likely involve core proteins that are also present in bacteria. However, completing the assembly requires high temperature, suggesting that these proteins have evolved to cope with such extreme conditions34,35.

We model this process by increasing \({k}_{{{\mbox{max}}}}^{{{\rm{deg}}}}\) which leads to a higher predicted protein content of the ribosome (Fig. 6). Similarly to E. coli, the higher the cooperativity, the lower the sensitivity of the optimum to the other parameters. Moreover, when using parameters from Thermococcus (see Supplementary Table 1) we observe an increase in ribosomal protein content, in accordance with experimental evidence31, and predict a decrease in growth rate.

Fig. 6: The model can be adjusted to predict archaeal protein-rich ribosome composition.
figure 6

The model was adapted to archaea by increasing \({k}_{\max }^{\circ}\) two-fold. The remaining parameters were either kept the same as in E. coli (red solid line), or parameters from Thermococcus (molecular masses of R and RNAP, transcription and translation rates, see Supplementary Table 1) were used (red dashed line). Panels (ac) represent three types of rRNA degradation. a No cooperativity of rPs. b Weak cooperativity of rPs. c Strong cooperativity of rPs.

Discussion

The ribosome is a central player in cellular self-fabrication, placing an upper bound on growth rate. To grow faster, a cell needs more ribosomes which, in turn, requires even more ribosomes to produce themselves. While most catalysts and molecular machines within a cell are proteins, ribosomes stand out by having a substantial (mass) fraction of rRNA, playing a catalytic role. The mass fraction of rPs varies across kingdoms, ranging from approximately 36% in prokaryotes9 to around 50% in eukaryotes10, and even higher in eukaryotic mitochondria, reaching up to 89% in Trypanosoma brucei32,36. This prompts the question: what factors determine the ratio of RNA to protein in ribosomes?

The analysis of our base model (without RNA degradation) suggests that RNA-only ribosomes maximize growth rate (Fig. 3). This results from the lower cost of rRNA synthesis compared to rP synthesis. It remains true even when one accounts for the synthesis of inactive RNAP and enzymes required for nucleotide synthesis5, which suggests that the costs of rRNA synthesis and associated processes are underestimated in the base model.

In order to explain a mixed (RNA+protein) ribosome, we consider rRNA degradation in our extended model, thereby increasing the costs for RNA synthesis. While rRNA that is already integrated into a ribosome is stable, nascent RNA may be susceptible to degradation27. Indeed, it has been experimentally observed that even at maximum growth rate, 10% of newly synthesized rRNA is degraded28, and the degradation rate increases if ribosome assembly is delayed27. Furthermore, when rRNA is overexpressed in excess of rPs, it is rapidly degraded25. Due to the extremely high rates at which rRNA is synthesized, errors become inevitable, necessitating the action of quality control enzymes such as polynucleotide phosphorylase (PNPase) and RNase R to ensure ribosome integrity37. The absence of the RNases results in the accumulation of rRNA fragments, ultimately leading to cell death27,38. In contrast, protein turnover is negligible20, and most ribosomal proteins can exist without rRNA and can be reused39,40. Therefore, we do not consider protein degradation in our model.

In our resource balance approach, decreasing the RNA content of the ribosome saves resources by reducing RNA turnover. At the same time, protein synthesis costs increase, leading to a mixed (RNA+protein) ribosome at maximum growth rate.

We include RNA degradation in two scenarios. (a) RNA is degraded at a rate proportional to its concentration, or (b) RNA degradation rate decreases non-linearly with ribosomal protein content, since proteins cooperatively protect RNA from degradation29,41,42. Both versions of the model predict an optimal mixed (RNA+protein) ribosome. However, without considering cooperative protein binding, optimal ribosome compositions depend on growth conditions. Notably, the higher the cooperativity, the closer the predicted RNAP fluxes and the fraction of degraded rRNA are to experimental data. Yet, more experimental data is needed to decide whether ribosome composition in E. coli remains truly independent of growth conditions when the bacterium is evolutionarily adapted to a single environment. Based on these results and available experimental evidence for cooperative protein binding42, we conclude that scenario (b) is more likely.

Our simple model lumps ribosome assembly and RNA degardation and hence allows multiple explanations for the precise mechanism. On the one hand, proteins may stabilize RNA either by blocking the access of RNases to RNA or by preventing misfolding. Intuitively, this could be explained by the fact that RNA molecules are long, and in order to protect them from misfolding and degradation, a certain critical amount of proteins is needed. Folding intermediates can get trapped in misfolded states and are subsequently degraded as a part of quality control. Proteins may help RNA to avoid these kinetic traps24,29,43,44. On the other hand, proteins may increase the rate of ribosome assembly and thereby reduce the number of ribosome intermediates (pre-R in Fig. 7). Indeed, it was observed that rRNA can fold to near-native conformation45,46. Yet, this process is slower than the protein-supported one, especially for long molecules24,47.

Fig. 7: Potential mechanisms by which ribosomal proteins affect the biosynthesis of ribosomes.
figure 7

Ribosomal proteins may decrease the degradation rate or misfolding of ribosome assembly intermediates (pre-R). Alternatively, they may increase the assembly rate. Created with BioRender.com.

Throughout the manuscript, we make use of two simplifications:

  • As in Kostinski and Reuveni12, we consider ribosomes with different compositions, but equal mass. RNA enzymes, known as ribozymes, are generally smaller than proteins and require only a few nucleotides for catalytic activity48. However, such small ribozymes are also inefficient. Increasing their size often improves turnover number, but may impede folding16,47,49. Therefore, we consider the case of a large, hard-to-fold, but catalytically efficient RNA-only ribosome.

  • We do not consider the effect of protein content on catalytic rates of the ribosomes. Proteins are generally more efficient catalysts than ribozymes16, yet rRNA is still present in the peptidyl transferase center17, and translation rate does not increase in ribosomes from different species which have higher protein content18,19. Furthermore, despite the modest catalytic rate of peptide bond formation, it does not appear to be the rate-limiting step. Given the size of the substrate molecules (mRNA), diffusion may be the limiting factor16,50. Therefore, we assume that enhancing ribosome catalytic rate is not the main reason for the addition of proteins. However, it is possible that proteins stabilize the ribosome structure and thereby indirectly ensure efficient peptide bond formation16.

In future versions of the model, these assumptions can be relaxed. Furthermore, incorporating other types of RNA (mRNA, tRNA) and energy metabolism, or even constructing a genome-scale RBA model51, will likely lead to more quantitative predictions of fluxes and growth rate. A strong indication of this is that including a variable RNAP allocation into the model leads to quantitatively better predictions (see Supplementary Fig. 6). Therefore, in the future, we aim to model RNAP allocation mechanistically. This will involve for example adding other RNA species (mRNA, tRNA), and considering non-specifically bound RNAP which is a considerable fraction of RNAP52.

In our current model, we approximate the cellular rRNA concentration using the ribosome’s rRNA content, see Equation (5). However, rRNA degradation likely begins already during transcription and ribosome assembly. This aspect is not captured in standard RBA as concentrations of all non-catalysts approach zero at maximum growth rates.

In the future, we aim to use growth balance analysis53,54. Growth balance analysis allows the integration of nonlinear kinetics, depending not only on catalyst concentrations but also on substrate concentrations. This will enable us to model RNA degradation based on the concentrations of free rRNA or assembly intermediates. While this shift may alter quantitative predictions, such as RNA degradation fluxes and estimates of \({k}_{{{\mbox{max }}}}^{{{\rm{deg}}}}\), the fundamental conclusions drawn from the model are expected to remain unchanged.

To better model protein-rich organisms such as archaea, the model could be expanded by including the temperature dependence of rRNA degradation and assembly in more detail. Apart from \({k}_{{{\mbox{max }}}}^{{{\rm{deg}}}}\), other parameters (e.g. K or n in the Hill function) might change too to capture the effects of extreme conditions. Furthermore, the effects of other extreme conditions (such as pH and osmolarity), and the reasons for the variability of archaeal ribosome composition could also be investigated55,56. However, the predictions of our current model are in agreement with the naive expectation that more proteins are required to keep ribosomes stable in harsh conditions. More experimental data is needed to model the archaeal ribosomes realistically.

Additionally, some extremophilic organisms such as the bacteria Chloroflexus aurantiacus or Fervidobacterium islandicum, exhibit ribosomes with lower protein content (approximately 40%) compared to extremophilic archaea (50%). It has been suggested that protein-rich ribosomes can be traced back to the oldest phylogenetic lineages, with some ribosomal proteins being lost over time31,57. Organisms with lower protein content in their ribosomes may have evolved alternative strategies to thrive in extreme conditions. Examples of such strategies include the presence of specific rRNA sequence variants or base modifications, as recently discussed by Nissley et al.58.

Moreover, certain archaeal species, such as those from Methanobacteriales or Halobacteriales, have transitioned to milder environmental conditions and subsequently shed unnecessary ribosomal proteins31,57.

To gain a comprehensive understanding of ribosome evolution in response to changing conditions, a thorough phylogenetic analysis is warranted. This analysis should be complemented by measurements of growth rate, translation rate, RNA degradation rate, among other parameters, to delineate the order of protein loss or gain, and the emergence of sequence variations and base modifications.

The protein content of eukaryotic ribosomes in the cytoplasm (approximately 50%) is higher than in bacteria10. This is consistent with the lower growth rates seen in eukaryotes like yeast and mammalian cells. Mitochondrial ribosomes show an even higher protein content, ranging from approximately 50% to 89%32. This may be advantageous since rP are not made directly in mitochondria but are imported for free from the cytoplasm59. Indeed, when we allow a free import of rP in our model, we observe that the optimum moves towards a protein-rich ribosome (Fig. 8). However, to accurately model eukaryotic ribosomes, it is essential to include the synthesis of both cytoplasmic and mitochondrial ribosomes, several different types of RNAPs, transport between nucleus and cytoplasm, and the dynamic interaction between host cells and mitochondria. While the cytoplasm provides ribosomal proteins for mitochondria, mitochondria synthesize enzymes of oxidative phosphorylation and provide ATP back to the host cell.

Fig. 8: The model can be adjusted to predict mitochondrial protein-rich ribosome composition.
figure 8

All parameters used for the simulation of mitochondria are the same as for E. coli in glucose minimal media, except a fraction of rPs can be imported for free from the cytoplasm and does not have to be synthesized. For simplicity, we assumed that 1/3 of rP are imported. (In reality, almost all rP are imported, but mitochondria make additional proteins to provide energy for the whole cell). Panels (ac) represent three types of rRNA degradation. a No cooperativity of rPs. b Weak cooperativity of rPs. c Strong cooperativity of rPs.

We hypothesize that eukaryotic cells can afford a higher protein content in their cytoplasmic and mitochondrial ribosomes without affecting the growth rate, and thereby gain additional functionalities that might provide a fitness advantage. Ribosomal proteins participate in translation processes, for example, binding of translation factors, release of tRNA, and translocation. They may also affect the fidelity of translation60. Furthermore, they play roles in various cellular processes such as cell proliferation, apoptosis, DNA repair, cell migration, and others33. These additional functions might have conferred evolutionary fitness advantages. Nevertheless, the primary role of ribosomal proteins seems to be stabilization and folding of rRNA33,60.

There are still many open questions about ribosome biogenesis and evolution. Our model could guide future experiments. There are a few studies that assessed the effect of individual rP deletions in E. coli, for example, mutation in S10 increased RNA degradation61, and mutation in L6 led to disrupted ribosomal assembly62. A systematic knock-out screen of all ribosomal proteins could be done (as in63), complemented with quantification of RNA degradation and misfolding. In case of extremophilic organisms with protein-rich ribosomes, temperature sensitivity could also be assessed. We would expect that deletion of the extra proteins would cause growth defects only at high temperatures. Furthermore, after removal of proteins from archaeal protein-rich ribosomes, laboratory evolution could be performed to see whether growth rate increases beyond wild-type.

Comprehensive datasets, akin to the work of Bremer and Dennis in 2008 for E. coli, should be generated for non-standard organisms by measuring various parameters such as transcription and translation rates, ribosome and RNAP activities, and other relevant factors.

Finally, phylogenetic analysis or ribosome evolution across different species and environments could be done.

Formal comparison with Kostinski and Reuveni (2020)

Our analysis is motivated by the previous work of Kostinski and Reuveni12, who understand ribosome composition as a competition between two autocatalytic loops. One loop is responsible for synthesizing rRNA, while the other loop is responsible for rP synthesis, both competing for limited resources. These loops and their constraints, namely, the stoichiometric constraints for rRNA and rP and the capacity constraint for RNAP, are contained in our more detailed RBA model, see Table 1. In addition to these three conditions, Kostinski and Reuveni12 make two more assumptions: they fix the ribosome allocations \({\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}\) and \({\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}\) for the synthesis of rP and RNAP, defined in Eqn. (3).

The resulting upper limits on growth rate can be derived easily by considering the synthesis of rRNA and rP, separately.

  • (rP) The stoichiometric constraint for rP is given by vAFwrP, see Table 1. Together with the definition of the corresponding ribosome allocation \({\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}=\frac{\mu \,{n}_{{\mathsf{rP}}}\,{w}_{{\mathsf{rP}}}}{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{v}_{{\mathsf{AF}}}}\), this yields

    $$\mu \le \frac{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}}{{n}_{{\mathsf{rP}}}}.$$
    (8a)
  • (rRNA) The stoichiometric constraint for rRNA and the capacity constraint for RNAP are given by vAFvRNAP and \(\mu \,{n}_{{\mathsf{rRNA}}}\,{v}_{{\mathsf{RNAP}}}\le {\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{w}_{{\mathsf{RNAP}}}\), see Table 1. By multiplication, they imply \(\mu \,{n}_{{\mathsf{rRNA}}}\,{v}_{{\mathsf{AF}}}\le {\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{w}_{{\mathsf{RNAP}}}\). Together with the definition of the ribosome allocation \({\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}=\frac{\mu \,{n}_{{\mathsf{RNAP}}}\,{w}_{{\mathsf{RNAP}}}}{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{v}_{{\mathsf{AF}}}}\) for the synthesis of RNAP, this yields

    $${\mu }^{2}\le \frac{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}}{{n}_{{\mathsf{rRNA}}}\,{n}_{{\mathsf{RNAP}}}}.$$
    (8b)

These upper bounds (8) are Eqns. (2) and (5) in Kostinski and Reuveni12, after inserting the effective transcription and translation elongation rate constants \({\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}={k}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{f}_{{\mathsf{RNAP}}}^{{\mathsf{act}}}\,{\phi }_{{\mathsf{rRNA}}}^{{\mathsf{RNAP}}}\) and \({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}={k}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{f}_{{\mathsf{R}}}^{{\mathsf{act}}}\), respectively. Here, \({\phi }_{{\mathsf{rRNA}}}^{{\mathsf{RNAP}}}\) denotes the fraction of RNAP transcribing rRNA (which we assume to equal one in the rest of this work).

Using Eqn. (4), the two upper bounds (8) can be written as functions of the rP fraction xrP, namely as

$$\mu \le {\gamma }_{{\mathsf{rP}}}\frac{{\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}}{{x}_{{{{{{{{\rm{rP}}}}}}}}}}\quad \,{{\mbox{and}}}\,\quad \mu \le {\gamma }_{{\mathsf{RNAP}}}\sqrt{\frac{{\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}}{1-{x}_{{{{{{{{\rm{rP}}}}}}}}}}}$$
(9)

with constants γrP, γRNAP > 0. For fixed ribosome allocations \({\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}\) and \({\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}\), the two curves necessarily intersect at some \(0\, < \,{x}_{{{{{{{{\rm{rP}}}}}}}}}^{* }\, < \,1\), and \(\mu ({x}_{{{{{{{{\rm{rP}}}}}}}}}^{* })\) is the maximum growth rate allowed by the constraints considered above, namely the stoichiometric constraints for rP and rRNA, and the RNAP capacity constraint.

Kostinski and Reuveni12 interpret Eqns. (9) as a trade-off between rRNA and rP production. This effect arises because they fix the ribosome allocations. In particular, Kostinski and Reuveni12 fix \({\phi }_{{\mathsf{rP}}}^{{\mathsf{R}}}\) and \({\phi }_{{\mathsf{RNAP}}}^{{\mathsf{R}}}\) to experimental values for E. coli (in multiple growth conditions), and find that maximum growth rate occurs close to the current rP fraction (xrP = 36%), and the resulting μ(xrP) is close to the experimental value. If we use their parameters (see Supplementary Table 1), we can exactly reproduce their results (see Supplementary Fig. 7). Our base model provides an explanation for the protein investment costs, giving a proper mechanistic interpretation to the argument presented by Kostinski and Reuveni12. Moreover, it is closer to an evolutionary scenario, where a cell can adjust both ribosome composition xrP and ribosome allocations ϕR. However, the base model predicts an optimal ribosome that is RNA-only (for realistic parameters), see Fig. 3a. This is possible because the ribosome allocations are adjusted according to demand. The ribosome allocations corresponding to varying ribosomal protein fraction are illustrated in Fig. 3b. Only the extended model with RNA degradation predicts a mixed (RNA+protein) ribosome at maximum growth rate.

Methods

Our analysis is based on the small model of a self-replicating cell depicted in Fig. 1 and described below. Constraints are listed in Table 1 and parameters in Supplementary Table 1. For an introduction to resource allocation in next-generation models of cellular growth, including the definition of elementary growth vectors, see Müller et al.23. Elementary growth vectors were enumerated using the package efmtool 0.2.064 in Python 3.8.13.

Figures 1a and 7 were created with BioRender.com and the remaining figures with R version 4.1.2.

Model details

We consider the small model of a self-fabricating cell depicted in Fig. 1 which contains metabolic reactions and macromolecular synthesis reactions. To take into account the limitation of cellular resources, we use three types of capacity constraints: enzyme capacity constraints limit the rate of metabolic reactions, the RNAP capacity constraint limits transcription rate, and the ribosome capacity constraint limits the synthesis rates of all proteins (including the ribosomal proteins).

The cell takes up a carbon source (C) via the reaction

$${r}_{{\mathsf{IC}}}:\,\mathop{\to }\limits^{{\mathsf{IC}}}{\mathsf{C}},$$

catalyzed by the importer IC, and forms amino acids (AA), nucleotides (NT), and ribosomal rRNA (rRNA) via

$${r}_{{\mathsf{EAA}}}: \,{n}_{{\mathsf{AA}}}\cdot {\mathsf{C}}\mathop{\to }\limits^{{\mathsf{EAA}}}{\mathsf{AA}},\\ \quad {r}_{{\mathsf{ENT}}}: \,{n}_{{\mathsf{NT}}}\cdot {\mathsf{C}}+{\mathsf{AA}}\mathop{\to }\limits^{{\mathsf{ENT}}}{\mathsf{NT}},\quad \\ {r}_{{\mathsf{RNAP}}}: \,{n}_{{\mathsf{rRNA}}}\cdot {\mathsf{NT}}\mathop{\to }\limits^{{\mathsf{RNAP}}}{\mathsf{rRNA}},$$

catalyzed by the enzymes EAA, ENT, and the RNA polymerase (RNAP). Ultimately, the ribosome R is built from rRNA and ribosomal protein (rP) via

$${r}_{{\mathsf{AF}}}:\,{\mathsf{rRNA}}+{\mathsf{rP}}\mathop{\to }\limits^{{\mathsf{AF}}}{\mathsf{R}},$$

catalyzed by the assembly factors AF. The processes above are part of the base model. In an extended model, ribosomal RNA degrades via

$${r}_{{\mathsf{RNase}}}:\,{\mathsf{rRNA}}\mathop{\to }\limits^{{\mathsf{RNase}}}{n}_{{\mathsf{rRNA}}}\cdot {\mathsf{NT}},$$

catalyzed by the RNase. Finally, we consider the synthesis of all proteins (enzymes and ribosomal protein) via the reactions

$$ {s}_{i}: \,{n}_{i} \cdot {\mathsf{AA}}\mathop{\to }\limits^{{\mathsf{R}}}i,\quad \\ i\in {\mathsf{Proteins}}= \{{\mathsf{IC}},{\mathsf{EAA}},{\mathsf{ENT}},{\mathsf{RNAP}},({\mathsf{RNase}}),{\mathsf{AF}},{\mathsf{rP}}\},$$

catalyzed by the ribosome.

The resulting stoichiometric matrix and the corresponding flux vector are displayed in Fig. 1b, and parameter values are given in Supplementary Table 1. In fact, the stoichiometric matrix can be partitioned into two submatrices,

$${{{{{{{\boldsymbol{N}}}}}}}}=\left(\begin{array}{c}{{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Met}}}\\ {{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Cat}}}\end{array}\right),$$

corresponding to the metabolites Met = {C, AA, NT, rRNA, rP} and the catalysts Cat = Enz {R} including the enzymes Enz = {IC, EAA, ENT, RNAP, (RNase), AF} and the ribosome. The (total) flux vector, vtot, can be partitioned into two subvectors,

$${{{{{{{{\boldsymbol{v}}}}}}}}}_{{{{{{{{\rm{tot}}}}}}}}}=\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right),$$

corresponding to the enzymatic reactions r and the protein synthesis reactions s.

In general, comprehensive models of cellular growth lead to linear (in-)equality systems for the fluxes, and concentrations are determined by fluxes, see Box 1. In the example, we distinguish enzymatic reactions r and protein synthesis reactions s (with corresponding fluxes v and w), and further metabolites Met and catalysts Cat, see above. Explicitly, we study the inequality system

$$\left(\begin{array}{c}{{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Met}}}\\ {{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Cat}}}\end{array}\right)\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)\ge {{{{{{{\mathbf{0}}}}}}}},\quad {{{{{{{\boldsymbol{v}}}}}}}}\ge {{{{{{{\mathbf{0}}}}}}}},\,{{{{{{{\boldsymbol{w}}}}}}}}\ge {{{{{{{\mathbf{0}}}}}}}},\quad \,{{\mbox{and}}}\,\quad \mu ={\omega }_{{\mathsf{C}}}{v}_{{\mathsf{IC}}},$$

since rIC is the only exchange reaction.

In fact, only \({{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Met}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)\ge {{{{{{{\mathbf{0}}}}}}}}\) yields non-trivial constraints, since \({{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Cat}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)\ge {{{{{{{\mathbf{0}}}}}}}}\) yields wi ≥ 0 for iEnz and vAF ≥ 0, already included in v0,  w0. However, NCat determines the catalyst concentrations via \(\mu \,{{{{{{{{\boldsymbol{c}}}}}}}}}_{{\mathsf{Cat}}}={{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{Cat}}}\left(\begin{array}{c}{{{{{{{\boldsymbol{v}}}}}}}}\\ {{{{{{{\boldsymbol{w}}}}}}}}\end{array}\right)\) or, explicitly,

$${c}_{i}={w}_{i}/\mu ,\quad i\in {\mathsf{Enz}},\qquad \,{{\mbox{and}}}\,\qquad {c}_{{\mathsf{R}}}={v}_{{\mathsf{AF}}}/\mu .$$
(10)

(Recall that the ribosome is formed by the assembly factors).

Now, catalyst concentrations are used to formulate capacity constraints (for importer, metabolic enzymes, and assembly factors),

$${v}_{i}\le {k}_{i}^{{\mathsf{cat}}}\,{c}_{i},\quad i\in \{{\mathsf{IC}},{\mathsf{EAA}},{\mathsf{ENT}},{\mathsf{AF}}\}\subset {\mathsf{Enz}},$$
(11a)

where \({k}_{i}^{{\mathsf{cat}}}\) are the corresponding enzyme turnover numbers. The capacity constraints for the RNA polymerase, (optionally the RNase), and the ribosome are given by

$${n}_{{\mathsf{rRNA}}}\,{v}_{{\mathsf{RNAP}}}\le {\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\,{c}_{{\mathsf{RNAP}}},\qquad ({n}_{{\mathsf{rRNA}}}\,{v}_{{\mathsf{RNase}}}\le {k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}\,{c}_{{\mathsf{RNase}}}),$$
(11b)

and

$$\mathop{\sum}\limits_{i\in {\mathsf{Proteins}}}{n}_{i}\,{w}_{i}\le {\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{c}_{{\mathsf{R}}},$$
(11c)

respectively. Here, nrRNA is the number of nucleotides in rRNA, and ni is the number of amino acids in protein i, cf. the stoichiometric coefficients in Fig. 1b. Further, \({\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}={k}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}{f}_{{\mathsf{RNAP}}}^{{\mathsf{act}}}\) and \({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}={k}_{{\mathsf{R}}}^{{\mathsf{el}}}{f}_{{\mathsf{R}}}^{{\mathsf{act}}}\) are the effective transcription and translation elongation rate constants, respectively, and \({k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}\) is the RNA degradation rate constant.

Finally, catalyst concentrations are expressed by corresponding fluxes in all capacity constraints (11) via Eqns. (10). The stoichiometric, capacity, and (dry) mass constraints described so far are summarized in Table 1, and the parameter values are given in Supplementary Table 1.

In particular, after using (10), the ribosome capacity constraint (11c) takes the form

$$\mathop{\sum}\limits_{i\in {\mathsf{Proteins}}}\mu \,{n}_{i}\,{w}_{i}\le {\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{v}_{{\mathsf{AF}}},$$

which suggests the definition of ribosome allocations (ribosome fractions translating certain proteins),

$${\phi }_{i}^{{\mathsf{R}}}=\frac{\mu \,{n}_{i}\,{w}_{i}}{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{v}_{{\mathsf{AF}}}},\quad i\in {\mathsf{Proteins}}.$$

Obviously, \({\sum }_{i\in {\mathsf{Proteins}}}{\phi }_{i}^{{\mathsf{R}}}\le 1\). Instead of varying the protein synthesis fluxes w, one may vary vAF (the ribosome synthesis flux) and the (vector of) ribosome allocations ϕR.

Throughout this work, we consider a fixed ribosome mass, but variable ribosomal RNA and protein content,

$${\omega }_{{\mathsf{R}}}={n}_{{\mathsf{rRNA}}}\,{\omega }_{{\mathsf{NT}}}+{n}_{{{{{{{{\rm{rP}}}}}}}}}\,{\omega }_{{\mathsf{AA}}},$$

where nrRNA and nrP are the numbers of nucleotides and amino acids in rRNA and rP, respectively. We define the ribosomal protein (mass) fraction

$${x}_{{{{{{{{\rm{rP}}}}}}}}}={n}_{{{{{{{{\rm{rP}}}}}}}}}\,\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{R}}}},$$

and express nrRNA and nrP by xrP,

$${n}_{{\mathsf{rRNA}}}=(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\,\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{NT}}}}\quad \,{{\mbox{and}}}\,\quad {n}_{{{{{{{{\rm{rP}}}}}}}}}={x}_{{{{{{{{\rm{rP}}}}}}}}}\,\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{AA}}}}.$$

For variable ribosomal protein fraction xrP (from 0 to 100%), we maximize growth rate (by varying fluxes under the given constraints).

Converting constraints from equations/inequalities into matrix form

For convenience, we describe how to obtain the constraints in Table 1 from the equations/inequalities given in the subsection Model details above.

The capacity constraints for enzymes have the form

$${v}_{i}\le {k}_{i}^{{\mathsf{cat}}}\,{c}_{i},\quad i\in \{{\mathsf{IC}},{\mathsf{EAA}},{\mathsf{ENT}},{\mathsf{AF}}\}.$$

Concentrations ci can be expressed by synthesis fluxes wi via ci = wi/μ, and hence

$${v}_{i}\le {k}_{i}^{{\mathsf{cat}}}\,{w}_{i}/\mu .$$

Multiplying by μ and bringing all terms to one side, we get

$$-\mu {v}_{i}+{k}_{i}^{{\mathsf{cat}}}\,{w}_{i}\ge 0$$

or, in vector form,

$$\left(-\mu \,\,{k}_{i}^{{\mathsf{cat}}}\right)\left(\begin{array}{c}{v}_{i}\\ {w}_{i}\end{array}\right)\ge 0.$$

The row vector is the non-zero part of the row in Table 1 that specifies the capacity constraint cap i, for i {IC, EAA, ENT, AF}.

For the RNAP and RNase capacity constraints (cap RNAP and cap RNase, respectively), we further have \({k}_{i}^{{\mathsf{cat}}}={\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}/{n}_{{\mathsf{rRNA}}}\) and \({k}_{i}^{{\mathsf{cat}}}={k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}/{n}_{{\mathsf{rRNA}}}\). Hence,

$$\left(\begin{array}{rc}-\mu {n}_{{\mathsf{rRNA}}}&{\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}\end{array}\right)\left(\begin{array}{c}{v}_{{\mathsf{RNAP}}}\\ {w}_{{\mathsf{RNAP}}}\end{array}\right)\ge 0$$

and

$$\left(\begin{array}{rc}-\mu {n}_{{\mathsf{rRNA}}}&{k}_{{\mathsf{RNase}}}^{{{\rm{deg}}}}\end{array}\right)\left(\begin{array}{c}{v}_{{\mathsf{RNase}}}\\ {w}_{{\mathsf{RNase}}}\end{array}\right)\ge 0.$$

Finally, for the ribosome capacity constraint, \({\sum }_{i\in {\mathsf{Proteins}}}{n}_{i}\,{w}_{i}\le {\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{c}_{{\mathsf{R}}}\), ribosome concentration cR can be expressed by assembly flux vAF via cR = vAF/μ. Ultimately,

$${\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\,{v}_{{\mathsf{AF}}}-\mathop{\sum}\limits_{i\in {\mathsf{Proteins}}}\mu \,{n}_{i}\,{w}_{i}\ge 0.$$

After expanding the sum, this yields the row in Table 1 that specifies cap R.

Minimum degradation rate is enforced by

$${v}_{{\mathsf{RNase}}}\ge {k}^{{{\rm{deg}}}}(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\,{c}_{{\mathsf{R}}}.$$

As before, ribosome concentration is expressed by assembly flux,

$${v}_{{\mathsf{RNase}}}\ge {k}^{{{\rm{deg}}}}(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\,{v}_{{\mathsf{AF}}}/\mu ,$$

which can be written as in Table 1,

$$\left(\begin{array}{rc}\mu &-{k}^{{{\rm{deg}}}}(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\end{array}\right)\left(\begin{array}{c}{v}_{{\mathsf{RNase}}}\\ {v}_{{\mathsf{AF}}}\end{array}\right)\ge 0.$$

As stated in Box 2, growth rate is determined by the exchange fluxes. Indeed, in the dry mass constraint ωTc = 1, concentrations c can be expressed by fluxes vtot via Nvtot = μc and hence ωTNvtot = μ. Since internal reactions are mass-balanced, only exchange reactions contribute to growth,

$${{{{{{{{\boldsymbol{\omega }}}}}}}}}^{T}{{{{{{{{\boldsymbol{N}}}}}}}}}_{{\mathsf{exc}}}{{{{{{{{\boldsymbol{v}}}}}}}}}_{{\mathsf{exc}}}=\mu .$$

In our small model, the only exchange reaction is carbon import and hence

$${\omega }_{{\mathsf{C}}}{v}_{{\mathsf{IC}}}=\mu .$$

Symbolic analysis of growth rate maximization

In order to confirm our numerical results, we also perform a symbolic analysis of growth rate maximization.

The base model involves five stoichiometric constraints (for the species C, AA, NT, RNAP, rP), six capacity constraints (for the reactions catalyzed by IC, EAA, ENT, RNAP, AF, R), and one (dry) mass constraint, cf. Table 1 (without the columns and rows indicated in the caption). They define a linear equality and inequality system with 12 constraints (either ≥ or = ) for 11 fluxes and 1 right-hand side.

We apply the determinant method introduced in Box 2 to the resulting matrix \(B\in {{\mathbb{R}}}^{12\times 12}\), and we find

$$0= \det B \\ \sim \left(\left[\frac{{n}_{{\mathsf{IC}}}}{{k}_{{\mathsf{IC}}}^{{\mathsf{cat}}}}\frac{{\omega }_{{\mathsf{NT}}}}{{\omega }_{{\mathsf{G}}}}+\frac{{n}_{{\mathsf{EAA}}}}{{k}_{{\mathsf{EAA}}}^{{\mathsf{cat}}}}+\frac{{n}_{{\mathsf{ENT}}}}{{k}_{{\mathsf{ENT}}}^{{\mathsf{cat}}}}+\frac{{n}_{{\mathsf{RNAP}}}}{{\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}}\right]{n}_{{\mathsf{rRNA}}}+\frac{{n}_{{\mathsf{AF}}}}{{k}_{{\mathsf{AF}}}^{{\mathsf{cat}}}}\right){\mu }^{2}\\ +\left({\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\left[\frac{{n}_{{\mathsf{IC}}}}{{k}_{{\mathsf{IC}}}^{{\mathsf{cat}}}}\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{G}}}}+\frac{{n}_{{\mathsf{EAA}}}}{{k}_{{\mathsf{EAA}}}^{{\mathsf{cat}}}}\right]+{n}_{{{{{{\rm{rP}}}}}}}\right)\mu -{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}.$$

Using ribosomal protein fraction and rescaling time,

$${n}_{{\mathsf{rRNA}}}=(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{NT}}}},\quad {n}_{{{{{{{{\rm{rP}}}}}}}}}={x}_{{{{{{{{\rm{rP}}}}}}}}}\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{AA}}}},\quad \,{{\mbox{and}}}\,\quad \hat{\mu }=\frac{\mu }{{\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}}\frac{{\omega }_{{\mathsf{R}}}}{{\omega }_{{\mathsf{AA}}}},$$

we obtain a quadratic equation for maximum growth rate,

$$0=\left(\alpha +\beta (1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\right){\hat{\mu }}^{2}+\left(\gamma +{x}_{{{{{{{{\rm{rP}}}}}}}}}\right)\hat{\mu }-1$$
(12)

with

$$ \alpha ={\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\frac{{n}_{{\mathsf{AF}}}}{{k}_{{\mathsf{AF}}}^{{\mathsf{cat}}}}{\left(\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{R}}}}\right)}^{2}\\ \beta ={\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\left[\frac{{n}_{{\mathsf{IC}}}}{{k}_{{\mathsf{IC}}}^{{\mathsf{cat}}}}\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{G}}}}+\left(\frac{{n}_{{\mathsf{EAA}}}}{{k}_{{\mathsf{EAA}}}^{{\mathsf{cat}}}}+\frac{{n}_{{\mathsf{ENT}}}}{{k}_{{\mathsf{ENT}}}^{{\mathsf{cat}}}}+\frac{{n}_{{\mathsf{RNAP}}}}{{\bar{k}}_{{\mathsf{RNAP}}}^{{\mathsf{el}}}}\right)\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{NT}}}}\right]\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{R}}}}\\ \gamma ={\bar{k}}_{{\mathsf{R}}}^{{\mathsf{el}}}\left[\frac{{n}_{{\mathsf{IC}}}}{{k}_{{\mathsf{IC}}}^{{\mathsf{cat}}}}\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{G}}}}+\frac{{n}_{{\mathsf{EAA}}}}{{k}_{{\mathsf{EAA}}}^{{\mathsf{cat}}}}\right]\frac{{\omega }_{{\mathsf{AA}}}}{{\omega }_{{\mathsf{R}}}}.$$

For fixed xrP [0, 1], the quadratic equation (12) has one positive solution \(\hat{\mu }({x}_{{{{{{{{\rm{rP}}}}}}}}})\). To show that it is monotone in xrP, we differentiate (12) and set \({{{{{{{\rm{d}}}}}}}}\hat{\mu }/{{{{{{{\rm{d}}}}}}}}{x}_{{{{{{{{\rm{rP}}}}}}}}}=0\). We get

$$0=-\beta {\hat{\mu }}^{2}+\hat{\mu }$$

which has the positive solution \(\hat{\mu }=1/\beta\). Insertion into (12) yields

$$0=(\alpha +\beta ){\left(\frac{1}{\beta }\right)}^{2}+\gamma \frac{1}{\beta }-1=:\varepsilon ,$$

which does not depend on xrP. In fact, if ε = 0, then \(\hat{\mu }\) is constant. Otherwise, \(\hat{\mu }\) is strictly monotone in xrP (decreasing if ε > 0 and increasing if ε < 0).

For realistic parameters, \(\hat{\mu }\) is decreasing (and \(\hat{\mu }\, < \,1/\beta\)).

Approximation

For realistic parameters, αβ ≤ 1, and for all xrP [0, 1], we may set α = 0 in the quadratic equation (12): For xrP → 0, obviously α + (1 − xrP)β → α + β ≈ β. For xrP → 1, the crucial quantity \(4(\alpha +(1-{x}_{{{{{{{{\rm{rP}}}}}}}}})\beta )/{(\gamma +{x}_{{{{{{{{\rm{rP}}}}}}}}})}^{2}\to 4\alpha /{(1+\gamma )}^{2}\ll 1\), and the quadratic term can be neglected.

Numerical growth rate maximization

We fix growth rate and solve the system of equations (1) and (2) using efmtool 0.2.064 in Python 3.8.13. We use bisection search to find the highest growth rate that still enables a feasible solution.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.