Constrained proteome allocation affects coexistence in models of competitive microbial communities

Pacciani-Mori, Leonardo; Suweis, Samir; Maritan, Amos; Giometto, Andrea

doi:10.1038/s41396-020-00863-0

Download PDF

Article
Open access
Published: 11 January 2021

Constrained proteome allocation affects coexistence in models of competitive microbial communities

The ISME Journal volume 15, pages 1458–1477 (2021)Cite this article

3633 Accesses
6 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Microbial communities are ubiquitous and play crucial roles in many natural processes. Despite their importance for the environment, industry and human health, there are still many aspects of microbial community dynamics that we do not understand quantitatively. Recent experiments have shown that the structure and composition of microbial communities are intertwined with the metabolism of the species that inhabit them, suggesting that properties at the intracellular level such as the allocation of cellular proteomic resources must be taken into account when describing microbial communities with a population dynamics approach. In this work, we reconsider one of the theoretical frameworks most commonly used to model population dynamics in competitive ecosystems, MacArthur’s consumer-resource model, in light of experimental evidence showing how proteome allocation affects microbial growth. This new framework allows us to describe community dynamics at an intermediate level of complexity between classical consumer-resource models and biochemical models of microbial metabolism, accounting for temporally-varying proteome allocation subject to constraints on growth and protein synthesis in the presence of multiple resources, while preserving analytical insight into the dynamics of the system. We first show with a simple experiment that proteome allocation needs to be accounted for to properly understand the dynamics of even the simplest microbial community, i.e. two bacterial strains competing for one common resource. Then, we study our consumer-proteome-resource model analytically and numerically to determine the conditions that allow multiple species to coexist in systems with arbitrary numbers of species and resources.

Pulsed, continuous or somewhere in between? Resource dynamics matter in the optimisation of microbial communities

Article Open access 24 January 2023

Polarization of microbial communities between competitive and cooperative metabolism

Article 04 January 2021

Resource competition predicts assembly of gut bacterial communities in vitro

Article 14 March 2024

Introduction

Microbes are among the most abundant life forms on Earth in terms of biomass [1]. They are found in almost every habitat of our planet, and continue to surprise us with their ability to survive in places that were thought to be inhospitable and barren. For example, microbial communities have been found in the deep terrestrial subsurface [2, 3], and it has been estimated that the first five kilometers beneath the Earth’s surface could be habitable for them [4]. Because of their ubiquity, microbial communities play fundamental roles in countless natural processes of vital importance, from the digestion and overall health of their host organism [5] to the regulation of bio-geochemical cycles [6, 7]. Despite their importance, however, we still know very little about the fundamental mechanisms that regulate microbial communities, partly because we are only able to grow in the lab a very small fraction of all the microbes found in nature [8], and partly because microbial communities are complex, non-linear systems [9] whose dynamics is difficult to predict. For these reasons, scientists from many disciplines have long been fascinated by the challenging theoretical questions posed by the study of microbial communities’ structure and dynamics, and serious efforts are being made to understand how competition [10,11,12] and metabolic interactions [13, 14] allow such systems to maintain the very high levels of biodiversity found in nature.

Recent experimental studies have shown that the structure and composition of microbial communities are tightly linked to the metabolism of the species that inhabit them [15, 16] (e.g., communities with different taxonomic compositions can nevertheless exhibit the same metabolic functional structure [17, 18]). We can therefore speculate that the ways with which microbes uptake and use different resources for growth and proliferation can affect the dynamics of an entire community. Resource uptake is constrained by the other functions that cells must perform to grow and proliferate, and the balance between such functions is governed by the allocation of the internal resources of the cell (e.g., the proteome, the set of proteins expressed by a cell) to different tasks. It is therefore important to understand how microbial community dynamics is influenced by the proteome allocation of its members, and new insights in this direction might help us make more powerful predictions of how microbial communities assemble and evolve [19, 20]. However, accounting for the dynamics of metabolism and gene expression of each species in a microbial community explicitly (e.g., via community flux balance analysis [21]) can be very challenging, and the large dimensionality of the mathematical models that attempt to do so poses limits to our understanding of the dynamics of microbial communities and of the fundamental properties that affect species coexistence.

Scott et al. [22] showed that, despite the complexity of bacterial metabolism, there are simple relationships that link the fraction of the proteome allocated for nutrient uptake and protein synthesis to the growth rate of bacteria grown in isolation, and that reducing these fractions by forcing cells to express a useless protein reduces their growth rate. Such relationships are very powerful because they describe how bacterial growth is influenced by proteome allocation and gene expression without requiring an explicit representation of the underlying molecular mechanisms. These relationships, which were also based on earlier observations by Schaechter et al. [23] on how the ribosomal component of the proteome of a microbial species scales with the growth rate, have recently been applied in many different contexts [24] and were instrumental in improving our knowledge of microbial metabolism, both experimentally [25] and computationally [26]. However, as the experiments by Scott et al. [22] were performed with single-species populations in exponential phase, it is still an open question if their approach can also be used to describe the population dynamics of different interacting microbial species competing for multiple resources.

In this work, we fill this gap by linking the results by Scott et al. [22] to one of the most widely adopted theoretical frameworks for modeling competitive ecosystems, MacArthur’s consumer-resource model [27,28,29], and use it to describe the dynamics of microbial species competing for one or more resources. MacArthur’s model describes how the population abundances of N_S species competing for a common pool of N_R resources change over time, and has been used in several recent studies [10,11,12, 30,31,32] to understand under which conditions multiple species can coexist while competing for few resources. These studies, however, did not account for the fact that proteome allocation constraints limit the rates at which microbes can uptake different resources, which, as shown here, affects the conditions that lead to the coexistence of multiple microbial species in competitive communities. We show that generalizing Scott et al.’s proteome-growth relationships and including them into a consumer-resource framework allows us to build a community dynamics model where all parameters can in principle be measured experimentally and have a precise biological interpretation. This “Consumer-Proteome-Resource” (CPR) model describes community dynamics at an intermediate level of complexity between classical consumer-resource models and biochemical models of microbial metabolism [21]. By adopting such an intermediate level of complexity and realism, we can take into account the dynamics of gene expression and microbial metabolism, while preserving analytical insights on the microbial community dynamics and identifying the key intracellular properties affecting species coexistence.

There have been attempts in the past at deriving models to describe the dynamics and/or structure of microbial communities by incorporating some insight into the metabolism of their species and the molecular aspects of their growth. One of the earliest and most notable efforts in this direction was performed by Droop [33], who developed a model that describes microalgal growth by taking into account intracellular quotas of the (single) supplied resource. In more recent times, the problem has been addressed by applying Flux Balance Analysis to genome-scale models in order to reveal how metabolic fluxes can influence community dynamics [34, 35]. This approach, however, leads to models that are extremely complicated and strongly dependent on the identity of the species in the community, since they require detailed knowledge of metabolic networks with hundreds of different reactions for every species, as well as the metabolic interactions among the members of the community. More recently, it has been shown that introducing some information on the metabolism of microbial species in models of community dynamics (without all the details that a Flux Balance Analysis model requires) can provide us with useful insights on the properties of the community [36, 37]. Our work sits conceptually in this latter context, but unlike what has already been done in this direction does not make assumptions on the metabolism of the species and relies on quantities (like the proteome fractions) that can be measured directly.

In the next section, we describe the CPR model for a general number of species/strains and resources. First, we review the proteome allocation framework of Scott et al. [22] and discuss how we generalize it to multiple resources. Second, we review the fundamental structure of consumer-resource models. Third, we construct our consumer-resource model which incorporates proteome allocation. Then, we consider the simplest implementation of an experimental microbial community, i.e. two Escherichia coli strains competing for glucose as the only carbon source, to illustrate that it is necessary to account for proteome allocation in consumer-resource models to describe the dynamics (and the conditions for coexistence) of even the simplest microbial community. The experiment described here constitutes a proof of the concept that one needs to account for proteome allocation dynamics when adopting consumer-resource theory to describe competitive microbial communities. Finally, we study (both analytically and numerically) the CPR model for communities composed of arbitrary numbers of species and resources to identify the conditions allowing the coexistence of multiple species in the community. A discussion section and some future perspectives conclude this work.

Results

Microbial proteome allocation

The phenomenological framework proposed by Scott et al. [22] prescribes that the proteome of a single microbial species growing on a single resource can be minimally divided into three sectors: one dedicated to nutrient uptake and metabolism (the “P-sector”), one dedicated to ribosomal proteins responsible for biomass production and growth (the “R-sector”), and a third one dedicated to housekeeping functions (the “Q-sector”), which was shown to be incompressible [22]. Naming φ^P, φ^R and φ^Q the proteome fractions corresponding to these sectors, we must have φ^P + φ^R + φ^Q = 1 (since all proteome fractions must sum to one), and Scott et al. have shown that φ^P and φ^R are linear functions of the species’ growth rate g, i.e:

$$\varphi ^P = \frac{\rho }{{\bar \kappa ^n\left( c \right)}}g,$$

(1a)

$$\varphi ^R = \frac{\rho }{{\kappa ^t}}g + \varphi ^0.$$

(1b)

Here ρ is a conversion factor (equal to the ratio between the total mass of the ribosomal proteins and the total RNA mass of the cells) and $\bar \kappa ^n\left( c \right) = \kappa ^n \cdot r\left( c \right)$, where r(c) = c/(K + c) is the Monod function which encapsulates the dependence on the resource concentration c. Most of our results do not actually depend on the exact functional form of r(c), as long as r(c) is a monotonically increasing function that saturates for large values of c (see Materials and Methods). K is the half-saturation constant of the resource and κⁿ is the “nutritional capacity” [22] of the (only) limiting resource. This parameter measures how much protein biomass is produced per unit ribosomal mass per unit time, and therefore depends on how much energy the resource contains and how efficiently the microbial species can metabolize it (see Supplementary Information and [22] for a molecular interpretation of κⁿ). The parameter κ^t is the “translational capacity” [22] of the microbial species, measuring how much protein biomass is produced per unit ribosomal mass per unit time; it is, therefore, a measure of how fast the microbial species expresses its genome to synthesize proteins. Finally, φ⁰ is the incompressible core of φ^R, representing the fact that ribosomal proteins are present in the cells also when microbes are not growing. All these parameters involve the ribosomal mass of the microbial species because the measurements by Scott et al. [22] were done by assaying the RNA/protein ratio in exponentially growing Escherichia coli.

Scott et al.’s results apply to microbes growing on a single resource. We generalize their framework to a system with multiple species and resources as shown in Fig. 1a: indicating with $\varphi _{\sigma i}^P$ the proteome fraction allocated by species σ to the uptake and metabolization of resource i, the total proteome fraction allocated by species σ to nutrient uptake and metabolism is given by $\varphi _\sigma ^P = \mathop {\sum}\nolimits_{i = 1}^{N_R} {\varphi _{\sigma i}^P}$. To ensure that the sum of all the proteome fractions is equal to one we must have:

$$\varphi _\sigma ^Q + \varphi _\sigma ^R + \mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}^P} = 1.$$

(2)

**Fig. 1: Assumptions of the CPR model.**

This constraint represents the finiteness of a species’ proteome, i.e. the fact that each species in a community has a limited proteomic budget that can be spent for all the necessary biological functions: for example, if more proteins need to be produced for metabolizing complex substrates (i.e., if the nutrient fraction $\varphi _\sigma ^P$ increases), then a smaller part of the proteome will be available for biomass production (i.e., the ribosomal fraction $\varphi _\sigma ^R$ decreases). In order to achieve optimal growth, microbial species must balance this trade-off [22].

Consumer-resource models

In Fig. 1b we show a schematic representation of the “classic” consumer-resource model. Within this framework, a community is a set of N_S species that can only uptake some (or all) of the N_R available resources. Species’ growth rates are determined by the types and the amount of resources they uptake, and are also regulated by a “maintenance cost”, representing the fact that species need to uptake a minimum amount of resources in order to survive. The resources, on the other hand, can be thought of as substrates that are supplied to the system with given (constant) rates s_i, and they are uptaken by species in the community. Overall, the model describes explicitly the dynamics of both species and resources through equations with the following structure:

$$\dot m_\sigma = m_\sigma \left( {g_\sigma - q_\sigma } \right) \qquad \quad \sigma = 1, \ldots ,N_S,$$

(3a)

$$\dot c_i = s_i - \mathop {\sum }\limits_{\sigma = 1}^{N_S} J_{\sigma i}m_\sigma \qquad \quad i = 1, \ldots ,N_R,$$

(3b)

where m_σ is the biomass density of species σ and g_σ is its growth rate. The parameter q_σ is a maintenance cost, due to the fact that each species requires a minimum amount of energy per unit time to survive without growing. Finally, c_i is the density of resource i, s_i is the (constant) resource supply rate, and J_σi is the rate at which species σ uptakes resource i per unit biomass. The ways in which species uptake the available substrates are encoded in J_σi with parameters that in the literature are called “metabolic strategies” or “resource preferences”. In particular, consumer-resource models are generally setup so that J_σi ∝ α_σi, with ${\vec{ \alpha}}_\sigma = \left( {\alpha _{\sigma 1}, \ldots ,\alpha _{\sigma N_R}} \right)$ the metabolic strategy (or resource preference) of species σ. Therefore, in the consumer-resource framework the interactions between species are indirect and mediated by the abundance of resources and the species’ resource preferences. Other types of direct inter-specific interactions (like cross-feeding through the exchange of metabolic byproducts), though undoubtedly important in natural microbial ecosystems, are not addressed in this work.

The consumer-proteome-resource model

Here, we incorporate proteome allocation constraints into consumer-resource models and show that proteome fractions allocated to the uptake of different resources must vary with time as resource concentrations vary. Figure 1c depicts schematically the assumptions underlying the CPR model. Each species σ uptakes resource i with a rate J_σi that is proportional to the proteome fraction $\varphi _{\sigma i}^P$. Then, resource i accounts for a growth term $g_\sigma ^{\left( i \right)}$ proportionally to the uptake rate J_σi. For our purposes, we assume that all resources in the system are substitutable, so that they can be used interchangeably and we can write the total growth rate g_σ of a given species as the sum of all the terms $g_\sigma ^{\left( i \right)}$. This assumption is consistent with previous works [38, 39] that considered the proteome allocation introduced by Scott et al. [22] in systems with two substitutable resources. Eventually, we obtain the following mathematical model (see Materials and Methods for the detailed derivation):

$$\dot m_\sigma = m_\sigma \left[ {\mathop {\sum}\limits_{i = 1}^{N_R} {\frac{{\kappa _i^n}}{{\rho _\sigma }}} r_i\left( {c_i} \right)\varphi _{\sigma i} - q_\sigma } \right],$$

(4a)

$$\dot c_i = s_i - \xi _ir_i\left( {c_i} \right)\mathop {\sum}\limits_{\sigma = 1}^{N_S} {m_\sigma } \varphi _{\sigma i},$$

(4b)

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}} \left[ {1 + \frac{{\kappa _i^n}}{{\kappa _\sigma ^t}}r_i\left( {c_i} \right)} \right] = {\Phi}_\sigma ,$$

(4c)

where we have written $\varphi _{\sigma i} = \varphi _{\sigma i}^P$ for simplicity. The parameter ξ_i can be interpreted as the maximum catalytic rate of the enzyme used to metabolize resource i, and Φ_σ is the total proteome fraction allocated by species σ for metabolism and biomass synthesis, which is fixed as shown by Scott et al. [22]. These equations have the traditional structure of a consumer-resource model given by Eqs. (3a) and (3b), but with the added merit of describing population dynamics using parameters and variables that have a precise biological meaning at the intracellular scale of the system and that can in principle be measured experimentally [22]. For a species growing on a single resource, the parameters that are most easily measured experimentally are the per-biomass resource uptake rate ξr(c)φ_σ and the yield (expressed as biomass per grams of resource), which in our framework is given by Y = κⁿ/ρξ (see Supplementary Information).

Notice that the metabolic strategies in our framework correspond to the proteome fractions φ_σi. If we interpreted the φ_σi as fixed parameters, the CPR model would be placed within the field of classic substitutable consumer-resource theory. However, we show below that the proteome fractions φ_σi are actually dynamical variables that vary according to the concentration of resources, and thus the CPR model constitutes a generalization of classic consumer-resource theory with substitutable resources, based on experimental evidence of microbial proteome allocation and growth. In the CPR model the proteome fractions are subject to the constraint encoded by Eq. (4c), which derives from the proteome finiteness given by Eq. (2). The expression of this constraint is significantly different from other ones that have been studied in the consumer-resource framework [40]. Posfai et al. [31], for example, considered a classic consumer-resource model with fixed metabolic strategies, and a metabolic constraint that in our notation would read $\mathop {\sum}\nolimits_{i = 1}^{N_R} {\varphi _{\sigma i}} = \Phi$, where the sum does not depend on the resource concentrations through r_i(c_i), and it is assumed that Φ_σ = Φ for all σ (i.e., the value of Φ_σ is exactly the same for all species). Such a model, however, cannot reproduce the fact that microbial species vary their metabolic strategies with time according to the concentration of resources, and the constraint $\mathop {\sum}\nolimits_{i = 1}^{N_R} {\varphi _{\sigma i}} = \Phi$ does not account for the fact that, as a species invests more resources in nutrient uptake and metabolization (the φ_σi) to achieve a higher growth rate, such an investment must be balanced by an increased investment in ribosomal proteins (the $\varphi _\sigma ^R$), both of which are constrained by the finiteness of the proteome.

The proteome finiteness constraint, as encoded by Eq. (4c), yields one important consequence that has important repercussions on the properties the CPR model. In particular, it implies that the proteome fractions φ_σi cannot be fixed parameters, but must change as the resources’ concentrations c_i change, and therefore they must be dynamical variables. This can be easily seen by considering a system with only one resource, for which Eq. (4c) reads

$$\varphi _\sigma \left[ {1 + \frac{{\kappa ^n}}{{\kappa _\sigma ^t}}r\left( c \right)} \right] = {{\Phi}}_\sigma ,$$

(5)

and thus the φ_σ must change as functions of the resource concentration:

$$\varphi _\sigma = \frac{{{{\Phi }}_\sigma }}{{1 + \frac{{\kappa ^n}}{{\kappa _\sigma ^t}}r\left( c \right)}}.$$

(6)

In particular, φ_σ must decrease as the resource concentration c increases (recall that r(c) is a monotonically increasing function). This occurs because if, for example, the available resource becomes scarce, cells will need to produce more catabolic proteins to meet their energy requirements. In the presence of multiple resources, the proteome finiteness constraint of Eq. (4c) implies that if the concentration of one resource c_j decreases, then either φ_σj or some of the φ_σi with i ≠ j must increase to satisfy the constraint, since Φ_σ is constant. Thus, it is necessary to introduce some form of dynamics on the proteome fractions that each species allocates for nutrient uptake and metabolization. This observation should not come as a surprise, given that microbes are known to adapt their proteome allocation and metabolic strategies according to which resources are available. Our approach is to require that all φ_σi evolve dynamically with a characteristic timescale to maximize the instantaneous growth rate of species σ in an adaptive process, while ensuring that the proteome finiteness constraint is satisfied at all times. The model equations and the mathematical details are discussed in the Materials and Methods.

Experimental example of the influence of proteome allocation on population dynamics

Traditional consumer-resource models do not account explicitly for proteome allocation to different tasks and assume that metabolic strategies are fixed with time. Here, we show experimentally that it is necessary to take into account proteome allocation within consumer-resource models to reproduce the dynamics of even the simplest competitive community, i.e. two species competing for one common resource. We competed experimentally two strains of E. coli grown in a liquid minimal medium with glucose as the sole carbon source, transferring a fraction of the community to fresh medium daily and measuring the relative abundance of the two strains at each transfer (see Materials and Methods). The two strains had the same genetic background and expressed constitutively from their genome two different fluorescent proteins, which allowed us to measure their relative abundance via flow cytometry. We introduced in strain σ = 1 a plasmid containing a Red Fluorescent Protein (RFP) whose expression could be controlled by adding to the medium Isopropyl β-D-1-thiogalactopyranoside (IPTG, a molecular mimic of allolactose that cannot be metabolized by E. coli). Thus, by varying the concentration of IPTG in the medium we could vary the proteome allocation of strain 1 by forcing it to produce a useless protein. We performed competition experiments at different concentrations of IPTG, measured the fluorescent protein production rates at these concentrations, and computed the selective advantage of strain 1 over strain 2, a measure for the difference in reproductive fitness between the two strains defined as:

$$S: = \frac{d}{{dt}}{\mathrm{ln}}\frac{f}{{1 - f}},$$

(7)

where f is the relative abundance (or frequency) of strain 1, i.e. f = m₁/(m₁ + m₂). The experiment is sketched in Fig. 2.

**Fig. 2: Schematic representation of the experiment.**

Figure 3a (magenta data points) shows that the selective advantage S decreased linearly with the production rate of the IPTG-inducible RFP of strain 1 over a broad range or RFP production rates (the mean cell’s fluorescence measured after 8 h at 105 μM IPTG is 22 times higher than at 0 μM IPTG, Fig. 3d), which are proportional to φ_iRFP. In the absence of IPTG and at low concentrations of it, strain 1 outcompeted strain 2 (S > 0). At an IPTG concentration of ~30 μM, the two strains coexisted by maintaining a stable relative fraction for the duration of the experiment. At IPTG concentrations larger than 30 μM, strain 1 was outcompeted by strain 2 (i.e., S < 0). This experiment illustrates that, in the presence of the same concentration of a single resource, manipulating the proteome allocation of one of the two strains results in different outcomes for their competition dynamics. Consumer-resource theory, which neglects proteome allocation dynamics, would not be able to predict competition dynamics in these settings.

Figure 3 also shows the results of a second experiment performed with two different strains (cyan data points). These strains had different fluorescent protein combinations with respect to strains 1 and 2 (see Materials and Methods and Fig. S.1): strain 3 expressed constitutively a red fluorescent protein (mKate2Hyb) and carried a plasmid with an IPTG-inducible yellow fluorescent protein (Venus YFP), while strain 4 expressed constitutively the yellow fluorescent protein mVenus (see Materials and Methods). Also in these independent sets of experiments, the selective advantage decreased linearly as the protein production rate was increased over a broad range (the mean cell’s fluorescence measured after 8 h at 105 μM IPTG was 16 times higher than at 0 μM IPTG, Fig. 3e). In this case, strain 3 always outcompeted strain 4, even at high concentrations of IPTG. This may be explained by the fact that the two proteins expressed by strains 1 and 3 have a different fitness cost (see Supplementary Information for more details).

It is natural to ask whether the CPR model can reproduce the results of our experiment. Applying the CPR framework to such a simple community, using assumptions consistent with our experimental settings (e.g., the fact that the strains are grown in medium-rich conditions, and that they share the same genetic background), leads to the prediction that the selective advantage S of strain 1 over strain 2 is given by (see Materials and Methods):

$$S = \frac{d}{{dt}}{\mathrm{ln}}\frac{f}{{1 - f}} \propto {\Phi}_1 - {\Phi}_2.$$

(8)

The same result could be obtained by assuming that the findings of Scott et al. [22] on how the exponential growth rate in isolation depends on proteome allocation can be applied to our experiment, in which cells were grown in co-culture dilution experiments and were not always in exponential phase. According to Eq. (8), the ratio between the relative abundances of the two strains decreases or grows exponentially with time, depending on the sign of Φ₁ − Φ₂, which then sets the outcome of competition: for example, if Φ₂ > Φ₁ (i.e., strain 2 allocates a larger fraction of its proteome to metabolism and biomass production than strain 1) then S < 0 and strain 2 outcompetes strain 1. Coexistence between the two strains is possible uniquely when Φ₁ = Φ₂ and thus S = 0. The system, therefore, exhibits two regimes where only one of the two strains survives (competitive exclusion), separated by the coexistence point Φ₁ = Φ₂. Equation (8) thus connects a well known concept of population genetics, the selective advantage in exponentially growing populations, with the differential proteome allocation Φ₁ − Φ₂ between microbial strains.

In our experiment, we forced strain 1 to produce a useless RFP at different rates depending on the IPTG concentration. Indicating with φ_iRFP the fraction of proteome allocated by strain 1 to the synthesis of the IPTG-inducible RFP (proportional to the fluorescent protein production rate), the proteome fraction allocated for nutrient uptake and growth is given by ${\Phi}_1 = {\Phi}_1^{\left( 0 \right)} - \varphi _{iRFP}$ (with ${\Phi}_1 = {\Phi}_1^{\left( 0 \right)}$ in the absence of IPTG). Thus, the selective advantage S is predicted to decay linearly with φ_iRFP as S = α − β · φ_iRFP with α and β positive constants (see the Materials and Methods section for all details and the explicit expression of S in this case). This prediction is thus consistent with the experimental observation of a linear decrease of S with the fluorescent protein production rate.

Coexistence of multiple species in the consumer-proteome-resource model

We now analyze the CPR model in the general case of multiple species and multiple resources both analytically and numerically, to provide some insights into the conditions required for the coexistence of all species in the community. Specifically, we look for stationary solutions where all species have non-null biomass densities. Doing so yields two necessary conditions for the coexistence of all species (see the Materials and Methods for all detailed expressions and computations). The first condition, which holds when there are more species than resources in the system (N_S > N_R), is that the maintenance cost q_σ of species σ must be proportional to the total proteome fraction allocated for metabolism and growth, i.e. q_σ ∝ Φ_σ, with a species-dependent proportionality constant. This requirement is biologically reasonable, since allocating a larger fraction of the proteome to such functions requires additional energy to synthesize the necessary proteins. The condition is also required for all species to coexist if there are fewer species than resources (N_S ≤ N_R) and all proteome fractions at stationarity $\varphi _{\sigma i}^ \ast$ are larger than zero. If, instead, there are fewer species than resources (N_S ≤ N_R) and some proteome fractions at stationarity are equal to zero, it is possible to find particular solutions for which all species coexist, without requiring q_σ ∝ Φ_σ. This happens, for example, when N_S ≤ N_R and the vectors ${\vec{\varphi}}_\sigma ^ \ast = \left( {\varphi _{\sigma 1}^ \ast , \ldots ,\varphi _{\sigma N_R}^ \ast } \right)$ are non-overlapping (i.e., ${\vec{\varphi}}_\sigma ^ \ast \cdot {\vec{\varphi}}_\rho ^ \ast = 0$ for σ ≠ ρ), which means that each species uses resources that are not used by other species. Further details can be found in the Materials and Methods.

The second condition, which holds in all the scenarios discussed in the previous paragraph, can be interpreted as follows using a graphical representation introduced by Posfai et al. [31] (see Materials and Methods for all the mathematical details). A system with N_R resources can be represented on an (N_R − 1)–dimensional simplex, where each vertex corresponds to one of the available resources; considering for example the case N_R = 3, the system can be represented on a triangle (i.e., a bi-dimensional simplex) as shown in Fig. 4. On this simplex one can draw the vectors $\vec {\hat s}$ and $\vec {\hat \varphi } _\sigma ^ \ast$, whose components are appropriately rescaled versions of (respectively) the resource supply rates s_i and the stationary proteome fractions $\varphi _{\sigma i}^ \ast$ (see Materials and Methods). The second condition for species coexistence prescribes, therefore, that $\vec {\hat s}$ must belong to the convex hull of the vectors $\vec {\hat \varphi } _\sigma ^ \ast$, as shown in Fig. 4.

**Fig. 4: Graphical representation of the second condition necessary for coexistence.**

Notice that, differently from similar results of earlier investigations of consumer-resource models [31], this condition involves the stationary proteome fractions $\varphi _{\sigma i}^ \ast$, and thus the community has the opportunity to coexist even if the rescaled resource supply rate vector is not within the convex hull of the proteome fractions at the start of the temporal evolution.

Because the CPR model is highly non-linear, it is impossible to predict a priori the values of the stationary fractions $\varphi _{\sigma i}^ \ast$ once all the other parameters are set. However, it is possible to understand how the various parameters affect the dynamics of the system by exploring different regions of the parameter space. The dynamics of the system, in fact, will depend on how the proteome fractions φ_σi evolve, and therefore the dynamics of the system will inevitably be influenced by some of the model parameters. In this sense the relevant parameters are the ratios $\gamma _{\sigma i} = \kappa _i^n{\mathrm{/}}\kappa _\sigma ^t$ between the nutritional and translational capacities, and the characteristic timescales τ_σ of the adaptive process that maximizes the growth rate g_σ in the dynamics of φ_σi (see the Materials and Methods for details). The timescales τ_σ measure how fast the dynamics of the proteome fractions φ_σi vary: the smaller τ_σ is, the faster species σ can switch between different resources. Biologically speaking, this parameter can be thought of as a measure of how fast the regulatory mechanisms of a microbial species can respond to changes in the availability of resources.

The first regime that we explored is $\tau _\sigma \gg 1$ and γ_σi ~ 0. In this regime, the adaptive process that regulates the dynamics of the proteome fractions φ_σi is very slow (i.e., species respond very slowly to changes in resource abundance) and the nutritional capacity is much smaller than the translational capacity, which happens for example when species are grown in very low-quality nutrients. In this case, the model predicts that the stationary values $\hat \varphi _{\sigma i}^ \ast$ of the rescaled proteome fractions allocated by the species to nutrient uptake and metabolization change negligibly, and therefore all species survive only if the rescaled nutrient supply rate vector $\vec {\hat s}$ lies in the convex hull of the rescaled initial proteome fractions $\vec {\hat \varphi } _\sigma$, as shown in Fig. 5.

**Fig. 5: Temporal evolution of the CPR model when $\tau _\sigma \gg 1$ and γ_σi ~ 0.1.**

The second regime we explored is $\tau _\sigma \gg 1$ and $\gamma _{\sigma i} {\gtrsim} 1$. In this case, the dynamics of $\hat \varphi _{\sigma i}$ allows the proteome fractions to move inside the simplex. Therefore, the system can reach stationary states where all species coexist even if $\vec {\hat s}$ is not necessarily close to the convex hull of the initial $\vec {\hat \varphi } _{\sigma}$. On the other hand, we observed that if $\vec {\hat s}$ is too far away from the convex hull of the initial $\vec {\hat \varphi } _{\sigma}$ there might still be extinctions. However, if $\vec {\hat s}$ lies at an intermediate distance between these two cases, the system can reach diverse stationary states only if the resource supply rates s_i are sufficiently large. For example, multiplying each resource supply rate by a factor x > 1, i.e. s_i → xs_i (this rescaling leaves $\hat s_i$ unchanged, see Materials and Methods), we observe a transition between two different states of the system for increasing values of x: when x ~ 1, only a few species survive, whereas for larger values of x the stationary biomass densities $m_\sigma ^ \ast$ of the other species increase until all of them coexist. Figure 6 shows an example of such transition. This phenomenon occurs only when $\vec {\hat s}$ lies in specific areas of the simplex, whose shape and position can be determined numerically, but depend on the particular values of the model parameters used. In this same regime, if γ_σi assume increasingly large values (which happens for example, if the species are grown in nutrients with increasingly higher quality) coexistence will be possible even if $\vec {\hat s}$ lies at increasingly large distances from the convex hull of the initial $\vec {\hat \varphi } _{\sigma}$.

**Fig. 6: Species coexistence as a function of the rescaled resource supply rate $x{\vec{s}}$ (with x > 1).**

Finally, the last regime we explored is $\tau_\sigma \lesssim 1$, i.e. the adaptive process maximizing species’ growth rates is fast. In this case, the smaller the timescales τ_σ are, the faster the proteome fractions φ_σi will reach their stationary values, and coexistence will always be possible independently of the initial values of the proteome fractions φ_σi and of the resource supply rates s_i. However, as the τ_σ grow, fewer and fewer species will be able to coexist. This can be seen by multiplying τ_σ by a factor y > 1: Fig. 7 shows how the species’ stationary biomasses change as y increases, and we can see that as species adaptation becomes slower (i.e., for larger y), fewer and fewer species survive in the community.

**Fig. 7: Coexistence in the CPR model is affected by the values of the adaptation timescales τ_σ.**

The results of this section can be summed up as follows. If metabolic adaptation is slow, i.e. if the characteristic relaxation times τ_σ of the proteome fractions ${\vec{\varphi}}_\sigma$ are large (or in other words, if the species shift slowly between different resources), coexistence will be favored if the system contains high-quality nutrients (i.e., the γ_σi have larger values). If the system contains low-quality nutrients, coexistence will be possible only if the resources are supplied in particular ratios that depend on the species’ proteome allocation. In particular, coexistence will be possible if the rescaled nutrient supply rate vector $\vec {\hat s}$ lies inside the convex hull of the rescaled proteome fractions $\vec {\hat \varphi } _\sigma$. On the other hand, fast metabolic adaptation (i.e., small values of τ_σ) always favor coexistence.

Discussion

Motivated by our experiment that shows how varying proteome allocation can have strong effects on the dynamics of even a very simple microbial community, we have formulated a consumer-resource model that generalizes and incorporates the phenomenological laws discovered by Scott et al. [22]. In this way, we have bridged microbial growth with proteome allocation constraints in competitive communities, and we have investigated the conditions that lead to species coexistence in the presence of multiple resources.

This CPR model describes the population dynamics of a purely competitive microbial community, i.e. an ensemble of species that compete directly for the same pool of resources. The main contribution of this work is introducing a physiological, experimentally-validated constraint on the amount of resources that cells can devote to growth and metabolism in consumer-resource models (i.e., Eq. (15c)). The introduction of this constraint makes it necessary to introduce some dynamics on the proteome fractions allocated for nutrient uptake and metabolization, and we have done so using an adaptive approach that assumes that microbial species are evolutionary well adapted to their environment. This work differs (both in scope and approach) from previous ones that involve adaptation on some species’ internal variables [41], and in particular differs from previous works involving the consumer-resource framework [31, 40] that considered phenomenological constraints that were not based on direct experimental measurements, nor on an interpretation of such constraints as arising from the finiteness of the proteome. Introducing the right constraint in such models is particularly important, because the exact conditions that allow species coexistence depend on the specific form of the constraint (see Materials and Methods). A further discussion on the differences between the CPR model and previous ones can be found in the Supplementary Information.

We have then shown that the CPR model predicts that high levels of biodiversity can be achieved only if two conditions apply. The first condition is that the maintenance cost must be proportional to the total proteome fraction allocated by the species to metabolism and growth, i.e. q_σ ∝ Φ_σ. The second condition can be interpreted graphically as described in the Results section, and summarized as follows: (i) if the timescales τ_σ over which the species shift between different resources are large (i.e., $\tau _\sigma \gg 1$) and if the quality of the resources is low, coexistence will be possible only if the resource supply rates have particular values (i.e., the rescaled nutrient supply rate vector $\vec {\hat s}$ belongs to the convex hull of $\vec {\hat \varphi } _\sigma$); (ii) if again $\tau _\sigma \gg 1$, but the resources are of higher quality, coexistence is possible (in some cases the magnitude of the resource supply rates must be large enough), and if the resources’ quality is higher, coexistence is favored; and (iii) coexistence is favored for smaller values of the timescales τ_σ. From the biological point of view, these points can be interpreted as follows: (i) if the species switch slowly between different resources and the quality of the resources is low, coexistence will be possible only if the resources are supplied with particular ratios (which depend on the proteome allocation of all the species); (ii) if again the species switch slowly between different resources, coexistence will be favored if the resources have higher quality; (iii) fast metabolic adaptation (i.e., the species can switch quickly between different resources) favors coexistence. Our approach, therefore, makes it possible to quantify precisely in what ways the internal cellular dynamics make coexistence possible in a broad range of environmental contexts.

The dynamics of microbial communities has traditionally been studied at the ecological level by using models of population dynamics describing how the population abundances of different species in the community change over time as the result of competition for resources. While this approach is undoubtedly useful and effective, it often cannot describe the system at a level of detail necessary to make predictions from measurable quantities. In fact, it is becoming increasingly clear that the structure and dynamics of microbial communities are affected by the metabolic activity of the species that comprise them [15,16,17,18]. As shown here, mathematical models of community dynamics that take explicitly into account how different species allocate their proteome to regulate nutrient uptake can provide new insights into the link between the ecological properties of microbial communities, i.e. population dynamics and species coexistence, and their intracellular ones, i.e. metabolism and gene expression [20].

Direct competition for resources is only one of the many known interactions that can take place between microbial species: exchange of metabolic byproducts [14], production of toxins [13] and environmental conditioning [42] are only a few of the ways in which we know microbes interact within a community. Each of these processes provide both growth benefits and proteomic costs to microbial species, and can in principle be included in our framework by appropriately taking into account how they affect proteome allocation and species fitness. With our framework it would therefore be possible to make quantitative predictions involving such phenomena, and testing them against experimental data.

Materials and methods

The consumer-proteome-resource equations

The derivation of the CPR models equations starts from Eqs. (3a) and (3b). To write these equations explicitly, we introduce the following assumptions: (i) the uptake rate J_σi is proportional to the proteome fraction $\varphi _{\sigma i} = \varphi _{\sigma i}^P$ allocated by species σ for the uptake and metabolization of resource i and (ii) each resource contributes to the growth of species σ through a term $g_\sigma ^{\left( i \right)}$ proportional to the uptake rate J_σi, so that the total growth rate g_σ of species σ can be written as the sum of all the terms $g_\sigma ^{\left( i \right)}$. Specifically, we rewrite Eq. (1a) as:

$$\varphi _{\sigma i}^P = \frac{{\rho _\sigma }}{{\bar \kappa _i^n\left( {c_i} \right)}}g_\sigma ^{\left( i \right)},$$

(9)

where ρ is considered to be species-dependent, $\bar \kappa _i^n\left( {c_i} \right) = \kappa _i^n \cdot r_i\left( {c_i} \right)$ (with $r_i\left( {c_i} \right) = c_i/\left( {K_i + c_i} \right)$), and $g_\sigma ^{\left( i \right)}$ is the contribution to the growth rate of species σ due to the uptake of resource i, i.e.:

$$g_\sigma = \mathop {\sum }\limits_{i = 1}^{N_R} g_\sigma ^{\left( i \right)},$$

(10)

and we generalize Eqs. (1a) and (1b) to:

$$g_\sigma = \mathop {\sum }\limits_{i = 1}^{N_R} \frac{{\bar \kappa _i^n\left( {c_i} \right)}}{{\rho _\sigma }}\varphi _{\sigma i}^P,$$

(11a)

$$\varphi _\sigma ^R = \frac{{\rho _\sigma }}{{\kappa _\sigma ^t}}g_\sigma + \varphi _\sigma ^0.$$

(11b)

Equation (10) implies that the N_R resources are substitutable (e.g., different carbon sources), otherwise, their contribution to the growth rate may satisfy a different equation (e.g., their contributions may be multiplicative rather than additive). We can use Eq. (11a) to write Eq. (11b) in terms of the fractions $\varphi _{\sigma i}$. By doing so we get that the normalization condition given by Eq. (2) reads:

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}} \left[ {1 + \frac{{\bar \kappa _i^n\left( {c_i} \right)}}{{\kappa _\sigma ^t}}} \right] = 1 - \varphi _\sigma ^Q - \varphi _\sigma ^0: = {\Phi}_\sigma ,$$

(12)

where we have written φ_σi instead of $\varphi _{\sigma i}^P$ for simplicity and Φ_σ is the total proteome fraction that species σ allocates to metabolism and biomass synthesis.

We generalize the results of Scott et al. to the case of multiple resources by assuming that the uptake rate J_σi of resource i per unit biomass is proportional to φ_σi, i.e.:

$$J_{\sigma i} = \xi _ir_i\left( {c_i} \right)\varphi _{\sigma i},$$

(13)

where the proportionality constant ξ_i can be interpreted biologically as the maximum catalytic rate of the enzyme used to metabolize resource i (see Supplementary Information). By comparing Eqs. (13) and (9) we can see that the contribution to the growth rate of species σ due to the uptake of resource i is proportional to its uptake rate, i.e. $g_\sigma ^{\left( i \right)} = \chi _{\sigma i}J_{\sigma i}$ with

$$\chi _{\sigma i}\xi _i = \frac{{\kappa _i^n}}{{\rho _\sigma }}.$$

(14)

With the considerations above, we obtain the final equations of the CPR model:

$$\dot m_\sigma = m_\sigma \left[ {\mathop {\sum}\limits_{i = 1}^{N_R} {\eta _{\sigma i}r} _i\left( {c_i} \right)\varphi _{\sigma i} - q_\sigma } \right],$$

(15a)

$$\dot c_i = s_i - \xi _ir_i\left( {c_i} \right)\mathop {\sum}\limits_{\sigma = 1}^{N_S} {m_\sigma \varphi _{\sigma i}} ,$$

(15b)

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}} \left[ {1 + \gamma _{\sigma i}r_i\left( {c_i} \right)} \right] = {\Phi}_\sigma ,$$

(15c)

where we have written explicitly $\bar \kappa _i^n\left( {c_i} \right) = \kappa _i^nr_i\left( {c_i} \right)$ with $r_i\left( {c_i} \right) = c_i/\left( {K_i + c_i} \right)$, and we have defined $\eta _{\sigma i}: = \kappa _i^n/\rho _\sigma$ and $\gamma _{\sigma i}: = \kappa _i^n/\kappa _\sigma ^t$ to simplify the notation. Regardless of the particular form of r(c) chosen, for our purposes we only need to assume that r(c) is a monotonically increasing function of c, and that ${\mathrm{lim}}_{c \to 0}r\left( c \right)/c = 1/K$ and ${\mathrm{lim}}_{c \to \infty }r\left( c \right) = 1$.

The constraint in Eq. (15c) is the explicit expression of Eq. (2) in our framework, and can be interpreted geometrically: considering species σ, the N_R-dimensional vector ${\vec{\varphi}}_\sigma = \left( {\varphi _{\sigma 1}, \ldots ,\varphi _{\sigma N_R}} \right)$ belongs to a hyperplane whose normal vector $\hat n_\sigma$ has components $1 + \gamma _{\sigma i}r_i\left( {c_i} \right)$. This means that as the system evolves, the components of $\hat n_\sigma$ vary with time and therefore the hyperplane to which ${\vec{\varphi}}_\sigma$ belongs moves in the N_R-dimensional space. This is also the reason why the proteome fractions φ_σi must be dynamical variables: the coefficients 1 + γ_σir_i(c_i) in Eq. (15c) are not fixed, but change with time depending on the system’s dynamics through r_i(c_i). This implies that for the constraint to be satisfied at all times, the proteome fractions φ_σi cannot be fixed but must be, in turn, dynamical variables: an increase (decrease) of 1 + γ_σir_i(c_i) must be balanced by a decrease (increase) of some of the φ_σi. This constraint reflects the well known fact that microbes can vary their enzyme synthesis with time and switch between nutrients according to environmental conditions [40, 43,44,45].

Dynamics of the proteome fractions φ _σi

We call ${\vec{c}} = \left( {c_1, \ldots ,c_{N_R}} \right)$ the vector of resource concentrations and define

$$F_\sigma \left( {{\vec{\varphi}}_\sigma ,{\vec{c}}} \right): = \mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}} \left[ {1 + \gamma _{\sigma i}r_i\left( {c_i} \right)} \right] - {\Phi}_\sigma$$

(16)

so that the constraint given by Eq. (15c) can be written more simply as $F_\sigma \left( {{\vec{\varphi}}_\sigma ,{\vec{c}}} \right) = 0$. Since this constraint must hold at every instant, any equation for ${\vec{\varphi}}_\sigma$ must satisfy

$$\dot F_\sigma \left( {\vec{\varphi}}_\sigma ,{\vec{c}} \right) \equiv \dot {\vec{\varphi}}_\sigma \cdot {\vec{\nabla}}_\varphi F_\sigma + \dot {\vec{c}} \cdot {\vec{\nabla}}_cF_\sigma = 0,$$

(17)

where ${\vec{\nabla}}_\varphi$ and ${\vec{\nabla}}_c$ are, respectively, the gradients taken with respect to the components of ${\vec{\varphi}}_\sigma$ and ${\vec{c}}$. The “minimal” equation for φ_σi, i.e. the simplest one (in the sense that it does not introduce extra terms orthogonal to ${\vec{\nabla}}_\varphi F_\sigma$, which would lead to a proliferation of new parameters) that satisfies Eq. (17) is therefore:

$$\dot {\vec{\varphi}}_\sigma = - \frac{{\vec{\nabla} _\varphi F_\sigma }}{{\left( {\vec{\nabla}_\varphi F_\sigma } \right)^2}}\vec{c} \cdot \vec{\nabla}_cF_\sigma ,$$

(18)

where, however, we are not taking into account the fact that with such an equation some of the φ_σi might become negative with time (see Supplementary Information for detailed computations on how this can be taken into account).

Microbes are able to switch between nutrients when cultured in mediums containing more than one resource [43]. For this reason, we can implement an adaptive approach [40] and ask that ${\vec{\varphi}}_\sigma$ evolves in time so that the growth rate g_σ of species σ is maximized respecting the constraint $F_\sigma \left( {{\vec{\varphi}}_\sigma ,\vec{c}} \right) = 0$, i.e. Equation (15c) is satisfied. In this case the evolution equation for ${\vec{\varphi}}_\sigma$ becomes:

$$\dot {\vec{\varphi}}_\sigma = \frac{1}{{\tau _\sigma }}\vec{\nabla}_\varphi g_\sigma - \frac{{\vec{\nabla}_\varphi F_\sigma }}{{\left( {\vec{\nabla}_\varphi F_\sigma } \right)^2}}\left( {\frac{1}{{\tau _\sigma }}\vec{\nabla} _\varphi g_\sigma \cdot \vec{\nabla}_\varphi F_\sigma + \dot{\vec{c}} \cdot \vec{\nabla}_cF_\sigma } \right),$$

(19)

where we have introduced τ_σ, the characteristic timescale over which ${\vec{\varphi}}_\sigma$ changes [40] (detailed computations are shown). We can recover Eq. (18) from Eq. (19) by sending τ_σ to infinity. Geometrically, Eq. (18) represents the case in which ${\vec{\varphi}}_\sigma$ is dragged along by the hyperplane to which it belongs, as the hyperplane moves because of Eq. (15c). On the other hand, according to Eq. (19) (with small enough values of τ_σ) the ${\vec{\varphi}}_\sigma$ are free to move on the hyperplane to find the maximum instantaneous growth rate compatible with the constraint given by Eq. (15c).

In this work we have used a generalization of Eq. (19) that ensures $\varphi _{\sigma i}\left( t \right) \ge 0\forall t$, and varied the values of τ_σ when needed (see Supplementary Information for details).

The introduction of this dynamics on the proteome fractions φ_σi in consumer-resource models allows our model to reproduce phenomena that classic consumer-resource theory cannot describe, like diauxic shifts (see Fig. S.2).

Conditions for coexistence

Evaluating Eqs. (15a)–(15c) at stationarity we obtain:

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\eta _{\sigma i}r_i^ \ast \varphi _{\sigma i}^ \ast } = q_\sigma ,$$

(20a)

$$s_i = \xi _ir_i^ \ast \mathop {\sum}\limits_{\sigma = 1}^{N_S} {m_\sigma ^ \ast \varphi _{\sigma i}^ \ast } ,$$

(20b)

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}^ \ast } \left( {1 + \gamma _{\sigma i}r_i^ \ast } \right) = {\Phi}_\sigma ,$$

(20c)

where we are denoting with the symbol “*” the quantities computed at stationarity, and we have assumed m_σ ≠ 0. If we now assume $\varphi _{\sigma i}^ \ast \,\ne\, 0$ for all i and all species, it is easily seen by substitution that a possible solution for $r_i^ \ast$ in Eqs. (20a) and (20c) is

$$r_i^ \ast = \left[ {\kappa _i^n\left( {\frac{{{\Phi}_\sigma }}{{\rho _\sigma q_\sigma }} - \frac{1}{{\kappa _\sigma ^t}}} \right)} \right]^{ - 1}.$$

(21)

Under our assumption (i.e., $\varphi _{\sigma i}^ \ast \,\ne\, 0$ for all i, for all species), and if N_S > N_R (i.e., the number of species is larger than the number of resources) this solution is acceptable only if its right-hand side is independent of σ, i.e. if

$$\frac{{{\Phi}_\sigma }}{{\rho _\sigma q_\sigma }} - \frac{1}{{\kappa _\sigma ^t}} = \Theta ,$$

(22)

with Θ some given constant independent of σ. Using Eqs. (21) and (22) in Eqs. (20c) or (20a) we get

$$\mathop {\sum}\limits_{i = 1}^{N_R} {\varphi _{\sigma i}^ \ast } = \frac{{{\Phi}_\sigma }}{{1 + \frac{1}{{\Theta \kappa _\sigma ^t}}}}.$$

(23)

From Eq. (21) we have:

$$r_i^ \ast = \frac{1}{{\kappa _i^n\Theta }} \quad \Rightarrow \quad c_i^ \ast = \frac{{K_i}}{{\kappa _i^n\Theta - 1}},$$

(24)

and since we need $r_i^ \ast \,<\, 1$ (or equivalently $c_i^ \ast \,> \, 0$), we need $\Theta \,> \, {\mathrm{max}}_i1/\kappa _i^n$. Therefore, Eq. (22) can be rewritten as

$$q_\sigma = \frac{{{\Phi}_\sigma }}{{\rho _\sigma \left( {\Theta + 1/\kappa _\sigma ^t} \right)}},$$

(25)

which is the explicit expression of the relationship between q_σ and Φ_σ. Equation (23) is a consequence of the system’s constraint in Eq. (20c), which is Eq. (15c) computed at stationarity. Therefore, the expression of the maintenance cost given in Eq. (25) is a consequence of the constraint introduced in the CPR model.

Notice, again, that this holds under the assumption that $\varphi _{\sigma i}^{\ast} \,\ne\, 0$ for all i and σ, and N_S > N_R. If we remove these assumptions, then it is possible to find solutions with N_S ≤ N_R where Eq. (22) does not hold. For example, if the species’ stationary proteome fractions ${\vec{\varphi}}_\sigma ^ \ast$ are non-overlapping (i.e., ${\vec{\varphi}}_\sigma ^ \ast \cdot {\vec{\varphi}}_\rho ^ \ast = 0$ when σ ≠ ρ), then $r_i^ \ast$ as given in Eq. (21) can be a valid solution without requiring Eq. (22). Consider as an example the particular case N_S = N_R = 3 and $\varphi _{\sigma i}^ \ast \propto \delta _{\sigma i}$ (where δ is Kronecker’s delta), i.e. a system with three species where each one uptakes only one resource, and no two species uptake the same resource. It is easy to imagine that the three species should be able to coexist, since their niches (defined in this context as the set of resources used for sustenance) do not overlap. This is indeed the case, given that a solution for $r_i^ \ast$ in Eqs. (20a) and (20c) is given by:

$$r_i^ \ast = \left[ {\kappa _i^n\left( {\frac{{{\Phi}_i}}{{\rho _iq_i}} - \frac{1}{{\kappa _i^t}}} \right)} \right]^{ - 1},$$

(26)

where we have identified each species index σ with the only resource i it consumes, and we don’t need to require Eq. (22) to hold for this solution to be feasible. This will be of course true even for systems where the species and/or resource labels are permutated (e.g., species 1 uptakes resource 2, species 2 uptakes resource 3 and species 3 uptakes resource 1, instead of species 1 uptaking resource 1, species 2 uptaking resource 2, and species 3 uptaking resource 3). This will be true even when N_R > N_S, as long as the vectors ${\vec{\varphi}}_\sigma ^ \ast$ are still non-overlapping and the inverse of r_i is written as the product of $\kappa _i^n$ and ${\Phi}_\sigma /\left( {\rho _\sigma q_\sigma } \right) - 1/\kappa _\sigma ^t$ where σ is the (only) species uptaking that resource. If one of the resources, e.g. resource j, is not uptaken by any species one has $\dot c_j = s_j$, i.e. c_j will grow linearly indefinitely. On the other hand, if N_S > N_R then Eq. (22) is necessary in order to have feasible solutions, even if we remove the assumption that $\varphi _{\sigma i}^{\ast} \,=\, 0$ for all species and resources.

Going back to Eqs. (20a)–(20c), if we now define:

$$\hat s_i = \frac{{s_i\kappa _i^n{\mathrm{/}}\xi _i}}{{\mathop {\sum}\nolimits_{j = 1}^{N_R} {s_j\kappa _j^n} {\mathrm{/}}\xi _j}}$$

(27a)

$$\hat \varphi _{\sigma i}^ \ast = \frac{{\varphi _{\sigma i}^ \ast }}{{\mathop {\sum}\nolimits_{j = 1}^{N_R} {\varphi _{\sigma j}^ \ast } }},$$

(27b)

$$z_\sigma = \frac{{m_\sigma ^ \ast \rho _\sigma q_\sigma }}{{\mathop {\sum}\nolimits_\lambda {m_\lambda ^ \ast } \rho _\lambda q_\lambda }},$$

(27c)

(so that z_σ are positive coefficients that sum to one), and Eq. (20b) can be rewritten as

$$\hat s_i = \mathop {\sum}\limits_{\sigma = 1}^{N_S} {z_\sigma \hat \varphi _{\sigma i}}$$

(28)

(see Supplementary Information for the detailed computations). Since $\mathop {\sum}\nolimits_i {\hat s_i} = \mathop {\sum}\nolimits_i {\hat \varphi _{\sigma i}^ \ast } = 1$, the vectors $\vec {\hat s}$ and $\vec {\hat \varphi } _\sigma ^ \ast$ belong to an (N_R − 1)–dimensional simplex. Furthermore, since z_σ are positive coefficients that sum to one, Eq. (28) means that $\vec {\hat s}$ belongs to the convex hull of the vectors $\vec {\hat \varphi } _\sigma ^ \ast$. Since Eq. (28) derives from requiring that all species have non-null stationary biomasses, we can see how this is the other condition necessary for coexistence.

At first glance, the result in Eq. (28) looks similar to what has been observed in consumer-resource model with metabolic trade-offs by Posfai et al. [31]. However, our result has an important difference with respect to that model: Eq. (28) depends in fact on the (rescaled) value of φ_σi at stationarity. In the CPR model, therefore, the proteome fractions φ_σi vary over time to satisfy Eq. (28), i.e. to include $\vec {\hat s}$ in the convex hull of the vectors $\vec {\hat \varphi } _\sigma ^ \ast$, unlike in Posfai et al. [31] where metabolic strategies (which in our framework correspond to the φ_σi) are fixed and thus coexistence is only possible if $\vec {\hat s}$ is within the convex hull of the φ_σi from the very start.

If we now suppose that $\tau _\sigma \gg 1$, so that we can use Eq. (18) for the dynamics of φ_σi, observing that the i-th component of the gradients $\vec{\nabla}_\varphi F_\sigma$ and $\vec{\nabla}_cF_\sigma$ are

$$\left( {\vec{\nabla}_\varphi F_\sigma } \right)_i \equiv \frac{{\partial F_\sigma }}{{\partial \varphi _{\sigma i}}} = 1 + \gamma _{\sigma i}r_i\left( {c_i} \right)$$

(29a)

and

$$\left( {\vec{\nabla}_cF_\sigma } \right)_i \equiv \frac{{\partial F_\sigma }}{{\partial c_i}} \propto \gamma _{\sigma i},$$

(29b)

we find that if γ_σi ~ 0 then $\dot {\vec{\varphi}}_\sigma \sim 0$ and therefore $\varphi _{\sigma i}^ \ast \sim \varphi _{\sigma i}\left( {t = 0} \right)$. In other words, if the γ_σi are small, the proteome fractions φ_σi at stationarity will be close to their initial values. Therefore in this case, with good approximation, Eq. (28) gives the condition for all species to coexist, i.e. $\vec {\hat s}$ must be inside the convex hull of $\hat \varphi _{\sigma i} = \varphi _{\sigma i}\left( 0 \right)/\mathop {\sum}\nolimits_j {\varphi _{\sigma j}} \left( 0 \right)$. If $\gamma_{\sigma i}\gtrsim1$ as discussed in the Results section, on the other hand, coexistence will be possible if the components of $\dot {\vec{\varphi}}_\sigma$ are not too small for a sufficiently long period of time so as to allow them to reach values satisfying Eq. (28) and thus for the species to coexist. This can be obtained by using large supply rates s_i so that r_i(c_i) ~ 1 for a sufficiently long time, as discussed in the Results. Finally, if the ratios γ_σi have larger values the proteome fractions φ_σi will be able to move more quickly.

Strains used in the experiment

The Escherichia coli strains used in our experiment have the same genetic background MG1655. The strains used in the experiments were constructed starting from the ancestor strain 0Y (expressing constitutively the yellow fluorescent protein mVenus from the genome, with genotype attTN7::pRNA1_mVenus) or the ancestor strain 0R (expressing constitutively the red fluorescent protein mKate2Hyb from the genome, with genotype attTN7::pRpsL_mKate2Hyb).

Strain 1 was obtained by transforming strain 0Y with the plasmid pR (see Table S.1), which contains the ampicillin resistance cassette, the red fluorescent protein mCherry under the control of the trc promoter, a hybrid of the trp and lac promoters, and the lac repressor, lacI. The expression of mCherry could thus be induced by adding IPTG, which binds to the repressor encoded by lacI allowing the expression of genes promoted by the trc promoter (here, mCherry). Because IPTG cannot be metabolized by E. coli, its concentration remains constant during our experiment and is unaltered by bacterial growth.

Strain 2 was obtained by transforming strain 0R with the plasmid pAMP (see Table S.1), which was obtained by removing the inducible red fluorescent protein mCherry from plasmid pR using traditional cloning.

Strain 3 was obtained by transforming strain 0R with plasmid pY (see Table S.1), which is identical to plasmid pR, except for the fluorescent protein induced by the trc promoter, which is Venus YFP instead of mCherry.

Strain 4 was obtained transforming strain 0Y with plasmid pAMP.

Because all strains had the ampicillin resistance cassette in the plasmids used to transform them, we performed the experiments by adding ampicillin to the medium to prevent contamination and plasmid loss.

Figures S.3–S.14 show the results of fitness assays performed with all the strains used in our experiments and the ancestor strains.

Experimental protocol

The competition assays were performed as follows:

1.
The strains were cultured overnight from a stock culture in M63 medium with 1% w/v glucose, and ampicillin. Then, the strains were mixed to perform competition assays aiming for 50:50 relative frequencies.
2.
The mixtures were inoculated in a 96-well plate containing M63 medium with 1% w/v glucose and ampicillin at eight different IPTG concentrations: 0, 15, 30, 45, 60, 75, 90, 105 μM (six technical replicates per concentration).
3.
The well plate was covered with a porous rayon film that allowed gas exchange and was cultured for 24 h at 30 °C on a microplate shaker set at 1050 rpm.
4.
After 24 h, the plate was reinoculated in a new 96-well plate with fresh medium (with the appropriate concentrations of IPTG in each well) with a dilution factor of 100. The new plate was cultured for another cycle at 30 °C for 24 h with constant shaking at 1050 rpm, while the old one was diluted with a dilution factor of 2000 to be analyzed at the flow cytometer.

IPTG calibration and computation of the normalized protein production rate

We measured how the fluorescence intensity of individual cells, a proxy for the total amount of fluorescent protein produced, varied as a function of the IPTG concentration. To do so, we inoculated strains 1 and 3 in a 96-well plate containing M63 minimal medium with ampicillin, 1% w/v glucose and the same IPTG concentrations used in our experimental protocol (six technical replicates per concentration, per strain). The plate was incubated at 30 °C for 8 h with constant shaking at 1050 rpm. At times t = 4 h and t = 8 h after inoculation we measured at the flow cytometer the mean fluorescence intensity of cells due to the induced fluorescent proteins at the various concentrations of IPTG (Fig. 3d, e). From these data, we estimated the normalized fluorescent protein production rate as follows.

We call k(C_I) the rate at which the fluorescence of the inducible protein increases when cells are exposed to a concentration C_I of IPTG, and we call d_FP the fluorescent protein degradation rate. The fluorescent intensity I of a cell (due to the production of the IPTG-inducible fluorescent protein) in between two successive cell divisions thus satisfies $dI/dt = k\left( {C_I} \right) - d_{FP}I$. At a cell division event, the fluorescent intensity of a cell is reduced by a factor 2. Indicating with I₀ the cell’s fluorescent intensity at the first measurement time (t = 4 h), it can be shown (see Supplementary Information) that according to this model the cell’s fluorescent intensity changes with time as:

$$I\left( t \right) = 2I_0e^{ - gt\left( {1 + \frac{{d_{FP}}}{g}} \right)} + \frac{{k\left( {C_I} \right)}}{{d_{FP}}}\left[ {1 - e^{ - gt\left( {1 + \frac{{d_{FP}}}{g}} \right)}} \right]\\ \left( {1 - \frac{1}{{2^{1 + \frac{{d_{FP}}}{g}} - 1}}} \right),$$

(30)

where g is the cell’s growth rate. Fluorescent proteins have small degradation rates compared to the cellular growth rate, so assuming $d_{FP} \ll g$ we can approximate Eq. (30) as:

$$I\left( t \right) = 2I_0e^{ - gt} + 2{\mathrm{ln}}\left( 2 \right)\frac{{k\left( {C_I} \right)}}{g}\left( {1 - e^{ - gt}} \right).$$

(31)

We used Eq. (31) and the data in Fig. 3d, e to compute the quantity k(C_I). Because the absolute value of k(C_I) depends on the arbitrary units returned by the flow cytometer (the intensity I is measured as a cell’s pulse area at the flow cytometer), we normalized the values of k(C_I) dividing them by the mean fluorescent intensity 〈I〉 of cells measured in the absence of IPTG at the first measurement in the calibration experiment (see Fig. 3d, e). Such a normalization affects only the absolute value of such rates, and not their relative magnitude. This also means that the normalized production rates shown for the two experiments in Fig. 3 cannot be compared directly.

The normalized k(C_I)/〈I〉 are the protein production rates of strains 1 and 3 (with dimensions 1/time) reported in Fig. 3.

The growth curves and the growth rates of strains 1 and 3 for the different IPTG concentrations used in our experiments are shown in Figs. S.15–S.18.

Estimation of the selection coefficient S

To first approximation, we can use the results of Scott et al. [22] on the dependence of the exponential growth rate of E. coli strains grown in isolation in rich medium (which in our notation corresponds to r(c) = 1) to estimate the outcome of our competition experiment, and in particular to estimate the dependence of the selection coefficient on the Φ_σ. From Eq. (11b) the growth rate of species σ is given by:

$$g_\sigma = \frac{{\kappa _\sigma ^t}}{{\rho _\sigma }}\left( {\varphi _\sigma ^R - \varphi _\sigma ^0} \right).$$

(32)

Using also Eq. (11a) with N_R = 1 and the definition of Φ_σ from Eq. (12), we can rewrite this as:

$$g_\sigma = \frac{{\kappa _\sigma ^t}}{{\rho _\sigma }}\left( {{\Phi}_\sigma - \frac{{\rho _\sigma }}{{\kappa ^n}}g_\sigma } \right),$$

(33)

which is easily rearranged into:

$$g_\sigma = \frac{{\kappa _\sigma ^t\kappa ^n}}{{\rho _\sigma \left( {\kappa _\sigma ^t + \kappa ^n} \right)}}{\Phi}_\sigma$$

(34)

(see also Eq. (S23) in [22, Online Supporting Material]). Therefore, if we assume $\kappa _1^t = \kappa _2^t$ and ρ₁ = ρ₂ (which can happen, for example, if the two populations are different strains of the same microbial species with similar genetic backgrounds) and $m_\sigma \left( t \right) = m_\sigma \left( 0 \right){\mathrm{exp}}\left( {g_\sigma t} \right)$ (which is a good approximation for populations growing in batch cultures with nutrient-rich medium), the selection coefficient is given by:

$$S = \frac{d}{{dt}}{\mathrm{ln}}\frac{f}{{1 - f}} = \frac{{\dot m_1}}{{m_1}} - \frac{{\dot m_2}}{{m_2}} = \frac{{\kappa ^t\kappa ^n}}{{\rho \left( {\kappa ^t + \kappa ^n} \right)}}\left( {{\Phi}_1 - {\Phi}_2} \right),$$

(35)

where f = m₁/(m₁ + m₂) and 1 − f = m₂/(m₁+ m₂) are the relative abundances (or “frequencies”) of strain 1 and strain 2, respectively. The time series of the values of ln(f/1 − f) for the two experiments are shown in Figs. S.19 and S.20.

If we now apply the CPR model, i.e. Eqs. (15a)–(15c), to the case of two populations and one resource, we obtain:

$$\dot m_\sigma = m_\sigma \left( {\eta _\sigma r\left( c \right)\varphi _\sigma - q_\sigma } \right) \qquad \quad \sigma = 1,2,$$

(36a)

$$\dot c = s - \xi r\left( c \right)\left( {m_1\varphi _1 + m_2\varphi _2} \right),$$

(36b)

$$\varphi _\sigma \left( {1 + \gamma _\sigma r\left( c \right)} \right) = {\Phi}_\sigma \qquad \quad \sigma = 1,2,$$

(36c)

where $\eta _\sigma = \kappa ^n{\mathrm{/}}\rho _\sigma$, $\gamma _\sigma = \kappa ^n{\mathrm{/}}\kappa _\sigma ^t$, and now Eq. (36c) gives the explicit expression of the (only) proteome fraction φ_σ as a function of the resource concentration. Because the ancestors of our two strains (i.e., strains 0Y and 0R) have the same genetic background (see, for example, Figs. S.3 and S.4), we set η₁ = η₂ = η, q₁ = q₂ = q and γ₁ = γ₂ = γ in Eqs. (36a)–(36c). Notice that, instead, Φ₁ ≠ Φ₂ because the proteome allocation of strain 1 could be varied experimentally and because the plasmids introduced in the ancestor strains have different maintenance costs. Furthermore, note that assuming η₁ = η₂ is equivalent to assuming ρ₁ = ρ₂, and on the other hand γ₁ = γ₂ is equivalent to $\kappa _1^t = \kappa _2^t$. Given that cells in the experiment are grown in nutrient-rich conditions, we assume that the maintenance cost is negligible, i.e. q ≃ 0. Furthermore, because most of the dynamics (i.e., the relative change in abundance of the two strains) occurs in the early phases of growth when glucose is abundant, we assume that r(c) ≈ 1 at all times so that we can neglect Eq. (36b) and we are left with:

$$\dot m_\sigma = m_\sigma \frac{\eta }{{1 + \gamma }}{\Phi}_\sigma \sigma = 1,2.$$

(37)

Notice again that this expression, and in particular the fact that the growth rate of species σ is proportional to Φ_σ, is a consequence of the constraint in Eq. (36c). We, therefore, have that the expression of the selective advantage S in this case is:

$$S = \frac{d}{{dt}}{\mathrm{ln}}\frac{f}{{1 - f}} = \frac{{\dot m_1}}{{m_1}} - \frac{{\dot m_2}}{{m_2}} = \frac{\eta }{{1 + \gamma }}\left( {{\Phi}_1 - {\Phi}_2} \right).$$

(38)

From the definitions of η = κⁿ/ρ and γ = κⁿ/κ^t it is immediate to see that the coefficient in Eq. (38) is the same as the one in Eq. (35).

With our framework, however, we can show that this result continues to be true even when we remove the assumption that r(c) = 1 at all times. In our experiment, for example, it was not true that glucose was always abundant throughout the experiment, since the density of the cells saturated well before the following re-inoculation in fresh medium was made (i.e., 24 h). In fact, the typical growth rate of the strains, estimated from growth curves measured in the same experimental conditions used for the competition assays, is 0.3 1/h. The competition assays started from a cellular density of ~8 · 10⁶ cells/mL, thus if growth was exponential the density after 24 h would have been be ~1.4 · 10¹⁰, which is much higher than the typical density (~10⁹ cells/mL) that E. coli cells reach at saturation. With a growth rate of 0.3 1/h, the time needed to reach a cellular density that is hundredfold the initial one (and therefore the time needed to reach saturation after a re-inoculation) is ~15.4 h.

A model better suited to describe the population dynamics of the two strains in our experiment would be as follows. The temporal dynamics of biomass and glucose concentration between two consecutive dilutions satisfies:

$$\dot m_\sigma = m_\sigma \eta _\sigma r\left( c \right)\varphi _\sigma \qquad \quad \sigma = 1,2$$

(39a)

$$\dot c = - r\left( c \right)\left( {m_1\eta _1\varphi _1 + m_2\eta _2\varphi _2} \right),$$

(39b)

where c(t) is the concentration of glucose at time t and r(c) = c/(c + K) is Monod’s function. This model is somewhat similar to a classic consumer-resource model, with the difference that there is no mortality term in Eq. (39a): between two consecutive dilutions, the biomass m_σ of strain σ will grow as long as there is glucose available, and because r(0) = 0 the strains will stop growing (i.e. they will enter the stationary phase) once glucose runs out.

We now make the following approximation: we assume that, after every reinoculation, glucose is initially abundant (i.e. r ~ 1) and that the transition of r(c) from 1 to 0 as c decreases is abrupt, which happens if K is sufficiently small. In other words, we assume that K is sufficiently small so that r(c) ≈ 1 until a given time T (the instant at which glucose is completely depleted), when r(c) abruptly goes to zero (i.e., r(c(t)) ~ H(T − t) with H the Heaviside’s step function). This means that after a reinoculation m_σ will grow exponentially for a time interval of length T, after which it will stop until the next dilution. If we still set η₁ = η₂ = η and γ₁ = γ₂ = γ and call D the dilution factor between reinoculations, we have that the biomass $m_\sigma ^{\left( N \right)}$ of strain σ at the N-th dilution is ($m_\sigma ^{\left( 0 \right)}$ being the biomass at the initial inoculation):

$$m_\sigma ^{\left( 1 \right)} = m_\sigma ^{\left( 0 \right)}{\mathrm{exp}}\left( {\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right),$$

(40a)

$$m_\sigma ^{\left( 2 \right)} = \frac{{m_\sigma ^{\left( 1 \right)}}}{D}{\mathrm{exp}}\left( {\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right) = \frac{{m_\sigma ^{\left( 0 \right)}}}{D}{\mathrm{exp}}\left( {2\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right),$$

(40b)

$$m_\sigma ^{\left( 3 \right)} = \frac{{m_\sigma ^{\left( 2 \right)}}}{D}{\mathrm{exp}}\left( {\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right) = \frac{{m_\sigma ^{\left( 0 \right)}}}{D}{\mathrm{exp}}\left( {3\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right),$$

(40c)

$$\hskip 110pt\vdots$$

$$m_\sigma ^{\left( N \right)} = \frac{{m_\sigma ^{\left( 0 \right)}}}{{D^{N - 1}}}{\mathrm{exp}}\left( {N\frac{\eta }{{1 + \gamma }}{\Phi}_\sigma T} \right).$$

(40d)

Therefore, if we call f^(N) the relative abundance of strain 1 at the N-th dilution, we have:

$${\mathrm{ln}}\frac{{f^{\left( N \right)}}}{{\left( {1 - f} \right)^{\left( N \right)}}} = {\mathrm{ln}}\frac{{m_1^{\left( 0 \right)}}}{{m_2^{\left( 0 \right)}}} + NT\frac{\eta }{{1 + \gamma }}\left( {{\Phi}_1 - {\Phi}_2} \right),$$

(41)

which gives the same expression for the selection coefficients after deriving with respect to the time NT.

Comments on the experimental selection coefficient S

Figure 3a shows that strain 1 has a fitness advantage over strain 2 in the absence of IPTG, since S > 0 at low protein production rates, even though the only significant difference between the two strains is that strain 1 carries an extra copy of lacI and the inducible fluorescent protein mCherry (see “Strains used in the experiment”); in our theoretical framework, such an advantage implies that ${\Phi}_1^{\left( 0 \right)} - {\Phi}_2 \,> \, 0$. This may be explained by the observation that expressing lacI is beneficial for E. coli strains growing on glucose because it represses expression of the lac operon. Stoebel et al. [46] have in fact found that cells with the genomic copy of lacI show some residual lacA activity when grown in glucose, and estimated the cost of expressing lacA as 1.85% per generation [46], which may be alleviated in the presence of an extra copy of lacI. See also Supplementary Information for a more detailed discussion. Using our data, it is possible to estimate the ratio Φ₁/Φ₂ at different protein production rates (Fig. 3c). This ratio is approximately ${\Phi}_1^{\left( 0 \right)}/{\Phi}_2 \approx 1.02$ for low protein production rates and then decays linearly up to Φ₁/Φ₁ ≈ 0.98.

In the first set of experiments (magenta points in Fig. 3), and to a lesser degree in the second set of experiments (cyan points), the data points at the lowest production rate (i.e., at 0 μM IPTG) appear to deviate from the linear trend, and so the fits in Fig. 3a–c were calculated by excluding those data points (including them in the fit doesn’t affect the results, see Fig. S.21). The flow cytometry data suggest that the average fluorescent intensity of strain 1 from the induced RFP decreased over the course of the experiment at 0 μM IPTG, which may partly explain the deviation of the first magenta point in Fig. 3a from the linear trend via a reduction in protein production rate throughout the experiment at 0 μM IPTG. Another factor that may cause deviations from a linear trend is an increased gene-expression heterogeneity between cells in the absence of IPTG, a well-known property of the lac operon whose constituent parts we have used in our genetic constructs [47], which might confer heterogeneous growth rates to different cells in the population. Note that the normalized protein production rates of the two sets of experiments (magenta and cyan data points) are not directly comparable.

Evaluation of the ratios Φ₁/Φ₂ and Φ₃/Φ₄

Consider the competition assay with strains 1 and 2 (the results are the same also for the competition assay between strains 3 and 4, after all subscripts are appropriately changed). For a given IPTG concentration C_I, from Eq. (37) the growth rate of strain 1 is $g_1\left[ {k\left( {C_I} \right)} \right] = {\Phi}_1\left[ {k\left( {C_I} \right)} \right] \cdot \eta r\left( c \right)/\left( {1 + \gamma r\left( c \right)} \right)$ (where we have inserted explicitly the dependence on r(c), and k(C_I) is the protein production rate induced by C_I). On the other hand, the expression of the selective advantage for general values of r(c) is:

$$S = \frac{{\eta \cdot r\left( c \right)}}{{1 + \gamma \cdot r\left( c \right)}}\left( {{\Phi}_1 - {\Phi}_2} \right)$$

(42)

(in fact, if we only remove the assumption that r(c) ≈ 1, from Eq. (36a) we have $\dot m_\sigma /m_\sigma = \eta _\sigma r\left( c \right)\varphi _\sigma$ with σ = 1, 2 and the definition of S leads to this equation). Dividing S in Eq. (42) by g₁, for any value of r(c) we obtain:

$$\frac{{S\left[ {k\left( {C_I} \right)} \right]}}{{g_1\left[ {k\left( {C_I} \right)} \right]}} = \frac{{{\Phi}_1\left[ {k\left( {C_I} \right)} \right] - {\Phi}_2\left[ {k\left( {C_I} \right)} \right]}}{{{\Phi}_1\left[ {k\left( {C_I} \right)} \right]}},$$

(43)

which is easily rearranged into:

$$\frac{{\Phi}_1}{{{\Phi}_2}}\left[ {k\left( {C_I} \right)} \right] = \frac{1}{{1 - S\left[ {k\left( {C_I} \right)} \right]/g_1\left[ {k\left( {C_I} \right)} \right]}}$$

(44)

(which are the values plotted in Fig. 3c). Notice again that this result does not depend on the assumption that r(c) = 1 at all times, i.e. Eq. (44) is valid for any value of r(c).

Proteome fraction allocated to the inducible RFP

Because we did not measure RNA/protein ratios in our experiments, we can only estimate the values of κ^t and κⁿ by taking them from the literature for E. coli strains grown at 30 °C in conditions similar to our experiments. Rosset et al. [48, 49] measured the RNA/protein ratio of several E. coli strains grown at 30 °C in M63 medium. Using their data and the relationship [22] r = r₀ + g/κ^t, where r is the RNA/protein ratio and r₀ a constant, we can estimate the translational capacity as κ^t = 3.0 ± 0.5 μg protein/μg RNA · 1/h (mean ± SD). An estimate for the nutritional capacity κⁿ, instead, can be obtained via the equation [22] $g = g_{{\mathrm{max}}}\kappa ^n/\left( {\kappa ^n + \kappa ^t} \right)$, where g_max is the maximum growth rate obtainable by our strain at a given temperature (for us, 30 °C), when nutrients are abundant. Van Derlinden and Van Impe [50] report a maximum growth rate g_max ≈ 1.2 1/h for E. coli MG1655 grown at 30 °C in rich medium with glucose (no error estimate was reported). Solving for κⁿ and using the growth rate value g measured for strain 1 in the absence of IPTG, we find κⁿ = 1.2 ± 0.2 μg protein/μg RNA · 1/h. These values allow us to estimate $\gamma = \kappa ^n/\kappa ^t = 0.4 \pm 0.1$ and $\eta = \kappa ^n/\rho = 1.57 \pm 0.07$ 1/h using the value for ρ = 0.76 μg protein/μg RNA · 1/h reported in Scott et al. [22]. With these estimations, from the expression of the selective advantage in Eq. (42) we have that a 1% difference in proteome allocation for metabolism and growth between the two strains (i.e., Φ₁ − Φ₂ = 1%) leads to S ≈ 1.1 · 10⁻². Finally, with these calculations we can estimate the maximum percentage of proteome maxφ_iRFP and maxφ_iYFP allocated at full expression to the production of, respectively, the inducible red and yellow proteins in our two experiments. In particular, for the first experiment we have ${\mathrm{max}}\varphi _{iRFP} = {\Phi}_1^{\left( 0 \right)} - {\Phi}_2 - \left( {1 + \gamma } \right)/\eta \cdot S_{105} = \left( {1 + \gamma } \right)/\eta \cdot \left( {S_0 - S_{105}} \right) \approx 1.1\%$ (where S₀ and S₁₀₅ are, respectively, the mean selection coefficients in the 0 μM and 105 μM IPTG treatments). For the experiment involving strains 3 and 4, using the same procedure we find maxφ_iYFP ≈ 0.4%. Of course, given that we had to rely on measurements taken from the literature, these should be regarded as only rough estimates.

Data availability

The raw flow cytometry data studied in this work and the software used to analyze it are all available at the following GitHub repository: https://github.com/LeonardoPaccianiMori/CPR-model-experiment-data-analysis.

References

Bar-On YM, Phillips R, Milo R. The biomass distribution on earth. Proc Natl Acad Sci USA. 2018;115:6506–11.
Article CAS PubMed Google Scholar
Colman DR, Poudel S, Stamps BW, Boyd ES, Spear JR. The deep, hot biosphere: Twenty-five years of retrospection. Proc Natl Acad Sci USA. 2017;114:6895–903.
Article CAS PubMed Google Scholar
Puente-Sánchez F, Arce-Rodrı́guez A, Oggerin M, Garcı́a-Villadangos M, Moreno-Paz M, Blanco Y, et al. Viable cyanobacteria in the deep continental subsurface. Proc Natl Acad Sci USA. 2018;115::10702–7.
Article Google Scholar
Gold T. The deep, hot biosphere. Proc Natl Acad Sci USA. 1992;89:6045–9.
Article CAS PubMed Google Scholar
Sekirov I, Russell SL, Antunes LCM, Finlay BB. Gut microbiota in health and disease. Physiol Rev. 2010;90:859–904.
Article CAS PubMed Google Scholar
Singh BK, Bardgett RD, Smith P, Reay DS. Microorganisms and climate change: terrestrial feedbacks and mitigation options. Nat Rev Microbiol. 2010;8:779–90.
Article CAS PubMed Google Scholar
Cavicchioli R, Ripple WJ, Timmis KN, Azam F, Bakken LR, Baylis M, et al. Scientists’ warning to humanity: microorganisms and climate change. Nat Rev Microbiol. 2019;17:569–86.
Article CAS PubMed PubMed Central Google Scholar
Stewart EJ. Growing unculturable bacteria. J Bacteriol. 2012;194:4151–60.
Article CAS PubMed PubMed Central Google Scholar
Gonze D, Coyte KZ, Lahti L, Faust K. Microbial communities as dynamical systems. Current Opin Microbiol. 2018;44:41–9.
Article Google Scholar
Tikhonov M, Monasson R. Collective phase in resource competition in a highly diverse ecosystem. Phys Rev Lett. 2017;118:1–5.
Article Google Scholar
Butler S, O’Dwyer JP. Stability criteria for complex microbial communities. Nat Commun. 2018;9:2970.
Article PubMed PubMed Central Google Scholar
Landmann S, Engel A. Systems of random linear equations and the phase transition in MacArthur’s resource-competition model. EPL 2018;124;18004.
Article Google Scholar
Niehaus L, Boland I, Liu M, Chen K, Fu D, Henckel C, et al. Microbial coexistence through chemical-mediated interactions. Nat Commun. 2019;10:2052.
Article PubMed PubMed Central Google Scholar
Marsland R III, Cui W, Goldford J, Sanchez A, Korolev K, Mehta P. Available energy fluxes drive a transition in the diversity, stability, and functional structure of microbial communities. PLOS Comput Biol. 2019;15:1–18.
Article Google Scholar
Rivett DW, Bell T. Abundance determines the functional role of bacterial phylotypes in complex communities. Nat Microbiol. 2018;3:767–72.
Article CAS PubMed PubMed Central Google Scholar
Enke TN, Datta MS, Schwartzman J, Cermak N, Schmitz D, Barrere J, et al. Modular assembly of polysaccharide-degrading marine microbial communities. Curr Biol. 2019;29:1–8.
Article Google Scholar
Zelezniak A, Andrejev S, Ponomarova O, Mende DR, Bork P, Patil KR. Metabolic dependencies drive species co-occurrence in diverse microbial communities. Proc Natl Acad Sci USA. 2015;112:201522642.
Article Google Scholar
Louca S, Jacques SMS, Pires APF, Leal JS, Srivastava DS, Parfrey LW, et al. High taxonomic variability despite stable functional structure across microbial communities. Nat Ecol Evol. 2016;1:0015.
Article Google Scholar
Basan M. Resource allocation and metabolism: the search for governing principles. Curr Opin Microbiol. 2018;45:77–83.
Article PubMed Google Scholar
Bajic D, Sanchez A. The ecology and evolution of microbial metabolic strategies. Curr Opin Biotechnol. 2020;62:123–8.
Article CAS PubMed Google Scholar
Budinich M, Bourdon J, Larhlimi A, Eveillard D. A multi-objective constraint-based approach for modeling genome-scale microbial ecosystems. PLOS One. 2017;12:1–22.
Article Google Scholar
Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T. Interdependence of cell growth and gene expression. Science. 2010;330:1099–1102.
Article CAS PubMed Google Scholar
Schaechter M, Maaløe O, Kjeldgaard NO. Dependency on medium and temperature of cell size and chemical composition during balanced growth of salmonella typhimurium. Microbiology. 1958;19:592–606.
CAS Google Scholar
Scott M, Hwa T. Bacterial growth laws and their applications. Curr Opin Biotechnol. 2011;22:559–65.
Article CAS PubMed PubMed Central Google Scholar
Basan M, Hui S, Okano H, Zhang Z, Shen Y, Williamson JR, et al. Overflow metabolism in Escherichia coli results from efficient proteome allocation. Nature. 2015;528:99–104.
Article CAS PubMed PubMed Central Google Scholar
Mori M, Hwa T, Martin OC, De Martino A, Marinari E. Constrained allocation flux balance analysis. PLOS Comput Biol. 2016;12:1–24.
Article Google Scholar
MacArthur R. Species packing, and what competition minimizes. Proc Natl Acad Sci USA. 1969;64:1369–71.
Article Google Scholar
MacArthur R. Species packing and competitive equilibrium for many species. Theor Popul Biol. 1970;1:1–11.
Article CAS PubMed Google Scholar
Chesson P. MacArthur’s consumer-resource model. Theor Popul Biol. 1990;37:26–38.
Article Google Scholar
Tikhonov M. Community-level cohesion without cooperation. eLife. 2016;5:e15747.
Article PubMed PubMed Central Google Scholar
Posfai A, Taillefumier T, Wingreen NS. Metabolic trade-offs promote diversity in a model ecosystem. Phys Rev Lett. 2017;118:28103.
Article Google Scholar
Advani M, Bunin G, Mehta P. Statistical physics of community ecology: a cavity solution to MacArthur’s consumer resource model. J Stat Mech. 2018;2018:033406.
Article PubMed PubMed Central Google Scholar
Droop MR. The nutrient status of algal cells in continuous culture. J Marine Biol Assoc UK. 1974;54:825–55.
Article CAS Google Scholar
Khandelwal RA, Olivier BG, Röling WFM, Teusink B, Bruggeman FJ. Community flux balance analysis for microbial consortia at balanced growth. PLOS One. 2013;8:1–10.
Article Google Scholar
Embree M, Liu JK, Al-Bassam MM, Zengler K. Networks of energetic and metabolic interactions define dynamics in microbial communities. Proc Natl Acad Sci USA. 2015;112:15450–5.
Article CAS PubMed Google Scholar
Liao C, Wang T, Maslov S, Xavier JB. Modeling microbial cross-feeding at intermediate scale portrays community dynamics and species coexistence. PLOS Comput Biol. 2020;16:1–23.
Article CAS Google Scholar
Muscarella ME, O’Dwyer JP. Species dynamics and interactions via metabolically informed consumer-resource models. Theor Ecol 2020;13:503–18.
Article Google Scholar
Hermsen R, Okano H, You C, Werner N, Hwa T. A growth-rate composition formula for the growth of E. coli on co-utilized carbon substrates. Mol Syst Biol. 2015;11:801.
Article PubMed PubMed Central Google Scholar
Erickson DW, Schink SJ, Patsalo V, Williamson JR, Gerland U, Hwa T. A global resource allocation strategy governs growth transition kinetics of Escherichia coli. Nature. 2017;551:119–23.
Article PubMed PubMed Central Google Scholar
Pacciani-Mori L, Giometto A, Suweis S, Maritan A. Dynamic metabolic adaptation can promote species coexistence in competitive communities. PLOS Comput Biol. 2020;16:1–18.
Article Google Scholar
Taillefumier T, Posfai A, Meir Y, Wingreen NS. Microbial consortia at steady supply. eLife. 2017;6:e22644.
Article PubMed PubMed Central Google Scholar
Ratzke C, Gore J. Modifying and reacting to the environmental pH can drive bacterial interactions. PLoS Biol. 2018;16:e2004248.
Article PubMed PubMed Central Google Scholar
Monod J. The growth of bacterial cultures. Ann Rev Microbiol. 1949;3:371–94.
Article CAS Google Scholar
Stülke J, Hillen W. Carbon catabolite repression in bacteria. Curr Opin Microbiol. 1999;2:195–201.
Article PubMed Google Scholar
Görke B, Stülke J. Carbon catabolite repression in bacteria: many ways to make the most out of nutrients. Nat Rev Microbiol. 2008;6:613–24.
Article PubMed Google Scholar
Stoebel DM, Dean AM, Dykhuizen DE. The cost of expression of escherichia coli lac operon proteins is in the process, not in the products. Genetics. 2008;178:1653–60.
Article CAS PubMed PubMed Central Google Scholar
Elowitz MB, Levine AJ, Siggia ED, Swain PS. Stochastic gene expression in a single cell. Science. 2002;297:1183–6.
Article CAS PubMed Google Scholar
Rosset R, Monier R, Julien J. RNA composition of escherichia coli as a function of growth rate. Biochem Biophys Res Commun. 1964;15:329–33.
Article CAS Google Scholar
Rosset R, Julien J, Monier R. Ribonucleic acid composition of bacteria as a function of growth rate. J Mol Biol. 1966;18:308–20.
Article CAS PubMed Google Scholar
Van Derlinden E, Van, Impe JF. Modeling growth rates as a function of temperature: Model performance evaluation with focus on the suboptimal temperature range. Int J Food Microbiol. 2012;158:73–8.
Article PubMed Google Scholar

Download references

Acknowledgements

We thank David R Nelson and Andrew W Murray for hosting LP-M during the experiment and the initial development of the model and for insightful comments and suggestions. We thank Daniel Eaton for providing the ancestor bacterial strains used in the experiment. AM and LP-M acknowledge Fondazione Cariparo for funding. SS acknowledges the University of Padua for STARS ReACT grant. AG acknowledges support from the Swiss National Science Foundation, Projects P2ELP2_168498, P400PB_180823 and P400PB_180823/2.

Author information

These authors contributed equally: Amos Maritan, Andrea Giometto

Authors and Affiliations

Dipartimento di Fisica e Astronomia “Galileo Galilei”, Università degli Studi di Padova, Via Francesco Marzolo 8, 35131, Padova, Italy
Leonardo Pacciani-Mori, Samir Suweis & Amos Maritan
Department of Physics, Harvard University, 17 Oxford St, Cambridge, MA, 02138, USA
Leonardo Pacciani-Mori
School of Civil and Environmental Engineering, Cornell University, 220 Hollister Dr, Ithaca, NY, 14853, USA
Andrea Giometto

Authors

Leonardo Pacciani-Mori
View author publications
You can also search for this author in PubMed Google Scholar
Samir Suweis
View author publications
You can also search for this author in PubMed Google Scholar
Amos Maritan
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Giometto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Leonardo Pacciani-Mori or Andrea Giometto.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pacciani-Mori, L., Suweis, S., Maritan, A. et al. Constrained proteome allocation affects coexistence in models of competitive microbial communities. ISME J 15, 1458–1477 (2021). https://doi.org/10.1038/s41396-020-00863-0

Download citation

Received: 11 June 2020
Revised: 19 November 2020
Accepted: 30 November 2020
Published: 11 January 2021
Issue Date: May 2021
DOI: https://doi.org/10.1038/s41396-020-00863-0

This article is cited by

Discovery of an antitumor compound from xenorhabdus stockiae HN_xs01
- Xiyin Huang
- Qiong Tang
- Shengbiao Hu
World Journal of Microbiology and Biotechnology (2024)
Ecological modelling approaches for predicting emergent properties in microbial communities
- Naomi Iris van den Berg
- Daniel Machado
- Kiran R. Patil
Nature Ecology & Evolution (2022)