Environmental flexibility does not explain metabolic robustness

Libiseller-Egger, Julian; Coltman, Benjamin Luke; Gerstl, Matthias P.; Zanghellini, Jürgen

doi:10.1038/s41540-020-00155-5

Download PDF

Article
Open access
Published: 27 November 2020

Environmental flexibility does not explain metabolic robustness

npj Systems Biology and Applications volume 6, Article number: 39 (2020) Cite this article

1888 Accesses
2 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Cells show remarkable resilience against genetic and environmental perturbations. However, its evolutionary origin remains obscure. In order to leverage methods of systems biology for examining cellular robustness, a computationally accessible way of quantification is needed. Here, we present an unbiased metric of structural robustness in genome-scale metabolic models based on concepts prevalent in reliability engineering and fault analysis. The probability of failure (PoF) is defined as the (weighted) portion of all possible combinations of loss-of-function mutations that disable network functionality. It can be exactly determined if all essential reactions, synthetic lethal pairs of reactions, synthetic lethal triplets of reactions etc. are known. In theory, these minimal cut sets (MCSs) can be calculated for any network, but for large models the problem remains computationally intractable. Herein, we demonstrate that even at the genome scale only the lowest-cardinality MCSs are required to efficiently approximate the PoF with reasonable accuracy. Building on an improved theoretical understanding, we analysed the robustness of 489 E. coli, Shigella, Salmonella, and fungal genome-scale metabolic models (GSMMs). In contrast to the popular “congruence theory”, which explains the origin of genetic robustness as a byproduct of selection for environmental flexibility, we found no correlation between network robustness and the diversity of growth-supporting environments. On the contrary, our analysis indicates that amino acid synthesis rather than carbon metabolism dominates metabolic robustness.

Reconstructing organisms in silico: genome-scale models and their emerging applications

Article 21 September 2020

Local flux coordination and global gene expression regulation in metabolic modeling

Article Open access 14 September 2023

A structural property for reduction of biochemical networks

Article Open access 31 August 2021

Introduction

“Robustness” is a system’s intrinsic ability to maintain functionality under perturbation. Due to the generality of this definition, a consensus on an exact quantitative metric, especially in a biological context, has yet to emerge^1,2,3,4. Aside from biology, robustness and reliability are key design goals in many fields of engineering and particularly paramount in the development of safety-critical systems⁵. Thus, some of the theoretical groundwork relevant in these areas can be borrowed for a systems biology approach^1,2.

Common basic reliability measures consider only the interaction of binary-state components, i.e. components that are either fully functional or fully dysfunctional. In such an analysis, a network is deemed functional if there exists at least one path from the input node(s) to the output node(s). Some early efforts to quantify structural robustness in metabolic networks follow the same principle. They correlate robustness with the number of unique, minimal pathways⁶ that provide a given function (in the context of metabolism usually cell growth)^7,8,9.

Consistent with the characterisation of “functionality” as cell growth, “dysfunctionality” (i.e. the lack of growth) is caused by lethal combinations of reaction knockouts, which could arise via loss-of-function (LOF) mutations disabling vital enzymatic activity. Similarly to how all paths leading from input nodes to output nodes can be determined computationally¹⁰, the same can be done for all possible ways of blocking these paths (i.e. lethal combinations of reaction deletions)¹¹. Analogously, knowledge of these knockout combinations, termed minimal cut sets (MCSs)¹², allows for computing the network’s PoF, which—in mathematical terms—is the complement of robustness¹³.

As we have shown previously, the great advantage in using the PoF as a surrogate for robustness lies in the fact that only the lowest cardinality-MCSs are sufficient to approximate it with negligible error¹³. As efficient algorithms for finding low-cardinality MCSs are available^14,15, this allows for the examination of robustness in GSMMs.

Here, we present fundamental improvements—both in theory and software implementation—of our PoF-calculating algorithm¹³. Provided with a list of low-cardinality MCSs and running on a personal computer, the program is now capable of determining the PoF of a GSMM at reasonable accuracy in under a minute, whereas hours of high-performance computing time were required in the past. With the improved tool at hand we computed the PoF across a large panel of Enterobacteriaceae as well as 30 fungal species and show that their structural metabolic robustness does not correlate with their ability to thrive in nutritionally diverse environments.

Box 1 List of symbols

m Number of minimal cut sets (MCSs) disabling growth in a metabolic network

m_i Number of MCSs with cardinalities up to i

d_m Cardinality up to which all MCSs have been calculated (i.e. for d_m = 3 all m₃ MCSs with up to three reaction knockouts are known)

d₀ Cardinality up to which all supersets of all known MCSs are considered by pof2.0

f_d Failure frequency at d random reaction deletions (i.e. the chance of a metabolic network not being able to grow given loss-of-function (LOF) mutations in d random reactions)

${f}_{d}^{{d}_{m}}$ Approximated failure frequency at d random reaction deletions computed with all MCSs up to cardinality d_m

F Probability of failure (PoF) (probability of a metabolic network acquiring a lethal combination of LOF mutations)

${\widetilde{F}}_{{d}_{0}}^{{d}_{m}}$ PoF estimate calculated by pof2.0 given all MCSs up to cardinality d_m and considering all possible unions of MCSs up to cardinality d₀

Results

Terminology and definitions

The PoF, F, of a metabolic network with r reactions is defined as the probability that a given number of random LOF mutations disables growth (i.e. is lethal). It can be expressed as the probability-weighted average of the failure frequency

$${f}_{d}=\frac{{\mathrm{number}}\; {\mathrm{of}}\; {\mathrm{lethal}}\; {\mathrm{combinations}}\; {\mathrm{of}}\; {d}\; {\mathrm{deletions}}}{{\mathrm{number}}\; {\mathrm{of}}\; {\mathrm{all}}\; {\mathrm{combinations}}\; {\mathrm{of}}\; {d}\; {\mathrm{deletions}}}$$

(1)

for LOF mutations in exactly d enzymes (reactions) being lethal. This gives

$$F:= \mathop{\sum }\limits_{d = 1}^{r}{w}_{d}{f}_{d}$$

(2)

with the discrete binomial distribution

$${w}_{d}=\left({r}\atop{d}\right){p}^{d}{(1-p)}^{r-d},$$

(3)

which—for a constant reaction-level mutation rate p—describes the probability that d LOF mutations occur.

Equation (1) can be expressed in a computationally more approachable way using combinations of MCSs¹³. In this context, an MCS is a minimal set of reaction deletions that suppresses growth¹². Essential reactions, synthetic lethal pairs or synthetic lethal triplets are examples of MCSs with cardinalities one, two, or three, respectively. In the Supplementary Notes we expand on ref. ¹³ and show that, if all MCSs of a metabolic network are known, Eq. (2) can be formulated as

$$F=\mathop{\sum }\limits_{{{\emptyset}}\ne {\mathcal{J}}\subseteq \{1,...,m\}}{(-1)}^{| {\mathcal{J}}| -1}\,{p}^{| {{\mathscr{M}}}_{{\mathcal{J}}}| },$$

(4)

where ${{\mathscr{M}}}_{{\mathcal{J}}}$ denotes the union ${\cup }_{j\in {\mathcal{J}}}{{\mathscr{M}}}_{j}$ of one or more MCSs with the (multi)-index ${\mathcal{J}}$ running over the power set of the indices {1, . . ., m} of all m MCSs. In other words, this means that every possible combination of one or more MCSs is a summand in Eq. (4). In Supplementary Table 1, we provide an evaluation of Eq. (4) for the toy network in Fig. 1a.

Note that according to Eq. (4), F solely depends on the total number of MCSs, m, and the cardinalities of the members of their power set (i.e. on how the MCSs overlap with each other), but not on the number of reactions in the network. Thus, the PoF is independent of network size.

PoF by example

A toy network composed of five irreversible and one reversible reaction is depicted in Fig. 1a and produces biomass via r₆. Any combination of LOF mutations encompassing the essential reaction r₆, one of the synthetic lethal pairs {r₁, r₂}, {r₄, r₅}, or one of the synthetic lethal triplets {r₁, r₃, r₅}, {r₂, r₃, r₄} is lethal. The failure frequency, f_d (i.e. the probability that a given number of d LOF mutations results in cell death), is illustrated in Fig. 1b. In general, the failure frequency for any fixed d is dominated by single and double LOF mutations (see the hatched bars in Fig. 1b), while the contribution of higher order combinations is relatively small. For instance, in Fig. 1b all f_d—except for f₃—can be computed exactly with the knowledge of the single and double LOF mutations alone as all f_d with d > 3 are already determined to be 1 by their respective contributions.

For a constant mutation rate p = 0.1 the likelihood that one, two, or more LOF mutations occur is binomially distributed, and the resulting expected value of the probability of a lethal set of mutations (i.e. the PoF) is p + 2p² − 7p⁴ + 7p⁵ − 2p⁶ = 0.11936 (see Supplementary Tables 1 and 2 for details). Note that it is independent of the network’s size.

Estimating F in large models

Equation (4) determines the PoF exactly as long as all m MCSs are known and the sum is evaluated over all their possible combinations (i.e. the full power set with 2^m − 1 members). For GSMMs both requirements cannot be met as the enumeration of all high-cardinality MCSs in a large network is computationally intractable and—even if it was not—the number of possible combinations would soon exceed the number of atoms in the universe. Thus, approximations are indispensable.

It is possible to systematically construct the first $m_i$ MCSs of lowest cardinality¹⁵, which have the greatest impact on the PoF. Hence, F can still be reliably estimated, even when only a relatively small number of MCS are known¹³. This addresses the first issue. However, even if the number of included MCSs is reduced from m to $m_i$, the exponential explosion of the summands still remains critical. In the simplest, least accurate case, we consider only the essential reactions, i.e. all m₁ MCSs with $| {{\mathscr{M}}}_{j}| =1$, and Eq. (4) simplifies to

$$F\approx {\widetilde{F}}_{r}^{1}=1-{(1-p)}^{{m}_{1}}\approx {m}_{1}p$$

(5)

(for derivation, see Supplementary Notes). When all MCSs up to a certain cardinality are known, we call this cardinality d_m (in this case d_m = 1). The number of essential reactions, m₁, times the mutation rate, p, is in fact the leading term in the sum of Eq. (4). To improve accuracy, we resort to Eq. (2) and, rather than summing over all r, truncate the sum to d → d₀ so that

$$F\approx {\widetilde{F}}_{{d}_{0}}^{{d}_{m}}=\mathop{\sum }\limits_{d = 1}^{{d}_{0}}{w}_{d}{f}_{d}^{{d}_{m}}.$$

(6)

Here, the superscript d_m indicates that ${\widetilde{F}}_{{d}_{0}}$ and f_d are computed with all MCSs up to cardinality d_m. However, we show in the Supplementary Notes that the resulting error can be further reduced.

Figure 2 serves to illustrate the maximum error ${\varepsilon }_{\max }$ for ${\widetilde{F}}_{10}^{3}$ in the E. coli GSMMi JO1366 (d_m = 3; all MCSs with cardinalities <4 are known). For d ≤ d_m = 3 all ${f}_{d}^{3}$ are exact. MCSs with cardinalities >3 have not been calculated and their contributions are unknown. Therefore, in the worst case all cut sets of cardinality 4 could be lethal leading to ${f}_{d}^{3}=1$ for d ≥ 4. The resulting maximum error is represented by the hatched bars in the inset of Fig. 2. It can be calculated via

$${\varepsilon }_{\max }=1-{\widetilde{F}}_{{d}_{0}}^{{d}_{m}}-\mathop{\sum }\limits_{d = 0}^{{d}_{m}}{w}_{d}\left(1-{f}_{d}^{{d}_{m}}\right).$$

(7)

For iJO1366 in Fig. 2 the maximum absolute and relative errors peak around p = 3 × 10⁻³ and quickly approach zero for p < 0.5 × 10⁻³. Note that the sum in Eq. (7) is independent of d₀.

**Fig. 2: Maximum error of $\widetilde{F}$.**

Implementation

In our previous work¹³, we approximated the (total) PoF by computing ${{f}}_{{d}}^{{d}_{m}}$ only for d ≤ d₀ = d_m and formulated a parallelised algorithm to carry out the truncated sum. Here, we present pof2.0 that includes further algorithmic improvements and allows us to quickly evaluate ${\widetilde{F}}_{{d}_{0}}^{{d}_{m}}$ for larger d₀. This has two implications: For a given number of MCSs F can be predicted more precisely and to achieve a certain accuracy the number of required MCSs can be reduced.

At the core of our method lies the fact that any lethal cut set must be a superset of at least one MCS. In order to estimate ${\widetilde{F}}_{{d}_{0}}^{{d}_{m}}$, $| {{\mathscr{M}}}_{{\mathcal{J}}}|$ (the number of knocked out reactions) as well as $| {\mathcal{J}}|$ (the number of MCSs in the cut set) need to be found for every possible combination that can be formed from the network’s known MCSs (i.e. their power set). The program achieves this by a recursive procedure, the basis of which has been outlined in ref. ¹³. In short, we first sort the list of MCSs by cardinality. Then, for every element in the sorted list, we iterate over all MCSs that appear earlier in the list and form the respective unions with $| {\mathcal{J}}| =2$. If a union with $| {{\mathscr{M}}}_{{\mathcal{J}}}| \le {d}_{0}$ is not a subset of a previously encountered cut set, we again iterate over all elements in the list up to the earliest MCS already present in that union to form all cut sets with $| {\mathcal{J}}| =3$ and so on. $| {{\mathscr{M}}}_{{\mathcal{J}}}|$ and $| {\mathcal{J}}|$ of every lethal combination are recorded to be evaluated once recursion has finished. The actual implementation employs a few shortcuts that considerably reduce the number of recursions required.

Moreover, in metabolic pathway analysis, it is common practice to compress networks before analysis¹⁶. This reduces the number of metabolites and reactions without losing information and is necessary for efficient utilisation of computational resources. Naturally, compressed networks also contain drastically fewer MCSs. pof2.0 is capable of handling linearly compressed networks, which entails a major leap in performance. The code is available at www.github.com/julibeg/PoF.

PoF computes within seconds

To assess the computational efficiency of pof2.0, we analysed its performance as a function of the number of processed MCSs for E. coli’s central carbon metabolism model (CCMM)¹⁷ and its GSMM iJO1366¹⁸ (Supplementary Fig. 1). In both cases, we observed a considerable reduction in run-time cost per additionally processed MCS compared to the previous implementation¹³. For instance, after processing ~2000 MCSs of the CCMM, we reached a ~100-fold speedup, the majority of which can be attributed to network compression. Extrapolating run-times to ~796,070 MCSs (d_m = 15) gives an estimate of ~280 h for the previous implementation while pof2.0 took less than a second. Due to these advances, the bulk of the computational burden in estimating structural robustness now clearly lies at MCS enumeration.

The preceding sections established the theoretical foundation and computational feasibility of our approach to quantify structural robustness. Next, we capitalise on the reduced run-time of pof2.0 by analysing hundreds of GSMMs in order to showcase the utility of our approach and examine the nature of metabolic robustness. Unless stated otherwise, all PoF approximations reported below were calculated with d_m = 3, d₀ = 10, and p = 10⁻⁴.

PoF negatively correlates with carbon source molar mass

To explore the influence of varying carbon sources on the robustness of growth in E. coli we computed the PoF of iJO1366¹⁸ growing on minimal medium under aerobic conditions for all its 181 single, growth-supporting carbon sources. As shown in Fig. 3, the PoF decreased with increasing molar mass of the carbon source (Pearson: r = −0.30, p = 3 × 10⁻⁵; Spearman: r = −0.48, p = 8 × 10⁻¹²).

**Fig. 3: PoF vs. carbon source molar mass.**

Figure 3 features two pronounced spikes (marked “1” and “2”). In spike 1 the PoF increases with the molar mass of the respective carbon sources, which are all medium- and long-chain fatty acids. While in reality the same enzymes process fatty acids of different lengths during beta-oxidation, these steps are represented as distinct reactions in metabolic models. Thus, contrary to in vivo, fatty acids are catabolised by increasingly longer linear reaction chains in silico. Spike 2, on the other hand, is comprised of cases where a dramatic increase in molar mass has hardly any effect on the PoF. Responsible are oligomaltoses (i.e. starch oligomers), most of which reach central metabolism via the same number of reactions in silico. Without the spikes, correlation increases to: Pearson r = −0.57, p = 5.4 × 10⁻¹⁶; Spearman r = −0.58, p = 1.3 × 10⁻¹⁶.

We further investigated the impact of the carbon source’s molecular composition on the PoF, which revealed a hierarchy based on chemical composition. On average, “pure” carbon sources (i.e. consisting only of C, H, and—possibly—O atoms) supported less robust growth than substrates containing either nitrogen or phosphate or both; see Supplementary Fig. 2.

Growth on “elementally richer” substrates being more robust can be explained by higher redundancy. For instance, when growing on an amino acid, otherwise lethal mutations in ammonium transport and assimilation pathways would become irrelevant. As an example consider the 298 MCSs of cardinality 1 in the joint set of MCSs for aerobic growth on either glucose or glutamate, out of which 285 were identical for both. Only eight and five were specific to glucose and glutamate, respectively, which is reflected in a lower PoF for the latter. Three out of the eight essential reactions unique to growth on glucose are involved in ammonium transport. This illustrates how F is linked to topological differences in metabolic networks.

The origin of biological robustness

The evolutionary origin of genetic robustness—despite its ubiquity^1,19—remains strongly debated^20,21. Approaches to explaining its emergence can be roughly split into three categories: congruent, adaptive and intrinsic²¹. Contrary to the adaptive hypothesis, which states that genetic robustness is selected for directly^{22,23,24,25,26,27,28}, the congruent hypothesis postulates that it rather arises as a byproduct of selection for environmental resilience^29,30,31. The intrinsic hypothesis, on the other hand, negates any evolutionary impact and argues that robustness is a fundamental property of complex systems and networks^32,33,34. However, due to its multi-factorial nature, it is more likely that no single mechanism is responsible for the development of biological robustness. The actual origin may rather comprise a combination of the hypotheses introduced above.

To test the impact of the congruence mechanism (“environmental robustness breeds genetic robustness”), we speculated that organisms capable of growing in many different conditions (robust towards changes in the chemical environment) would exhibit low PoFs (robust towards LOF mutations). Accordingly, structural robustness in metabolic networks should correlate with the number of substrates enabling growth. To evaluate this idea, we calculated the PoF and counted the aerobic, growth-supporting carbon sources for GSMMs of 53 E. coli and Shigella³⁵, 406 Salmonella³⁶, as well as 30 fungal³⁷ strains.

Robustness correlates with the diversity of growth media

For the E. coli and Shigella models the PoF was calculated twice—once with regard to growth on glucose minimal medium and once for minimal medium with all growth-supporting carbon sources enabled. Being allowed to grow on all carbon sources the networks showed considerably higher robustness compared to growth on glucose alone (Supplementary Fig. 3). In fact, when testing iJO1366, the average of F decreased monotonically with the number of carbon sources available (see Supplementary Fig. 4). This confirms the naïve expectation that fatal reaction deletions are more strongly buffered in richer media. However, the magnitude of the two PoFs for a given model were correlated (Pearson r = 0.77, p = 1 × 10⁻¹¹; Spearman r = 0.60, p = 3 × 10⁻⁶).

Similarly, we found growth on minimal glucose medium to be considerably more robust than optimal growth on the same medium (Supplementary Fig. 3a). In fact, the distribution of the PoF across the models was quite consistent for the different conditions with the exception of Shigella boydii Sb227. This strain showed the greatest robustness among all E. coli and Shigella models at glucose-optimal growth while ranking last when all carbon sources were enabled (Supplementary Fig. 3b).

PoF does not correlate with carbon utilisation capabilities

Figure 4 shows the PoF across all E. coli, Shigella, Salmonella, and fungal models versus the number of carbon sources that allow the respective networks to grow. Among the E. coli and Shigella strains, it ranged from 2.78% (E. coli IAI39) to 2.95% (S. boydii CDC 3083-94) with an average of 2.87%. For Salmonella and the fungi, it was considerably lower with averages of 2.62% and 1.04%, respectively. Within the respective groups there was no correlation between F and the number of carbon sources. On the contrary, most E. coli models sat on a straight horizontal line from 167 to 186 different carbon sources with the PoF being practically identical at 2.8685%. Similarly, out of 406 Salmonella models, 213 had essentially the same $\widetilde{F}$ of 2.6059% and grow on 143–168 carbon sources. Finally, the fungi showed the highest amount of variability; yet again no correlation with the number of carbon sources.

**Fig. 4: Genetic robustness vs. environmental robustness.**

The observation that many strains have (almost) identical PoF values, although differing substantially in carbon utilisation capability, suggests that additional carbon sources mostly only add extra uptake reactions to the network rather than increasing connectivity. For example, all E. coli models with ${\widetilde{F}}_{10}^{3}\approx 2.8685 \%$ feature the same set of essential reactions, albeit some can grow on close to 20 more carbon sources compared to others. The linear fit in Fig. 5 corroborates this assumption as the number of reactions increases by ~3.28 for every additional carbon source. These three reactions usually consist of an exchange reaction for importing the carbon source across the model boundary and two transport reactions across the outer and inner membrane. Therefore, roughly only every third carbon source adds one non-transport reaction to the model leaving little room for increasing metabolic connectivity.

To further examine this hypothesis we computed the size of E. coli’s “shared reactome”, i.e. the set of reactions present in all E. coli GSMMs. Under the investigated conditions (aerobic growth on glucose minimal medium), it was 1336/1774 = 75%. Considering only the E. coli strains with virtually equal PoF, it increased to 1447/1772 = 82%, indicating that the PoF is a good predictor for network overlap. Similarly, a pairwise comparison across all E. coli and Shigella models revealed that the difference in PoF correlates with the pairwise network dissimilarity, 1 − σ_i,j, of any two networks M_i and M_j (see Fig. 6). Here, dissimilarity was defined as the complement of the pairwise network similarity

$${\sigma }_{i,j}=\frac{{r}_{{M}_{i}\cap {M}_{j}}^{2}}{{r}_{{M}_{i}}{r}_{{M}_{j}}},$$

(8)

where ${r}_{{M}_{i}},{r}_{{M}_{j}}$, and ${r}_{{M}_{i}\cap {M}_{j}}$ denote the number of reactions in the models M_i, M_j, and of those found in both, respectively.

**Fig. 6: Pairwise difference in the PoF across all *E. coli* and *Shigella* GSMMs vs. network dissimilarity.**

In addition to greater robustness, the Salmonella models also displayed reduced nutritional flexibility; being able to grow on only 164 carbon sources on average as opposed to E. coli’s 179. Moreover, their relative shared reactome was slightly larger at 1274/1628 = 78%.

With only 1383/2153 = 64% of reactions belonging to the shared reactome, the fungal models showed larger diversity. This is not surprising, since the group of fungal models featured 30 species from different genera as opposed to multiple strains/species from the same genus for the bacterial models.

In spite of decreased nutritional flexibility, all fungal species showed considerably larger robustness as opposed to the bacterial models discussed above. However, it appears that this observation is an artefact of the different model reconstruction procedures rather than a genuine biological effect. The fungal models’ biomass consisted of 44 components, while the bacterial biomass reactions were considerably more detailed (51 and 68 components for Salmonella and E. coli/Shigella, respectively), including additional vitamins and metal ions. Consequently, the growth medium was more complex for the bacterial models (containing 18 components) as opposed to the fungal models with only five components (see Supplementary Table 3). Moreover, the more complex the biomass composition, the more likely it is affected by LOF mutations. In fact, when using the fungal biomass reaction as objective in iJO1366, the PoF decreased from 2.89 to 1.53% (Supplementary Fig. 5). Thus, as the biomass composition is different for models of the respective reconstruction groups (fungal, E. coli/Shigella, and Salmonella), we conclude that comparisons within groups are appropriate, but not across groups.

When comparing pathogenic and non-pathogenic models within the same reconstruction group, no differences in the PoF have been found (Supplementary Fig. 6). However, given the small number of purely pathogenic strains/species and the lacking genetic diversity among them, general trends cannot be inferred.

Essential reactions dominate metabolic robustness

For the models assessed here, a comparison with Eq. (5) (right panel in Fig. 4) reveals that even at the relatively high p = 10⁻⁴ the number of MCSs with cardinalities two and three was not large enough to contribute noticeably to ${\widetilde{F}}_{10}^{3}$. This observation—under the assumption that it can be generalised to most bacterial models growing on minimal medium—suggests that a reasonably accurate approximation for the structural robustness of these networks could be obtained solely from the number of essential reactions, m₁, which is easy to compute.

Figure 6 further highlights the minuscule contribution of the MCSs with $| {{\mathscr{M}}}_j| \in \{2,3\}$ as all pairwise PoF-differences between two models with the same number of essential reactions are smaller than 10⁻⁶. Similarly, the PoF differences among pairs of models with the same number of essential reactions as well as synthetic lethal pairs (m_1,i = m_1,j and m_2,i = m_2,j) lie between 10⁻¹² and 10⁻¹⁰ and are mostly determined by differences in the number of synthetic lethal triplets Δm₃ (Supplementary Fig. 7).

As F is almost exclusively determined by m₁, we identified the essential reactions shared by all E. coli and Shigella models and the corresponding KEGG³⁸ pathways (Fig. 7). Interestingly, (central) carbon metabolism only hosts a small number of essential reactions and thus constitutes a minor contribution to the PoF. Instead, amino acid synthesis appears to be buffered by redundancy to a lesser extent.

**Fig. 7: Essential reactions grouped by pathway.**

Discussion

Robustness is a key feature of biological systems. Yet, its exact quantification remains challenging. In a previous work, we have modelled the robustness of metabolic networks by counting the frequency of the binary network response (as measured by growth or no growth under steady-state conditions) upon random reaction deletions¹³. Similar mathematical strategies are commonly used in engineering to evaluate the reliability of safety-critical structures, for example in aircraft³⁹. Here, we expanded on this approach and presented a comprehensive theoretical description which we validated by confirming the buffering capabilities of diverse growth media^40,41.

Capitalising on reduced run-times for GSMMs, we were able to evaluate the PoF across 459 members of the family Enterobacteriaceae (amongst them commensal and pathogenic E. coli, Shigella, as well as Salmonella strains) and 30 species of fungi growing on glucose minimal medium under aerobic conditions. Although the closely related E. coli and Salmonella strains, whose lineages separated 140 million years ago⁴², share a large fraction of their genetic material^43,44, we found the Salmonella models to be considerably more robust. In fact, Salmonella serovars were shown to have fewer essential genes (relative to the genome size) than E. coli MG1655^45,46,47, which is also reflected in the corresponding metabolic models. This indicates higher redundancy and supports our in silico analysis of the respective GSMMs. However, in this work we have not undertaken any phylogenetic analyses and thus cannot discuss evolutionary trajectories that could explain these differences.

Compared to the bacterial models the fungal species analysed here showed drastically higher robustness. However, it should be stressed that, if GSMMs that have been generated via different reconstruction pipelines are compared, caution has to be exercised as differences in the reconstructions process add to the observed PoF. For instance, the biomass composition is more detailed in the Enterobacteriaceae models as they require metal ions in their minimal in silico medium to support growth, whereas the fungal models do not. This prohibits a direct comparison between the respective groups. In general, we observed an increase in the PoF with the complexity of the biomass formulation. This is consistent with earlier reports that the size of essential networks scales with the number of biomass components⁴⁸. However, to turn this drawback into a benefit, the PoF could be used as a quality indicator. For instance, we were able to track the smaller PoF of E. coli IAI39 compared to the average PoF of the E. coli strains back to a lumped reconstruction of folate biosynthesis.

Within the groups of models that share the same reconstruction pipeline (E. coli/Shigella, Salmonella, and fungi), our findings do not support a connection between nutritional flexibility and cellular robustness. In line with this result, the diversity of genetic capabilities observed in Salmonella was shown to not be indicative of strain-specific lifestyle³⁶. Here we find that this is true not only for Enterobacteriaceae but also for fungi. Although the strain-specific portions of the pan-reactome are largely associated with altered carbon metabolism^35,36, our analysis reveals that this does not lead to increased connectivity (Fig. 5)—and thereby structural robustness—of the respective network. This reflects the notion that metabolism is organised in a bow tie structure with a large number of diverse nutrients being “fanned into” a tightly knotted core network from where just a few key metabolites “fan-out” again to produce all building blocks and macromolecular compounds that make up a cell^49,50. The observation that on average the network size of E. coli and Shigella GSMMs increased by only three reactions for every additional carbon source (Fig. 5)—a number that is in good agreement with the previously derived two reactions per carbon source for minimal metabolic models⁴⁸—also corroborates this interpretation.

The congruent hypothesis has recently also been challenged by Ho and Zhang²⁰. Their analysis was based on the reallocation of growth-optimal flux distributions in a single GSMM of E. coli MG1655 (computed via minimisation of metabolic adjustments⁵¹) upon genetic and environmental perturbations²⁰. Due to the use of an optimisation principle, their analysis was inherently biased, while our approach is more general in the sense that it takes the complete network structure (i.e. all possible flux re-routings) into account⁵². It could be argued that robustness is most relevant in the fittest states. However, our results show that structural robustness impacts all levels of growth-optimality in a similar way (Supplementary Fig. 3).

Classically, an unbiased characterisation of metabolic networks is achieved in terms of minimal functional units that are able to characterise all feasible steady-state flux distributions^6,53,54,55. Alternatively, a network can be equally well described by failure modes, called MCSs¹¹. This is mirrored by the fact that, according to Eq. (4), the PoF is solely determined by the MCSs and independent of the network’s size. Thus, it is a true feature of a network’s structural topology.

In principle, F can be computed exactly. However, in practice it suffers from the combinatorial explosion of the number of MCSs in GSMMs and the exponential explosion of the number of summands in Eq. (4) (i.e. the power set of the MCSs). Nonetheless, evaluating the PoF remains feasible as the (relatively few) low-cardinality MCSs are most dominant¹³. In fact, for many models growing on minimal media the number of essential reactions already suffices. This is due to the fact that it is far more likely that a randomly selected set of d deletions contains e.g. one or more essential reaction(s) than a complete MCS of cardinality d. Hence, high-cardinality MCSs are not as relevant and can be neglected—provided d₀ is sufficiently large and p is sufficiently small (Fig. 2).

The probability of LOF mutations in a given number of reactions follows the binomial distribution, consequently depending on the total number of reactions in the model as well as the per-reaction mutation rate p. As we reported solely comparative analyses here, we focused on relative differences in F and chose p to be 10⁻⁴ per reaction. In order to correctly calculate absolute values, a more accurate, strain-specific estimate for p would be required. However, quantifying deleterious mutation rates proves challenging as LOF mutations with strong effects are usually cleared from populations quickly and thereby evading experimental detection. Kibota and Lynch⁵⁶ give a lower bound of 2 × 10⁻⁴ deleterious mutations per genome and generation in E. coli, while estimates for the general mutation rate are in the range of 5 × 10⁻³ to 10⁻⁴ per genome and generation^57,58,59,60. Hence, the rate for LOF mutations of a given gene (or reaction) can be confidently assumed to be very low.

This has implications for the computation of F, as fewer MCSs are required to give precise estimates in case of smaller mutation rates. For instance, in a network with 3000 reactions out of which 300 are essential the maximum relative error of the analytical PoF estimate from the essential reactions alone would be around 1% with p = 10⁻⁶ (see Supplementary Fig. 8c). Thus, in such a case performing recursion would not be necessary.

In fact, for all models analysed here the number of essential reactions dominated $\widetilde{F}$, enabling accurate predictions with Eq. (5) even at the unrealistically high p = 10⁻⁴ (see Fig. 4b). Thus, genetic robustness could simply be estimated given the number of essential reactions or genes.

Due to the binary nature of LOF mutations underlying the concept of the PoF, it can only measure structural redundancy in metabolic networks by analysing their topology in the steady state. Alternative forms of resilience or robustness, for instance absolute concentration robustness⁶¹ or regulatory feedback mechanisms, are not addressed. Moreover, due to its focus on the reaction level, the PoF disregards some genetic information (e.g. duplicated genes coding for the same enzyme). However, we have demonstrated previously that PoF analyses at the level of genes and at the level of reactions yield qualitatively similar results¹³. Incidentally, recent advances in software for the enumeration of genetic MCSs^62,63 allow for extending the scope of our method to the gene level. Additionally, we have shown here that the PoF is independent of the network’s size and mostly determined by the number of essential reactions (or genes, for that matter). Thus, given knowledge of the essential genes⁶⁴ (and potentially the synthetic lethal pairs, which have been determined for some model organisms like yeast⁶⁵), Eqs. (4) and (5) enable us to evaluate the PoF on the genome level without the need for a metabolic reconstruction altogether. When relying only on the essential genes for PoF estimation, the theory simplifies greatly. Therefore, in future work the theoretical framework presented here could be expanded such that ways for incorporating gene lengths in the analysis could easily be devised, which would further improve the accuracy and reliability of our approach for quantifying genetic robustness.

Methods

Model acquisition and preprocessing

Fifty-four GSMMs of E. coli and Shigella strains³⁵ were downloaded from the BIGG Models database, version 1.6^66,67. Additionally, 408 Salmonella GSMMs³⁶ (accession ID: MODEL1807280001) and 56 GSMMs of fungal species³⁷ (accession IDs: MODEL16042800{00-55}) were retrieved from BioModels⁶⁸. The minimal media can be found in Supplementary Table 3. Native model sizes were ~2600 reactions for the bacterial and ~6700 reactions for the fungal species. To remove any artefacts that might have emerged in the automated model-reconstruction process, all models were made consistent with respect to growth on glucose. This means that every reaction unable to carry any flux when growing on the minimal medium (as determined by flux variability analysis⁶⁹) was removed from the respective model. This reduced model sizes to ~1650 reactions for E. coli/ Shigella, ~1550 reactions for Salmonella, and ~1750 reactions for the fungi. Note that in terms of the PoF analysis, “consistent” and “native” models are equivalent as F is independent of the number of reactions and the set of MCSs is the same for both. However, the models were made consistent nonetheless as it is commonly considered good practice and also reduces the computational load on MCS enumeration. Additionally, the E. coli and Shigella batch was also made consistent with regard to optimal growth on glucose, removing every reaction not participating in flux patterns providing the highest possible growth rate. Thereby, model sizes were reduced to ~500 reactions. To avoid introducing any bias in the process of making the models consistent, glucose levels in the media were adjusted on a per-model basis beforehand so that all models would achieve the same growth rate. Moreover, for reasons stated below, the default upper and lower bounds on reaction fluxes were relaxed to ±10,000. The 54 E. coli and Shigella models contained ten auxotrophs, one of which was capable of growing on its supplementary medium component as sole carbon source. It was therefore removed from the dataset. Salmonella featured 27 auxotrophs, two of which were able to grow on their supplements alone and were subsequently excluded from the analysis. The supplements required by the other auxotrophs were added to their respective media and can be found in Supplementary Tables 4 and 5. Out of the 56 fungal models, 26 were unable to grow in the specified conditions and thus dropped. The remaining 30 species (Supplementary Table 6) underwent additional preprocessing steps in order to harmonise reaction bounds prior to making the models consistent. All preprocessing was done in COBRApy v.0.17.0⁷⁰.

Computation of MCSs and PoF

To enumerate the models’ MCSs, the relevant information required was extracted from the respective SBML⁷¹ or JSON files and reformatted accordingly. The data was then subjected to

Linear compression with an in-house Perl script. The target (i.e. biomass generation) reaction was exempt from compression.
Transforming the MILP problem inferred from the compressed model into its dual form, again by an in-house Perl script.
Solving the dual problem for MCSs with cardinalities up to $d_m$ with an in-house implementation of the algorithm described in ref. ¹⁵.

The compressed MCSs were then used for PoF evaluation. For all models, the LOF mutation rate p was set to 10⁻⁴ per reaction.

Single carbon source comparison for E. coli

Every carbon source available in iJO1366 was added separately to the minimal medium (Supplementary Table 3) except for glucose and the corresponding PoF with $d_m =3$ and d₀ = 10 was determined. To reduce the computational burden in the MCS-enumeration step, the respective model was made consistent for every substrate. Again, in order to prevent introducing bias, the carbon source availability in the medium was tuned to allow for the same growth rate as glucose. This was infeasible for some carbon sources with very low yields due to internal model constraints (i.e. flux bounds). Therefore, these constraints were relaxed to ±10,000 in all models.

Benchmarking PoF-implementations

For E. coli’s CCMM¹⁷ and GSMM¹⁸, low-cardinality MCSs were enumerated up to $d_m =15$ and $d_m=3$, respectively. Then, for a range of m ≤ ${m_{d_{m}}}$, the first m compressed MCSs were extracted and evaluated for all d₀ ≤ 15 (CCMM) and d₀ ≤ 8 (GSMM). Additionally, the MCSs were decompressed and evaluated again in the uncompressed form with both pof2.0 and the old implementation available at https://github.com/mpgerstl/networkRobustnessToolbox. Computation was performed on an Intel^® Xeon^® CPU E5-2650 v3 @ 2.30 GHz, and run-times were recorded using Bash’s⁷² built-in time command.

Computation of the PoF

pof2.0 was written in C++11⁷³. The implementation relies on Boost⁷⁴ for calculating binomial coefficients and uses Luigi Pertoldi’s progress bar⁷⁵. The code is available at www.github.com/julibeg/PoF.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All models used for analysis are publicly available^35,36,37. E. coli and Shigella GSMMs can be obtained from the BIGG database^66,67 (http://bigg.ucsd.edu) using the model IDs in Supplementary Data File 1. Salmonella GSMMs are available on BioModels⁷⁶ (https://www.ebi.ac.uk/biomodels) with the accession ID MODEL1807280001. Their individual IDs are listed in Supplementary Data File 2. Fungal models are available on BioModels as well. The respective accession IDs can be found in Supplementary Table 6.

Code availability

Source files for both, old and new, implementations of the PoF calculator are available at https://github.com/mpgerstl/networkRobustnessToolbox and www.github.com/julibeg/PoF, respectively.

References

Kitano, H. Biological robustness. Nat. Rev. Genet. 5, 826–837 (2004).
Article CAS PubMed Google Scholar
Kitano, H. Towards a theory of biological robustness. Mol. Syst. Biol. 3, 137 (2007).
Article PubMed PubMed Central Google Scholar
Larhlimi, A., Blachon, S., Selbig, J. & Nikoloski, Z. Robustness of metabolic networks: a review of existing definitions. Biosystems 106, 1–8 (2011).
Article PubMed Google Scholar
Stelling, J., Sauer, U., Szallasi, Z., DoyleIII, F. J. & Doyle, J. Robustness of cellular functions. Cell 118, 675–685 (2004).
Article CAS PubMed Google Scholar
Rausand, M. Reliability of Safety-Critical Systems: Theory and Applications (Wiley Blackwell, 2014).
Schuster, S., Fell, D. A. & Dandekar, T. A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks. Nat. Biotech. 18, 326–332 (2000).
Article CAS Google Scholar
Behre, J., Wilhelm, T., vonKamp, A., Ruppin, E. & Schuster, S. Structural robustness of metabolic networks with respect to multiple knockouts. J. Theor. Biol. 252, 433–441 (2008).
Article CAS PubMed Google Scholar
Min, Y. et al. Pathway knockout and redundancy in metabolic networks. J. Theor. Biol. 270, 63–69 (2011).
Article CAS PubMed Google Scholar
Wilhelm, T., Behre, J. & Schuster, S. Analysis of structural robustness of metabolic networks. Syst. Biol., IEE Proc. 1, 114–120 (2004).
Article CAS Google Scholar
Zanghellini, J., Ruckerbauer, D. E., Hanscho, M. & Jungreuthmayer, C. Elementary flux modes in a nutshell: properties, calculation and applications. Biotechnol. J. 8, 1009–1016 (2013).
Article CAS PubMed Google Scholar
Ballerstein, K., Kamp, A. V., Klamt, S. & Haus, U.-U. Minimal cut sets in a metabolic network are elementary modes in a dual network. Bioinformatics 28, 381–387 (2012).
Article CAS PubMed Google Scholar
Klamt, S. & Gilles, E. D. Minimal cut sets in biochemical reaction networks. Bioinformatics 20, 226–234 (2004).
Article CAS PubMed Google Scholar
Gerstl, M. P., Klamt, S., Jungreuthmayer, C. & Zanghellini, J. Exact quantification of cellular robustness in genome-scale metabolic networks. Bioinformatics 32, 730–737 (2016).
Article CAS PubMed Google Scholar
De Figueiredo, L. F. et al. Computing the shortest elementary flux modes in genome-scale metabolic networks. Bioinformatics 25, 3158–3165 (2009).
Article PubMed Google Scholar
vonKamp, A. & Klamt, S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLoS Comput. Biol. 10, e1003378 (2014).
Article Google Scholar
Gagneur, J. & Klamt, S. Computation of elementary modes: a unifying framework and the new binary approach. BMC Bioinform. 5, 175 (2004).
Article Google Scholar
Orth, J. D., Fleming, R. M. T. & Palsson, B.O. Reconstruction and use of microbial metabolic networks: the core Escherichia coli metabolic model as an educational guide. EcoSal Plus 4, 1–47 (2010).
Article Google Scholar
Orth, J. D. et al. A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011. Mol. Syst. Biol. 7, 535 (2011).
Article PubMed PubMed Central Google Scholar
Félix, M.-A. & Barkoulas, M. Pervasive robustness in biological systems. Nat. Rev. Genet. 16, 483–496 (2015).
Article PubMed Google Scholar
Ho, W.-C. & Zhang, J. Adaptive genetic robustness of Escherichia coli metabolic fluxes. Mol. Biol. Evol. 33, 1164–1176 (2016).
Article CAS PubMed PubMed Central Google Scholar
Visser, J. A. G. M. D. et al. Perspective: Evolution and detection of genetic robustness. Evolution 57, 1959–1972 (2003).
PubMed Google Scholar
Rendel, J. M. Canalization of the scute phenotype of drosophila. Evolution 13, 425–439 (1959).
Article Google Scholar
Sanjuán, R., Cuevas, J. M., Furió, V., Holmes, E. C. & Moya, A. Selection for robustness in mutagenized RNA viruses. PLoS Genet. 3, e93 (2007).
Article PubMed PubMed Central Google Scholar
Schmalhausen, I. I. Factors of Evolution: The Theory of Stabilizing Selection (Blakiston, Oxford, 1949).
Waddington, C. H. Canalization of development and the inheritance of acquired characters. Nature 150, 563–565 (1942).
Article Google Scholar
Waddington, C. H. The Strategy of the Genes: A Discussion of Some Aspects of Theoretical Biology (Allen & Unwin, London, 1957).
Wilke, C. Adaptive evolution on neutral networks. Bull. Math. Biol. 63, 715–730 (2001).
Article CAS PubMed Google Scholar
Wilke, C. O. & Adami, C. Evolution of mutational robustness. Mutat. Res. 522, 3–11 (2003).
Article CAS PubMed Google Scholar
Ancel, L. W. & Fontana, W. Plasticity, evolvability, and modularity in RNA. J. Exp. Zool. 288, 242–283 (2000).
Article CAS PubMed Google Scholar
Meiklejohn, C. D. & Hartl, D. L. A single mode of canalization. Trends Ecol. Evol. 17, 468–473 (2002).
Article Google Scholar
Wagner, G. P., Booth, G. & Bagheri-Chaichian, H. A population genetic theory of canalization. Evolution 51, 329–347 (1997).
Article PubMed Google Scholar
Bergman, A. & Siegal, M. L. Evolutionary capacitance as a general feature of complex gene networks. Nature 424, 549–552 (2003).
Article CAS PubMed Google Scholar
Kitami, T. & Nadeau, J. H. Biochemical networking contributes more to genetic buffering in human and mouse metabolic pathways than does gene duplication. Nat. Genet. 32, 191–194 (2002).
Article CAS PubMed Google Scholar
Siegal, M. L. & Bergman, A. Waddington’s canalization revisited: developmental stability and evolution. Proc. Natl Acad. Sci. 99, 10528–10532 (2002).
Article CAS PubMed PubMed Central Google Scholar
Monk, J. M. et al. Genome-scale metabolic reconstructions of multiple Escherichia coli strains highlight strain-specific adaptations to nutritional environments. Proc. Natl Acad. Sci. USA 110, 20338–20343 (2013).
Article CAS PubMed PubMed Central Google Scholar
Seif, Y. et al. Genome-scale metabolic reconstructions of multiple salmonella strains reveal serovar-specific metabolic traits. Nat. Commun. 9, 1–12 (2018).
Article CAS Google Scholar
Castillo, S. et al. Whole-genome metabolic model of Trichoderma reesei built by comparative reconstruction. Biotechnol. Biofuels 9, 252 (2016).
Article PubMed PubMed Central Google Scholar
Kanehisa, M. & KEGG, G. S. Kyoto encyclopedia of genes and genomes. Nucleic Acid Res. 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Schäfer, L., García, S. & Srithammavanh, V. Simplification of inclusion-exclusion on intersections of unions with application to network systems reliability. Reliab. Eng. Syst. Saf. 173, 23–33 (2018).
Article Google Scholar
Hanscho, M. et al. Nutritional requirements of the BY series of Saccharomyces cerevisiae strains for optimum growth. FEMS Yeast Res. 12, 796–808 (2012).
Article CAS PubMed Google Scholar
Winzeler, E. A. et al. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science 285, 901–906 (1999).
Article CAS PubMed Google Scholar
Ochman, H. & Wilson, A. C. Evolution in bacteria: evidence for a universal substitution rate in cellular genomes. J. Mol. Evol. 26, 74–86 (1987).
Article CAS PubMed Google Scholar
McClelland, M. et al. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature 413, 852–856 (2001).
Article CAS PubMed Google Scholar
Meysman, P., Sánchez-Rodríguez, A., Fu, Q., Marchal, K. & Engelen, K. Expression divergence between Escherichia coli and Salmonella enterica serovar Typhimurium reflects their lifestyles. Mol. Biol. Evol. 30, 1302–1314 (2013).
Article CAS PubMed PubMed Central Google Scholar
Barquist, L. et al. A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium. Nucleic Acids Res. 41, 4549–4564 (2013).
Article CAS PubMed PubMed Central Google Scholar
Knuth, K., Niesalla, H., Hueck, C. J. & Fuchs, T. M. Large-scale identification of essential Salmonella genes by trapping lethal insertions. Mol. Microbiol. 51, 1729–1744 (2004).
Article CAS PubMed Google Scholar
Kong, X. et al. ePath: an online database towards comprehensive essential gene annotation for prokaryotes. Sci. Rep. 9, 1–11 (2019).
Article Google Scholar
Bilgin, T. & Wagner, A. Design constraints on a synthetic metabolism. PLoS ONE 7, e39903 (2012).
Article CAS PubMed PubMed Central Google Scholar
Csete, M. & Doyle, J. Bow ties, metabolism and disease. Trends Biotechnol. 22, 446–450 (2004).
Article CAS PubMed Google Scholar
Friedlander, T., Mayo, A. E., Tlusty, T. & Alon, U. Evolution of bow-tie architectures in biology. Comput. Biol. 11, e1000202 (2015).
Google Scholar
Segrè, D., Vitkup, D. & Church, G. M. Analysis of optimality in natural and perturbed metabolic networks. Proc. Natl Acad. Sci. USA 99, 15112–15117 (2002).
Article PubMed PubMed Central Google Scholar
Lewis, N. E., Nagarajan, H. & Palsson, B. O. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–305 (2012).
Article CAS PubMed PubMed Central Google Scholar
Klamt, S., Müller, S., Regensburger, G. & Zanghellini, J. A mathematical framework for yield (vs. rate) optimization in constraint-based modeling and applications in metabolic engineering. Metab. Eng. 47, 153–169 (2018).
Article CAS PubMed PubMed Central Google Scholar
Klamt, S. et al. From elementary flux modes to elementary flux vectors: metabolic pathway analysis with arbitrary linear flux constraints. PLoS Comput. Biol. 13, e1005409 (2017).
Article PubMed PubMed Central Google Scholar
Schuster, S. & Hilgetag, C. On elementary flux modes in biochemical reaction systems at steady state. J. Biol. Syst. 2, 165–182 (1994).
Article Google Scholar
Kibota, T. T. & Lynch, M. Estimate of the genomic mutation rate deleterious to overall fitness in e. coli. Nature 381, 694–696 (1996).
Article CAS PubMed Google Scholar
Barrick, J. E. et al. Genome evolution and adaptation in a long-term experiment with escherichia coli. Nature 461, 1243–1247 (2009).
Article CAS PubMed Google Scholar
Drake, J. W. A constant rate of spontaneous mutation in dna-based microbes. Proc. Natl Acad. Sci. USA 88, 7160–7164 (1991).
Article CAS PubMed PubMed Central Google Scholar
Wielgoss, S. et al. Mutation rate inferred from synonymous substitutions in a long-term evolution experiment with escherichia coli. G3 1, 183–186 (2011).
Article CAS PubMed PubMed Central Google Scholar
Williams, A. B. Spontaneous mutation rates come into focus in Escherichia coli. DNA Repair 24, 73–79 (2014).
Article CAS PubMed Google Scholar
Shinar, G. & Feinberg, M. Structural sources of robustness in biochemical reaction networks. Science 327, 1389–1391 (2010).
Article CAS PubMed Google Scholar
Apaolaza, I. N., Valcarcel, L. V. & Planes, F. J. gMCS: fast computation of genetic minimal cut sets in large networks. Bioinformatics 35, 535–537 (2019).
Article CAS PubMed Google Scholar
Schneider, P., Kamp, A. V. & Klamt, S. An extended and generalized framework for the calculation of metabolic intervention strategies based on minimal cut sets. PLoS Comput. Biol. 16, e1008110 (2020).
Article CAS PubMed PubMed Central Google Scholar
Luo, H., Lin, Y., Gao, F., Zhang, C.-T. & Zhang, R. DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements. Nucleic Acids Res. 42, D574–D580 (2014).
Article CAS PubMed Google Scholar
Costanzo, M. et al. The genetic landscape of a cell. Science 327, 425–431 (2010).
Article CAS PubMed PubMed Central Google Scholar
King, Z. A. et al. BiGG models: a platform for integrating, standardizing and sharing genome-scale models. Nucleic Acids Res. 44, D515–D522 (2016).
Article CAS PubMed Google Scholar
Norsigian, C. J. et al. BiGG Models 2020: multi-strain genome-scale models and expansion across the phylogenetic tree. Nucleic Acids Res. 48, D402–D406 (2020).
CAS PubMed Google Scholar
Chelliah, V. et al. BioModels: ten-year anniversary. Nucleic Acids Res. 43, D542–D548 (2015).
Article CAS PubMed Google Scholar
Mahadevan, R. & Schilling, C. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab. Eng. 5, 264–276 (2003).
Article CAS PubMed Google Scholar
Ebrahim, A., Lerman, J. A., Palsson, B. Ø. & Hyduke, D. R. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7, 74 (2013).
Article PubMed PubMed Central Google Scholar
Olivier, B. G. & Bergmann, F. T. SBML Level 3 Package: Flux Balance Constraints version 2. J. Integr. Bioinform. 15, 1–38 (2018).
Article Google Scholar
Free Software Foundation. GNU bash (4.2.26)[unix shell program]. https://www.gnu.org/software/bash/ (2011).
ISO. ISO IEC 14882:2011 Information Technology—Programming Languages—C++, 3rd edn (ISO, 2011).
Schäling, B. The Boost C++ Libraries (XML Press, 2011).
Pertoldi, L. progressbar. https://github.com/gipert/progressbar (2019).
Malik-Sheriff, R.S. et al. BioModels—15 years of sharing computational models in life science. Nucleic Acids Res. 48, D407–D415 (2019).
PubMed Central Google Scholar

Download references

Acknowledgements

This work has been supported by acib. The COMET centre: acib—Next Generation Bioproduction is funded by BMK, BMDW, SFG, Standortagentur Tirol, Government of Lower Austria and Vienna Business Agency in the framework of COMET—Competence Centres for Excellent Technologies. The COMET-Funding Program is managed by the Austrian Research Promotion Agency FFG.

Author information

Julian Libiseller-Egger
Present address: Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK

Authors and Affiliations

Austrian Centre of Industrial Biotechnology, 1190, Vienna, Austria
Julian Libiseller-Egger, Benjamin Luke Coltman, Matthias P. Gerstl & Jürgen Zanghellini
University of Natural Resources and Life Sciences, 1190, Vienna, Austria
Julian Libiseller-Egger
Department of Biotechnology, University of Natural Resources and Life Sciences, 1190, Vienna, Austria
Benjamin Luke Coltman
Department of Analytical Chemistry, University of Vienna, 1090, Vienna, Austria
Jürgen Zanghellini

Authors

Julian Libiseller-Egger
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Luke Coltman
View author publications
You can also search for this author in PubMed Google Scholar
Matthias P. Gerstl
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Zanghellini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation: J.L.-E., J.Z. and M.P.G. Methodology and formal analysis: J.L.-E. and J.Z. Data acquisition and curation: B.L.C. and J.L.-E. Software implementation: J.L.-E. Visualisation: J.L.-E. and J.Z. Writing—original draft: J.L.-E. and J.Z. Writing—review and editing: all. Funding acquisition: J.Z.

Corresponding author

Correspondence to Jürgen Zanghellini.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Reporting Summary

Supplementary Data File 1

Supplementary Data File 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Libiseller-Egger, J., Coltman, B.L., Gerstl, M.P. et al. Environmental flexibility does not explain metabolic robustness. npj Syst Biol Appl 6, 39 (2020). https://doi.org/10.1038/s41540-020-00155-5

Download citation

Received: 27 February 2020
Accepted: 07 October 2020
Published: 27 November 2020
DOI: https://doi.org/10.1038/s41540-020-00155-5