The principles governing cellular metabolic operation are poorly understood. Because diverse organisms show similar metabolic flux patterns, we hypothesized that a fundamental thermodynamic constraint might shape cellular metabolism. Here, we develop a constraint-based model for Saccharomyces cerevisiae with a comprehensive description of biochemical thermodynamics including a Gibbs energy balance. Non-linear regression analyses of quantitative metabolome and physiology data reveal the existence of an upper rate limit for cellular Gibbs energy dissipation. By applying this limit in flux balance analyses with growth maximization as the objective function, our model correctly predicts the physiology and intracellular metabolic fluxes for different glucose uptake rates as well as the maximal growth rate. We find that cells arrange their intracellular metabolic fluxes in such a way that, with increasing glucose uptake rates, they can accomplish optimal growth rates but stay below the critical rate limit on Gibbs energy dissipation. Once all possibilities for intracellular flux redistribution are exhausted, cells reach their maximal growth rate. This principle also holds for Escherichia coli and different carbon sources. Our work proposes that metabolic reaction stoichiometry, a limit on the cellular Gibbs energy dissipation rate, and the objective of growth maximization shape metabolism across organisms and conditions.
Key questions in metabolic research are how and why cells organize their metabolism, or their fluxes through the metabolic network, in a particular manner. Such understanding is highly relevant from a fundamental point of view, but also should enable computational methods for metabolic-flux prediction, which are important in biomedicine and biotechnology.
The archetypal question in this context is why many prokaryotic and eukaryotic cells often use an inefficient fermentative metabolism, even under aerobic conditions. Many explanations have been offered for this, including the economics of enzyme production1,2, a ‘make-accumulate-consume’ strategy3, intracellular crowding4, limited nutrient transport capacity5, and adjustments to growth-dependent requirements6,7. Recently, the integration of proteome allocation constraints in metabolic models has led to predictions in good agreement with experimental data8,9. However, respiration and aerobic fermentation occur in many organisms, including bacteria4, fungi3, mammals6,7 and plants10, with fermentation occurring at high glucose uptake rates (GURs) and respiration at low GURs7,11. This led us to wonder whether a fundamental thermodynamic principle governs metabolism, on top of which specific protein allocation constraints evolved. Specifically, we hypothesized that the rate at which cells, as open systems operating far from equilibrium12, can dissipate Gibbs energy to the extracellular environment13 may be limited and that such a limit may constrain metabolic fluxes.
Here, using a constraint-based thermodynamic model of S. cerevisiae and non-linear regression analysis of quantitative metabolome and physiology data, we identified an upper limit for the cellular Gibbs energy dissipation rate. When we used this rate limit in flux balance analyses (FBAs) with growth maximization as objective function, we generated correct predictions of metabolic phenotypes under diverse conditions. As we found the same principle to hold in E. coli, our work suggests that growth maximization under the constraint of an upper rate limit on Gibbs energy dissipation must have been the general governing principle in shaping metabolism and its regulation. Furthermore, our work provides an important contribution to current predictive metabolic modelling for fundamental biology, biomedicine and biotechnology.
Development of a combined thermodynamic–stoichiometric model
To test our hypothesis that cellular metabolism is limited by a certain critical rate of Gibbs energy dissipation, we used the yeast S. cerevisiae as a model and aimed to estimate cellular Gibbs energy dissipation rates from experimental data using regression analysis (Fig. 1). Specifically, we formulated a combined thermodynamic and stoichiometric metabolic network model, describing cellular metabolic operation through the variables metabolic flux (reaction rate), v, and metabolite concentration, c. The basis of this model is a stoichiometric metabolic network model14 (Supplementary Methods 1.1 and Supplementary Note 1) that describes 241 metabolic processes (that is, chemical conversions and metabolite transport, MET) of primary metabolism and their mitochondrial or cytosolic localization with mass balances for 156 metabolites (Tables 1–5 in Supplementary Data 1) as well as with pH-dependent proton and charge balances (Tables 6 and 7 in Supplementary Data 1). The boundary of the system was defined around the extracellular space and the exchange of matter with the environment was accomplished through 15 exchange processes (EXG) (compare Fig. 1).
To this model, we added a Gibbs energy balance stating that the sum of the Gibbs energy dissipation rates of the individual metabolic processes (that is, the total cellular rate of Gibbs energy dissipation, gdiss) must equal the sum of the rates at which Gibbs energy is exchanged with the environment (Supplementary Methods 1.2). We defined the rate of Gibbs energy dissipation of a metabolic process as the product of the metabolic flux of the process and its Gibbs energy. The Gibbs energy of a metabolic process, in turn, was made a function of the substrate and product concentrations, the standard Gibbs energy of the reaction and/or the Gibbs energy of the metabolite’s transmembrane transport15. We transformed the standard Gibbs energies of the reaction to correspond to the respective compartmental pH values16 (Supplementary Methods 1.3). Finally, for each metabolic process, we added the second law of thermodynamics stating that the Gibbs energy dissipation rate must be negative for a metabolic process carrying flux (Supplementary Methods 1.4). All metabolic processes in the model were considered reversible.
Limit on the rate of cellular Gibbs energy dissipation
To determine gdiss values at different growth conditions, we analysed experimental data with regression analysis using the developed model (Supplementary Fig. 1 and Supplementary Methods 2.1). Specifically, we used physiological data (that is, growth rates, metabolite uptake and excretion rates) and metabolome data of S. cerevisiae obtained from eight different glucose-limited chemostat cultures17. In these cultures, metabolic operation ranged from respiration at low GURs to aerobic fermentation with ethanol production at high GURs. As Gibbs energies estimated with the component contribution method18 contained uncertainties, and Gibbs energies were not available for all metabolic reactions, we included the available standard Gibbs energies of reaction together with their respective uncertainties as experimental data in the regression.
To enforce one common set of standard Gibbs energies of reaction across all experimental conditions with the same thermodynamic reference state (that is, obeying the first law of thermodynamics, which we enforced by applying the loop law19,20), we performed one large regression across all conditions. In this large-scale multi-step non-linear regression, we estimated for each condition its condition-dependent variables (that is, fluxes and metabolite concentrations), and for all conditions together, a set of condition-independent standard Gibbs energies of reaction with minimal distance to the experimental data.
To prevent overfitting, we employed a parametric bootstrap approach (Supplementary Fig. 2a). The regression and a subsequent variability analysis of the solution space provided us with physiological ranges for the intracellular metabolite concentration and for the Gibbs energies of reaction (that is, the lowest and highest possible values across all experimental conditions, reflecting the physiological bounds of metabolic operation), which we used to refine the scope of the model (Supplementary Methods 2.2 and Tables 8 and 9 in Supplementary Data 1).
First, we found that the model with its thermodynamic and stoichiometric constraints could be well fitted to all data sets (Supplementary Fig. 2b–d), demonstrating that the developed model can describe the broad range of underlying metabolic operations, ranging from fully respiratory to fermentative conditions. Second, by examining the gdiss values determined for the different experimental conditions, we found that gdiss first linearly increased with increasing growth rate µ, and then plateaued at µ values > 0.3 h−1 (Fig. 2). The existence of a plateau above a certain µ suggested, in line with our hypothesis, that there could be an upper rate limit, gdisslim, at which cells can dissipate Gibbs energy, here corresponding to −3.7 kJ gram cell dry weight (gCDW)−1 h−1. Because the growth rate at which this limit is reached coincided with the onset of ethanol excretion, we speculated that this limit might cause the switch to fermentation at high GURs.
Accurate predictions of metabolic phenotypes
To test whether such an upper limit on the Gibbs energy dissipation rate might govern metabolic operation, that is, be responsible for the different flux distributions at different GURs, we used FBA, which computes metabolic flux distributions on the basis of a stoichiometric metabolic network model and mathematical optimization using an evolutionary optimization criterion14. Specifically, we used the objective of growth maximization (that is, identifying the flux distribution that generates the maximal amount of biomass from the available nutrients) to simulate the combined thermodynamic and stoichiometric model, which we additionally constrained by the hypothesized gdisslim (Supplementary Methods 2.2). To solve this nonconvex bilinear optimization problem, we transferred it into a mixed-integer non-linear program, which we then solved using a branch-and-cut global optimization algorithm21 (Supplementary Methods 1.5, 1.6 and 2.3).
Previously, it was shown that the objective of growth maximization alone could not predict flux distributions across experimental conditions22. But by using it in combination with the identified upper limit on gdiss, we correctly predicted physiologies as observed in glucose-limited chemostat cultures and in glucose batch cultures, solely using the respective glucose uptake rates as input. For instance, we correctly predicted growth rates (Fig. 3a), a respiratory metabolism at low GURs (<3 mmol gCDW−1 h−1, Fig. 3b–d) and aerobic fermentation with lowered oxygen uptake rates at GURs > 3 mmol gCDW−1 h−1 (Fig. 3b,c). At a GUR of 22 mmol gCDW−1 h−1, we predicted a maximal growth rate followed by a decrease in the growth rate and glycerol production at greater GURs, while still maximizing the growth rate in the optimization. The fact that we could not find any experimental values with GURs > 22 mmol gCDW−1 h−1 suggests that cells restrict their glucose uptake rate to retain the maximal possible growth rate.
FBA simulations without a limit on gdiss predicted a respiratory metabolism for all GURs and no maximal growth rate (compare dotted lines in Fig. 3a–d). FBA simulations with other frequently used objectives (‘minimal sum of absolute fluxes,’ ‘maximal ATP yield,’ ‘maximal ATP yield per flux sum’ and ‘maximal biomass per flux sum’) and the gdisslim constraint did not correctly predict the physiologies (compare dashed lines in Fig. 3a–d and Supplementary Fig. 6). Together with exhaustive sensitivity analyses of various model parameters, for example lower and upper bounds of the intracellular metabolite concentrations, and Gibbs energies of reaction (Supplementary Figs. 3–5), this shows that the predictions obtained with growth maximization as objective and the constrained cellular Gibbs energy dissipation rate are not a trivial result of the earlier regression, nor are they enforced by isolated elements of our model.
To further examine the predictions obtained with the model constrained by the rate limit on Gibbs energy dissipation, we compared intracellular flux predictions with results from 13C-based metabolic flux analysis (13C-MFA). We found that our predictions were in agreement with fluxes determined with 13C-MFA, as was evident from metabolic reactions located at key branch points in central metabolism (Fig. 4a–d and Supplementary Fig. 7). We found the flux reorganization patterns we expected; for instance, redirection of flux from the pentose-phosphate pathway to glycolysis with increasing GUR (Fig. 4a,b).
The fact that we could correctly predict extracellular physiologies including the maximal growth rate, as well as the experimentally observed reorganization pattern of intracellular metabolic fluxes with increasing GURs, suggests that the objective of growth maximization under the constraint of an upper limit on the Gibbs energy dissipation rate could have governed the evolution of metabolism and its regulation, at least in yeast.
Identified principle also applies to Escherichia coli
Because we conjectured that the two elements of this principle, growth maximization and the upper limit on the Gibbs energy dissipation rate, might be universal, we next investigated whether this principle also applies to prokaryotes, using E. coli as model. By following the same workflow as we outlined for S. cerevisiae, we formulated a combined thermodynamic and stoichiometric metabolic model, this time at genome scale, encompassing 626 unique metabolites involved in 1,062 metabolic processes23 (Supplementary Methods 1.1–1.5, Supplementary Note 2 and Supplementary Data 2). Using this model and non-linear regression (Supplementary Methods 3.1 and 3.2) with data from glucose-limited chemostat cultures24, we found, similarly to the yeast results, that gdiss first linearly increased with increasing GURs and then reached a plateau (at −4.9 kJ gCDW−1 h−1) at conditions in which acetate is excreted (Supplementary Figs. 9 and 10). When we performed FBA simulations with growth maximization as the objective and the identified gdisslim as constraint (Supplementary Methods 3.3 and 3.4), we again correctly predicted the shift from respiration to fermentation with increasing GURs, as well as the maximal growth rate (Fig. 5a). Notably, this flux reorganization pattern was reflected in measured changes in protein abundance (Supplementary Fig. 11).
Next, we used this model to perform FBA simulations with different nutrients in which we allowed for unlimited substrate uptake. Specifically, we simulated growth in unlimited batch cultures on eight different carbon sources (acetate, fructose, galactose, gluconate, glucose, glycerol, pyruvate and succinate), on simultaneously present glucose and succinate and on either glucose or glycerol supplemented with all proteinogenic amino acids; notably, none of these conditions were used in the regression. Here, we found that our model could predict maximal growth rates, as well as uptake and excretion rates (Fig. 5b and Supplementary Fig. 12). Notably, this was true even for cases in which we simulated complex media with the possibility of unlimited uptake of all proteinogenic amino acids. The same model, not constrained by the upper rate limit on Gibbs energy dissipation, did not predict maximal growth rates (as maximization of growth would lead to infinite substrate uptake and thus to infinite growth), and did not predict the fermentative phenotypes (Supplementary Fig. 13). A comparison of the FBA-predicted intracellular fluxes with 13C-MFA-inferred flux distributions also showed good agreement (Supplementary Fig. 14).
As our model connects fluxes and metabolite levels through Gibbs energies of reaction and the second law of thermodynamics, we next asked whether metabolic rearrangements, which are necessary with increasing GURs, would require metabolite levels to follow certain trends. Indeed, for 36 metabolites we found a correlation (Spearman correlation coefficient > 0.6) between their concentrations and GUR. Of these 36 metabolites, experimental data as a function of GUR were available for coenzyme A, ribose 5-phosphate and α-ketoglutarate. The profiles of these metabolites matched well with the predicted profiles (Fig. 5c). Notably, α-ketoglutarate is an important metabolic regulatory molecule25. Our analysis suggests that the concentration of this metabolite is constrained in a GUR-dependent manner by thermodynamics, thus making it an ideal candidate as a regulatory metabolite.
The agreement of these E. coli predictions with respective experimental data, extending even to the predictions of some metabolite concentrations, suggests that growth maximization under the constraint of a limited cellular Gibbs energy dissipation rate as a metabolism-governing principle also applies to E. coli and carbon sources other than glucose, including complex media. This provides evidence that this principle universally shaped cellular metabolism across organisms. Furthermore, as the E. coli model used is a genome-scale model, this shows that the concept can also be implemented and applied on the genome scale.
Maximal growth under the rate limit on Gibbs energy dissipation
Finally, we aimed to understand how gdisslim governs metabolism. Therefore, we revisited yeast and the respective FBA simulations from which we determined the Gibbs energy dissipation rate of each metabolic process, g, at different GURs. From these process- and GUR-specific dissipation rates, we identified seven clusters of metabolic processes that showed similar Gibbs energy dissipation trends with increasing GURs (Fig. 6a and Supplementary Fig. 15). We found that, below GURs of 3 mmol gCDW−1 h−1, processes related to respiration (respiration and energy metabolism clusters in Fig. 6a) contributed 45% to the total cellular Gibbs energy dissipation rate, which, in absolute terms, is still low at this point. Once gdisslim was reached and GUR further increased, cells redirected metabolic fluxes from dissipation-intense pathways to less dissipation-intense pathways, that is, to fermentative processes (pyruvate decarboxylase and pyruvate kinase clusters in Fig. 6a), which produced ~40% of the gdiss at GURs >20 mmol gCDW−1 h−1.
Such flux redirection occurred not only between respiration and fermentation, but also between other processes as indicated by the changes in the directionality patterns (Supplementary Fig. 17). Thus, the flux redirection, which occurs at increasing GURs, allows cells to achieve higher growth rates while staying below gdisslim. Such flux redirection leads to the usage of pathways with lower carbon efficiencies and thus lower biomass yields (Fig. 6b). Once all possibilities for flux redirection are exhausted, upon a further enforced increase in the nutrient uptake, cells need to reduce their growth rate and to excrete other by-products (for instance, glycerol) to stay below the Gibbs energy dissipation rate limit. This defines the maximal growth rate (compare Fig. 2).
Our findings answer central questions in metabolic research, for example what shapes metabolic fluxes, what limits growth rate, and what causes cells to change the way they operate their metabolism, as exemplified by the paradigm switch from respiration to aerobic fermentation. Although we cannot exclude the possibility of a third correlated factor explaining our results, our work proposes growth maximization under the constraint of an upper limit on the cellular Gibbs energy dissipation rate as the basic principle underlying metabolism; this also offers an explanation for the empirical description of Pareto optimality in metabolism26 (Supplementary Fig. 18). The limit on cellular Gibbs energy dissipation rate leads to a redirection of metabolic fluxes (for instance, from respiration to fermentation) as substrate uptake rates increase and cells try to maximize growth.
Although the second law of thermodynamics was traditionally formulated for isolated systems close to equilibrium12, here we applied it to cells—open systems out of equilibrium—similarly to how the law has been applied to cellular metabolism13,19,27,28,29,30,31,32. Following Erwin Schrödinger’s notion that “the essential thing in metabolism is that the organism succeeds in freeing itself from all the entropy it cannot help producing while alive”33, our work suggests that there is an upper rate limit at which cells can do so.
The identified upper rate limit on cellular Gibbs energy dissipation suggests that higher rates of Gibbs energy dissipation cannot be sustained, because this presumably has detrimental consequences for the functioning of cells. What could such consequences be? If the dissipated Gibbs energy is dissipated as heat, then the identified limit could be understood as a limit on heat transfer. Although it was suggested that mitochondria (a compartment in which, at certain conditions, we predicted >50% of the total cellular Gibbs energy dissipation; compare Fig. 6) could have an elevated temperature34,35, theoretical considerations argue against a significant and detrimental temperature increase inside individual cells36. On the other hand, during enzymes’ catalytic cycle, enzymes might be set in motion, and Gibbs energy is therefore translated into work37,38,39,40. In fact, active metabolism was shown to increase cytoplasmic diffusion rates above those expected from thermal motion alone41,42,43. In turn, cytoplasmic motion can negatively affect biomolecular functions, such as kinetic proofreading and gene regulation44,45. Therefore, the upper limit on the rate of cellular Gibbs energy dissipation could reflect the limit of critical non-thermal motion inside the cell, beyond which biomolecular function would be compromised.
To maximize growth rate and at the same time avoid exceeding the critical Gibbs energy dissipation rate, cells must have evolved respective sensing mechanisms and means to control metabolic fluxes by adjusting enzyme abundance and kinetics. If intracellular molecule motion reflects the cellular Gibbs energy dissipation rate, then this could directly lead to differential regulation of gene expression. Alternatively, the recently uncovered cellular capability for metabolic flux sensing and flux-dependent regulation11,46 could have evolved in a manner to ultimately avoid detrimental Gibbs energy dissipation rates.
Our approach of using a limit in the cellular Gibbs energy dissipation rate is structurally similar to recent approaches using protein allocation constraints8,9, with a weighted sum of fluxes being the limiting element in both. In the protein allocation approaches, metabolic fluxes are weighted, for example by the molecular mass and the catalytic efficiency of the respective enzymes9. In contrast to these static weights, weighting in our approach is provided by the Gibbs energies of reaction, which can vary to some extent, being a function of flexible metabolite concentrations. We argue that the similarity not only is technical, but also probably has a biological or physical reason: to harness the energy released during catabolism, cells need to partition their metabolism into reaction steps that release Gibbs energy amounts that can be stored, for example as ATP. Thus an overall larger change in Gibbs energy in a pathway (for example, as in respiration compared to fermentation) requires more reaction steps, and thus a larger amount of enzyme.
Our work shows that cellular metabolism could be constrained by a limit on the cellular Gibbs energy dissipation rate. This limit is likely a universal, physical constraint on metabolism and could also explain the Warburg effect in cancer cells. Future work will need to show how the Gibbs energy dissipation rate limits biomolecular function, and how it could have shaped the evolution of enzyme expression and kinetics. Moreover, our concept for metabolic flux prediction, although computationally demanding, offers an advantage over current FBA-based methods as it does not require assumptions about reaction directionalities and does not require any organism-specific information that is hard to obtain, such as information on protein abundances and catalytic efficiencies47. Thus, with this work, we not only present a fundamental understanding of metabolism, but also provide an important contribution to predictive metabolic modelling.
Formulation of the combined thermodynamic and stoichiometric model
The combined thermodynamic and stoichiometric network model is based on steady-state mass balances for the metabolites i:
where Sij are the stoichiometric coefficients of the metabolic (j ∈ MET) and exchange (i ∈ EXG) processes; vj∈MET are the rates of metabolic processes, that is, chemical conversions and/or metabolite transport; and vi∈EXG are the rates of exchange processes, which describe the transfer of metabolites across the system boundary. In this stoichiometric network model, we included steady-state, pH-dependent proton and charge balances for each intracellular compartment; this imposes metabolic fluxes that keep the pH in the respective compartments and the membrane potentials across the membranes constant (Supplementary Methods 1.1).
In addition to the mass, proton and charge balances, we introduced a Gibbs energy balance, which states that gdiss equals the sum of Gibbs energy exchange rates, gi∈EXG, and the sum of Gibbs energy dissipation rates, gj∈MET:
The Gibbs energy exchange rates are defined as:
where ∆fG′i∈EXG are the Gibbs energies of formation of the metabolites transferred across the system boundary. The Gibbs energy dissipation rates are defined as:
where ∆rG′j∈MET are the Gibbs energies of reaction of the cellular metabolic processes.
The Gibbs energies of reaction of the metabolic processes, ∆rG′j∈MET, are due to chemical conversions and/or metabolite transport according to:
where ∆rG′oj∈MET are the standard Gibbs energies of the chemical conversions, ∆rG′tj∈MET the Gibbs energies of the metabolite transports, ln ci the natural logarithm of the concentration ci of the metabolite i, T the temperature and R the universal gas constant.
To define the Gibbs energy exchange rates, we used Gibbs energies of formations, ∆fG′i∈EXG, of the respective metabolites i ∈ EXG that are transferred across the system boundary:
where ∆fG′oi ∈ EXG are the standard Gibbs energies of formation of the metabolites i ∈ EXG.
All standard Gibbs energies were estimated using the component-contribution method18 and transformed16 (indicated by the prime symbol) to the pH values in the respective compartment. Furthermore, we used the extended Debye–Hückel equation to take into account the effect of electrolyte solution on charged metabolites16 (Supplementary Methods 1.2 and 1.3).
The directionalities of the fluxes through the metabolic processes j ∈ MET were in principle assumed to be reversible but must obey the second law of thermodynamics, according to:
where the Gibbs energy dissipation rate, gj∈MET, must be <0, in case there is flux through this metabolic process (Supplementary Methods 1.4).
By combining the relevant equations mentioned above, we formulated the combined thermodynamic and stoichiometric model, M(v,ln c) ≤ 0, as a set of equalities and inequalities of the variables v, that is, the rates of the metabolic processes j ∈ MET and the exchange processes i ∈ EXG and ln c, the natural logarithm of the concentrations of the metabolites i:
Before performing mathematical optimizations with this non-linear and non-convex model, we applied two strategies to reduce the size of the model without reducing its degrees of freedom. First, we defined the scope of the predictions in terms of allowed exchange processes and removed all reactions from the model that could never carry metabolic flux under the specified conditions. Second, we identified reactions, which are fully coupled (that is, always proportionally carry the same flux) as described48, and reformulated the model, M(v,ln c) ≤ 0, by replacing the reaction fluxes v with the flux through the group of coupled reactions, vgrp. Note that the reduced model, Mgrp(v,ln c) ≤ 0, still strictly only depends on the fluxes v and metabolite concentrations ln c, and that whereas the mass balances and Gibbs energy balance are formulated using the flux through the reaction groups vgrp, the second law of thermodynamics is still formulated for every metabolic process individually so as not to lose any directionality constraints.
The reduced model together with a set of bounds, B(v,ln c) ≤ 0, on the variables v and ln c, define the solution space Ω. Ω contains the mass-, proton- and charge-balanced and thermodynamically feasible steady-state solutions, in terms of rates v and metabolite concentrations ln c. The set of bounds, B(v,ln c) ≤ 0, consist of constraints on the rates of the extracellular exchange processes, for example the uptake rate of a carbon source, the physiological ranges of the intracellular metabolite concentrations, ln c, and Gibbs energies of reactions, ∆rG′, or an upper limit on gdiss. We analyzed the solution space, Ω, using mathematical optimization, where we formulated different optimization problems, for example regression, flux balance and variability analyses (Supplementary Methods 1.5).
Because Ω is non-convex and non-linear, the optimization problems can contain multiple local optima. To efficiently solve these problems, we first determined an approximate solution by solving a linear relaxation of the optimization problem with the mixed-integer programming solver CPLEX 12 (IBM ILOG). Then, we used this approximate solution as starting point for the solution of the optimization problem with the global optimization solver ANTIGONE 1.0 (ref. 21) or the local solver CONOPT3 (ref. 49).
Generally, we implemented all optimization problems in the mathematical programming system General Algebraic Modeling System (GAMS) (GAMS Development Corporation, release 24.2.2). The optimization problems were solved on computational clusters; we used a small test cluster of 30 cores for model development. For the large-scale studies in which we solved >100,000 optimization problems, we set up a cluster in Amazon’s Elastic Compute Cloud comprising 1,248 cores, or used a managed HPC cluster comprising 5,640 cores. Solving these optimization problems typically took between 30 min and 14 h (Supplementary Methods 1.6).
We estimated the gdiss values and a thermodynamically consistent set of standard Gibbs energies of reactions, ∆rG′o, from experimental data and the reduced model, Mgrp(v,ln c) ≤ 0. The experimental data consisted of (1) measured extracellular physiological rates and (2) intracellular metabolite concentrations (only for S. cerevisiae), which were both determined for glucose-limited chemostat cultures at different dilution rates, and (3) standard Gibbs energies of reactions, determined from the component contribution method18.
We formulated a non-linear regression analysis that we regularized using the Lasso method50. This regularization, which was done to prevent overfitting the data, included a regularization parameter α, which was determined by model selection. The regression comprised two steps: (1) determining the minimal training error as a function of α and (2) determining the goodness of fit using the reduced chi-squared χ2red,α as a function of α. The model selection was performed by repeating these two steps for different α values and selecting the α with a reduced χ2red,α of 1, which means that the model and the data fit each other (Supplementary Methods 2.1 and 3.1).
Next, we determined physiological bounds for the Gibbs energies, ∆rG′j∈MET, of the metabolic processes j ∈ MET and for the metabolite concentrations ci. These physiological bounds (lower, lo; upper, up) are required in our strategy to solve the FBA optimizations to formulate the linear relaxation and were defined by the infimum and supremum, that is, the smallest and greatest possible values of c and ∆rG′ across all experimental conditions of the data sets as determined by variability analysis (Supplementary Methods 2.2 and 3.2).
Flux balance analysis with the combined thermodynamic and stoichiometric model
For different growth conditions, that is, glucose uptake rates or carbon sources, we predicted metabolic fluxes using the reduced model, Mgrp(v,ln c) ≤ 0, and FBA. Therefore, we defined the solution spaces of the FBA, ΩFBA. The metabolite concentrations, ln c, and the Gibbs energies of reaction, ∆rG′, were constrained by the regression-identified physiological bounds, and the standard Gibbs energies of reactions, ∆rG′o, were set to the identified thermodynamically consistent set. Furthermore, gdiss was constrained by gdisslim and the rates of exchange processes were constrained by the growth condition, such that any quantity of oxygen, phosphate, ammonium, water, protons, sulfate, and so on (resembling what was available in the growth medium) could be taken up, and all other compounds could be excreted.
Then, we used FBA14, in which we maximized the growth rate, µ, in the solution space ΩFBA,
We then characterized the solution space ΩFBAµ* for optimal growth rates, using flux variability analysis, and, as described14,26,51, using Markov-chain Monte Carlo (MCMC) sampling (Supplementary Methods 2.4 and 3.4).
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The data that support the plots within this paper and other findings of this study are available from the corresponding author upon reasonable request. The code is available from the corresponding author upon request and the code to perform the flux balance analyses is deposited on GitHub (https://doi.org/10.5281/zenodo.1401220).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was funded by the Netherlands Organisation for Scientific Research (NWO) through the Systems Biology Centre for Metabolism and Ageing (Groningen), and by the BE-Basic R&D Program, which was granted as FES subsidy from the Dutch Ministry of Economic Affairs, Agriculture and Innovation (EL&I). We thank A. Canelas for sharing raw data, E. Noor for help with the component contribution method, E. Wit for statistics advice, G. Zampar for helpful discussions and B. Bakker, A. Bardow, D. Huberts, A. Ortega, U. Sauer, S. Stratmann and J. Radzikowski for helpful comments on the manuscript.