The space of enzyme regulation in HeLa cells can be inferred from its intracellular metabolome

Diener, Christian; Muñoz-Gonzalez, Felipe; Encarnación, Sergio; Resendis-Antonio, Osbaldo

doi:10.1038/srep28415

Download PDF

Article
Open access
Published: 23 June 2016

The space of enzyme regulation in HeLa cells can be inferred from its intracellular metabolome

Christian Diener¹,
Felipe Muñoz-Gonzalez¹,
Sergio Encarnación² &
…
Osbaldo Resendis-Antonio^1,3

Scientific Reports volume 6, Article number: 28415 (2016) Cite this article

3154 Accesses
8 Citations
4 Altmetric
Metrics details

Subjects

Abstract

During the transition from a healthy state to a cancerous one, cells alter their metabolism to increase proliferation. The underlying metabolic alterations may be caused by a variety of different regulatory events on the transcriptional or post-transcriptional level whose identification contributes to the rational design of therapeutic targets. We present a mechanistic strategy capable of inferring enzymatic regulation from intracellular metabolome measurements that is independent of the actual mechanism of regulation. Here, enzyme activities are expressed by the space of all feasible kinetic constants (k-cone) such that the alteration between two phenotypes is given by their corresponding kinetic spaces. Deriving an expression for the transformation of the healthy to the cancer k-cone we identified putative regulated enzymes between the HeLa and HaCaT cell lines. We show that only a few enzymatic activities change between those two cell lines and that this regulation does not depend on gene transcription but is instead post-transcriptional. Here, we identify phosphofructokinase as the major driver of proliferation in HeLa cells and suggest an optional regulatory program, associated with oxidative stress, that affects the activity of the pentose phosphate pathway.

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

A lipidome landscape of aging in mice

Article 12 April 2024

The JAK/STAT signaling pathway: from bench to clinic

Article Open access 26 November 2021

Introduction

During the development of cancer, cells undergo major metabolic changes to increase their capacity for proliferation. In many cases, that transition is characterized by an increased usage of fermentation, which is independent of the presence of oxygen and caused by a higher flux through glycolysis and diminished activity of the TCA cycle¹. The resulting decreases in respiration and secretion of lactate have been known since the 1920s and were named after Otto Warburg. Although the Warburg effect has been well characterized in the last century, questions remain as to which regulatory changes are necessary to cause it². The recently increased availability of genome and proteome technologies has provided great advantages in the analysis of genome-level regulation during the formation of cancer^3,4,5. Thus, it has become apparent that regulatory events in cancer development are heterogeneous and that the regulatory events causing the Warburg effect may be distinct between different cancer types or even patients with the same cancer^6,7. Additionally, data on the transcription level may only detect a subset of regulatory events, such as changes in gene expression and mutations, but fail to find post-translational regulation events such as protein modifications, phosphorylation or allosteric regulation that might have a large impact on cancer development^8,9,10,11.

Therefore, it comes as no surprise that there has been an ongoing effort to combine data from the genome with the metabolome, the concentrations of all of the cells’ metabolites^12,13. Metabolome data are inherently more complicated to obtain than genome data due to the necessity of different protocols for different metabolites. However, the metabolome is also closer to the cellular phenotype, as it forms the basis for growth and cellular health and has proven to deliver reliable markers for the detection of cancer^12,14. This case holds particularly for the intracellular metabolome which gives a detailed snapshot of a cells’ metabolic state and is often more informative than extracellular measurements from biofluids¹⁵.

While there exists a wide selection of methods to analyze genomic data and infer regulation events in cancer up to the enzyme level, metabolome data are often analyzed solely on the abundance level with only limited ability to extend this information to the entire biological system or even connect metabolome data to the inferences made from genome data. However, there exist some notable exceptions where methods from Systems Biology have been applied successfully^16,17. Here, a common strategy is to employ fluxome analysis, based mostly on Flux Balance Analysis (FBA) or control theory^{18,19,20,21,22}. FBA has been shown to be a valuable tool in cancer research, albeit with certain limitations²³. In particular, it is difficult to connect metabolome data to FBA because FBA does not treat kinetics explicitly and therefore has no direct quantifiable concept of how metabolite concentrations influence cellular fluxes. An alternative formulation termed “k-cone” remedies this situation by acting on the space of possible kinetic parameters rather than fluxes^24,25,26. In contrast to FBA, it makes assumptions about specific kinetics and may give better insight into the systems dynamics than FBA. In the analysis of regulation events, k-cone analysis is particularly useful because it expresses the systems properties in terms of individual enzyme activities, which can be connected to other omics data such as genome or proteome data that give estimates of enzyme concentrations.

In this manuscript, we present the first metabolome profiles for the cancerous HeLa and non-cancerous HaCaT cell lines. We use the k-cone formalism to obtain a differential analysis of enzyme activities between the two cell lines, as that formulation can use metabolome data to provide a mathematical transformation between the normal and disease state. Our analysis identifies alterations in the enzyme activities of HeLa cells, many of which are consistent with previously identified changes. Furthermore, we also show that there exists a set of optional enzyme regulations that may help HeLa cells to alleviate oxidative stress without compromising proliferation. Taken together, we propose that differential k-cone analysis, which may integrate genome-scale metabolic reconstructions and metabolome data, is a suitable conceptual scheme to identify and suggest the regulatory mechanisms required to establish the phenotype in cancer cell lines.

Results

Obtaining the k-cone and global stability from metabolome data

K-cone analysis is closely related to the steady state, the state of homeostasis for a metabolic system. During the steady state, metabolite concentrations are constant which gives rise to the steady state equation

Here, S denotes the stoichiometric matrix and v the vector of steady state fluxes. The space of all v that fulfill that equation is commonly known as the flux cone. The k-cone is based on the same equation but further assumes a distinct structure of the fluxes, where each flux can be decomposed into a kinetic constant and a metabolic term given by the metabolite concentrations x as v_i = k_im_i(x). This case holds for mass-action kinetics with stoichiometries s_ji

and gives rise to the k-cone equation

where M denotes a diagonal matrix containing the mass-action terms m_i(x) on its diagonal^24,26. This equation now defines a space for all feasible kinetic constants in the steady state, the k-cone. Dividing all reversible biochemical reactions into their irreversible individual forward and backward reactions further yields a k-cone that is strictly positive. It is important to note that the k-cone does not identify the exact kinetic constants for the system but rather the space in which those constants must reside. Furthermore, the k-cone can be constrained by known in vivo equilibrium constants (see Supplementary Text)²⁴. Because we were particularly interested in a differential analysis of enzyme activities, we tried to find expressions relating the k-cone of a normal cellular state to a disease state. As we show in detail in the Supplementary Text, given the k-cones for the normal state K_n and the disease state K_d, the corresponding mass action term matrices M_n and M_d define a diagonal transformation matrix T = M_nM_d⁻¹ such that K_d = TK_n. Thus, the difference between the normal and the disease state of the entire space of steady state kinetic parameters is completely defined by the mass-action terms which can be obtained from metabolome data. The diagonal matrix T explains all possible changes in enzyme activity between the normal and disease state in a quantitative manner and quantifies the prevalence with which an enzyme is regulated when the fluxes are unknown. Specifically, if feasible kinetic constants were randomly sampled from the normal and disease k-cones for the same cell, the expectation of their fold changes would be given by the diagonal of T, which is why we will denote the entries in T as the expected differential activities (EDAs).

However, one particular normal or disease state only occupies one distinct point, k_n or k_d, in its respective k-cone, K_n or K_d. In the case where the fluxes are known along with the metabolome one can use the mass-action kinetics, v_i = k_im_i(x), to pinpoint the kinetic constants directly through dividing the fluxes by the corresponding mass-action terms m_i(x). Based on this argumentation, one can derive a relation that incorporates flux data and where k_d = TWk_n (Supplementary Text). Here, W is a diagonal weight matrix containing the steady state flux ratios v_d/v_n. Thus, the transformation of the entire k-cone is given by metabolome data alone, whereas the exact position in the respective k-cones is defined by fluxome data. The fluxome of human cell lines is usually unknown, but there exist several strategies for using prior information to estimate a feasible flux distribution of cancer cells. Here, the most common strategy is to use flux balance analysis (FBA), possibly incorporating other genome-scale data, such as gene expression and growth rate measurements²⁷.

Metabolome measurements were obtained for the HaCaT keratinocyte and HeLa cell lines, which are both selected for their ability to proliferate independently and have very similar doubling times^28,29,30. Here, HaCaT cells were used as an immortalized control, meaning the cell line employs unlimited proliferation but is not cancerous, whereas HeLa is a cervical cancer cell line. We quantified a large fraction of the metabolites participating in the central carbon metabolism for both cell lines (three biological replications for each cell line, data in Table S1, see Materials and Methods). The obtained log-fold changes in metabolite concentrations are shown in Fig. 1 and were consistent with previously published works^15,31,32 on different cancers, particularly in showing a strong deregulation of metabolites and intermediates of the glycine and proline metabolisms between HeLa and HaCaT³³.

The measured metabolite concentrations were mapped to a model of the central carbon metabolism, extending a previously published model to yield one with 100 irreversible reactions assuming mass-action kinetics³⁴. The model contained all major pathways of central carbon metabolism, such as glycolysis/gluconeogenesis, the TCA cycle and the pentose phosphate pathway, as well as simplified versions of cellular respiration and oxidative stress (Table S2). Of the measured metabolites, 28 could be mapped to the 43 metabolites in the model and the remaining unmapped model metabolites were imputed by previous measurements and assumed to be the same in HaCaT and HeLa (see Supplementary Tables S1 and S2 as well as Materials and Methods). This yielded the 100 mass-action terms of the model, where 71 mass-action terms were based on at least one measured (non-imputed) metabolite and 43 mass-action terms contained at least one imputed metabolite.

Using this model, we derived the specific k-cones and transition matrix T from the metabolite measurements, yielding a k-cone containing over 80,000 basis vectors. Due to the high dimensionality of the k-cone visualization requires a mapping to a lower dimensional space. We employed two different strategies for this purpose. In the first attempt, shown in Fig. 2B, we mapped the reduced k-cone onto two dimensions by principal component analysis (PCA) and proceeded by clustering the reduced k-cone vectors in order to eliminate identical vectors in the reduction (also see Materials and Methods for details on the projection). As an alternative we also employed a strategy based on reducing the original k-cone space. Here, we used additional constraints taken from approximations of in vivo equilibrium constants (K_eq) obtained from the equilibrator database³⁵ (http://equilibrator.weizmann.ac.il/) which reduced the k-cone to only 40 basis vectors which were then mapped onto two dimensions by PCA (see Fig. S1, Table S2 and Supplementary Text). We observed that on a global scale, the space of enzyme activities, as given by the kinetic parameters, was almost identical between the HaCaT and HeLa cell lines, indicating that the differences were limited to reactions with relatively low enzyme activities (compare Fig. 2B and Fig. S1A). To see whether there exists a subspace where the k-cones differ, we also visualized the k-cone for only those reactions whose mass-action terms changed by at least 2-fold, thus indicating large influences in the matrix T (Fig. 2C and S1B). Here, we observed a stronger difference between the k-cone spaces of the normal and control group, mostly achieved by slight rotation and strong scaling in the basis vectors. Those results could be observed for the complete k-cones as well as for the ones constrained by equilibrium constants.

Given a k-cone basis, it is also possible to evaluate the stability of the entire basis. Here, stability denotes the ability of the system to return to its steady state upon slight perturbation. For a detailed explanation refer to Supplementary Text. We calculated the stability for all basis vectors in each of the non-reduced k-cones. Because every possible steady state solution of kinetic constants must be a linear combination of the k-cone basis vectors, the global stability of the system must be a combination of the observed stabilities. The majority of all k-cone basis vectors were stable, which is to be expected of biologically relevant steady states (Fig. S1C). However, there was also a large group of unstable basis vectors, albeit with very small positive eigenvectors. One should note that due to the smaller absolute values of eigenvalues in the unstable states, the stable state will predominate, meaning that mixed states composed of stable and unstable basis vectors will most likely be stable (see Supplementary Text). The proportion of stable basis vectors in each k-cone remained the same between HaCaT and HeLa cells, suggesting that cancer in HeLa cells is essentially a stable state and as difficult to perturb as the non-cancerous HaCaT cells.

Inference of enzymatic regulation in HeLa

As shown in the previous section, the transition given by T gives an approximation of enzyme activity fold-changes between the HeLa and HaCaT cell lines and does not require explicit calculation of the k-cone. However, because the mass-action terms are based on the products of noisy data, special care must be taken to avoid an influence of the reaction order (number of multiplicands in the mass-action terms) on the fold-changes and their statistics. Thus, we performed all analyses in log-space considering log₂T rather than T. The problem of identifying differential enzyme activity is similar to the problem of identifying differential gene expression. Thus, analysis methods for microarray data can be applied to the mass-action terms if they follow the required log-normal distribution. Because we used mass-action kinetics, the logarithmic transformation of the mass-action terms is a weighted sum of the log-transformed metabolite concentrations and is approximately normally distributed as long as the log-transformed metabolite concentrations are as well. The validity of this assumption was verified by quantile-quantile plots as well as the empirical distribution function of the log-transformed metabolite concentrations (shown in Fig. S2). This assumption allowed us to employ methods from microarray analysis, as used by the limma package³⁶. The significance of the observed log-fold changes in enzyme activity between HaCaT and HeLa cells was obtained by Welch t-tests using an empirical Bayes estimator for the stable quantification of sample variances³⁷.

There is the possibility that only a subset of the reactions are actually required to maximize proliferation, leaving a large window of variation for fluxes that do not directly influence the growth rate. Due to the identical model and thus identical flux cone, for both conditions, those differences would not be detectable with standard methods to calculate metabolic fluxes such as sampling from the flux cone or flux balance analysis that would both result in the same fluxes for both conditions in average. Because the accuracy of our approximation of differential enzyme activity is compromised by large changes in the fluxes between the cell lines, we performed flux variability analysis to identify the maximum log-fold change that could be caused by variation in the fluxes. Here, we first added a biomass reaction to the model and maximized its flux. This step was followed by flux variability analysis to obtain upper bounds for the absolute log-fold change for each reaction flux³⁸. The individual log-fold changes were then filtered by those upper bounds to leave only those log-fold changes that could not be counteracted by flux variation (see Materials and Methods). This process could be used to identify differential enzyme activities that were necessary for optimal proliferation, as they could not be explained by flux variation under optimal growth. This analysis can be interpreted as identifying dimensions in the k-cones that do not overlap between normal and disease conditions.

The mean log-fold changes for the EDAs in each reaction are shown in Fig. 3. Reactions with a significant p-value (FDR corrected p < 0.05) in their EDAs are indicated in Fig. 4A and their individual log-fold changes along with their flux variation are shown in Fig. 4B. Mean log fold changes along with their credible intervals are reported in Supplementary Table S3.

Notably, many of the enzymes with significant EDAs are already known to be altered in cancer. Those enzymes include phosphofructokinase, phosphoglycerate mutase, pyruvate kinase, 6-phosphogluconate dehydrogenase, pyruvate dehydrogenase and aconitase^{39,40,41,42,43,44,45}. Previously unknown regulation events include a retention of glucose-6-phosphate, an increased production rate of PRPP (5-Phospho-alpha-D-ribose 1-diphosphate) and a strongly accelerated production of glyceraldehyde 3-phosphate. Additionally, the EDAs predict a strong dysregulation of TCA cycle enzyme activities as well as increased ATP usage and lactate export in HeLa cells, all consistent with the Warburg effect¹⁴. The set of necessary regulations for proliferation consisted of the up-regulation of four glycolytic enzymes, the up-regulation of ATP synthesis and the increased export of lactate. Thus, the Warburg effect in HeLa cells seems to be a consequence of maintaining a high proliferation rate. In general, the strongest regulation was observed for phosphofructokinase, which showed an 8-fold increase in enzyme activity in HeLa compared to the HaCaT cell line. Because phosphofructokinase is allosterically regulated by ATP, citrate and pH, this regulation is consistent with the observed lower concentrations of ATP, citrate and lactate as shown in Fig. 1⁴⁶.

Relation to gene expression and coregulation of heterogeneous enzyme activities

Given our predictions of differential enzyme activity based on metabolome data (EDAs), we also investigated how well these data would correlate with differential gene expression in HaCaT and HeLa cells and, thus, whether the observed changes in enzyme activity are due to changes in gene expression. For this purpose, we assembled a data set consisting of 58 microarray samples (20 HaCaT and keratinocytes and 38 HeLa) on a single platform (HGU133Plus 2.0) obtained from the GEO database⁴⁷. Here, the keratinocyte samples were added because only very few HaCaT samples were available in the GEO database. Their validity was checked by PCA and clustering over the expression values, which consistently grouped the keratinocyte samples together with the HaCaT samples (Fig. S2 and Supplementary Text). All samples in the list were curated manually to ensure that they described untreated conditions and they can be found in Supplementary Table S4. We found that log-fold changes obtained from EDAs and gene expression did not correlate on a global level (see Fig. 5A, Pearson product-moment correlation <0.01, p > 0.83). However, some of the enzymes with the largest changes in activity also showed significant changes in gene expression (compare Table S5), particularly phosphofructokinase, glucose-6-phosphate isomerase and phosphoglucokinase. As shown in Fig. 5A, enzymes participating in the pentose phosphate pathway and TCA cycle with a significant change in enzyme activity, as predicted by metabolome data, often showed significant changes in gene expression, but in many cases in the opposite direction (meaning they were up-regulated in their EDA but down-regulated in gene expression and vice versa; also see Supplementary Text). In total, genomic regulation is mostly active in glycolysis. However, gene expression in general is not a good predictor of differential enzyme activity, suggesting that the primary regulation of metabolism in HeLa cells occurs on the post-transcriptional level.

Genomic analysis of human cancers has already shown that metabolic regulation in cancer can be highly heterogeneous and varies greatly among different cancers and even within patients with the same cancer⁷. Here, we aimed to analyze the heterogeneity of regulation on a metabolic level. Log-fold changes of enzyme activity within HaCaT samples and between HeLa and HaCaT samples both showed strong variations (compare Figs 3 and 4B). To analyze this phenomenon in more detail, we calculated standard deviations for all obtained log-fold changes of the EDAs for the control log-fold changes (within HaCaT) and differential log-fold changes (between HeLa and HaCaT). Here, standard deviations were mostly conserved on a reaction level between the HeLa and HaCaT cell lines and most standard deviations for enzyme activities from the HeLa samples remained in the range of 3-fold (0.33 to 3) standard deviations within the HaCaT samples (see Fig. 5B). The largest variations within HaCaT and between HeLa and HaCaT could be observed for reactions alleviating oxidative stress and the reactions of the TCA cycle, indicating that both HaCaT and HeLa cells show variations in the regulation of enzymes involved in oxidative stress and the TCA cycle. Although gene expression was not a good predictor of enzyme activity changes, the heterogeneity seemed to be consistent with previously reported measurements of gene expression which also identified the TCA cycle and oxidative stress genes as the most heterogeneous ones in both normal and cancer cells⁷.

To identify reactions with specifically increased heterogeneity in cancer, we selected reactions whose variation increased by at least 3-fold in HeLa cells compared to HaCaT cells. For the EDAs, this selection produced a set of 11 reactions involved in the pentose phosphate pathway, glycolysis and respiration as well as increased ATP usage (see Fig. 5C). Log-fold changes of those reactions were highly correlated and formed two blocks, one connecting a high ATP usage, respiration and the late phase of glycolysis and another connecting reactions of the pentose phosphate pathway. Both of these blocks were connected by phosphofructokinase (PFK), showing a strong influence of PFK in the balancing of respiration with the pentose phosphate pathway.

Because in our results phosphofructokinase up-regulation is the most prominent change necessary for proliferation, we suggest the mechanism illustrated in Fig. 5D, where HeLa cells divert most of their metabolism towards glycolysis and use the pentose phosphate pathway only when a higher respiration requires it.

Discussion

As we have shown, differential k-cone analysis is capable of suggesting regulated enzymes in the transition to cancer for the central carbon metabolism of HeLa cells. We feel that this method is particularly well-suited to examining the regulation of non-essential enzymes or enzymes that are not visibly affected on the genome level, as it detects regulation on the metabolome level, explicitly including post-transcriptional regulation events. Analysis based on the k-cone, as performed here, combines well with existing methods such as FBA or control theory by integrating data from the metabolome and thus giving a more appropriate description of the phenotype⁴⁸. In particular, the quantities used in the k-cone method are very similar to FBA and create the possibility of incorporating prior knowledge in the form of the matrix W containing the flux ratios.

However, it is also important to note the limitations of our results. At least in the analysis we performed here, the method assumed the kinetics to be governed by the mass-action law, which is, at best, an approximation of the most likely more complex underlying kinetics. Another limitation is lacking metabolite measurements for metabolites included in the used model. The analyses, as we performed here, aimed to be conservative, meaning that we treated cases with missing data as non-differential.

Our results suggest that the activities of many enzymes in the central carbon metabolism of HeLa cells are similar to the activities found in the non-cancerous HaCaT cells. However, there is a small set of enzymes that can alter their activities in the two cell lines and those changes do not seem to affect the stability of the system. Those regulation events can be further subdivided into a small set of necessary regulations required for proliferation and a slightly larger set of enzymes that can be regulated on a on-demand basis. Here, the up-regulation of phosphofructokinase (PFK) seems to be the major driver for maintaining proliferation, which explains the requirement of the Warburg effect, as a high concentration of lactate, citrate or unused ATP can allosterically inhibit phosphofructokinase^46,49. As such, it might be beneficial for HeLa cells to limit the production of ATP in the TCA cycle to maintain a more active PFK. One of the optional regulation events in HeLa cells is a strong regulation of the entry and exits of the pentose phosphate pathway. This regulation caters to the needs of the cancer by draining the pentose phosphate pathway into ribonucleotide and glycolytic precursors while simultaneously producing NADPH, which is required to alleviate oxidative stress. Interestingly, our results propose an up-regulation of almost all enzymes in the central carbon metabolism using fructose 6-phosphate as a substrate, which might be associated with its necessity for glycolysis and the production of nucleotide precursors (compare Fig. 4). In our analysis, mitochondrial and oxidative stress enzyme activity show high variation in individual HeLa cell cultures and are correlated with pentose phosphate pathway enzyme activity (Fig. 5). Additionally, some of the genes associated with oxidative stress are among the most down-regulated ones (compare Fig. 5A and Table S5). This finding suggests a low tolerance of HeLa cells to oxidative stress in the default state, as most of glucose-6-phosphate is diverted into glycolysis by a more active PFK. However, this low tolerance can be counteracted by a high fidelity in deviating flux into the pentose phosphate pathway. There is some evidence that fidelity to oxidative stress is indeed due to the precedence of faster metabolic regulations⁵⁰. However, at this stage, it is impossible to say whether this is an observation specific to the comparison of HeLa to HaCaT cells or a general mechanisms in various human cancers. There is some slight evidence of abnormal transaldolase activity in cancer, but not to the extent that we observed here^51,52. Finally, experiments in yeast and B. subtilis suggest that large parts of the central carbon metabolism are regulated on a post-transcriptional level rather than on a transcriptional level, which seems to be the case particularly for the pentose phosphate pathway and late glycolysis, which we also find strongly regulated here^22,53,54,55. Further investigation of this putative phenomenon could be of medical interest. Many of the more severe treatment options such as chemotherapy rely upon increasing oxidative stress in cancer cells. Thus, optional signatures of enzyme activity indicating a strong ability to combat oxidative stress via the pentose phosphate pathway might have consequences for the treatment options of those particular cancers.

We also observed that the metabolic differences between cancer and healthy cells are caused by the alteration of only a few enzymes. This result stands in contrast to mRNA measurements, which often suggest that cancer alters the expression of the majority of enzymes in the central carbon metabolism⁷. Thus, it seems that not all genomic aberrations are capable of affecting the dynamics of the underlying metabolic network sufficiently and this result further outlines the necessity of methods that can map regulatory events in cancer to effects on metabolite abundances, which are closely connected to the resulting phenotype. Consequently, it would be worthwhile to combine the presented methods with existing omics data. For instance, one could study whether certain signatures in mRNA expression or protein abundance changes are associated with a particular change in enzyme activity. We feel that such a combination of methodologies could yield insights into the metabolic state of cancer cells and help understand their ruling principles, elucidate the heterogeneous causes of cancer and potentially identify new targets to halt or delay cancer progression.

Materials and Methods

Metabolome measurements

Measurement of the ionic metabolites was performed using the CE-MS system. The HeLa cell line was provided by the oncology laboratory of the Centro Medico Siglo XXI, which belongs to the Instituto Mexicano del Seguro Social. The HaCaT cell line was donated by the Centro de Investigación Sobre Enfermedades Infecciosas, which belongs to the Instituto Nacional de Salud Pública. Cell lines were cultured in RPMI-advanced as previously described by our group⁵⁶. For each of the two cell lines, we obtained three biological replicates. First, 5 · 10⁶ cells were harvested at 70% confluence with 2 ml MeOH including the internal standards. Then, 1.6 ml of cell suspension was transferred to microcentrifuge tubes containing 1.6 ml of CHCl₃ plus 640 μl of milliQ water, vortexed and then centrifuged at 2300 g and 4 °C for 5 minutes. 1.5 ml of the transferred aqueous layer was filtered through a Millipore 5-kDa cutoff and evaporated to dryness using a centrifugal evaporator. The measurement and quantification of extracted metabolites were performed by a commercial provider using a capillary electrophoresis (CE) connected to an ESI-TOF-MS with an electrophoresis buffer (Solution ID H3302-1021, Human Metabolome Technologies Inc., Tsuruoka, Japan).

Abundances were transformed into concentrations by dividing by the total volume of the 10⁶ cells, assuming an individual cell volume of 1.54 fl⁵⁷. Hydrogen was not considered in the model due to lack of concentration (or intracellular pH) measurements.

Missing metabolite concentrations required in the model were imputed in multiple steps. First, missing data were imputed from measurements within the same cell line, followed by imputation across the two cell lines. Concentrations for metabolites that could not be detected in either of the two cell lines were obtained from the Human Metabolome database, primarily using cytosolic concentration measurements and falling back to blood measurements if cytosolic measurements were not available⁵⁸. The actual values of those imputed concentrations were only of importance for the stability analysis described below. During differential analysis, metabolites with missing measurements for either cell line were assigned fold-changes of one due to the nature of the imputation procedure. As such, we made the implicit assumption that missing metabolite concentrations did not change across the two cell lines.

Data availability and reproducibility

The raw data used for the analysis are provided in Supplementary Tables S1, S2 and S4. The methods of data handling, optimization and analysis were implemented in the dycone R package available at https://github.com/cdiener/dycone together with installation instructions (doi: 10.5281/zenodo.49987). A detailed protocol describing the steps taken to generate all figures and results in this paper is given in the Supplementary Text. To adhere to Open Science standards, the protocol is also available as R Markdown document together with the raw data files at https://github.com/cdiener/kcone-paper and can be used to reproduce all analyses in this manuscript.

Model specification and k-cone calculations

The underlying kinetic model was obtained by extending a previously published model that had been validated by experimental data. The model was updated by annotating all reactions with their respective IDs from KEGG and adjusting the hydrogen balances to coincide with the ones reported in KEGG. We added additional reactions summarizing core mechanisms such as the neutralization of peroxide, import of glutamine, as well as the production of ATP and reduction of NADH in the mitochondria. Appropriate exchange reactions were added to molecules that could either be produced or consumed by metabolic processes not included in the model or obtained from the extracellular environment. The complete model specification can be found in Table S1, which is also the exact file read to generate the presented results.

A complete mathematical derivation of the formalism can be found in Supplementary Text. The k-cones for various metabolite measurements are obtained from the flux cone V. Because the flux cone equations define the H-representation of a polyhedral cone, obtaining the basis for the flux cone is equivalent to the vertex enumeration problem, which was solved using the method of Fukuda et al.⁵⁹ using the Rcdd package (https://cran.r-project.org/web/packages/rcdd) on a H-representation with redundancies removed. Basis elements were normalized to unit length. The non-reduced individual k-cones, individual k-cones were calculated as M⁻¹V, where M denotes the diagonal matrix with the mass action terms on its diagonal. The k-cones which where additionally constrained by equilibrium constants were calculated individually by adding the respective equilibrium constraints to the k-cone equation (see Supplementary Text). Visualization of the k-cones was performed by first performing dimensionality reduction using principal component analysis, followed by k-means clustering of the reduced vectors to avoid overlap for the k-cones not constrained by equilibrium constants. The convex hull, representing the shadow cast by the k-cone into the lower dimension, was calculated by identifying the set of non-redundant vectors in the reduced polytope.

The stability of the k-cone was obtained from calculations as detailed in Supplementary Text. Derivatives for the mass-action kinetics were derived analytically for each reaction and the Jacobian matrix constructed for each of the basis vectors of the corresponding flux cone. The stability for one basis vector was then evaluated based on the eigenvalues of the Jacobian matrices. The basis vector was identified as stable if all eigenvalues were smaller than −ε (where ε denotes the double float machine accuracy) and unstable if at least one eigenvalue was larger than ε.

Optimization and differential analysis

Approximations of changes in enzyme activities were either obtained by the transformation T via calculating its log-diagonal log₂ diag(T) = log₂ (k₂/k₁) for T = M₁M₂⁻¹, as derived in the Supplementary Text.

Necessity for proliferation was performed by first appending a biomass reaction to the model and identifying the maximum permissible flux through that reaction. The biomass reaction was adapted from Recon 2⁶⁰. Here, we mapped metabolites or precursors from our model to the metabolites included in the Recon 2 model (version 2.02, also see Supplementary Text). In the case that one precursor could produce several metabolites in Recon 2, we used the maximum stoichiometry of the associated products. Given the resulting biomass reaction flux v_bm, this procedure gave rise to the following linear programming program for the flux balance analysis¹⁸ of the fluxes v_i:

Here, the upper bound could be chosen arbitrarily, as later calculations used only the flux ratios, which are invariant to the upper bound. Lower bounds for the fluxes were chosen as 10⁻¹⁶ to ensure a non-zero flux for each reaction, as all reactions of the central carbon metabolism should be active in the cell lines used.

After obtaining the maximum biomass flux v_max, the upper and lower flux limits were obtained by solving two linear programming problems for each flux v_i:

The largest absolute log-fold change that could be explained by flux variability analysis could now be obtained as

Finally, all individual log-fold changes obtained from the EDAs whose absolute value exceeded that of lfc_max were deemed necessary for proliferation.

All log-fold changes, obtained by the transformation T, were handled in the same way. Significance measures were obtained by performing Welch t-tests on the log-transformed mass-action terms after validating their normal distributions (Fig. S1). Sample variances were estimated using the empirical Bayes method as implemented in limma^36,37. P-values obtained for all reactions were finally adjusted to q-values (false discovery cutoffs) by the method of Benjamini-Hochberg⁶¹. As an alternative to hypothesis testing, we also estimated log-fold changes by a combinatorial method. First, control log-fold changes were obtained from all permutations of HaCaT samples, yielding 6 control log-fold changes for each reaction with a zero mean. Log-fold changes were also obtained for all combinations of HeLa samples with a HaCaT samples, yielding 9 differential but possibly interdependent log-fold changes. Those combinatorial estimates were then used to obtain 95% credible intervals using the Bayes bootstrap⁶². The 95% credible intervals denote the one interval which contains the true log-fold change with 95% probability. The obtained p-values, as well as the mean log-fold, worst-case estimates, 95% credible intervals and standard deviations for the EDAs are reported in Table S3.

Gene expression and coregulation

Fifty-eight microarray samples were manually selected from the GEO database by selecting for samples from a single platform (HGU133Plus 2.0), in untreated conditions and only for the previously used cell lines (HaCaT, keratinocytes and HeLa). The analysis of gene expression was performed by normalizing the raw data for the 58 samples by Frozen Robust Multiarray Analysis (fRMA), followed by differential analysis using the limma package, particularly its empirical Bayes method^36,63. The exact protocol for the analysis can again be found in the Supplementary Text.

Standard deviations for log-fold changes of enzyme activities were obtained from the 9 possible Hela/HaCaT sample combinations of log-fold changes as used in the approximation of credible intervals. To obtain an estimate for the standard deviation in untreated conditions we used the control samples obtained from the paired permutations of HaCaT samples, as previously described. Given the control standard deviation σ_c and the differential standard deviation σ_d (obtained from the log-fold change samples described before), a reaction was considered differentially heterogeneous if σ_d/σ_c > 3, based on the visualization in Fig. 5B that showed that the majority of reactions fell into that margin. Correlation between the selected heterogeneous enzymes was calculated by Pearson correlation between the corresponding 9 HeLa/HaCaT log-fold changes.

Additional Information

How to cite this article: Diener, C. et al. The space of enzyme regulation in HeLa cells can be inferred from its intracellular metabolome. Sci. Rep. 6, 28415; doi: 10.1038/srep28415 (2016).

References

Kim, J. W. & Dang, C. V. Cancer’s molecular sweet tooth and the Warburg effect. Cancer Research 66, 8927–8930 (2006).
CAS PubMed Google Scholar
Vander Heiden, M. G., Cantley, L. C. & Thompson, C. B. Understanding the Warburg Effect: The Metabolic Requirements of Cell Proliferation. Science (80-.). 324, 1029–1033 (2009).
ADS CAS PubMed Central Google Scholar
Verhaak, R. G. W. et al. Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA, IDH1, EGFR and NF1. Cancer Cell 17, 98–110 (2010).
CAS PubMed PubMed Central Google Scholar
Muzny, D. M. et al. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).
ADS CAS Google Scholar
Koboldt, D. C. et al. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
ADS CAS Google Scholar
Meacham, C. E. & Morrison, S. J. Tumour heterogeneity and cancer cell plasticity. Nature 501, 328–337 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Hu, J. et al. Heterogeneity of tumor-induced gene expression changes in the human metabolic network. Nat. Biotechnol. 31, 522–9 (2013).
CAS PubMed PubMed Central Google Scholar
Bode, A. M. & Dong, Z. Post-translational modification of p53 in tumorigenesis. Nat. Rev. Cancer 4, 793–805 (2004).
CAS PubMed Google Scholar
Thomson, J. M. Extensive post-transcriptional regulation of microRNAs and its implications for cancer. Genes Dev. 20, 2202–2207 (2006).
CAS PubMed PubMed Central Google Scholar
Hitosugi, T. & Chen, J. Post-translational modifications and the Warburg effect. Oncogene 33, 4279–4285 (2014).
CAS PubMed Google Scholar
Lindsley, J. E. & Rutter, J. Whence cometh the allosterome? Proc. Natl. Acad. Sci. 103, 10533–10535 (2006).
ADS CAS PubMed PubMed Central Google Scholar
Chiarugi, A., Dölle, C., Felici, R. & Ziegler, M. The NAD metabolome — a key determinant of cancer cell biology. Nat. Rev. Cancer 12, 741–752 (2012).
CAS PubMed Google Scholar
Aboud, O. A. & Weiss, R. H. New Opportunities from the Cancer Metabolome. Clin. Chem. 59, 138–146 (2013).
PubMed Google Scholar
Cairns, R. A., Harris, I. S. & Mak, T. W. Regulation of cancer cell metabolism. Nat. Rev. Cancer 11, 85–95 (2011).
CAS PubMed Google Scholar
Sreekumar, A. et al. Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression. Nature 457, 910–914 (2009).
ADS CAS PubMed PubMed Central Google Scholar
Zelezniak, A., Sheridan, S. & Patil, K. R. Contribution of Network Connectivity in Determining the Relationship between Gene Expression and Metabolite Concentration Changes. PLoS Comput. Biol. 10, e1003572 (2014).
ADS PubMed PubMed Central Google Scholar
Bordbar, A. et al. Personalized Whole-Cell Kinetic Models of Metabolism for Discovery in Genomics and Pharmacodynamics. Cell Syst. 1, 283–292 (2015).
CAS PubMed Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
CAS PubMed PubMed Central Google Scholar
Agren, R. et al. Reconstruction of Genome-Scale Active Metabolic Networks for 69 Human Cell Types and 16 Cancer Types Using INIT. PLoS Comput. Biol. 8, e1002518 (2012).
CAS PubMed PubMed Central Google Scholar
Agren, R. et al. Identification of anticancer drugs for hepatocellular carcinoma through personalized genome-scale metabolic modeling. Mol. Syst. Biol. 10, 721–721 (2014).
PubMed PubMed Central Google Scholar
Yizhak, K., Chaneton, B., Gottlieb, E. & Ruppin, E. Modeling cancer metabolism on a genome scale. Mol. Syst. Biol. 11, 817–817 (2015).
PubMed PubMed Central Google Scholar
Raamsdonk, L. M. et al. A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations. Nat. Biotechnol. 19, 45–50 (2001).
CAS PubMed Google Scholar
Machado, D. & Herrgård, M. Systematic Evaluation of Methods for Integration of Transcriptomic Data into Constraint-Based Models of Metabolism. PLoS Comput. Biol. 10, e1003580 (2014).
ADS PubMed PubMed Central Google Scholar
Famili, I., Mahadevan, R. & Palsson, B. O. k-Cone Analysis: Determining All Candidate Values for Kinetic Parameters on a Network Scale. Biophys. J. 88, 1616–1625 (2005).
CAS PubMed Google Scholar
Schellenberger, J. & Palsson, B. O. Use of Randomized Sampling for Analysis of Metabolic Networks. J. Biol. Chem. 284, 5457–5461 (2009).
CAS PubMed Google Scholar
Resendis-Antonio, O. Filling Kinetic Gaps: Dynamic Modeling of Metabolism Where Detailed Kinetic Information Is Lacking. PLoS One 4, e4967 (2009).
ADS PubMed PubMed Central Google Scholar
Yizhak, K. et al. Phenotype-based cell-specific metabolic modeling reveals metabolic liabilities of cancer. Elife 3, (2014).
Posakony, J. W., England, J. M. & Attardi, G. Mitochondrial growth and division during the cell cycle in HeLa cells. J. Cell Biol. 74, 468–491 (1977).
CAS PubMed PubMed Central Google Scholar
Boukamp, P. Normal keratinization in a spontaneously immortalized aneuploid human keratinocyte cell line. J. Cell Biol. 106, 761–771 (1988).
CAS PubMed Google Scholar
Shi, Q. & King, R. W. Chromosome nondisjunction yields tetraploid rather than aneuploid cells in human cell lines. Nature 437, 1038–1042 (2005).
ADS CAS PubMed Google Scholar
Hirayama, A. et al. Quantitative Metabolome Profiling of Colon and Stomach Cancer Microenvironment by Capillary Electrophoresis Time-of-Flight Mass Spectrometry. Cancer Res. 69, 4918–4925 (2009).
CAS PubMed Google Scholar
Tang, X. et al. A joint analysis of metabolomics and genetics of breast cancer. Breast Cancer Res. 16, 415 (2014).
PubMed PubMed Central Google Scholar
Jain, M. et al. Metabolite Profiling Identifies a Key Role for Glycine in Rapid Cancer Cell Proliferation. Science (80-.). 336, 1040–1044 (2012).
ADS CAS Google Scholar
Resendis-Antonio, O., Checa, A. & Encarnación, S. Modeling core metabolism in cancer cells: Surveying the topology underlying the Warburg effect. PLoS One 5 (2010).
Noor, E., Haraldsdóttir, H. S., Milo, R. & Fleming, R. M. T. Consistent Estimation of Gibbs Energy Using Component Contributions. PLoS Comput. Biol. 9, e1003098 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47 (2015).
PubMed PubMed Central Google Scholar
Smyth, G. K. Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments. Stat. Appl. Genet. Mol. Biol. 3, 1–25 (2004).
ADS MathSciNet MATH Google Scholar
Mahadevan, R. & Schilling, C. H. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab. Eng. 5, 264–276 (2003).
CAS PubMed Google Scholar
Yi, W. et al. Phosphofructokinase 1 Glycosylation Regulates Cell Growth and Metabolism. Science (80-.). 337, 975–980 (2012).
ADS CAS Google Scholar
Webb, B. A. et al. Structures of human phosphofructokinase-1 and atomic basis of cancer-associated mutations. Nature 523, 111–114 (2015).
ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Christofk, H. R. et al. The M2 splice isoform of pyruvate kinase is important for cancer metabolism and tumour growth. Nature 452, 230–233 (2008).
ADS CAS PubMed Google Scholar
Vander Heiden, M. G. et al. Evidence for an Alternative Glycolytic Pathway in Rapidly Proliferating Cells. Science (80-.). 329, 1492–1499 (2010).
ADS CAS Google Scholar
Chan, B., VanderLaan, P. A. & Sukhatme, V. P. 6-Phosphogluconate dehydrogenase regulates tumor cell migration in vitro by regulating receptor tyrosine kinase c-Met. Biochem. Biophys. Res. Commun. 439, 247–251 (2013).
CAS PubMed Google Scholar
Gruer, M. J., Artymiuk, P. J. & Guest, J. R. The aconitase family: Three structural variations on a common theme. Trends in Biochemical Sciences 22, 3–6 (1997).
CAS PubMed Google Scholar
Tsui, K. H. et al. Hypoxia upregulates the gene expression of mitochondrial aconitase in prostate carcinoma cells. J. Mol. Endocrinol. 51, 131–141 (2013).
CAS PubMed Google Scholar
Kemp, R. G. & Foe, L. G. Allosteric regulatory properties of muscle phosphofructokinase. Mol. Cell. Biochem. 57, 147–154 (1983).
CAS PubMed Google Scholar
Barrett, T. et al. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res. 41, D991–D995 (2013).
CAS PubMed Google Scholar
Hernández Patiño, C. E., Jaime-Muñoz, G. & Resendis-Antonio, O. Systems biology of cancer: moving toward the integrative study of the metabolic alterations in cancer cells. Front. Physiol. 3, 481 (2012).
PubMed Google Scholar
Hasawi, N. Al., Alkandari, M. F. & Luqmani, Y. A. Phosphofructokinase: A mediator of glycolytic flux in cancer progression. Crit. Rev. Oncol. Hematol. 92, 312–321 (2014).
PubMed Google Scholar
Ralser, M. et al. Metabolic reconfiguration precedes transcriptional regulation in the antioxidant response. Nat. Biotechnol. 27, 604–605 (2009).
CAS PubMed Google Scholar
Basta, P. et al. Genetic variation in Transaldolase 1 and risk of squamous cell carcinoma of the head and neck. Cancer Detect. Prev. 32, 200–208 (2008).
CAS PubMed PubMed Central Google Scholar
Wang, C. et al. Identification of transaldolase as a novel serum biomarker for hepatocellular carcinoma metastasis using xenografted mouse model and clinic samples. Cancer Lett. 313, 154–166 (2011).
CAS PubMed Google Scholar
Daran-Lapujade, P. et al. The fluxes through glycolytic enzymes in Saccharomyces cerevisiae are predominantly regulated at posttranscriptional levels. Proc. Natl. Acad. Sci. USA 104, 15753–15758 (2007).
ADS CAS PubMed PubMed Central Google Scholar
Link, H., Kochanowski, K. & Sauer, U. Systematic identification of allosteric protein-metabolite interactions that control enzyme activity in vivo. Nat. Biotechnol. 31, 357–361 (2013).
CAS PubMed Google Scholar
Chubukov, V. et al. Transcriptional regulation is insufficient to explain substrate-induced flux changes in Bacillus subtilis. Mol. Syst. Biol. 9, 709–709 (2014).
Google Scholar
Higareda-Almaraz, J., Enríquez-Gasca, M., Hernández-Ortiz, M., Resendis-Antonio, O. & Encarnación-Guevara, S. Proteomic patterns of cervical cancer cell lines, a network perspective. BMC Syst. Biol. 5, 96 (2011).
CAS PubMed PubMed Central Google Scholar
Zhao, L. et al. Intracellular water-specific MR of microbead-adherent cells: The HeLa cell intracellular water exchange lifetime. NMR Biomed. 21, 159–164 (2008).
CAS PubMed PubMed Central Google Scholar
Wishart, D. S. et al. HMDB 3.0–The Human Metabolome Database in 2013. Nucleic Acids Res. 41, D801–D807 (2013).
CAS PubMed Google Scholar
Avis, D. & Fukuda, K. A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra. Discrete Comput. Geom. 8, 295–313 (1992).
MathSciNet MATH Google Scholar
Thiele, I. et al. A community-driven global reconstruction of human metabolism. Nat. Biotechnol. 31, 419–425 (2013).
CAS PubMed Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B 57, pp. 289–300 (1995).
MathSciNet MATH Google Scholar
Rubin, D. B. The bayesian bootstrap. Ann. Stat. 9, 130–134 (1981).
ADS MathSciNet Google Scholar
McCall, M. N., Bolstad, B. M. & Irizarry, R. A. Frozen robust multiarray analysis (fRMA). Biostatistics 11, 242–53 (2010).
PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank the financial support of the Research Chair on Systems Biology (INMEGEN-FUNTEL Mexico) and from an internal grant of the National Institute of Genomic Medicine. FMG is funded through a Ph.D. scholarship of the CONACyT. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors are grateful for comments and suggestions coming from two anonymous referees during the review process of this paper.

Author information

Authors and Affiliations

Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, 14610, Mexico
Christian Diener, Felipe Muñoz-Gonzalez & Osbaldo Resendis-Antonio
Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, 62210, México
Sergio Encarnación
Coordinación de la Investigación Científica - Red de Apoyo a la Investigación UNAM, Mexico
Osbaldo Resendis-Antonio

Authors

Christian Diener
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Muñoz-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Encarnación
View author publications
You can also search for this author in PubMed Google Scholar
Osbaldo Resendis-Antonio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.D. and O.R.A. developed the methods and C.D. performed the analysis, S.E. performed the experiments, F.M.G. analyzed the gene expression data. C.D. and O.R.A. wrote the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Dataset 1

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Diener, C., Muñoz-Gonzalez, F., Encarnación, S. et al. The space of enzyme regulation in HeLa cells can be inferred from its intracellular metabolome. Sci Rep 6, 28415 (2016). https://doi.org/10.1038/srep28415

Download citation

Received: 17 December 2015
Accepted: 31 May 2016
Published: 23 June 2016
DOI: https://doi.org/10.1038/srep28415

This article is cited by

Unveiling functional heterogeneity in breast cancer multicellular tumor spheroids through single-cell RNA-seq
- Erick Andrés Muciño-Olmos
- Aarón Vázquez-Jiménez
- Osbaldo Resendis-Antonio
Scientific Reports (2020)
Kinetic analysis, size profiling, and bioenergetic association of DNA released by selected cell lines in vitro
- Janine Aucamp
- Abel J. Bronkhorst
- Piet J. Pretorius
Cellular and Molecular Life Sciences (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.