Systematic identification of metabolites controlling gene expression in E. coli

Lempp, Martin; Farke, Niklas; Kuntz, Michelle; Freibert, Sven Andreas; Lill, Roland; Link, Hannes

doi:10.1038/s41467-019-12474-1

Download PDF

Article
Open access
Published: 02 October 2019

Systematic identification of metabolites controlling gene expression in E. coli

Nature Communications volume 10, Article number: 4463 (2019) Cite this article

17k Accesses
56 Citations
48 Altmetric
Metrics details

Subjects

Abstract

Metabolism controls gene expression through allosteric interactions between metabolites and transcription factors. These interactions are usually measured with in vitro assays, but there are no methods to identify them at a genome-scale in vivo. Here we show that dynamic transcriptome and metabolome data identify metabolites that control transcription factors in E. coli. By switching an E. coli culture between starvation and growth, we induce strong metabolite concentration changes and gene expression changes. Using Network Component Analysis we calculate the activities of 209 transcriptional regulators and correlate them with metabolites. This approach captures, for instance, the in vivo kinetics of CRP regulation by cyclic-AMP. By testing correlations between all pairs of transcription factors and metabolites, we predict putative effectors of 71 transcription factors, and validate five interactions in vitro. These results show that combining transcriptomics and metabolomics generates hypotheses about metabolism-transcription interactions that drive transitions between physiological states.

Engineering regulatory networks for complex phenotypes in E. coli

Article Open access 13 August 2020

Rongming Liu, Liya Liang, … Ryan T. Gill

The Escherichia coli transcriptome mostly consists of independently regulated modules

Article Open access 04 December 2019

Anand V. Sastry, Ye Gao, … Bernhard O. Palsson

Transcriptomic profiling of Escherichia coli K-12 in response to a compendium of stressors

Article Open access 24 May 2022

Rama P. Bhatia, Hande A. Kirit, … Jonathan P. Bollback

Introduction

Transcriptional regulation of metabolism is well characterized regarding the canonical flow of genetic information, which considers how transcription modulates the abundance of enzymes, and thereby metabolic flux and metabolites^1,2,3,4. In reverse, metabolites convey information back to the transcription network by directly or indirectly interacting with a transcription factor (TF)^5,6,7,8,9 (Fig. 1a). In Escherichia coli, for example, the amino acid arginine allosterically regulates the activity of ArgR, which is a TF that controls genes involved in arginine biosynthesis, but the total regulon includes more than 400 genes¹⁰. Allosteric TF regulation allows a cell to tune gene expression depending on its metabolic state and theory shows that this regulation is robust and predictable by models¹¹. An important consequence of allosteric TF regulation is that metabolites are not just biomass building blocks but they serve as internal signals with the potential to actively drive transitions between different physiological states.

It is largely unexplored which of the many intracellular metabolites interact with TFs^12,13, yet many transcriptional regulators are expected to bind a small molecule⁵. Currently, a major limitation to fill this gap of knowledge is the lack of methods to identify the most functionally relevant metabolite–TF interactions that control gene expression in vivo. Detection of physical interactions between metabolites with transcriptional regulators is mainly based on in vitro assays, which are low-throughput, feasible for only certain compounds and combinatorial effects cannot be assayed¹⁴. An alternative approach is to probe protein structural changes with proteomics, which can detect binding of a single metabolite across thousands of proteins in cell extracts, but this approach cannot decipher unspecific binding from interactions that are functional in vivo¹⁵. An in vivo approach has been proposed, which searches for correlations in metabolomics data and data from fluorescent transcriptional reporters. This method could indeed recover few of the known metabolites that are relevant for gene regulation of central carbon metabolism in E. coli¹⁶.

Here, we measure the E. coli transcriptome and metabolome changes during a 20 h dynamic transition, and show that integrating these two data-types generates hypotheses about metabolite–TF interactions that may have functional relevance in vivo. We also construct a metabolite–TF network for E. coli from the literature and databases, and show that our approach recovers more than 50% of the interactions in this network that were covered by our data. Moreover, we validate five predicted interactions with in vitro binding assays, i.e. lysine–ArgR, tyrosine–TrpR, glutamate–SgrR, tryptophan–SoxR, and dihydroxyacetone phosphate–DhaR, showing that our methodology generates physiologically meaningful results.

Results

Switching E. coli between growth and carbon starvation

We used a 1 L bioreactor to switch the culture conditions of E. coli between 6 h growth, 12 h carbon starvation, and 2 h growth resumption. First, cells grew on minimal medium with glucose and when the culture reached an optical density (OD) of 2, we transferred cells to minimal medium without carbon source. This rapid medium exchange caused an immediate growth arrest and cells starved for a period of 12 h (Fig. 1b). After 12 h we added again glucose to the culture and within 10 min cells resumed growing exponentially (Fig. 1b). Apart from the fast growth resumption, also oxygen uptake and CO₂-production increased rapidly upon glucose addition and reached the same rate as before starvation (Supplementary Fig. 1). Thus, physiological parameters like growth and respiration change in a fast and reversible fashion when E. coli cells enter and exit carbon starvation. Next, we investigated metabolism and transcription during the growth–starvation–growth switch, and measured the concentration of 123 metabolites by LC-MS/MS (Supplementary Fig. 2) and 4242 transcripts by RNA-sequencing (Fig. 1c, Supplementary Data 1 and 2). In total, we collected transcriptomics samples at 29 and metabolomics samples at 35 different time points in duplicates from a single bioreactor (Supplementary Data 3), with average errors of 18% for metabolites and 16% for transcripts. Only 8% of metabolites and 17% of the transcripts did not change significantly in either phase. To explore global dynamics of the metabolome and transcriptome data, we grouped each data set into four clusters (hierarchical clustering, z-score normalized). The clusters showed that the largest group of metabolites (63%) and transcripts (68%) decreased during the starvation phase and increased during the exit-phase (Cluster A, Supplementary Fig. 3). This group included intermediates in glycolysis like fructose-bisphosphate, dihydroxyacetone phosphate, and acetyl-CoA, as well as the nucleotide-triphosphates ATP and GTP (Fig. 2). Another group of metabolites and transcripts accumulated during the first 4–6 h into starvation, such as the amino acids lysine and phenylalanine that originate from degradation of proteins¹⁷ (Cluster C, Supplementary Fig. 3). Similarly, accumulation of nucleotide derivatives like hypoxanthine was presumably a consequence of RNA degradation (Fig. 2). These data indicate that starving E. coli cells catabolize RNA and proteins during the early phase of starvation, an interpretation that is consistent with the relatively high production of CO₂ in this phase (Supplementary Fig. 1), and also with the expression of genes involved in RNA, protein, and glycogen degradation processes (Supplementary Fig. 4). Notably, expression of genes in glycogen degradation preceded the expression of genes in RNA and protein degradation (Supplementary Fig. 4), confirming that glycogen functions as short-term energy storage. After switching cells back to glucose, 95% of the metabolites and 78% of transcripts reached the same steady-state levels that they had before the starvation phase. However, for many metabolites and transcripts, it took over 1 h until they reached a steady state, thus indicating extensive regulation during the exit phase.

Integrating metabolomics and transcriptomics data

To identify metabolites that are potential regulators of gene expression during the growth–starvation-growth switch, we searched for correlations between dynamics of metabolites and transcripts. Because metabolites modulate transcription through allosteric TF regulation, we sought to determine the activity of TFs and other regulators like σ⁷⁰ and σ^S. The relationship between transcriptional regulators and their target genes is well-characterized in E. coli, in the form of a transcription regulation network¹⁸. A well-mapped transcription regulation network allowed us to infer activities of transcriptional regulators from measured gene expression profiles using algorithms like network component analysis (NCA)^19,20. The NCA algorithm estimates activity profiles of transcriptional regulators, which minimize the error between theoretical and measured gene expression profiles (for 2167 genes that are mapped to 209 transcriptional regulators in the E. coli transcription regulation network). In total, we performed 100 searches with the NCA algorithm, each with a different randomized initial condition, such that we obtained means and confidence intervals for activity profiles of the 209 transcriptional regulators (Fig. 3a, Supplementary Data 5). These 209 activity profiles were able to reproduce 75% of the transcript dynamics and were consistent with the expected responses of transcriptional regulators during starvation and growth²¹. For example, σ⁷⁰, the major sigma factor during exponential growth, was deactivated upon entry to starvation, and the stress response regulator σ^S was immediately activated (Supplementary Fig. 5).

Allosteric regulation of a TF by a metabolite is often described by Hill-type kinetics¹¹, which assumes a sigmoidal relationship between TF activity and the concentration of an effector metabolite. In a canonical example of this regulation, the secondary messenger cyclic-AMP activates CRP, which is a global TF in E. coli^22,23. On the basis of Hill kinetics, we tested how well the measured cyclic-AMP concentration predicts the activity profile of CRP (Fig. 3b). Cyclic-AMP and CRP activity revealed indeed a Hill-type relationship with an activation constant (K_H) of 39 µM, which is very close to the in vitro determined value of 27 µM²³. Thus in vivo metabolite and transcript data identifies the existence of the known interaction between cyclic-AMP and CRP, and additionally captures the underlying kinetics of allosteric TF regulation. Another well-known metabolite–TF pair is tryptophan and the repressor of the tryptophan operon (TrpR), which also showed Hill-type kinetics, and the in vivo K_H of 355 µM was again relatively close to the in vitro value of 160 µM²⁴ (Fig. 3c).

Next, we wondered how many of the known metabolite–TF interactions are covered by our data, and whether they show a Hill-type relationship. Therefore, we first constructed a “literature network” of known metabolite–TF interactions by mining RegulonDB¹⁸, the EcoCyc database²⁵ and the Allosteric Database²⁶. This literature metabolite–TF network included in total 134 interactions between 87 TFs and 106 metabolites (Supplementary Fig. 6). 41% of the interactions are activating, 38% inhibiting and for 21% it is not known whether the metabolite inhibits or activates the TF. Our data covered interactions for 21 out of the 87 TFs, and 12 of them correlated with at least one of the known regulatory metabolites (Fig. 4a, Pearson’s correlation coefficient R² > 0.75). Thus, our data recovered known interactions in more than 50% of the cases, and in each of these cases the correlation correctly reflected, whether the metabolite activates or inhibits the TF. In case of NadR and ExuR, our data suggests that they are inhibited by ATP and lysine, respectively.

Mapping metabolism-transcription interactions systematically

A problem of the correlation analysis was that several metabolites correlated with the activity of a TF, resulting in many false positives (Fig. 4b). The large number of false positives is mainly caused by metabolites that have similar dynamics. The same problem was previously reported for a multi-omics analysis of yeast metabolism, which searched for correlations between metabolites and fluxes²⁷. In this study, correlations between metabolites caused also many false positives, and including prior knowledge about metabolic flux regulation solved the problem. Here, we could not adapt such an approach, due to the limited information about allosteric TF regulation. Instead, we reduced the number of putative interactions by using a distance criterion for metabolites and TFs: metabolite–TF pairs were only considered, if at least one target-gene of the TF encodes an enzyme that participates in the same metabolic subsystem as the metabolite or if the metabolite is a substrate or a product. The hypothesis behind this distance criterion is that metabolites are more likely to regulate genes that are involved in their own biosynthesis. This assumption is supported by a recent study in cancer cells, which showed that metabolite-gene pairs have a higher correlation when they are close in the metabolic network²⁸. We observed a similar proximity of metabolite–TF interactions in our literature network, because more than 80% of these interactions have a small distance in the E. coli genome-scale metabolic model²⁹ (Supplementary Fig. 6). We then applied the distance criterion to our data and only considered metabolite–TF pairs that fulfilled the distance criterion (black dots in Fig. 4b). For the 12 known metabolite–TF interactions that showed a Hill-type relationship, 11 fulfilled the distance criterion, and only the interaction between ExuR and lysine was rejected. The advantage of the distance filter was that it reduced the number of highly correlating metabolites from an average of 34 metabolites per TF to an average of 9 (Fig. 4b).

Among the false positives that remained after the distance filter were lysine–ArgR and tyrosine–TrpR (orange dots in Fig. 4b). Because lysine and tyrosine share structural similarity with the known allosteric effectors (arginine for ArgR and tryptophan for TrpR), we tested if lysine and tyrosine are additional and previously unidentified regulators of ArgR and TrpR. Therefore, we purified the two TFs and tested binding of lysine and tyrosine in vitro using micro-scale thermophoresis (MST). The in vitro MST assays showed indeed binding of lysine and tyrosine to ArgR and TrpR, respectively, thus validating the in vivo prediction (Fig. 4c, Supplementary Fig. 9). The in vitro assays also confirmed the known arginine–ArgR and tryptophan–TrpR interactions (Fig. 4c, Supplementary Fig. 9). Because ArgR regulates essential steps in lysine biosynthesis, as well as two lysine transporters, the physiological function of the lysine–ArgR interaction is presumably a metabolic feedback that inhibits lysine production and import when lysine is abundant^30,31. In case of TrpR, previous studies showed that deletion of TrpR, affects expression of tyrA in the tyrosine biosynthesis pathway³². Here, we show that also tyrosine is linked to TrpR, and the crosstalk between the two aromatic amino acids could potentially coordinate their biosynthesis.

Finally, we tested if we can generate hypotheses about the existence of metabolite–TF interactions in an unbiased fashion by fitting Hill functions to all pairs of metabolites and TFs. We first reduced the number of TFs from 209 to 125 by excluding: (i) TFs that followed simple on-off-on dynamics, (ii) TFs with poor estimates of activity profiles (confidence interval >100%), and (iii) TFs that are part of two-component systems (these regulators are more likely modulated by external signals rather than internal metabolites). The remaining 125 TFs and 123 metabolites resulted in 15,375 metabolite–TF pairs, for which we tested if they show a Hill-type relationship. A total of 3067 metabolite–TF pairs (20%) showed a Hill type relationship (R² > 0.75 Supplementary Data 6), and by applying again the distance criterion we reduced this number to 513, which we considered as putative metabolite–TF interactions (Supplementary Figs. 7, 8 and Supplementary Data 7).

The putative 513 interactions included 71 TFs, and we focused on the 30 TFs that correlated only with one or two metabolites (Supplementary Data 7). The resulting network shows mostly interactions of TFs with metabolites from amino acid and nucleotide metabolism but also with intermediates in carbon and cofactor metabolism (Fig. 5a). We purified three of the identified TFs to test if they bind the predicted metabolite. In vitro MST assays validated that SoxR binds tryptophan, SgrR binds glutamate, and DhaR binds the glycolysis intermediate dihydroxyacetone phosphate (DHAP) (Fig. 5b, Supplementary Fig. 9). SoxR is known to activate the expression of aroF and tyrA, which encode enzymes catalyzing the first step in the biosynthesis of all aromatic amino acids (aroF) and the tyrosine branch (tyrA)³³. By binding tryptophan, SoxR could be part of a feedback regulation circuit in aromatic amino acid biosynthesis, which reduces expression of aroF and tyrA when tryptophan levels are high. SgrR activates alaC that encodes a transaminase that converts glutamate and pyruvate to alpha-ketoglutarate and alanine³⁴. This transaminase accounts, together with a corresponding isoenzyme, for 90% of the catalytic activity for biosynthesis of alanine in E. coli³⁵. As our in vivo data show an inhibition of SgrR by glutamate, low glutamate levels would upregulate alaC. Because low glutamate level brings the transamination reaction closer to thermodynamic equilibrium, an accompanying upregulation of alaC might provide the necessary enzymatic capacity³⁶. The last new interaction is DhaR and DHAP, a regulator of dihydroxyacetone kinases, which seems to activate in response to increasing DHAP levels³⁷. As DhaR activates the dihydroxyacetone kinases, the interaction could be part of a positive feedback loop.

Discussion

In conclusion, data of the E. coli transcriptome and metabolome during a 20 h starvation–growth–starvation switch generated hypotheses about potential interactions between metabolites and TFs. The scale of this approach is the biggest advantage, because it allows pair-wise testing of all TFs against all metabolites. Here, we provided a first proof-of-principle that the combination of transcriptomics and metabolomics has a great potential to identify metabolite–TF interactions at a metabolism-wide scale. To this end, we showed that many of the known metabolite–TF interactions were reflected by our data (e.g., cyclic AMP-CRP), and, therefore, that metabolite and gene expression data contain the information that is necessary to reconstruct metabolic-genetic networks. Moreover, we could validate five of the predicted metabolite–TF interactions with in vitro assays (lysine–ArgR, tyrosine–TrpR, glutamate–SgrR, tryptophan–SoxR and dihydroxyacetone phosphate–DhaR).

In our analysis, we excluded two-component systems, because they are likely responsive to external metabolites. By measuring the exo-metabolome it should be possible to identify effectors of two-component systems with the method proposed in this study. We also excluded TFs with poor estimates of activity profiles, and to include these TFs one could probe their activities with fluorescent transcriptional reporters as recently suggested¹⁶. Accurate information about TF activities was important for our approach because it allowed pairwise testing of Hill-type relationships between TF activities and metabolites. Here, we inferred TF activities with the NCA algorithm that requires a well-mapped transcription regulation network. While the transcription regulation network is known in E. coli, it is unknown for most other organisms. To overcome the need for a known transcription regulation network, the TF activities could be inferred from the transcriptome data directly without using prior knowledge about the transcription regulation network. Previous studies showed for example that machine learning methods can infer TF activities in E. coli based on transcriptomics data³⁸, and inference of regulatory metabolites with such methods was also suggested³⁹. Future approaches could even consider determining TF activities and regulatory metabolites simultaneously.

The main limitation in our study was that many metabolites showed similar dynamics, which in turn caused false positive predictions of metabolite–TF interactions. The high correlation among metabolites could be a general problem in metabolomics-based inference approaches²⁷. A solution for this problem is to enforce more specific metabolite concentration changes by localized perturbations of metabolism, for example by disturbing single enzymes. We anticipate that the transcriptome and metabolome of hundreds of locally perturbed metabolic states would provide sufficient information to faithfully map metabolite–TF interactions of an organism. An effective perturbation method is CRISPR interference, because of its potential to interfere with the expression of every enzyme of an organism.

A complete map of metabolite–TF interactions would advance our knowledge about the dynamic nature of metabolic regulation and enable the construction of dynamic metabolic models. Here, we focused mainly on interactions that are part of metabolic-genetic feedback circuits, because we considered the distance between TFs and metabolites. However, metabolites will not only affect the transcription of genes encoding enzymes, but also affect genes involved in various other physiological processes. Understanding these long-ranging metabolite–TF interactions would dramatically increase our understanding about how metabolism drives physiological responses, e.g. to oxidative stress⁴⁰ or antibiotics⁴¹. Finally, there is the possibility to exploit the knowledge about metabolite–TF interactions to engineer better strains for biotechnology, e.g. by designing genetic-metabolic feedback that acts as valves in production strains⁴² or growth switches⁴³.

Methods

Strains and cultivation

E. coli BW25113 (parent strain for the Keio Collection, CGSC#: 7636) was cultivated in 1 L bioreactor with 500 mL of M9 minimal medium containing 5 g L⁻¹ glucose to an optical density at 600 nm (OD) of 2. Then the culture was centrifuged at 37 °C and 1800 × g for 5 min Pelleted cells were resuspended in M9 medium at 37 °C without glucose and transferred back to the bioreactor. After 12 h, the culture was supplemented glucose to a final concertation of 5 g L ⁻¹ glucose. The M9 minimal medium consisted of the following components (per liter): 6 g Na₂HPO₄ · 2 H₂O, 3 g KH₂PO₄, 1.5 g (NH₄)₂SO₄, 0.5 g NaCl. The following components were sterilized separately and then added to the medium (final concentrations): 0.1 mM CaCl₂, 1 mM MgSO₄, 60 µM FeCl₃, 2.8 µM thiamine-HCl, and 10 mL trace salt solution. The trace salt solution contained (per liter) 180 mg ZnSO₄ · 7 H₂O, 120 mg CuCl₂ · 2 H₂O, 120 mg MnSO₄· H₂O, 180 mg CoCl₂ · 6 H₂O. The dissolved oxygen in the bioreactor was kept at 30% and pH 7 was controlled with 5 M NH₄OH and 20% H₃PO₄. The bioreactor was a BioFlo115 bioprocess system (Eppendorf, Hamburg, Germany), equipped with a pH-sensor (Mettler Toledo, Colombus, OH) and a DO-sensor (Mettler Toledo, Colombus, OH). Exhaust gas of the cultivation was analyzed by a DASGIP GasAnalyser (Eppendorf, Hamburg, Germany). The GasAnalyser was calibrated with two-point-calibration prior to the cultivation. The bioreactor cultivation was monitored with the BioCommand-Software (Eppendorf, Hamburg, Germany).

Metabolomics

For metabolomics 1 mL culture aliquots were vacuum-filtered on a 0.45 µm pore size filter (HVLP02500, Merck Millipore). Filters were immediately transferred into 40:40:20 (v-%) acetonitrile/methanol/water at −20 °C for extraction. Extracts were centrifuged for 15 min at 11,000 × g at −9 °C. Centrifuged extracts were mixed with ¹³C-labeled internal standard. Chromatographic separations were performed on an Agilent 1290 Infinity II LC System (Agilent Technologies) equipped with an Acquity UPLC BEH Amide column (2.1 × 30 mm, particle size 1.8 µm, Waters) for acidic conditions and an iHilic-Fusion (P) HPLC column (2.1 × 50 mm, particle size 5 µm, Hilicon) for basic conditions. We were applying the following binary gradients at a flow rate of 400 µl min⁻¹: acidic condition) 0–1.3 min: isocratic 10% A (water/formic acid, 99.9/0.1 (v/v), 10 mM ammonium formate), 90% B (acetonitrile/formic acid, 99.9/0.1 (v/v)); 1.3–1.5 min linear from 90 to 40% B; 1.5–1.7 min linear from 40 to 90% B, 1.7–2 min isocratic 90% B. Basic condition) 0–1.3 min: isocratic 10% A (water/formic acid, 99.8/0.2 (v/v), 10 mM ammonium carbonate), 90% B (acetonitrile); 1.3–1.5 min linear from 90 to 40% B; 1.5–1.7 min linear from 40 to 90% B, 1.7–2 min isocratic 90% B. The injection volume was 3.0 µl (full loop injection).

Eluting compounds were detected using an Agilent 6495 triple quadrupole mass spectrometer (Agilent Technologies) equipped with an Agilent Jet Stream electrospray ion source in positive and negative ion mode. Source gas temperature was set to 200 °C, with 14 L min⁻¹ drying gas and a nebulizer pressure of 24 psi. Sheath gas temperature was set to 300 °C and flow to 11 L min⁻¹. Electrospray nozzle and capillary voltages were set to 500 and 2500 V, respectively. Metabolites were identified by multiple reaction monitoring (MRM), and MRM parameters were optimized and validated with authentic standards⁴⁴. Metabolites were measured in ¹²C− and ¹³C isoforms, and data were analyzed with published Matlab code⁴⁴. Metabolites were sampled four times at the first time point t₀; and two samples were collected at the remaining time points (see also reporting standards in Supplementary Data 10). Metabolomics metadata is accessible under the MetaboLights accession number MTBLS1044.

Transcriptomics

For transcriptomics 0.5 mL culture was transferred into reaction tubes and centrifuged at 11.000 × g for 2 min, and the pellet was frozen in liquid nitrogen. The total RNA of the cells was isolated using the Total RNA Isolation Mini Kit (Agilent, Santa Clara, CA). The integrity of the RNA was measured using the BioAnalyzer Pico-Kit (Agilent, Santa Clara, CA). RNA-sequencing was performed by the Max Planck-Genome-Centre Cologne, Germany (https://mpgc.mpipz.mpg.de/home/). The sequencing reads were analyzed and mapped using the CLC Software (QIAGEN, Venlo, NL). For normalization, gene expression was calculated as transcripts per kilobase million (TPMs). RNA was sampled four times at the first time point t₀; and two samples were collected at the remaining time points. For the time points t₁₃, t₁₅, t₁₉ and t₂₄ one of the two replicates was excluded due to low quality of the sampled RNA. Transcriptomics metadata is accessible under the GEO number GSE131992.

Network component analysis (NCA)

NCA was performed by iteratively optimizing connectivity strength and TF-activity by using the connectivity matrix of the transcription regulation network and the measured gene expression. The optimization is a least square optimization between the gene expression and the product of connectivity and TF-activity:

$$\min _{A,P}\left\| {E - AP} \right\|^2$$

(1)

Where E is the log₁₀ transformed gene expression data (in TPMs) (Supplementary Data 9), A the connectivity matrix of the transcription regulation network (matrix with regulator-gene interactions Supplementary Data 8) and P the TF-activity¹⁹. To generate the connectivity matrix, a matrix of transcription regulator—gene interactions was generated by combining the matrixes of TF— gene interactions and sigma factor—gene interactions of RegulonDB¹⁸. Additional regulation that was added was the (p)ppGpp regulon and transcriptional attenuation, as described in the EcoCyc database²⁵. To account for basal expression of every gene by the RNA polymerase we added a global regulator, which was connected to all genes in the connectivity matrix. Randomized starting points were used for each calculation cycle of the algorithm. A calculation cycle was aborted if the summed squared 2-norm of the residuals did not change by more than 1%.

Correlations between metabolites and TF activities

Metabolite concentrations and TF activities were first correlated linearly. In case of a positive linear correlation, we used activating Hill kinetics as the basis for a non-linear fit. In case of a negative linear correlation we used inhibition kinetics:

$$Activation\;kinetics:y = y_{{\mathrm{max}}} \ast \frac{{x^{\mathrm{h}}}}{{x^{\mathrm{h}} + K_{\mathrm{H}}^{\mathrm{h}}}}$$

(2)

$$Inhibition\;kinetics:y = y_{{\mathrm{max}}} \ast \frac{{K_{\mathrm{H}}^{\mathrm{h}}}}{{x^{\mathrm{h}} + K_{\mathrm{H}}^{\mathrm{h}}}}$$

(3)

Where y is the TF activity, and x the metabolite concentration. K_H is the activation constant, h the Hill coefficient and y_max is the maximal TF activity, which was assumed to be constant over time. Parameters of the Hill equations (K_H and h) were estimated in total 50 times per metabolite–TF pair. The Hill coefficient h was constrained to an upper value of 10. For each pair of metabolite and TF, we tested if a negative time-shift of the TF activity by one time point would improve the parameter estimation. This accounts for the fact that TF activities are derived from gene expression data, which could potentially succeed changes of metabolite levels (Supplementary Fig. 10). The correlation coefficient R² was calculated between the measured TF activity and the transformed metabolite levels using the estimated Hill parameters.

Distances of metabolite–TF interactions

First, we remove all cofactors, as well as periplasmatic and extracellular metabolites from the stoichiometric matrix of the iJO1366 metabolic genome-scale model of E. coli. Next, we create a metabolite-gene adjacency matrix, F, by calculating the inner product of the modified stoichiometric matrix, N, and the reaction-gene matrix, G. We finish by computing the Boolean of F, F’. Next, we transform F’ it into an undirected, bipartite graph, nodes denoting metabolites and genes, respectively. For this graph, we calculate a distance matrix, D, containing all pairwise distances between metabolites and genes in F. For known metabolite–TF interactions, we look for the distances between the regulating metabolite and each of the target genes of the TF and take the smallest distance. In case a regulating metabolite is not part of the iJO1366, we omit the distance calculation⁴⁵.

The distance criterion for correlating metabolite–TF pairs (Fig. 4b) was also based on the genome-scale model iJO1366²⁹. Pairs of metabolites and TFs were only considered if at least one of the two criteria was fulfilled. Criteria 1: the metabolite is a product or a substrate of an enzyme that is encoded by a target-gene of the TF. Criteria 2: the metabolite is listed in the same metabolic subsystem as an enzyme that is encoded by a target-gene of the TF. Subsystems of TFs were defined as the metabolic pathways controlled by the TF in the genome scale model. Subsystems of metabolites were defined according to the Supplementary Data 1.

Protein overexpression and purification

TFs were purified from the E. coli ASKA strains⁴⁶. Cells were grown in 200 mL TB medium containing 30 µg × mL⁻¹ chloramphenicol at 37 °C. When cells reached OD 0.6 we added 0.5 mM IPTG. Cells were incubated at 37 °C for 3 h more and harvested by centrifugation. Proteins were purified from the pellets using Protino™ Ni-TED-IDA 1000 Kit (Macherey-Nagel, Düren Germany). Protein purity was confirmed by SDS-PAGE and concentrations were determined by the Pierce protein BCA Assay (Thermo Fischer Scientific, Waltham, MA).

Quantitation of interactions by microscale thermophoresis

Microscale Thermophoresis (MST)⁴⁷ was performed on a Monolith NT.115 (Nano Temper Technologies GmbH, Munich, Germany) at 21 °C (red LED power was set to 75% and infrared laser power to 80%). 50 nM of the respective protein was labeled with the dye Monolith His-Tag Labeling Kit RED-tris-NTA 2nd Generation (MO-L018) supplied by NanoTemper Technologies. Labeled proteins were titrated as indicated with the respective metabolite in buffer T (50 mM NaH₂PO₄, 500 mM NaCl, and pH 5.7). At least nine independent MST experiments (three technical replicates of three biological replicates) were performed at 680 nm and processed by Nano Temper Analysis package 1.2.009 and Origin8 (OriginLab, Northampton, MA).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this paper.

Code availability

Matlab code to perform Network Component Analysis and Kinetic correlations can be accessed from the GitHub repository via https://github.com/nfarke/Lempp_Metabolite_TF_interaction_Ecoli.

Data availability

Gene expression data that support the findings of this study have been deposited in NCBI’s Gene Expression Omnibus with the accession code GSE131992. Metabolomics data that support the findings of this study have been deposited in MetaboLights database with the accession codes MTBLS1044. The source data of Figs. 1a, b, 2, 3a–c, 4b, c and 5b and Supplementary Figs. 1, 3, 4, 5, 7, and 9 are provided as a Source Data file. All other data are available from the corresponding author on reasonable request.

References

Buescher, J. M. et al. Global network reorganization during dynamic adaptations of Bacillus subtilis metabolism. Science 335, 1099–1103 (2012).
Article ADS CAS PubMed Google Scholar
Kresnowati, M. T. A. P. et al. When transcriptome meets metabolome: Fast cellular responses of yeast to sudden relief of glucose limitation. Mol. Syst. Biol. 2, 49 (2006).
Article CAS PubMed PubMed Central Google Scholar
Bradley, P. H., Brauer, M. J., Rabinowitz, J. D. & Troyanskaya, O. G. Coordinated Concentration Changes of Transcripts and Metabolites in Saccharomyces cerevisiae. PLOS Comput. Biol. 5, e1000270 (2009).
Redestig, H. & Costa, I. G. Detection and interpretation of metabolite–transcript coresponses using combined profiling data. Bioinformatics 27, i357–i365 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chubukov, V., Gerosa, L., Kochanowski, K. & Sauer, U. Coordination of microbial metabolism. Nat. Rev. Microbiol. 12, 327–340 (2014).
Article CAS PubMed Google Scholar
Browning, D. F. & Busby, S. J. W. Local and global regulation of transcription initiation in bacteria. Nat. Rev. Microbiol. 14, 638–650 (2016).
Article CAS PubMed Google Scholar
Donati, S., Sander, T. & Link, H. Crosstalk between transcription and metabolism: how much enzyme is enough for a cell? Wiley Interdiscip. Rev. Syst. Biol. Med. 10, https://doi.org/10.1002/wsbm.1396 (2018).
Google Scholar
Jozefczuk, S. et al. Metabolomic and transcriptomic stress response of Escherichia coli. Mol. Syst. Biol. 6, 1–16 (2010).
Article Google Scholar
Geiger, R. et al. L-Arginine modulates T cell metabolism and enhances survival and anti-tumor activity. Cell 167, 829–842.e13 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cho, B.-K., Federowicz, S., Park, Y.-S., Zengler, K. & Palsson, B. Ø. Deciphering the transcriptional regulatory logic of amino acid metabolism. Nat. Chem. Biol. 8, 65–71 (2012).
Article CAS Google Scholar
Razo-Mejia, M. et al. Tuning transcriptional regulation through signaling: a predictive theory of allosteric induction. Cell Syst. 6, 456–469.e10 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rinschen, M. M., Ivanisevic, J., Giera, M. & Siuzdak, G. Identification of bioactive metabolites using activity metabolomics. Nat. Rev. Mol. Cell Biol. 20, 353–367 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yugi, K. & Kuroda, S. Metabolism-centric trans-omics. Cell Syst. 4, 19–20 (2017).
Article CAS PubMed Google Scholar
Folly, B. B. et al. Assessment of the interaction between the flux-signaling metabolite fructose-1,6-bisphosphate and the bacterial transcription factors CggR and Cra. Mol. Microbiol. 109, 278–290 (2018).
Article Google Scholar
Piazza, I. et al. A map of protein-metabolite interactions reveals principles of chemical communication. Cell 172, 358–372.e23 (2018).
Article CAS PubMed Google Scholar
Kochanowski, K. et al. Few regulatory metabolites coordinate expression of central metabolic genes in Escherichia coli. Mol. Syst. Biol. 13, 903–903 (2017).
Article PubMed PubMed Central Google Scholar
Link, H., Fuhrer, T., Gerosa, L., Zamboni, N. & Sauer, U. Real-time metabolome profiling of the metabolic switch between starvation and growth. Nat. methods 12, 1091–1097 (2015).
Article CAS PubMed Google Scholar
Gama-Castro, S. et al. RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond. Nucleic Acids Res. 44, D133–D143 (2016).
Article CAS PubMed Google Scholar
Liao, J. C. et al. Network component analysis: reconstruction of regulatory signals in biological systems. Proc. Natl Acad. Sci. USA 100, 15522–15527 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Kao, K. C. et al. Transcriptome-based determination of multiple transcription regulator activities in by using network component analysis Escherichia coli. Proc. Natl Acad. Sci. USA 101, 641–646 (2004).
Article ADS CAS PubMed Google Scholar
Sharma, U. K. & Chatterji, D. Transcriptional switching in Escherichia coli during stress and starvation by modulation of sigma activity. FEMS Microbiol. Rev. 34, 646–657 (2010).
Article CAS PubMed Google Scholar
Kolb, A., Busby, S., Buc, H., Garges, S. & Adhya, S. Transcriptional regulation by cAMP and its receptor protein. Annu. Rev. Biochem. 62, 749–797 (1993).
Article CAS PubMed Google Scholar
Małecki, J. ĳ., Polit, A. & Wasylewski, Z. Kinetic Studies of cAMP-induced allosteric changes in cyclic amp receptor protein from Escherichia coli. J. Biol. Chem. 275, 8480–8486 (2000).
Article PubMed Google Scholar
Arvidson, D. N., Bruce, C. & Gunsalus, R. P. Interaction of the Escherichia coli trp aporepressor with its ligand, L-tryptophan. J. Biol. Chem. 261, 238–243 (1986).
CAS PubMed Google Scholar
Keseler, I. M. et al. The EcoCyc database: reflecting new knowledge about Escherichia coli K-12. Nucleic Acids Res. 45, D543–D550 (2017).
Article CAS PubMed Google Scholar
Shen, Q. et al. ASD v3.0: unraveling allosteric regulation with structural mechanisms and biological networks. Nucleic Acids Res. 44, D527–D535 (2016).
Article ADS CAS PubMed Google Scholar
Hackett, S. R. et al. Systems-level analysis of mechanisms regulating yeast metabolic flux. Science 354, pii: aaf2786 (2016).
Ortmayr, K., Dubuis, S. & Zampieri, M. Metabolic profiling of cancer cells reveals genome-wide crosstalk between transcriptional regulators and metabolism. Nat. Commun. 10, 1841 (2019).
Article ADS PubMed PubMed Central Google Scholar
Orth, J. D. et al. A comprehensive genome‐scale reconstruction of Escherichia coli metabolism—2011. Mol. Syst. Biol. 7, 535 (2011).
Pathania, A. & Sardesai, A. A. Distinct paths for basic amino acid export in Escherichia coli: YbjE (LysO) mediates export of L-lysine. J. Bacteriol. 197, 2036–2047 (2015).
Article CAS PubMed PubMed Central Google Scholar
Peterkofsky, B. & Gilvarg, C. N-Succinyl-l-diaminopimelic-glutamic transaminase. J. Biol. Chem. 236, 1432–1438 (1961).
CAS PubMed Google Scholar
Sander, T. et al. Allosteric feedback inhibition enables robust amino acid biosynthesis in e. coli by enforcing enzyme overabundance. Cell Syst. 8, 66–75.e8 (2019).
Article CAS PubMed PubMed Central Google Scholar
Seo, S. W., Kim, D., Szubin, R. & Palsson, B. O. Genome-wide reconstruction of OxyR and SoxRS transcriptional regulatory networks under oxidative stress in Escherichia coli K-12 MG1655. Cell Rep. 12, 1289–1299 (2015).
Article CAS PubMed Google Scholar
Vanderpool, C. K. & Gottesman, S. The novel transcription factor SgrR coordinates the response to glucose-phosphate stress. J. Bacteriol. 189, 2238–2248 (2007).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. H., Schneider, B. L. & Reitzer, L. Genetics and regulation of the major enzymes of alanine synthesis in Escherichia coli. J. Bacteriol. 192, 5304–5311 (2010).
Article CAS PubMed PubMed Central Google Scholar
Flamholz, A., Noor, E., Bar-Even, A., Liebermeister, W. & Milo, R. Glycolytic strategy as a tradeoff between energy yield and protein cost. Proc. Natl Acad. Sci. USA 110, 10039–10044 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Bächler, C., Schneider, P., Bähler, P., Lustig, A. & Erni, B. Escherichia coli dihydroxyacetone kinase controls gene expression by binding to transcription factor DhaR. EMBO J. 24, 283–293 (2005).
Article PubMed Google Scholar
Fang, X. et al. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc. Natl Acad. Sci. USA 114, 10286–10291 (2017).
Article CAS PubMed PubMed Central Google Scholar
Noor, E., Cherkaoui, S. & Sauer, U. Biological insights through omics data integration. Curr.Opin. in Syst. Biol. https://doi.org/10.1016/j.coisb.2019.03.007 (2019).
Article Google Scholar
Campbell, K., Vowinckel, J., Keller, M. A. & Ralser, M. Methionine metabolism alters oxidative stress resistance via the pentose phosphate pathway. Antioxid. Redox Signal. 24, 543–547 (2016).
Article CAS PubMed PubMed Central Google Scholar
Campos, A. I. & Zampieri, M. Metabolomics-driven exploration of the chemical drug space to predict combination antimicrobial therapies. Mol. Cell 74, 1291–1303.e6 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gupta, A., Brockman Reizman, I. M., Reisch, C. R. & Prather, K. L. J. Dynamic regulation of metabolic flux in engineered bacteria using a pathway-independent quorum-sensing circuit. Nat. Biotechnol. 35, 273–279 (2017).
Article CAS PubMed PubMed Central Google Scholar
Burg, J. M. et al. Large-scale bioprocess competitiveness: the potential of dynamic metabolic control in two-stage fermentations. Curr. Opin. Chem. Eng. 14, 121–136 (2016).
Article Google Scholar
Guder, J. C., Schramm, T., Sander, T. & Link, H. Time-optimized isotope ratio LC–MS/MS for high-throughput quantification of primary metabolites. Anal. Chem. 89, 1624–1631 (2017).
Article CAS PubMed Google Scholar
Reznik, E. et al. Genome-scale architecture of small molecule regulatory networks and the fundamental trade-off between regulation and enzymatic activity. Cell Rep. 20, 2666–2677 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kitagawa, M. et al. Complete set of ORF clones of Escherichia coli ASKA library (a complete set of E. coli K-12 ORF archive): unique resources for biological research. DNA Res. 12, 291–299 (2005).
Article CAS PubMed Google Scholar
Jerabek-Willemsen, M., Wienken, C. J., Braun, D., Baaske, P. & Duhr, S. Molecular interaction studies using microscale thermophoresis. Assay. Drug Dev. Technol. 9, 342–353 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank G. Bange and K. Drescher for discussions. This work was supported by the Deutsche Forschungsgemeinschaft through the Collaborative Research Center SFB 987, and by the ERC starting grant 715650. M.K. acknowledges founding of the IMPRS graduate school for environmental, cellular and molecular microbiology from the Max Planck Society. We thank the Max Planck-Genome-Centre Cologne (http://mpgc.mpipz.mpg.de/home/) for performing RNA sequencing in this study.

Author information

Authors and Affiliations

Max Planck Institute for Terrestrial Microbiology, Marburg, 35043, Germany
Martin Lempp, Niklas Farke, Michelle Kuntz & Hannes Link
Institut für Zytobiologie und Zytopathologie, Philipps-Universität Marburg, 35033, Marburg, Germany
Sven Andreas Freibert & Roland Lill
LOEWE Zentrum für Synthetische Mikrobiologie SYNMIKRO, Philipps-Universität Marburg, 35032, Marburg, Germany
Roland Lill & Hannes Link

Authors

Martin Lempp
View author publications
You can also search for this author in PubMed Google Scholar
Niklas Farke
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Kuntz
View author publications
You can also search for this author in PubMed Google Scholar
Sven Andreas Freibert
View author publications
You can also search for this author in PubMed Google Scholar
Roland Lill
View author publications
You can also search for this author in PubMed Google Scholar
Hannes Link
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L. and M.K. performed experiments. M.L. performed LC-MS/MS measurement, Network Component Analysis, kinetic correlations. M.L., S.F. and R.L. performed MST. N.F. constructed and analyzed the literature metabolite-TF network. M.L. and H.L. co-wrote the paper. H.L. directed the project.

Corresponding author

Correspondence to Hannes Link.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer Review Information Nature Communications thanks Julio Collado-Vides, Alisdair Fernie and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Supplementary Data 10

Reporting summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lempp, M., Farke, N., Kuntz, M. et al. Systematic identification of metabolites controlling gene expression in E. coli. Nat Commun 10, 4463 (2019). https://doi.org/10.1038/s41467-019-12474-1

Download citation

Received: 17 April 2019
Accepted: 11 September 2019
Published: 02 October 2019
DOI: https://doi.org/10.1038/s41467-019-12474-1

This article is cited by

The diversification of methods for studying cell–cell interactions and communication
- Erick Armingol
- Hratch M. Baghdassarian
- Nathan E. Lewis
Nature Reviews Genetics (2024)
Metabolomics and lipidomics signature in celiac disease: a narrative review
- Mohammad Rostami-Nejad
- Nastaran Asri
- Kamran Rostami
Clinical and Experimental Medicine (2024)
Dissociation kinetics of small-molecule inhibitors in Escherichia coli is coupled to physiological state of cells
- Dai Le
- Tatsuya Akiyama
- Minsu Kim
Communications Biology (2023)
Local flux coordination and global gene expression regulation in metabolic modeling
- Gaoyang Li
- Li Liu
- Huansheng Cao
Nature Communications (2023)
An automated workflow for multi-omics screening of microbial model organisms
- Stefano Donati
- Matthias Mattanovich
- Douglas McCloskey
npj Systems Biology and Applications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Switching E. coli between growth and carbon starvation

Integrating metabolomics and transcriptomics data

Mapping metabolism-transcription interactions systematically

Discussion

Methods

Strains and cultivation

Metabolomics

Transcriptomics

Network component analysis (NCA)

Correlations between metabolites and TF activities

Distances of metabolite–TF interactions

Protein overexpression and purification

Quantitation of interactions by microscale thermophoresis

Reporting summary

Code availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links