Distinct metabolic network states manifest in the gene expression profiles of pediatric inflammatory bowel disease patients and controls

Knecht, Carolin; Fretter, Christoph; Rosenstiel, Philip; Krawczak, Michael; Hütt, Marc-Thorsten

doi:10.1038/srep32584

Download PDF

Article
Open access
Published: 02 September 2016

Distinct metabolic network states manifest in the gene expression profiles of pediatric inflammatory bowel disease patients and controls

Carolin Knecht¹^na1,
Christoph Fretter²^na1,
Philip Rosenstiel³^na1,
Michael Krawczak¹^na1 &
…
Marc-Thorsten Hütt²^na1

Scientific Reports volume 6, Article number: 32584 (2016) Cite this article

2103 Accesses
15 Citations
Metrics details

Subjects

Abstract

Information on biological networks can greatly facilitate the function-orientated interpretation of high-throughput molecular data. Genome-wide metabolic network models of human cells, in particular, can be employed to contextualize gene expression profiles of patients with the goal of both, a better understanding of individual etiologies and an educated reclassification of (clinically defined) phenotypes. We analyzed publicly available expression profiles of intestinal tissues from treatment-naive pediatric inflammatory bowel disease (IBD) patients and age-matched control individuals, using a reaction-centric metabolic network derived from the Recon2 model. By way of defining a measure of ‘coherence’, we quantified how well individual patterns of expression changes matched the metabolic network. We observed a bimodal distribution of metabolic network coherence in both patients and controls, albeit at notably different mixture probabilities. Multidimensional scaling analysis revealed a bisectional pattern as well that overlapped widely with the metabolic network-based results. Expression differences driving the observed bimodality were related to cellular transport of thiamine and bile acid metabolism, thereby highlighting the crosstalk between metabolism and other vital pathways. We demonstrated how classical data mining and network analysis can jointly identify biologically meaningful patterns in gene expression data.

Meta-analysis of gene expression disease signatures in colonic biopsy tissue from patients with ulcerative colitis

Article Open access 14 September 2021

Systems biology approach highlights mechanistic differences between Crohn’s disease and ulcerative colitis

Article Open access 01 June 2021

Inflammatory bowel disease biomarkers revealed by the human gut microbiome network

Article Open access 08 November 2023

Introduction

Over the past decade, the advent and further development of the high-throughput molecular techniques of genomics, proteomics and metabolomics have rendered possible the generation of rich molecular data sets at ever increasing speed. Due to the mere size and complexity of these data, however, both hypothesis-driven analyses and agnostic data mining exercises are usually hampered by serious multiple comparison problems. In consequence, molecular studies of human disease have rarely led to more than long lists of uninterpretable fold changes and p values, with little direct benefit to scientific scrutiny. Occasionally, selected experimental targets may also accrue from the expertise of individual research groups, but the evidence basis of such ‘good guesses’ is usually subjective or sparse, or both. Based upon previous experience in other areas of scientific research, it may thus be surmised that proper contextualization of molecular data by additional biological information would greatly facilitate their interpretation at different levels of cellular organization.

The term ‘network medicine’ has been coined to summarize attempts at gaining a systemic understanding of biological processes by mapping experimental data onto networks¹. These networks serve as abstractions of the underlying biological processes and, in this way, render them more amenable to statistical and mathematical analysis. In fact, throughout the distinguished career of network-based science², the question of how to use biological networks to interpret high-throughput molecular data has played an important role^3,4. Yet, all strategies brought forth so far essentially follow the same principle: Data attributes are associated with vertices in a network of interest and are given statistical weight depending upon their bonding by network edges.

Crohn disease (CD) and ulcerative colitis (UC) are inflammatory bowel diseases (IBD), characterized by relapsing-remitting episodes of intestinal inflammation. Both entities provide prime examples of a complex disease that is caused by a poorly understood interplay between environmental and genetic risk factors. Usually, both diseases first arise between the 2^nd and 4^th decade of life and have a strong effect upon the quality of life of patients. More specifically, CD and UC are associated with pain and bloody diarrhea, have debilitating inflammatory extra-intestinal manifestations (e.g. arthritis, uveitis), and require strong and long-term immunosuppressive medication. Both diseases are associated with a Western lifestyle and have become dramatically more frequent in the second half of the 20^th century⁵. Genetic studies identified a wealth of replicated disease associations to over 160 genomic regions⁶, suggesting an important role of immune signaling, endoplasmic reticulum (ER) stress, autophagy and cytoskeletal organization in IBD etiology. Despite the large number of risk loci and the improved understanding of their functional role, however, the exact causes of IBD still remain to be elucidated. There is currently no cure for either CD or UC, and primary and secondary non-response to induction and maintenance therapy represent a major problem of IBD clinical care.

Unsupervised gene expression analysis of patient samples aims at a better understanding of those gene regulatory processes that are critical for disease etiology, progression and treatment response. However, despite several fruitful attempts to follow this paradigm in the case of IBD^7,8,9,10, ways and means to infer different functional states of patient tissue from gene expression profiles, and to relate these states to the disease phenotype of interest, are still missing. Here, we follow an archetypical ‘network medicine’ approach to infer hitherto unrecognized patterns in gene expression data from IBD and control mucosal samples. We hypothesized that one or more deregulated states of a biological network may exist in the patients and that this variation can be identified from gene expression profiles taking the natural variation between patients properly into account.

Metabolic networks seem to suggest themselves as plausible candidates for network medicine in the IBD context because the human body makes many metabolic adjustments in response to, and in order to compensate for, inflammatory processes. The relevance of metabolic organization in IBD pathophysiology has been recognized early on¹¹ but systematic studies of IBD-related metabolic gene activity are still lacking. Therefore, extracting effective metabolic networks from gene expression changes in IBD patients may be an ideal test case for such a systems-based approach and, at the same time, may reveal new hints at the biological mechanisms underlying the disease. Moreover, distinct metabolic states may be associated with differences in disease progression and may therefore point towards a meaningful stratification of patients with a view on treatment and surveillance. Finally, complementing networks with standard enrichment analysis may allow metabolism-related states to be linked to the utterance of other biological functions.

Results and Discussion

In the present study, we focused upon the utility of metabolic networks to contextualize molecular data. More specifically, we used the Recon2 metabolic model¹² as a template to interpret publicly available gene expression profiles¹³ of intestinal tissue from control individuals and treatment-naive pediatric patients diagnosed with either Crohn disease (CD) or ulcerative colitis (UC). This age group may be rather untypical for IBD. However, we surmise that the analysis of pediatric patients may shed some extra light on the etiological link between gene expression and disease manifestation because, around the incidence peak of 20 to 40 years, this relationship may already be confounded to a considerable extend by past or present environmental influences. Our study involved multiple data processing and analysis steps (Fig. 1) that combine a metabolic network-based approach to data analysis with classical data mining, jointly facilitating a more function-orientated interpretation of the expression profiles.

Quantification of metabolic coherence

The concept of metabolic network coherence employed here^14,15 is based upon genome-wide metabolic networks that are subjected to flux-balance analysis (FBA), a variant of constraint-based modeling¹⁶. FBA starts from the solution space of a linear system, N∙v = 0, with stoichiometric matrix N and metabolic flux vector v. After the inclusion of necessary constraints (e.g., maximal nutrient uptake rates or reversibility of biochemical reactions), an objective function (e.g., biomass maximization) is defined and the optimal flux is found by linear programming^17,18. FBA has been applied successfully in microbiology before, for example, to predict gene essentiality with high accuracy for Escherichia coli¹⁹ and Saccharomyces cerevisiae²⁰. With the publication of the first metabolic models of human cells^21,22 and their multiple refinements^12,23, an application of the concept of metabolic network coherence in human medical research has become feasible. Our analysis strategy¹⁵ was first applied to gene expression profiles from patients with aldosterone-producing adenomas of the adrenal gland, where it revealed several distinct metabolism-related states in the data. Similar approaches combining flux prediction with gene expression profiling have been used, for example, to establish cell type-specific metabolic models^23,24,25.

The metabolic network derived from the Recon2 model is a bipartite graph with metabolite nodes and reaction nodes. A projection of this bipartite graph onto the reaction nodes (i.e. the reaction-centric metabolic network) and the evaluation of the gene-reaction associations contained in Recon2 lead to a (gene-centric) metabolic network with vertices representing genes and edges representing paths of length 2 between the gene-associated reactions in the original bipartite graph. We analyzed effective metabolic networks that were obtained by mapping significantly altered gene expression levels onto the gene-centric metabolic network. Here, ‘significantly altered’ gene expression was defined by way of calling a gene ‘saliently expressed’ in a given profile when the normalized expression (DESeq; see below) value for that gene exceeded ± 3. Note that ± 3 is an appropriate threshold for z scores like the normalized DESeq values because ± 3 roughly demarcates the 1% quantile of the standard Gaussian distribution. The general principle of metabolic network coherence analysis is depicted in Supplementary Figure S1.

A central problem of metabolic network coherence analysis in its original form¹⁵ has been the choice of an appropriate objective function and of suitable input to the metabolic system (i.e., a suitable cellular environment). We circumvented this problem by using a static network rather than a network comprising predicted active fluxes obtained via FBA. Statistically, the main effect of FBA in network coherence analysis is meaningful pruning of the original (usually dense) reaction-centric metabolic network. We achieved a similar effect by eliminating currency metabolites (ATP, H₂O, etc.) from the bipartite metabolic network before projecting the set of reaction nodes onto the network (see Methods section for additional information). Examples of both high and low coherence effective networks generated in the course of our study are shown in Fig. 2.

Network analysis yielded a single global quantity per individual, called the ‘metabolic network coherence’ of the corresponding gene expression profile. Formal assessment by means of a Kruskall-Wallis test revealed a highly significant difference in metabolic network coherence between the three diagnostic groups (χ² = 9.305, 2 d.f., p = 0.0095). The observed heterogeneity was entirely due to a lower level of coherence prevailing in the expression profiles of controls (median: −0.195) compared to CD (0.596) and UC (0.723) patients. No significant difference was observed between CD and UC (p > 0.2).

Multi-modality of metabolic network coherence

Visual inspection further revealed that the distribution of metabolic coherence values was characterized by prominent multi-modality (Fig. 3). The significance and precise stochastic nature of this finding were formally evaluated by mixture analysis as implemented in SAS procedure FMM (version 9.5; SAS Institute Inc., Cary, NC, USA). Since FMM is unsuitable for the analysis of heavily skewed distributions, we applied a standardized extreme deviation criterion^26,27 to define outliers as values more than 5.2 median absolute deviations away from the median (equivalent to a metabolic network coherence value > 3.578). Applying this threshold highlighted seven IBD samples and four control samples as outliers. Upon the exclusion of these values, use of a Bayes Information Criterion (BIC) yielded the best fit to the data for a mixture of two Gaussian distributions with mixing probabilities 0.267 (A) and 0.733 (B) (see Fig. 4A). Mean and variance were estimated as −0.272 and 0.017, respectively, for distribution A, and 1.029 and 1.206, respectively, for distribution B. Mixture analysis of individual patient subgroups yielded similar results for CD and UC, with nearly identical means but somewhat different variances (Supplementary Figure S2; Supplementary Table S1). Statistically significant substructure, as judged by a BIC, was also detected in the control profiles. Again, the best fit to the data was obtained with a mixture of two Gaussian distributions, and the respective mean and variance estimates were −0.278 and 0.080 for distribution A, and 1.618 and 0.403 for distribution B. Whilst these parameters were strikingly similar to those characterizing the metabolic network coherence distributions in patients, however, the mixing probabilities were reversed at 0.777 for distribution A, and 0.223 for distribution B (Fig. 4B).

High metabolic network coherence is obtained when expression level differences between different genes fit to the topology of the metabolic network, i.e. when expression levels tend to be more similar for genes that are connected in the network than would be expected by chance alone. This kind of coherence can be interpreted as meaning that the expression profile is partially ‘explicable’ by the network. For individuals with low metabolic network coherence, by contrast, other functional characteristics (beyond the metabolism-related state) would have to be invoked to ‘explain’ their gene expression profile.

The above results suggest that the intestinal gene expression profiles of children can be subdivided into two groups, one with metabolic network coherence of high average level and large variance, and one with notably lower average and smaller variance. These two subgroups are present at relative frequencies of approximately 1:3 in pediatric treatment-naive IBD patients, and 3:1 in same-aged controls, i.e. IBD is strongly associated with intestinal gene expression of high metabolic coherence. In principle, there are two basic explanations for this observation. Either high metabolic coherence or the biological causes thereof represent a risk factor for IBD at young age per se. In this case, our results potentially point towards novel disease mechanisms worth further exploration. Alternatively, the development or presence of pediatric IBD may cause a shift of gene expression from low to high coherence in some patients, but not in others. Even although our results would then lack immediate etiological relevance they may nevertheless lead to new insights into the mechanisms of disease progression, with potential benefits in terms of therapy and disease management.

Data mining

Classical data mining aims at discerning patterns in data without invoking additional contextual information. We applied multi-dimensional scaling (MDS) analysis to the original expression profile data of the pediatric IBD patients and controls. When the Euclidean distances between the original DESeq values were subjected to MDS, no particular pattern became apparent (Fig. 5A). However, a different result was obtained when the DESeq values were dichotomized according to whether or not they exceeded ± 3, in which case the respective gene was termed ‘saliently expressed’. Note that ± 3 is an appropriate threshold for z scores like the normalized DESeq values because ± 3 roughly demarcates the 1% quantile of the standard Gaussian distribution. With the dichotomous data, MDS revealed two clusters of expression profiles that could be distinguished well in the first dimension (Fig. 5C,D).

MDS analysis did not reveal any relationship between disease type or case-control status and cluster affiliation (Fig. 5A,C). However, virtually all expression profiles from the low coherence group, assigned to distribution (A) with > 80% certainty, were found to fall into only one of the two binary-distance based MDS clusters. The high coherence group (B) predominated the other cluster (Fig. 5D). Although less well-structured, the Euclidean distance-based MDS plots exhibited a bipartite partition as well (Fig. 5B). Similar results were obtained for IBD patients alone (Supplementary Figure S3).

The fact that MDS of the binary distance data yielded a more clear-cut result than MDS of the original DESeq values may appear surprising at first glance because, from a statistical point of view, dichotomization usually entails a loss of information. However, in the present situation, focusing the analysis upon saliently (i.e., particularly highly or lowly) expressed genes may have been equivalent to highlighting the relevant links between gene activity and metabolism and, at the same time, filtering out the noise that is likely to constitute intermediate expression levels.

In order to assess the possible role of known biological determinants of both gene expression and metabolism, we stratified the distribution of metabolic network coherence values by both age and sex. However, no influence of these two covariates became apparent (Fig. 6).

Saliently expressed genes

For each gene and each coherence group, we determined the proportion of profiles in which the gene was saliently expressed (I,e, DESeq > + 3 or DESeq < −3). When the two proportions were assessed for a statistically significant difference among IBD patients using a Fisher or chi-squared test as appropriate, and allowing for multiple testing, seven genes were found to be saliently expressed more often in one of the two coherence groups (Fig. 7, Table 1).

A change in metabolism has been hypothesized for long to play a role in the etiology of IBD. Early work, focused upon energy homeostasis in intestinal epithelial cells¹¹, revealed diminished butyrate oxidation to CO₂ and ketones as well as a shift to increased glucose and glutamine oxidation in UC patients in a process that potentially compensates for the concurrent decrease in fatty acid oxidation. The importance of fatty acid metabolism in IBD was further highlighted by the observation that the expression of fatty acid synthase and long chain acyl-CoA synthetases (ACSL) 1 and 4 genes is altered in IBD patients, and that this change probably reflects impaired sensing of bile acids via the LXR receptor²⁸. Intriguingly, we found two UDP glucuronosyltransferase genes to be saliently expressed more often in the high than the low coherence group of pediatric IBD patients (Table 1). For decades, the UDP glucuronosyltransferases of the intestinal mucosa have been known to contribute to the extrahepatic metabolism of bile acids^29,30, even though the precise role of this process in inflammatory responses is still poorly understood.

Table 1 Genes expressed saliently at significantly different proportions in low and high coherence expression profiles (IBD patients only).

Full size table

IL6 is a cytokine, known to promote intestinal inflammation, that has a clear role in fatty acid metabolism, for example, by stimulating apolipoprotein (a) expression and lipoprotein (a) synthesis in hepatocytes³¹. Along the same vein, the TM4SF4 gene encodes a transmembrane protein that stimulates thiamine resorption in intestinal epithelial cells³². Thiamine, in turn, is an essential component of several co-enzyme complexes, including pyruvate dehydrogenase that catalyzes the formation of Acetyl CoA as a first step in fatty acid synthesis. Interestingly, a variant in TM4SF4 was recently found to increase the risk for gallstone formation³³, a disease that involves impaired enterohepatic circulation of bile acids.

In summary, we may surmise that a functional link exists between fatty acid metabolism and inflammation that partly explains why high metabolic network coherence was more prevalent in IBD patients than controls in our study.

Conclusions

The pronounced heterogeneity of disease progression and therapy response observed among patients with inflammatory bowel diseases (IBD) calls for a more refined classification of cases to benefit both medical research and clinical care³⁴. Therefore, a careful assessment of the functional state of patient tissues as captured by high-throughput molecular data appears well warranted.

We used a network approach to analyze gene expression data from pediatric IBD patients and controls, not only to resolve otherwise indiscernible patterns in these data, but also to improve our understanding of the underlying disease mechanisms. The latter was facilitated by our drawing upon more general insights into a particular type of biological system, namely metabolic networks. Two distinct subgroups of expression profiles were identified on the basis of these considerations: one where the metabolic network coherence was high on average and varied substantially between individuals, and one where metabolic network coherence was distinctly lower and less variable. Whilst the latter group dominated the control group, the former was most prevalent in IBD patients. Whether this discrepancy reflects causes or consequences of disease manifestation remains unclear but warrants further exploration.

The metabolic network coherence-based classification of transcriptome profiles showcased here also bears potential for translation into clinical practice in that it opens an additional perspective for the biology-driven stratification of IBD patients. Since the success prospects of pharmacological therapies in IBD or in any other inflammatory disease are likely to be influenced by the peculiarities of the individual metabolism, metabolic network coherence may represent a suitable biomarker to distinguish between responder and non-responders, or to predict side effects, for certain treatments. In addition, as was evidenced by the different prevalence of high and low coherence in patients and controls, metabolic network coherence may also serve as a diagnostic marker, for example, to allow differentiation between IBD and non-IBD intestinal health problems.

Classical data mining was capable of identifying substructure in the gene expression data as well that mirrored the results of the metabolic network coherence analysis. The fact that the two coherence groups could be discerned without invoking the metabolic network itself suggests that the differences between the two patient groups reside at a more comprehensively systemic level, and that metabolism only served as a marker for these differences.

We employed publicly available transcriptome data from intestinal biopsies of mostly therapy-naive pediatric IBD patients. Even though some of the clinical characteristics (no previous immunosuppressive medication, sampling close to first diagnosis, narrow age range) render this group ideal for metabolism-centered analyses, it must be emphasized that pediatric IBD differs from adult IBD in several ways³⁵. Moreover, the controls employed in our study were considered “non-IBD” by the treating physicians, but still presented with intestinal health problems. Therefore, it cannot be excluded that presence of the high metabolic network coherence state in this group reflected particular non-inflammatory factors such as, for example, a specific infection. Therefore, it must be verified explicitly whether metabolic network coherence is also bimodal in adult IBD patients or in adults in general.

The present study also highlighted two synergistic aspects of the combination of network analysis and classical data mining. On the one hand, network analysis provides a means to use external contextual information to facilitate a better understanding of the results of classical data mining. On the other hand, classical data mining can lend statistical support to the qualitative results of network analysis. Nevertheless, experimental studies are now required to link the two distinct states of gene expression inferred by our combined in silico approach to etiological pathways. Such linkage would represent yet another critical step towards network medicine fulfilling its ultimate claim, namely to benefit patients by way of clinically actionable results.

Methods

Data

In this study, we used RNA-seq data of the RISK cohort¹³ comprising 321 intestinal tissue samples from treatment-naive pediatric patients with a confirmed diagnosis of Crohn disease (CD) or ulcerative colitis (UC), and from age-matched controls. The proband age ranged from 2 to 17 years, 40% of individuals were female. The CD group comprised 218 patients, 61 individuals were diagnosed with UC and 42 were controls. Ileal biopsies were taken from all individuals and gene expression was measured by RNA-seq. The original data were processed further using the DESeq algorithm for RPKM normalization. Recruitment procedure, data quality measures and data processing are described in detail in the original report¹³. Our analyses employed data publicly available at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE57945. The original DESeq data consisted of one continuous score per gene (or transcript). Since metabolic network coherence analysis requires a binary score per gene, however, we had to dichotomize the data, labeling genes with a DESeq value < −3 or > + 3 as ‘saliently expressed’ (Fig. 8). The choice of this threshold was motivated by the fact that DESeq values are z scores, and that ± 3 roughly demarcates the 1% and 99% quantile, respectively, of the standard Gaussian distribution.

Metabolic network coherence

For metabolic network coherence analysis, we mapped the expression profiles of patients onto reaction-centric metabolic networks and studied the ensuing effective metabolic networks (i.e. subnetworks spanned by the saliently expressed genes e). For an effective network G_e(V,E) with a set of vertices (reactions) V = {r₁, r₂, …, r_K} and edges E, metabolic coherence C is computed as follows: Let k_i denote the degree of vertex r_i in the effective network and let K_c be the number of vertices r_i for which k_i > 0. The connectivity of the effective network (i.e., the number of reactions with non-zero degree divided by the size K of the effective network, R = K_c/K) reveals how ‘meaningful’ the gene-gene correlation in different expression state is from a metabolic perspective. An observed ratio R can be tested for statistical significance by means of comparing it to the null distribution. Here, the null distribution was simulated by randomly drawing the same number of saliently expressed metabolic genes from the set of all metabolic genes, leading to a set of ratios {R₁^(r), R₂^(r), … R_N^(r)} for random data with mean <R^(r)> and standard deviation σ(R^(r)). The metabolic coherence C(e) of a gene expression profile e is then defined as the z-score with respect to the null distribution, i.e. C(e) = (R-<R^(r)>)/σ(R^(r)). In cases, where the effective network comprised less than two nodes (19 CD, 7 UC, 4 controls), no metabolic coherence value could sensibly be computed.

Statistical analysis

The distribution of metabolic network coherence in different sub-groups was subjected to mixture analysis as implemented in SAS procedure FMM (version 9.5; SAS Institute Inc., Cary, NC, USA). In each case, the best fit was observed for two Gaussian distribution, albeit mixed at different proportions. Then, the posterior probability of being sampled from one of the two distributions was calculated of each individual profile. If one of the two posterior probabilities exceeded 0.8, the profile was classified as ‘highly’ or ‘lowly’ coherent, depending upon the respective distribution; otherwise, the profile was classified as ‘undetermined’ (Table 2). Differences between the metabolic network coherence distributions in different groups of profiles were assessed for statistical significance using a Kruskal-Wallis as implemented in SAS procedure NPAR1WAY.

Table 2 Number of high and low coherence expression profiles in different phenotypic subgroups.

Full size table

Data mining

Multidimensional scaling (MDS) analysis was performed with R v.3.1.3³⁶. As continuous input, we used Euclidean distances between gene-specific DESeq values. In addition, binary distances between dichotomized expression levels were calculated as implemented in R-command mds.

Graphs

Metabolic gene networks were generated from Recon 2 v.3 by connecting any two genes that shared a gene-enzyme-reaction-enzyme-gene relationship while excluding metabolites belonging to a list of ‘currency metabolites’ (e.g., ATP, H₂O). Currency metabolites were eliminated by removing the top 5% of metabolites after sorting them by their node degree in the gene-centric metabolic network. This way, 1009 of the 1101 original nodes remained in the network.

Additional Information

How to cite this article: Knecht, C. et al. Distinct metabolic network states manifest in the gene expression profiles of pediatric inflammatory bowel disease patients and controls. Sci. Rep. 6, 32584; doi: 10.1038/srep32584 (2016).

References

Barabási, A.-L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nature Rev Genet 12, 56–68 (2011).
Article Google Scholar
Barabási, A.-L. The network takeover. Nature Physics 8, 14–16 (2012).
Article ADS Google Scholar
Hütt, M.-T. Understanding genetic variation - the value of systems biology. Br J Clin Pharmacol 77, 597–605 (2014).
Article Google Scholar
Ideker, T. & Krogan, N. J. Differential network biology. Molecular systems biology 8, 565 (2012).
Article Google Scholar
Schreiber, S., Rosenstiel, P., Albrecht, M., Hampe, J. & Krawczak, M. Genetics of Crohn disease, an archetypal inflammatory barrier disease. Nature Rev Genet 6, 376–388 (2005).
Article CAS Google Scholar
Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet 47, 979–986, doi: 10.1038/ng.3359 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fukushima, K., Yonezawa, H. & Fiocchi, C. Inflammatory bowel disease-associated gene expression in intestinal epithelial cells by differential cDNA screening and mRNA display. Inflamm Bowel Dis 9, 290–301 (2003).
Article Google Scholar
Costello, C. M. et al. Dissection of the inflammatory bowel disease transcriptome using genome-wide cDNA microarrays. PLoS Med 2, e199, doi: 10.1371/journal.pmed.0020199 (2005).
Article CAS PubMed PubMed Central Google Scholar
Ben-Shachar, S. et al. Gene expression profiles of ileal inflammatory bowel disease correlate with disease phenotype and advance understanding of its immunopathogenesis. Inflamm Bowel Dis 19, 2509–2521, doi: 10.1097/01.MIB.0000437045.26036.00 (2013).
Article PubMed Google Scholar
Holgersen, K. et al. High-resolution gene expression profiling using RNA sequencing in patients with inflammatory bowel disease and in mouse models of colitis. J Crohns Colitis 9, 492–506, doi: 10.1093/ecco-jcc/jjv050 (2015).
Article PubMed Google Scholar
Roediger, W. E. The colonic epithelium in ulcerative colitis: an energy-deficiency disease? Lancet 2, 712–715 (1980).
Article CAS Google Scholar
Thiele, I. et al. A community-driven global reconstruction of human metabolism. Nature Biotechnol 31, 419–425 (2013).
Article CAS Google Scholar
Haberman, Y. et al. Pediatric Crohn disease patients exhibit specific ileal transcriptome and microbiome signature. J Clin Invest 124, 3617 (2014).
Article CAS Google Scholar
Sonnenschein, N., Geertz, M., Muskhelishvili, G. & Hütt, M.-T. Analog regulation of metabolic demand. BMC Syst Biol 5, 40 (2011).
Article Google Scholar
Sonnenschein, N. et al. A network perspective on metabolic inconsistency. BMC Syst Biol 6, 41 (2012).
Article Google Scholar
Price, N. D., Reed, J. L. & Palsson, B. Ø. Genome-scale models of microbial cells: evaluating the consequences of constraints. Nature Rev Microbiol 2, 886–897 (2004).
Article CAS Google Scholar
Kauffman, K., Prakash, P. & Edwards, J. S. Advances in flux balance analysis. Curr Opin Biotechnol 14, 491–496 (2003).
Article CAS Google Scholar
Varma, A. & Palsson, B. Ø. Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110. AEM 60, 3724–3731 (1994).
CAS Google Scholar
Edwards, J. S., Ibarra, R. U. & Palsson, B. Ø. In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data. Nature Biotechnol 19, 125–130 (2001).
Article CAS Google Scholar
Famili, I., Forster, J., Nielsen, J. & Palsson, B. Ø. Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network. PNAS 100, 13134–13139 (2003).
Article CAS ADS Google Scholar
Duarte, N. C. et al. Global reconstruction of the human metabolic network based on genomic and bibliomic data. PNAS 104, 1777–1782 (2007).
Article CAS ADS Google Scholar
Ma, H. et al. The Edinburgh human metabolic network reconstruction and its functional analysis. Molecular systems biology 3, 135 (2007).
Article Google Scholar
Jerby, L., Shlomi, T. & Ruppin, E. Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism. Molecular systems biology 6, 1–9 (2010).
Article Google Scholar
Gille, C. et al. HepatoNet1: a comprehensive metabolic reconstruction of the human hepatocyte for the analysis of liver physiology. Molecular systems biology 6, 411, doi: 10.1038/msb.2010.62 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mintz-Oron, S. et al. Reconstruction of Arabidopsis metabolic network models accounting for subcellular compartmentalization and tissue-specificity. PNAS 109, 339–344 (2012).
Article CAS ADS Google Scholar
Hedderich, J. & Sachs, L. Angewandte Statistik. Vol. 14 (Springer-Verlag, 2012).
Hampel, F. R. The Breakdown Points of the Mean Combined with Some Rejection Rules. Technometrics 27, 95–107, doi: 10.2307/1268758 (1985).
Article MathSciNet MATH Google Scholar
Heimerl, S. et al. Alterations in intestinal fatty acid metabolism in inflammatory bowel disease. Biochim Biophys Acta 1762, 341–350, doi: 10.1016/j.bbadis.2005.12.006 (2006).
Article CAS PubMed Google Scholar
Rachmilewitz, D. & Saunders, D. R. Metabolism of chenodeoxycholate by intestinal mucosa. Gastroenterology 71, 82–86 (1976).
Article CAS Google Scholar
Matern, S., Matern, H., Farthmann, E. H. & Gerok, W. Hepatic and extrahepatic glucuronidation of bile acids in man. Characterization of bile acid uridine 5′-diphosphate-glucuronosyltransferase in hepatic, renal, and intestinal microsomes. The Journal of clinical investigation 74, 402–410, doi: 10.1172/JCI111435 (1984).
Article CAS PubMed PubMed Central Google Scholar
Müller, N. et al. IL-6 blockade by monoclonal antibodies inhibits apolipoprotein (a) expression and lipoprotein (a) synthesis in humans. J Lipid Res 56, 1034–1042, doi: 10.1194/jlr.P052209 (2015).
Article CAS PubMed PubMed Central Google Scholar
Subramanian, V. S., Nabokina, S. M. & Said, H. M. Association of TM4SF4 with the human thiamine transporter-2 in intestinal epithelial cells. Dig Dis Sci 59, 583–590, doi: 10.1007/s10620-013-2952-y (2014).
Article CAS PubMed Google Scholar
Joshi, A. D. et al. Four Susceptibility Loci for Gallstone Disease Identified in a Meta-analysis of Genome-Wide Association Studies. Gastroenterology, doi: 10.1053/j.gastro.2016.04.007 (2016).
Vermeire, S., Van Assche, G. & Rutgeerts, P. Role of genetics in prediction of disease course and response to therapy. World J Gastroenterol 16, 2609–2615 (2010).
Article CAS Google Scholar
Dubinsky, M. Special issues in pediatric inflammatory bowel disease. World J Gastroenterol 14, 413–420 (2008).
Article Google Scholar
R Development Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2016).

Download references

Acknowledgements

This work was supported by the German Federal Ministry of Education and Research (BMBF) within the e:Med research and funding framework (grants 01ZX1306A and 01ZX1306D).

Author information

Knecht Carolin and Fretter Christoph contributed equally to this work.

Authors and Affiliations

Institute of Medical Informatics and Statistics, Christian-Albrechts University Kiel, Kiel, Germany
Carolin Knecht & Michael Krawczak
Department of Life Sciences and Chemistry, Jacobs University, Bremen, Germany
Christoph Fretter & Marc-Thorsten Hütt
Institute of Clinical Molecular Biology, Center for Molecular Biosciences, Christian-Albrechts University Kiel, Kiel, Germany
Philip Rosenstiel

Authors

Carolin Knecht
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Fretter
View author publications
You can also search for this author in PubMed Google Scholar
Philip Rosenstiel
View author publications
You can also search for this author in PubMed Google Scholar
Michael Krawczak
View author publications
You can also search for this author in PubMed Google Scholar
Marc-Thorsten Hütt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T.H. and M.K. conceived this study. C.F. and C.K. performed the numerical simulations and data analyses. All authors participated in the interpretation of the findings and drafted the manuscript. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Knecht, C., Fretter, C., Rosenstiel, P. et al. Distinct metabolic network states manifest in the gene expression profiles of pediatric inflammatory bowel disease patients and controls. Sci Rep 6, 32584 (2016). https://doi.org/10.1038/srep32584

Download citation

Received: 21 April 2016
Accepted: 10 August 2016
Published: 02 September 2016
DOI: https://doi.org/10.1038/srep32584

This article is cited by

Identifying metabolic shifts in Crohn's disease using 'omics-driven contextualized computational metabolic network models
- Philip Fernandes
- Yash Sharma
- Sana Syed
Scientific Reports (2023)
Network location and clustering of genetic mutations determine chronicity in a stylized model of genetic diseases
- Piotr Nyczka
- Johannes Falk
- Marc-Thorsten Hütt
Scientific Reports (2022)
A hexokinase isoenzyme switch in human liver cancer cells promotes lipogenesis and enhances innate immunity
- Laure Perrin-Cocon
- Pierre-Olivier Vidalain
- Olivier Diaz
Communications Biology (2021)
The metabolic network coherence of human transcriptomes is associated with genetic variation at the cadherin 18 locus
- Kristina Schlicht
- Piotr Nyczka
- Michael Krawczak
Human Genetics (2019)
An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease
- Ryohei Eguchi
- Mohammand Bozlul Karim
- Md. Altaf-Ul-Amin
BMC Bioinformatics (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Quantification of metabolic coherence

Multi-modality of metabolic network coherence

Data mining

Saliently expressed genes

Conclusions

Methods

Data

Metabolic network coherence

Statistical analysis

Data mining

Graphs

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links