Genetically personalised organ-specific metabolic models in health and disease

Foguet, Carles; Xu, Yu; Ritchie, Scott C.; Lambert, Samuel A.; Persyn, Elodie; Nath, Artika P.; Davenport, Emma E.; Roberts, David J.; Paul, Dirk S.; Di Angelantonio, Emanuele; Danesh, John; Butterworth, Adam S.; Yau, Christopher; Inouye, Michael

doi:10.1038/s41467-022-35017-7

Download PDF

Article
Open access
Published: 29 November 2022

Genetically personalised organ-specific metabolic models in health and disease

Carles Foguet ORCID: orcid.org/0000-0001-8494-9595^1,2,3,4,
Yu Xu ORCID: orcid.org/0000-0002-7304-5045^1,3,4,
Scott C. Ritchie ORCID: orcid.org/0000-0002-8454-9548^1,3,4,5,6,
Samuel A. Lambert ORCID: orcid.org/0000-0001-8222-008X^1,2,3,4,
Elodie Persyn^1,3,4,
Artika P. Nath^1,6,
Emma E. Davenport⁷,
David J. Roberts^8,9,10,
Dirk S. Paul ORCID: orcid.org/0000-0002-8230-0116^3,4,5,
Emanuele Di Angelantonio^2,3,4,5,9,11,
John Danesh^2,3,4,5,7,9,
Adam S. Butterworth ORCID: orcid.org/0000-0002-6915-9015^2,3,4,5,9,
Christopher Yau^12,13 &
…
Michael Inouye ORCID: orcid.org/0000-0001-9413-6520^{1,2,3,4,5,6,14}

Nature Communications volume 13, Article number: 7356 (2022) Cite this article

6870 Accesses
8 Citations
29 Altmetric
Metrics details

Subjects

Abstract

Understanding how genetic variants influence disease risk and complex traits (variant-to-function) is one of the major challenges in human genetics. Here we present a model-driven framework to leverage human genome-scale metabolic networks to define how genetic variants affect biochemical reaction fluxes across major human tissues, including skeletal muscle, adipose, liver, brain and heart. As proof of concept, we build personalised organ-specific metabolic flux models for 524,615 individuals of the INTERVAL and UK Biobank cohorts and perform a fluxome-wide association study (FWAS) to identify 4312 associations between personalised flux values and the concentration of metabolites in blood. Furthermore, we apply FWAS to identify 92 metabolic fluxes associated with the risk of developing coronary artery disease, many of which are linked to processes previously described to play in role in the disease. Our work demonstrates that genetically personalised metabolic models can elucidate the downstream effects of genetic variants on biochemical reactions involved in common human diseases.

Genome-scale models in human metabologenomics

Article 19 September 2024

A dynamic multi-tissue model to study human metabolism

Article Open access 22 January 2021

Systems genetics applications in metabolism research

Article 21 October 2019

Introduction

Genome-wide association studies (GWAS) have identified more than 50,000 genetic variants associated with complex traits or diseases¹. While the contribution of individual variants to a given phenotype is generally small, the effect of multiple genetic variants can be aggregated into polygenic scores (PGS), which are highly predictive of disease incidence and enhance existing risk models^2,3,4. However, while GWAS and PGS can be useful for risk stratification^5,6,7, the mechanisms by which genetic variants influence disease risk, i.e., variant to function (V2F), remain largely unsolved. Addressing V2F is a major challenge in human genetics and has the potential to unveil many new therapeutic targets^6,8,9.

An approach to address the V2F challenge is to quantify how genetic variation causes disease through the regulation of molecular traits. To this end, genetic variants affecting gene expression are identified and subsequently aggregated into models that can impute the abundance of transcripts and proteins^10,11,12. For example, PredictDB is a database that offers a collection of linear models to impute transcript levels in specific organs of the human body¹³. PredictDB models were trained in the GTEx dataset, which contains genotype profiling and tissue-specific transcript abundance from post-mortem donors¹⁴. Imputed transcript or protein levels can be used to perform transcriptome-wide or proteome-wide association analyses, respectively, to identify gene products associated with disease^10,15. Alternatively, PGSs for disease can be used to identify proteins and other gene products which may disrupt polygenic risk¹⁶. However, transcripts and proteins do not exert their effects in isolation but in highly connected and complex biological networks. Indeed, previous studies have shown the merit of analysing genetic variation in the context of gene co-expression and gene interaction networks to characterise how the effects of genetic variants contribute to complex traits or diseases by propagating through biological networks^17,18,19,20.

Metabolism is one of the most prominent biological networks and a comparatively tractable experimental setting in which to address the V2F challenge. Essentially, metabolism is a set of interconnected chemical reactions and transport processes occurring in a highly ordered, regulated and coordinated manner across multiple organs in the human body²¹. The metabolic phenotype of a given organ is defined by both metabolite concentrations and metabolic fluxes (i.e., the rates at which substrates are converted to products through reactions) and emerges from the complex interaction of metabolites, enzymes, and transmembrane carriers^22,23. Metabolite concentrations offer a static snapshot of metabolite distributions, whereas metabolic fluxes provide a map of metabolite traffic through metabolic pathways²⁴.

Genome-scale metabolic models (GSMMs), mathematical representations of the metabolic reaction network arising from the human genome^25,26, simulate steady-state metabolic fluxes by formulating network stoichiometry as sets of linear equations and directionality constraints²⁷. GSMMs have emerged as a useful approach to integrate transcriptomics, proteomics, and metabolomics to characterise metabolic flux maps^28,29. For example, proteomics, metabolomics, and physiological data have been used to build human organ-specific GSMMs³⁰. Similarly, there is increasing interest in integrating individual measures to build personalised GSMMs that reflect the specific metabolic phenotype in each individual, thus facilitating personalised medicine^{30,31,32,33,34}.

Since gene expression is highly heritable^10,13, it may be feasible to leverage human genome-scale metabolic networks to analyse the system-wide effects of genetic variants on metabolism and build genetically personalised GSMMs. To this end, we present a framework where transcript levels imputed from genetic data can be used to simulate personalised and organ-specific, genome-scale flux maps using the quadratic metabolic transformation algorithm (qMTA). Such flux maps provide genetically personalised metabolic models at a genome scale for each tissue. As proof of concept, we build personalised organ-specific flux maps for over 520,000 individuals across the INTERVAL^35,36 and UK Biobank (UKB)³⁷ cohorts, then perform a fluxome-wide association study (FWAS) to test the association between organ-specific flux values and directly measured blood metabolite levels. Finally, we apply FWAS to identify fluxes associated with coronary artery disease (CAD), thus demonstrating the potential of genome-scale flux maps for V2F by elucidating intermediary biochemical reactions between genetic variation and common disease.

Results

A computational framework for genetically personalised organ-specific GSMMs

We developed a framework for building personalised organ-specific flux maps from genotype data (Fig. 1; Methods). First, we extract the organ-specific models from the Harvey/Harvetta multiorgan model³⁰, which provide a set of curated metabolic networks for the main organs of the human body. Harvey/Harvetta models were built from the Recon3D human GSMM²⁵, which has been superseded by HUMAN1²⁶. HUMAN1 shares 97% of reactions with Recon3D, but it incorporates a myriad of improvements in gene-reaction rules, reaction reversibility and stoichiometric consistency compared to the latter. Hence, we performed a liftover of the Harvey/Harvetta organ-specific models to HUMAN1 (Methods).

**Fig. 1: Framework for computing organ-specific personalised genome-scale flux maps from genotype data and performing fluxome-wide association study (FWAS).**

With the HUMAN1-based organ-specific models, the next step is to compute a reference flux distribution for each organ under consideration. This is achieved by defining organ-specific metabolic objectives that must be fulfilled (e.g., synthesis of neurotransmitters in the brain), obtaining average organ transcript abundances from GTEx¹⁴ and using them as an input for the GIM3E algorithm³⁸. GIM3E is an algorithm that, subject to fulfilling the organ-specific metabolic objectives, seeks to minimise the overall flux through the network using transcript abundance data to give each reaction a minimisation weight inversely proportional to the expression of the enzymes catalysing it. Subsequently, flux sampling³⁹ is applied to identify a representative flux distribution (i.e., sets of flux values) in the solution space within 99% of the GIM3E optimal solution. The resulting set of flux values, termed reference flux distribution, is both enzymatically efficient and consistent with the average transcript abundances in each organ (Supplementary Fig. 1). The flux distribution can be assumed to represent the average metabolic state of each modelled organ in the general population.

Subsequently, models from PredictDB¹³ are used to impute personalised organ-specific transcript abundances from genotype data. The resulting imputed transcript data are mapped to reactions in the organ-specific subnetworks as putative reaction activity fold changes relative to the average organ-specific transcript expression in GTEx¹⁴. The imputed personalised reaction activity fold changes and the reference flux distributions are then utilised by the qMTA to compute genetically personalised organ-specific flux maps. Briefly, qMTA finds the flux distributions most consistent with the putative reaction activity fold changes in each individual (Supplementary Fig. 2; Methods).

Building flux maps for >520,000 individuals

Using the above framework, we built personalised organ-specific flux maps for 37,220 and 487,395 individuals from the INTERVAL^35,36 and UKB³⁷ cohorts, respectively. Personalised models were generated for skeletal muscle, adipose tissue, liver, brain, and heart, which together account for roughly 66% of body weight in an average adult⁴⁰. Overall, 14,220 reaction flux values were computed for each individual. Metabolic fluxes “flow” through pathways where the product of one reaction is the substrate of successive reactions; thus, many of the flux values computed in each individual will have inherent dependencies (Supplementary Fig. 3A–C). As such, from the 14,220 reaction flux values, we selected a subset of 4300 flux values without strong pairwise correlations (ρ < 0.9) for further analysis (Supplementary Fig. 3D; Methods).

Principal component analysis of the personalised organ-specific flux values for individuals of INTERVAL and UKB showed the underlying structure in the data (Supplementary Fig. 4). Fluxes with the greatest loadings on top principal components (PCs) tended to be related to the known metabolism of each organ (Supplementary Data 1). For example, in the liver, fluxes through reactions and transport processes of amino acid, glycerophospholipid, and nucleotide metabolism exhibited large loadings along the first five PCs. Key reactions in cholesterol and bile acid biosynthesis also had large PC loadings, reflecting the function of the liver in cholesterol homoeostasis²¹. In both skeletal muscle and heart, the top PCs were associated with fluxes through transport processes of amino acids and reactions related to fatty acid β-oxidation, processes which play key roles in skeletal muscle and heart^41,42,43,44. Notably, in the brain, the main loadings on the top principal components were distributed across a wide range of pathways. For instance, PC1 was associated with reactions and transport processes involving bile acids and their precursors. Bile acids, which can be synthesised within the brain and can also be transported across the blood-brain barrier, have been reported to act as regulators of neurological functions^45,46. Likewise, PC2, and to a lesser extent PC3, were related to reactions and transport processes from amino acid metabolism, including reactions linked to neurotransmitters such as dopamine, glycine, glutamate, and nitric oxide. PC4 was associated with reactions of fatty acid metabolism, most notably several reactions involving arachidonic acid, a conditionally essential fatty acid with many roles in brain function in health and disease^47,48. Lastly, PC5 was primarily associated with reactions of nucleotide metabolism. Finally, in adipose tissue, all PCs were strongly associated with fatty acid metabolism, including reactions involved in their oxidation, biosynthesis and transport. However, PC2 and PC3 were also associated with reactions of steroid metabolism, reflecting adipose tissue’s capacity to synthesise and convert steroids⁴⁹.

Fluxome-wide association study for blood metabolites

We next validated that genetically personalised GSMMs could generate reliable and meaningful flux predictions across cohorts. As phenotypes, organ-specific flux maps are expected to lead to distinct profiles in the blood metabolome. To demonstrate this, we performed an association analysis by individually regressing each measured blood metabolic feature against the 4300 personalised fluxes computed in the INTERVAL^35,36 and UKB³⁷ cohorts (Supplementary Fig. 2; Methods). The blood metabolome for INTERVAL comprised both Nightingale Health NMR assays (N = 37,720 participants) and Metabolon HD4 mass spec assays (N = 8115 participants)⁵⁰. In UKB, blood samples for 120,266 participants were profiled with Nightingale Health NMR⁵¹.

For INTERVAL, an FDR-adjusted significance threshold of P < 1.0 × 10⁻⁶ was applied to control for all tested pairs (Methods). We identified 4312 significant associations between flux values and blood metabolic features in total, of which 1066 were for the Nightingale platform and 3246 for Metabolon (Supplementary Data 2, Fig. 2A, B). Consistent with the role of the liver in whole-body metabolic homoeostasis²¹, the liver was the organ with the most associations (1301), followed by the heart (1005), skeletal muscle (896), brain (593), and adipose tissue (517) (Fig. 2C, Supplementary Data 2). We externally validated the INTERVAL flux associations with Nightingale metabolites using UKB (Fig. 2D). We found 83% of the INTERVAL associations replicated in UKB with an FDR-adjusted significance threshold of P < 1.0 × 10⁻⁶ and consistent direction of the effect sizes. Effect sizes were themselves highly correlated (ρ = 0.82) between INTERVAL and UKB (Supplementary Fig. 5).

**Fig. 2: Fluxome-wide association study (FWAS) between genetically personalised flux maps and blood metabolic features.**

Finally, we also evaluated the effect of the underlying genome-scale reconstructions of human metabolism in the FWAS for blood metabolic features. With this aim, we used organ-specific models built from the Recon3D human GSMM^25,30 to compute genetically-personalised fluxes for the INTERVAL cohort^35,36, test their association to blood metabolic features, and compare the results to the above-described FWAS that had used fluxes computed with HUMAN1-based models. We identified 3895 significant associations between blood metabolic features and the genetically personalised flux values computed using Recon3D-based organ-specific models (Supplementary Data 2). There was a significant overlap with HUMAN1 models as 1761 of these associations could be replicated in the HUMAN1-based FWAS, and the associated effect sizes on blood metabolites were highly correlated between HUMAN1 and Recon3D analyses (ρ = 0.72). However, 2134 associations were only statistically significant on the Recon3D-based analysis and could not be replicated with HUMAN1 models. Likewise, of the 4312 significant associations between blood metabolic features and fluxes computed using HUMAN1 models, 2551 associations could not be detected with Recon3D-based models. Such discrepancy between HUMAN1- and Recon3D-based analyses is not surprising; HUMAN1²⁶, which is a newer reconstruction of human metabolism than Recon3D²⁵, expands gene reaction annotations and refines reaction reversibility, both of which can have significant effects on how genetic variation propagates through the network and, thus, can lead to significant differences in the resulting personalised flux maps and the downstream FWAS. Indeed, many discrepancies between the Recon3D and HUMAN1 results are likely artefacts emerging from erroneous or incomplete annotations in Recon3D. Throughout this work, we focus on the analyses and discussion of HUMAN1-based fluxes, as HUMAN1 has been established to be a better representation of human metabolism²⁶, but results obtained with Recon3D-based models will also be provided in the appropriate supplementary data (Supplementary Data 2, Supplementary Data 3, and Supplementary Data 4).

Fluxome associations by metabolic feature class and reaction system

The 4312 significant associations comprised 229 unique blood metabolic features and 763 unique organ-specific metabolic fluxes. Consistent with the coverage of the Nightingale Health and Metabolon HD4 platforms, we found that most of these blood metabolic features were lipid-related (Fig. 3A, Supplementary Data 2). Glycerides and phospholipids were enriched in associations across all organs relative to all features profiled with the Metabolon HD4 assay (Methods), suggesting an association with core reactions (i.e., active in all modelled organs). The liver and adipose tissue were also enriched in associations with steroids, reflecting the role of such organs in cholesterol²¹ and steroid hormone metabolism⁴⁹.

**Fig. 3: Characterisation of the significant associations between blood metabolic features and metabolic fluxes.**

We further assessed the metabolic systems of the 763 organ-specific metabolic fluxes from the significant associations (Fig. 3B, Supplementary Data 2) and found that most reactions were functionally part of lipid metabolism, consistent with a large number of associations with lipid metabolic features. Reactions of fatty acid metabolism were significantly enriched in associations with blood metabolic features in all organs relative to all analysed reactions in each organ-specific metabolic network. In the liver, reactions of glyceride and phospholipid metabolism and bile acid metabolism were also enriched.

There was widespread consistency between biochemical pathways and blood metabolic feature classes (Fig. 3C, Supplementary Data 2). For example, reactions from the glycerides and phospholipids system were primarily associated with blood metabolic features of glycerides and phospholipids as well as lipoprotein fractions and their constituents. We found that reactions of fatty acid metabolism were associated mainly with blood glyceride and phospholipids, followed by fatty acids, which themselves provide acyl chains to glycerides and phospholipids. Similarly, reactions from nucleotide metabolism and amino acid metabolism were primarily associated with blood metabolic features of nucleotides and amino acids, respectively.

Fluxes of the hepatic triacylglycerol to cholesteryl ester pathway and blood lipoproteins

In the liver, we identified 555 associations between fluxes and lipoprotein fractions (Supplementary Data 2). Most of these associations were to reactions of glycerides and phospholipids metabolism, which were enriched in associations relative to all analysed liver fluxes (Methods; Fig. 3B). FWAS revealed that a major determinant of triacylglycerols (TAG), free cholesterol (FC), and cholesteryl esters (CE) fractions in lipoproteins was a sequential set of reactions which we term the TAG to cholesterol esterification (TAG-CE) pathway (Fig. 4). In the TAG-CE pathway, TAGs are hydrolysed to diglycerides and fatty acids in the liver, diacylglycerides are then used as a substrate to synthesise phospholipids (i.e., phosphatidylcholine and phosphatidylethanolamine) which are subsequently used as substrates to esterify FC. We found that fluxes through reactions of the TAG-CE pathway were strongly associated with an increased percentage of CE in HDL and decreased TAG levels in LDL and HDL (Table 1, Supplementary Data 3). The pathway was also strongly associated with reduced HDL size, likely driven by a reduction of TAG levels in HDL⁵². While the associations were primarily found in the liver-specific flux map with mediation by liver-expressed enzymes, these pathways are not necessarily constrained to the liver. For example, the hepatic TAG lipase localises to both the liver and blood⁵³. Similarly, phospholipids synthesised in the liver may be transferred to HDL in circulation, where they can fuel cholesterol esterification catalysed by the liver-secreted lecithin-cholesterol acyltransferase (LCAT)^52,54.

**Fig. 4: Triacylglycerol to cholesteryl ester pathway in the liver.**

Table 1 Top associations between metabolic fluxes and lipoproteins in the hepatic triacylglycerol to cholesteryl ester pathway

Full size table

Components of the TAG-CE pathway have been the subject of various studies. For example, rare deficiencies in hepatic lipase activity have been linked to increased TAG levels and decreased CE levels in HDL⁵⁵. Similarly, genetic variants in hepatic lipase have been associated with total cholesterol levels in HDL^56,57. For phospholipids, blocking phosphatidylcholine synthesis has been shown to result in cellular accumulation of TAG both in vitro and in vivo^58,59. Similarly, LCAT deficiencies have been associated with reduced cholesterol esterification and increased triglycerides in plasma^60,61. It has also been suggested that cholesterol may be inefficiently esterified by LCAT in patients with CAD, leading to a lower CE to FC ratio⁶².

Conversely, flux through reactions disrupting TAG-CE, such as cholesterol esterase, are predicted by our FWAS to have the opposite effect and are associated with increased TAG levels and decreased cholesterol esterification (Table 1, Supplementary Data 3). Among such reactions, there is the hydrolysis of retinyl esters which can act as an alternative source of free fatty acids inhibiting TAG lipase activity (Fig. 4). Retinyl esters are the most abundant form of vitamin A in the human body and are its most common form in diets and vitamin supplements⁶³. Dietary retinol is esterified in enterocytes, and most of it is transported to hepatocytes by means of lipoproteins, where it is subsequently hydrolysed and transferred to stellate cells for storage⁶⁴. Notably, the administration of high doses of retinol derivatives has been reported to increase total triglyceride levels and, in some instances, increase total cholesterol in LDL while decreasing total cholesterol in HDL^65,66,67,68. We hypothesise that this occurs because retinyl esters disrupt the hepatic TAG-CE pathway, inhibiting triglyceride lipase and reducing cholesterol esterification, thus reducing the capacity of HDL to collect FC from other lipoproteins such as LDL^52,62,69.

FWAS identifies metabolic fluxes associated with coronary artery disease

We extended our approach of fluxome-wide analysis to common diseases and performed a multi-tissue FWAS for CAD in UKB. We evaluated the association of the 4300 metabolic fluxes with CAD using Cox regression (Methods), which identified 92 significant associations (FDR-adjusted P value < 0.05 controlling for all tested fluxes). Of such associations, 31 could be replicated with genetically personalised fluxes computed with Recon3D-based models, whereas 61 were specific to the HUMAN1-based models. Liver fluxes yielded the largest number of significant associations with CAD (N = 32), followed by fluxes from the adipose tissue (N = 26), heart (N = 15), brain (N = 10), and skeletal muscle (N = 9) (Fig. 5; Supplementary Data 4).

**Fig. 5: Fluxome-wide association analysis (FWAS) between genetically personalised flux values and coronary artery disease.**

The flux of histamine synthesis through histidine decarboxylase was shown to be strongly associated with CAD in adipose tissue with a hazard ratio (HR) per s.d. of log-transformed flux value of 1.060 and a P value of 2.33 × 10⁻²⁷ (Supplementary Data 4). Such association was also detected in the liver, where both the fluxes through histidine uptake (HR = 1.024 per s.d., P = 1.65 × 10⁻⁵) and histidine decarboxylation (HR = 1.027 per s.d., P = 8.60 × 10⁻⁷) were associated with increased CAD risk. Histamine is an inflammatory mediator synthesised from histidine primarily in mast cells⁷⁰, which reside in most tissues, including liver and adipose tissue^71,72. Histamine has been reported to be associated with atherosclerosis via blood lipids and lipoprotein fractions as well as by promoting inflammation^73,74,75,76. In adipose tissue, polyamine synthesis was also associated with reduced CAD risk (spermidine synthase: HR = 0.9517 per s.d., P = 6.06 × 10⁻²¹). Notably, polyamine-rich diets have been established to have a protective effect against cardiovascular disease^77,78. Moreover, it has recently been determined that polyamines produced by adipose endothelial cells might protect against obesity, a known risk factor for CAD⁷⁹, by promoting vascularisation and lipolysis in white adipose tissue⁸⁰.

Concerning lipid metabolism, the fluxes through the TAG lipase reactions in adipose tissue (HR = 0.9652 per s.d., P = 6.48 × 10⁻¹¹), heart (HR = 0.9675 per s.d., P = 1.06 × 10⁻⁹) and skeletal muscle (HR = 0.9693 per s.d., P = 9.83 × 10⁻⁹) were strongly associated with reduced CAD risk, consistent with the anti-atherogenic effect of lipoprotein lipase activity in these organs⁸¹. Similarly, the release of free fatty acids from adipose tissue was also associated with reduced CAD risk (e.g., the release of oleic acid: HR = 0.9688 per s.d., P = 5.09 × 10⁻¹⁹ and release of myristic acid: HR = 0.9691 per s.d., P = 6.64 × 10⁻⁹). Interestingly, not only is the release of free fatty acids part of normal adipocyte function²¹, but it is also a key part of the polyamine-driven signalling cascade in adipose tissue⁸⁰. Conversely, the flux through the phospholipase reaction was associated with increased CAD risk in adipose tissue (HR = 1.027 per s.d., P = 9.48 × 10⁻⁷). Notably, phospholipase activities have been suggested to have a causal role in atherosclerosis and have been investigated as potential pharmacological targets to prevent atherosclerosis and CAD^82,83,84,85.

The fluxes through several transport processes were also identified as associated with CAD. For instance, histamine transport in the liver appears to be associated with CAD risk in a transport process-specific manner, with histamine transport through uniport being associated with decreased CAD risk (HR = 0.942 per s.d., P = 2.60 × 10⁻²⁸) and its antiport with glutathione being associated to increased risk (HR = 1.051 per s.d., P = 9.77 × 10⁻²⁰). Also in the liver, transport of bilirubin conjugates was associated to decreased CAD risk (transport of bilirubin-monoglucuronoside: HR = 0.9629 per s.d., P = 2.74 × 10⁻¹² and transport of bilirubin-bisglucuronoside: HR = 0.9808 per s.d., P = 3.49 × 10⁻⁴). Notably, the transport process of bilirubin-monoglucuronoside is mediated by SLCO1B1, which also mediates the hepatic uptake of statins, enhancing their therapeutic efficacy^86,87. Interestingly, high levels of total bilirubin in blood have been associated with decreased risk of CAD^88,89, likely mediated by the modulation of arterial diameter and reactivity⁹⁰. In both the brain and heart, the flux of prostaglandin E2 transport was also associated with increased CAD risk (brain: HR = 1.058 per s.d., P = 4.23 × 10⁻²⁶ and heart: HR = 1.049 per s.d., P = 5.27 × 10⁻¹⁹). Prostaglandin E2 is an inflammatory mediator that promotes inflammation and has been reported to contribute to the development of atherosclerotic lesions^91,92. Additionally, in the brain, the transport of norepinephrine, a neurotransmitter that can increase blood pressure and may play a role in atherosclerosis^93,94, was also associated with increased risk of CAD (HR = 1.041 per s.d., P = 4.49 × 10⁻¹⁴).

Discussion

Here, we present a new framework that uses metabolic modelling to leverage the stoichiometric relationships of enzymes in human genome-scale metabolic networks to characterise how genetic variants affect metabolic phenotypes. We achieve this by integrating genetic effects on transcript levels into organ-specific GSMMs and simulating how they propagate and interact into genome-scale flux maps of major human organs. To validate our method, we built organ-specific models for the liver, heart, skeletal muscle, brain, and adipose tissue for over 520,000 individuals from the INTERVAL^35,36 and UKB³⁷ cohorts, surpassing by more than two orders of magnitude the number of personalised GSMMs built in previous works^30,31,32,33. Association analyses were performed between genetically-personalised flux values and directly measured blood metabolites in both INTERVAL and UKB, identifying many significant and replicable associations. As expected, we found that most blood metabolic features were associated with functionally related flux pathways. Finally, we demonstrate that fluxome-wide analysis can be used to identify putative metabolic drivers of CAD.

With cardiovascular disease being a leading cause of mortality and comorbidity worldwide⁹⁵, the identification of specific biochemical reactions linked to CAD using population-scale genomic data is of significant interest to both basic discovery science and the development of therapeutics. Indeed, many of the 92 flux associations we identified involve pathways or metabolites that have been associated with CAD in existing studies, such as histamine^73,76, TAGs⁹⁶, or phospholipase activity^82,83,85. The modulation of some of these fluxes has been explored as therapies for CAD, namely several phospholipase inhibitors^83,84.

Our analysis has several limitations. For instance, as a proof of concept, this study focused on modelling only the five most prominent human organs⁴⁰, and thus we can only identify flux to phenotype associations in the liver, heart, skeletal muscle, brain, and adipose tissue. However, given the availability of models to impute tissue- or cell-specific transcript abundance from genotype¹³, this analysis can easily be expanded to other tissues and cell types. Indeed, we envision that future applications may select organs for modelling based on the target diseases or phenotypes. Furthermore, the modelling framework presented here is limited to only simulating the effect of genetic variants affecting transcript levels. In the future, it could also be expanded to model the impact of gain or loss of function variants⁹⁷ and environmental variables (e.g., diet, lifestyle, and medication) on the personalised flux maps. Additionally, while transcript levels are widely used in genome-scale metabolic modelling^26,29,38, protein levels have a more direct effect on enzymatic activity, and new methods are being developed to fully integrate them into GSMMs^26,98. With models to impute the levels of proteins becoming increasingly available^12,15, we expect that the framework for computing genetically personalised fluxes will be extended to integrate the protein layer in the future. Finally, an inherent limitation of our analysis is that it is dependent on the quality of the underlying metabolic networks and their gene-reaction annotations. Indeed, we determined that an important number of the associations between fluxes and blood metabolomics or CAD risk could not be replicated with models based on an earlier reconstruction of human metabolism (i.e., Recon3D²⁵). However, with human GSMMs becoming increasingly more well-annotated²⁶, differences in FWAS results using models built from different genome-scale reconstructions of human metabolism will progressively become more subtle.

Concerning translating genetically personalised models and fluxes to clinical applications, GSMMs have already been established to have utility for drug discovery and repositioning^{32,99,100,101,102}. Therefore, FWAS may enable identifying fluxes associated with disease states and, by extension, the gene knockdowns or metabolic interventions (e.g., dietary supplements or metabolic inhibitors) to target them. FWAS to blood metabolic features may also help screen for potential adverse side effects of metabolic interventions. For example, we identified that retinyl esters might increase TAG levels and reduce cholesterol esterification in lipoproteins, consistent with reports that administering high doses of vitamin A derivatives results in hypertriglyceridemia and dysregulation of cholesterol levels^65,66,67,68. Furthermore, while it is very early days, personalised fluxes associated with disease risk could also be incorporated into existing risk prediction models, potentially enhancing their predictive capacity.

Overall, this work demonstrates that genome-scale metabolic modelling can contribute to addressing the V2F challenge by characterising how the effects of genetic variants propagate through the metabolic networks of specific human organs.

Methods

INTERVAL cohort

INTERVAL is a cohort of approximately 50,000 participants nested within a randomised trial studying the safety of varying the frequency of blood donation (https://clinicaltrials.gov/ct2/show/NCT01610635). Participants were blood donors aged 18 years and older (median 44 years of age; 50% women) recruited between 2012 and 2014 from 25 NHS Blood and Transplant centres^35,36. Genetically personalised fluxes were computed for the 37,220 individuals with genotype and blood metabolome data that had passed quality control.

Genotyping of INTERVAL samples, their quality control and imputation were performed as previously described:¹⁰³ Participants were genotyped in ten batches using Affymetrix UK Biobank arrays. Duplicate samples, samples with extreme heterozygosity or sex mismatch, were removed, and participants of non-European descent were excluded. First- or second-degree relatives (identity-by-descent $\hat{\pi } > 0.187$) were also removed, keeping one sample at random from each pair of close relatives. Genotyped variants were removed if they had a call rate <99%, were monomorphic, or had Hardy-Weinberg equilibrium P value < 5 × 10⁻⁶. Variants were subsequently phased using SHAPEIT3, then imputed to the UK10K/1000 Genomes reference panels using the Sanger Imputation Server (https://imputation.sanger.ac.uk).

UK Biobank

UKB is a cohort of approximately 500,000 participants from the general UK population (https://www.ukbiobank.ac.uk/). Participants were between age 40 and 69 at recruitment (median 58 years of age; 54% women) and accepted an invitation to attend one of the assessment centres that were established across the United Kingdom between 2006 and 2010³⁷. Genetically personalised fluxes were computed for the 487,395 individuals in the version 3 release of the UK Biobank genotype data¹⁰⁴ (https://biobank.ndph.ox.ac.uk/showcase/label.cgi?id=263), which has been imputed to the UK10K/1000 genomes and haplotype reference consortium (HRC)¹⁰⁵ panels.

Building organ-specific models

For each analysed organ (i.e., adipose tissue, brain, liver, heart, and skeletal muscle), the set of organ-specific metabolic reactions was extracted from the Harvey/Harvetta models (version 1_03c)³⁰, which contain manually curated metabolic networks for the major organs of the human body. To avoid any gender biases, any reaction present in either the male (Harvey) or female (Harvetta) models was included.

Harvey/Harvetta models were built from the Recon3D human GSMM²⁵, which has been superseded by HUMAN1²⁶. Hence, we performed a liftover to update the Harvey/Harvetta organ-specific models to HUMAN1. Briefly, the IDs of the organ-specific metabolic reactions from the Harvey/Harvetta models were mapped to HUMAN1 (version 1.11.0) using the mapping provided in the HUMAN1 reaction annotations²⁶. Subsequently, the resulting set of HUMAN1 reaction IDs was used to assemble organ-specific models from HUMAN1 reactions. Manual curation was used to identify and, when possible, correct gaps and missmaps. Some reactions in the Harvey/Harvetta models that were not present on the base Recon3D and thus could not be mapped to HUMAN1, were also added to the resulting network. These reactions included phospholipase, cholesterol esterase, and extracellular LCAT. Additionally, the side acyl chains of triglycerides and phospholipids were simplified to a stoichiometric mix of 1/3 oleoyl, 1/6 palmitoleoyl, 1/6 palmitoyl, 1/6 stearoyl, 1/6 myristoyl in line with the ratio used in Harvey/Harvetta for non-essential fatty acids³⁰. Boundaries for the exchange reactions fluxes (i.e., rates of metabolite uptake or secretion) between each organ-specific model and blood or bile were set as the average bounds of the corresponding reactions in the Harvey and Harvetta models. In some instances, the ranges of metabolite uptake and secretion were further constrained to ensure that they were physiologically relevant. In the brain-specific model, exchange reactions to blood were mapped to the exchange reactions between blood and cerebrospinal fluid defined in Harvey/Harvetta. Such reactions had been defined, taking into consideration the selective permeability of the blood-brain barrier³⁰. Thus, only metabolites permeable to this barrier can be exchanged between blood and the brain-specific model. Next, metabolites in blood or bile were made boundary conditions (i.e., assumed constant), allowing each organ subnetwork to function independently. Finally, given that most HUMAN1 reactions lack a name attribute, unnamed reactions in the resulting network were named using their corresponding name in Recon3D. In a number of instances, ambiguously named reactions were manually renamed.

To validate the resulting organ-specific models, we performed flux variability analysis (FVA)¹⁰⁶ to test the capacity of reactions in the networks to carry a significant amount of flux (>10⁻⁶ mol/day), and 93% were shown to be capable of carrying a significant flux. Furthermore, models were also evaluated against a set of essential metabolic tasks (i.e., tasks all organs are expected to perform to be viable) and organ-specific metabolic tasks obtained from the HUMAN1 repository²⁶ (Supplementary Data 5). Each organ-specific model was shown to be capable of successfully performing all essential tasks as well as its organ-specific tasks. The resulting organ-specific GSMMs are available on GitHub and permanently archived by Zenodo¹⁰⁷.

Additionally, a set of Recon3D-based organ-specific models were also built. Such models were obtained by applying the steps described above without performing the liftover to HUMAN1.

Computing organ-specific reference flux maps

The GIM3E algorithm was applied to compute the reference flux map for each organ. The GIM3E algorithm applies a flux minimisation weighted by transcript abundances allowing to find solutions that are enzymatically efficient, consistent with gene expression data and fulfil a set of metabolic objectives³⁸ (Supplementary Fig. 1). First, a set of metabolic objectives was defined for each organ representing major metabolic functions that each organ fulfils in the conditions under study (Supplementary Data 6). These were added in each organ subnetwork as lower bounds for flux values through reactions associated with those metabolic objectives. Lower bounds were set relative to the maximum flux feasible through such reactions identified with FVA¹⁰⁶.

Next, organ-specific transcript abundances were obtained as transcripts per million from the GTEx Portal¹⁴ (GTEx Analysis Release V8; dbGaP Accession phs000424.v8.p2; accessed on 05/05/2021) and the average abundance of each transcript in each organ was computed. In the heart, adipose tissue, and brain, there were transcripts abundances measured from multiple source sites. Hierarchical clustering analysis indicated that source sites from each organ were clustered together (Supplementary Fig. 6). Hence, the average of the source sites in each organ was used for the heart, adipose tissue and brain. Average transcript abundances were mapped to the organ-specific subnetworks using the gene reaction annotations of HUMAN1²⁶. More in detail, transcript abundances of isoenzymes and enzyme subunits catalysing each reaction or transport process were added and, subsequently, log₂ transformed. The resulting values were used as input to apply the flux minimisation weighted by reaction expression³⁸:

$${{{\mathrm{minimise}}}}\mathop{\sum }_{i}{{{{{{\bf{v}}}}}}}_{i}\cdot \left({{\max }}\left(0,\,{P}_{95}-{\bar{{{{{{\bf{x}}}}}}}}_{{{{{{{\bf{GTEx}}}}}}}_{i}}\right)+1\right)$$

(1)

subject to:

$${{{{{\bf{s}}}}}}.{{{{{\bf{v}}}}}}=0$$

$${{{{{\bf{lb}}}}}}\le {{{{{\bf{v}}}}}}\le {{{{{\bf{ub}}}}}}$$

where, ${{{{{\bf{v}}}}}}$ is a vector of steady-state flux values; ${\bar{{{{{{\bf{x}}}}}}}}_{{{{{{\bf{GTEX}}}}}}}$ is a vector of average transcript abundances mapped to reactions of the organ-specific network; ${P}_{95}$ is the 95^th percentile of the average transcript abundance values mapped to reactions of the organ-specific network; s is the stoichiometric matrix. Its product with ${{{{{\bf{v}}}}}}$ defines the metabolic steady state constraint (i.e., input and output fluxes must be balanced for each metabolite in the network); lb and ub are vectors defining the lower and upper bounds of reactions, respectively. The organ-specific metabolic objectives are defined as lower bounds greater than 0 (i.e., constraining such reactions to being active) for the relevant reactions.

Subsequently, FVA was used to identify the feasible flux ranges within 99% of the optimal value of the GIM3E objective function³⁸. Finally, the resulting solution space was sampled using the Artificially Centred hit-and-run (ACHR) algorithm³⁹ implemented into COBRApy^108,109. ACHR was run with a thinning factor of 1000, and 1000 sets of steady-state flux distributions were computed. The average of those flux samples was used as each organ’s reference flux map.

Following the same approach, reference fluxes were also computed for the Recon3D organ-specific models using the gene reaction annotations of Recon3D²⁵.

Imputing individual-specific gene expression data

The elastic net models from PredictDB¹³ were used to impute organ-specific gene expression levels from individual-level genotypes. These are well-established models that have been extensively validated^{13,110,111,112}. The latest release of PredictDB models, which had been trained with GTEx v8 data, were obtained from https://predictdb.org/. They were used with PLINK2¹¹³ to predict relative transcripts abundances using genotype data from the INTERVAL^35,36 and UKB³⁷ cohorts. For adipose, brain and heart tissue, the average of the imputed abundances in each source site was used.

Mapping individual-specific gene expression data to reactions in the model

Imputed individual-specific expression patterns from metabolic genes (i.e., genes coding for enzymes, enzyme subunits, or transmembrane carriers) were mapped to organ-specific models using the gene reaction annotations of HUMAN1²⁶. Imputed values were expressed as log₂ fold changes relative to average gene expression in GTEx and mapped to reactions in the organ-specific model considering the relative transcript abundance of isoenzymes and enzyme subunits in GTEx:

$${{{\bf{FC}}}}_{R,n}=\frac{\mathop{\sum}\limits_{g\in {{{{\bf{g}}}}_{{{\bf{R}}}}}}{{{\bf{GTEx}}}}_{g}\cdot {2}^{{{{{\bf{S}}}}_{g,n}}}}{\mathop{\sum}\limits_{g\in {{{{\bf{g}}}}_{{{\bf{R}}}}}}{{{{\bf{GTEx}}}}_{g}}}$$

(2)

where, ${{{{{{\bf{S}}}}}}}_{g,n}$ is the organ-specific score for gene g in individual n computed using the elastic net models from PredictDB; ${{{{{{\bf{GTEx}}}}}}}_{g}$ is the average organ-specific gene expression of gene g in GTEx; ${{{{{{\bf{g}}}}}}}_{{{{{{\bf{R}}}}}}}$ are the genes associated with reaction R in the organ-specific network; ${{{{{{\bf{FC}}}}}}}_{R,n}$ is the imputed reaction activity fold change for reaction R in individual n.

Reaction activity fold changes were also computed for the Recon3D organ-specific models using the gene reaction annotations of Recon3D²⁵.

The quadratic metabolic transformation algorithm

Building upon the principle of the metabolic transformation algorithm^101,102, we developed qMTA. qMTA seeks to identify the flux map most consistent with a set of reaction activity fold changes starting from a reference flux distribution (Supplementary Fig. 2). To this end, it minimises the difference between the simulated flux values and the target fluxes (i.e., the product of the flux value in the reference flux distribution and the reaction activity fold change). Additionally, it also minimises the deviation from the reference flux distribution in reactions not mapped to any gene expression fold changes. Furthermore, the two terms of the optimisation function are scaled by the reference flux distribution to prevent biases towards reactions with high flux values.

$${{{\mathrm{minimise }}}}\,w{\sum }_{i \in {{{\bf{Ru}}}}}\frac{{\left({{{{\bf{v}}}}}_{i}^{{{{\bf{ref}}}}}-{{{{\bf{v}}}}}_{i,n}^{{{{\bf{qMTA}}}}}\right)}^{2}}{{{\max }}\left(\left|{{{{\bf{v}}}}}_{i}^{{{{\bf{ref}}}}}\right|,\,m\right)}+\mathop{\sum}\limits_{i\in {{{\bf{Re}}}}}\frac{{\left({{{{\bf{v}}}}}_{i}^{{{{\bf{ref}}}}}{{\cdot }}{{{{\bf{FC}}}}}_{i,n}-{{{{\bf{v}}}}}_{i,n}^{{{{\bf{qMTA}}}}}\right)}^{2}}{{{\max }}\left({\left({{{{\bf{v}}}}}_{i}^{{{{\bf{ref}}}}}({{{{\bf{FC}}}}}_{i,n}-1)\right)}^{2},\,m\right)}$$

(3)

Subject to:

$${{{{{\bf{s}}}}}}.\,{{{{{{\bf{v}}}}}}}_{n}^{{{{{{\bf{qMTA}}}}}}}=0$$

$${{{{{\bf{lb}}}}}}\, < \,{{{{{{\bf{v}}}}}}}_{n}^{{{{{{\bf{qMTA}}}}}}}\, < \,{{{{{\bf{ub}}}}}}$$

where, $w$ is the weight given to minimising variation in reactions not mapped to imputed gene expression; ${{{{{\bf{Ru}}}}}}$ are reactions not mapped to imputed gene expression;${{{{{{\bf{v}}}}}}}^{{{{{{\boldsymbol{ref}}}}}}}$ is the flux vector of the reference flux distribution; ${{{{{{\bf{v}}}}}}}_{i,n}^{{{{{{\bf{qMTA}}}}}}}$ is the simulated flux value for reaction i in individual n; m is the minimum value allowed for the scaling factor;${{{{{\bf{Re}}}}}}$ are reactions mapped to imputed gene expression.

Personalised flux maps computed with qMTA were subsequently log₂ transformed and standardised to zero-mean and unit variance.

Additionally, two hyperparameters in qMTA ($w$ and m) were tuned using the regression analysis with blood metabolic features in the INTERVAL cohort. For each simulated organ, a grid search (w → [100,10,1,0.1,0.01], m → [10⁻⁶, 10⁻⁷, 10⁻⁸, 10⁻⁹, 10⁻¹⁰, 10⁻¹¹, 10⁻¹²]) was performed to identify the parameters that resulted in flux maps with the strongest association with both Nightingale Health and Metabolon HD4 metabolic features. This was measured as the summation of the amount of variance explained (R²) for each blood feature-flux value pair when testing associations between metabolic fluxes and blood metabolic features. The resulting parameters were subsequently used in the analysis of the samples from UKB. The process was repeated to identify the best set of hyperparameters for the Recon3D-based models.

Metabolomics

The Nightingale NMR platform quantifies 230 and 249 analytes in INTERVAL and UKB, respectively, including lipoprotein subfractions and ratios, lipids and low molecular weight metabolites (e.g., amino acids)⁵¹. In INTERVAL, blood samples were profiled with the Nightingale platform at the baseline of the blood donation assay (N = 37,720). In UKB, metabolite concentrations were determined in 117,981 participants at baseline assessment and 5141 participants at repeat assessment, among which there were 1427 participants with measurements at both time points. For participants with measurements at both baseline and repeat assessment, the measurement at baseline assessment was used¹¹⁴. Values were adjusted for technical covariates using the ukbnmr R package¹¹⁴ and subsequently regressed for age, sex, BMI, and the first 5 PCs of genetic ancestry. Composite biomarkers and ratios were recomputed after adjustment, including 98 and 76 additional biomarker ratios in INTERVAL and UKB, respectively, not provided by the Nightingale platform. Metabolic features not present in both INTERVAL and UKB were excluded from downstream analyses. Likewise, 68 features with markedly distinct variance between INTERVAL and UKB (|log2(sd_INTERVAL/sd_UKB)| > log2(2.5)) were also excluded. Finally, acetate was excluded due to a large number of NA (>75%) in INTERVAL. Subsequently, measures were standardised to zero-mean and unit variance.

The Metabolon HD4 assay measures ~1000 metabolites (~700 named, ~300 unknown), including lipids, xenobiotics, amino acids and energy-related metabolites. A subset of INTERVAL participants (N=8,115) had their blood profiled with this assay, predominantly using baseline blood samples. Nineteen features were excluded due to a large number of NA (>75%). Values were regressed against technical covariates age, sex, BMI, and the first 5 PCs of genetic ancestry. Subsequently, measures were standardised to zero mean and unit-variance.

Testing associations between metabolic fluxes and blood metabolic features

Due to the linear nature of many metabolic pathways, some flux values were highly intercorrelated (Fig. S3). To remove reaction flux pairs with a strong correlation, for each pair of reaction flux values with ρ > 0.9, the feature with the largest mean absolute correlation to other flux values was removed¹¹⁵. Likewise, both the Nightingale and Metabolon platforms had some metabolic features with strong correlations, and those features with ρ > 0.75 were removed using the same approach used for reaction fluxes. Overall, 4300 scaled flux values and 57 Nightingale Health and 718 Metabolon HD4 blood metabolic features were selected to perform FWAS.

Then, the association of each metabolic feature to each personalised flux value was evaluated using linear regression (Supplementary Fig. 2).

$${{{{{\rm{Met}}}}}}={a}_{{{{{{\rm{Met}}}}}},i} \, {{\cdot }} \, {{{{{{\bf{v}}}}}}}_{i}^{{{{{{\bf{qMTA}}}}}}}+\varepsilon$$

(4)

where, ${{{{{\rm{Met}}}}}}$ are the measured levels of a blood metabolic feature; ${a}_{{{{{{\rm{Met}}}}}},i}$ is the effect size of flux i on ${{{{{\rm{Met}}}}}}$; $\varepsilon$ is the residual.

Statistical significance was evaluated with a t-test (two-tailed) on effect sizes. The resulting P values were adjusted for multiple testing against all evaluated blood metabolic features—reaction flux pairs using the Benjamini and Yosef Hochberg (i.e., FDR) method.

To evaluate the association between metabolic fluxes computed with Recon3D-based models and blood metabolic features, the set of 4300 uncorrelated flux values in HUMAN1 was mapped to equivalent reactions in the Recon3D-based models. This set of flux values was then used to perform FWAS to the same set of 57 Nightingale Health and 718 Metabolon HD4 blood metabolic features as the HUMAN1 analysis.

Classes of blood metabolic features

Nightingale/Metabolon platforms provide sets of Groups/Sub-pathways to stratify metabolic features. We harmonised both annotations systems to define a set of curated groups that could be applied to both Nightingale and Metabolon features. For instance, the Metabolon features annotated to “Glycerolipid Metabolism” and “Phospholipid metabolism”, and the Nightingale features annotated to “Phospholipids” were all assigned to the curated group “Glycerides and phospholipids”. Some Metabolon features were not annotated (i.e., unknown) and could not be assigned to any curated group. Unknown features were included in the FWAS but omitted from the enrichment analysis. Fisher’s exact test (one-sided) was used to identify metabolite classes enriched in features with significant association to personalised flux values relative to the set of all uncorrelated blood metabolic features. An FDR-adjusted significance threshold of P < 0.05 was applied to control for all tested classes of blood metabolic features across all organs.

Reaction systems

Subsystem annotations for reactions were obtained from the HUMAN1 model²⁶. As some subsystems contained a low number of reactions, functionally related subsystems were merged into larger reaction systems. For instance, the purine metabolism, pyrimidine metabolism and nucleotide metabolism subsystems were aggregated into a reaction system termed nucleotide metabolism. Additionally, transport processes (i.e., annotated in the transport or exchange reactions subsystems) were assigned a system based on the specific metabolites being transported in each process. Briefly, we first assigned a system to each metabolite based on the most frequent reaction system annotation in the reactions in which it participates. For instance, alanine was assigned to the system “amino acid metabolism” since it was the system annotated most in reactions in which alanine participated. Next, each transport process/exchange reaction in HUMAN1 was assigned the system most numerous in the metabolites being transported. For the purpose of this assignment, metabolites that are often cofactors in transport processes (e.g., Na⁺, K⁺, H⁺, and ATP/ADP) were set to give less weight than other metabolites. For instance, the alanine-sodium symporter (alanine[e] + Na⁺[e] → alanine[c] + Na⁺[c]) was assigned to the system “amino acid metabolism” as alanine (system: amino acid metabolism) was given more weight than Na⁺ (system: Miscellaneous). Reaction systems are solely used as annotations and have no influence on network stoichiometry or genetically personalised flux values.

Fisher’s exact test (one-sided) was used to identify reaction systems enriched in reactions with significant association with blood metabolic features relative to the set of all evaluated reactions in each organ. An FDR-adjusted significance threshold of P < 0.05 was applied to control for all tested systems across all organs.

Testing associations between metabolic fluxes and coronary artery disease

Using PheWAS Catalogue (version 1.2), we used the WHO International Classification of Diseases (ICD) diagnosis codes in versions 9 (ICD-9) and 10 (ICD-10) of Phecode 411.4 for CAD case definition in UKB. In detail, we searched for the presence of any of the constituent ICD-9/10 codes in linked health records (including in-patient Hospital Episode Statistics data, and primary and secondary cause of death information from the death registry) and converted the earliest coded date to the age of phenotype onset. Individuals without any codes for CAD were recorded as controls and censored according to the maximum follow-up of the health linkage data (January 31, 2020) or the date of death.

We recorded 34,121 events of CAD and 428,669 controls in UKB, which were used to evaluate the association of genetically personalised fluxes to CAD risk. Association was tested using an age-as-time-scale Cox proportional hazards regression. The Cox models were stratified by sex and adjusted by genotyping array, 10 genetic PCs, BMI and smoking status and fitted using the CoxPHFitter function from the lifelines package for python¹¹⁶. The significance of the flux to CAD risk associations was evaluated with a two-tailed Wald test for the flux HRs. The resulting P values were adjusted for multiple testing against all tested fluxes using the Benjamini and Yosef Hochberg (i.e., FDR) method.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data from the INTERVAL^35,36 and UK Biobank³⁷ cohorts is under restricted access as it contains potentially identifying and sensitive patient information. It can be accessed by making a reasoned request to the INTERVAL coordination centre (https://www.intervalstudy.org.uk) and UKB (https://www.ukbiobank.ac.uk/), respectively. Response times from the data access committees are typically under one month. The summary statistics for the FWAS to blood metabolic features and CAD are provided in Supplementary Data 2, Supplementary Data 3 and Supplementary Data 4. The organ-specific genome-scale metabolic models generated in this work are available on the cobrafunctions GitHub repository, which is permanently archived by Zenodo¹⁰⁷. HUMAN1²⁶ (version 1.11.0) can be obtained from the Human-GEM GitHub repository. The Harvey and Harvetta models (1_03c) are available in the Supporting Information of reference 30. The elastic net PredictDB models (GTEx v8) models¹³ are available at https://predictdb.org. The GTEx¹⁴ gene expression data (GTEx Analysis Release V8; dbGaP Accession phs000424.v8.p2) can be obtained from https://gtexportal.org.

Code availability

The code used to generate personalised organ-specific flux maps from imputed gene expression data is available on GitHub and permanently archived by Zenodo¹⁰⁷. qMTA requires the proprietary solver CPLEX(2.6 or newer), which is freely available to academic users.

References

Tam, V. et al. Benefits and limitations of genome-wide association studies. Nat. Rev. Genet. 20, 467–484 (2019).
Article CAS PubMed Google Scholar
Lambert, S. A. et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425 (2021).
Article CAS PubMed Google Scholar
Wand, H. et al. Improving reporting standards for polygenic scores in risk prediction studies. Nature 591, 211–219 (2021).
Article CAS PubMed PubMed Central Google Scholar
Polygenic Risk Score Task Force of the International Common Disease Alliance. Responsible use of polygenic risk scores in the clinic: potential benefits, risks and gaps. Nat. Med. https://doi.org/10.1038/s41591-021-01549-6 (2021).
Brittain, H. K., Scott, R. & Thomas, E. The rise of the genome and personalised medicine. Clin. Med. 17, 545–551 (2017).
Article Google Scholar
Dugger, S. A., Platt, A. & Goldstein, D. B. Drug development in the era of precision medicine. Nat. Rev. Drug Discov. 17, 183–196 (2018).
Article CAS PubMed Google Scholar
Lambert, S. A., Abraham, G. & Inouye, M. Towards clinical utility of polygenic risk scores. Hum. Mol. Genet. 28, R133–R142 (2019).
Article CAS PubMed Google Scholar
Nandakumar, S. K., Liao, X. & Sankaran, V. G. In the blood: connecting variant to function in human hematopoiesis. Trends Genet. 36, 563–576 (2020).
Article CAS PubMed PubMed Central Google Scholar
Claussnitzer, M. & Susztak, K. Gaining insight into metabolic diseases from human genetic discoveries. Trends Genet. 37, 1081–1094 (2021).
Article CAS PubMed Google Scholar
Wainberg, M. et al. Opportunities and challenges for transcriptome-wide association studies. Nat. Genet. 51, 592–599 (2019).
Article CAS PubMed PubMed Central Google Scholar
Brandes, N., Linial, N. & Linial, M. PWAS: proteome-wide association study—linking genes and phenotypes by functional variation in proteins. Genome Biol. 21, 173 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xu, Y. et al. An atlas of genetic scores to predict multi-omic traits. Preprint at bioRxiv https://doi.org/10.1101/2022.04.17.488593 (2022).
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Article CAS PubMed PubMed Central Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article Google Scholar
Wingo, T. S. et al. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat. Neurosci. 24, 810–817 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ritchie, S. C. et al. Integrative analysis of the plasma proteome and polygenic risk of cardiometabolic diseases. Nat. Metab. https://doi.org/10.1038/s42255-021-00478-5 (2021).
Zhu, X., Duren, Z. & Wong, W. H. Modeling regulatory network topology improves genome-wide analyses of complex human traits. Nat. Commun. 12, 1–15 (2021).
CAS Google Scholar
Carlin, D. E. et al. A fast and flexible framework for network-assisted genomic association. iScience 16, 155–161 (2019).
Article CAS PubMed PubMed Central Google Scholar
Talukdar, H. A. et al. Cross-tissue regulatory gene networks in coronary artery disease. Cell Syst. 2, 196–208 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ghosh, S. et al. Systems genetics analysis of genome-wide association study reveals novel associations between key biological processes and coronary artery disease. Arterioscler. Thromb. Vasc. Biol. 35, 1712–1722 (2015).
Article CAS PubMed PubMed Central Google Scholar
Frayn, K. N. Metabolic Regulation: A Human Perspective. (2010).
Stephanopoulos, G. Metabolic fluxes and metabolic engineering. Metab. Eng. 1, 1–11 (1999).
Article CAS PubMed Google Scholar
Nielsen, J. It is all about metabolic fluxes. J. Bacteriol. 185, 7031–7035 (2003).
Article CAS PubMed PubMed Central Google Scholar
Zamboni, N., Saghatelian, A. & Patti, G. J. Defining the metabolome: size, flux, and regulation. Mol. Cell 58, 699–706 (2015).
Article CAS PubMed PubMed Central Google Scholar
Brunk, E. et al. Recon3D enables a three-dimensional view of gene variation in human metabolism. Nat. Biotechnol. 36, 272–281 (2018).
Article CAS PubMed PubMed Central Google Scholar
Robinson, J. L. et al. An atlas of human metabolism. Sci. Signal. 13, 1–12 (2020).
Article Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
de Mas, I. M. et al. Cancer cell metabolism as new targets for novel designed therapies. Future Med. Chem. 6, 1791–1810 (2014).
Article Google Scholar
Jamialahmadi, O., Hashemi-Najafabadi, S., Motamedian, E., Romeo, S. & Bagheri, F. A benchmark-driven approach to reconstruct metabolic networks for studying cancer metabolism. PLoS Comput. Biol. 15, e1006936 (2019).
Article PubMed PubMed Central Google Scholar
Thiele, I. et al. Personalized whole‐body models integrate metabolism, physiology, and the gut microbiome. Mol. Syst. Biol. https://doi.org/10.15252/msb.20198982 (2020).
Agren, R. et al. Reconstruction of genome-scale active metabolic networks for 69 human cell types and 16 cancer types using INIT. PLoS Comput. Biol. 8, e1002518 (2012).
Article CAS PubMed PubMed Central Google Scholar
Folger, O. et al. Predicting selective drug targets in cancer through metabolic networks. Mol. Syst. Biol. 7, 501–501 (2014).
Article Google Scholar
Lewis, J. E., Forshaw, T. E., Boothman, D. A., Furdui, C. M. & Kemp, M. L. Personalized genome-scale metabolic models identify targets of redox metabolism in radiation-resistant tumors. Cell Syst. 12, 68–81.e11 (2021).
Article CAS PubMed PubMed Central Google Scholar
Heinken, A., Basile, A., Hertel, J., Thinnes, C. & Thiele, I. Genome-scale metabolic modeling of the human microbiome in the era of personalized medicine. Annu. Rev. Microbiol. 75, 199–222 (2021).
Article PubMed Google Scholar
Moore, C. et al. The INTERVAL trial to determine whether intervals between blood donations can be safely and acceptably decreased to optimise blood supply: study protocol for a randomised controlled trial. Trials 15, 363 (2014).
Article PubMed PubMed Central Google Scholar
Di Angelantonio, E. et al. Efficiency and safety of varying the frequency of whole blood donation (INTERVAL): a randomised trial of 45 000 donors. Lancet 390, 2360–2371 (2017).
Article PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, 1–10 (2015).
Article Google Scholar
Schmidt, B. J. et al. GIM3E: condition-specific models of cellular metabolism developed from metabolomics and expression data. Bioinformatics 29, 2900–2908 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kaufman, D. E. & Smith, R. L. Direction choice for accelerated convergence in hit-and-run sampling. Oper. Res. 46, 84–95 (1998).
Article MathSciNet MATH Google Scholar
Gallagher, D., Chung, S. & Akram, M. Body Composition. in Encyclopedia of Human Nutrition 191–199 (Elsevier, 2013). https://doi.org/10.1016/B978-0-12-375083-9.00027-1.
Grynberg, A. & Demaison, L. Fatty acid oxidation in the heart. J. Cardiovasc. Pharmacol. 28, S11–S17 (1996).
CAS PubMed Google Scholar
Drake, K. J., Sidorov, V. Y., McGuinness, O. P., Wasserman, D. H. & Wikswo, J. P. Amino acids as metabolic substrates during cardiac ischemia. Exp. Biol. Med. 237, 1369–1378 (2012).
Article CAS Google Scholar
Dickinson, J. M. & Rasmussen, B. B. Amino acid transporters in the regulation of human skeletal muscle protein metabolism. Curr. Opin. Clin. Nutr. Metab. Care 16, 638–644 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lundsgaard, A.-M., Fritzen, A. M. & Kiens, B. Molecular regulation of fatty acid oxidation in skeletal muscle during aerobic exercise. Trends Endocrinol. Metab. 29, 18–30 (2018).
Article CAS PubMed Google Scholar
Monteiro-Cardoso, V. F., Corlianò, M. & Singaraja, R. R. Bile acids: a communication channel in the gut-brain axis. Neuromol. Med. 23, 99–117 (2021).
Article CAS Google Scholar
McMillin, M. & DeMorrow, S. Effects of bile acids on neurological function and disease. FASEB J. 30, 3658–3668 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rapoport, S. I. Arachidonic acid and the brain. J. Nutr. 138, 2515–2520 (2008).
Article CAS PubMed PubMed Central Google Scholar
Bosetti, F. Arachidonic acid metabolism in brain physiology and pathology: lessons from genetically altered mouse models. J. Neurochem. 102, 577–586 (2007).
Article CAS PubMed PubMed Central Google Scholar
Li, J., Papadopoulos, V. & Vihma, V. Steroid biosynthesis in adipose tissue. Steroids 103, 89–104 (2015).
Article CAS PubMed Google Scholar
Lotta, L. A. et al. A cross-platform approach identifies genetic regulators of human metabolism and health. Nat. Genet. 53, 54–64 (2021).
Article CAS PubMed PubMed Central Google Scholar
Julkunen, H., Cichońska, A., Slagboom, P. E. & Würtz, P. Metabolic biomarker profiling for identification of susceptibility to severe pneumonia and COVID-19 in the general population. Elife 10, 1–20 (2021).
Article Google Scholar
Rye, K.-A., Bursill, C. A., Lambert, G., Tabet, F. & Barter, P. J. The metabolism and anti-atherogenic properties of HDL. J. Lipid Res. 50, S195–S200 (2009).
Article PubMed PubMed Central Google Scholar
Santamarina-Fojo, S., González-Navarro, H., Freeman, L., Wagner, E. & Nong, Z. Hepatic lipase, lipoprotein metabolism, and atherogenesis. Arterioscler. Thromb. Vasc. Biol. 24, 1750–1754 (2004).
Article CAS PubMed Google Scholar
Rousset, X., Vaisman, B., Amar, M., Sethi, A. A. & Remaley, A. T. Lecithin: cholesterol acyltransferase—from biochemistry to role in cardiovascular disease. Curr. Opin. Endocrinol. Diabetes Obes. 16, 163–171 (2009).
Article CAS PubMed PubMed Central Google Scholar
Connelly, P. W. & Hegele, R. A. Hepatic lipase deficiency. Crit. Rev. Clin. Lab. Sci. 35, 547–572 (1998).
Article CAS PubMed Google Scholar
Hodoğlugil, U., Williamson, D. W. & Mahley, R. W. Polymorphisms in the hepatic lipase gene affect plasma HDL-cholesterol levels in a Turkish population. J. Lipid Res. 51, 422–430 (2010).
Article PubMed PubMed Central Google Scholar
McCaskie, P. et al. The C-480T hepatic lipase polymorphism is associated with HDL-C but not with risk of coronary heart disease. Clin. Genet. 70, 114–121 (2006).
Article CAS PubMed Google Scholar
Machado, M. V. et al. Mouse models of diet-induced nonalcoholic steatohepatitis reproduce the heterogeneity of the human disease. PLoS ONE 10, e0127991 (2015).
Article PubMed PubMed Central Google Scholar
Testerink, N., van der Sanden, M. H. M., Houweling, M., Helms, J. B. & Vaandrager, A. B. Depletion of phosphatidylcholine affects endoplasmic reticulum morphology and protein traffic at the Golgi complex. J. Lipid Res. 50, 2182–2192 (2009).
Article CAS PubMed PubMed Central Google Scholar
Sakai, N. et al. Targeted disruption of the mouse lecithin:cholesterol acyltransferase (LCAT) gene. J. Biol. Chem. 272, 7506–7510 (1997).
Article CAS PubMed Google Scholar
Yamashita, S. & Matsuzawa, Y. Low HDL and high HDL syndromes. in Encyclopedia of Endocrine Diseases. vol. 1 327–339 (Elsevier, 2018).
Gerl, M. J. et al. Cholesterol is inefficiently converted to cholesteryl esters in the blood of cardiovascular disease patients. Sci. Rep. 8, 14764 (2018).
Article PubMed PubMed Central Google Scholar
Ross, A. C. Retinol: properties and determination. in Encyclopedia of Food and Health vol. 1821 604–609 (Elsevier, 2016).
Shirakami, Y., Lee, S. A., Clugston, R. D. & Blaner, W. S. Hepatic metabolism of retinoids and disease associations. Biochim. Biophys. Acta 1821, 124–136 (2012).
Article CAS PubMed Google Scholar
Murray, J. C., Gilgor, R. S. & Lazarus, G. S. Serum triglyceride elevation following high-dose vitamin A treatment for pityriasis rubra pilaris. Arch. Dermatol. 119, 675–676 (1983).
Article CAS PubMed Google Scholar
Vahlquist, C., Michaëlsson, G., Vahlquist, A. & Vessby, B. A sequential comparison of etretinate (Tigason) and isotretinoin (Roaccutane) with special regard to their effects on serum lipoproteins. Br. J. Dermatol. 112, 69–76 (1985).
Article CAS PubMed Google Scholar
Bershad, S. et al. Changes in plasma lipids and lipoproteins during isotretinoin therapy for acne. N. Engl. J. Med. 313, 981–985 (1985).
Article CAS PubMed Google Scholar
Redlich, C. A. et al. Effect of long-term beta-carotene and vitamin A on serum cholesterol and triglyceride levels among participants in the Carotene and Retinol Efficacy Trial (CARET). Atherosclerosis 145, 425–432 (1999).
Article PubMed Google Scholar
Duong, P. T., Weibel, G. L., Lund-Katz, S., Rothblat, G. H. & Phillips, M. C. Characterization and properties of pre beta-HDL particles formed by ABCA1-mediated cellular lipid efflux to apoA-I. J. Lipid Res. 49, 1006–1014 (2008).
Article CAS PubMed PubMed Central Google Scholar
Huang, H., Li, Y., Liang, J. & Finkelman, F. D. Molecular regulation of histamine synthesis. Front. Immunol. 9, 1–7 (2018).
Article Google Scholar
Lopez-Perez, D. et al. In patients with obesity, the number of adipose tissue mast cells is significantly lower in subjects with type 2 diabetes. Front. Immunol. 12, 1–13 (2021).
Article Google Scholar
Jarido, V. et al. The emerging role of mast cells in liver disease. Am. J. Physiol. 313, G89–G101 (2017).
Google Scholar
Clejan, S. et al. Blood histamine is associated with coronary artery disease, cardiac events and severity of inflammation and atherosclerosis. J. Cell. Mol. Med. 6, 583–592 (2002).
Article CAS PubMed PubMed Central Google Scholar
Inouye, M. et al. An immune response network associated with blood lipid levels. PLoS Genet. 6, e1001113 (2010).
Inouye, M. et al. Metabonomic, transcriptomic, and genomic variation of a population cohort. Mol. Syst. Biol. 6, 1–10 (2010).
Wang, K. Y. et al. Histamine deficiency decreases atherosclerosis and inflammatory response in apolipoprotein e knockout mice independently of serum cholesterol level. Arterioscler. Thromb. Vasc. Biol. 31, 800–807 (2011).
Article CAS PubMed Google Scholar
Eisenberg, T. et al. Cardioprotection and lifespan extension by the natural polyamine spermidine. Nat. Med. 22, 1428–1438 (2016).
Article CAS PubMed PubMed Central Google Scholar
Soda, K., Kano, Y. & Chiba, F. Food polyamine and cardiovascular disease—an epidemiological study. Glob. J. Health Sci. 4, 170–178 (2012).
Article PubMed PubMed Central Google Scholar
Cercato, C. & Fonseca, F. A. Cardiovascular risk and obesity. Diabetol. Metab. Syndr. 11, 74 (2019).
Article CAS PubMed PubMed Central Google Scholar
Monelli, E. et al. Angiocrine polyamine production regulates adiposity. Nat. Metab. 4, 327–343 (2022).
Article CAS PubMed Google Scholar
Mead, J. R. & Ramji, D. P. The pivotal role of lipoprotein lipase in atherosclerosis. Cardiovasc. Res. 55, 261–269 (2002).
Article CAS PubMed Google Scholar
Sato, H. et al. Analyses of group III secreted phospholipase A2 transgenic mice reveal potential participation of this enzyme in plasma lipoprotein modification, macrophage foam cell formation, and atherosclerosis. J. Biol. Chem. 283, 33483–33497 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rosenson, R. S. & Hurt-Camejo, E. Phospholipase A2 enzymes and the risk of atherosclerosis. Eur. Heart J. 33, 2899–2909 (2012).
Article CAS PubMed Google Scholar
Giordanetto, F. et al. Discovery of AZD2716: a novel secreted phospholipase A2 (sPLA2) inhibitor for the treatment of coronary artery disease. ACS Med. Chem. Lett. 7, 884–889 (2016).
Article CAS PubMed PubMed Central Google Scholar
Akinkuolie, A. O. et al. Group IIA secretory phospholipase A2, vascular inflammation, and incident cardiovascular disease: an analysis from the JUPITER trial. Arterioscler. Thromb. Vasc. Biol. 39, 1182–1190 (2019).
Article CAS PubMed PubMed Central Google Scholar
Oshiro, C., Mangravite, L., Klein, T. & Altman, R. PharmGKB very important pharmacogene: SLCO1B1. Pharmacogenet. Genomics 20, 211–216 (2010).
Article CAS PubMed PubMed Central Google Scholar
Adhyaru, B. B. & Jacobson, T. A. Safety and efficacy of statin therapy. Nat. Rev. Cardiol. 15, 757–769 (2018).
Article CAS PubMed Google Scholar
Lin, J.-P. et al. Association between the UGT1A1*28 allele, bilirubin levels, and coronary heart disease in the Framingham Heart Study. Circulation 114, 1476–1481 (2006).
Article CAS PubMed Google Scholar
Suh, S. et al. Relationship between serum bilirubin levels and cardiovascular disease. PLoS ONE 13, e0193041 (2018).
Article PubMed PubMed Central Google Scholar
McArdle, P. F. et al. Association between bilirubin and cardiovascular disease risk factors: using Mendelian randomization to assess causal inference. BMC Cardiovasc. Disord. 12, 16 (2012).
Article CAS PubMed PubMed Central Google Scholar
Babaev, V. R. et al. Macrophage EP4 deficiency increases apoptosis and suppresses early atherosclerosis. Cell Metab. 8, 492–501 (2008).
Article CAS PubMed PubMed Central Google Scholar
Gomez, I., Foudi, N., Longrois, D. & Norel, X. The role of prostaglandin E2 in human vascular inflammation. Prostaglandins Leukot. Essent. Fat. Acids 89, 55–63 (2013).
Article CAS Google Scholar
Bauch, H. J., Grünwald, J., Vischer, P., Gerlach, U. & Hauss, W. H. A possible role of catecholamines in atherogenesis and subsequent complications of atherosclerosis. Exp. Pathol. 31, 193–204 (1987).
Article CAS PubMed Google Scholar
Foulon, P. & De Backer, D. The hemodynamic effects of norepinephrine: far more than an increase in blood pressure! Ann. Transl. Med. 6, S25–S25 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pinaire, J., Azé, J., Bringay, S., Cayla, G. & Landais, P. Hospital burden of coronary artery disease: Trends of myocardial infarction and/or percutaneous coronary interventions in France 2009–2014. PLoS ONE 14, 1–21 (2019).
Article Google Scholar
Harchaoui, K. E. L., Visser, M. E., Kastelein, J. J. P., Stroes, E. S. & Dallinga-Thie, G. M. Triglycerides and cardiovascular risk. Curr. Cardiol. Rev. 5, 216–222 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jamshidi, N. & Palsson, B. Systems biology of SNPs. Mol. Syst. Biol. 2, 1–4 (2006).
Article Google Scholar
Sánchez, B. J. et al. Improving the phenotype predictions of a yeast genome-scale metabolic model by incorporating enzymatic constraints. Mol. Syst. Biol. 13, 935 (2017).
Article PubMed PubMed Central Google Scholar
Agren, R. et al. Identification of anticancer drugs for hepatocellular carcinoma through personalized genome-scale metabolic modeling. Mol. Syst. Biol. 10, 721–721 (2014).
Article PubMed PubMed Central Google Scholar
Raškevičius, V. et al. Genome scale metabolic models as tools for drug design and personalized medicine. PLoS ONE 13, 1–14 (2018).
Article Google Scholar
Yizhak, K., Gabay, O., Cohen, H. & Ruppin, E. Model-based identification of drug targets that revert disrupted metabolism and its application to ageing. Nat. Commun. 4, 2632 (2013).
Article PubMed Google Scholar
Valcárcel, L. V, Torrano, V., Tobalina, L., Carracedo, A. & Planes, F. J. rMTA: Robust metabolic transformation analysis. Bioinformatics https://doi.org/10.1093/bioinformatics/btz231 (2019).
Astle, W. J. et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell 167, 1415–1429.e19 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article CAS PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gudmundsson, S. & Thiele, I. Computationally efficient flux variability analysis. BMC Bioinforma. 11, 489 (2010).
Article Google Scholar
Foguet, C. Cobrafunctions V1.0. Zenodo https://zenodo.org/record/7277058 (2022).
Ebrahim, A., Lerman, J. A., Palsson, B. O. & Hyduke, D. R. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7, 74 (2013).
Article PubMed PubMed Central Google Scholar
Heirendt, L. et al. Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0. Nat. Protoc. 14, 639–702 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, B. et al. Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression. Pac. Symp. Biocomput. 23, 448–459 (2018).
PubMed PubMed Central Google Scholar
Tavares, V., Monteiro, J., Vassos, E., Coleman, J. & Prata, D. Evaluation of genotype-based gene expression model performance: a cross-framework and cross-dataset study. Genes (Basel). 12, 1–12 (2021).
Hale, A. T. et al. Multi-omic analysis elucidates the genetic basis of hydrocephalus. Cell Rep. 35, 109085 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Ritchie, S. C. et al. Quality control and removal of technical variation of NMR metabolic biomarker data in ~120,000 UK Biobank participants. Preprint at medRxiv https://doi.org/10.1101/2021.09.24.21264079 (2021).
Kuhn, M. Building Predictive Models in R Using the caret Package. J. Stat. Softw. 28, 1–26 (2008).
Davidson-Pilon, C. lifelines: survival analysis in Python. J. Open Source Softw. 4, 1317 (2019).
Article Google Scholar

Download references

Acknowledgements

Participants in the INTERVAL trial were recruited with the active collaboration of NHS Blood and Transplant (www.nhsbt.nhs.uk), which has supported fieldwork and other elements of the trial. DNA extraction and genotyping were co-funded by the National Institute for Health and Care Research (NIHR), the NIHR BioResource (http://bioresource.nihr.ac.uk) and the NIHR Cambridge Biomedical Research Centre (BRC) (no. BRC-1215-20014)*. Nightingale Health NMR assays were funded by the European Commission Framework Programme 7 (HEALTH-F2-2012-279233). Metabolon Metabolomics assays were funded by the NIHR BioResource and the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014)*. The academic coordinating centre for INTERVAL was supported by core funding from the NIHR Blood and Transplant Research Unit in Donor Health and Genomics (no. NIHR BTRU-2014-10024), NIHR BTRU in Donor Health and Behaviour (NIHR203337), UK Medical Research Council (MRC) (no. MR/L003120/1), British Heart Foundation (nos SP/09/002, RG/13/13/30194 and RG/18/13/33946) and the NIHR Cambridge BRC (no. BRC-1215-20014)*. *The views expressed are those of the author(s) and not necessarily those of the NIHR, NHSBT or the Department of Health and Social Care. A complete list of the investigators and contributors to the INTERVAL trial is provided in ref. 36. The academic coordinating centre would like to thank blood donor centre staff and blood donors for participating in the INTERVAL trial. This work was supported by Health Data Research UK, which is funded by the UK MRC, Engineering and Physical Sciences Research Council (EPSRC), Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome. This research has been conducted using the UK Biobank Resource under Application 7439. This work was performed using resources provided by the Cambridge Service for Data-Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (www.csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/P020259/1), and DiRAC funding from the Science and Technology Facilities Council (www.dirac.ac.uk). C.F. is funded Health Data Research UK. S.R. is funded by the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014). S.L. is supported by a Canadian Institutes of Health Research postdoctoral fellowship (MFE-171279). E.P. was funded by the EU/EFPIA Innovative Medicines Initiative Joint Undertaking BigData@Heart grant 116074 and the NIHR BTRU in Donor Health and Genomics (NIHR BTRU-2014-10024) and is funded by the NIHR BTRU in Donor Health and Behaviour (NIHR203337). E.E.D. is supported by the Wellcome Trust grant (206194, 108413/A/15/D). J.D. holds a British Heart Foundation Professorship and an NIHR Senior Investigator Award. M.I. is supported by the Munz Chair of Cardiovascular Prediction and Prevention and the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014). M.I. was also supported by the UK Economic and Social Research 878 Council (ES/T013192/1).

Author information

Authors and Affiliations

Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
Carles Foguet, Yu Xu, Scott C. Ritchie, Samuel A. Lambert, Elodie Persyn, Artika P. Nath & Michael Inouye
Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK
Carles Foguet, Samuel A. Lambert, Emanuele Di Angelantonio, John Danesh, Adam S. Butterworth & Michael Inouye
British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
Carles Foguet, Yu Xu, Scott C. Ritchie, Samuel A. Lambert, Elodie Persyn, Dirk S. Paul, Emanuele Di Angelantonio, John Danesh, Adam S. Butterworth & Michael Inouye
Heart and Lung Research Institute, University of Cambridge, Cambridge, UK
Carles Foguet, Yu Xu, Scott C. Ritchie, Samuel A. Lambert, Elodie Persyn, Dirk S. Paul, Emanuele Di Angelantonio, John Danesh, Adam S. Butterworth & Michael Inouye
British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK
Scott C. Ritchie, Dirk S. Paul, Emanuele Di Angelantonio, John Danesh, Adam S. Butterworth & Michael Inouye
Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
Scott C. Ritchie, Artika P. Nath & Michael Inouye
Wellcome Sanger Institute, Hinxton, UK
Emma E. Davenport & John Danesh
BRC Haematology Theme, Radcliffe Department of Medicine, and NHSBT-Oxford, John Radcliffe Hospital, Oxford, UK
David J. Roberts
National Institute for Health and Care Research Blood and Transplant Research Unit in Donor Health and Behaviour, University of Cambridge, Cambridge, UK
David J. Roberts, Emanuele Di Angelantonio, John Danesh & Adam S. Butterworth
NHS Blood and Transplant, John Radcliffe Hospital, Oxford, UK
David J. Roberts
Health Data Science Centre, Human Technopole, Milan, Italy
Emanuele Di Angelantonio
Nuffield Department of Women’s and Reproductive Health, University of Oxford, Oxford, OX3 9DU, UK
Christopher Yau
Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE, UK
Christopher Yau
The Alan Turing Institute, London, UK
Michael Inouye

Authors

Carles Foguet
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Scott C. Ritchie
View author publications
You can also search for this author in PubMed Google Scholar
Samuel A. Lambert
View author publications
You can also search for this author in PubMed Google Scholar
Elodie Persyn
View author publications
You can also search for this author in PubMed Google Scholar
Artika P. Nath
View author publications
You can also search for this author in PubMed Google Scholar
Emma E. Davenport
View author publications
You can also search for this author in PubMed Google Scholar
David J. Roberts
View author publications
You can also search for this author in PubMed Google Scholar
Dirk S. Paul
View author publications
You can also search for this author in PubMed Google Scholar
Emanuele Di Angelantonio
View author publications
You can also search for this author in PubMed Google Scholar
John Danesh
View author publications
You can also search for this author in PubMed Google Scholar
Adam S. Butterworth
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Yau
View author publications
You can also search for this author in PubMed Google Scholar
Michael Inouye
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation: C.F., C.Y., and M.I.; Formal Analysis: C.F., Y.X., S.C.R., and M.I.; Investigation: S.C.R., S.A.L., E.P., A.P.N., E.E.D., D.J.R., D.S.P., E.D.A., J.D., A.S.B., and M.I.; Writing—original draft: C.F., C.Y., and M.I.; Supervision: C.Y. and M.I.; All authors reviewed and approved the final paper.

Corresponding authors

Correspondence to Carles Foguet or Michael Inouye.

Ethics declarations

Competing interests

A.S.B. has received grants (outside of this work) from AstraZeneca, Bayer, Biogen, BioMarin, Bioverativ, Merck, Novartis, Regeneron, and Sanofi. J.D. serves on scientific advisory boards for AstraZeneca, Novartis, and UK Biobank, and has received multiple grants from academic, charitable and industry sources outside of the submitted work. During the preparation of the paper, D.S.P. became a full-time employee of AstraZeneca. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Priyanka Baloni and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Foguet, C., Xu, Y., Ritchie, S.C. et al. Genetically personalised organ-specific metabolic models in health and disease. Nat Commun 13, 7356 (2022). https://doi.org/10.1038/s41467-022-35017-7

Download citation

Received: 05 May 2022
Accepted: 15 November 2022
Published: 29 November 2022
DOI: https://doi.org/10.1038/s41467-022-35017-7

This article is cited by

EnzChemRED, a rich enzyme chemistry relation extraction dataset
- Po-Ting Lai
- Elisabeth Coudert
- Alan Bridge
Scientific Data (2024)
Genome-scale models in human metabologenomics
- Adil Mardinoglu
- Bernhard Ø. Palsson
Nature Reviews Genetics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.