Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Review Article
  • Published:

Genetic variation in metabolic phenotypes: study designs and applications

Key Points

  • Technical advances in mass spectrometry and NMR spectroscopy enable genome-wide screens to be carried out for the association between genetic variants and hundreds of metabolic traits in a single experiment.

  • Metabolite concentrations are direct readouts of biological processes and can play the part of intermediate phenotypes, providing functional links between genetic variance and disease end points in genome-wide association studies (GWASs).

  • The first GWASs with metabolomics have already discovered many genetic variants in enzyme-, transporter- and other metabolism-related genes that induce major differences in the individual metabolic capabilities of the organism.

  • Knowledge of the genetic basis of human metabolic individuality holds the key to understanding the interactions of genetic, environmental and lifestyle factors in the aetiology of complex disorders.

  • We review emerging insights from recent GWASs with metabolomics and present design considerations for high-throughput metabolomics experiments with metabolic traits in epidemiological and clinical studies.

  • Using ratios between metabolite concentrations can drastically increase the power of a metabolomics study and can provide functional information on the perturbed underlying biochemical pathways.

  • Integration with other biochemical information, including data from other GWASs, can largely improve the value of the study.

  • Current challenges and future directions include the addition of new sample types (other than urine and blood), extension of the metabolite panels, standardization between platforms and the development of adapted statistical and data analysis tools.

Abstract

Many complex disorders are linked to metabolic phenotypes. Revealing genetic influences on metabolic phenotypes is key to a systems-wide understanding of their interactions with environmental and lifestyle factors in their aetiology, and we can now explore the genetics of large panels of metabolic traits by coupling genome-wide association studies and metabolomics. These genome-wide association studies are beginning to unravel the genetic contribution to human metabolic individuality and to demonstrate its relevance for biomedical and pharmaceutical research. Adopting the most appropriate study designs and analytical tools is paramount to further refining the genotype–phenotype map and eventually identifying the part played by genetic influences on metabolic phenotypes. We discuss such design considerations and applications in this Review.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: The metabolic trait as an intermediate phenotype.
Figure 2: Workflow and considerations.

Similar content being viewed by others

References

  1. Garrod, A. E. The incidence of alkaptonuria a study in chemical individuality. Lancet 2, 1616–1620 (1902).

    Article  CAS  Google Scholar 

  2. Mootha, V. K. & Hirschhorn, J. N. Inborn variation in metabolism. Nature Genet. 42, 97–98 (2010). This comment provides an independant view on the potential of genetic studies with metabolomics.

    Article  CAS  Google Scholar 

  3. Garrod, A. E. Inborn Factors in Disease (Oxford Univ. Press, 1931). Archibald Garrod noted more than 80 years ago that “diathesis is nothing else but chemical individuality”.

  4. Psychogios, N. et al. The human serum metabolome. PLoS ONE 6, e16957 (2011).

    Article  CAS  Google Scholar 

  5. Link, E. et al. SLCO1B1 variants and statin-induced myopathy—a genomewide study. N. Engl. J. Med. 359, 789–799 (2008).

    Article  CAS  Google Scholar 

  6. Dumont, J. et al. FADS1 genetic variability interacts with dietary α-linolenic acid intake to affect serum non-HDL-cholesterol concentrations in European adolescents. J. Nutr. 141, 1247–1253 (2011).

    Article  CAS  Google Scholar 

  7. Lu, Y. et al. Dietary n-3 and n-6 polyunsaturated fatty acid intake interacts with FADS1 genetic variation to affect total and HDL-cholesterol concentrations in the Doetinchem Cohort Study. Am. J. Clin. Nutr. 92, 258–265 (2010).

    Article  CAS  Google Scholar 

  8. Krug, S. et al. The dynamic range of the human metabolome revealed by challenges. FASEB J. 26, 2607–2619 (2012). This paper reports a series of controlled physiological challenges that may be used in future GWASs with metabolic traits.

    Article  CAS  Google Scholar 

  9. Nicholson, G. et al. Human metabolic profiles are stably controlled by genetic and environmental variation. Mol. Syst. Biol. 7, 525 (2011). This paper addresses essential questions about the heritability of metabolic traits.

    Article  Google Scholar 

  10. Kettunen, J. et al. Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nature Genet. 44, 269–276 (2012). This paper reports a large GWAS with NMR-derived metabolic traits.

    Article  CAS  Google Scholar 

  11. Suhre, K. et al. Metabolic footprint of diabetes: a multiplatform metabolomics study in an epidemiological setting. PLoS ONE 5, e13953 (2010). This paper reports a pilot study on three different metabolomics platforms and provides practical insights into the possibilities and pitfalls of high-throughput metabolomics experiments.

    Article  Google Scholar 

  12. Mittelstrass, K. et al. Discovery of sexual dimorphisms in metabolic and genetic biomarkers. PLoS Genet. 7, e1002215 (2011).

    Article  CAS  Google Scholar 

  13. Nicholson, G. et al. A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection. PLoS Genet. 7, e1002270 (2011).

    Article  CAS  Google Scholar 

  14. Suhre, K. et al. A genome-wide association study of metabolic traits in human urine. Nature Genet. 43, 565–569 (2011).

    Article  CAS  Google Scholar 

  15. Suhre, K. et al. Human metabolic individuality in biomedical and pharmaceutical research. Nature 477, 54–60 (2011). This paper reports 37 loci of human metabolic individuality and provides examples for a wide range of biomedical and pharmaceutical applications.

    Article  CAS  Google Scholar 

  16. Illig, T. et al. A genome-wide perspective of genetic variation in human metabolism. Nature Genet. 42, 137–141 (2010).

    Article  CAS  Google Scholar 

  17. Kastenmuller, G. Romisch-Margl, W., Wagele, B., Altmaier, E. & Suhre, K. metaP-server: a web-based metabolomics data analysis tool. J. Biomed. Biotechnol. 2011, 839862 (2011).

    Article  Google Scholar 

  18. Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).

    Article  CAS  Google Scholar 

  19. Abecasis, G. R., Cherny, S. S., Cookson, W. O. & Cardon, L. R. Merlin—rapid analysis of dense genetic maps using sparse gene flow trees. Nature Genet. 30, 97–101 (2002).

    Article  CAS  Google Scholar 

  20. Marchini, J. & Howie, B. Genotype imputation for genome-wide association studies. Nature Rev. Genet. 11, 499–511 (2010).

    Article  CAS  Google Scholar 

  21. Gieger, C. et al. Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum. PLoS Genet. 4, e1000282 (2008). This paper reports the first GWAS with metabolic traits and with ratios between metabolite concentrations.

    Article  Google Scholar 

  22. Tanaka, T. et al. Genome-wide association study of plasma polyunsaturated fatty acids in the InCHIANTI study. PLoS Genet. 5, e1000338 (2009).

    Article  Google Scholar 

  23. Hicks, A. A. et al. Genetic determinants of circulating sphingolipid concentrations in European populations. PLoS Genet. 5, e1000672 (2009).

    Article  Google Scholar 

  24. Demirkan, A. et al. Genome-wide association study identifies novel loci associated with circulating phospho- and sphingolipid concentrations. PLoS Genet. 8, e1002490 (2012).

    Article  CAS  Google Scholar 

  25. Wishart, D. S. et al. HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 37, D603–D610 (2009).

    Article  CAS  Google Scholar 

  26. Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012).

    Article  CAS  Google Scholar 

  27. Altmaier, E. et al. Bioinformatics analysis of targeted metabolomics--uncovering old and new tales of diabetic mice under medication. Endocrinology 149, 3478–3489 (2008).

    Article  CAS  Google Scholar 

  28. Petersen, A. K. et al. On the hypothesis-free testing of metabolite ratios in genome-wide and metabolome-wide association studies. BMC Bioinformatics 13, 120 (2012). This paper provides a statistical underpinning to using ratios between metabolite concentrations in association studies.

    Article  Google Scholar 

  29. Borenstein, M., Hedges, L. V., Higgins, J. P. T. & Rothstein, H. R. Introduction to Meta-Analysis (Wiley, 2009).

    Book  Google Scholar 

  30. Krumiesk, J. et al. Mining the unknown: A systems approach to metabolite identification combining genetic and metabolic information. PLoS Genet. (in the press).

  31. Cornelis, M. C. et al. Genome-wide meta-analysis identifies regions on 7p21 (AHR) and 15q24 (CYP1A2) as determinants of habitual caffeine consumption. PLoS Genet. 7, e1002033 (2011).

    Article  CAS  Google Scholar 

  32. Zhang, A., Sun, H., Wang, P., Han, Y. & Wang, X. Recent and potential developments of biofluid analyses in metabolomics. J. Proteomics 75, 1079–1088 (2011).

    Article  Google Scholar 

  33. Yu, Z. et al. Differences between human plasma and serum metabolite profiles. PLoS ONE 6, e21230 (2011).

    Article  CAS  Google Scholar 

  34. Tukiainen, T. et al. Detailed metabolic and genetic characterization reveals new associations for 30 known lipid loci. Hum. Mol. Genet. 21, 1444–1455 (2012).

    Article  CAS  Google Scholar 

  35. Inouye, M. et al. Metabonomic, transcriptomic, and genomic variation of a population cohort. Mol. Syst. Biol. 6, 441 (2010).

    Article  Google Scholar 

  36. Kottgen, A. et al. New loci associated with kidney function and chronic kidney disease. Nature Genet. 42, 376–384 (2010).

    Article  Google Scholar 

  37. Suhre, K. et al. Identification of a potential biomarker for FABP4 inhibition: the power of lipidomics in preclinical drug testing. J. Biomol. Screen 16, 467–475 (2011).

    Article  CAS  Google Scholar 

  38. Dupuis, J. et al. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nature Genet. 42, 105–116 (2010).

    Article  CAS  Google Scholar 

  39. Sanna, S. et al. Common variants in the SLCO1B3 locus are associated with bilirubin levels and unconjugated hyperbilirubinemia. Hum. Mol. Genet. 18, 2711–2718 (2009).

    Article  CAS  Google Scholar 

  40. Johnson, A. D. et al. Genome-wide association meta-analysis for total serum bilirubin levels. Hum. Mol. Genet. 18, 2700–2710 (2009).

    Article  CAS  Google Scholar 

  41. Kolz, M. et al. Meta-analysis of 28,141 individuals identifies common variants within five new loci that influence uric acid concentrations. PLoS Genet. 5, e1000504 (2009).

    Article  Google Scholar 

  42. Zhai, G. et al. Eight common genetic variants associated with serum DHEAS levels suggest a key role in ageing mechanisms. PLoS Genet. 7, e1002025 (2011).

    Article  CAS  Google Scholar 

  43. Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).

    Article  CAS  Google Scholar 

  44. Kathiresan, S. et al. Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nature Genet. 40, 189–197 (2008).

    Article  CAS  Google Scholar 

  45. Willer, C. J. et al. Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nature Genet. 40, 161–169 (2008).

    Article  CAS  Google Scholar 

  46. Kathiresan, S. et al. Common variants at 30 loci contribute to polygenic dyslipidemia. Nature Genet. 41, 56–65 (2009).

    Article  CAS  Google Scholar 

  47. Kronenberg, F. in Genetics Meets Metabolomics: from Experiment to Systems Biology (ed. Suhre, K.) 255–264 (Springer, 2012).

    Book  Google Scholar 

  48. The Wellcome Trust Case Control Conortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).

  49. Franke, A. et al. Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nature Genet. 42, 1118–1125 (2010).

    Article  CAS  Google Scholar 

  50. Kato, N. et al. Meta-analysis of genome-wide association studies identifies common variants associated with blood pressure variation in east Asians. Nature Genet. 43, 531–538 (2011).

    Article  CAS  Google Scholar 

  51. Rothman, N. et al. A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci. Nature Genet. 42, 978–984 (2010).

    Article  CAS  Google Scholar 

  52. Chambers, J. C. et al. Genetic loci influencing kidney function and chronic kidney disease. Nature Genet. 42, 373–375 (2010).

    Article  CAS  Google Scholar 

  53. Wallace, C. et al. Genome-wide association study identifies genes for biomarkers of cardiovascular disease: serum urate and dyslipidemia. Am. J. Hum. Genet. 82, 139–149 (2008).

    Article  CAS  Google Scholar 

  54. Li, S. et al. The GLUT9 gene is associated with serum uric acid levels in Sardinia and Chianti cohorts. PLoS Genet. 3, e194 (2007).

    Article  Google Scholar 

  55. Doring, A. et al. SLC2A9 influences uric acid concentrations with pronounced sex-specific effects. Nature Genet. 40, 430–436 (2008).

    Article  Google Scholar 

  56. Ferreira, M. A. & Purcell, S. M. A multivariate test of association. Bioinformatics 25, 132–133 (2009).

    Article  CAS  Google Scholar 

  57. Ried, J. S. et al. PSEA: phenotype set enrichment analysis—a new method for analysis of multiple phenotypes. Genet. Epidemiol. 36, 244–252 (2012).

    Article  Google Scholar 

  58. Raychaudhuri, S. et al. Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet. 5, e1000534 (2009).

    Article  Google Scholar 

  59. Krumsiek, J., Suhre, K., Illig, T., Adamski, J. & Theis, F. J. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC Syst. Biol. 5, 21 (2011). This paper introduces partial correlation networks to high-throughput metabolomics studies that may be used in a systems biology approach to GWAS with metabolic traits.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

K.S. is supported by 'Biomedical Research Program' funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation. The statements made herein are solely the responsibility of the authors. The authors thank G. Kastenmüller, A.-K. Petersen and J. Adamski for critical reading of the manuscript. We thank our reviewers for suggestions that led to the improvement of the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Karsten Suhre or Christian Gieger.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Related links

Related links

FURTHER INFORMATION

Karsten Suhre's homepage

GeneCards

GRAIL

GWAS server

HapMap

Helmholtz Zentrum München (German Research Center for Environmental Health)

HMDB Serum Metabolome

Human Metabolome Database

Ingenuity Systems Pathway Analysis

KEGG

LIPID MAPS Lipidomics Gateway

MassBank

metaP-server

Nature Reviews Genetics Series on Study designs

NHGRI Catalog of Published Genome-Wide Association Studies

Online Mendelian Inheritance in Man (OMIM)

The Pharmacogenomics Knowledge Base (PharmGKB)

Software — Department of Epidemiology and Biostatistics — Karolinska Institutet

Weill Cornell Medical College in Qatar

Glossary

NMR spectroscopy

An experimental technique that identifies molecules by the specific pattern in the chemical shift of specific atoms.

High-performance liquid-phase chromatography

(HPLC). A chromatographic technique used to separate a complex mixture of metabolites. Often used in combination with mass spectrometry.

Metabolomics

The field of identifying metabolites in a biological sample using techniques such as NMR spectrometry and liquid- or gas-phase chromatography coupled with mass spectroscopy. 'Metabonomics' is often synonymously used in connection with NMR-based experiments.

Metabolic traits

Quantitative measures of the concentrations of a specific metabolite.

Genetically influenced metabotypes

(GIMs). Associations between a genetic variant and a metabolic phenotype.

Metabolic individuality

The metabolic capacities of an individual, as defined by the ensemble of all functional genetic variants (genetically influenced metabotypes) in their metabolism-related genes. Historically, Garrod introduced the term 'chemical individuality' to represent this concept.

Metabolome

The ensemble of all small molecules (metabolites) that are processed by the body's enzyme and transporter proteins.

Glycerophosphatidylethanolamines

Glycerophospholipids with ethanolamine head groups.

Q–Q plots

A graphical method for comparing probability distributions. In genome-wide association studies, it is used to verify whether the P values are normally distributed; an over-representation of low P values indicates possible true positive associations.

Additive linear model

A mathematical model used in statistical association analysis; here, it assumes a linear additive effect of the minor alleles on the metabolite concentrations.

Linkage disequilibrium

(LD). A nonrandom association between neighbouring gene variants; it is used to describe a region of high correlation between SNPs.

Glycerophospholipids

Glycerol-based phospholipids are major constituents of the membrane bi-layers and are found in association with low-density lipoprotein (LDL) and high-density lipoprotein (HDL) particles.

Sphingolipids

A class of lipids that contain a backbone of sphingoid bases

Rights and permissions

Reprints and permissions

About this article

Cite this article

Suhre, K., Gieger, C. Genetic variation in metabolic phenotypes: study designs and applications. Nat Rev Genet 13, 759–769 (2012). https://doi.org/10.1038/nrg3314

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nrg3314

This article is cited by

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research