Abstract
Metabolites derived from dietary choline and L-carnitine, such as trimethylamine N-oxide and betaine, have recently been identified as novel risk factors for atherosclerosis in mice and humans. We sought to identify genetic factors associated with plasma betaine levels and determine their effect on risk of coronary artery disease (CAD). A two-stage genome-wide association study (GWAS) identified two significantly associated loci on chromosomes 2q34 and 5q14.1. The lead variant on 2q24 (rs715) localizes to carbamoyl-phosphate synthase 1 (CPS1), which encodes a mitochondrial enzyme that catalyses the first committed reaction and rate-limiting step in the urea cycle. Rs715 is also significantly associated with decreased levels of urea cycle metabolites and increased plasma glycine levels. Notably, rs715 yield a strikingly significant and protective association with decreased risk of CAD in only women. These results suggest that glycine metabolism and/or the urea cycle represent potentially novel sex-specific mechanisms for the development of atherosclerosis.
Similar content being viewed by others
Introduction
Metabolomics is an emerging field of biomedical research that is based on characterizing the repertoire of small molecules in biological samples. With respect to coronary artery disease (CAD), this has led to the identification of metabolites in several pathways that may serve as novel clinical biomarkers1,2,3. Furthermore, systems genetics analyses that combine metabolic profiles with genomic, transcriptomic, proteomic and clinical data have yielded comprehensive data sets that can potentially be leveraged to identify underlying biological networks that drive disease susceptibility4. For example, genome-wide association studies (GWASs) for CAD phenotypes and serum or urinary metabolite levels have provided a catalogue of numerous genetic associations, including variants that are often shown to have pleiotropic effects on multiple metabolites in the same biological pathway5. However, these studies have generally been carried out in an unbiased manner, both from a genetics and metabolomics perspective, and efforts are underway to intersect these high-dimensional data to identify pathways causally related to CAD and other diseases.
Another complementary metabolomics approach is to focus on targeted pathways that have been directly implicated in the development of CAD. In this regard, we recently described the role of metabolites derived from dietary choline and L-carnitine on aortic lesion formation in mice and risk of CAD in humans2,6,7. For example, high levels of trimethylamine N-oxide (TMAO), which is generated by one or more of the flavin monooxygenase (FMO) family of enzymes8,9 through gut microbiota-dependent metabolism of choline and L-carnitine, have been mechanistically linked to atherosclerosis and increased risk of CAD. Interestingly, naturally occurring DNA variation appears to only play a marginal role in the regulation of TMAO levels, particularly in humans, suggesting that dietary factors and/or gut bacterial composition are more important determinants10.
In parallel, dietary choline can also be absorbed by the intestine and oxidized to betaine in the liver (and kidney) via a two-step process involving choline dehydrogenase (CHDH) and aldehyde dehydrogenase 7 family member A1 (ALDH7A1)11,12. Although the precise role of TMAO in physiological processes and atherogenesis is not entirely known, both it and FMO3, the major enzyme responsible for generating TMAO8, have been linked to alterations in cholesterol and sterol metabolism13,14, as well as inflammatory gene expression6,14. By comparison, betaine provides one source of methyl groups required for the conversion of homocysteine to methionine, as well the being a methyl donor in folate metabolism15. However, the contribution of genetic factors to plasma betaine levels and, by extension, CAD are not well understood. Therefore, the aim of the present study was to integrate targeted metabolomics and GWAS data to identify genetic factors controlling plasma betaine levels and determine their relationship to risk of CAD. These analyses identified two significantly associated loci for betaine levels on chromosomes 2q34 and 5q14.1. The locus on 2q24 was further associated with intermediates in the metabolic pathway leading from choline to urea and also exhibited a protective and strikingly significant female-specific association with risk of CAD.
Results
Clinical characteristics of Genebank subjects
Table 1 describes the characteristics of the GeneBank subjects used in this study for whom complete clinical data were available. As expected for a patient population undergoing elective cardiac evaluation by coronary angiography, the majority of subjects were male, had prevalent CAD and were taking lipid-lowering medications (Table 1).
Two-stage GWAS for plasma betaine levels
To identify the genetic determinants of plasma betaine levels, we carried out a two-stage GWAS in sequentially consented and enrolled individuals from GeneBank. In stage 1, ∼2.4 million genotyped and imputed autosomal single-nucleotide polymorphisms (SNPs) were evaluated for association with plasma betaine levels in the first 1,985 subjects, with adjustment for age and sex. The observed genomic control factor in these analyses was 1.007, indicating that the GWAS results are not confounded by underlying population stratification. As shown by the Manhattan plot in Fig. 1, four loci on chromosomes 1q32.2, 2q34, 5q14.1 and 16q24.1 were associated with plasma betaine levels in stage 1 at the significant (P<5.0 × 10−8) or suggestive (P<5.0 × 10−6) genome-wide thresholds (Table 2). Multiple SNPs on chromosome 5q14.1 were significantly associated with plasma betaine levels despite not being in strong linkage disequilibrium (LD) with the lead SNP (Fig. 2a), whereas the association signal on chromosome 2q34 was primarily driven by rs715 (Fig. 2b). To determine whether the signals at chromosome 5q14.1 were independent, we performed an analysis conditioned on the lead SNP (rs617219) and identified rs16876394 as still being significantly associated with plasma betaine levels at the genome-wide level (conditioned P=8.5 × 10−9) as well as a nominal association with rs557302 (P=0.002). Aside from rs617219, rs16876394, rs557302 or variants in high LD with these SNPs, there were no other suggestive or significant associations at the chromosome 5q14.1 locus with plasma betaine levels.
In stage 2, we genotyped the lead SNPs at the four identified loci in an additional 1,895 sequential GeneBank subjects for whom plasma betaine levels were measured. The threshold for significance in these replication analyses was set at P<8.3 × 10−3 based on a Bonferroni correction for testing six SNPs (0.05/6). These analyses failed to replicate the association of rs674433 and rs2641698 with plasma betaine levels (Table 2); therefore, the loci on chromosomes 1q32.2 and 16q24.1 were not given further consideration. However, rs715 on chromosome 2q34 and the three independent SNPs on chromosome 5q14.1 (rs617219, rs16876394 and rs557302) all yielded directionally consistent and statistically significant associations with plasma betaine levels in stage 2, which became even more significant in a combined analysis with all subjects (Table 2).
Follow-up analyses with chromosome 5q14.1 locus
The lead SNPs on chromosome 5q14.1 span an ∼200 kb interval containing genes that encode enzymes known to be involved in betaine metabolism, including betaine-homocysteine S-methyltransferase (BHMT), BHMT2 and dimethylglycine dehydrogenase (DMGDH; Fig. 3). The rs617219 variant maps ∼1,500 bp downstream of BHMT and is associated with higher betaine levels, whereas rs16876394 and rs557302 are located in intron 5 of DMGDH and intron 4 of BHTM2, respectively, and lead to lower betaine levels (Fig. 2a and Table 2). The BHMT and BHMT2 enzymes catalyse the transfer of a methyl group from betaine (also known as trimethylglycine) to homocysteine and simultaneously produce dimethylglycine and methionine, respectively. Dimethylglycine is further metabolized by DMGDH through another demethylation reaction to sarcosine, which is subsequently converted to glycine via sarcosine dehydrogenase (SDH; Fig. 3).
As chromosome 5q14.1 harbours genes involved in betaine metabolism, we next determined the association of the lead variants at this locus with other analytes in this pathway that were measured in 400 GeneBank subjects selected from stages 1 and 2 who were matched with respect to age, sex and CAD status. At a Bonferroni-corrected threshold of P<2.1 × 10−3 (0.05/3 SNPs × 8 metabolites), the only significant associations revealed by these analyses were increased and decreased plasma dimethylglycine levels with rs557302 (P=6.1 × 10−4) and rs617219 (P=3.9 × 10−4), respectively (Table 3). Furthermore, none of the three chromosome 5q14.1 variants were associated with other pathway-related intermediates, including the pro-atherogenic metabolite TMAO, which is derived from the initial catabolism of choline to trimethylamine (TMA) by gut bacteria, followed by its oxidation to TMAO in the liver by one or more members of the FMO family of enzymes8 (Table 3 and Fig. 3).
To determine a functional basis for the association of these variants with plasma betaine levels, we searched the publicly available Genotype-Tissue Expression Project database (http://www.gtexportal.org/) for evidence of cis expression quantitative trait loci (eQTL) at the chromosome 5q14.1 locus. Of the metabolically relevant tissues available, rs557302 and rs617219, but not rs16876394, yielded cis eQTLs for BHMT, BHMT2 or DMGDH at varying levels of significance in subcutaneous adipose tissue and skeletal muscle (Supplementary Figs 1–3), but not in the liver or kidney (data not shown).
Follow-up analyses with chromosome 2q34
The lead SNP on chromosome 2q34 (rs715; T>C) is located in the 3′ untranslated region of the carbamoyl-phosphate synthase 1 gene (CPS1), and, in subjects of northern European ancestry, is in near perfect LD (r2=0.93) with rs1047891 (formerly designated as rs7422339; Fig. 2b). Interestingly, two other previous GWAS identified a different CPS1 variant (rs2216405) for plasma glycine levels16,17, which is in moderate LD with rs1047891 and rs715 in subjects of northern European ancestry (r2∼0.42). In our GWAS analyses, rs2216405 yielded a modest association with plasma betaine levels (P=0.005; Fig. 2b) but became non-significant after conditioning on rs715 (conditioned P-value=0.39). As a result, we focused our follow-up analyses for the CPS1 locus on rs715.
CPS1 encodes a mitochondrial enzyme that catalyses the first committed reaction and rate-limiting step in the urea cycle by generating carbamoyl phosphate from NH3 and CO2. One of the major routes leading to the formation of NH3 and CO2 in animals is the degradation of glycine by the glycine cleavage complex (GCC; Fig. 3). Therefore, we next determined the association of rs715 with plasma glycine levels and other metabolites leading from choline to the urea cycle in sequential GeneBank subjects and in the subset of 400 individuals in whom betaine-derived analytes were measured. At a Bonferroni-corrected threshold of P<4.2 × 10−3 (0.05/1 SNP × 12 metabolites), rs715 was significantly associated with increased plasma glycine and decreased citrulline levels (Table 4). Based on previously reported sexually dimorphic effects of rs715 on plasma glycine levels18, we also carried out association tests in males and females separately. The association of rs715 with plasma choline, betaine, glycine and citrulline was more pronounced and significant in female subjects, despite comparable or greater numbers of males in these analyses (Table 4). Rs715 also yielded nominally significant female-specific associations with plasma TMAO and other urea cycle metabolites but not with intermediates, such as homocysteine, methionine, dimethylglycine and sarcosine (Table 4). Of note, the significant sexually dimorphic associations of rs715 with plasma choline, betaine and glycine levels were supported by significant interactions with sex (P<0.05; Table 4).
Effect of betaine-associated variants on risk of CAD
To investigate the clinical significance of the loci on chromosomes 2q34 and 5q14.1, we determined whether the identified variants were associated with various CAD phenotypes. In addition to using the subjects in the metabolite analyses described above, we also genotyped additional sequential consenting subjects enrolled in GeneBank with available genomic DNA and clinical phenotype data (total n=8,668) for rs715, rs16876394, rs557302 and rs617219. Of these variants, rs715 yielded a protective association (P=2.6 × 10−3; Table 5) with severe CAD, defined as having ≥50% stenosis in three or more major epicardial arteries, which was significant at a Bonferroni-corrected threshold of testing four SNPs with three CAD phenotypes (0.05/12=4.2 × 10−3). By comparison, none of the chromosome 5q14.1 variants were significantly associated with any CAD phenotype (Table 5).
The sexually dimorphic metabolite associations with rs715 suggested to us that the association of this variant with CAD could also differ in men and women and/or be mediated through its effects on metabolite levels. To concurrently test these hypotheses, we carried out sex-stratified multivariate analyses in the subset of 400 age-, sex- and CAD-matched GeneBank subjects for whom genetic, metabolomic and clinical phenotype data were available. In addition, only betaine and glycine were included in the multivariate analyses as these metabolites were associated with rs715 in males and females separately at the Bonferroni-corrected significance threshold and exhibited significant interactions with sex. In a univariate model without the inclusion of metabolite levels, rs715 was associated with decreased risk of CAD in women (P=0.04) but not men (P=0.54; Supplementary Table 1). The female-specific protective association of rs715 with CAD was still significant in a multivariate model that included betaine levels, but not in models that adjusted for glycine or both metabolites (Supplementary Table 1).
To further evaluate the association of the variants on chromosomes 2q34 and 5q14.1 with prevalent CAD in a larger independent data set, we used the results of a meta-analysis of GWAS data with 22,233 cases and 64,762 controls from the CARDIoGRAM Consortium. Consistent with the results in GeneBank, 715 was associated with decreased risk of CAD (odds ratio (OR)=0.95, 95% confidence interval (CI)=0.92–0.99; P=0.01) in CARDIoGRAM (Table 6), whereas none of the variants at the chromosome 5q14.1 locus were associated with CAD (Supplementary Table 2). We also evaluated the association of rs715 in males and females separately using the sex-stratified results from CARDIoGRAM. Notably, the C allele of rs715 exhibited a particularly significant association with decreased risk of CAD in females (OR=0.88, 95% CI=0.83–0.94; P=6.3 × 10−5) but not in male subjects (Table 6).
Discussion
Based on recent studies in mice and humans implicating choline-derived metabolites in atherosclerosis, the goal of the present study was to identify the genetic determinants of plasma betaine levels in humans and determine their relationship with risk of CAD. Our GWAS analyses identified and validated two loci on chromosomes 2q34 and 5q14.1 that were significantly associated with plasma betaine levels in a cohort of patients undergoing elective cardiac evaluation. Interestingly, these efforts also identified chromosome 2q34 locus as having a more pronounced effect in women on a cascade of circulating metabolites that lead from choline to urea as well as a strong female-specific association with risk of CAD.
The variants on chromosome 5q14.1 are located in or near several genes (BHMT, BHMT2 and DMGDH) that metabolize betaine through a series of demethylation reactions. The association signals were restricted to plasma betaine and dimethylglycine levels and were derived from several independent SNPs with directionally opposite effects. It is possible that our analyses were underpowered to detect associations with other betaine pathway metabolites (that is, methionine or sarcosine) as these were only measured in a subset of ∼400 GeneBank subjects. We suspect that this is unlikely as the Twins UK, KORA and Framingham Heart Study cohorts reported directionally consistent associations between rs16876394, rs557302 and rs617219 with plasma betaine levels but not with other pathway intermediates19,20. However, these studies did not determine whether betaine-associated variants were associated with risk of CAD. Our analyses further revealed that rs557302 and rs617219 exhibit eQTLs for BHMT, BHMT2 and DMGDH in adipose tissue and skeletal muscle, thus providing functional evidence for association of chromosome 5q14.1 with plasma betaine levels. Interestingly, the alleles of rs557302 and rs617219 that increased expression of BHMT and BHMT2, which would presumably increase betaine catabolism, were unexpectedly associated with higher betaine levels. By contrast, the alleles that increased expression of DMGDH were associated with lower dimethylglycine levels, which is a biological effect that is consistent with the direction of the eQTL and metabolite associations. Furthermore, BHMT, BHMT2 and DMGDH are predominantly expressed in the liver and kidney but there were no cis eQTLs identified for these genes with rs16876394, rs557302, rs617219, or any other chromosome 5q14.1 variants in previously published hepatic gene expression data sets, at least based on the thresholds selected for genome-wide significance in these prior analyses21,22,23,24. Thus, additional studies will be required to elucidate the genetic and functional complexity underlying the association of plasma betaine levels with the chromosome 5q14.1 region.
The other locus identified by our GWAS on chromosome 2q34 localizes to CPS1 and revealed several interesting observations. For example, in both male and female GeneBank subjects, rs715 was primarily associated with decreased betaine and increased glycine levels but the effects were stronger in women. Although these results are consistent with prior studies18,19,20,25,26,27, we additionally demonstrated that rs715 was associated with decreased levels of choline, TMAO and urea cycle products (for example, citrulline) in women as well. With respect to the urea cycle, the strongest effect was on citrulline with increasingly weaker effects on more distal metabolites. Although similar trends were observed in men, the effect sizes on urea cycle metabolites were not as strong and the associations did not reach statistical significance. Moreover, the more prominent associations of rs715 with choline, TMAO, betaine and glycine levels in women were supported by significant statistical evidence for an interaction with sex. Taken together, these data revealed a pattern, at least in women, whereby rs715 was most strongly associated with increased glycine levels and more weakly associated with decreased levels of the most proximal precursors starting at choline and the most distal metabolites in the urea cycle. This pathway also provides at least one plausible unifying mechanism for the pleiotropic associations of CPS1 with metabolites leading from choline to urea (Fig. 3).
The direction of the associations we and others have detected with the CPS1 locus suggest that the minor C allele of rs715 and/or other tightly linked variants lead to decreased CPS1 activity/expression. Based on data from the HapMap and 1000 Genomes Projects for subjects of northern European ancestry, rs715 is only in very strong LD with rs1047891 (r2=0.93), a nonsynonymous Thr1405Asn (ACC>AAC) substitution that together with rs715 comprises a small haplotype block at the 3′ end of CPS1. Of these polymorphisms, we speculate that rs1047891 may be the more likely causal SNP as structure-function studies have shown this amino-acid substitution to be located within a CPS1 domain that is important for its allosteric activation by N-acetylglutamate (NAG)28,29. Thus, it is possible that an asparagine at position 1,405 disrupts the interaction between CPS1 and NAG, thereby reducing activation of the enzyme and flux through the urea cycle. Such a notion is supported by the development of severe hyperammonemia and lower plasma citrulline, arginine and ornithine levels in an NAG synthase-deficient mouse model30, as well as a previous in vivo functional genetics study demonstrating that carriers of rs1047891 had significantly decreased levels of arginine-derived nitric oxide metabolites and agonist-stimulated vasodilation during bradykinin infusion31.
Glycine is the downstream product of progressive betaine demethylation and one metabolite that is degraded to NH3 for subsequent entry into the urea cycle32. Although the functional consequences of rs715 and/or rs1047891 might be predicted to also lead to somewhat elevated blood NH3 levels, this genetic effect(s) is apparently not to the same extent as that conferred by loss-of-function alleles resulting in CPS1 deficiency33. For example, ∼10% of subjects in the general population with northern European ancestry are homozygous for the rs715/rs1047891 haplotype but there are no reports that such individuals exhibit overt symptoms of severe hyperammonemia. This observation implies that the minor alleles of the rs715/rs1047891 variants, either alone or in combination, do not decrease CPS1 activity/expression by more than 50%. Nonetheless, the effect of rs715 and/or rs1047891 on CPS1 function may still explain why these variants were associated with increase plasma glycine levels as NH3 can be converted back to glycine through bidirectional reactions catalysed by GCC32. This notion would also be consistent with the hyperglycinemia observed in CPS1-deficient patients33.
Another major aim of our study was to determine the relationship between betaine-associated loci and risk of CAD. Of the two validated loci for betaine levels, the rare allele of the CPS1 variant was associated with decreased risk of CAD in GeneBank and CARDIoGRAM. The sex-specific protective effect on CAD risk was also consistent in both studies, with a strikingly significant association observed in only women in CARDIoGRAM. This raises several interesting questions regarding the biological mechanism(s) by which rs715 decreases sex-specific risk of CAD. The most straightforward hypothesis is that the protective effect of rs715 is mediated through one or more of the metabolites/biomarkers that this variant is associated with. For example, the minor alleles of rs715 and rs1047891 have been associated with other CAD-related traits, including increased homocysteine34,35,36,37 and creatinine levels38, but decreased homoarginine39,40, high-density lipoprotein41, and fibrinogen levels42,43. However, with the possible exception of the association with fibrinogen levels, which in and of themselves do not appear to be causally related to CAD43, the direction of the associations with rs715 and these biomarkers is opposite to what would be expected for a variant that decreases risk of CAD.
Our results would also suggest that choline and betaine are not the likely causal factors for the association of rs715 with CAD as the chromosome 5q14.1 variants had similar effects on betaine levels but were not associated with CAD even in the large CARDIoGRAM Consortium. Furthermore, a recent analysis showed that neither choline nor betaine predicted incident cardiac events when plasma TMAO levels were added to the adjustment model, and that choline and betaine predicted future risk of adverse events only in the context of elevated TMAO levels44. Thus, it is possible that the association of rs715 with reduced risk of CAD is mediated through lower TMAO levels as this has been shown to be a pro-atherogenic metabolite in both mice and humans2. Other possibilities may be related to the association of rs715 with increased levels of glycine and/or decreased urea cycle metabolites. For example, we previously demonstrated strong clinical associations between increased prevalence of obstructive CAD and elevated citrulline and ornithine levels in GeneBank subjects45. Alternatively, the female-specific association of rs715 with CAD may involve increased glycine levels as it was no longer significant after inclusion of this amino acid, but not betaine, in the multivariate model. In this regard, glycine has been shown to have cardioprotective anti-inflammatory properties in endothelial cells, activated macrophages and other leukocytes46,47, thus providing another possible mechanism for how a genetic factor that increases glycine levels reduces risk of CAD. However, additional studies will still be required to determine whether plasma glycine levels are inversely related to CAD risk and whether glycine metabolism and/or the urea cycle are causally related to the development of atherosclerosis, particularly through sex-specific mechanisms.
In conclusion, the integration of targeted metabolomics with an unbiased genetic screen identified loci on chromosomes 2q34 and 5q14.1 as being associated with plasma levels of analytes related to betaine metabolism. Notably, we also identified a strong association of the chromosome 2q34 locus with decreased risk of CAD in only women. Importantly, this finding represents one of the first female-specific genetic associations for CAD and its magnitude (∼12% decreased risk) was equivalent to the most significantly associated loci identified for CAD to date48. Given that such loci still only explain ∼11% of the genetic variation in CAD risk48, our results also suggest that a portion of this ‘missing heritability’ may reside in sex-specific associations. This highlights the need for future genetics and metabolomics studies to be of sufficient size in order to permit adequately powered analyses in men and women separately.
Methods
Study population
The Cleveland Clinic GeneBank study is a single site sample repository generated from consecutive patients undergoing elective diagnostic coronary angiography or elective cardiac computed tomographic angiography with extensive clinical and laboratory characterization and longitudinal observation. Subject recruitment occurred between 2001 and 2007. Ethnicity was self-reported and information regarding demographics, medical history and medication use was obtained by patient interviews and confirmed by chart reviews. All clinical outcome data were verified by source documentation. CAD was defined as adjudicated diagnoses of stable or unstable angina, myocardial infarction (adjudicated definition based on defined electrocardiographic changes or elevated cardiac enzymes), angiographic evidence of ≥50% stenosis in one or more major epicardial vessel, and/or a history of known CAD (documented myocardial infarction, CAD or history of revascularization). The GeneBank Study has been used previously for discovery and replication of novel genes and risk factors for atherosclerotic disease49,50,51,52,53. The present study was approved by the Institutional Review Boards of the Cleveland Clinic and USC Keck School of Medicine.
Measurement of plasma metabolites
Metabolite levels in human plasma were quantified using stable isotope dilution high-performance liquid chromatography (HPLC) with online electrospray ionization tandem mass spectrometry on an ABI SCIEX QTRAP 5500 mass spectrometer (Applied Biosystems) interfaced with a Shimatzu HPLC equipped with a phenyl column (4.6 × 2,505 mm2, 5 μm RexChrom Phenyl; Regis). Separation was performed using a gradient starting from 10 mM ammonium formate over 0.5 min, then to 5 mM ammonium formate, 25% methanol and 0.1% formic acid over 3 min, held for 8 min, followed by 100% methanol and water washing for 3 min. Metabolites were monitored in multiple reaction monitoring mode using characteristic parent–daughter ion transitions at m/z ratios for each metabolite. Stable isotope labelled internal standards for each monitored analyte were added to plasma samples before protein precipitation and similarly monitored at the appropriate transitions in multiple reaction monitoring mode. Various concentrations of metabolite standards and a fixed amount of internal standards were spiked into control plasma to prepare the calibration curves for quantification of plasma analytes.
Genotyping
Genome-wide genotyping of SNPs in humans was performed on the Affymetrix Genome-Wide Human Array 6.0 chip. Using these data and those from 120 phased chromosomes from the HapMap CEU samples (HapMap r22 release, NCBI build 36), genotypes were imputed for untyped autosomal SNPs across the genome using MACH 1.0 software. All imputations were done on the forward (+) strand using 562,554 genotyped SNPs that had passed quality control filters. Analyses with the imputed data set excluded individuals with <90% call rates, and SNPs with Hardy-Weinberg equilibrium P-values<0.0001 and call rates <97% or minor allele frequencies <1%. This resulted in 2,421,770 autosomal SNPs that were available for a GWAS analysis in 1,985 GeneBank subjects. Genotyping of individual SNPs selected for replication in stage 2 and association with CAD phenotypes was performed using the TaqMan Allelic Discrimination system (Applied Biosystems). In samples from the GWAS data set that were also genotyped by Taqman, the concordance rate with genotypes obtained from the Affymetrix chip was >99%.
Statistical analyses
GWAS and individual SNP analyses for plasma betaine levels in GeneBank were carried out using linear regression analyses with natural log transformed values and adjustment for age and sex. To test for association of genetic variants with the presence and severity of CAD in the GeneBank Cohort, we used unconditional logistic or multinomial regression, with adjustment for age, sex, medication use (statins and/or aspirin) and Framingham ATP-III risk score (which includes smoking and diabetes status). To determine whether the association of rs715 with CAD was mediated through its effects on glycine or betaine levels, we carried out multivariate logistic regression with or without the metabolites included in the model. Adjusted ORs with 95% CIs are reported with two-sided P-values. All analyses were performed using PLINK 1.07 (ref. 54 or SAS 9.3 (SAS Institute Inc) assuming additive genetic models. For replication, the results of the Coronary Artery Disease Genome-wide Replication And Meta-Analysis (CARDIoGRAM) Consortium were used to determine whether variants identified for plasma betaine levels were associated with CAD. CARDIoGRAM represents a GWAS meta-analysis of CAD comprising a discovery set of 22,233 cases and 64,762 controls, in which logistic regression was first used in each cohort to test for association with CAD using a log-additive model with adjustment for age and sex and taking into account the uncertainty of possibly imputed genotypes. Subsequently, a meta-analysis was performed separately for every SNP from each study that passed the quality control criteria using a fixed effects model with inverse variance weighting or a random effects model, depending on the presence of heterogeneity between studies55. The same approach was used to test for association with CAD in males and females separately. The results of these meta-analyses were used to determine whether SNPs at the loci on chromosomes 2q34 and 5q14.1 were associated with CAD.
Additional information
How to cite this article: Hartiala, J. A. et al. Genome-wide association study and targeted metabolomics identifies sex-specific association of CPS1 with coronary artery disease. Nat. Commun. 7:10558 doi: 10.1038/ncomms10558 (2016).
References
Roberts, L. D. & Gerszten, R. E. Toward new biomarkers of cardiometabolic diseases. Cell Metab. 18, 43–50 (2013).
Wang, Z. et al. Gut flora metabolism of phosphatidylcholine promotes cardiovascular disease. Nature 472, 57–63 (2011).
Wang, Z. et al. Measurement of trimethylamine-N-oxide by stable isotope dilution liquid chromatography tandem mass spectrometry. Anal. Biochem. 455, 35–40 (2014).
Civelek, M. & Lusis, A. J. Systems genetics approaches to understand complex traits. Nat. Rev. Genet. 15, 34–48 (2014).
Hindorff, L. A., Junkins, H. A., Hall, P. N., Mehta, J. P. & Manolio, T. A. A catalog of published genome-wide association studies. Available at http://www.genome.gov/gwastudies/ (2015).
Koeth, R. A. et al. Intestinal microbiota metabolism of L-carnitine, a nutrient in red meat, promotes atherosclerosis. Nature Med. 19, 576–585 (2013).
Tang, W. H. et al. Intestinal microbial metabolism of phosphatidylcholine and cardiovascular risk. N. Engl. J. Med. 368, 1575–1584 (2013).
Bennett, B. J. et al. Trimethylamine-N-oxide, a metabolite associated with atherosclerosis, exhibits complex genetic and dietary regulation. Cell Metab. 17, 49–60 (2013).
Krueger, S. K. & Williams, D. E. Mammalian flavin-containing monooxygenases: structure/function, genetic polymorphisms and role in drug metabolism. Pharmacol. Ther. 106, 357–387 (2005).
Hartiala, J. et al. Comparative genome-wide association studies in mice and humans for trimethylamine N-oxide, a proatherogenic metabolite of choline and L-carnitine. Arterioscler. Thromb. Vasc. Biol. 34, 1307–1313 (2014).
Ueland, P. M., Holm, P. I. & Hustad, S. Betaine: a key modulator of one-carbon metabolism and homocysteine status. Clin. Chem. Lab. Med. 43, 1069–1075 (2005).
Lever, M. & Slow, S. The clinical significance of betaine, an osmolyte with a key role in methyl group metabolism. Clin. Biochem. 43, 732–744 (2010).
Shih, D. M. et al. Flavin containing monooxygenase 3 exerts broad effects on glucose and lipid metabolism and atherosclerosis. J. Lipid Res. 56, 22–37 (2015).
Warrier, M. et al. The TMAO-generating enzyme flavin monooxygenase 3 is a central regulator of cholesterol balance. Cell Rep S2211-1247, 01065–1 (2015).
Pajares, M. A. & Perez-Sala, D. Betaine homocysteine S-methyltransferase: just a regulator of homocysteine metabolism? Cell Mol. Life Sci. 63, 2792–2803 (2006).
Illig, T. et al. A genome-wide perspective of genetic variation in human metabolism. Nature Genet. 42, 137–141 (2010).
Suhre, K. et al. Human metabolic individuality in biomedical and pharmaceutical research. Nature 477, 54–60 (2011).
Mittelstrass, K. et al. Discovery of sexual dimorphisms in metabolic and genetic biomarkers. PLoS Genet. 7, e1002215 (2011).
Rhee, E. P. et al. A genome-wide association study of the human metabolome in a community-based cohort. Cell Metab. 18, 130–143 (2013).
Shin, S. Y. et al. An atlas of genetic influences on human blood metabolites. Nature Genet. 46, 543–550 (2014).
Schadt, E. E. et al. Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 6, e107 (2008).
Greenawalt, D. M. et al. A survey of the genetics of stomach, liver, and adipose gene expression from a morbidly obese cohort. Genome Res. 21, 1008–1016 (2011).
Innocenti, F. et al. Identification, replication, and functional fine-mapping of expression quantitative trait loci in primary human liver tissue. PLoS Genet. 7, e1002078 (2011).
Schroder, A. et al. Genomics of ADME gene expression: mapping expression quantitative trait loci relevant for absorption, distribution, metabolism and excretion of drugs in human liver. Pharmacogenomics J. 13, 12–20 (2013).
Xie, W. et al. Genetic variants associated with glycine metabolism and their role in insulin sensitivity and type 2 diabetes. Diabetes 62, 2141–2150 (2013).
Yu, B. et al. Genetic determinants influencing human serum metabolome among African Americans. PLoS Genet. 10, e1004212 (2014).
Demirkan, A. et al. Insight in genome-wide association of metabolite quantitative traits by exome sequence analyses. PLoS Genet. 11, e1004835 (2015).
Rodriguez-Aparicio, L. B. Guadalajara AM, Rubio V. Physical location of the site for N-acetyl-L-glutamate, the allosteric activator of carbamoyl phosphate synthetase, in the 20-kilodalton COOH-terminal domain. Biochemistry (Mosc.) 28, 3070–3074 (1989).
Britton, H. G., Garcia-Espana, A., Goya, P., Rozas, I. & Rubio, V. A structure-reactivity study of the binding of acetylglutamate to carbamoyl phosphate synthetase I. Eur. J. Biochem. 188, 47–53 (1990).
Senkevitch, E., Cabrera-Luque, J., Morizono, H., Caldovic, L. & Tuchman, M. A novel biochemically salvageable animal model of hyperammonemia devoid of N-acetylglutamate synthase. Mol. Genet. Metab. 106, 160–168 (2012).
Summar, M. L. et al. Relationship between carbamoyl-phosphate synthetase genotype and systemic vascular function. Hypertension 43, 186–191 (2004).
Kikuchi, G., Motokawa, Y., Yoshida, T. & Hiraga, K. Glycine cleavage system: reaction mechanism, physiological significance, and hyperglycinemia. Proc. Jpn Acad. B Phys. Biol. Sci. 84, 246–263 (2008).
Freeman, J. M., Nicholson, J. F., Schimke, R. T., Rowland, L. P. & Carter, S. Congenital hyperammonemia. Association with hyperglycinemia and decreased levels of carbamyl phosphate synthetase. Arch. Neurol. 23, 430–437 (1970).
Pare, G. et al. Novel associations of CPS1, MUT, NOX4, and DPEP1 with plasma homocysteine in a healthy population: a genome-wide evaluation of 13 974 participants in the Women's Genome Health Study. Circ. Cardiovasc. Genet 2, 142–150 (2009).
Lange, L. A. et al. Genome-wide association study of homocysteine levels in Filipinos provides evidence for CPS1 in women and a stronger MTHFR effect in young adults. Hum. Mol. Genet. 19, 2050–2058 (2010).
van Meurs, J. B. et al. Common genetic loci influencing plasma homocysteine concentrations and their effect on risk of coronary artery disease. Am. J. Clin. Nutr. 98, 668–676 (2013).
Williams, S. R. et al. Genome-wide meta-analysis of homocysteine and methionine metabolism identifies five one carbon metabolism loci and a novel association of ALDH1L1 with ischemic stroke. PLoS Genet. 10, e1004214 (2014).
Kottgen, A. et al. New loci associated with kidney function and chronic kidney disease. Nature Genet. 42, 376–384 (2010).
Choe, C. U. et al. Homoarginine levels are regulated by L-arginine:glycine amidinotransferase and affect stroke outcome: results from human and murine studies. Circulation 128, 1451–1461 (2013).
Kleber, M. E. et al. Genome-wide association study identifies 3 genomic loci significantly associated with serum levels of homoarginine: the AtheroRemo Consortium. Circ. Cardiovasc. Genet. 6, 505–513 (2013).
Global Lipids Genetics, C. et al. Discovery and refinement of loci associated with lipid levels. Nature Genet. 45, 1274–1283 (2013).
Danik, J. S. et al. Novel loci, including those related to Crohn disease, psoriasis, and inflammation, identified in a genome-wide association study of fibrinogen in 17 686 women: the Women's Genome Health Study. Circ. Cardiovasc. Genet. 2, 134–141 (2009).
Sabater-Lleal, M. et al. Multiethnic meta-analysis of genome-wide association studies in >100 000 subjects identifies 23 fibrinogen-associated Loci but no strong evidence of a causal association between circulating fibrinogen and cardiovascular disease. Circulation 128, 1310–1324 (2013).
Wang, Z. et al. Prognostic value of choline and betaine depends on intestinal microbiota-generated metabolite trimethylamine-N-oxide. Eur. Heart J. 35, 904–910 (2014).
Tang, W. H., Wang, Z., Cho, L., Brennan, D. M. & Hazen, S. L. Diminished global arginine bioavailability and increased arginine catabolism as metabolic profile of increased cardiovascular risk. J. Am. Coll. Cardiol. 53, 2061–2067 (2009).
Hasegawa, S. et al. Cysteine, histidine and glycine exhibit anti-inflammatory effects in human coronary arterial endothelial cells. Clin. Exp. Immunol. 167, 269–274 (2012).
Wang, W. et al. Glycine metabolism in animals and humans: implications for nutrition and health. Amino Acids 45, 463–477 (2013).
Nikpay, M. et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Bhattacharyya, T. et al. Relationship of paraoxonase 1 (PON1) gene polymorphisms and functional activity with systemic oxidative stress and cardiovascular risk. JAMA 299, 1265–1276 (2008).
Hartiala, J. et al. Genetic contribution of the leukotriene pathway to coronary artery disease. Hum. Genet. 129, 617–627 (2011).
Tang, W. H. et al. Clinical and genetic association of serum ceruloplasmin with cardiovascular risk. Arterioscler. Thromb. Vasc. Biol. 32, 516–522 (2012).
Tang, W. H. et al. Clinical and genetic association of serum paraoxonase and arylesterase activities with cardiovascular risk. Arterioscler. Thromb. Vasc. Biol. 32, 2803–2812 (2012).
Reiner, A. P. et al. Genome-wide and gene-centric analyses of circulating myeloperoxidase levels in the charge and care consortia. Hum. Mol. Genet. 22, 3381–3393 (2013).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Schunkert, H. et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature Genet. 43, 333–338 (2011).
Acknowledgements
We thank the CARDIoGRAM Consortium for providing access to the results of the meta-analyses of GWAS data for CAD. A list of authors and affiliations for the consortium is provided in the Supplementary Information. This study was supported in part by NIH grants R01HL103866, P20HL113452, R01ES021801, R01ES025786, American Heart Association Scientist Development Grant 12SDG12050473 and a Transatlantic Networks of Excellence Award from Foundation Leducq. GeneBank was supported in part by NIH grants P01HL098055, P01HL076491 and R01HL103931. R.R. has received research funding from Canadian Institutes of Health Research #MOT82810 and Canada Foundation for Innovation # 11966. S.L.H. is also partially supported by a gift from the Leonard Krieger Fund. Mass spectrometry instrumentation used was housed within the Cleveland Clinic Mass Spectrometry Facility with partial support through a Center of Innovation by AB SCIEX.
Author information
Authors and Affiliations
Contributions
H.A. conceived, designed and supervised the study. W.H.W.T. and S.L.H. are principal investigators of the GeneBank Cohort. Z.W. performed metabolomics measurements. A.L.C. performed genotyping. J.H. and C.W. performed bioinformatics and GWAS statistical analyses. W.H.W.T., S.L.H., A.F.R.S, R.R., R.M. and J.E. contributed important intellectual content to the study and manuscript. J.H. and H.A. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
S.L.H. is named as co-inventor on pending and issued patents held by the Cleveland Clinic relating to cardiovascular diagnostics and therapeutics. S.L.H. reports he has been paid as a consultant or speaker for the following companies: Cleveland Heart Lab, Inc., Esperion and Procter & Gamble. S.L.H. reports he has received research funds from Abbott, Astra Zeneca, Cleveland Heart Lab, Esperion, Procter & Gamble and Takeda. S.L.H. has the right to receive royalty payments for inventions or discoveries related to cardiovascular diagnostics and therapeutics from Cleveland Heart Lab, Inc., Frantz Biomarkers, LLC, Procter & Gamble and Siemens. The remaining authors declare no competing financial interests.
Supplementary information
Supplementary Information
Supplementary Figures 1-3, Supplementary Tables 1-2 and Supplementary Note 1 (PDF 540 kb)
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Hartiala, J., Wilson Tang, W., Wang, Z. et al. Genome-wide association study and targeted metabolomics identifies sex-specific association of CPS1 with coronary artery disease. Nat Commun 7, 10558 (2016). https://doi.org/10.1038/ncomms10558
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/ncomms10558
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.