Genetic analysis in European ancestry individuals identifies 517 loci associated with liver enzymes

Pazoki, Raha; Vujkovic, Marijana; Elliott, Joshua; Evangelou, Evangelos; Gill, Dipender; Ghanbari, Mohsen; van der Most, Peter J.; Pinto, Rui Climaco; Wielscher, Matthias; Farlik, Matthias; Zuber, Verena; de Knegt, Robert J.; Snieder, Harold; Uitterlinden, André G.; Lynch, Julie A.; Jiang, Xiyun; Said, Saredo; Kaplan, David E.; Lee, Kyung Min; Serper, Marina; Carr, Rotonya M.; Tsao, Philip S.; Atkinson, Stephen R.; Dehghan, Abbas; Tzoulaki, Ioanna; Ikram, M. Arfan; Herzig, Karl-Heinz; Järvelin, Marjo-Riitta; Alizadeh, Behrooz Z.; O’Donnell, Christopher J.; Saleheen, Danish; Voight, Benjamin F.; Chang, Kyong-Mi; Thursz, Mark R.; Elliott, Paul

doi:10.1038/s41467-021-22338-2

Download PDF

Article
Open access
Published: 10 May 2021

Genetic analysis in European ancestry individuals identifies 517 loci associated with liver enzymes

Nature Communications volume 12, Article number: 2579 (2021) Cite this article

9004 Accesses
43 Citations
50 Altmetric
Metrics details

Subjects

Abstract

Serum concentration of hepatic enzymes are linked to liver dysfunction, metabolic and cardiovascular diseases. We perform genetic analysis on serum levels of alanine transaminase (ALT), alkaline phosphatase (ALP) and gamma-glutamyl transferase (GGT) using data on 437,438 UK Biobank participants. Replication in 315,572 individuals from European descent from the Million Veteran Program, Rotterdam Study and Lifeline study confirms 517 liver enzyme SNPs. Genetic risk score analysis using the identified SNPs is strongly associated with serum activity of liver enzymes in two independent European descent studies (The Airwave Health Monitoring study and the Northern Finland Birth Cohort 1966). Gene-set enrichment analysis using the identified SNPs highlights involvement in liver development and function, lipid metabolism, insulin resistance, and vascular formation. Mendelian randomization analysis shows association of liver enzyme variants with coronary heart disease and ischemic stroke. Genetic risk score for elevated serum activity of liver enzymes is associated with higher fat percentage of body, trunk, and liver and body mass index. Our study highlights the role of molecular pathways regulated by the liver in metabolic disorders and cardiovascular disease.

Genome-wide association study of serum liver enzymes implicates diverse metabolic and liver pathology

Article Open access 05 February 2021

A multiancestry genome-wide association study of unexplained chronic ALT elevation as a proxy for nonalcoholic fatty liver disease with histological and radiological validation

Article 02 June 2022

The effect of liver enzymes on adiposity: a Mendelian randomization study

Article Open access 14 November 2019

Introduction

Global mortality due to liver disease has been on the rise since 2005¹. Liver disease is now the third cause of premature mortality in the UK that kills 40 people a day in the UK alone overtaking deaths from diabetes and cancer². While 90% of liver diseases can be prevented, 75% of the patients are diagnosed in late stages². The great majority of liver disease in the UK is caused by alcohol consumption, obesity, and viral hepatitis, all of which may result in liver inflammation, cirrhosis, and hepatocellular carcinoma².

Obesity is linked to liver disease through association with non-alcoholic fatty liver disease (NAFLD) or its newly proposed term metabolic (dysfunction)-associated fatty liver disease^3,4,5. Research has shown an increased risk of cardiovascular disease (CVD) in people with NAFLD in both men and women⁶. Elevated serum activity of liver enzymes is an indicator of the underlying liver problems. Specific liver diseases such as NAFLD², alcohol liver disease⁷, viral hepatitis⁸, autoimmune hepatitis⁹, and cholestatic disorders may have genetic underlying factors contributing to the initiation of liver disease or progression of the clinical course of the disease. Genetic factors are known to alter serum concentrations of liver enzymes¹⁰ and several genetic loci have been identified associated with serum activity of liver enzymes.

A previous genome-wide association study (GWAS) of serum activities of liver enzymes¹¹ on ~60,000 individuals of European ancestry identified 44 genetic loci for serum level of alanine transaminase (ALT), alkaline phosphatase (ALP), and γ-glutamyl transferase (GGT).

Here, we sought to identify genetic factors involved in serum levels of ALT, ALP, and GGT using data from 437,438 individuals of European ancestry within the UK Biobank (UKB) and sought replication in 315,572 individuals of European ancestry from the Million Veteran Program (MVP), the Rotterdam Study, and the Lifelines Study. Our aim was to identify etiological genetic and molecular pathways underlying liver function and the link to metabolic disorders and CVDs. We identified and replicated the loci associated with serum activity of liver enzymes and highlighted the pathways involved in metabolic disorders and CVD.

We identified 517 liver enzyme single-nucleotide polymorphisms (SNPs) with evidence of involvement in liver development and function, lipid metabolism, insulin resistance, vascular formation, body mass index (BMI), and body and liver fat percentage. Liver enzyme SNPs show association with coronary heart disease and ischemic stroke.

Results

We performed a two-stage GWAS in European ancestry individuals on serum concentrations of ALT, ALP, and GGT using a discovery sample of 437,438 individuals (Fig. 1) and a replication sample of 315,572 individuals (Supplementary Data 1). At the discovery stage, Q–Q plots (Fig. 2) showed an early deviation from the expected line. To estimate if this is due to population stratification or polygenicity, we performed univariate linkage disequilibrium (LD) score regression (LDSR). The LDSR intercepts (standard error) in UKB were 1.12 (0.02) for ALT, 1.24 (0.02) for ALP, and 1.22 (0.02) for GGT, indicating that inflated test statistics are due to polygenicity of the traits. SNP heritability estimates (standard error) showed that 11% (0. 7%) of ALT, 20.9% (2%) of ALP, and 17% (1%) of GGT is heritable. At the discovery stage, we identified 328 SNPs for GGT, 230 for ALT, and 369 for ALP surpassing our pre-set stringent threshold at P < 1 × 10⁻⁸ (see “Methods”) within the UKB sample (Supplementary Data 2–4). Conditional analysis using the genome-wide complex traits analysis (GCTA) software¹² identified additional independent SNPs for ALT (n = 17), ALP (n = 118), and GGT (n = 43).

**Fig. 1: Overview of study design and findings.**

**Fig. 2: Overview of ALT, ALP, and GGT loci identified within the UKB study (discovery sample).**

We then sought replication of the discovered variants in three independent studies (total N = 315,572). We successfully replicated 517 SNPs including 144 ALT, 265 ALP, and 167 GGT SNPs (Fig. 3 and Supplementary Data 5–7) using our pre-set stringent replication criteria (see “Methods”).

**Fig. 3: Overview of nearest genes mapped to known and novel ALT, ALP, and GGT replicated SNPs and their overlap.**

We examined variance explained by the known and novel liver enzyme SNPs in the Airwave study¹³ cohort of UK police forces. We observed that ALT SNPs explained 10.3% variation in the circulating level of ALT; ALP SNPs explained 6.2% variation in the circulating level of ALP, and GGT SNPs explained 7.0% variation in the circulating level of GGT in the Airwave study.

Cross-trait associations

To investigate evidence for shared genetic components with other traits, we used LDSR, which supports the hypothesis for shared genetic contribution with lipid and glucose metabolism, as well as coronary heart disease (CHD) across all three liver enzymes (Supplementary Fig. 1 and Supplementary Data 8). Liver enzyme SNPs showed positive genetic correlations surpassing our pre-setP value threshold of 1.94 × 10⁻⁴ with several cardiometabolic factors such as waist-to-hip ratio (P_ALT = 1.52 × 10⁻⁵⁵; P_GGT = 1.19 × 10⁻⁴¹), type 2 diabetes (P_ALT = 1.77 × 10⁻³⁴P_GGT = 1.16 × 10⁻¹⁵), CHD (P_GGT = 3.79 × 10⁻²³; P_ALT = 2.17 × 10⁻²¹; P_ALP = 1.52 × 10⁻⁸), and high-density lipoprotein (HDL) cholesterol (P_ALT = 2.31 × 10⁻¹³). Meanwhile, liver enzyme SNPs showed negative genetic correlation with years of education (P_GGT = 1.13 × 10⁻³³; P_ALT = 4.40 × 10⁻²⁹; P_ALP = 6.45 × 10⁻²⁰), parental age of first birth (P_ALT = 2.13 × 10⁻²¹; P_GGT = 3.36 × 10⁻²¹; P_ALP = 3.59 × 10⁻¹⁰), lung function (P_ALT = 2.18 × 10⁻¹⁷; P_GGT = 9.98 × 10⁻¹¹; P_ALT = 5.67 × 10⁻⁰⁷), and intelligence (P_GGT = 1.73 × 10⁻¹⁰; P_ALT = 1.73 × 10⁻¹⁰). Association of replicated liver enzyme SNPs with these genetically correlated traits are presented in Supplementary Data 9.

Assessment of cross-trait associations on DisGeNET^14,15, a database on previously published gene–disease associations, showed that the ALT, ALP, and GGT known and novel SNPs were linked to multiple traits such as CVDs, lipid levels, alcohol consumption, NAFLD, and other cardiometabolic traits (Fig. 4). Metabolomics analysis showed that liver enzyme SNPs were mainly associated with lipid and drug metabolites (Supplementary Data 10).

**Fig. 4: Overview of diseases and traits known to be related to liver enzyme SNPs using DisGeNET.**

Tissue and protein expression assessment

We assessed gene expression of liver enzyme loci in 51 tissues (Supplementary Figs. 3–5). Genes mapped to liver enzyme genes showed medium to high gene expression in liver, adipose tissue, brain, artery, and urogenital system.

We compared the liver expression of genes mapped to our discovery stage SNPs with other tissues and we observed that among genes mapped to the identified SNPs, 26 ALP, 9 ALT, and 20 GGT SNPs were more expressed in the liver compared to all other 51 tissues. This result highlighted SERPINA1 gene with the highest expression in the liver among all genes assessed. We also sought to identify which of the associated SNPs affect gene expression (expression quantitative trait locus (eQTL)) within the Genotype-Tissue Expression (GTeX) database. We found that 21 ALT, 31 ALP, and 30 GGT SNPs affected the expression of genes (cis-eQTL) across tissues. We then specifically looked for eQTL effects in the liver and observed that 5 ALT, 4 ALP, and 8 GGT SNPs (with one SNP overlapping between GGT and ALT) affected expression of genes in the liver (Supplementary Data 11). For example, ALP SNP rs5760119 (proxy SNP for rs5751777) had an eQTL effect on the expression of several genes in the liver including DDT, DDTL, MIF, and GSTT2B. Evaluation of protein expression information on the Human Protein Atlas¹⁶ available from www.proteinatlas.org showed high RNA and protein expression for DDT, DDTL, and MIF in the liver. We observed evidence of expression of a further ten liver enzyme genes (SPTLC3, ACTG1, CD276, CHEK2, EFHD1, MIF, MLIP, NYNRIN, PGAP3, and SHROOM3) in the liver or gallbladder.

Pathway analysis

Using the Ingenuity Pathway Analysis (IPA)¹⁷ software, we found multiple canonical pathways involving gene lists mapped to the three liver enzyme SNPs. For example, the farnesoid X receptor (FXR) pathway that is involved in multiple biological systems including the metabolism of bile acid, lipids, glucose, and the immune system appeared as top canonical pathway across all three liver enzyme SNPs. Upstream regulator analysis identified multiple transcription regulators including nuclear receptors (RXRA, NR1I2, ESR1, NR1H3, and PPARG), and transcription regulators (TP53, HNF4A, FOXA2, and CEBPA).

We also used Data-driven Expression Prioritized Integration for Complex Traits (DEPICT)¹⁸ to find gene sets associated with molecular pathways and tissues enriched with genes mapped to the liver enzyme SNPs. We identified enrichment across multiple organs, tissues, and cells (Figs. 5 and 6). We observed enrichment for ALT SNPs in the liver, adrenal glands, and adipocytes within a range of adipose tissues. ALP SNPs were enriched in hepatocytes in the liver and GGT SNPs were enriched mainly in hepatocytes, embryoid bodies, and epithelial cells across digestive, mucus membranes, and urogenital systems. Evaluation of enriched mammalian phenotypes in relation to liver enzyme SNPs highlighted the importance of a range of phenotypes including abnormal liver physiology and morphology, liver fibrosis, and abnormalities in lipid, glucose, bile acid, and iron metabolisms (Supplementary Data 12). Evaluation of Gene Ontology data in relation to all three liver enzyme SNPs showed the importance of retinoic acid receptor-binding pathway (P = 3.14 × 10⁻⁷), regulation of lipid biosynthetic process (P = 7.48 × 10⁻⁷), basolateral plasma membrane (P = 5.40 × 10⁻⁹), and multiple other pathways involved mainly in liver development and lipid homeostasis. Within KEGG and REACTOME pathways, we observed that enrichment of REACTOME PPARA activates the gene expression (P = 1.93 × 10⁻⁹) pathway, and regulation of lipid metabolism by PPARA gene expression activation (P = 2.86 × 10⁻⁹) were consistently enriched pathways across the three liver enzymes.

**Fig. 5: Overview of tissue enrichment for GGT SNPs using DEPICT.**

**Fig. 6: Overview of tissue and physiological systems enrichment using DEPICT.**

Mendelian randomization (MR)

As our cross-trait assessment showed a link between liver enzyme loci with adiposity, lipid, and glucose metabolism that are the main risk factors for major cardiovascular events, we performed MR analysis to test the causality of the observed associations. To this end, we used the meta-analysis of discovery and replication samples to select the list of variants proxying liver enzyme levels, with genetic association estimates for CHD and stroke risk taken from previously published GWAS. We observed associations of genetically proxied serum levels of all three liver enzymes on CHD risk, although with heterogeneity in estimates obtained across methods that make different assumptions regarding the inclusion of pleiotropic variants. We also observed an MR association of ALT with ischemic stroke (Supplementary Data 13). MR using the inverse-variance-weighted (IVW) method showed that for 10-fold increase in genetically proxied serum level of ALT, the odds ratio (OR) for CHD was 5.84 (95% confidence interval (CI) = 2.52–13.52, P = 3.73 × 10⁻⁵). This was 2.15 (95% CI = 1.07–4.31, P = 0.03) per 10-fold increase in genetically proxied level of ALP and it was 1.46 (95% CI = 1.16–1.83, P = 0.001) per 10-fold increase in genetically proxied level of GGT. In addition, for 10-fold increase in genetically proxied ALT, the OR for ischemic stroke was 2.33 (95% CI = 1.30–4.19, P = 0.005).

Genetic risk score (GRS) analysis

To investigate cumulative effect of liver enzyme SNPs on various complex traits, we performed GRS analysis in the Airwave sample. The GRS was weighted according to the meta-analysis effect estimates for serum level of liver enzyme SNPs (Supplementary Tables 5–7). Here, each standard deviation of increase in ALT GRS was associated with 3.09 U/L in ALT (95% CI = 2.02–4.17; P = 3.5 × 10⁻⁸). Each standard deviation increase in ALP GRS was associated with 2.07 U/L in ALT (95% CI = 1.49–2.66; P = 3.05 × 10⁻¹¹), whereas each standard deviation increase in GGT GRS was associated with 1.43 U/L increase in GGT (95% CI = 1.35–1.52; P = 2.58 × 10⁻²¹⁰). We similarly observed association between GRSs and liver enzymes in NFBC1966 cohort for serum levels of ALT (OR = 1.72; 95% CI = 1.36–2.07; P = 7.55 × 10⁻²¹), ALP (OR = 1.88; 95% CI = 1.67–2.09; P = 1.32 × 10⁻⁶⁵), and GGT (OR = 1.96; 95% CI = 1.72–2.19; P = 2.98 × 10⁻⁵⁶).

We investigated the association of GRS with liver and metabolic traits (see “Methods”) within UKB (Supplementary Data 14). GRS was associated with the metabolic syndrome (β = 0.001; 95% CI = 0.001–0.01; P = 2.47 × 10⁻³⁸), and body fat distribution indices such as body fat percent (β = 0.07; 95% CI = 0.05–0.09; P = 5.97 × 10⁻¹³), and liver fat percent (β = 0.28; 95% CI = 0.13–0.42; P = 1.28 × 10⁻⁴). Our liver enzyme GRS showed a marginal inverse association with basal metabolic rate (β = −2.76; 95% CI = −5.3 to −0.23; P = 0.03) and left ventricular diastolic volume (β = −1.77; 95% CI = −3.51 to −0.03; P = 0.04). We additionally observed that liver enzyme GRS was associated with a small increase in the risk of incident CVD (OR = 1.03; 95% CI = 1.01–1.05; P = 6.47 × 10⁻⁴). To investigate the mediatory/confounding effect of adiposity, lipid, and glucose metabolism on the association of GRS and CVD, we corrected our CVD analysis for the effect of body fat percent, BMI, and the metabolic syndrome, as well as biomarkers of lipid and glucose metabolism. Of these factors, we observed that adjustment for metabolic syndrome or HDL cholesterol gave a partial reduction in risk of liver enzyme GRS on CVD (Supplementary Data 15) more than other factors.

Discussion

We performed a GWAS for serum activity of liver enzymes using a sample size of 437,438 participants from the UKB study and replicated the findings among 315,572 individuals from three independent cohorts of European ancestry, in a combined sample size of 753,010. Using this design, we identified 517 SNPs associated with the serum level of three liver enzymes. These SNPs explained 6–10% of the variation in the liver enzyme levels in an independent study. Our analysis indicates an SNP-based heritability of 11% for ALT, 17% for GGT, and 21% for ALP. These estimates are much higher (up to 10%) than previously reported SNP-based heritability estimates for serum activity of liver enzymes¹⁹.

Genetic correlation analysis supports that genetic determinants of liver enzyme serum levels are linked to lipid and glucose metabolism, adiposity, and CVDs. Metabolomics analysis highlighted the association of lipids and lipoproteins with individual liver enzyme loci. We additionally showed that liver enzyme SNPs collectively are associated with increased lipid levels, increased body fat distribution indices, increased insulin-like growth factor-1 and hemoglobin A1C, and increased NAFLD. In GRS association with CVD, we showed that adjustment for metabolic syndrome or HDL gave 10–15% reduction in the effect size of liver enzyme GRS on CVD, implying that some of this CVD risk may be attributable to the metabolic syndrome/ lipid metabolism.

The top canonical pathway analysis by IPA highlighted the role of FXR, a nuclear receptor involved in the regulation of bile acid synthesis and transport²⁰. The FXR pathway is known to protect against liver inflammation associated with non-alcoholic steatohepatitis²¹ and is involved in lipid transport and glucose metabolism. The biological links within the FXR pathway may provide a biological support for the observed link between liver enzyme loci, lipid dysregulation, diabetes, and obesity.

Furthermore, our gene-set enrichment analysis using DEPICT¹⁸ once again highlighted the regulation of lipid metabolism processes and abnormal liver physiology and morphology. These in silico analyses from multiple sources suggest interconnectivity of lipid and glucose metabolism with processes involved in liver physiology and morphology.

Among the genes identified, we found that LIPC (hepatic type of lipase C) is associated with liver enzyme levels. This gene is highly expressed in the liver and is involved in receptor-mediated lipoprotein uptake, affecting lipid levels²². Polymorphisms in LIPC have been associated with hypertension, type 2 diabetes, and metabolic syndrome²³. Since familial lipid disorders such as familial combined hyperlipidemia²⁴ that commence in infancy are known to cause NAFLD, changes in lipid levels due to polymorphisms in genes such as LIPC might occur prior to changes in serum activity of liver enzymes, perhaps due to the accumulation of fat in the liver. This also applies to another liver enzyme locus, APOE, that is a well-studied lipid-modulating locus linked to LIPC and hepatic injury.

We observed a genetic correlation between femoral neck bone mineral density and ALP in our discovery stage within the UKB. This was not the case for ALT or GGT. ALP has multiple isoforms with the bone and liver being the most abundant circulating isoforms²⁵. In our replication strategy, for each locus to be considered replicated, we implemented concordance of effect with another liver enzyme. This strategy filtered out the signals that were probably due to bone diseases rather than the liver and eventually none of the replicated ALP SNPs reported here show the previous link to bone traits.

Our study additionally confirms the association of various loci that have been shown to be involved in liver disorders. A recent GWAS on non-alcoholic fatty liver and steatohepatitis by Anstee et al.²⁶ highlighted the role of PNPLA3, TM6SF2, GCKR, PYGO1, HSD17B13, and LEPR in these liver disorders. In addition, a recent GWAS on NAFLD by Namjou et al.²⁷ highlighted the role of TRIB1, PNPLA3, TM6SF2, COL13A1, and GCKR in the pathogenesis of NAFLD. Our study confirms that SNPs in PNPLA3, TM6SF2, GCKR, and LEPR are associated with the serum activity of liver enzymes.

Some of the SNPs we replicated play a role in rare familial liver disorders. For instance, we identified and replicated SNPs in SLC22A1, LIPC, ABCC2, CYP7A1, NR1H4, ADH4, MTTP, and ATP8B1 regions that have been previously linked to familial intrahepatic cholestasis²⁸. The disease onset is in childhood and manifests with cholestasis in the liver, leading to liver failure. The pathologic underlying factors are defects in bile acid secretion and metabolism.

One of our lead SNPs in SERPINA1 gene rs28929474 has previously been associated with liver traits, and mutations in SERPINA1 is known to cause liver cirrhosis²⁹. Our study confirms a strong association between this locus across all three liver enzymes.

In summary, here we increase the number of SNPs identified so far for modulating circulating liver enzymes to a total of 561 SNPs. Our tissue expression lookup supported the role of genes with strong evidence of expression in the liver or gallbladder. We show evidence of involvement of liver enzyme SNPs in metabolic syndrome and in coronary artery disease. Our study shows that up to 10% of the variance in serum activity of liver enzymes is genetically determined and suggests the possible role of SNPs involved in liver fat percent in variation in serum activity of liver enzymes and a shared genetic contribution with CVD. Our study implies a role for genetic loci for liver enzyme levels in creating multiple abnormalities in lipid, glucose, and bile acid metabolism. These disturbances seem to be linked to the accumulation of fat in the liver and the body, as well as abnormalities in lipid levels, glucose control, and liver enzyme levels. Adiposity, hyperlipidemia, and abnormal glucose metabolisms are known to be linked to accelerated atherosclerosis and CVD risk. Dedicated investigations are needed on the biological effect of genes within the FXR pathway, their physical interaction, and their link to liver abnormalities and cardiometabolic changes.

Methods

Study design and participants

We used data from the UKB^30,31,32 and included 437,267 individuals aged 40–69 years in the discovery stage. Study participants were ascertained through United Kingdom National Health Service registers across 22 centers in Great Britain between 2006 and 2010³². We included individuals of European ancestry following quality measures and exclusions (sex discordance, high missingness, and/or heterozygosity). Allocating individuals to ethnicity groups was based on self-reported ethnicity matched with principal component analysis ancestry clustering using the k-means clustering method. We excluded participants who had withdrawn consent (n = 39), as well as those who were pregnant or unsure of their pregnancy status at recruitment (n = 372). Non-European ancestry individuals were excluded from the main analysis. We limited our analysis to individuals with complete values for ALT, ALP, and GGT concentration. After exclusions, there were 437,267 individuals for ALT analysis, 437,438 for ALP, and 437,194 for GGT (Fig. 1) analyses. Values of ALT, ALP, and GGT were log 10 transformed to approximate normal distribution. To replicate our SNPs, we used data for 315,572 individuals from three independent studies, namely (i) the Rotterdam Study (NL, N = 6943)³³; (ii) the Lifelines study (NL, N = 13,386)³⁴; and (iii) the MVP (USA, N = 294,043)³⁵ (see Supplementary information). For additional replication, we used GRS and sought the effect estimate and explained variance of the GRS on serum level of ALT, ALP, and GGT in independent samples from the Airwave health monitoring study¹³, a cohort of UK police forces, and in the Northern Finland Birth Cohort 1966^36,37 (NFBC1966; see Supplementary information).

For subsequent analyses such as the association of GRS with various trait and association testing with NAFLD within the UKB, we excluded 918 individuals who had (based on Hospital Episode Statistics [HES] at the time of recruitment) documented International Classification of Diseases Tenth Revision (ICD10) diagnosis code for osteopathy (M45-49 and M80-90), vitamin D deficiency (E55), any liver disorders (K70-K77) including NAFLD (ICD10 code K760), alcohol liver disorder (K70), primary biliary cholangitis (PBC; K74.3), primary sclerosing cholangitis (PSC and K83), autoimmune hepatitis (AIH; K75.4), diseases of the gallbladder (K80-K87), and parathyroid diseases (E214, E215, D351, C750, and D442).

Ethical consideration

The North West Multi-Center Research Ethics Committee has approved the UKB study. Any UKB participants who withdrew consent were removed from the current analysis. Local ethical approval was obtained for all independent replication cohorts.

The MVP received ethical and study protocol approval from the Veteran Affair Central Institutional Review Board and site-specific Research and Development Committees in accordance with the principles outlined in the Declaration of Helsinki. Informed consent was obtained from all participants of the MVP study.

Lifelines are conducted according to the principles of the Declaration of Helsinki and is approved by the medical ethics committee of the University Medical Centre Groningen, The Netherlands. Written informed consent was obtained from all participants.

The Rotterdam Study has been approved by the medical ethics committee according to the Population Screening Act: Rotterdam Study, executed by the Ministry of Health, Welfare, and Sports of the Netherlands. All participants from the Rotterdam Study in the present analysis provided written informed consent to participate and to obtain information from their treating physicians.

The Airwave Health Monitoring Study is approved by the National Health Service Multi-site Research Ethics Committee (MREC/13/NW/0588).

The NFBC1966 study was approved by the Ethics Committee of the Northern Ostrobothnia Hospital District, and the Ethics Committee of the University of Oulu. All participants gave written informed consent.

Liver and metabolic traits

The serum concentration of ALT, ALP, and GGT in stored blood samples was measured using the enzymatic rate analytical method on a Beckman Coulter AU5800. The manufacturer’s analytic range for ALT was 3–500 U/L, for ALP, 5–1500 U/L, and it was 5–1200 U/L for GGT. Details of quality control and sample preparation for the measurements of serum activity of liver enzymes have been published by the UKB³⁸.

We investigated the effect of genetic determinants of liver enzyme levels on BMI, basal metabolic rate (explain methods), electrocardiographic traits, left ventricular ejection fraction, cardiac index, bioimpedance measures using the Tanita BC418MA body composition analyzer including basal metabolic rate, body fat mass, body fat percentage (n = 415,692), fat-free mass, predicted muscle mass, and impedance for the trunk (n = 415,667), as well as coronary artery disease. Liver fat distribution was available in a subset of the UK Biobank, which had undergone imaging analysis of the liver and had genetic data available (n = 4085).

Cardiovascular events

UK Biobank data are linked to electronic health data including HES and Office for National Statistics cause of death data. HES data provide information on hospital admissions for diagnoses and procedures. Using HES we defined CVD as coronary artery disease, stroke, or myocardial infarction classified using our published algorithm³⁹ comprising codes from the ICD 9th (428, 410, 411, 412, 413, 414, 4297, 431, 430, 434, 436, 428, 425) and 10^th (I20, I21, I22, I23, I24, I25, I61, I60, I63, I64, I61, I60, I50, and I42) Revision codes. Prevalent cases were removed from the analyses.

We additionally investigated electrocardiographic traits, left ventricular ejection fraction, and cardiac index in relation to genes identified in this study.

Genotyping and Imputation

Genotyping and imputation in the UKB have been described in detail elsewhere^40,41. Briefly, two custom Affymetrix UKBileve and UKB Axiom arrays⁴² (designed to optimize imputation performance) were used for genotyping of DNA samples obtained from the UKB study participants. The UKB performed imputation centrally using an algorithm implemented in the IMPUTE2 program. Only markers that were present in both UKBileve and UKB Axiom arrays were used for imputation. To maximize the use of haplotypes with British and European ancestry, a special reference panel comprising a merged sample of UK10K sequencing and 1000 Genomes imputation reference panels was used for genotype imputation by the UKB. Genetic principal components to account for population stratification were computed centrally by UKB.

Genome-wide association analysis in UKB

We restricted the main association analysis to SNPs from the third release of UKB genetic data (GRCh37). For GWAS on serum activity of liver enzymes, we performed linear mixed models (LMM) as implemented in the BOLT-LMM (v2.3) software⁴³. The BOLT method accounts for the population structure and cryptic relatedness simultaneously. We assumed an additive genetic model on log 10-transformed ALT, ALP, and GGT values, adjusted for age, sex, and 40 genetic principal components for European ancestry. We applied several filters on a random subset of individuals and common SNPs (minor allele frequency [MAF] > 5%) to estimate parameters of LMM with Hardy–Weinberg equilibrium P > 1 × 10⁻⁶ and missingness <0.015 for the initial modeling step.

For the BOLT-LMM analysis to estimate the effect of SNPs on serum level liver enzymes, we set the discovery stage significance threshold of P < 1 × 10⁻⁸. This stringent threshold (compared with the usual GWAS threshold of P < 5 × 10⁻⁸) was used to robustly define lead SNPs to be put forward for replication and functional assessment. Multiallelic SNPs were removed from the database. We removed all SNPs in the HLA region (chr6:25-34 MB) and removed SNPs with MAF < 0.001. A total of 13,995,440 SNPs passed our quality control criteria and were included in ALP, ALT, and GGT GWAS.

Genetic data of the UKB include many SNPs in high LD that might inflate GWAS test statistics. To distinguish confounding due to population stratification from polygenicity in such data, we applied a univariate LDSR method⁴⁴. We calculated LDSR intercept for ALP, ALT, and GGT GWAS, which was then used as a genomic control factor to account for cryptic relatedness.

Locus definition

For the selection of lead SNPs at the discovery stage, all associations surpassing the stringent threshold of P < 1 × 10⁻⁸ were ranked in order of statistical significance with the strongest SNP associations located at the top of the list. We then removed all SNPs in the region of ±500 kb spanning the strongest ranking SNPs (lead SNP) that showed larger association P values than the lead SNP. We additionally LD pruned the list of final lead SNPs considering SNPs with LD threshold of r² < 0.1 as independent signals.

To detect any secondary signals, we used UKB GWAS summary-level data for ALT, ALP, and GGT and performed approximate conditional analysis using the GCTA software¹². We used locus-specific conditional analysis for ALT, ALP, and GGT conditioned on the lead SNPs within each locus. Our criteria for the selection of secondary signals included MAF ≥ 0.001 and P < 1 × 10⁻⁸ both in the BOLT-LMM GWAS and in joint conditional analysis within GCTA. The individual-level data for the European ancestry participants of UKB were used for LD calculation in GCTA analysis. We accepted and added the signals passing these selection criteria to the list of lead SNPs.

For further exploratory analyses, we searched proxy SNPs (r² > 0.8) within 1 Mb region spanning the final LD pruned lead SNPs. Our criteria to choose proxy SNPs included location within 1 Mb window around the sentinel SNP and r² ≥ 0.8 with the sentinel SNP. For proxy SNPs to be eligible for further analyses, we used MAF ≥ 0.001 and an imputation score >0.3. Both LD pruning and proxy search were performed using the PLINK2 software^45,46.

Replication and concordance

We sought replication for all independent lead SNPs from the BOLT-LMM and GCTA analysis in independent samples. We used data from multiple cohorts of (i) the Rotterdam Study (n = 6943)³³, (ii) the Lifelines study (n = 13,386)³⁴, (iii) and the Million Veterans Program (n = 294,043)³⁵, and performed a meta-analysis across all replication cohorts. Later, we carried out a meta-analysis of discovery and replication results using inverse-variance fixed-effects models in the METAL software⁴⁷. Our replication criteria included (i) stringent (P < 5 × 10⁻⁹) association P value in the meta-analysis of discovery and replication, to minimize false-positive signals; (ii) P < 0.01 in the meta-analysis of replication cohorts together with the concordant direction of effects in the meta-analysis of replication and discovery; (iii) concordant direction of effects on serum level of at least two of the three liver enzymes. In addition, we cross-referenced the ALP-replicated SNPs against reports of bone traits reported in GWAS Catalog⁴⁸ to exclude any potential bone signals. We listed all unique replicated SNPs across all three liver enzymes, and we considered every two SNPs in 500 kb distance of one another as a single locus.

Cross-trait associations

In addition to the final replicated SNPs, we included their proxy SNPs (r² ≥ 0.8) for functional assessment and cross-trait lookups.

To investigate shared heritable contribution between serum activity of liver enzymes and other phenotypes, we used the Broad institute LD hub⁴⁹ tool on 257 LD hub traits (excluding Neal’s lab GWAS analyses http://www.nealelab.is/uk-biobank/ that are based on UKB) to agnostically assess the genetic correlation between any two given traits using LDSR method⁴⁴ implemented in online LD hub tool. The LDSR method developed by Bulik-Sullivan lab uses summary statistics from previously published GWAs. The method estimates genome-wide genetic correlation calculated from the additive genetic variance and covariance between any pair of traits⁴⁴. We used three GWAS summary statistics data from our discovery stage for ALP, ALT, and GGT traits against 257 LD hub summary statistics creating 771 combinations of paired traits. LDSR method uses summary statistics from GWAs of two different traits to identify the genetic correlation between the two traits using SNP data and is described in detail by Bulik-Sullivan et al.⁴⁴. To claim significance, we used a P value threshold of 1.94 × 10⁻⁴ corresponding to a nominal P value (0.05) with Bonferroni correction for 257 LD hub traits.

To assess and identify disease traits that are linked to ALT, ALP, and GGT SNPs, we sought evidence of previous associations using DisGeNET^14,15. As input, we used ALT, ALP, and GGT lead SNPs and their proxy SNPs (r² > 0.8) within 1 Mb region.

To investigate the metabolomic signatures of the identified SNPs, we used individual-level metabolomics data on 1941 serum samples from the Metabolon platform in the Airwave study¹³, a cohort of UK police forces, and performed association tests using linear regression analyses, adjusted for age and sex and principal components of genetically inferred ancestry.

Tissue and Protein expression analysis

We used the online portal of the GTEx database^50,51,52 to obtain the multi-tissue eQTL summary statistics (V7) on gene expression levels by Transcripts per Million using expression data from 48 tissues. To account for multiple testing, we used Benjamini–Hochberg corrected P values to denote statistical significance.

We additionally retrieved median gene expression levels by Transcripts per Million for genes mapped to ALT, ALP, and GGT SNPs from the RNA seq GTEX (V7) database for 51 tissues. For each tissue, we calculated mean and standard deviations of gene expression values. We then standardized gene expression levels across gene transcript-tissue combinations from GTEx to facilitate comparison across tissues. We finally used proteomics (https://www.proteomicsdb.org), tissue expression databases (https://tissues.jensenlab.org), and human protein atlas¹⁶ (www.proteinatlas.org) to check for protein expression of the genes in eQTL with liver enzyme SNPs.

Pathway analysis and gene-set enrichment analysis

We annotated replicated SNPs to the nearest gene within a distance of ±500 kb using the University of California Santa Cruz (UCSC) genome browser. We performed gene-based variant effect analysis using the IPA¹⁷ software (IPA®, Qiagen Redwood City) on genes mapped to ALT, ALP, and GGT SNPs to evaluate over-representation of these genes in canonical pathways and in association with previously reported diseases and biological functions.

The P value of overlap implemented in IPA states the statistical significance of the enrichment of a biological attribute (e.g., canonical pathway, upstream analysis, etc.) in the user’s dataset. It compares the proportion of input molecules (e.g., genes) that are associated with a particular biological attribute to the proportion of molecules that we expect to see if the dataset were made up of randomly selected molecules. It is calculated using the right-tailed Fisher’s exact test. A P value < 0.05 or (−log P value = 1.3) is considered significant by IPA. The smaller the P value, the less likely that the association is random and the more statistically significant the association⁵³.

For our replicated SNPs for each of the three liver enzymes, we used DEPICT¹⁸ at enrichment false discovery rate <0.05 to highlight gene sets associated with specific molecular pathways and mammalian phenotypes.

GRS analysis

To estimate the cumulative contribution of genetic variants to liver enzyme concentrations, we created a GRS for the novel and known loci, weighted according to the effect estimates from the meta-analysis of discovery and replication (n = 753,010). This was separately done across all three liver enzyme SNPs and then an average value of the three GRSs was calculated. This averaged GRS was then standardized so that each unit in the GRS represents 1 SD. We tested the GRS against liver enzyme levels in the independent Airwave study (n_ALP = 331; n_ALT = 330; n_GGT = 13,420)¹³ and estimated the percentage of variance in serum activity of liver enzymes explained by the GRS. We additionally replicated the GRS results in the NFBC1966 cohort (n_ALP = 3619; n_ALT = 3620; n_GGT = 3617).

To test the involvement of replicated liver enzyme SNPs in complex conditions and diseases relevant to the liver, we created a GRS within the UKB weighted according to effect estimates from the meta-analysis of independent replication cohorts (n = 315,572). We investigated the association of this GRS with liver and metabolic traits (described above) within UKB.

Mendelian randomization

To further investigate the effect of circulating levels of the liver enzymes on the risk of cardiovascular outcomes, a two-sample MR approach was employed⁵⁴. We considered the outcomes of CHD, ischemic stroke, and intracerebral hemorrhage (ICH). Genetic association estimates on outcomes were obtained from the CARDIoGRAMplusC4D Consortium for CHD (60,801 cases and 123,504 controls, multiethnic)⁵⁵, the MEGASTROKE Consortium for ischemic stroke (60,341 cases and 454,450 controls, multiethnic)⁵⁶, and the International Stroke Genetic Consortium for ICH (1545 cases and 1481 controls, European ancestry)⁵⁷. For the main analysis, the random-effects IVW meta-analysis MR approach was used, with the simple and weighted median, and MR-Egger approaches also employed as sensitivity analyses as these are more robust to the inclusion of potentially pleiotropic variants⁵⁸.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Summary statistics will be made available through the NHGRI-EBI GWAS Catalog [https://www.ebi.ac.uk/gwas/downloads/summary-statistics] under accession number GCP000102. The direct links to download the summary statistics from GWAS Catalog are as follow: ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90013405, ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90013406, and ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90013407. Genetic association estimates for outcomes considered in Mendelian randomization were obtained from publicly available sources. For coronary heart disease this was the CARDIoGRAMplusC4D Consortium, for ischemic stroke this was the MEGASTROKE Consortium, and for intracerebral hemorrhage this was the International Stroke Genetic Consortium (https://cd.hugeamp.org/downloads.html].

References

Harris, R., Harman, D. J., Card, T. R., Aithal, G. P. & Guha, I. N. Prevalence of clinically significant liver disease within the general population, as defined by non-invasive markers of liver fibrosis: a systematic review. Lancet Gastroenterol. Hepatol. 2, 288–297 (2017).
Article PubMed Google Scholar
Namjou, B. et al. GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network. BMC Med. 17, 135–135 (2019).
Article PubMed PubMed Central CAS Google Scholar
Fouad, Y. et al. What’s in a name? Renaming ‘NAFLD’ to ‘MAFLD’. Liver Int. 40, 1254–1261 (2020).
Article PubMed Google Scholar
Eslam, M., Sanyal, A. J. & George, J. MAFLD: a consensus-driven proposed nomenclature for metabolic associated fatty liver disease. Gastroenterology 158, 1999–2014.e1 (2020).
Article CAS PubMed Google Scholar
Tilg, H. & Effenberger, M. From NAFLD to MAFLD: when pathophysiology succeeds. Nat. Rev. Gastroenterol. Hepatol. 17, 387–388 (2020).
Motamed, N. et al. Non-alcoholic fatty liver disease (NAFLD) and 10-year risk of cardiovascular diseases. Clin. Res. Hepatol. Gastroenterol. 41, 31–38 (2017).
Article PubMed Google Scholar
Buch, S. et al. A genome-wide association study confirms PNPLA3 and identifies TM6SF2 and MBOAT7 as risk loci for alcohol-related cirrhosis. Nat. Genet. 47, 1443–8 (2015).
Article CAS PubMed Google Scholar
Wei, L. et al. Genetic variation in FCER1A predicts peginterferon alfa-2a-induced hepatitis B surface antigen clearance in East Asian patients with chronic hepatitis B. J. Viral Hepat. 26, 1040–1049 (2019).
CAS PubMed Google Scholar
de Boer, Y. S. et al. Genome-wide association study identifies variants associated with autoimmune hepatitis type 1. Gastroenterology 147, 443–52.e5 (2014).
Article CAS PubMed Google Scholar
Chambers, J. C. et al. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma. Nat. Genet. 43, 1131 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chambers, J. C. et al. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma. Nat. Genet. 43, 1131–8 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Elliott, P. et al. The Airwave Health Monitoring Study of police officers and staff in Great Britain: rationale, design and methods. Environ. Res. 134, 280–5 (2014).
Article CAS PubMed Google Scholar
Pinero, J. et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, bav028 (2015).
Article PubMed PubMed Central CAS Google Scholar
Pinero, J. et al. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 45, D833–D839 (2017).
Article CAS PubMed Google Scholar
Uhlen, M. et al. A pathology atlas of the human cancer transcriptome. Science 357, eaan2507 (2017).
Article PubMed CAS Google Scholar
Jiménez-Marín, Á., Collado-Romero, M., Ramirez-Boo, M., Arce, C. & Garrido, J. J. Biological pathway analysis by ArrayUnlock and Ingenuity Pathway Analysis. BMC Proc. 3, S6–S6 (2009).
Article PubMed PubMed Central Google Scholar
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Article CAS PubMed Google Scholar
van Beek, J. H. D. A. et al. Heritability of liver enzyme levels estimated from genome-wide SNP data. Eur. J. Hum. Genet. 23, 1223–1228 (2015).
Article PubMed CAS Google Scholar
Zhu, Y., Liu, H., Zhang, M. & Guo, G. L. Fatty liver diseases, bile acids, and FXR. Acta Pharm. Sin. B 6, 409–412 (2016).
Article PubMed PubMed Central Google Scholar
Armstrong, L. E. & Guo, G. L. Role of FXR in liver inflammation during nonalcoholic steatohepatitis. Curr. Pharmacol. Rep. 3, 92–100 (2017).
Article CAS PubMed PubMed Central Google Scholar
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kilpelainen, T. O. et al. Interaction of single nucleotide polymorphisms in ADRB2, ADRB3, TNF, IL6, IGF1R, LIPC, LEPR, and GHRL with physical activity on the risk of type 2 diabetes mellitus and changes in characteristics of the metabolic syndrome: the Finnish Diabetes Prevention Study. Metabolism 57, 428–36 (2008).
Article CAS PubMed Google Scholar
Kneeman, J. M., Misdraji, J. & Corey, K. E. Secondary causes of nonalcoholic fatty liver disease. Ther. Adv. Gastroenterol. 5, 199–207 (2012).
Article Google Scholar
Haarhaus, M., Brandenburg, V., Kalantar-Zadeh, K., Stenvinkel, P. & Magnusson, P. Alkaline phosphatase: a novel treatment target for cardiovascular disease in CKD. Nat. Rev. Nephrol. 13, 429–442 (2017).
Article CAS PubMed Google Scholar
Anstee, Q. M. et al. Genome-wide association study of non-alcoholic fatty liver and steatohepatitis in a histologically characterised cohort☆. J. Hepatol. 73, 505–515 (2020).
Article CAS PubMed Google Scholar
Namjou, B. et al. GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network. BMC Med. 17, 135 (2019).
Article PubMed PubMed Central CAS Google Scholar
Srivastava, A. Progressive familial intrahepatic cholestasis. J. Clin. Exp. Hepatol. 4, 25–36 (2014).
Article PubMed Google Scholar
Strnad, P. et al. Heterozygous carriage of the alpha1-antitrypsin Pi*Z variant increases the risk to develop liver cirrhosis. Gut 68, 1099–1107 (2019).
Article CAS PubMed Google Scholar
Elliott, P. & Peakman, T. C. The U.K. Biobank sample handling and storage protocol for the collection, processing and archiving of human blood and urine. Int. J. Epidemiol. 37, 234–44 (2008).
Article PubMed Google Scholar
UK Biobank Coordinating Centre. UK Biobank: protocol for a large-scale prospective epidemiological resource. Protocol No: UKBB-PROT-09-06 (Main Phase). (21 March 2007 (AMENDMENT ONE FINAL). http://www.ukbiobank.ac.uk/wp-content/uploads/2011/11/UK-Biobank-Protocol.pdf (2007).
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Ikram, M. A. et al. Objectives, design and main findings until 2020 from the Rotterdam Study. Eur. J. Epidemiol. 35, 483–517 (2020).
Article PubMed PubMed Central Google Scholar
Scholtens, S. et al. Cohort Profile: LifeLines, a three-generation cohort study and biobank. Int. J. Epidemiol. 44, 1172–80 (2015).
Article PubMed Google Scholar
Gaziano, J. M. et al. Million Veteran Program: a mega-biobank to study genetic influences on health and disease. J. Clin. Epidemiol. 70, 214–23 (2016).
Article PubMed Google Scholar
Rantakallio, P. The longitudinal study of the northern Finland birth cohort of 1966. Paediatr. Perinat. Epidemiol. 2, 59–88 (1988).
Article CAS PubMed Google Scholar
Sovio, U. et al. Genetic determinants of height growth assessed longitudinally from infancy to adulthood in the northern Finland birth cohort 1966. PLoS Genet. 5, e1000409–e1000409 (2009).
Article PubMed PubMed Central CAS Google Scholar
Fry, D. A., R., Moffat, S., Gordon, M. & Singh, P. U.K. Biobank Biomarker Project; Companion Document to Accompany Serum BiomarkerData (11 March 2019). http://biobank.ctsu.ox.ac.uk/crystal/docs/serum_biochemistry.pdf (2019).
Pazoki, R. et al. Genetic predisposition to high blood pressure and lifestyle factors: associations with midlife blood pressure levels and cardiovascular events. Circulation 137, 653–661 (2018).
Article PubMed Google Scholar
UK Biobank. Genotype imputation and genetic association studies of UK Biobank: Interim Data Release (May 2015). http://www.ukbiobank.ac.uk/wp-content/uploads/2014/04/imputation_documentation_May2015.pdf (2015).
Clare Bycroft, C. F. et al. Genome-wide genetic data on ~500,000 UK Biobank participants. Preprint at bioRxivhttps://doi.org/10.1101/166298 (2017).
Bycroft, C. et al. The U.K. Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Loh, P. R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–90 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–5 (2015).
Article CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–75 (2007).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
MacArthur, J. et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 45, D896–D901 (2017).
Article CAS PubMed Google Scholar
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272–279 (2017).
Article CAS PubMed Google Scholar
Carithers, L. J. & Moore, H. M. The Genotype-Tissue Expression (GTEx) Project. Biopreserv. Biobank. 13, 307–8 (2015).
Article PubMed PubMed Central Google Scholar
Human Genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–60 (2015).
Article CAS Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
QIAGEN Silicon Valley (Ingenuity Systems). Calculating and interpreting the p-values for functions, pathways and lists in IPA. https://www.ingenuity.com/wp-content/themes/ingenuity-qiagen/pdf/ipa/functions-pathways-pval-whitepaper.pdf (2016).
Davey Smith, G. & Ebrahim, S. Mendelian randomization: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 32, 1–22 (2003).
Article Google Scholar
Nikpay, M. et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Article CAS PubMed PubMed Central Google Scholar
Malik, R. et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat. Genet. 50, 524–537 (2018).
Article CAS PubMed PubMed Central Google Scholar
Woo, D. et al. Meta-analysis of genome-wide association studies identifies 1q22 as a susceptibility locus for intracerebral hemorrhage. Am. J. Hum. Genet. 94, 511–21 (2014).
Article CAS PubMed PubMed Central Google Scholar
Burgess, S., Bowden, J., Fall, T., Ingelsson, E. & Thompson, S. G. Sensitivity analyses for robust causal inference from Mendelian randomization analyses with multiple genetic variants. Epidemiology 28, 30–42 (2017).
Article PubMed Google Scholar

Download references

Acknowledgements

This research has been conducted using the UKB Resource under application number 236 granting access to the corresponding UKB genetic and phenotype data (released 17 Nov. 2016). See Supplementary information for details of cohorts, GWAS resources, and funding. This research has been conducted using the UKB Resource under applications number 10035 and 236 granting access to the corresponding UKB genetic and phenotype data (released 17 Nov. 2016). UK Biobank genotyping was supported by the British Heart Foundation (grant SP/13/2/30111) for Large-scale comprehensive genotyping of UKB for cardiometabolic traits and diseases: UK CardioMetabolic Consortium. P.E. is Director of the Medical Research Council Centre for Environment and Health and acknowledges support from the Medical Research Council and Public Health England (MR/L01341X/1 and MR/S019669/1). P.E. also acknowledges support from the National Institute of Health Research Imperial Biomedical Research Centre. P.E. is a UK Dementia Research Institute professor, UK Dementia Research Institute at Imperial College London. The DRI receives funding from UK Dementia Research Institute Ltd funded by the UK Medical Research Council, Alzheimer’s Society and Alzheimer’s Research UK. P.E. is associate director of Health Data Research UK-London funded by a consortium led by the UK Medical Research Council. This work used the computing resources of the UK MEDical BIOinformatics partnership (UK MED-BIO), which is supported by the Medical Research Council (MR/L01632X/1). R.P. holds a fellowship supported by Rutherford Fund from Medical Research Council (MR/R0265051/1). The main replication sample was based on data from the Million Veteran Program (MVP), Office of Research and Development, Veterans Health Administration. The outlined work performed in MVP was supported by funding from the Department of Veterans Affairs Office of Research and Development, Million Veteran Program via #MVP000 and I01-BX003362 (P.S.T. and K.M.-C.) with additional support from the NIH/NIDDK (DK101478, B.F.V.; 1K23DK115897-01, M.S.), the NIH/NHGRI (HG010067, B.F.V.), NIH/NIAAA (RO1 AA026302, R.M.C.), Linda Pechenik Montague Investigator award (B.F.V.), and VA Informatics and Computing Infrastructure (VINCI) VA HSR RES 130457. The content of this manuscript does not represent the views of the Department of Veterans Affairs or the United States Government. The LifeLines Cohort Study, and generation and management of GWAS genotype data for the LifeLines Cohort Study is supported by the Netherlands Organization of Scientific Research NWO (grant 175.010.2007.006), the Economic Structure Enhancing Fund (FES) of the Dutch government, the Ministry of Economic Affairs, the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the Northern Netherlands Collaboration of Provinces (SNN), the Province of Groningen, University Medical Center Groningen, the University of Groningen, Dutch Kidney Foundation and Dutch Diabetes Research Foundation. The authors wish to acknowledge the services of the Lifelines Cohort Study, the contributing research centers delivering data to Lifelines, and all the study participants. NFBC1966 received financial support from the Academy of Finland (EGEA-project, no 285547), University Hospital Oulu, University of Oulu, Finland (75617), NHLBI grant 5R01HL087679-02 (STAMPEED program, 1RL1MH083268-01), the Medical Research Council, UK (PREcisE, JPI HDHL, MR/S03658X/1), H2020 DynaHEALTH action (Grant Agreement 633595), H2020 ALEC Action (Grant Agreement 633212) and H2020 EUCAN Connect (Grant Agreement 824989).

Author information

These authors contributed equally: Raha Pazoki, Marijana Vujkovic, Benjamin F. Voight, Kyong-Mi Chang, Mark R. Thursz, Paul Elliott.

Authors and Affiliations

Department of Epidemiology and Biostatistics, MRC Centre for Environment and Health, School of Public Health, London, UK
Raha Pazoki, Joshua Elliott, Evangelos Evangelou, Dipender Gill, Rui Climaco Pinto, Matthias Wielscher, Verena Zuber, Saredo Said, Abbas Dehghan, Ioanna Tzoulaki, Marjo-Riitta Järvelin & Paul Elliott
Division of Biomedical Sciences, Department of Life Sciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge, UK
Raha Pazoki & Xiyun Jiang
Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA
Marijana Vujkovic, David E. Kaplan, Marina Serper, Rotonya M. Carr, Benjamin F. Voight & Kyong-Mi Chang
Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Marijana Vujkovic, David E. Kaplan, Marina Serper, Rotonya M. Carr & Kyong-Mi Chang
Royal Surrey County Hospital, Guildford, Surrey, UK
Joshua Elliott
Department of Hygiene and Epidemiology, University of Ioannina Medical School, Ioannina, Greece
Evangelos Evangelou & Ioanna Tzoulaki
British Heart Foundation Centre of Research Excellence, Imperial College London, London, UK
Dipender Gill & Paul Elliott
Department of Epidemiology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
Mohsen Ghanbari & M. Arfan Ikram
Department of Genetics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
Mohsen Ghanbari
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Peter J. van der Most, Harold Snieder, H. Marike Boezen & Behrooz Z. Alizadeh
UK Dementia Research Institute, Imperial College London, London, UK
Rui Climaco Pinto, Abbas Dehghan & Paul Elliott
Department of Dermatology, Medical University of Vienna, Vienna, Austria
Matthias Wielscher & Matthias Farlik
Department of Gastroenterology and Hepatology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
Robert J. de Knegt
Department of Internal Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
André G. Uitterlinden
VA Salt Lake City Health Care System, Salt Lake City, UT, USA
Julie A. Lynch & Kyung Min Lee
University of Massachusetts, Boston, MA, USA
Julie A. Lynch
School of Medicine, University of Utah, Salt Lake City, UT, USA
Kyung Min Lee
VA Palo Alto Health Care System, Palo Alto, CA, USA
Philip S. Tsao
School of Medicine, Stanford University, Stanford, CA, USA
Philip S. Tsao
Division of Digestive Diseases, Department of Metabolism, Digestion & Reproduction, Imperial College London, London, UK
Stephen R. Atkinson & Mark R. Thursz
Institute of Biomedicine, Medical Research Center Oulu, Oulu University, Oulu, Finland
Karl-Heinz Herzig
Oulu University Hospital, Oulu, Finland
Karl-Heinz Herzig
Institute of Pediatrics, Poznan University of Medical Sciences, Poznan, Poland
Karl-Heinz Herzig
Center for Life Course Health Research, Faculty of Medicine, Oulu University, Oulu, Finland
Marjo-Riitta Järvelin
Department of Life Sciences, College of Health and Life Sciences, Brunel University London, Uxbridge, UK
Marjo-Riitta Järvelin
Unit of Primary Care, Oulu University Hospital, Oulu, Finland
Marjo-Riitta Järvelin
VA Boston Healthcare System, Boston, MA, USA
Christopher J. O’Donnell, Saiju Pyarajan, Kelly Cho, J. Michael Gaziano, Scott Kinlay, Xuan-Mai T. Nguyen, Jessica V. Brewer, Mary T. Brophy, Nhan V. Do, Donald E. Humphries, Luis E. Selva, Shahpoor Shayan, Stacey B. Whitbourne, Jim L. Breeling, J. P. Casas Romero & Rachel B. Ramoni
Harvard Medical School, Boston, MA, USA
Christopher J. O’Donnell
Brigham Women’s Hospital, Boston, MA, USA
Christopher J. O’Donnell
Departments of Medicine and Cardiology, Columbia University, New York City, NY, USA
Danish Saleheen
Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Benjamin F. Voight
Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Benjamin F. Voight
Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Benjamin F. Voight
National Institute for Health Research, Imperial Biomedical Research Centre, Imperial College London, London, UK
Paul Elliott
Health Data Research UK at Imperial College London, London, UK
Paul Elliott
Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Lude Franke, Morris Swertz & Cisca Wijmenga
Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Pim van der Harst
Division of Nephrology, Department of Internal Medicine, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Gerjan Navis
Department of Pathology and Medical Biology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Marianne Rots & Bruce H. R. Wolffenbuttel
Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Bruce H. R. Wolffenbuttel
Iowa City VA Health Care System, Iowa City, IA, USA
Zuhair K. Ballas
VA North Texas Health Care System, Dallas, TX, USA
Sujata Bhushan
VA Puget Sound Health Care System, Seattle, WA, USA
Edward J. Boyko
Portland VA Medical Center, Portland, OR, USA
David M. Cohen
VA Connecticut Healthcare System, West Haven, CT, USA
John Concato, Michaela Aslan & Hongyu Zhao
Southeast Louisiana Veterans Health Care System, New Orleans, LA, USA
Joseph I. Constans
Birmingham VA Medical Center, Birmingham, AL, USA
Louis J. Dellitalia
VA Southern Nevada Healthcare System, North Las Vegas, NV, USA
Joseph M. Fayad
VA Loma Linda Healthcare System, Loma Linda, CA, USA
Ronald S. Fernando
Miami VA Health Care System, Miami, FL, USA
Hermes J. Florez
VA Eastern Kansas Health Care System, Leavenworth, KS, USA
Melinda A. Gaddy
John D. Dingell VA Medical Center, Detroit, MI, USA
Saib S. Gappy
Fayetteville VA Medical Center, Fayetteville, AR, USA
Gretchen Gibson
Richmond VA Medical Center, Richmond, VA, USA
Michael Godschalk
Sioux Falls VA Health Care System, Sioux Falls, SD, USA
Jennifer A. Greco
VA San Diego Healthcare System, San Diego, CA, USA
Samir Gupta
Edward Hines Jr. VA Medical Center, Hines, IL, USA
Salvador Gutierrez
Fargo VA Health Care System, Fargo, ND, USA
Kimberly D. Hammer
Ralph H. Johnson VA Medical Center, Charleston, SC, USA
Mark B. Hamner
Cincinnati VA Medical Center, Cincinnati, OH, USA
John B. Harley
VA Tennessee Valley Healthcare System, Nashville, TN, USA
Adriana M. Hung
VA Sierra Nevada Health Care System, Reno, NV, USA
Mostaqul Huq
W.G. (Bill) Hefner VA Medical Center, Salisbury, NC, USA
Robin A. Hurley
Hampton VA Medical Center, Hampton, VA, USA
Pran R. Iruvanti
Eastern Oklahoma VA Health Care System, Muskogee, OK, USA
Douglas J. Ivins
VA Northeast Ohio Healthcare System, Cleveland, OH, USA
Frank J. Jacono
Philadelphia VA Medical Center, Philadelphia, PA, USA
Darshana N. Jhala
VA Health Care Upstate New York, Albany, NY, USA
Laurence S. Kaminsky
Louisville VA Medical Center, Louisville, KY, USA
Jon B. Klein
Richard Roudebush VA Medical Center, Indianapolis, IN, USA
Suthat Liangpunsakul
Washington DC VA Medical Center, Washington, DC, USA
Jack H. Lichy, Jennifer Moser, Grant D. Huang & Sumitra Muralidhar
James A. Haley Veterans Hospital, Tampa, FL, USA
Stephen M. Mastorides
Columbia VA Health Care System, Columbia, SC, USA
Roy O. Mathew
Central Western Massachusetts Healthcare System, Leeds, MA, USA
Kristin M. Mattocks
Bay Pines VA Healthcare System, Bay Pines, FL, USA
Rachel McArdle
Southern Arizona VA Health Care System, Tucson, AZ, USA
Paul N. Meyer
VA Salt Lake City Health Care System, Salt Lake City, UT, USA
Laurence J. Meyer
James H. Quillen VA Medical Center, Johnson City, TN, USA
Jonathan P. Moorman
VA Long Beach Healthcare System, Long Beach, CA, USA
Timothy R. Morgan
Minneapolis VA Health Care System, Minneapolis, MN, USA
Maureen Murdoch
Michael E. DeBakey VA Medical Center, Houston, TX, USA
Olaoluwa O. Okusaga
Salem VA Medical Center, Salem, VA, USA
Kris-Ann K. Oursler
Manchester VA Medical Center, Manchester, NH, USA
Nora R. Ratcliffe
St. Louis VA Health Care System, St. Louis, MO, USA
Michael I. Rauchman
White River Junction VA Medical Center, White River Junction, VT, USA
R. Brooks Robey
VA Pacific Islands Health Care System, Honolulu, HI, USA
George W. Ross
Syracuse VA Medical Center, Syracuse, NY, USA
Richard J. Servatius
Providence VA Medical Center, Providence, RI, USA
Satish C. Sharma
VA New York Harbor Healthcare System, New York, NY, USA
Scott E. Sherman
VA Pittsburgh Health Care System, Pittsburgh, PA, USA
Elif Sonel
North Florida/South Georgia Veterans Health System, Gainesville, FL, USA
Peruvemba Sriram
VA Maine Healthcare System, Augusta, ME, USA
Todd Stapley
William S. Middleton Memorial Veterans Hospital, Madison, WI, USA
Robert T. Striker
Overton Brooks VA Medical Center, Shreveport, LA, USA
Neeraj Tandon
New Mexico VA Health Care System, Albuquerque, NM, USA
Gerardo Villareal
VA Greater Los Angeles Health Care System, Los Angeles, CA, USA
Agnes S. Wallbom
Edith Nourse Rogers Memorial VA Hospital, Bedford, MA, USA
John M. Wells
Clement J. Zablocki VA Medical Center, Milwaukee, WI, USA
Jeffrey C. Whittle
San Francisco VA Health Care System, San Francisco, CA, USA
Mary A. Whooley
Atlanta VA Medical Center, Decatur, GA, USA
Peter W. Wilson & Yan V. Sun
VA Western New York Healthcare System, Buffalo, NY, USA
Junzhe Xu
Northport VA Medical Center, Northport, NY, USA
Shing-Shing Yeh
Raymond G. Murphy VA Medical Center, Albuquerque, NM, USA
Todd Connor & Dean P. Argyres
Durham VA Medical Center, Durham, NC, USA
Elizabeth R. Hauser & Jean C. Beckham
Canandaigua VA Medical Center, Canandaigua, NY, USA
Brady Stephens
Phoenix VA Health Care System, Phoenix, AZ, USA
Samuel M. Aguayo
South Texas Veterans Health Care System, San Antonio, TX, USA
Sunil K. Ahuja

Authors

Raha Pazoki
View author publications
You can also search for this author in PubMed Google Scholar
Marijana Vujkovic
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Elliott
View author publications
You can also search for this author in PubMed Google Scholar
Evangelos Evangelou
View author publications
You can also search for this author in PubMed Google Scholar
Dipender Gill
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Ghanbari
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. van der Most
View author publications
You can also search for this author in PubMed Google Scholar
Rui Climaco Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Wielscher
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Farlik
View author publications
You can also search for this author in PubMed Google Scholar
Verena Zuber
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. de Knegt
View author publications
You can also search for this author in PubMed Google Scholar
Harold Snieder
View author publications
You can also search for this author in PubMed Google Scholar
André G. Uitterlinden
View author publications
You can also search for this author in PubMed Google Scholar
Julie A. Lynch
View author publications
You can also search for this author in PubMed Google Scholar
Xiyun Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Saredo Said
View author publications
You can also search for this author in PubMed Google Scholar
David E. Kaplan
View author publications
You can also search for this author in PubMed Google Scholar
Kyung Min Lee
View author publications
You can also search for this author in PubMed Google Scholar
Marina Serper
View author publications
You can also search for this author in PubMed Google Scholar
Rotonya M. Carr
View author publications
You can also search for this author in PubMed Google Scholar
Philip S. Tsao
View author publications
You can also search for this author in PubMed Google Scholar
Stephen R. Atkinson
View author publications
You can also search for this author in PubMed Google Scholar
Abbas Dehghan
View author publications
You can also search for this author in PubMed Google Scholar
Ioanna Tzoulaki
View author publications
You can also search for this author in PubMed Google Scholar
M. Arfan Ikram
View author publications
You can also search for this author in PubMed Google Scholar
Karl-Heinz Herzig
View author publications
You can also search for this author in PubMed Google Scholar
Marjo-Riitta Järvelin
View author publications
You can also search for this author in PubMed Google Scholar
Behrooz Z. Alizadeh
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. O’Donnell
View author publications
You can also search for this author in PubMed Google Scholar
Danish Saleheen
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin F. Voight
View author publications
You can also search for this author in PubMed Google Scholar
Kyong-Mi Chang
View author publications
You can also search for this author in PubMed Google Scholar
Mark R. Thursz
View author publications
You can also search for this author in PubMed Google Scholar
Paul Elliott
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Lifelines Cohort Study

Behrooz Z. Alizadeh
, H. Marike Boezen
, Lude Franke
, Pim van der Harst
, Gerjan Navis
, Marianne Rots
, Morris Swertz
, Bruce H. R. Wolffenbuttel
& Cisca Wijmenga

the VA Million Veteran Program

Zuhair K. Ballas
, Sujata Bhushan
, Edward J. Boyko
, David M. Cohen
, John Concato
, Michaela Aslan
, Hongyu Zhao
, Joseph I. Constans
, Louis J. Dellitalia
, Joseph M. Fayad
, Ronald S. Fernando
, Hermes J. Florez
, Melinda A. Gaddy
, Saib S. Gappy
, Gretchen Gibson
, Michael Godschalk
, Jennifer A. Greco
, Samir Gupta
, Salvador Gutierrez
, Kimberly D. Hammer
, Mark B. Hamner
, John B. Harley
, Adriana M. Hung
, Mostaqul Huq
, Robin A. Hurley
, Pran R. Iruvanti
, Douglas J. Ivins
, Frank J. Jacono
, Darshana N. Jhala
, Laurence S. Kaminsky
, Jon B. Klein
, Suthat Liangpunsakul
, Jack H. Lichy
, Jennifer Moser
, Grant D. Huang
, Sumitra Muralidhar
, Stephen M. Mastorides
, Roy O. Mathew
, Kristin M. Mattocks
, Rachel McArdle
, Paul N. Meyer
, Laurence J. Meyer
, Jonathan P. Moorman
, Timothy R. Morgan
, Maureen Murdoch
, Olaoluwa O. Okusaga
, Kris-Ann K. Oursler
, Nora R. Ratcliffe
, Michael I. Rauchman
, R. Brooks Robey
, George W. Ross
, Richard J. Servatius
, Satish C. Sharma
, Scott E. Sherman
, Elif Sonel
, Peruvemba Sriram
, Todd Stapley
, Robert T. Striker
, Neeraj Tandon
, Gerardo Villareal
, Agnes S. Wallbom
, John M. Wells
, Jeffrey C. Whittle
, Mary A. Whooley
, Peter W. Wilson
, Yan V. Sun
, Junzhe Xu
, Shing-Shing Yeh
, Todd Connor
, Dean P. Argyres
, Elizabeth R. Hauser
, Jean C. Beckham
, Brady Stephens
, Samuel M. Aguayo
, Sunil K. Ahuja
, Saiju Pyarajan
, Kelly Cho
, J. Michael Gaziano
, Christopher J. O’Donnell
, Scott Kinlay
, Xuan-Mai T. Nguyen
, Jessica V. Brewer
, Mary T. Brophy
, Nhan V. Do
, Donald E. Humphries
, Luis E. Selva
, Shahpoor Shayan
, Stacey B. Whitbourne
, Jim L. Breeling
, J. P. Casas Romero
, Rachel B. Ramoni
, Philip S. Tsao
& Kyong-Mi Chang

Contributions

R.P., P.E., and M.R.T. led this research. R.P., P.E., and M.R.T. drafted the manuscript. R.P. performed quality control and management of the UK Biobank phenotypes for this research and performed GWAS and secondary analyses with contributions from R.C.P. (metabolon analysis), D.G. (Mendelian randomization analysis), X.J. (cross-trait lookup), and S.S. (provided Supplementary Fig. 1). J.E. contributed to data analysis. S.R.A. provided clinical input. E.E. and V.Z. provided statistical advice. I.T., P.E., and A.D. acquired Airwave data. M.G. performed data analysis in the Rotterdam Study. R.J.d.K. acquired liver function test data in the Rotterdam Study. A.G.U. acquired Rotterdam Study genetics data. M.A.I. acquired the Rotterdam Study cohort data. M.V. and K.M.L. performed data analysis in the MVP cohort. J.A.L. performed data collection in MVP. D.E.K. and M.S. performed phenotype curation in MVP. R.C.P., P.S.T., C.J.O., D.S., B.F.V., and K.-M.C. performed study design in MVP cohort. P.J.v.d.M. performed data analysis in Lifelines. H.S. and B.Z.A. acquired lifelines data. M.W. and M.F performed data analysis in NFBC1966. M.-R.J. and K.-H.H. acquired the NFBC1966 data. All authors critically reviewed and approved the final version of the manuscript.

Corresponding authors

Correspondence to Raha Pazoki or Paul Elliott.

Ethics declarations

Competing interests

D.G. declares part-time employment by Novo Nordisk. The other authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pazoki, R., Vujkovic, M., Elliott, J. et al. Genetic analysis in European ancestry individuals identifies 517 loci associated with liver enzymes. Nat Commun 12, 2579 (2021). https://doi.org/10.1038/s41467-021-22338-2

Download citation

Received: 13 July 2020
Accepted: 05 February 2021
Published: 10 May 2021
DOI: https://doi.org/10.1038/s41467-021-22338-2

This article is cited by

Proteome-wide Mendelian randomization highlights AIF1 and HLA-DQA2 as targets for primary sclerosing cholangitis
- Lanlan Chen
- Yuexuan Zhao
- Guoyue Lv
Hepatology International (2024)
Integrative common and rare variant analyses provide insights into the genetic architecture of liver cirrhosis
- Jonas Ghouse
- Gardar Sveinbjörnsson
- Stefan Stender
Nature Genetics (2024)
Genome-wide characterization of circulating metabolic biomarkers
- Minna K. Karjalainen
- Savita Karthikeyan
- Johannes Kettunen
Nature (2024)
Gamma-glutamyl transferase and calculus of kidney incidence: a Mendelian randomization study
- Peizhe Li
- Yuewen Pang
- Jing Shi
Scientific Reports (2023)
Evidence for involvement of the alcohol consumption WDPCP gene in lipid metabolism, and liver cirrhosis
- Felix O’Farrell
- Benjamin Aleyakpo
- Raha Pazoki
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Cross-trait associations

Tissue and protein expression assessment

Pathway analysis

Mendelian randomization (MR)

Genetic risk score (GRS) analysis

Discussion

Methods

Study design and participants

Ethical consideration

Liver and metabolic traits

Cardiovascular events

Genotyping and Imputation

Genome-wide association analysis in UKB

Locus definition

Replication and concordance

Cross-trait associations

Tissue and Protein expression analysis

Pathway analysis and gene-set enrichment analysis

GRS analysis

Mendelian randomization

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Lifelines Cohort Study

the VA Million Veteran Program

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links