Abstract
The QT interval is an electrocardiographic measure representing the sum of ventricular depolarization and repolarization, estimated by QRS duration and JT interval, respectively. QT interval abnormalities are associated with potentially fatal ventricular arrhythmia. Using genome-wide multi-ancestry analyses (>250,000 individuals) we identify 177, 156 and 121 independent loci for QT, JT and QRS, respectively, including a male-specific X-chromosome locus. Using gene-based rare-variant methods, we identify associations with Mendelian disease genes. Enrichments are observed in established pathways for QT and JT, and previously unreported genes indicated in insulin-receptor signalling and cardiac energy metabolism. In contrast for QRS, connective tissue components and processes for cell growth and extracellular matrix interactions are significantly enriched. We demonstrate polygenic risk score associations with atrial fibrillation, conduction disease and sudden cardiac death. Prioritization of druggable genes highlight potential therapeutic targets for arrhythmia. Together, these results substantially advance our understanding of the genetic architecture of ventricular depolarization and repolarization.
Introduction
The electrocardiogram (ECG) is a non-invasive tool that captures cardiac electrical activity1. The QT interval (QT) represents the sum of ECG measures that estimate intervals for ventricular depolarization (QRS duration; QRS) and repolarization (JT interval; JT) at an organ level (Fig. 1). The QT is used to diagnose congenital long or short QT syndromes and acquired QT-prolongation, which are associated with an increased risk for ventricular arrhythmia and sudden cardiac death (SCD)2,3,4,5. Susceptibility to congenital long QT syndrome (LQTS) is mediated by rare and common variation at 15 genes including KCNQ1, KCNH2, and SCN5A6,7. However, 25-30% of LQTS cases have a negative genetic test and LQTS genes do not adequately explain the heritability of the QT in the general population, or predisposition to QT-prolongation from precipitating factors such as medication8,9.
QT and JT phenotypes are highly correlated, whereas QRS has a modest positive and a negligible negative correlation with QT and JT, respectively10. While at a cellular level, repolarization starts directly after depolarization of the first ventricular cardiomyocyte, at an organ level the majority of ventricular repolarization occurs during the JT interval. Therefore, investigation of the QT and its individual components has the potential to identify both shared and specific biological mechanisms for ventricular depolarization and repolarization. Genome-wide association studies (GWAS) for QT and JT have reported common variants at genes regulating cardiac ion channels, calcium-handling proteins and myocyte structure11,12. GWAS for QRS have highlighted genes for sodium channels, kinases and transcription factors in cardiac embryonic development13,14. However, a large proportion of the heritability remains unexplained (~67% for QT, ~83% for QRS), and our limited understanding of the underlying biological networks restricts the potential translational opportunities15,16,17.
We have performed the largest multi-ancestry GWAS meta-analysis to date for QT, JT, and QRS in over 250,000 individuals, to discover additional candidate genes and pathways relevant to ventricular depolarization and repolarization, identify new therapeutic targets, and test the association of polygenic risk scores (PRSs) with cardiovascular disease.
Results
Meta-analysis of GWAS
Thirty-five studies contributed to our primary multi-ancestry GWAS meta-analysis for QT, JT, and QRS, comprising a maximum total of 252,977 individuals of European (84%), Hispanic/Latino (7.7%), African (6.7%), South and South-East Asian (<1%) ancestries (Supplementary Data 1–3, Supplementary Note 1). The meta-analysis workflow is summarized in Fig. 2. No evidence of inflation of test statistics was observed (Supplementary Figs. 1 and 2).
Workflow including single variant and gene-based meta-analyses, and downstream bioinformatics. VEP (Variant Effect Predictor), CADD (Combined Annotation Dependent Depletion), eQTL (expression Quantitative Trait Locus), GTEx (Genotype-Tissue Expression project), COLOC (Colocalization), GARFIELD (GWAS Analysis of Regulatory and Functional Information Enrichment with LD correction), DEPICT (Data-driven Expression-Prioritized Integration for Complex Traits, GWAS (Genome-Wide Association Study), EA (European Ancestry), PRS (Polygenic Risk Score), AF (Atrial Fibrillation), CAD (Coronary Artery Disease), CD (Conduction Disease), HF (Heart Failure), NICM (Non-Ischemic Cardiomyopathy), VA (Ventricular Arrhythmia), SCD (Sudden Cardiac Death).
QT GWAS meta-analysis
We discovered 176 genome-wide significant (GWS; P-value (P) < 5 × 10−8) lead variants at independent autosomal loci (114 unreported) associated with QT in the multi-ancestry meta-analysis (Table 1, Supplementary Fig. 3a). Of the previously reported loci for QT or JT (grouped as the phenotypic correlation is high), there was support for association at 62/66 (93.9%) loci (P < 5 × 10−8). There was weaker support for association at 3 loci (NRAP, MYH6, NACA, P < 1.29 × 10−4) and no evidence of support for SUCLA2 (P > 0.05) (Supplementary Data 4). Ancestry-specific analyses identified additional unreported loci in European (11), Hispanic/Latino (1), and African (1) individuals (Supplementary Data 5). All European and African ancestry-specific lead variants were supported in the multi-ancestry analysis (P < 5 × 10−5). The Hispanic ancestry locus was not supported in the multi-ancestry analysis (P = 0.07), however, the lead variant at this locus was rare (MAF = 0.002) and monomorphic in Europeans.
To identify additional signals, we performed joint and conditional analyses with Genome-wide Complex Trait Analysis (GCTA)18 using summary statistics from the European ancestry meta-analysis with the reference sample from UK Biobank (52,230 individuals of European ancestry). These analyses identified an additional 65 conditionally independent variants at 38 loci at Pjoint < 5 × 10−8 (Supplementary Data 6).
JT and QRS GWAS meta-analyses
For JT and QRS, we identified 155 and 121 lead variants at independent autosomal loci (96 and 77 unreported) in multi-ancestry meta-analyses respectively (Table 1, Supplementary Fig. 3, Supplementary Data 7 and 8). Ancestry-specific analyses identified additional unreported loci (N = 18) in European (4 JT, 6 QRS), African (4 JT, 2 QRS) and Hispanic/Latino (1 JT, 1 QRS) ancestries. Of these, 13 lead variants had evidence for support in the multi-ancestry analysis (P < 5 × 10−5). Joint and conditional analyses identified an additional 56 and 29 conditionally independent variants at 32 JT and 18 QRS loci, respectively (Supplementary Data 6).
X-chromosome meta-analyses
X-chromosome analyses (multi-ancestry sample size: 86,600 and 60,343 for separate female and male analyses, respectively) identified one locus in males in both multi- and European ancestry meta-analyses, for QT and JT (Table 1, Supplementary Data 5 and 7). There were no GWS findings for QRS or female X-chromosome analyses, and no suggestive evidence of association on a lookup of the lead QT/JT variant (rs55891214) in these analyses (P > 0.05). rs55891214 is highly correlated (r2 > 0.9) with lead variants reported in GWAS for serum testosterone, estradiol levels, male-pattern baldness, and heel bone mineral density19,20,21,22. The nearest gene, FAM9B, is exclusively expressed in the testis, and together these findings suggest the association may be driven by serum testosterone levels19,23.
Overlap of genetic contributions and heritability of QT, JT, and QRS
There was substantial overlap of multi-ancestry JT and QT GWAS loci (130/200, 65%) but less between QRS and QT (53/243, 21.8%) (overlap: r2 > 0.1 between lead variants or within ±500 kb). For QRS and JT, there was overlap at 34 loci, where a lead variant was genome-wide significant in both analyses (Supplementary Data 9). Predominantly discordant (27/34, 79.4%) directions of effect were observed at these lead variants (Fig. 3). Across all loci for QT, JT and QRS, overlap was observed with previously reported loci for PR interval (51 (29.1%), 46 (29.7%), 42 (34.7%), respectively) and resting heart rate (58 (33.1%), 57 (36.8%), 46 (38.0%)), demonstrating shared genetic contributions. 13 loci were common to all 5 ECG measures (Supplementary Data 10), highlighting these loci as integral genetic determinants of global cardiac electrophysiology. Estimated genetic correlations were calculated using LD Score Regression (LDSC)24,25. A strong positive correlation was observed between QT and JT (rg = 0.91, P < 0.001) and a weak positive correlation between QT and QRS (rg = 0.17, P = 0.05) (Supplementary Fig. 4). In contrast, a negative genetic correlation was observed between JT and QRS (rg = −0.25, P = 0.003).
Circular Manhattan plots for QT (outer, yellow), JT (middle, red), and QRS (inner, blue) multi-ancestry GWAS linear regression meta-analyses. The Y-axis has been restricted to -log10 P-value < 30. Two-sided P-values are reported. A Bonferroni-corrected threshold (<5 × 10−8) was used to declare significance. Overlapping JT and QRS loci are labeled with the most likely candidate at the locus color coded according to a concordant (green) or discordant (purple) direction of effect at a variant level. Direction of effect was compared by comparing the lead JT variant beta with the corresponding direction of effect of the same variant in the QRS GWAS meta-analysis. This plot was produced using the R package Circlize version 0.4.10. Gu, Z. (2014) circlize implements and enhances circular visualization in R. Bioinformatics. 10.1093/bioinformatics/btu393.
SNP-based heritability estimations in Europeans from UKB for QT, JT and QRS were 29.3%, 29.5% and 15.0% (standard error [SE]:1%) respectively. The percentage of overall variance explained by all lead and conditionally independent variants from the European meta-analysis was 14.6%, 15.9% and 6.3%. Therefore, these variants explain 49.8%, 53.9%, and 42.0% of the SNP-based heritability of QT, JT and QRS in the UKB individuals included in the heritability calculations (Supplementary Note 2).
Gene-based meta-analysis
To investigate whether rare variants (MAF < 0.01) in aggregate modulate ECG traits, we conducted gene-based meta-analyses of rare variants predicted by Variant Effect Predictor (VEP)26 to have high or moderate impact on protein function, using Sequence Kernel Association Testing (SKAT)27. These analyses discovered 13, 16, and 3 genes for QT, JT and QRS, respectively (P < 2.5 × 10−6; Bonferroni adjusted for ~20,000 genes). These genes were brought forward for conditional analyses, and 7, 7 and 2 genes remained associated with QT, JT and QRS, respectively, after conditioning on the rare variant with the lowest P-value at each gene (P < 0.05/number of genes) (Table 2). These results indicate that the gene-based associations were not a consequence of a single variant with a strong effect. We identified an association of rare variants in aggregate at Mendelian long-QT syndrome (LQTS) genes (KCNQ1 [QT and JT], KCNH2 [QT]). SCN5A was associated with JT and QRS, however, it did not reach the Bonferroni corrected threshold for significance for QT. This could be explained by the discordant directions of effect observed for QRS (Beta [β]: 0.04) and JT (β:−0.05), which subsequently reduce the strength of the association observed for QT (β:−0.03). MYH7 and TNNI3K were also associated with JT. TNNI3K, which was not associated using single variant analysis, encodes a cardiomyocyte-specific kinase previously linked to familial cardiac arrhythmia and dilated cardiomyopathy (OMIM:613932)28,29.
As several genes mapped to loci implicated in single variant GWAS analyses, we explored the relationship between these rare gene-based signals and common (MAF > 0.05) or low frequency (0.01 ≤ MAF ≤ 0.05) variants. Analyses were repeated in 76,202 individuals from UKB conditioning on independent significant variants identified in the corresponding European GWAS meta-analysis, and residing within the same locus as the gene. These conditional analyses showed that associations for KCNQ1, KCNH2, and RNF207 with QT and/or JT were independent of flanking variants identified by GWAS (Supplementary Data 11). Because conditional analyses required a large sample with a shared set of variants, we lacked adequate power to definitively determine the independence of DLEC1 from nearby common variants. For MYH7 and TNNI3K, there were no GWS variants at the locus in our single-variant meta-analysis.
Variant-level functional annotation
Most multi-ancestry QT lead variants, 160/176 (90.9%), were common, 13 were low frequency and 3 were rare (RUFY1, DNAJB5, CACNB2; Supplementary Data 5). At 25 loci, a lead variant or proxy (r2 > 0.8) was annotated with VEP, as missense (N = 24) or stop-gain (N = 1) (Supplementary Data 12a). Ten of these (40%) were predicted by SIFT or PolyPhen-2 to be “deleterious” or “damaging”. These included: rs1805128, a KCNE1 polymorphism D85N (c.253G>A), which is a recognized modifier of Long QT syndrome30, and a missense variant in NEXN (rs1166698). NEXN mutations are associated with cardiomyopathies (OMIM:613121)31,32. At 16 loci, a lead variant or proxy had a Combined Annotation Dependent Depletion (CADD) score \(\ge \)20 and therefore was predicted to be among the top 1% most deleterious in the genome (Supplementary Data 12b). Of these, the variants at 10 loci were non-coding.
A similar proportion of lead variants for JT (91%) and QRS (95.9%) were common. Missense variants were identified in 21 JT and 11 QRS loci (Supplementary Data 12a) and predicted to be “deleterious” or “damaging” at 8 JT and 7 QRS loci. At 18 JT and 13 QRS loci, the lead/proxy variant had a CADD score \(\ge \)20 (Supplementary Data 12b). Of these, the variant was non-coding in 13 JT and 7 QRS loci.
Association with gene expression levels in cardiac tissue
Using data from the Genotype-Tissue Expression (GTEx) project33, we identified 39 (22.2%) multi-ancestry QT loci where a variant was a cis-eQTL in left ventricle (LV) or right atrial appendage (RAA) tissue samples (Supplementary Data 13). There was strong support for pairwise colocalization (posterior probability>0.75) at 17 loci for LV tissue and 14 for RAA.
At 37 (23.9%) JT and 18 (14.9%) QRS multi-ancestry loci the lead variant or proxy was a significant cis-eQTL in LV or RAA tissue (Supplementary Data 13). There was support for colocalization at 12 JT and 7 QRS loci in LV tissue and 12 JT and 8 QRS loci in RAA. Comparing QT/JT and QRS, discordant directions of effect were identified at 3 overlapping loci (KLF12 [RAA], PRKCA, and TCEA3 [RAA and LV]), suggesting differences at a variant level may translate to opposing effects on tissue-specific gene expression (Fig. 4).
Colocalization analyses performed using data from GTEx (version 8), using the R package COLOC (methods). A posterior probability of >75% was used to declare significance. Boxes are color coded to show either increased (red) or decreased (blue) effect on tissue-specific gene expression. The degree of shading reflects the normalized effect sizes (and therefore no units) from the slope of the linear regression model for the effect allele relative to the non-effect allele (see methods for more information). The direction of effect has been aligned to the ECG trait prolonging allele. Y axis: Transcripts. RAA: Right atrial appendage, LV: Left ventricle.
Tissue- and cell-type specific effects of variants through regulatory elements
Potential target genes of regulatory variants were identified using two long-range chromatin interaction datasets (40 kb-resolution Hi-C and ~4 kb-resolution promoter-capture Hi-C) from LV and RV tissue34,35. Promoter interactions were identified at 39 (22.2%) QT loci (Supplementary Data 14a and 14b). Evaluation of cardiac cell-type specific effects using single nucleus Assay for Transposase-Accessible Chromatin using sequencing (snATAC-seq) data36, identified significant enrichment at open chromatin regions for QT in atrial and ventricular cardiomyocytes (Supplementary Fig. 5).
Promoter interactions were identified at 46 (29.7%) JT and 28 (23.1%) QRS loci (Supplementary Data 14a and 14b). Cardiac cell-type specific enrichment was significant in atrial and ventricular cardiomyocytes for both JT and QRS, and in adipocytes for JT (Supplementary Fig. 5).
Tissue-specific enrichment of variants in DNaseI hypersensitivity sites, using GWAS Analysis of Regulatory and Functional Information Enrichment with LD correction (GARFIELD, v2)37 identified strongest enrichment in fetal heart tissue for QT, JT, and QRS (Supplementary Fig. 6).
Gene-set tissue/cell-type enrichment and pathway analyses
Candidate genes were prioritized to common functional pathways using reconstituted gene sets in Data-driven Expression-Prioritization Integration for Complex Traits (DEPICT) software38. These gene sets were highly expressed (false discovery rate [FDR] < 0.01) in cardiac tissues for all three traits (Supplementary Data 15). Additionally, for QRS, connective tissue cell types including heart valve, chondrocytes, and joint/skeletal tissues, were significant (Supplementary Fig. 7).
The most significant (FDR < 0.01) gene-ontology (GO) biological processes for QT could be grouped into broad categories, including cardiac and muscle cell differentiation/development, and regulation of gene expression (Supplementary Fig. 8). In addition, and previously not reported for QT, response to insulin and insulin receptor signaling processes were identified (Fig. 5). In the Reactome database, top pathways were related to signal transduction, including protein kinase B mediated events, mechanistic target of rapamycin (mTOR) signaling, the phosphoinositide 3-kinases cascade and the insulin receptor signaling cascade (Supplementary Data 16). The top 10 mouse phenotypes enriched for these gene sets included abnormal myocardial layer/trabeculae morphology, decreased embryo size/growth retardation, and increased heart weight.
The first three panels (QT, JT, and QRS) were created using Cytoscape (v3.8.2). Significant GO biological processes (false discovery rate [FDR] < 0.01) from DEPICT pathway analyses (represented as a colored point in the image) were linked together (light orange line) when containing a minimum of 25% overlap of gene members. Orphan pathways or those with less than three edges were excluded. This created discrete “modules” of interlinked pathways, from which common themes could be identified. The final panel shows a bar graph with the most significant GO process members (Y-axis) for JT and QRS from each “common theme”, along with their enrichment P-values (X-axis) and color coded by FDR (see legend). Enrichment P-values are as output by DEPICT which compares z-scores derived from Welch’s t-test again the null hypothesis (see methods for more information). TGF-beta: Transforming growth factor beta, TRPS/TKS: transmembrane receptor protein serine/threonine kinase.
GO biological processes significant for JT (and QT), but not QRS, included regulation of gene expression, histone and chromatin modification, insulin receptor signaling and response to insulin stimulus (Fig. 5, Supplementary Data 16). Cellular growth, the transmembrane receptor protein serine/threonine kinase signaling pathway and vasculogenesis were enriched only for QRS. Reactome pathways significant for JT (and QT) but not QRS included regulation of lipid metabolism by peroxisome proliferator and its activated receptor effect on gene expression, along with insulin receptor signaling and related events (Supplementary Fig. 9). The top Reactome pathway for QRS, not significant for JT (or QT), was extracellular matrix interactions39. A summary of the findings for all bioinformatic analyses for previously unreported loci is in Supplementary Data 17–19.
Identification of potential drug targets for therapeutic opportunities
To identify potential drug targets for arrhythmia, we interrogated the druggable gene-set database published by Finan et al.40. We examined all 200, 173 and 155 plausible candidate genes from the QT, JT and QRS multi-ancestry meta-analyses respectively, including known and previously unreported loci. 53 (QT), 46 (JT), and 31 (QRS) genes were identified as potential drug targets, that are not current targets of anti-arrhythmic drugs (Supplementary Data 20). Of these, 21 QT, 17 JT, and 10 QRS genes were classed as Tier 1, encoding proteins that are targets of drugs either approved or in development. Genes from significant signals in cardiac tissue-specific eQTL co-localization and Hi-C analyses may be favored for prioritization. Of the 53 potential gene therapeutic targets for QT, ABCC8, KCNA7, KCNK13, PRKCA and THRB had support for co-localization in eQTL analyses and NFKB1, MITF, PLK2 and CASQ2 were significant Hi-C findings. CASQ2 and PLK2 were previously investigated as potential targets of gene transfer technology41,42.
Association of genetically determined QT, JT, and QRS with cardiovascular disease and sudden cardiac death
PRSs were constructed using European-ancestry lead variants to determine the relationship of genetically determined QT, JT and QRS with the directly measured ECG phenotype, and cardiovascular diseases that may have shared genetic contributions to risk. Each PRS was tested for association with the directly measured ECG trait in 4214 individuals from UKB not included in the GWAS. Associations observed for each PRS were (β [95% CI]): 6.4 ms (5.7–7.1) for QT; 6.4 ms (5.7–7.1) for JT; and 2.2 ms (1.7–2.7) for QRS (ms per standard deviation [SD] increase in the PRS). A significant difference in means was observed (two sample t-test, P < 2.2 × 10−16), when comparing individuals in the top and bottom quintiles of the PRS distribution (16.0 ms for QT; 16.0 ms for JT and 6.2 ms for QRS).
In ~357 K unrelated individuals of European ancestry from UKB not included in the GWAS meta-analysis, each PRS was tested for association with prevalent cardiovascular disease cases including atrial fibrillation (AF), “atrioventricular block (AVB), or permanent pacemaker implantation (PPM)”, “bundle branch block (BBB) or fascicular block”, and heart failure (Supplementary Data 21, Fig. 6). A Bonferroni threshold (0.05/number of conditions tested) was used to indicate significance (P < 6.3 × 10−3). Genetically determined QT and JT were associated with decreased risk for “AVB or PPM implantation” (odds ratio [OR] (95% CI) per SD: 0.94 [0.924–0.963] and 0.94 [0.918–0.956] respectively). In contrast, genetically determined QRS was associated with increased risk for “BBB or fascicular block” (1.07 [1.037–1.105]). Genetically determined QT and QRS were associated with decreased risk for AF (0.97 [0.954–0.979]) and 0.93 [0.921–0.945], respectively). Including the QRS PRS as a covariate in the QT model did not substantially change the point estimate (OR: 0.97 [0.958–0.983]), indicating the relationship with AF was not driven by overlap with the genetic contribution for QRS.
Data are presented as odds ratios (OR) and 95% confidence intervals (lower 2.5% and upper 97.5%) for association of each ECG (QT – blue, JT – green, QRS – yellow) polygenic risk score (PRS) with prevalent cases in UK Biobank from logistic regression analyses. Associations are reported as risk per standard deviation increase in the PRS and statistical tests were two sided. To adjust for multiple testing, a Bonferroni-corrected threshold (P < 6.3 × 10−3) was used to declare significance. A total of 371,951 individuals of European ancestry were included in this analysis. AF (Atrial Fibrillation), AVB (Atrioventricular block), PPM (Permanent pacemaker), BBB (Bundle branch block), HF (Heart Failure).
As these ECG measures are established risk markers for malignant ventricular arrhythmia and SCD, we also tested each PRS for association with SCD in the Atherosclerosis Risk in Communities (ARIC) study, Cardiac Arrest Blood Study (CABS), Finnish Genetic Study for Arrhythmic Events (FinGesture) and Northern Finland Birth Cohort of 1966 (NFBC1966) cohorts (Supplementary Data 22). Results are reported as risk for SCD, per unit increase in the average ms per allele. The PRS distribution (mean [SD]) was: 0.321 (0.02) for QT; 0.361 (0.02) for JT; 0.134 (0.008) for QRS. Therefore, findings are reported as log OR (95% CI). The lead variant at the NOS1AP locus for QT and JT (rs12042862, T-allele) was associated with increased risk for SCD (0.11 [0.036–0.190], P = 0.004), as previously reported43. There was no association observed between each PRS and SCD in the full sample (Supplementary Data 23). As the incidence of SCD is different between men and women, we performed sex-stratified analyses44. Increasing QT PRS was associated with SCD in females (8.2 [3.05-13.35], P = 1.8 × 10−3) with a concordant direction of effect across all studies. Sensitivity analyses in FinGesture suggested the association between the QT PRS and SCD in women, may be driven by non-ischemic aetiologies compared with ischaemic (P = 0.004 vs. 0.926 respectively).
Discussion
Our large-scale GWAS meta-analyses for QT and its components, QRS and JT, substantially advance our understanding of the genetic architecture of ventricular depolarization and repolarization. We more than double the number of autosomal loci associated with each trait and identify sex-specific effects at an X-chromosome locus (FAM9B). In addition to established processes, we report loci involved in energy metabolism and response to insulin, which have greater enrichment for ventricular repolarization. Extracellular matrix interactions, cell growth, and connective tissue components are significantly enriched among QRS-associated genes. We identify Mendelian genes for which a burden of rare variants are associated with these ECG measures (e.g. KCNQ1, KCNH2, TNNI3K). We also highlight potential therapeutic targets and together with the association of PRSs with AF, conduction disease and SCD, these indicate possible translational opportunities of our findings.
Previous knowledge of ventricular repolarization has centered on the role of cardiac ion channels, predominantly from the investigation of inherited arrhythmic syndromes3. However, ventricular repolarization is complex and influenced by multiple processes, as suggested by previous GWAS, and now advanced in our present study11,12. Our analyses have identified additional candidate genes involved in cardiomyocyte differentiation, tissue development, cardiac contraction and regulation of gene expression. In this study, we also report pathways related to insulin receptor and mTOR signaling, along with genes that implicate cardiac energy metabolism. Ion homeostasis is an energy-consuming process and mismatch in the supply and utilization of adenosine triphosphate (ATP) can lead to electrical and mechanical instability45. Appropriate cardiac energy utilization is therefore, necessary to maintain normal physiological activity. Candidate genes common to QT and JT within insulin related pathways include SLC2A4, PIGQ, and ABCC8. SLC2A4 (alias GLUT4), is a glucose transporter in cardiomyocytes with effects on cardiac contractility, development of hypertrophy, and susceptibility to atrial and ventricular arrhythmias46,47,48. PIGQ is involved in the biosynthesis of glycosylphosphatidylinositol-anchored proteins, which are involved in membrane protein transportation and cell surface protection49. Mutations involving these proteins cause cardiac-related glycosylation disorders, including congenital defects and arrhythmia50. ABCC8 (alias sulfonylurea receptor 1) modulates ATP-sensitive potassium channels and insulin release. It is a crucial component of sarcolemma K+ ATP channels in mouse atrial myocytes, however a similar role has not been identified in human cardiomyocytes51,52. In humans, the role of ABCC8 is predominantly within pancreatic beta cells, and therefore may indicate indirect effects on ventricular repolarization through insulin secretion53. Insulin is considered cardioprotective and receptors for insulin signaling are highly expressed in the heart54. Type-1 diabetic mice have altered ion channel kinetics in ventricular and atrial myocytes, that increase the risk of arrhythmia and can be reversed with insulin therapy55,56,57.
Sex differences in ventricular repolarization are well recognized and incorporated into clinical definitions for QT-prolongation58. Our study identified an X-chromosome locus (FAM9B), which may contribute to these differences through serum testosterone levels. In rat cardiomyocytes, testosterone upregulates KCNQ1 expression with a long-term effect on QT-shortening59. Androgen receptors are expressed in the atrial and ventricular myocardium in multiple species including humans of both sexes60,61. During puberty in males, appropriate shortening is driven by increasing testosterone, while the comparatively gradual development of a relatively hypogonadal state in post-pubertal males, may explain senescent increases58,62. Prolonged ventricular repolarisation is also a feature of several human diseases that share androgen deficiency as a common characteristic63,64,65,66. In addition to testosterone, a role for other hormones in ventricular repolarization is supported by the association of variants with QT and JT at the THRB locus, a nuclear hormone receptor for triiodothyronine67.
Loci for QRS in comparison, have greater enrichment in processes for vasculogenesis, cell growth, and embryonic development (Fig. 5). These include candidate genes encoding transcription factors (or their regulators) with roles in cellular proliferation and cardiac conduction system development (ID2, PRDM6 and PALLD)68,69,70,71. Gene sets were also enriched in connective tissues and cell types. PDE1A is an example of one of these genes. It encodes a cyclic nucleotide phosphodiesterase and regulates cardiac fibroblast activation and fibrosis formation72. Myocardial fibrosis is a pathophysiological process in ventricular remodeling, which impairs cardiac electrical conduction and increases the risk for arrhythmogenesis39,73.
In this study, we observed predominantly discordant directions of effect comparing overlapping QRS and JT loci. Despite the strong phenotypic and genetic correlation between QT and JT, the genetic contribution to the QT interval represents the combined effects of variants associated with QRS and JT. Therefore, overlap and shared biology are also observed across QT and QRS loci, along with a weak positive genetic correlation. These findings could inform drug development for arrhythmia, as genes or their encoded proteins could be targeted for their specific effects on predominantly ventricular depolarization or repolarization.
Inherited arrhythmic syndromes highlight the importance of rare variation on ECG traits and arrhythmic risk. However, our understanding of the relationship with common variation in the general population has previously been limited. A recent study suggests the phenotypic effects of rare variants on QT are modulated by polygenic variation in the general population74. Our gene-based meta-analyses identified a burden of rare coding variants associated with QT, JT, and QRS, in genes typically linked with inherited channelopathies (KCNQ1, KCNH2) and cardiomyopathies (MYH7, TNNI3K). Therefore, our findings support a continuum between the genetic architectures of polygenic traits and disorders that are classically considered monogenic, and highlight the utility of employing a rare-variant gene-based approach in large, unselected populations.
In PRS analyses, we observed decreased risk of AF with increasing QT and QRS PRSs. This is an opposite direction of effect compared with epidemiological studies using directly measured ECG intervals where an increase in QRS or QT is associated with increased risk of AF75,76,77. However, this relationship may be J-shaped, and an increased risk of AF is also observed in patients with short QT syndrome compared to the general population78,79. In addition, class-III anti-arrhythmics, used for the chemical cardioversion of AF and maintenance of sinus rhythm, inhibit hERG K + currents that both increase the atrial refractory period (thereby contributing to a protective effect) and prolong the QT interval80,81. However, our findings along with the association with conduction disease, may also reflect different biological information captured in the variance explained by the PRS, compared to the directly measured ECG trait; the latter being susceptible to modification by other factors such as coronary artery disease. This may also account for the differences observed when comparing phenotypic and genetic correlations of these traits. Additional research is warranted to investigate these observations. Furthermore, improved risk scores from our study could be used in future work to evaluate the modification of phenotypic expression in families with inherited channelopathies.
While cohorts have extracted ECG parameters using different methods, we believe the large sample sizes and averaging of effect estimates during meta-analysis should limit the impact of any variability on our findings. Of note, we did not observe substantial heterogeneity across results from previously unreported variants. Future GWAS meta-analyses could use the same algorithm across all cohorts to extract ECG phenotypes, but raw digitalized data are not available for all participants of the current study, so we were unable to do this without substantially reducing the total sample size. Our study includes extensive in-silico follow-up of variants; however, it does not identify causal relationships. Functional follow-up is warranted using the latest advances, including single-cell genomics to further evaluate the relationship of variants with gene-expression82 and gene-editing tools (e.g. CRISPR) to investigate the effects of coding and regulatory variants on target genes and cellular function in relevant cell types (e.g. human iPSC cardiomyocytes)83,84.
In summary, by analyzing the largest available sample size to date, we have substantially advanced the delineation of shared and distinct mechanisms influencing ventricular depolarization and repolarization. This work will inform functional follow-up and the prioritization of potential therapeutic targets for arrhythmia.
Methods
A summary of all input data and tools for each analysis performed in this study is provided in Supplementary Data 24.
Study cohorts
A total of 35 studies (involving 53 ancestry-specific sub-studies), including members of the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium85 contributed to this study (Supplementary Data 1). A maximum total of 252,977 individuals of European (84%), Hispanic (7.7%), African (6.7%), South and South-East Asian (<1%) ancestries were included. All participating institutions approved this project and informed consent was obtained for all individuals at the study level. Cohorts were predominantly population- or community-based with a small number of studies ascertained on a specific case status. Affymetrix or Illumina arrays were typically used for genotyping. Study-specific genotype quality control filters prior to imputation, including call rate, Hardy-Weinberg equilibrium (HWE) P-value, and minor allele frequency, are provided in Supplementary Data 2. All GWAS summary data utilized NCBI build 37. Most studies imputed ungenotyped rare and common variants using 1000G reference panels (40/53 sub-studies); the remainder used the Haplotype Reference Consortium (HRC) panel (r1.1 2016), (Supplementary Data 2)86,87.
Fourteen studies (including 31 sub-studies), imputed genotype data for X chromosome analysis. The pooled multi-ancestry sample size for X chromosome analyses was 86,600 (75,607 European, 7040 Hispanic, 1943 African, 709 South Asian, 590 South East Asian and 711 mixed) for females and 60,343 individuals (52,070 European, 5182 Hispanic, 1518 African, 806 South Asian, 479 South East Asian and 379 mixed) for males.
Cohort-level single variant association analyses
Single variant genome-wide association studies were performed by each participating cohort for QT, JT, and QRS. For each trait of interest, additive genetic models were implemented for two phenotypes: (1) the raw phenotype (on the millisecond (ms) scale)) and (2) the rank-based inverse normal transformation of each phenotype (on the standard deviation scale due to non-normal distributions of these traits). Per study summary statistics for each ECG measure and covariate are provided in Supplementary Data 3. Individuals were excluded at the study level for: prevalent myocardial infarction or heart failure, pregnancy at the time of recruitment, implantation of a pacemaker or implantable cardiac defibrillator, QRS duration >120 ms, or right or left bundle branch block or atrial fibrillation on ECG. The QRS duration criterion was used as a surrogate marker for bundle branch block and interventricular conduction delay that was not identified during ECG analysis. Additionally, if the data were available, individuals using digitalis, class I or III anti-arrhythmics or QT prolonging medication were excluded. These exclusions were chosen to reduce the risk of confounding in our analyses of ECG parameters, where the bulk of the power comes from normal variation of QT, JT, and QRS. This will have reduced the total sample size available, however the genetic contribution to ECG interval variation in these disease states could differ, warranting separate investigations. Cohorts including related individuals used appropriate software to account for this e.g BOLT linear mixed model software (BOLT-LMM)88 (which fits a linear mixed model on hard-called genotyped single nucleotide polymorphisms (SNPs)) or other software incorporating a kinship matrix or pedigree89,90,91. An imputation quality cut-off of Rsq >0.3 (or similar in IMPUTE) was applied in all cohorts to ensure high quality variants were included in the meta-analysis.
Covariates were included in the GWAS model and chosen for their known association with each ECG measure. These included age (years), sex (except in sex-stratified X chromosome analyses), RR interval (ms), height and body-mass index (BMI, kg/m2). Genetic principal components (PCs) were included to account for cryptic population stratification except in cohorts with pedigree data available or when analyses were performed using linear mixed models. As there may be ancestral differences in ECG measure, cohorts comprised of multiple ancestries performed separate analyses for each ancestry to control for underlying population stratification. Separate summary statistics for each ancestry were submitted for central analysis, and for secondary ancestry-specific meta-analyses. Additional cohort-specific covariates were included when deemed appropriate locally, for example, recruitment site or genotyping array. Autosome and X chromosome analyses were performed separately. For X chromosome analyses, male genotypes were coded as 0 or 2 and sex-stratified analyses were performed to account for random X chromosome inactivation in females. Pseudoautosomal regions were excluded from the analyses due to the high risk of genotyping errors in these regions.
Cohort level generation of covariance matrices for gene-based testing
To test for associations due to a burden of rare (MAF ≤ 0.01) variants with functional consequences within protein-coding genes, additional analyses were performed by participating cohorts using Rare variant test (Rvtests, version 2.0.6)92. Rvtests generates summary score statistics per variant using a separate matrix file containing the covariances between markers in a specified sliding window. To avoid associations driven by extreme outliers, these analyses were performed using only rank-based inverse normal transformed values for QT, JT, and QRS. To reduce computational demand, only variants with a MAF < 1% and Rsq >0.3 were included and LD windows of 500 kb were specified for construction of the genotype-covariance matrices. Rvtests uses a genomic kinship matrix to account for relatedness in each cohort and PCs were again included to adjust for residual population stratification. Analyses were only performed using individuals of European ancestry due to the potential of population differences in allele frequencies in rare variants. Only autosomes were included in these analyses.
Central quality control of study-level data
Quality control of all study-level GWAS summary statistics was performed centrally using the EasyQC R package (version 9.2)93. In brief, variants were aligned to either the 1000 G or HRC reference panel and allele frequency plots for each study were compared against the reference. Quantile – quantile (QQ) and P-value – Z-score statistics (P-Z) plots were visually inspected. Variants with invalid beta estimates, standard errors or P-values were removed. Per-study summary statistics were generated including standard error and beta estimate ranges. Genomic-control inflation factors (lambdas) were calculated to identify systematic inflation of test statistics, which can result from a variety of factors, including population stratification, and lead to a large number of true positive findings94.
GWAS Meta-Analysis
The meta-analysis workflow is summarized in Fig. 2. The primary analyses were pre-specified to be the multi-ancestry rank-based inverse normal transformed meta-analysis for each ECG trait, to avoid unstable normal approximation test statistics for low-frequency variants or outlier trait values. Ancestry-specific secondary meta-analyses were also performed for European, African, and Hispanic ancestries. Due to the lack of suitably sized replication datasets, we undertook a one-stage, single-discovery design. Additionally, to enable estimation of clinically recognizable effect sizes (inverse normal transformation produces results on a standard deviation scale), a meta-analysis for each trait and ancestry was also performed using the raw phenotype on the millisecond (ms) scale. All meta-analyses were conducted using an inverse variance-weighted, fixed effects model using METAL (version released 2011-03-25) and performed independently across two sites and checked for consistency95. Genomic control was applied during meta-analysis to studies in which the inflation factor (λ) was > 1.0. Summary genome-wide association (“Manhattan”) plots, QQ plots and lambdas for the entire meta-analysis were produced for each trait using qqman R package (v0.1.8). For all subsequent analyses, only variants present in >50% of the total sample size of the meta-analysis were included. Genome-wide significance (GWS) was defined as P\(\le \) 5 × 10−08. The 1000 Genome reference panel was used for calculating correlations between variants in downstream analyses, including all individuals for multi-ancestry summary statistics and individuals from relevant populations for European, African and Hispanic meta-analyses. Where pre-computed LD scores were required, or correlations calculated within tools that did not permit modification of individuals included in the reference panel, the European ancestry meta-analysis was used in place of the multi-ancestry recognizing a substantial proportion of the multi-ancestry meta-analysis included individuals of European descent. When this is the case, it is explicitly stated in the methodology and results.
Definition of known and novel loci
To identify novel associations in our results, we first defined boundaries for previously published loci (Supplementary Data 4) using the following process in PLINK(v1.9)96. Reported genome-wide significant (P\(\le \) 5 × 10−08) lead variants from each published GWAS were extracted and correlations calculated using the 1000 Genome phase 3 reference panel (Nov 2014)86 in a 4 Mb region centered on each variant. The locus start was defined as minus 50 kb from the most upstream variant that was in r2 > 0.1 with the lead variant and the locus end as plus 50 kb from the most downstream variant which was in r2 > 0.1. Overlapping boundaries were subsequently merged. This window was declared the genomic boundary for the locus or a minimum physical distance window of ±500 kb around the reported variant – whichever was larger. For the purpose of these analyses, as QT and JT phenotypes are highly correlated phenotypes (Fig. 1), previously reported JT and QT loci were pooled. A separate list was compiled for previously described QRS duration loci. Novel loci in our meta-analyses were identified by applying the same approach to variants meeting the GWS threshold in our results. Variants within loci boundaries not overlapping with known loci were declared novel associations. Subsequently, to evaluate for evidence of heterogeneity at each locus, a forest plot was produced for the lead variant using the R-package Metaviz (version 0.3.0) and manually inspected along with the I2 heterogeneity index for that variant. Finally, Locus-Zoom plots were generated for each locus to visually inspect correlations (r2) between lead variants and surrounding markers and their associated P-values using LocusZoom (v0.12)97.
Conditional and heritability analyses
We sought to determine whether any variants at a given locus were conditionally independent (i.e. independent signals of association). Conditional analyses were performed using Genome-wide Complex Trait Analysis (GCTA, v1.26.0), for loci that achieved genome-wide significance in the European-ancestry analysis with a reference sample of 52,230 individuals of European ancestry from UK Biobank18. Related pairs up to the 2nd-degree (kinship coefficient< 0.0884) were excluded. Due to insufficient reference sample size and an inability to effectively reproduce the ancestral mix, conditional analyses were not performed for other ancestries or the multi-ancestry meta-analyses. Stringent thresholds of PJoint < 5 × 10−08 and minimal correlation (r2 < 0.1) with the lead variant were used to declare a variant “conditionally independent”. Using the same dataset, heritability estimates for each trait in European samples were obtained using BOLT-Restricted Maximum Likelihood (REML, v2.3.2), which applies variance components analysis using modeled directly genotyped SNPs to calculate SNP-based heritability88. The percent variance explained (PVE) by lead and conditionally independent variants was subsequently calculated (Eq. 1)98;
The total PVE by all lead and conditionally independent variants was calculated as the sum of each variant’s PVE. The heritability explained was the total PVE divided by the heritability of the trait.
LD score regression
To calculate the genetic correlation between ECG traits, LD score regression was performed using LD Score (LDSC) software (v1.0.1)24. European meta-analysis summary statistics were filtered to include only variants present in the International HapMap Project (~1.1 million variants), along with pre-computed LD scores using the 1000 G reference panel provided by LDSC24. LDSC uses the LD scores as regression weights and subsequently calculates the genetic correlation using intersecting SNPs from each meta-analysis25.
Gene-based testing meta-analysis
Gene-based meta-analysis was performed using the R package rareMETALs (v.7.1)99. Analyses were restricted to up to 192,780 individuals of European descent (from 37 studies) due to potential differences in the allele frequency of rare (MAF≤ 0.01) variants between populations. QC of study-level data was performed as described above for the single-variant meta-analysis. Variants from all studies, however, were subsequently filtered to only include those predicted by VEP (Ensembl release 99) to have high or moderate impact (and thus be protein-altering). Score and covariance files used as input for gene-based meta-analyses in rareMETALS were generated using Rvtests as described above92. Gene-based meta-analysis was subsequently performed for inverse-normal transformed QT (N = 192,780), JT (N = 192,501) and QRS (N = 192,495) using Sequence Kernel Association Testing (SKAT)27, which considers the joint effects of multiple variants on the phenotype, while taking into account the effect size and direction of effect of each variant. Power under SKAT is maximal for genetic variation in a gene that causally increases and decreases a quantitative trait, but is less powered to detect genetic effects that all influence a quantitative trait in one direction. Gene-based meta-analysis was conducted for 18,751 genes that had more than one rare (MAF ≤ 0.01) variant annotated as high or moderate impact. A gene-based test was considered significant if P < 2.5 × 10−06 (Bonferroni correction for ~20,000 tested genes).
To follow-up on gene associations passing Bonferroni correction, additional conditional analyses were performed for each of the three traits. First, to confirm that the gene-based association is not solely driven by one rare variant, conditional gene-based meta-analyses were repeated while conditioning on the most significant variant in the gene. Genes were considered significant if Pconditional < 0.05/number of genes tested for the trait (13 genes for QT, 16 genes for JT and 3 genes for QRS), upon gene-based analysis conditioned on the most significant variant. Second, conditional analyses were restricted to 76,202 individuals from the UK Biobank to ensure a common set of variants were able to be examined. To confirm that the gene-based association was not solely driven by flanking low-frequency and common (MAF > 0.01) QT/JT/QRS variants at the locus in which the gene is located regardless of their annotation with VEP, (P < 5 × 10−08 and r2 < 0.1 or ±500 kb from the lead variant in the respective GWAS meta-analysis) analyses were repeated while conditioning on all independent variants at the locus. Furthermore, additional conditional analyses were conducted for associated genes that reside in the same locus. This was done to ensure that these are independent gene associations in the same locus and not attributable to rare variants in the other gene in the locus.
Biological annotation of GWAS loci
Identification of variant consequences
To identify variants with potential functional consequences, we annotated lead and conditionally independent variants, and their proxies (r2 > 0.8), using VEP (Ensembl release 99)26 to extract information on the impact of a variant on a transcript or protein including their deleteriousness scores using the Sorting Intolerant From Tolerant algorithm (SIFT, version 5.2.2)100 and PolyPhen-2 (Version 2.2.2101). Additionally, Combined Annotation Dependent Depletion (CADD, v1.4)102 and RegulomeDB (v.2.0.3)103 rank scores for these variants were extracted. CADD scores correlate with pathogenicity of both coding and non-coding variants and rate a variant according to its deleteriousness within the genome102. RegulomeDB annotates variants with known and predicted regulatory elements in intergenic regions including regions of Dnase hypersensitivity, transcription factors binding sites, and promotor regions utilizing publicly available datasets103.
Association with tissue-specific gene expression
To evaluate correlations between GWAS variants and tissue-specific gene expression, data from the Genotype-Tissue Expression project (GTEx, v8)33,104,105 were extracted for tissues relevant to cardiac electrophysiology including cardiac, vascular (coronary artery and aorta) and brain (as autonomic regulation influence ECG traits). First, lead and conditionally independent variants and their proxies (r2 > 0.8), were checked to determine whether they overlapped with the lead variant at an expression quantitative trait locus (eQTL) for each tissue. Additionally, to determine whether the same variant may be causal in both our GWAS meta-analysis and the original eQTL study, colocalization analyses were performed using the R package COLOC (v5.1.0), which uses Bayesian methods to determine the correlation between variants from the two datasets106. A posterior probability of >75% was used to determine significance.
Tissue- and cell-type specific regulatory elements
To identify tissue-specific enrichment of variants in DnaseI hypersensitivity sites, we used GWAS Analysis of Regulatory and Functional Information Enrichment with LD correction (GARFIELD, v2)37. GARFIELD performs greedy pruning of GWAS SNPs (r2 > 0.1) and annotates them based on overlapping functional information to assess the enrichment of association signals with features extracted from the ENCODE, GENCODE and Roadmap Epigenomics projects37. Odds ratios are quantified and assessed using a generalized linear model framework while matching for MAF, distance to the nearest transcription start site, and number of proxies (r2 > 0.8).
We sought to identify potential target genes of regulatory variants using long-range chromatin interaction (Hi-C) data analyzed using FUMA GWAS (Functional Mapping and Annotation of Genome-Wide Association Studies) software (v1.3.6)107. Within FUMA, pre-processed significant loops computed by Fit-Hi-C pipelines filtered at an FDR < 0.05 and overlap with lead and conditionally independent variants and their proxies (r2 > 0.8) was identified34. Additionally, we utilized recently published promoter capture Hi-C data which uses loops called from Knight-Ruiz normalized 5 kb, 10 kb and 25 kb resolution data35. Promoter interactions in left and right ventricular tissue for potential regulatory variants were extracted and variants with the highest regulatory potential were determined using a RegulomeDB score cut-off of ≤3b103.
To identify cardiac cell-type specific functional effects of non-coding variants, we integrated GWAS variants with cell-type chromatin marks from single nucleus Assay for Transposase-Accessible Chromatin using sequencing (snATAC-seq) data36. These data contain open/accessible chromatin information for nine cell types obtained from the heart, including atrial and ventricular cardiomyocyte, smooth muscle, endothelial, adipocyte, macrophage, fibroblast, lymphocyte and nervous cells. Haplotype blocks were created for each lead and conditionally independent variant including variants with r2 > 0.1 within a 2 Mb radius. Subsequently, using a SNP enrichment method, CHEERS (Chromatin Element Enrichment Ranking by Specificity, v2019)108, the peaks with the lowest 10th percentile of total read counts from the snATAC-seq data were removed, the peak counts were subsequently quantile normalized, and the Euclidean distance calculated108. A one-sided P-value for enrichment of variants in estimated haplotype blocks within cell-type specific ATAC-seq peaks was calculated and a Bonferroni-corrected threshold used to declare significance (0.05/number of cell types).
Candidate gene prioritization and pathway enrichment
To prioritize candidate genes at each locus, we used Data-driven Expression-Prioritized Integration for Complex Traits (DEPICT, v3). DEPICT prioritizes the most likely causal genes at associated loci according to common functional pathways using reconstituted gene sets containing a membership probability for each gene in the genome38. Additionally, it highlights enriched pathways and tissues/cell types where genes from associated loci are highly expressed. DEPICT uses a clumping method (r2 = 0.1, window size = 250 kb, P = 5 × 10−08) to identify uncorrelated variants from each meta-analysis using 1000 G reference data after excluding the major histocompatibility complex region on chromosome 6. Gene-set enrichment analysis was conducted based on 14,461 predefined reconstituted gene sets from various databases and data types, including Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), REACTOME, phenotypic gene sets derived from the Mouse genetics initiative, and molecular pathways derived from protein–protein interactions. Finally, tissue and cell type enrichment analyses were performed based on expression information in any of the 209 Medical Subject Heading (MeSH) annotations for the 37,427 human Affymetrix HGU133a2.0 platform microarray probes. Cytoscape (version 3.8.2, https://cytoscape.org/)109 was used to visualize significantly enriched (FDR < 0.01) DEPICT GO biological processes for each ECG trait. Processes were connected by overlap of significantly enriched genes (minimum number of 25% of member genes). Orphan pathways or those with less than three edges were excluded. Each module was labeled with a common theme to represent the group of biological processes.
DEPICT requires at least ten genome-wide significant loci to be able to perform the analysis. Therefore, for African and Hispanic ancestries, candidate genes were identified using g:Profiler110, a functional enrichment tool for the annotation of a list of genes. It also enables mapping of variants to gene names, where they overlap with at least one protein coding Ensembl gene with annotation of predicted variant effects.
In addition to DEPICT gene prioritization results, the list for each locus was supplemented with candidate genes highlighted by these bioinformatic analyses. A literature review was performed which also included a look up of genes using the Online Mendelian Inheritance in Man (OMIM, https://www.omim.org/) and Mouse Genome Informatics (MGI, http://www.informatics.jax.org/) databases.
Druggability analyses
To identify potential novel drug targets from our GWAS findings, we interrogated a previously published druggable gene set database developed by Finan et al. which includes detailed information on methods used to assemble and annotate the dataset40. In brief, this reference set contains predominantly protein-coding genes (as annotated from the Ensembl v.73). Genes were subsequently assembled into three tiers: Tier 1 includes targets of approved drugs and drugs in clinical development including targets of small molecule and biotherapeutic drugs. Tier 2 incorporates proteins closely related to drug targets or associated with drug-like compounds. Genes where one or more Ensembl peptide sequence shared ≥50% identity (over ≥75% of the sequence) with an approved drug target were included. Tier 3 incorporated extracellular proteins and members of key drug-target families (including G protein-coupled receptors, kinases, ion channels, nuclear hormone receptors, and phosphodiesterases). Using the list of “most likely candidate genes” at unreported GWAS loci and previously reported candidate genes for QT, JT and QRS, a look up was performed in the database. Genes which are existing drug targets for anti-arrhythmic drugs (annotated using KEGG drug (https://www.genome.jp/) were excluded. As genes which were significant findings in cardiac tissue-specific eQTL co-localization and Hi-C analyses may be favored for prioritization, a look up was performed in our data to highlight these.
Association between genetically determined QT, JT, and QRS and relevant cardiovascular diseases
To determine relationships of genetically pre-determined QT JT QRS with cardiovascular diseases PRSs were constructed using the lead variants from the European-ancestry meta-analyses and tested for association with prevalent cases of atrial fibrillation, stroke, coronary artery disease, heart failure, non-ischaemic cardiomyopathy, conduction disease (“AVB or PPM implantation” and “fascicular block/BBB”) and ventricular arrhythmia in the UK Biobank. Secondary analyses were also performed to test for association with subgroups including myocardial infarction and stroke. Outcomes were identified using self-reported data, operation codes, ICD-9/ICD-10 codes from hospital episodes statistics and mortality registry (Supplementary Data 25). Analyses were performed in individuals of European-ancestry without ECGs and therefore not included in the GWAS meta-analysis and related pairs up to the 2nd-degree (kinship coefficient <0.0884) were excluded. To take advantage of genotype probability data in BGEN format, PRSice-2 (v2.3.3)111 was used to calculate the PRS. The PRS was calculated by summing the dosage of the ECG trait prolonging allele, weighted by the effect size from the corresponding GWAS. Associations with prevalent cases were identified using logistic regression including covariates age, sex, 10 PCs and genotype array. A Bonferroni-corrected threshold of 0.05/number of outcomes tested (0.05/7 = 7.1 × 10−3) was used to determine significant associations. P-values < 0.05 but greater than 7.1 × 10−3 were considered as suggestive associations.
To determine whether genetically determined QT, JT, and QRS are associated with SCD, PRS were constructed for each trait and tested in four cohorts: the Atherosclerosis Risk in Communities study (ARIC)112,113, the Cardiac Arrest Blood study (CABS)114, the Finnish Genetic Study for Arrhythmic Events (FinGesture) and Northern Finland Birth Cohort of 1966 (NFBC1966)115. The latter three are independent cohorts as they were not included in the GWAS meta-analysis. While ARIC was included in the GWAS meta-analysis, it contributes a relatively small proportion of the GWAS meta-analysis sample size and thus effects on beta-estimates would be negligible. Individual study information and case definitions used are available in Supplementary Data 22. As FinGesture contains only cases, population controls were taken from NFBC1966116. For these analyses, only individuals of European-ancestry were included, due to low sample sizes of other ancestries. PRSs were constructed by averaging the dosage of each lead variant allele associated with prolongation of the ECG trait being tested, from the European ancestry meta-analysis, weighted by the effect size from the corresponding raw-phenotype meta-analysis. To ensure only high-quality variants were included, each study applied an imputation quality threshold (Rsq > 0.8). The PRS was included in a logistic regression (Cox for ARIC) model along with covariates age (when appropriate), sex, 10 PCs and genotyping array (when appropriate). Sex-stratified analyses were performed to identify sex-specific effects. Per study summary statistics were subsequently meta-analyzed using an inverse-variance weighted fixed effects model with the R package ‘Meta’ (v.5.5.0)117.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Summary statistics from each genome-wide association study meta-analysis will be made available on the NHGRI-EBI Catalog of human genome-wide association studies website, https://www.ebi.ac.uk/gwas/. Electrocardiographic phenotype data derived from UK Biobank digitalized signals, will be returned to the study. The UK Biobank will make these data available to all bona fide researchers for all types of health-related research that is in the public interest, without preferential or exclusive access for any person. All researchers will be subject to the same application process and approval criteria as specified by the UK Biobank. Please see the UK Biobank’s website for the detailed access procedure (http:/www.ukbiobank.ac.uk/register-apply/). Other datasets used in these analyses are publicly available and can be sourced from: 1000 Genomes reference panel: https://www.internationalgenome.org/category/reference/; Haplotype reference consortium reference panel: http://www.haplotype-reference-consortium.org/; Variant level annotation from Variant Effect Predictor (VEP), Ensembl release 99: https://www.ensembl.org/info/docs/tools/vep/index.html; Variant level Combined Annotation Dependent Depletion scores from Combined Annotation Dependent Depletion (CADD, v1.4): https://cadd.gs.washington.edu/; Variant level tissue-specific gene expression from The GTEx portal (v8): https://gtexportal.org/home/; HiC data from the Functional Mapping and Annotation of Genome-Wide Association Studies (FUMA GWAS, v.1.3.6): https://fuma.ctglab.nl/; DNaseI hypersensivity site enrichment data from GWAS Analysis of Regulatory and Functional Information Enrichment with LD correction (GARFIELD, v2): https://www.ebi.ac.uk/birney-srv/GARFIELD/; Gene-set, biological pathways and tissue expression data from Data-driven Expression-Prioritization Integration for Complex Traits (DEPICT, v3): https://github.com/perslab/depict; Variant level RegulomeDB scores from RegulomeDB (v.2.0.3): https://regulomedb.org/regulome-search/; A compendium of promoter-centered long-range chromatin interactions in the human genome (Jung et al., 2019): https://doi.org/10.1038/s41588-019-0494-8; Cardiac cell type-specific gene regulatory programs and disease risk association (Hocker et al., 2021): https://doi.org/10.1126/sciadv.abf1444; Druggable genome dataset from Finan et al., 2017: DOI: 10.1126/scitranslmed.aag1166; g:Profiler (accessed May 2021): https://biit.cs.ut.ee/gprofiler/gost; Online Mendelian Inheritance in Man database: https://www.omim.org/; Mouse Genome Informatics: http://www.informatics.jax.org/; KEGG drug database: (https://www.genome.jp/).
Code availability
Codes are available from the original software used for each analysis (see methods and Reporting Summary).
References
Krittayaphong, R. et al. Electrocardiographic predictors of cardiovascular events in patients at high cardiovascular risk: a multicenter study. J. Geriatr. Cardiol. 16, 630–638 (2019).
Niemeijer, M. N., van den Berg, M. E., Eijgelsheim, M., Rijnbeek, P. R. & Stricker, B. H. Pharmacogenetics of drug-induced QT interval prolongation: an update. Drug Saf. 38, 855–867 (2015).
Schwartz, P. J., Crotti, L. & Insolia, R. Long-QT syndrome: from genetics to management. Circ. Arrhythm. Electrophysiol. 5, 868–877 (2012).
Straus, S. M. et al. Prolonged QTc interval and risk of sudden cardiac death in a population of older adults. J. Am. Coll. Cardiol. 47, 362–367 (2006).
Zhang, Y. et al. Electrocardiographic QT interval and mortality: a meta-analysis. Epidemiology 22, 660–670 (2011).
Tester, D. J. & Ackerman, M. J. Genetics of long QT syndrome. Methodist Debakey Cardiovasc J. 10, 29–33 (2014).
Lahrouchi, N. et al. Transethnic genome-wide association study provides insights in the genetic architecture and heritability of long QT syndrome. Circulation 142, 324–338 (2020).
Jamshidi, Y., Nolte, I. M., Spector, T. D. & Snieder, H. Novel genes for QTc interval. How much heritability is explained, and how much is left to find? Genome Med. 2, 35 (2010).
Wilde, A. A. M., Amin, A. S. & Postema, P. G. Diagnosis, management and therapeutic strategies for congenital long QT syndrome. Heart 108, 332–338 (2021).
Crow, R. S., Hannan, P. J. & Folsom, A. R. Prognostic significance of corrected QT and corrected JT interval for incident coronary heart disease in a general population sample stratified by presence or absence of wide QRS complex: the ARIC Study with 13 years of follow-up. Circulation 108, 1985–1989 (2003).
Bihlmeyer, N. A. et al. Exomechip-wide analysis of 95 626 individuals identifies 10 novel loci associated with QT and JT intervals. Circ. Genom. Precis Med. 11, e001758 (2018).
Arking, D. E. et al. Genetic association study of QT interval highlights role for calcium signaling pathways in myocardial repolarization. Nat. Genet. 46, 826–836 (2014).
Sotoodehnia, N. et al. Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction. Nat. Genet. 42, 1068–1076 (2010).
Desplantez, T., Dupont, E., Severs, N. J. & Weingart, R. Gap junction channels and cardiac impulse propagation. J. Membr. Biol. 218, 13–28 (2007).
Silva, C. T. et al. Heritabilities, proportions of heritabilities explained by GWAS findings, and implications of cross-phenotype effects on PR interval. Hum. Genet. 134, 1211–1219 (2015).
Duijvenboden, S. et al. Genomic and pleiotropic analyses of resting QT interval identifies novel loci and overlap with atrial electrical disorders. Hum. Mol. Genet. 30, 2513–2523 (2021).
van der Harst, P. et al. 52 genetic loci influencing myocardial mass. J. Am. Coll. Cardiol. 68, 1435–1448 (2016).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Eriksson, A. L. et al. Genetic determinants of circulating estrogen levels and evidence of a causal effect of estradiol on bone density in men. J. Clin. Endocrinol. Metab. 103, 991–1004 (2018).
Kemp, J. P. et al. Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis. Nat. Genet. 49, 1468–1475 (2017).
Yap, C. X. et al. Dissection of genetic variation and evidence for pleiotropy in male pattern baldness. Nat. Commun. 9, 5407 (2018).
Jin, G. et al. Genome-wide association study identifies a new locus JMJD1C at 10q21 that may influence serum androgen levels in men. Hum. Mol. Genet. 21, 5222–5228 (2012).
Martinez-Garay, I. et al. A new gene family (FAM9) of low-copy repeats in Xp22.3 expressed exclusively in testis: implications for recombinations in this region. Genomics 80, 259–267 (2002).
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93 (2011).
Milano, A., Lodder, E. M. & Bezzina, C. R. TNNI3K in cardiovascular disease and prospects for therapy. J. Mol. Cell Cardiol. 82, 167–173 (2015).
Lal, H., Ahmad, F., Parikh, S. & Force, T. Troponin I-interacting protein kinase: a novel cardiac-specific kinase, emerging as a molecular target for the treatment of cardiac disease. Circ. J. 78, 1514–1519 (2014).
Nishio, Y. et al. D85N, a KCNE1 polymorphism, is a disease-causing gene variant in long QT syndrome. J. Am. Coll. Cardiol. 54, 812–819 (2009).
Wang, H. et al. Mutations in NEXN, a Z-disc gene, are associated with hypertrophic cardiomyopathy. Am. J. Hum. Genet 87, 687–693 (2010).
Zhang, X. L. et al. Genetic basis and genotype-phenotype correlations in han chinese patients with idiopathic dilated cardiomyopathy. Sci. Rep. 10, 2226 (2020).
GTEx Consortium. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Jung, I. et al. A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat. Genet. 51, 1442–1449 (2019).
Hocker, J. D. et al. Cardiac cell type-specific gene regulatory programs and disease risk association. Sci. Adv. 7, eabf1444 (2021).
Iotchkova, V. et al. GARFIELD classifies disease-relevant genomic features through integration of functional annotations with association signals. Nat. Genet 51, 343–353 (2019).
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Perestrelo, A. R. et al. Multiscale analysis of extracellular matrix remodeling in the failing heart. Circ. Res 128, 24–38 (2021).
Finan, C. et al. The druggable genome and support for target identification and validation in drug development. Sci. Transl. Med. 9 (2017).
Lodola, F. et al. Adeno-associated virus-mediated CASQ2 delivery rescues phenotypic alterations in a patient-specific model of recessive catecholaminergic polymorphic ventricular tachycardia. Cell Death Dis. 7, e2393 (2016).
Li, W. et al. PLK2 modulation of enriched TAp73 affects osteogenic differentiation and prognosis in human osteosarcoma. Cancer Med. 9, 4371–4385 (2020).
Mitchell, R. N. et al. Effect of sex and underlying disease on the genetic association of QT interval and sudden cardiac death. J. Am. Heart Assoc. 8, e013751 (2019).
Ågesen, F. N. et al. Temporal trends and sex differences in sudden cardiac death in the Copenhagen City Heart Study. Heart 107, 1303–1309 (2021).
Barth, A. S. & Tomaselli, G. F. Cardiac metabolism and arrhythmias. Circ. Arrhythm. Electrophysiol. 2, 327–335 (2009).
Huang, J. P., Huang, S. S., Deng, J. Y. & Hung, L. M. Impairment of insulin-stimulated Akt/GLUT4 signaling is associated with cardiac contractile dysfunction and aggravates I/R injury in STZ-diabetic rats. J. Biomed. Sci. 16, 77 (2009).
Slot, J. W., Geuze, H. J., Gigengack, S., James, D. E. & Lienhard, G. E. Translocation of the glucose transporter GLUT4 in cardiac myocytes of the rat. Proc. Natl Acad. Sci. USA 88, 7815–7819 (1991).
Maria, Z., Campolo, A. R., Scherlag, B. J., Ritchey, J. W. & Lacombe, V. A. Dysregulation of insulin-sensitive glucose transporters during insulin resistance-induced atrial fibrillation. Biochim. Biophys. Acta Mol. Basis Dis. 1864, 987–996 (2018).
Bellai-Dussault, K., Nguyen, T. T. M., Baratang, N. V., Jimenez-Cruz, D. A. & Campeau, P. M. Clinical variability in inherited glycosylphosphatidylinositol deficiency disorders. Clin. Genet. 95, 112–121 (2019).
Manea, E. A step closer in defining glycosylphosphatidylinositol anchored proteins role in health and glycosylation disorders. Mol. Genet. Metab. Rep. 16, 67–75 (2018).
Fedorov, V. V. et al. Effects of KATP channel openers diazoxide and pinacidil in coronary-perfused atria and ventricles from failing and non-failing human hearts. J. Mol. Cell Cardiol. 51, 215–225 (2011).
Flagg, T. P. et al. Differential structure of atrial and ventricular KATP: atrial KATP channels require SUR1. Circ. Res 103, 1458–1465 (2008).
Gloyn, A. L., Siddiqui, J. & Ellard, S. Mutations in the genes encoding the pancreatic beta-cell KATP channel subunits Kir6.2 (KCNJ11) and SUR1 (ABCC8) in diabetes mellitus and hyperinsulinism. Hum. Mutat. 27, 220–231 (2006).
Abel, E. D. Insulin signaling in the heart. Am. J. Physiol. Endocrinol. Metab. 321, E130–E145 (2021).
Polina, I. et al. Loss of insulin signaling may contribute to atrial fibrillation and atrial electrical remodeling in type 1 diabetes. Proc. Natl Acad. Sci. USA 117, 7990–8000 (2020).
Lu, Z. et al. Decreased L-type Ca2+ current in cardiac myocytes of type 1 diabetic Akita mice due to reduced phosphatidylinositol 3-kinase signaling. Diabetes 56, 2780–2789 (2007).
Lu, Z. et al. Increased persistent sodium current due to decreased PI3K signaling contributes to QT prolongation in the diabetic heart. Diabetes 62, 4257–4265 (2013).
Rautaharju, P. M. et al. Sex differences in the evolution of the electrocardiographic QT interval with age. Can. J. Cardiol. 8, 690–695 (1992).
Masuda, K. et al. Testosterone-mediated upregulation of delayed rectifier potassium channel in cardiomyocytes causes abbreviation of QT intervals in rats. J. Physiol. Sci. 68, 759–767 (2018).
Marsh, J. D. et al. Androgen receptors mediate hypertrophy in cardiac myocytes. Circulation 98, 256–261 (1998).
McGill, H. C., Anselmo, V. C., Buchanan, J. M. & Sheridan, P. J. The heart is a target organ for androgen. Science 207, 775–777 (1980).
Gray, A., Feldman, H. A., McKinlay, J. B. & Longcope, C. Age, disease, and changing sex hormone levels in middle-aged men: results of the Massachusetts Male Aging Study. J. Clin. Endocrinol. Metab. 73, 1016–1025 (1991).
Mohamed, R., Forsey, P. R., Davies, M. K. & Neuberger, J. M. Effect of liver transplantation on QT interval prolongation and autonomic dysfunction in end-stage liver disease. Hepatology 23, 1128–1134 (1996).
Bidoggia, H. et al. Sex differences on the electrocardiographic pattern of cardiac repolarization: possible role of testosterone. Am. Heart J. 140, 678–683 (2000).
Charbit, B. et al. Effects of testosterone on ventricular repolarization in hypogonadic men. Am. J. Cardiol. 103, 887–890 (2009).
Schwartz, J. B. et al. Effects of testosterone on the Q-T interval in older men and older women with chronic heart failure. Int J. Androl. 34, e415–e421 (2011).
Hönes, G. S. et al. Noncanonical thyroid hormone signaling mediates cardiometabolic effects in vivo. Proc. Natl Acad. Sci. USA 114, E11323–E11332 (2017).
Moskowitz, I. P. et al. A molecular pathway including Id2, Tbx5, and Nkx2-5 required for cardiac conduction system development. Cell 129, 1365–1376 (2007).
Al Sayed, Z. R. et al. Human model of IRX5 mutations reveals key role for this transcription factor in ventricular conduction. Cardiovasc. Res. 117, 2092–2107 (2020).
Wu, L. et al. Bone morphogenetic protein 4 promotes the differentiation of Tbx18-positive epicardial progenitor cells to pacemaker-like cells. Exp. Ther. Med 17, 2648–2656 (2019).
Gewies, A. et al. Prdm6 is essential for cardiovascular development in vivo. PLoS ONE 8, e81833 (2013).
Miller, C. L. et al. Cyclic nucleotide phosphodiesterase 1A: a key regulator of cardiac fibroblast activation and extracellular matrix remodeling in the heart. Basic Res Cardiol. 106, 1023–1039 (2011).
Ten Tusscher, K. H. & Panfilov, A. V. Influence of diffuse fibrosis on wave propagation in human ventricular tissue. Europace 9, vi38–vi45 (2007).
Nauffal, V. et al. Monogenic and polygenic contributions to QTc prolongation in the population. medRxiv, 2021.06.18.21258578 (2021).
Cho, M. S. et al. Clinical implications of ventricular repolarization parameters on long-term risk of atrial fibrillation- longitudinal follow-up data from a general ambulatory Korean population. Circ. J. 84, 1067–1074 (2020).
Zhang, N. et al. Prolonged corrected QT interval in predicting atrial fibrillation: a systematic review and meta-analysis. Pacing Clin. Electrophysiol. 41, 321–327 (2018).
Aeschbacher, S. et al. Relationship between QRS duration and incident atrial fibrillation. Int J. Cardiol. 266, 84–88 (2018).
Borggrefe, M. et al. Short QT syndrome. Genotype-phenotype correlations. J. Electrocardiol. 38, 75–80 (2005).
Nielsen, J. B. et al. J-shaped association between QTc interval duration and the risk of atrial fibrillation: results from the Copenhagen ECG study. J. Am. Coll. Cardiol. 61, 2557–2564 (2013).
Feld, G. K. & Cha, Y. Electrophysiologic effects of the new class III antiarrhythmic drug dofetilide in an experimental canine model of pacing-induced atrial fibrillation. J. Cardiovasc Pharm. Ther. 2, 195–203 (1997).
Riera, A. R. et al. Relationship among amiodarone, new class III antiarrhythmics, miscellaneous agents and acquired long QT syndrome. Cardiol. J. 15, 209–219 (2008).
Tanay, A. & Regev, A. Scaling single-cell genomics from phenomenology to mechanism. Nature 541, 331–338 (2017).
Rao, S., Yao, Y. & Bauer, D. E. Editing GWAS: experimental approaches to dissect and exploit disease-associated genetic variation. Genome Med. 13, 41 (2021).
Krause, M. D. et al. Genetic variant at coronary artery disease and ischemic stroke locus 1p32.2 regulates endothelial responses to hemodynamics. Proc. Natl Acad. Sci. USA 115, E11349–E11358 (2018).
Psaty, B. M. et al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ. Cardiovasc. Genet. 2, 73–80 (2009).
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
Loh, P. R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).
O’Connell, JR. MMAP User Guide. http://edn.som.umaryland.edu/mmap/index.php. Accessed 7 December 2020.
Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).
Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42, 348–354 (2010).
Zhan, X., Hu, Y., Li, B., Abecasis, G. R. & Liu, D. J. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics 32, 1423–1426 (2016).
Winkler, T. W. et al. Quality control and conduct of genome-wide association meta-analyses. Nat. Protoc. 9, 1192–1212 (2014).
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Shim, H. et al. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS ONE 10, e0120758 (2015).
Feng, S., Liu, D., Zhan, X., Wing, M. K. & Abecasis, G. R. RAREMETAL: fast and powerful meta-analysis for rare variants. Bioinformatics 30, 2828–2829 (2014).
Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081 (2009).
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
Boyle, A. P. et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res 22, 1790–1797 (2012).
Battle, A. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Soskic, B. et al. Chromatin activity at GWAS loci identifies T cell states driving complex immune diseases. Nat. Genet. 51, 1486–1493 (2019).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Raudvere, U. et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 47, W191–W198 (2019).
Choi, S. W. & O’Reilly, P. F. PRSice-2: polygenic Risk Score software for biobank-scale data. Gigascience 8, giz082 (2019).
The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. Am. J. Epidemiol. 129, 687–702 (1989).
Wright, J. D. et al. The ARIC (Atherosclerosis Risk In Communities) Study: JACC Focus Seminar 3/8. J. Am. Coll. Cardiol. 77, 2939–2959 (2021).
Sotoodehnia, N. et al. Beta2-adrenergic receptor genetic variants and risk of sudden cardiac death. Circulation 113, 1842–1848 (2006).
Haukilahti, M. A. E. et al. Sudden cardiac death in women. Circulation 139, 1012–1021 (2019).
Sabatti, C. et al. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat. Genet. 41, 35–46 (2009).
Balduzzi, S., Rücker, G. & Schwarzer, G. How to perform a meta-analysis with R: a practical tutorial. Evid. Based Ment. Health 22, 153–160 (2019).
Acknowledgements
All study acknowledgements can be found in Supplementary Note 3, and study funding information in Supplementary Note 4. William J Young was supported by an MRC grant MR/R017468/1. Alvaro Alonso is supported by an NHLBI award K24HL148521. Antonio Luiz P Ribeiro is supported in part by CNPq (310679/2016-8 and 465518/2014-1) and by FAPEMIG (PPM-00428-17 and RED-00081-16). Eduardo Tarazona-Santos, Maria Fernanda Lima-Costa and Thiago P Leal are supported by Brazilian Conselho Nacional de Desenvolvimento Científico e Tecnologico (CNPq). Eduardo Tarazona-Santos and Maria Fernanda Lima-Costa are also supported by the Brazilian Ministry of Health (DECIT/MS, EPIGEN-Brazil Project), Brazilian Ministry of Science and Technology (Financiadora de Estudos e Projetos (FINEP) and Fundação de Amparo a Pesquisa do Estado de Minas Gerais (FAPEMIG). Ruth Loos is supported by the National Institutes of Health (R01DK110113; R01DK107786; R01HL142302; R01DK124097). Michael J Cutler is supported by funding from the Dell Loy Hansen Heart Foundation. M.Benjamin Shoemaker is supported by grant NIHK23HL127704. Rozenn Lemaitre reports funding from grant: AHA 19SFRN34830063. Steven A Lubitz is supported by NIH grant 1R01HL139731 and American Heart Association 18SFRN34250007. Patrick T Ellinor was supported by grants from Bayer AG, the Foundation Leducq (14CVD01), the NIH (1RO1HL092577, R01HL128914, K24HL105780), and the American Heart Association (18SFRN34110082). Lu-Chen Weng was supported by National Institutes of Health (NIH) grant 1R01HL139731 and American Heart Association (AHA) grant 18SFRN34110082. Victor Nauffal is funded by a T32 training grant from the National Institutes of Health (5T32HL007604-35). Nona Sotoodehnia is supported by grants AHA19SFRN348300063, R01HL141989, Medic One Foundation, Laughlin Family. Ulrich Schotten received funding from the Netherlands Heart Foundation (CVON2014-09, RACE V Reappraisal of Atrial Fibrillation: Interaction between hypercoagulability, Electrical remodelling, and Vascular Destabilization in the Progression of AF), the European Union (ITN Network Personalize AF: Personalized Therapies for Atrial Fibrillation: a translational network, grant number 860974; MAESTRIA: Machine Learning Artificial Intelligence Early Detection Stroke Atrial Fibrillation, grant number 965286). Sébastien Thériault is supported by a Junior 1 Clinical Research Scholar award from the Fonds de Recherche du Québec-Santé (FRQS). Jonas L Isaksen is supported by CACHET. Caroline Hayward is supported by an MRC University Unit Programme Grant MC_UU_00007/10 (QTL in Health and Disease). Christy Avery and Antoine Baldassari were supported by NIH grants R01HL142825, and U01HG007416. Dawood Darbar was supported by NIH grants R01HL138737 and T32HL139439. Daniel S Evans is supported by NIH grant U24AG051129. Niels Grarup is supported by the Novo Nordisk Foundation (Grant number NNF18CC0034900). James G. Wilson is supported by U54GM115428 from the National Institute of General Medical Sciences. Hao Mei is also supported by the CHARGE infrastructure grant (HL105756). Dennis Mook-Kanamori is supported by Dutch Science Organization (ZonMW-VENI Grant 916.14.023). May E Montasser receives funding from Regeneron Pharmaceutical unrelated to this work. James F Wilson and Pau Navarro acknowledge support from the MRC Human Genetics Unit programme grant, “Quantitative traits in health and disease” (U. MC_UU_00007/10). Linda Repetto is funded by a University of Edinburgh studentship. Xia Shen was in receipt of a Starting Grant (2017-02543) from the Swedish Research Council (Vetenskaprådet). Lars Lind is funded by Uppsala University Hospital, Uppsala, Sweden. Niek Verweij was supported by NWO VENI (016.186.125). Jan-Walter Benjamins is funded by the Research Project CVON-AI (2018B017) financed by the PPP Allowance made available by Top Sector Life Sciences & Health to the Dutch Heart Foundation to stimulate public-private partnerships. J.Wouter Jukema is an Established Clinical Investigator of the Netherlands Heart Foundation (grant 2001 D 032). Oliver Hines is supported by a Medical Research Council DTP Studentship. Massimo Mangino is supported by the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. Patricia B Munroe, Pier D Lambiase, Andrew Tinker and Michele Orini were supported by Medical Research Council grant MR/N025083/1. Julia Ramirez is funded by European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No.786833. Pier D Lambiase is supported by UCL/UCLH Biomedicine NIHR, Barts Heart Centre Biomedical Research Centre. Patricia B Munroe, Pier D Lambiase, Andrew Tinker and Helen Warren acknowledge the NIHR Cardiovascular Biomedical Centre at Barts and The London, Queen Mary University of London. Christopher Newton-Cheh acknowledges support from the National Institutes of Health (R01HL143070). The authors also wish to acknowledge the CHARGE infrastructure grant (HL105756).
Author information
Authors and Affiliations
Contributions
Interpreted results, writing, and editing the manuscript: W.J.Y., N.L., A.I., N.S., B.M., C.N.C., P.B.M. Conceptualization of project: N.S., C.N.C., P.B.M. Supervision of project: N.S., B.M., C.N.C., P.B.M. Contributed to GWAS analysis plan: W.J.Y., N.L., N.S., C.N.C., P.B.M. Performed meta-analyses: W.J.Y. and N.L. Performed GCTA, genetic correlations, heritability, variant annotations, GTEx analyses, HiC analyses, gene-set enrichment and pathway analyses, druggability analyses, PRS analyses in UKB: W.J.Y. Performed PRS analyses in SCD cohorts: T.D, J.A.B. Contributed data for PRS SCD analyses: M.-R.J., H.V.H., R.N.M., J.J., P.F., R.L., K.P. Performed gene literature review: W.J.Y., L.F., F.A., R.S., D.R. Visualization of enrichment pathways: S.A.G. Contributed to study-specific GWAS by providing phenotype, genotype and performing data analyses: W.J.Y., A.I., T.D., L.F., J.A.B., R.N., J.-W.B., J.H., L.-P.L., L.Repetto, M.P.C., M.B., S.W., A.R.B., T.M.B., J.P.C., D.S.E., R.F., O.H., J.L.I., H.L., H.M., A.M., M.M.-N., C.N., Y.Q., A.R., C.R., K.R., E.T.-S., S.Thériault, S.V.D., H.R.W., J.Y. S.A., G.A., A.A., L.A., J.C.B., E.B., A.C., E.C., M.C., M.J.C., D.D., A.D.G., A.D.L., J.D., C.E., P.T.E., S.B.F., C.F., M.Gögele, C.G., M.Graff, X.G., T.H., S.R.H., P.L.H., N.H.-K., A.I., R.D.J., M.J., J.J., M.Kavousi, J.A.K., T.P.L., H.J.L., L.L., A.L., S.L., P.M., M.Mangino, T.M., M.Mezzavilla, P.P.M., R.N.M., N.M., M.E.M., A.C.M., M.N., V.N., P.N., K.N., G.Pare, G.Pelliccione, A.P., D.J.P., P.P.P., M.H.P., O.T.R., A.P.R., A.L.P.R., K.M.R., L.Risch, D.S., U.S., C.S., X.S., M.B.S., G.S., M.F.S., E.Z.S., M.S., K.S., K.T., K.D.T., A.T., S.Trompet, A.U., U.V., H.V., M.W., L.-C.W., E.W., J.G.W., C.L.A., D.C., A.C., F.C., M.D., G.G., N.G., C.H., Y.J., J.W.J., S.K., M.Kähönen, J.K.K., C.K., T.L., M.F.L.-C., Y.L., R.J.F.L., S.A.L., D.O.M.-K., A.P.M., J.R.O., M.S.O., M.O., S.P., C.P., A.P., B.M.P., J.I.R., B.S., P.H., C.M.D., N.V., J.F.W., D.E.A., J.R., P.D.L., N.S., P.B.M. All authors read, revised, and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
M.J.C. has consulted for Biosense Webster and Janssen Scientific. S.A.L. receives sponsored research support from Bristol Myers Squibb/Pfizer, Bayer AG, Boehringer Ingelheim, Fitbit, and IBM, and has consulted for Bristol Myers Squibb/Pfizer, Bayer AG, and Blackstone Life Sciences. P.T.E. has received grant funding from Bayer AG and served on advisory boards or consulted for Bayer AG, Quest Diagnostics, MyoKardia and Novartis. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. D.O.M.-K. is a part time research consultant at Metabolon, Inc. D.C has received speaker fees from BMS/Pfizer and consultation fees from Roche Diagnostics. U.S received consultancy fees or honoraria from Università della Svizzera Italiana (USI, Switzerland), Roche Diagnostics (Switzerland), EP Solutions Inc. (Switzerland), Johnson & Johnson Medical Limited, (United Kingdom), Bayer Healthcare (Germany). U.S is co-founder and shareholder of YourRhythmics BV, a spin-off company of the University Maastricht. L.-C.W receives sponsored research support from IBM to the Broad Institute. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Albert Henry, Pieter Postema and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Young, W.J., Lahrouchi, N., Isaacs, A. et al. Genetic analyses of the electrocardiographic QT interval and its components identify additional loci and pathways. Nat Commun 13, 5144 (2022). https://doi.org/10.1038/s41467-022-32821-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-32821-z
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.