Genome-wide association and multi-omic analyses reveal ACTN2 as a gene linked to heart failure

Arvanitis, Marios; Tampakakis, Emmanouil; Zhang, Yanxiao; Wang, Wei; Auton, Adam; Dutta, Diptavo; Glavaris, Stephanie; Keramati, Ali; Chatterjee, Nilanjan; Chi, Neil C.; Ren, Bing; Post, Wendy S.; Battle, Alexis

doi:10.1038/s41467-020-14843-7

Download PDF

Article
Open access
Published: 28 February 2020

Genome-wide association and multi-omic analyses reveal ACTN2 as a gene linked to heart failure

Nature Communications volume 11, Article number: 1122 (2020) Cite this article

13k Accesses
40 Citations
40 Altmetric
Metrics details

Subjects

Abstract

Heart failure is a major public health problem affecting over 23 million people worldwide. In this study, we present the results of a large scale meta-analysis of heart failure GWAS and replication in a comparable sized cohort to identify one known and two novel loci associated with heart failure. Heart failure sub-phenotyping shows that a new locus in chromosome 1 is associated with left ventricular adverse remodeling and clinical heart failure, in response to different initial cardiac muscle insults. Functional characterization and fine-mapping of that locus reveal a putative causal variant in a cardiac muscle specific regulatory region activated during cardiomyocyte differentiation that binds to the ACTN2 gene, a crucial structural protein inside the cardiac sarcolemma (Hi-C interaction p-value = 0.00002). Genome-editing in human embryonic stem cell-derived cardiomyocytes confirms the influence of the identified regulatory region in the expression of ACTN2. Our findings extend our understanding of biological mechanisms underlying heart failure.

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Genome-wide association studies

Article 26 August 2021

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Introduction

Heart failure is a highly prevalent disease¹ that constitutes a major medical and economic burden in the healthcare system, accounting for ~1–2% of the annual healthcare budget in developed countries². Although almost any disease that directly or indirectly affects myocardial function can lead to the eventual development of clinical heart failure, it is well-established that certain intrinsic homeostatic mechanisms like the renin–angiotensin–aldosterone axis and the sympathetic nervous system potentiate the effects of a variety of myocardial insults and cause adverse left ventricular remodeling³, suggesting that multiple cellular mechanisms that lead to the disease are shared regardless of the inciting condition.

The increasing appreciation of an underlying strong heritable component of clinical heart failure further strengthens the argument for shared, yet unidentified, disease mechanisms whose discovery could reveal novel targets for its treatment and prevention. Indeed, large recent pedigree studies estimate heart failure heritability to be 26–34%⁴. However, large-scale genome-wide associations studies (GWAS) for heart failure have been unsuccessful to-date at uncovering a significant proportion of this estimated heritability underscoring a major unmet need in cardiovascular genetics. In fact, the largest published GWAS for heart failure until recently had only identified one genome-wide significant locus for all-comers with the disease that the investigators attribute to its overlap with atrial fibrillation⁵. A larger GWAS performed and published in parallel to our study increased the number of identified loci to 11⁶. Even within this important work, however, many of the identified loci appear to be acting via heart failure risk factors and these loci have not yet been extensively functionally characterized, thereby limiting identification of actionable targets that predispose to heart failure development.

In the current work, we perform a large-scale GWAS for heart failure and replicate our findings in a comparably sized independent cohort. We identify and replicate associations between heart failure and one known locus in chromosome 4 near the PITX2 gene and two novel loci near the ABO (chromosome 9) and ACTN2 (chromosome 1) genes. One of the novel loci near ABO was also detected in the aforementioned recently published GWAS⁶. Heart failure sub-phenotyping and multi-trait conditional analyses show that the novel chromosome 1 locus affects heart failure and left ventricular remodeling independently of known risk factors and in response to a variety of initial cardiac muscle insults. Detailed functional characterization of that locus using epigenomic, Hi-C, and transcriptomic datasets in differentiating cardiomyocytes reveals a cardiac muscle-specific regulatory element that is dynamic during cardiomyocyte differentiation and binds to the promoter of the ACTN2 gene, whereas genome-editing confirms that ACTN2 expression is significantly reduced in cardiomyocytes that carry a deletion of the identified novel regulatory element.

Results and discussion

GWAS meta-analysis identifies novel heart failure loci

We performed a large-scale GWAS meta-analysis of five cohorts that study cardiovascular disease and two population genetics cohorts, all of European ancestry comprising a total of 10,976 heart failure cases and 437,573 controls. We used the 1000 Genomes phase 3 reference panel to impute variants from single nucleotide polymorphism (SNP) array data and analyzed a total of 13,066,955 unique genotyped or high-confidence imputed variants (INFO score > 0.7) with a minor allele frequency >1%. We analyzed each individual cohort using a logistic mixed model and meta-analyzed all studies with fixed effects inverse-variance meta-analysis.

The combined meta-analysis revealed one previously identified and two novel loci associated with clinical heart failure at a genome-wide significance threshold (p-value < 5e-8) (Fig. 1a, Table 1, and Supplementary Data 1). All identified leading variants are common (MAF > 10%) and are located in non-coding regions of the genome (Supplementary Fig. 1). We validated our genome-wide significant loci in an independent cohort of 24,829 self-reported heart failure cases and 1,614,513 controls of European ancestry from the personal genetics company 23andMe, Inc. with all three sentinel variant associations successfully replicating at a nominal p-value level (p < 0.05) (Table 1) and after Bonferroni adjustment. Demographic information comparing the Discovery and Replication cohorts is available in Supplementary Table 1.

**Fig. 1: Summary GWAS and genetic correlation plots.**

Table 1 Lead variant associations in the Discovery and Replication cohorts.

Full size table

Analysis links heart failure and musculoskeletal traits

We subsequently performed linkage disequilibrium (LD) score regression to estimate heart failure heritability driven by common variants and the genetic correlation between heart failure and other complex traits. Liability scale SNP heritability for the disease assuming a population prevalence of 1.8%⁷ was 5.9% (SE 0.7%), much lower than the pedigree-based estimates of 26%, a discrepancy that has been observed for other complex traits⁸ and could be explained by multiple factors, including rare variants. The LD score regression intercept was 0.99, indicating no inflation beyond what can be accounted for by polygenicity. As expected, we saw significant genetic correlation between heart failure and known heart failure risk factors, such as hypertension, ischemic heart disease, adverse lipid profiles, diabetes, and atrial fibrillation. We also found a strong association between heart failure and pulmonary, musculoskeletal, and GI traits (Fig. 1b and Supplementary Table 2). We should note that genetic correlation analysis should not be viewed as evidence for a causal relationship between the tested diseases and consequently these results do not indicate that heart failure is causally influenced by musculoskeletal disorders. However, these disorders may share some genetic factors or cellular pathways—since the heart is composed mostly of muscle and stromal tissue, it is plausible that it could share regulatory mechanisms with other organs of similar cell type composition.

Atrial fibrillation’s role in heart failure development

We investigated each genome-wide significant locus in depth. The chromosome 4 locus tagged by the SNP rs1906615 is found in an intergenic region close to the PITX2 gene. This locus was previously identified as containing the strongest evidence for association with atrial fibrillation⁹ and has been reported as a significant locus in recent heart failure GWAS^5,6. However, that association was thought to be mediated via the relative enrichment of the heart failure population in atrial fibrillation cases⁵. Indeed, via multi-trait conditional and joint analysis using summary statistics from GWAS of atrial fibrillation, we confirm that the effect of the PITX2 locus on heart failure is explained by its effect in atrial fibrillation (Table 2 and Supplementary Data 2). Mendelian randomization (MR) analysis using 110 independent (LD r² < 0.001) genome-wide significant atrial fibrillation-associated variants, provides further evidence for a directional effect of atrial fibrillation on heart failure development (weighted mode MR effect size 0.21 (odds ratio 1.23), z-test p < 0.0001). Sensitivity analysis using the MR Egger and weighted median approach to account for potential pleiotropy and/or invalid instruments confounding our MR estimates is statistically significant and supports our hypothesis for a causal effect of atrial fibrillation on heart failure (Supplementary Fig. 2). While MR methods alone cannot rule out reverse causation, 105/110 of the variants used here have a larger effect size for atrial fibrillation than heart failure, and atrial fibrillation displays greater SNP heritability, supporting that the MR result indicates an effect on Heart Failure that is mediated through atrial fibrillation rather than the reverse.

Table 2 Lead GWAS variants in multi-trait analysis, Heart Failure sub-phenotypes and echocardiographic traits.

Full size table

ACTN2 gene enhancer is associated with heart failure

The chromosome 1 locus tagged by the SNP rs580698 is found near ACTN2, a gene that encodes for a structural cardiac protein inside the sarcolemma, at which rare mutations have recently been associated with the development of cardiomyopathy and consequently heart failure¹⁰. Multi-trait conditional and joint analysis with common heart failure risk factors (atrial fibrillation, ischemic heart disease, hypertension, diabetes mellitus) does not result in a significant change in the effect of the ACTN2 locus on heart failure, suggesting that the association signal is not primarily mediated via these other diseases (Table 2 and Supplementary Data 2). A phenome-wide association approach (PheWAS) using echocardiographic and other phenotypic information available for a subset of our cohorts and participants demonstrates that the ACTN2 locus is significantly associated with both ischemic and non-ischemic heart failure and has a trend for an effect in left ventricular dilation and heart failure with reduced ejection fraction, thereby suggesting its potential role in mechanisms predisposing to left ventricular adverse remodeling in response to various initial insults (Table 2 and Supplementary Fig. 3). Chromatin state data for the ACTN2 locus from Roadmap Epigenomics reveal a broad area of muscle-specific active enhancer elements in the skeletal muscle, fetal heart, left and right ventricular tissues (Figs. 2 and 3a, and Supplementary Fig. 4). Integration with expression quantitative trait loci (eQTL) data does not reveal any compelling evidence of colocalization between the GWAS signal and altered expression of nearby genes in adult blood or post-mortem adult heart tissues (Supplementary Table 3). In addition, no significant association with the expression of nearby genes is detected in eQTL studies performed with freshly preserved heart tissue at the time of heart transplant/donor heart explant¹¹.

**Fig. 2: Epigenetic overview of the *ACTN2* locus.**

**Fig. 3: Fine-mapping of the *ACTN2* locus.**

Since eQTL analysis in adult tissue did not identify a target gene for the locus, in an effort to provide a credible hypothesis of how this locus is associated with heart failure, we proceeded with additional functional characterization. The first step was to fine map putative causal variants within the locus. First, we generated a credible set of SNPs for the ACTN2 locus based on the GWAS associations and linkage disequilibrium pattern for each region of interest using CAVIAR¹² and selected the 111 SNPs with GWAS p-value < 5e-7 from the credible set. Then, we intersected that set of SNPs with active chromatin states using the Roadmap Epigenomics ChromHMM 25-state model¹³ for cardiac tissues and with candidate cis-regulatory elements (ccREs) from the ENCODE registry¹⁴. Of 111 strongly associated variants in almost perfect LD within the credible set for ACTN2, only seven overlapped regulatory elements in both Roadmap and ENCODE and were, therefore, used in downstream analyses (Supplementary Data 3).

Next, we verified the presence of active chromatin states overlapping our ACTN2 locus SNPs in engineered human embryonic stem cell (hESC) derived cardiomyocytes during different stages of differentiation. We showed that one of the seven target variants, rs535411 (Supplementary Data 4) overlaps cardiomyocyte-specific ATAC-seq, H3K4me1, and H3K27ac peaks that start to appear on day 7 of hESC differentiation into a cardiomyocyte and persist until at least day 80 (Fig. 3b). The ATAC-seq signal onset at that region coincides temporally with the onset of ACTN2 expression based on RNA-seq data from the same differentiation experiment (Fig. 3b), with both occurring between day 5 and 7. High-resolution chromatin conformation capture (HiC) analysis of our hESC to cardiomyocyte model on day 80 of differentiation shows that the ATAC-seq peak is in contact with the ACTN2 gene promoter (observed/expected interaction frequency = 1.82, Poisson test p = 0.00002) (Fig. 4a and Supplementary Table 4) and its interaction is dynamic and increases during differentiation (Fig. 4b).

**Fig. 4: The fine-mapped regulatory element in chromosome 1 affects *ACTN2* gene expression.**

Although rare variants within the ACTN2 gene are known to be associated with cardiomyopathies, the credible set analysis does not support the coding region as being the primary driver of the GWAS signal. Moreover, conditioning on the sentinel variant eliminates the signal for association of the locus with heart failure, suggesting that the association is driven primarily by a single causal variant in high LD with the sentinel SNP (Supplementary Fig. 5). We should note however that we cannot exclude the possibility that the association signal could be caused or increased by other rare variants within our identified cardiac muscle enhancer region that contains rs535411.

We subsequently ventured to experimentally validate the effect of the putative enhancer element at the 1q43 locus on ACTN2 gene expression in cardiomyocytes. For that purpose, we generated engineered hESCs with a CRISPR-Cas9-induced deletion in the ~2200 bp region that delimits the enhancer element identified in our hESC-CM epigenomic data analyses. We differentiated these edited hESCs into cardiomyocytes and on day 15 of differentiation, we compared the expression of ACTN2 to that of isogenic hESC-CMs without the deletion. ACTN2 expression was reduced on average by half in the edited hESC-CMs compared to controls (Fig. 4c). We then assessed expression of other nearby genes, and none appeared to be affected by the deletion (Supplementary Fig. 6A). These experiments support the epigenetic predictions of a cardiac enhancer element in that region and validate the Hi-C data that suggest binding of that enhancer element to the ACTN2 gene promoter. More importantly, these results provide a mechanistic hypothesis of the GWAS association between the ACTN2 locus and heart failure. Indeed, previous studies have established that reduction of ACTN2 mRNA levels via a siRNA leads to defects in the number and size of cardiac sarcomeres along with a phenotype of dilated heart with thin walls and a decreased heart rate in zebrafish¹⁵. It is therefore plausible that smaller reductions of ACTN2 expression as those caused by variants within our identified enhancer could generate subtler cardiac sarcomeric defects in humans that become apparent later in life in individuals with an additional genetic or environmental insult to the heart muscle, thereby providing a tenable explanation for the detected heart failure association that deserves further exploration in future studies.

Independent experiments support the presence of cardiac muscle enhancer in the identified region. Specifically, ChiP-seq data of p300/CREBBP from an independent cardiomyocyte experiment show a peak at the identified region¹⁶ suggestive of chromatin-accessible active regulatory elements. Since the ACTN2 gene is known to be induced during cardiomyocyte maturation¹⁷ and our hESC experiments confirm a dynamic regulatory region that switches on during cardiomyocyte differentiation, the absence of evidence of cis-eQTL effects of our putative causal variant with the ACTN2 gene may reflect a dynamic effect of the enhancer on gene expression during the maturation process or could be the consequence of insufficient power, relevant cell type, or other context-specificity in eQTL studies to-date. Moreover, prior studies support the role of SNPs in this region in cardiac function. Our fine-mapped variant rs535411 is associated with left ventricular end diastolic dimension (beta = 0.022, t-test p = 5.07e-05) in a recent large-scale GWAS of echocardiographic traits¹⁸, which corroborates our hypothesis for a role of the locus in left ventricular remodeling. Beyond ACTN2, the data from our genetic correlation analysis (Fig. 1b) support a broader role of common variants related to structural musculoskeletal proteins in heart failure by revealing strongly shared heritability between heart failure and multiple musculoskeletal disorders (including osteoarthritis, enthesopathies, intervertebral disk disease) and smooth muscle disorders (esophageal, gastric, and duodenal diseases).

Regulatory variants of ABO predispose to heart failure

Lastly, the chromosome 9 locus tagged by the SNP rs9411378 is found in an intron of the ABO gene, a gene that determines blood type and has been linked to the development of ischemic heart disease¹⁹. A PheWAS of the sentinel variant across 4155 GWAS from the GWAS Atlas²⁰ shows its significant effects in hematologic (red blood cell count, white blood cell count, monocyte cell count, hemoglobin concentration) and metabolic traits (lipid disorders, diabetes, activated partial thromboplastin time) (Supplementary Data 5 and Fig. 5a), whereas a similar PheWAS approach on 1448 traits from the UK BioBank reveals its association with venous thromboembolism (Supplementary Fig. 7). Interestingly, conditioning on several traits associated with our sentinel variant for which GWAS summary statistics are available or on known heart failure risk factors does not significantly change the signal of association between the ABO locus and heart failure (Table 2 and Supplementary Data 2), suggesting a direct effect of the locus on heart failure independent of its effect on other human disorders. In addition, since ABO is a known locus for coronary disease, which in turn is one of the major disorders leading to heart failure, beyond conditioning on ischemic heart disease we also performed a sensitivity analysis in which we excluded all patients with coronary artery disease (CAD) and tested the association between the locus sentinel SNP and heart failure (log(Odds Ratio) = 0.1027, score test p-value = 1.3e-4). Since CAD is only one of the many causes of heart failure, and individuals can have both CAD and heart failure from a different cause, the restricted analysis is conservative and consequently underpowered compared to our discovery GWAS (N_cases = 4137 vs. N_cases = 10,976), which even at the expected, unrestricted effect size inevitably makes the association non-significant at a genome-wide p-value threshold of 5e-8. Nevertheless, the restricted effect size on HF did remain similar to our unrestricted analysis and the association remained nominally significant. Taken together, this sensitivity analysis and the multi-trait conditional analysis suggest the possibility of a role of the ABO locus on heart failure independent of its established influence on coronary disease risk. However, definitive proof of this hypothesis will require further study.

**Fig. 5: Functional characterization of the *ABO* locus.**

The locus did not show any active enhancer or promoter states in cardiac tissues but instead overlapped active enhancer states in primary hematopoietic stem cells and intestinal cells (Supplementary Fig. 8). The sentinel variant was a strong eQTL for ABO gene expression in eQTLGen and GTEx whole blood, consistent with our findings of active chromatin state overlap in hematopoietic lineage cells. In addition, the eQTL signal had strong evidence of colocalization with the GWAS signal for the same locus (posterior probability 96%) (Fig. 5b). Notably, the sentinel variant in our GWAS is in LD with the most common variant (rs8176719) associated with O-blood type via a frameshift mutation that is thought to inactivate ABO (LD r² 0.64 in 1000 Genome Europeans). However, the effects of our lead variants on the expression of ABO remain after stratifying by rs8176719 genotype (Fig. 5c), suggesting an additional regulatory role for our variants, which goes beyond tagging the O-blood type variant rs8176719. Similarly, in our GWAS, a strong signal for association remains within the locus after conditioning on rs8176719 (Supplementary Fig. 9). We should note though that rs8176719 is genotyped or accurately imputed only on a small subset of our participants (35,836 individuals), which may limit interpretation of this analysis as definitive evidence of an independent signal. Since our GWAS locus is intronic, we also examined whether it could affect splicing of the ABO gene using whole-blood RNA-seq data from GTEx v8. Indeed, although the variant’s effect on expression appears stronger than its splicing consequence, we found that the locus is also associated with splicing of ABO, promoting a splice variant that skips the exon on which rs8176719 is found, which provides additional evidence for a regulatory role not due to linkage disequilibrium with rs8176719 (Supplementary Table 5 and Supplementary Fig. 10).

Although having non-O blood type via a structural coding variation in the ABO gene has been linked to cardiovascular disease and cardiovascular mortality²¹, the mechanisms underlying this association are not fully understood. Our finding that regulatory variation of the ABO gene’s expression is linked to the development of heart failure highlights the importance of ABO in cardiovascular disease and opens the door to further studies to decipher the cellular mechanisms involved.

In summary, we performed a large-scale genome-wide association study for heart failure and replicated our findings in a similarly powered cohort. Our results validate the use of this approach to discover regulatory variants associated with heart failure predisposition in response to a variety of cardiac insults, reveal a new putative mechanism for the disease associated with the regulation of a structural cardiac muscle protein during differentiation, underscore the role of the ABO gene in cardiovascular disease and highlight broadly shared heritability between heart failure and musculoskeletal disorders.

Methods

Samples

We performed genome-wide association studies in five cohorts that study cardiovascular disease (Framingham Heart Study, Cardiovascular Health Study, Atherosclerosis Risk in Communities Study, Multi-Ethnic Study of Atherosclerosis, Women’s Health Initiative) and the eMERGE initiative. Genotype and phenotype raw data were downloaded from dbGAP (accession numbers phs000007.v29.p11, phs000287.v6.p1, phs000209.v13.p3, phs000280.v4.p1, phs000200.v11.p3, phs000888.v1.p1). Our work complies with all relevant ethical regulations for work with human participants. All individuals provided informed consent for participation in the individual cohorts. Our GWAS study was approved by the Johns Hopkins School of Medicine IRB (IRB #00163194). For each individual study we performed sample level filtering (excluding samples with assigned and genotype sex discrepancy, extreme deviations from heterozygosity or missingness). We also excluded individuals that were not of European Ancestry and for every group of individuals that were related (identity by descent (IBD) > 0.125) we randomly selected one.

In addition, for each study SNP level filtering was performed to exclude SNPs that had significant deviations from Hardy–Weinberg equilibrium in heart failure controls, minor allele frequency <0.01, missing call rate >0.05 and differential missingness between heart failure cases and controls²². For studies that analyzed their populations with different genotyping arrays, we also excluded SNPs that had significant deviation in minor allele frequencies (MAF) between the different arrays. For individuals that were genotyped in more than one genotyping array, we selected the array that had the most extensive genotyping. We proceeded with imputing and analyzing each array separately for every study.

Imputation

We imputed each study to the 1000 Genomes phase 3 reference panel using Minimac3²³ after pre-phasing with Eagle²⁴ on the Michigan Imputation Server. Prior to imputation, we lifted all SNPs to the hg19 human genome build using the UCSC liftOver tool, aligned all SNPs to the positive strand and filtered out SNPs whose minor allele frequencies deviated by >0.2 compared to the reference panel’s MAF and SNPs A/T or G/C SNPs with MAF > 0.4 as those are prone to strand alignment errors. After imputation, we excluded all imputed SNPs with imputation r squared (INFO score) <0.7, SNPs with MAF < 0.01 and SNPs with Hardy–Weinberg p-value <1e-4. For the eMERGE cohort, imputation was performed independently prior to the start of this study with procedures detailed elsewhere²⁵ and we subsequently applied the same post-imputation filters.

Genome-wide association

For each study, we performed a GWAS for heart failure controlling for age, sex and the first 10 genotype principal components (PCs). Heart failure definitions in the different cohorts are listed in Supplementary Table 6. PCs were calculated based on a set of independent (LD r² < 0.2) genotyped or high-quality imputed SNPs (INFO score>0.9) in an unrelated population (IBD < 0.08) and the SNP loadings were subsequently used to calculate the eigenvectors for all individuals included in the analysis. In the eMERGE cohort, since the population was collected from multiple different hospitals across the United States, we included an additional multilevel categorical covariate denoting the sample source. All GWAS were performed using a linear mixed model with the saddlepoint approximation (SAIGE)²⁶ to account for any residual relatedness structure in our analysis and for case–control imbalance, which is inherent in our phenotype of interest. For the UK BioBank cohort we used the summary statistics for all cause Heart Failure (PheCode 428) generated by analyzing the UK BioBank data in the SAIGE paper²⁶.

Meta-analysis

We meta-analyzed the results of all our GWAS using fixed effects inverse-variance meta-analysis via the software METAL²⁷. We kept only SNPs that were present in at least three studies and 5000 individuals. The following tools were used for the GWAS: Python, R, Bcftools, PLINK²⁸, SNPRelate²⁹, SAIGE²⁶, METAL²⁷.

Replication

We replicated our findings in an independent cohort of 24,829 Heart failure cases and 1,614,513 controls of European ancestry within the 23andMe research cohort. 23andMe participants provided informed consent and participated in the research online, under a protocol approved by the external AAHRPP-accredited IRB, Ethical and Independent Review Services (E&I Review). Heart failure in the replication population was self-reported as an answer to the question “Have you ever been diagnosed with or treated for Heart failure?”. All three replication variants were imputed with high quality (imputation r² > 0.95) using an imputation panel that combined the 1000 Genomes Phase 3 panel with the UK10k panel. The variants were analyzed via logistic regression assuming an additive model with covariates for age, sex, the first five genotype PCs, and indicator variables to represent the genotyping platform. The p-values were adjusted for an LD score regression intercept of 1.043.

Phenome-wide association of heart failure subtypes

For each of the five cohorts in our study and the eMERGE cohort, we classified heart failure individuals as having ischemic heart failure if they also had a history of diagnosed ischemic heart disease, myocardial infarction, percutaneous coronary intervention or coronary artery bypass graft surgery and non-ischemic heart failure otherwise. We also classified individuals as heart failure with reduced ejection fraction if they had heart failure and at least one echocardiogram showing a left ventricular ejection fraction (LVEF) <50%, and heart failure with preserved ejection fraction otherwise. Individuals that did not have information on myocardial infarction history or echocardiographic information were not included in the respective analyses. We also obtained continuous data of LVEF, left ventricular end diastolic diameter, and interventricular septum diameter from each individual’s most recent available echocardiogram. Each of our sentinel variants from the general heart failure GWA meta-analysis were tested for an effect in each of these variables using SAIGE for the categorical variables and linear regression assuming an additive genotype effect for the continuous variables with the same covariates as in our primary GWAS. The results cross-cohort were meta-analyzed using METAL.

Other phenotype associations

To evaluate if our lead GWAS variants had associations with other phenotypes we queried the NHGRI-EBI GWAS catalog and also evaluated the GWAS atlas²⁰, which contains data from 4155 GWAS across 2960 unique traits and the 1488 Electronic Health Record-Derived PheWAS codes from the Michigan Genomics Initiative²⁶.

Heritability and genetic correlation

We used LD score regression³⁰ with the 1000 Genomes European reference LD to evaluate the liability scale heritability explained by the common variants in our GWAS assuming a population prevalence of 0.018⁷. We subsequently analyzed our GWAS together with summary statistics from GWAS studies from the UK biobank³¹ using the genetic correlation method of the LD score regression pipeline to quantify the shared heritability between our phenotype and other traits³². For the genetic correlation analysis we selected traits to analyze based on the following procedure:

1.
Among all summary statistics analyzed in the SAIGE paper²⁶, we first excluded the categories Injuries and poisonings (as it is unlikely to have a major heritable component), as well as symptoms and pregnancy complications (as they are too general to have a meaningful interpretation of genetic correlation).
2.
We excluded general disease bundles that include the work “other” or “NOS” (e.g., other infectious and parasitic diseases) or are a sign/symptom (e.g., hematuria) or medication (e.g., chemotherapy).
3.
We reclassified all infections into the “Infectious Diseases” category and all congenital anomalies to their respective organ system.
4.
From every organ system or general disease category we selected the three diseases with the highest number of cases.
5.
For every selected disease, we excluded diseases and disorders that are subsets of the same disease or highly related (e.g., Selected disease: hypertension-excluded disease: essential hypertension).
6.
We excluded diseases whose z-score of observed heritability calculated via LD score regression was <1.

Conditional analysis based on summary statistics

We used the COJO package³³ from the GCTA pipeline to evaluate the residual association signal within our genome-wide significant loci after conditioning on our sentinel variants or other variants of interest using as reference the LD of the eMERGE heart failure dataset.

Multi-trait conditional and joint analysis

We used the mtCOJO package³⁴ from the GCTA pipeline to evaluate the effects of our variants conditioned to other heart failure risk factors (e.g., hypertension, atrial fibrillation, ischemic heart disease) and conditions associated with our sentinel variants in PheWAS studies using the 1000 Genomes Europeans reference LD scores.

Mendelian randomization analysis

We used the MR base package³⁵ to perform Mendelian Randomization analysis in order to evaluate the effect of atrial fibrillation on the development of heart failure using summary statistics from a large-scale GWAS meta-analysis of atrial fibrillation³⁶. The polygenic risk score was constructed using independent variants (LD r² < 0.001) at a genome-wide significance threshold (p < 5e-8) using as reference the LD of the 1000 Genomes European samples.

Variant fine-mapping

We followed a step-wise approach based on epigenomic annotations and LD structure for our variant fine-mapping efforts. We first used the Roadmap epigenomics ChromHMM 25-state model¹³ across all tested cell types and tissues to visualize our significant loci and identify broad patterns of active promoter or enhancer elements across tissues. We subsequently used CAVIAR¹² with the 1000 Genomes European reference LD and the assumption of at most two causal variants per locus to generate a credible set of SNPs for each locus. The CAVIAR analysis for the ACTN2 locus included all SNPs in a radius of 100 kilobases around the locus sentinel SNP. Since that analysis identified a large number of SNPs (N = 183), we selected only the 111 SNPs in the set that had a GWAS p-value < 5e-7 for downstream fine-mapping. Then, we intersected the SNPs in that set with active enhancer or promoter elements predicted by Roadmap epigenomics for heart tissues (fetal heart, left ventricle, right ventricle, and right atrium). Finally, we intersected SNPs selected by the previous step with candidate cis-regulatory elements predicted by ENCODE¹⁴. Bedtools³⁷ was used for all intersection tests. The WashU Epigenome browser was used for visualization of our loci in the ChromHMM context³⁸.

Cardiomyocyte differentiation model

To further probe the effects of the ACTN2 locus on cardiomyocyte function, we performed ATAC-seq, ChIP-seq of H3K4me3 and H3K27ac, RNA-seq and HiC experiments³⁹, in an engineered H9 hESC (WiCell Research Institute) modified into H9 hESC MLC2v:H2B-GFP reporter transgenic line, which expresses H2B-GFP in differentiated ventricular cardiomyocytes⁴⁰. This cell line was differentiated into cardiomyocytes using a well-established Wnt-based differentiation protocol⁴¹. Cardiomyocytes and their intermediate cell populations were collected and analyzed at different differentiation stages (Day 0, 2,5,7,15, and 80) and epigenomics, transcriptomics and three-dimensional chromatin conformation assays were performed on these cells³⁹. We queried our fine-mapped variants by intersecting them with ATAC-seq, H3K4me3, and H3K27ac peaks in the cardiomyocyte differentiation model and we subsequently assessed the HiC contacts between the identified peaks and nearby genes. HiC contacts were generated at 5 kb resolution. Expected contacts for each bin are calculated as the genome-wide average of contacts of the same distance, as Hi-C contacts follow a distance-based decay. The observed/expected value for each bin shows the enrichment of HiC contacts relative to the background. We tested the significance of enrichment of observed contacts with respect to the expected contacts using an upper-tail Poisson test with x equals observed contacts and lambda equals expected contacts⁴². The UCSC human genome browser was used to visualize the sequencing peaks⁴³.

Expression quantitative trait loci (eQTL)

We assessed the effect of our genome-wide significant variants in gene expression of nearby genes using two databases: (1) Genotype Tissue Expression (GTEx): we obtained whole-genome sequencing and RNA sequencing data from GTEx version 8. We followed the standard pipeline proposed by GTEx v7⁴⁴ to normalize gene expression and perform cis-eQTL analyses. In brief, we filtered out genes with <6 reads or <0.1 counts per million (cpm) in >20% of participants per tissue, performed normalization of expression values between samples using TMM⁴⁵ and for each gene, we normalized gene expression across samples by an inverse rank-based transform to the standard normal distribution. The effect of a variant in gene expression was analyzed using linear regression as implemented in MatrixEQTL⁴⁶ using age, sex, RNA-seq platform, five genotype PCs and 60 probabilistic estimation of expression residuals (PEER) factors⁴⁷ as covariates. Using the methods above, we tested the effect of the ACTN2 locus fine-mapped variants to the expression of genes within 1 megabase in left ventricle tissue and the effect of the ABO locus in whole blood. (2) To increase the power of detecting a cis-eQTL association in whole blood, we obtained cis-eQTL summary statistics from eQTLGen, which includes eQTL data from 31,684 samples⁴⁸. We queried our ABO locus sentinel variant in the dataset.

Colocalization analysis

For each identified significant eQTL result for our variants, we evaluated whether the eQTL and GWAS signals colocalize using a Bayesian colocalization method as implemented in coloc⁴⁹ to estimate the posterior probability of an identical causal variant per locus between eQTL and GWAS. Colocalization Manhattan plots for Supplementary Fig. 1 were generated using LocusZoom⁵⁰.

CRISPR-Cas9 enhancer deletion cardiomyocyte model

To generate a deletion of the candidate causal region within the ACTN2 locus in human embryonic stem cells (hESCs), we used the CRISPR/Cas9 system. More specifically, we used CHOPCHOP v2⁵¹ to find guide RNAs (gRNAs) that in combination with Cas9 will generate cuts within the ATAC-seq peak detected as causal in our epigenetic hESC-CM experiments. We then cloned both gRNAs and Cas9 in vectors carrying a puromycin resistant cassette and used the NEON electroporation system (ThermoFisher) to effectively transform hESCs (H9 line, Cat# WA09, WiCell). Cells were then plated in flasks coated with Gelltrex (LDEV-Free reduced growth factor basement membrane matrix, Thermofisher) and maintained in Essential 8 medium (Thermofisher). hESCs underwent electroporation using the Thermo Neon Transfection system and transformed hESCs were selected using Puromycin for 48 h. Cells were replated and colonies from single cells were manually picked and expanded. All colonies were subsequently screened using PCR with primers that bind outside the expected Cas9 cuts. DNA from colonies carrying the deletion generated a 490 bp PCR fragment confirming a ~2200 bp deletion in the target segment (Supplementary Fig. 6B, C). To generate hESC-derived cardiomyocytes, hESCs with enhancer deletion and hESCs from the parent isogenic line were differentiated to cardiomyocytes⁵². For the differentiation protocol, cells were sequentially treated with two small inhibitors, 6 μM of CHIR99021 (Tocris, GSK3b inhibitor) for 48 h followed by 2.5 μM of IWR-1 (Tocris, Wnt signaling antagonist) in RPMI-B27 without insulin medium (Thermofisher). Spontaneous beating was noted at day 7 of differentiation. Cardiomyocytes were further selected using sodium lactate⁵³. RNA was isolated from cardiomyocytes at day 15 using Trizol, complementary DNA (cDNA) was generated using the high-capacity cDNA reverse transcription kit (Thermofisher) and quantitative PCR was performed using Sybr select protocol⁵⁴. Gene expression levels were normalized with GAPDH. Expression in edited cardiomyocytes and controls was assessed in four replicates and compared with a two-tailed t-test. Whole list of primers is provided in Supplementary Table 7.

Splice-QTL analysis for the ABO locus

We used LeafCutter⁵⁵ to perform splice-QTL (sQTL) analysis for rs550057 (which is an LD surrogate for the sentinel SNP rs9411378 of the ABO GWAS locus, highly associated with heart failure in the GWAS discovery cohort). We obtained and normalized intron excision ratios from binary sequence alignment/map files for whole-blood tissue in GTEx v8 following the filtering and normalization steps provided with the LeafCutter software. We then used FastQTL⁵⁶ to perform nominal sQTL analysis for rs550057 using as covariates five genotype PCs, ten PCs calculated based on the normalized intron excision ratios, along with sex, age, and whole-genome sequencing library construction methodology.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Our GWAS summary statistics were made available in a public Zenodo repository (https://zenodo.org/record/3612522#.XiSE_i2ZOgA), and all the genotype data used to generate those summary statistics are available from dbGaP (accession numbers phs000007.v29.p11, phs000287.v6.p1, phs000209.v13.p3, phs000280.v4.p1, phs000200.v11.p3, phs000888.v1.p1) or via a request to the UK BioBank. Our analysis of eQTL data from GTEx and eQTLGen are all available in our Supplementary Tables and the corresponding GTEx v8 sequencing data are available from dbGaP (accession number phs000424.v8.p2) and on the GTEx project portal (https://gtexportal.org/home/). RNA-seq, H3K27ac-seq, and Hi-C data from the cardiomyocyte differentiation experiments have been deposited in the Gene Expression Omnibus under the accession number GSE116862, whereas the sequencing raw reads for ATAC-seq and H3K4me1-seq, as well as all processed epigenetic, RNA-seq, and HiC data in hESC-CMs for our loci of interest were made available at the following Zenodo repository (https://zenodo.org/record/3612522#.XiSE_i2ZOgA). Lastly, the source data underlying Figs. 1b, 3a, c, 4c and Supplementary Figs. 2, 6a, c, 10a are provided as a Source Data file.

Code availability

The code used for the GWAS analyses is available on the following Github repository: https://github.com/marvani88/HF_GWAS. UCSC genome browser plots were created using the genome browser website (http://genome.ucsc.edu/).

References

Roger, V. L. Epidemiology of heart failure. Circ. Res. 113, 646–659 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lesyuk, W., Kriza, C. & Kolominsky-Rabas, P. Cost-of-illness studies in heart failure: a systematic review 2004-2016. BMC Cardiovasc. Disord. 18, 3 (2018).
Article Google Scholar
Mann, D. L. Mechanisms and models in heart failure: A combinatorial approach. Circulation 100, 999–1008 (1999).
Article CAS PubMed Google Scholar
Lindgren, M. P. et al. A Swedish Nationwide Adoption Study of the heritability of heart failure. JAMA Cardiol. 3, 703–710 (2018).
Article PubMed PubMed Central Google Scholar
Aragam, K. G. et al. Phenotypic refinement of heart failure in a national biobank facilitates genetic discovery. Circulation 139, 489–501 (2019).
Article Google Scholar
Shah, S. et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat. Commun. 11, 163–165 (2020).
Article PubMed PubMed Central CAS Google Scholar
Zarrinkoub, R. et al. The epidemiology of heart failure, based on data for 2.1 million inhabitants in Sweden. Eur. J. Heart Fail. 15, 995–1002 (2013).
Article PubMed Google Scholar
Wainschtein, P. et al. Recovery of trait heritability from whole genome sequence data. Preprint at: https://www.biorxiv.org/content/10.1101/588020v1 (2019).
Roselli, C. et al. Multi-ethnic genome-wide association study for atrial fibrillation. Nat. Genet. 50, 1225–1233 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chiu, C. et al. Mutations in alpha-actinin-2 cause hypertrophic cardiomyopathy: a genome-wide analysis. J. Am. Coll. Cardiol. 55, 1127–1135 (2010).
Article CAS PubMed Google Scholar
Cordero, P. et al. Pathologic gene network rewiring implicates PPP1R3A as a central regulator in pressure overload heart failure. Nat. Commun. 10, 276–5 (2019).
Article ADS CAS Google Scholar
Hormozdiari, F., Kostem, E., Kang, E. Y., Pasaniuc, B. & Eskin, E. Identifying causal variants at loci with multiple signals of association. Genetics 198, 497–508 (2014).
Article CAS PubMed PubMed Central Google Scholar
Roadmap Epigenomics Consortium. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article PubMed Central CAS Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Gupta, V., Discenza, M., Guyon, J. R., Kunkel, L. M. & Beggs, A. H. alpha-Actinin-2 deficiency results in sarcomeric defects in zebrafish that cannot be rescued by alpha-actinin-3 revealing functional differences between sarcomeric isoforms. FASEB J. 26, 1892–1908 (2012).
Article CAS PubMed PubMed Central Google Scholar
May, D. et al. Large-scale discovery of enhancers from human heart tissue. Nat. Genet. 44, 89–93 (2011).
Article PubMed PubMed Central CAS Google Scholar
Zhang, M. et al. Universal cardiac induction of human pluripotent stem cells in two and three-dimensional formats: implications for in vitro maturation. Stem Cells 33, 1456–1469 (2015).
Article CAS PubMed Google Scholar
Wild, P. S. et al. Large-scale genome-wide analysis identifies genetic variants associated with cardiac structure and function. J. Clin. Invest. 127, 1798–1812 (2017).
Article PubMed PubMed Central Google Scholar
He, M. et al. ABO blood group and risk of coronary heart disease in two prospective cohort studies. Arterioscler. Thromb. Vasc. Biol. 32, 2314–2320 (2012).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
Article CAS PubMed Google Scholar
Franchini, M. & Lippi, G. The intriguing relationship between the ABO blood group, cardiovascular disease, and cancer. BMC Med. 13, y (2015).
Article Google Scholar
Anderson, C. A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573 (2010).
Article CAS PubMed PubMed Central Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Verma, S. S. et al. Imputation and quality control steps for combining multiple genome-wide datasets. Front. Genet. 5, 370 (2014).
Article PubMed PubMed Central CAS Google Scholar
Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).
Article CAS PubMed PubMed Central Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 8 (2015). eCollection 2015.
Article CAS Google Scholar
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 36–3 (2012).
Google Scholar
Zhu, Z. et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat. Commun. 9, 2 (2018).
Article ADS CAS Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 7, https://doi.org/10.7554/eLife.34408 (2018).
Nielsen, J. B. et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat. Genet. 50, 1234–1239 (2018).
Article CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Li, D., Hsu, S., Purushotham, D., Sears, R. L. & Wang, T. WashU epigenome browser update 2019. Nucleic Acids Res. 47, W158–W165 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells. Nat. Genet. 51, 1380–1388 (2019).
Article CAS PubMed PubMed Central Google Scholar
Veevers, J. et al. Cell-surface marker signature for enrichment of ventricular cardiomyocytes derived from human embryonic stem cells. Stem Cell. Rep. 11, 828–841 (2018).
Article CAS Google Scholar
Lian, X. et al. Directed cardiomyocyte differentiation from human pluripotent stem cells by modulating Wnt/beta-catenin signaling under fully defined conditions. Nat. Protoc. 8, 162–175 (2013).
Article CAS PubMed Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
Article CAS PubMed PubMed Central Google Scholar
GTEx Consortium. Erratum: genetic effects on gene expression across human tissues. Nature 553, 530 (2018).
Article CAS Google Scholar
Robinson, M. D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 11, r25 (2010). Epub 2010 Mar 2.
Article PubMed PubMed Central CAS Google Scholar
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
Article CAS PubMed PubMed Central Google Scholar
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
Article CAS PubMed PubMed Central Google Scholar
Võsa, U. et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. Preprint at: https://www.biorxiv.org/content/10.1101/447367v1 (2018).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Article CAS PubMed PubMed Central Google Scholar
Labun, K. et al. CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing. Nucleic Acids Res. 47, W171–W174 (2019).
Article PubMed PubMed Central Google Scholar
Cho, G. S., Tampakakis, E., Andersen, P. & Kwon, C. Use of a neonatal rat system as a bioincubator to generate adult-like mature cardiomyocytes from human and mouse pluripotent stem cells. Nat. Protoc. 12, 2097–2109 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tohyama, S. et al. Distinct metabolic flow enables large-scale purification of mouse and human pluripotent stem cell-derived cardiomyocytes. Cell. Stem Cell. 12, 127–137 (2013).
Article CAS PubMed Google Scholar
Andersen, P. et al. Precardiac organoids form two heart fields via Bmp/Wnt signaling. Nat. Commun. 9, 314–318 (2018).
Article CAS Google Scholar
Li, Y. I. et al. Annotation-free quantification of RNA splicing using LeafCutter. Nat. Genet. 50, 151–158 (2018).
Article CAS PubMed Google Scholar
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the research participants and employees of 23andMe for making this work possible. We would also like to thank François Aguet for providing the processed Leafcutter intron excision ratios for the splicing QTL analysis and Princy Parsana for reviewing the GWAS code. The UK BioBank data was obtained under the UK BioBank resource application 17712. Dr. Arvanitis was supported by NIH T32-HL007227 for this work. Dr. Tampakakis was supported by NIH K08- HL145135-01, AHA 19CDA34660077 and the Johns Hopkins Magic That Matters Fund. Drs. Chatterjeee, Dutta and Battle were supported by NIH R01-HG010480-01.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
Marios Arvanitis, Diptavo Dutta & Alexis Battle
Department of Medicine, Division of Cardiology, Johns Hopkins University, Baltimore, MD, USA
Marios Arvanitis, Emmanouil Tampakakis, Stephanie Glavaris, Ali Keramati & Wendy S. Post
Ludwig Institute for Cancer Research, San Diego, CA, USA
Yanxiao Zhang & Bing Ren
23andMe, Inc., Mountain View, CA, USA
Wei Wang, Adam Auton, Michelle Agee, Stella Aslibekyan, Robert K. Bell, Katarzyna Bryc, Sarah K. Clark, Sarah L. Elson, Kipper Fletez-Brant, Pierre Fontanillas, Nicholas A. Furlotte, Pooja M. Gandhi, Karl Heilbron, Barry Hicks, David A. Hinds, Karen E. Huber, Ethan M. Jewett, Yunxuan Jiang, Aaron Kleinman, Keng-Han Lin, Nadia K. Litterman, Jennifer C. McCreight, Matthew H. McIntyre, Kimberly F. McManus, Joanna L. Mountain, Sahar V. Mozaffari, Priyanka Nandakumar, Elizabeth S. Noblin, Carrie A. M. Northover, Jared O’Connell, Steven J. Pitts, G. David Poznik, J. Fah Sathirapongsasuti, Anjali J. Shastri, Janie F. Shelton, Suyash Shringarpure, Chao Tian, Joyce Y. Tung, Robert J. Tunney, Vladimir Vacic, Xin Wang & Amir S. Zare
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Diptavo Dutta & Nilanjan Chatterjee
Department of Oncology, School of Medicine, Johns Hopkins University, Baltimore, MD, USA
Nilanjan Chatterjee
Department of Medicine, Division of Cardiology, University of California, San Diego, La Jolla, CA, 92093, USA
Neil C. Chi
School of Medicine, Institute of Genomic Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
Neil C. Chi & Bing Ren
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Wendy S. Post

Authors

Marios Arvanitis
View author publications
You can also search for this author in PubMed Google Scholar
Emmanouil Tampakakis
View author publications
You can also search for this author in PubMed Google Scholar
Yanxiao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Adam Auton
View author publications
You can also search for this author in PubMed Google Scholar
Diptavo Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Glavaris
View author publications
You can also search for this author in PubMed Google Scholar
Ali Keramati
View author publications
You can also search for this author in PubMed Google Scholar
Nilanjan Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
Neil C. Chi
View author publications
You can also search for this author in PubMed Google Scholar
Bing Ren
View author publications
You can also search for this author in PubMed Google Scholar
Wendy S. Post
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Battle
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

23andMe Research Team

Michelle Agee
, Stella Aslibekyan
, Robert K. Bell
, Katarzyna Bryc
, Sarah K. Clark
, Sarah L. Elson
, Kipper Fletez-Brant
, Pierre Fontanillas
, Nicholas A. Furlotte
, Pooja M. Gandhi
, Karl Heilbron
, Barry Hicks
, David A. Hinds
, Karen E. Huber
, Ethan M. Jewett
, Yunxuan Jiang
, Aaron Kleinman
, Keng-Han Lin
, Nadia K. Litterman
, Jennifer C. McCreight
, Matthew H. McIntyre
, Kimberly F. McManus
, Joanna L. Mountain
, Sahar V. Mozaffari
, Priyanka Nandakumar
, Elizabeth S. Noblin
, Carrie A. M. Northover
, Jared O’Connell
, Steven J. Pitts
, G. David Poznik
, J. Fah Sathirapongsasuti
, Anjali J. Shastri
, Janie F. Shelton
, Suyash Shringarpure
, Chao Tian
, Joyce Y. Tung
, Robert J. Tunney
, Vladimir Vacic
, Xin Wang
& Amir S. Zare

Contributions

M.A. conceptualized and designed the study, performed the analyses, interpreted the results, and wrote the manuscript. E.T. designed and performed the CRISPR hESC-CM experiments and reviewed and revised the manuscript. Y.Z., N. C.C., and B.R. performed the experiments and analyses of the HiC cardiomyocyte differentiation data, interpreted the results, and reviewed and revised the manuscript. W.W., A.A., and the 23andMe Research Team performed the analyses pertaining to GWAS replication, and reviewed and revised the manuscript. D.D. and N.C. performed the sensitivity analysis of the ABO locus in individuals without coronary disease, and reviewed and revised the manuscript. S.G. assisted with the CRISPR hESC-CM experiments, and reviewed and revised the manuscript. A.K. contributed to the design of the primary GWAS meta-analysis, and reviewed and revised the manuscript. W.S.P. contributed to the design of the study, the interpretation of the results, and reviewed and revised the manuscript. A.B. conceptualized and designed the study, provided oversight for the analyses, interpreted the results, and reviewed and revised the manuscript.

Corresponding author

Correspondence to Alexis Battle.

Ethics declarations

Competing interests

W.W., A.A., and the members of the 23andMe Research Team are employees of 23andMe Inc. All other authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Rajat Gupta, Thomas Quertermous and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Information

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Arvanitis, M., Tampakakis, E., Zhang, Y. et al. Genome-wide association and multi-omic analyses reveal ACTN2 as a gene linked to heart failure. Nat Commun 11, 1122 (2020). https://doi.org/10.1038/s41467-020-14843-7

Download citation

Received: 02 August 2019
Accepted: 27 January 2020
Published: 28 February 2020
DOI: https://doi.org/10.1038/s41467-020-14843-7

This article is cited by

Functional genomics in stroke: current and future applications of iPSCs and gene editing to dissect the function of risk variants
- Alessandra Granata
BMC Cardiovascular Disorders (2023)
Bioinformatics analysis of immune cell infiltration patterns and potential diagnostic markers in atherosclerosis
- Haigang Ji
- Ling Yuan
- Jing Chen
Scientific Reports (2023)
Evaluating 17 methods incorporating biological function with GWAS summary statistics to accelerate discovery demonstrates a tradeoff between high sensitivity and high positive predictive value
- Amy Moore
- Jesse A. Marks
- Eric O. Johnson
Communications Biology (2023)
Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease
- Matteo D’Antonio
- Jennifer P. Nguyen
- Kelly A. Frazer
Nature Communications (2023)
A global high-density chromatin interaction network reveals functional long-range and trans-chromosomal relationships
- Ruchi Lohia
- Nathan Fox
- Jesse Gillis
Genome Biology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.