Introduction

Echocardiographic assessment of cardiac structure offers prognostic information about cardiac conditions such as heart failure (HF)1,2. Pathological processes including cardiomyocyte cell death, inflammatory cell response and changes in interstitial tissue of the heart are factors leading to adverse remodelling and HF3.

A number of apoptotic genes have been investigated as potential targets to prevent cardiomyocyte death, but it is now increasingly evident that caspase-dependent cell death plays a minor if any role in adult myocyte loss4, which involves Cyclophilin D5 and calpains6. By contrast, caspase proteins are now recognized as important factors for initial differentiation of stem cells to cardiomyocytes7, and its deficiency in vivo was shown to induce abnormal heart development8,9. In rodent cardiomyocytes, caspase-3 is involved in WNT signalling and myocyte growth10,11, contributing also to muscle-specific gene splicing by cleaving PTB12. In addition, apoptotic DNA nucleases were shown to play a role in the developmental process of C.elegans including the C.elegans Caspase-associated DNase (CAD), ENDOG and TATD orthologues13. Furthermore, ENDOG also contributes to the signalling pathways determining myocyte size through the control of reactive oxygen radicals (ROS)14,15. These facts lead us to hypothesize that caspases and the nucleases ENDOG and TATD play relevant functions in cardiomyocyte proliferation and maturation during development.

Recent genome wide association studies have been performed for evaluating comprehensive sets of echocardiographic traits in well characterized individuals included in large cohort studies16,17. Increased left ventricular mass (LVM) is a well-established predictor of adverse cardiovascular events and premature death18,19. End-Diastolic LV Internal Dimension (LVID) and LV Wall Thickness (LVWT) are other measures of left ventricular hypertrophy (LVH), usually a pathological compensatory mechanism of the LV overload which evolves towards progressive LV dysfunction and HF. The principal role of the left atrium (LA) is to modulate LV filling and cardiovascular performance, being LA enlargement and dysfunction also predictors of cardiovascular events, especially in patients with atrial fibrillation (AF)20. The risk of aortic aneurysm is strictly related to the diameter of the ascending aorta, and recent studies have shown high prevalence of aortic root (AROT) enlargement in the hypertensive population21. Using data from publicly available repositories, we aimed to explore the association between a selected group of candidate apoptosis-related genes and these echocardiographic phenotypes by means of meta-GWAS. Furthermore, we aimed to assess previously reported signals in our study datasets and performed an agnostic analysis to investigate relevant pathways revealed for each trait. Following the leading results of this analysis, we further explored the unsuspected genetic relationship between Alzheimer’s disease (AD) and these echocardiographic traits, by estimating their genetic correlation, and identifying common genetic determinants of these conditions.

Materials and Methods

Experimental design

This study is aimed at identified genetic variants associated with echocardiographic traits, using both a candidate gene approach and an agnostic approach in retrospective datasets available in public repositories.

Study cohorts

The four cardiovascular datasets analysed in this study were downloaded from dbGAP (https://www.ncbi.nlm.nih.gov/gap) after requesting the appropriate permissions. In the case of a multi-ethnic study, only Caucasian samples after principal component analysis (PCA) were retained for analysis. A summary of the clinical characteristics of these populations is shown in Table 1. A full description of each of them is provided in the Supplementary Info.

Table 1 Study sample characteristics.

A total of seven AD datasets were used to further explore the observed enrichment of top genes derived from the analysis of echocardiographic traits on genes involved in AD pathways. As for previously described datasets, only Caucasian samples after principal component analysis (PCA) were retained for analysis. Demographic characteristics of these datasets are summarized in the Supplementary Table 1.

Phenotypes

Data from the most recent available echocardiographic examinations of each cohort were included in this study. The following five phenotypes were analysed: LVM (g), AROT (cm), LVID (cm), LA size LAS, cm), and LVWT (cm). The latter was defined as the sum of the End-Diastolic Thicknesses of the Posterior Wall (TPW) and End-Diastolic Thicknesses of the Interventricular Septum (TIS). LVM was calculated using the formula 0.8 [1.04{(LVID + TIS + TPW)3 −(LVID)3}] + 0.622.

Genotyping and imputation

The cardiovascular datasets included in this study were genotyped using different platforms: CARDIA and MESA were genotyped using the Affymetrix Genome-Wide Human 6.0 array, whereas the FHS was genotyped using the Affymetrix Human 500k array and the CHS cohort with the Illumina HumanCNV370-Duo v1.0.

AD datasets were genotyped using the Illumina arrays Human 610‐Quad BeadChip (ADNI1, AddNeuroMed batch 1), the HumanOmniExpress BeadChip (ADNI2/GO, AddNeuroMed batch 2, ADGC dataset 3), the Human660W-Quad (ADGC datasets 1 and 2) and the HumanHap300-Duo BeadChip (The Mayo study) or the Affymetrix 250k NspI (the Neocodex-Murcia study), 500k (the TGEN and GenADA studies) or 6.0 (ROSMAP study) arrays.

Prior to imputation, we first performed an extensive quality control excluding individuals with more than 3% missing genotypes, with excess autosomal heterozygosity (>0.35), those showing a discrepancy between genotypic and reported sex, as well as individuals of non-European ancestry based on PCA using SMARTPCA23. At the genotype level, we removed single nucleotide polymorphisms (SNPs) with missing genotype rate > 5%, not in Hardy-Weinberg equilibrium (p < 10−6) and SNPs with minor allele frequency (MAF) < 1%. Duplicated and related individuals were identified and removed by means of Identity By State (IBS) estimates within and across studies.

Genotype imputation is aimed at estimating unobserved genotypes using as reference known haplotypes from a well characterized population. Imputation was performed using the minimac3 algorithm and the SHAPEIT tool for haplotype phasing at the University of Michigan server using the HRC reference panel24. After imputation, SNPs with an R2 quality estimate lower than 0.3 were excluded from further analyses in accordance with software instructions.

Statistical analysis

All analyses were performed in Caucasian populations only. Individuals with prevalent myocardial infarction (MI) or congestive heart failure (CHF) were excluded from the study. Linear regression models available from PLINK software25 were fitted to investigate the association between genotypes and quantitative phenotypes, with age, sex, body mass index and the four principal components as covariates. For each phenotype, we obtained summary estimates of association across studies by using a fixed-effects model meta-analysis procedure implemented also in PLINK. For the genome wide SNP analysis, the conventional GWAS significance threshold was used (p = 5 × 10−8)26, whereas signals with p value < 10−5 were considered as suggestive of association and reported in the Supplementary Tables 26.

Gene-wise statistics were computed using MAGMA software, which takes into account physical distance and linkage disequilibrium (LD) between markers to estimate a summary gene p-value using a known approximation of the sampling distribution27. All SNPs with MAF above 1% were used in these analyses. At each trait, genes were ranked according to the global p mean value.

Top results from MAGMA genome wide analyses were tested for over-representation of genes involved in known pathways, functions and diseases. For these enrichment analyses, we applied a moderated threshold of p < 10−3, which has been showed to have a good power for detecting relevant pathways and functions while maintaining type 1 error controlled27. For these analyses, we used the R packages Webgestalt (GO terms, KEGG, Wikipathways and Reactome) and enrichR28,29 (Aging perturbations from GEO, Biocarta, Panther Disease perturbations from GEO Disease signatures from GEO, dbGaP and OMIM) with default parameters.

In order to explore genetic correlation between different traits we used a bivariate GREML analysis with GTCA software30. This method allows to identify pleiotropic gene effects associated with different diseases or disorders, providing a single measure of the proportion of shared genetic determinants. Furthermore, we obtained summary estimates of association across phenotypes performing unweighted meta-analysis of Fisher p-values.

Results

Overall, our study included data from 11,559 individuals with echocardiographic phenotypes from four different datasets (Table 1). Figure 1 illustrates the data analysis roadmap as explained in Materials and Methods. After imputation and quality control, we obtained about 7 million SNPs with MAF > 0.01 that were tested for association with echocardiographic traits at each study. We then performed a meta-GWAS to obtain summary estimates of association for each SNP. Genomic inflation factor (λ) ranged from 0.994 to 1.022 in these analyses, indicating absence of population stratification due to hidden population structure (Fig. 2). MAGMA software was used for summarizing the meta-GWAS SNP results in order to obtain a gene-wise statistic of the association between 18,480 genes and the five phenotypes.

Figure 1
figure 1

Data analysis workflow.

Figure 2
figure 2

Manhattan plots of the meta-analyses for the different echocardiographic traits. The threshold for genome-wide significance (P < 5 × 10−8) is indicated by the red line, while the blue line represents the suggestive threshold (P < 1 × 10−5). Loci previously associated with echocardiographic traits are shown in blue, and newly associated loci are shown in red.

Association of apoptosis-related genes with cardiac phenotypes

Because there is experimental evidence supporting the role of some apoptosis-related genes with cardiac development and disease, we first analysed the potential association of polymorphisms in a series of apoptosis-related genes with the cardiac phenotypes. Table 2 shows the association results of the 20 apoptosis-related candidate genes. Study-wide statistically significant results were observed for the association of a genetic locus on 2q33.1 involving two initiator caspases (CASP8 and CASP10) and the apoptosis regulator protein CFLAR (CASP8 And FADD Like Apoptosis Regulator, c-FLIP) with LVM. The same three genes were also linked to LVID with p < 0.05, along with the Fas receptor-associated adaptor FADD (Fas-associated protein with death domain) and BCL2 (B-cell lymphoma-2) genes. BCL2, FADD and TATDN1 (TatD DNase domain containing-1) showed a trend for association with AROT. We did not find evidence of association of any calpain family member with any of the analysed traits (data not shown).

Table 2 Candidate genes for echocardiographic traits.

Agnostic GWAS of genetic variants associated with cardiac phenotypes and enrichment analysis of top genes on echocardiographic traits

Although we did not find any GWAS significant signal at SNP-level (p < 5 × 10−8) related with the analysed phenotypes, we observed several suggestive signals at the p < 10−5 level (Supplementary Tables 26), most of them intragenic (Fig. 2). For each genotype, we additionally performed a gene-level, ranking genes according to the MAGMA computed SNP-wise p-value, which uses the full spectrum of SNP-level meta-analysis results (Table 3 and Supplementary Tables 711). Overall, these top genes from MAGMA gene level analyses (p < 10−3) showed little overlap across phenotypes (Fig. 3a), with the exception of TECTB/ACSL5 locus for LVID and LVWT, genes associated to cholesterol and fatty acid metabolism respectively, and ZNF678, a zinc finger protein involved in immune response, for LVM and LVWT genes. Enrichment analysis (Fig. 3b, Supplementary Tables 1216) suggested some processes underlying or related to one or more echocardiographic traits such as lymphocyte activity, cardiomyopathy, left ventricular hypertrophy or oxidative stress response Interestingly, enrichment analysis also suggested an over-representation of genes differentially expressed in AD samples versus control as well as genes involved in tau, presenilin and amyloid biology (APH1B, KLC3, KLC4, MAPK4, MAPK13;MAPK14, PPP2R5D, RBPJ, TCF7L1).

Table 3 Top ranked genes for each echocardiographic trait after genome wide association meta-analysis and gene-wise statistics calculation.
Figure 3
figure 3

Results from gene-wise analysis of echocardiographic traits. (a) Venn diagram showing the overlap of top genes of the different echocardiographic traits analysed. (b) Plot summarising main results of the enrichment analysis of top genes from gene-wise statistics. RF (rich factor): number of genes from the input list included in each category divided by the total number of genes in the category.

Genetic correlation analysis between echocardiographic traits and AD

The association of gene variants related to echocardiographic measures with mental illnesses, the increasing interest in the vascular aspects of AD and availability of diverse AD datasets, prompted us to explore in more depth the relationship between the echocardiographic phenotypes and AD. We performed a genetic correlation analysis using the study dataset (comprised by 11,559 individuals with echocardiographic phenotypes) along with 12,730 AD cases and controls both from internal and publicly available databases. First, we estimated the proportion of variance, as a proxy of trait heritability, explained by all SNPs in each one of these traits31, which was higher for AD (0.38) than for the echocardiographic phenotypes (range: 0.17-0.36). Then, we looked for shared genetic loci across the genome between echocardiographic traits and AD using GREML analyses as suggested by the enrichment analyses. Specifically, we observed a positive correlation between AD and LAS (rG = 0.167, p = 0.0334), and negative correlations between AD and LVID (rG = −0.196, p = 0.0056), and AD and LVM (rG = −0.198, p = 0.0165); of note, LVM and LVID are the most correlated echocardiographic traits (rG = 0.988, p < 0.00001) (Fig. 4a). The sign of the rG estimates determines whether a direct or inverse relationship between the two phenotype traits is observed. Therefore, our results suggest that SNPs that increase the risk of AD tend to be associated with increasing LAS values. On the contrary, we found that SNPs associated with increased risk of AD tend to be associated with decreasing ventricular measures (or vice versa), in particular LVID and LVM.

Figure 4
figure 4

Results from the combined analysis of echocardiographic traits and AD. (a) Genetic correlations (rG) between echocardiographic traits and AD. (b) Venn diagram showing the overlap of top genes derived from gene-wise association analysis. (c) Enrichment analysis of top genes from meta-analysis of echocardiographic traits and Alzheimer’s disease. RF (rich factor): number of genes from the input list included in each category divided by the total number of genes in the category.

Based on these findings we performed a SNP-wise meta-analysis by pooling Fisher association p-values of two or more phenotypes aimed at identifying genes contributing most to both heart measures and AD. Thus, we combined in these meta-analyses p-values for LAS&AD, LVID&AD, LVM&AD, LAS&LVID&LVM&AD and LVID&LVM&AD and calculated gene-wise statistics (Table 4, Supplementary Tables 1721). We observed a large overlap of genes falling under the p < 10−3 threshold in the different meta-analyses performed, with a group of 30 genes consistently associated in all of them, including the CASP8/CASP10/CFLAR locus and the GABRR1 GABA receptor (Fig. 4b). Consequently, apoptosis related pathways driven by the CASP8, CASP10 and CFLAR locus were persistently observed in the subsequent enrichment analyses (Fig. 4c, Supplementary Tables 2225). We also observed an enrichment on genes present at the neuronal synapse such as glutamate (GRM5), cholinergic (CHRNA2) and GABA receptors (GRIN2C, GABRR1, GABRR2 and GABBR2), teneurin (TENM2), calsyntenin (CLSTN3), or the cytoskeletal alpha-actin protein (ACTN2) for the diverse meta-analyses involving ventricular measures and AD.

Table 4 Top ranked genes for the combined analysis of echocardiographic traits and AD after genome wide association meta-analysis and gene-wise statistics calculation.

Discussion

Our study, based on data from 11,559 individuals free of cardiovascular disease, shows that variants affecting diverse genes involved in apoptosis regulation associate with echocardiographic phenotypes in humans. We have obtained these results by both hypothesis-driven and agnostic approaches. In addition, novel findings from the analysis of our GWAS also include previously unnoticed associations of variants in genes involved in cell proliferation, DNA replication and mRNA splicing with left ventricular morphology. Our data also suggest the existence of a set of genes, mainly related to apoptosis/inflammation signalling, whose variants are associated with both cardiac phenotype and AD.

Our hypothesis-driven analysis showed that caspases 8 and 10 and the regulatory CFLAR (cFLIP) gene are strong predictors of LVM. Furthermore, our hypothesis-free approach found that caspase dependent pathways were overrepresented among top genes associated with the LVM phenotype. These results support our a priori hypothesis that the apoptotic signalling influences heart morphology with potential impact on heart performance. Our hypothesis was based in previous experimental work, including our own, showing that deficiency9,11 or overexpression32 of key apoptotic genes altered normal cardiomyocyte differentiation and heart development independently of cell death. Indeed, although these genes are best known for their role in regulating apoptotic cell death, experimental evidences show that the same genes also regulate myocyte proliferation, inflammation and hypertrophy in the heart9,11,33,34,35. Because cell death is not a major event during heart development, and based on the above experimental work, we suggest that the relationship between the apoptotic genes and the cardiac phenotype might involve non-apoptotic functions.

In order to estimate the robustness of our GWAS analysis, we asked whether we had reproduced some signals already observed in previous genetic studies. Indeed, we found the already published link between SMG6, TSR1 and SRR genes and AROT16,17, but failed to detect statistically significant associations for other gene variants, possibly due to the limited power of this study. Given the strong influence of age in echocardiographic parameters, the diverse population structure of the study cohorts could also affect our power in the meta-analysis, with signals not replicating in populations largely differing in their median age. However, our analysis demonstrated genetic association between variants of genes previously shown to influence heart function and cardiac hypertrophy in experimental models, such as MLF136, which associates with LVM phenotype in our study, and KCNIP2 (KChIP2)37,38,39 and TRAF3IP240,41, which have also been associated with LVWT and LVM phenotypes respectively in humans and in this study as well. Also, from the top list of genes whose variants are associated to LIVD, ANKS6 has been associated with heart malformations42.

The GWAS analysis also showed association of variants of a group of odorant receptors genes located in the 11p15.4 chromosomic region coding (subfamilies 51, 52 and 56) or in chromosome 16 (OR2C1) with LVWT and LVID. Expression of these genes in non-neural tissues is related to the control of different processes, including glucose and oxygen homeostasis or cell cycle control43, and has been shown to be involved in the regulation of cardiac function in rodent experimental models through interaction with fatty acids44. Therefore, our genetic results open the possibility that odorant receptors’ activity can influence cardiac function in humans. Interestingly, low OR expression has been found in the cortex of neuro-psychiatric patients45 and a genetic microduplication in the 11p15.4 region has been associated with familial intellectual disability and autism46.Unexpectedly, we found enrichment on AD related pathways for LAS, LVID and LVM that led us to explore comprehensively a possible link between AD and all these echocardiographic traits. Intriguingly, our results revealed for the first time a potential genetic link between AD and LAS, paired with a negative correlation between AD and either LVM or LVID. Interestingly, in line with our observation, recent reports found that LAS and LVID were independently associated to cognitive function in older adults, being predictors of cognitive decline after 14 years of follow-up47,48. The opposite direction of the correlation coefficients for atrial and ventricular measures could be related to the different development patterns of these chambers during embryogenesis49 and with the described strong association of left atrial size with long-term exposure to vascular risk factors, particularly high blood pressure and obesity. In fact, the CARDIA Brain MRI Substudy found association of higher left atrial volume in early adulthood with impairment of white matter integrity in midlife, but not for ventricular measures50.

A recent GWAS by our group on AD clinical endophenotypes revealed that vascular processes were the main causal mechanisms in pure AD51. But our findings are compatible to a broader definition of- dementia which includes other diseases such vascular dementia, fronto-temporal dementia or Lewy Body diseases among others. In fact, the rule is the presence of multiple pathologies in the brain of people with dementia including pervasive vascular lesions52. The link between dementia and cardiac conditions is not well understood. The old concept of cardiogenic dementia was based on the high incidence of cardiac dysrhythmias observed in patients with dementia due to vascular causes53. However, the relation between coronary heart disease (CHD) or HF and AD in epidemiological studies remains controversial, with some studies showing an association with cognitive impairment and dementia54,55,56 whereas others found no association57,58. The fact that both conditions are competing risks complicates the study of their relationship. Lower cardiac index levels are related to lower cerebral blood flow in older adults free of CVD59, but individuals with cardiac conditions that did not result in premature death might include many individuals chronically exposed to brain hypoperfusion due to reduced cardiac output that adaptively decreased cerebrovascular resistance through arteriolar dilatation. This kind of antagonistic pleiotropy between these phenotypes has been previously suggested by Beeri et al. after observing that better cognitive performance was associated with worse cardiac functioning in very elderly subjects60.

Moreover, whereas enlarged ventricular volume (LV hypertrophy) is a marker of diastolic dysfunction, LVM is also a marker of cardiovascular health, positively correlated with physical activity and cardiorespiratory fitness61,62. Population based studies have shown an inverted U-shaped association for LVM values and age, since they rise in adolescence and decline with increased age63,64. Furthermore, a U-shaped association between left ventricular ejection fraction (LVEF), a marker of systolic dysfunction, and abnormal cognitive decline has been reported, with increased dementia risk at the lowest and highest LVEF quintiles65.

To our knowledge, this is the first report analysing shared genetic factors between echocardiographic measures and AD. This method for estimating genome-wide pleiotropy has the advantage of being free of potential confounders determined by shared epidemiological risk factors such as high blood pressure or atherosclerosis. Our results show a negative genetic correlation for the ventricular measures LVID and LVM and AD, pointing to antagonist pleiotropic effects of shared genes by AD and LV cardiac measures, the main known functions of which are related to apoptosis (CASP8, CASP10, CFLAR, PHIP) and neurotransmission (GABRR1). Interestingly, some CASP8 variants have been previously associated with AD66, and the protein has been shown to be involved in amyloid related pathways and activated in both blood and brain cells from AD patients1,67,68,69. In our study, the GABA receptor GABRR1 was consistently associated in all meta-analyses involving heart traits and AD. Neurotransmission affects both cardiac and neuronal performance, and a few studies have examined synapse and neuron loss in AD brains and suggested that synaptic changes precede neuron loss70,71. In fact, GABA signalling is reduced in AD and other mental disorders such as dementia with Lewy bodies or frontotemporal lobar degeneration72,73,74. By contrast, augmented inhibitory GABAergic neurotransmission has been reported in animals models of HF and LVH through the involvement of the paraventricular nucleus of the hypothalamus75,76.

In summary, our GWAS data suggest the influence of gene variants affecting the apoptotic/inflammation signalling pathway on left ventricular morphology and cardiac function, uncover novel candidate gene variants regulating echocardiographic phenotypes and establish a genetic link between cardiac morphology alterations, mental illness and AD involving key genes in the regulation of apoptotic signalling that deserve functional assessment due to their diagnostic and therapeutic potential. These results should be replicated in larger datasets in order to confirm the observed pleiotropic effects for genes associated with echocardiographic traits.