Network analysis of coronary artery disease risk genes elucidates disease mechanisms and druggable targets

Lempiäinen, Harri; Brænne, Ingrid; Michoel, Tom; Tragante, Vinicius; Vilne, Baiba; Webb, Tom R.; Kyriakou, Theodosios; Eichner, Johannes; Zeng, Lingyao; Willenborg, Christina; Franzen, Oscar; Ruusalepp, Arno; Goel, Anuj; van der Laan, Sander W.; Biegert, Claudia; Hamby, Stephen; Talukdar, Husain A.; Foroughi Asl, Hassan; Pasterkamp, Gerard; Watkins, Hugh; Samani, Nilesh J.; Wittenberger, Timo; Erdmann, Jeanette; Schunkert, Heribert; Asselbergs, Folkert W.; Björkegren, Johan L. M.

doi:10.1038/s41598-018-20721-6

Download PDF

Article
Open access
Published: 21 February 2018

Network analysis of coronary artery disease risk genes elucidates disease mechanisms and druggable targets

Harri Lempiäinen¹,
Ingrid Brænne²^na1,
Tom Michoel ORCID: orcid.org/0000-0003-4749-4725^3,4^na1,
Vinicius Tragante⁵^na1,
Baiba Vilne^6,7^na1,
Tom R. Webb ORCID: orcid.org/0000-0001-5998-8226⁸^na1,
Theodosios Kyriakou⁹,
Johannes Eichner¹,
Lingyao Zeng⁶,
Christina Willenborg²,
Oscar Franzen ORCID: orcid.org/0000-0002-7573-0812¹⁰,
Arno Ruusalepp⁴,
Anuj Goel ORCID: orcid.org/0000-0003-2307-4021⁹,
Sander W. van der Laan ORCID: orcid.org/0000-0001-6888-1404¹¹,
Claudia Biegert¹,
Stephen Hamby⁸,
Husain A. Talukdar¹²,
Hassan Foroughi Asl ORCID: orcid.org/0000-0002-6925-9040¹²,
CVgenes@target consortium,
Gerard Pasterkamp^11,13,
Hugh Watkins⁹,
Nilesh J. Samani⁸,
Timo Wittenberger ORCID: orcid.org/0000-0002-7158-7887¹,
Jeanette Erdmann ORCID: orcid.org/0000-0002-4486-6231²,
Heribert Schunkert^6,7,
Folkert W. Asselbergs ORCID: orcid.org/0000-0002-1692-8669^5,14 &
…
Johan L. M. Björkegren^4,10,12

Scientific Reports volume 8, Article number: 3434 (2018) Cite this article

4512 Accesses
41 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Genome-wide association studies (GWAS) have identified over two hundred chromosomal loci that modulate risk of coronary artery disease (CAD). The genes affected by variants at these loci are largely unknown and an untapped resource to improve our understanding of CAD pathophysiology and identify potential therapeutic targets. Here, we prioritized 68 genes as the most likely causal genes at genome-wide significant loci identified by GWAS of CAD and examined their regulatory roles in 286 metabolic and vascular tissue gene-protein sub-networks (“modules”). The modules and genes within were scored for CAD druggability potential. The scoring enriched for targets of cardiometabolic drugs currently in clinical use and in-depth analysis of the top-scoring modules validated established and revealed novel target tissues, biological processes, and druggable targets. This study provides an unprecedented resource of tissue-defined gene–protein interactions directly affected by genetic variance in CAD risk loci.

Refining the impact of genetic evidence on clinical success

Article Open access 17 April 2024

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Introduction

Genome-wide association studies (GWAS) have identified over 200 genome-wide significant and suggestive risk loci for coronary artery disease (CAD)¹. Most of the CAD associated variants are in non-coding regions and likely affect disease development by regulating gene expression². Several candidate genes regulated by lead risk SNPs have been suggested, predominantly by studies of expression quantitative trait loci (eQTLs)^2,3,4. However, the target tissues, biological processes, and pathways through which these loci affect CAD etiology are largely unknown. For example, although it has been postulated that many loci appear to affect CAD by regulating genes in the arterial wall¹, presumably by influencing the predominant disease process in CAD, atherosclerosis, no druggable gene targets addressing this aspect of atherosclerosis have yet been identified^2,5. Moreover, although CAD genes by virtue of their cellular function in themselves may not be druggable, neighboring genes in the same subnetwork or pathway may very well be.

Thus in the post-GWAS era, we need to go beyond the genetic susceptibility markers identified by GWAS, and understand how these markers actually affect disease etiology⁶. This is not a trivial task, as it is unclear whether these risk loci harbor one or several disease-causing genes and whether the effects of these genes on disease are mediated in one or several tissues. Therefore, a systems genetics approach is an unbiased way not only to better understand the molecular pathophysiology of individual GWAS risk loci and candidate genes (as provided by the subnetwork analyses) but also to determine the main target tissue(s) of a risk locus.

In this study, we first used several bioinformatics strategies to prioritize candidate genes in CAD risk loci⁴. Then, we sought for immediate neighbors of genes in the affected functional gene-networks by analyzing the unique seven-tissue Stockholm Atherosclerosis Gene Expression (STAGE)⁷ datasets (GSE40231). We integrated these functional gene-networks with data on known protein–protein interactions⁸ to infer regulatory-gene and protein networks (RGPNs) in and across the seven metabolic and vascular tissues relevant to CAD. Within these RGNPs, we computed subnetworks (“modules”) using Girvan-Newman algorithm⁹ and sought those that contained at least one of the CAD candidate genes. The motivation for going beyond individual CAD candidate genes but rather to identify modules with several genes/proteins surrounding the CAD candidate gene in the networks is driven by two principal concepts. First, unlike an isolated CAD candidate gene, the module is a community of co-expressed and interacting genes and proteins and may suggest how the locus drives CAD etiology. Second, although the CAD candidate genes themselves may not be druggable, the entire module or specific neighboring genes to the CAD candidate gene may be.

Next, to assess importance and reliability for CAD, each module was scored according to the proximity of its node/gene to the CAD candidate gene(s), tissue expression patterns (CAD vs common drug toxicity tissues) of the genes, presence of a CAD mouse phenotype, and druggability. High-scoring modules were further scrutinized to assess their content of known targets for cardiovascular drugs, general drug target enrichment, and biological functions according the gene ontology (GO).

Methods

Gene Prioritization

Genome-wide significant (P < 5 × 10⁻⁸) and genome-wide suggestive CAD SNPs were collected from recent association studies^1,2,10,11. The genome-wide suggestive SNPs included variants with a false-discovery rate (FDR) ≤ 5%¹ and variants identified in the discovery analysis in “Exome array” studies^10,11. SNPs in high linkage disequilibrium (LD) (r² > 0.8) with each lead variant was identified with the 1000 Genomes phase 1 v3 ALL reference panel. We used a similar approach to a previous study², genes were linked to CAD loci in three ways: (1) proximity to lead and high LD SNPs ascertained by searching the ENSEMBL database (GRCh38) for RefSeq annotated genes, (2) a long-range interaction between a chromosomal region containing a gene and the region containing the CAD-associated SNPs¹², and (3) the eQTL most significantly associated with the CAD lead SNP or proxy^{13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29}. A “Prioritized Gene List” was generated as described in the Results section. GWAVA software was used to identify transcribed SNPs and SNPs that alter the coding sequence³⁰. Deleterious SNPs were predicted with SIFT³¹, PolyPhen³², and CADD³³. Mouse phenotypes were assigned if the gene had been reported to cause an atherosclerosis phenotype³⁴. All gene symbols were confirmed with the HGNC Multi-symbol checker tool.

To prioritize the most likely causal gene at each locus, we scored each of the 475 genes using six criteria for functional relations: the SNP (1) has a chromatin interaction with the gene, (2) is transcribed, (3) causes a coding change, (4) is an eQTL, and (5) is predicted to be deleterious and (6) the mouse knockout of the gene has an atherosclerosis phenotype³⁴. We assigned each category a weight of 2^(L-1), where L is the rank of the category, and the score for each gene was the sum of the weighted score across the six categories (Supplementary Table 1A). The top-scoring gene at each locus was considered the most likely to be causal.

Network Construction

To refine and further supplement the cross-tissue co-expression networks from the Stockholm Atherosclerosis Gene Expression (STAGE) study^7,35, we added tissue-specific protein–protein interactions (PPIs) from the ConsensusPathDB (http://consensuspathdb.org) database⁸, which contains 261,085 protein interactions from 19 different resources, including IntAct³⁶, HPRD³⁷, and BioGRID³⁸. ConsensusPathDB assigns a confidence score to each binary PPI, an aggregate score based on network-topological and annotation features (e.g., literature evidence, pathway and Gene Ontology co-annotation), with scores <0.5 denoting low confidence and those >0.95 denoting high confidence. Otherwise, the interactions were considered to be of medium confidence. We selected PPIs with confidence scores ≥0.5. To ensure the tissue-specificity of PPIs, we used gene expression data from tissues in the STAGE study⁷: atherosclerotic arterial wall (AAW), internal mammary artery (IMA), liver (LIV), skeletal muscle (SM), subcutaneous fat (SF), and visceral fat (VF). For each cross-tissue co-expression network gene, we added its protein interaction partners to the network only if both interactors were expressed in the same tissue. A gene was considered to be expressed in a particular tissue if its median expression signal was above the background level (i.e., Robust Multi-array Average (RMA) log2 intensity values ≥5.0) across all samples. Subsequently, to increase network connectivity, we also searched for PPIs within these added nodes (i.e., the first neighbors of each cross-tissue co-expression network gene), again requiring that both interactors be expressed in the same tissue.

Informed consent was obtained from all participants in the STAGE study and all the methods were approved by the Ethics Committee of the Karolinska University Hospital (Ethical approval Dnr 02-004) and performed in accordance with relevant guidelines and regulations⁷.

Module Identification and Scoring

Gene modules within networks were identified with the Girvan-Newman algorithm⁹, implemented in R igraph (http://igraph.org)³⁹. Modules were defined as subsets of vertices within which vertex–vertex connections were dense but between which such connections were less dense. The Girvan-Newman algorithm detects modules hierarchically by progressively removing edges from the original network. Hence, the algorithm focuses on edges that are least central and thus most likely to be “between” modules (for details, see⁹).

The scoring of the extracted modules was performed in R (R version 3.0.2, Bioconductor version 2.13)^40,41, unless otherwise specified. BioMart^42,43 was used to ensure the gene names matched HGNC nomenclature. Gene-wise scoring was done individually for each selected network containing modules of interest. The integrated score was the sum of the scores in the following six categories:

(1)
The distance to a CAD candidate gene. The shortest path of each network gene to a CAD candidate gene (including both the genome-wide significant and suggestive loci associated genes) was calculated with the shortest.paths function in the R/Bioconductor package igraph. The CAD candidate genes used are listed in Supplementary Table 1B. If a gene was a CAD candidate gene (i.e., the distance was zero), it was assigned a score of 4. If the distance was one, the gene was assigned a score of 2. If the distance was two, the gene was assigned a score of 1.
(2)
The expression level of a gene in CAD tissues relative to that in toxicologically relevant tissues (Tox tissues), defined as those with common drug side effects. The CAD tissues were those used in the STAGE study (liver, skeletal muscle, visceral fat, subcutaneous fat, whole blood, atherosclerotic arterial wall, and unaffected arterial wall); RMA condensed Affymetrix gene expression values were used (provided by CGN). The Tox tissues were liver, heart, kidney, cerebral cortex, pancreas, and testis; the FPKM (fragments per kilobase per million) gene-level condensed RNA-seq was downloaded from the EBI-EMBL Expression Atlas (http://www.ebi.ac.uk/gxa/experiments/E-MTAB-1733). Since liver is both a CAD tissue and a Tox tissue, it was not assigned to either category but was used instead for normalization to facilitate the comparison of Affymetrix data and RNA-seq data.

For STAGE Affymetrix data, the RMA values per tissues were averaged (median) and normalized to expression in the reference tissue (liver) by taking the ratio. Data from atherosclerotic arterial wall and unaffected arterial wall were merged by taking the average. The average (mean) expression value for all CAD tissues was calculated from the normalized tissue expression values.

For RNA-seq data, the zero values were replaced by an arbitrary minimum FPKM value of 0.5 (to avoid data loss upon log transformation), and genes containing “LOWDATA” or “FAIL” tags were removed. For non-unique genes, average (mean) FPKM values were calculated. The tissue expression values were normalized by calculating the ratio to expression in the reference tissue (liver). The average (mean) expression value for Tox tissues was calculated from the normalized tissue expression values.

To correct for different dynamic ranges, the normalized expression values for CAD and Tox tissues were transformed to an interval [0,1]. For genes present in both data sets, the ratio of CAD expression to Tox expression was calculated. The scores were scaled between −2 and 2 to yield the desired weight for the summary score.
(3)
Vascular expression. All STAGE Affymetrix CEL files from the NCBI Gene Expression Omnibus (GEO) (series GSE40231) were loaded into Genedata Analyst software (version 8.2.4a) and MAS5 condensed. Samples from internal mammary artery and atherosclerotic aortic wall were selected. A gene was considered to be expressed in a particular vascular tissue if one or more of its probes was “present” as judged from MAS5 absent/present calls in ≥50% of samples. If a gene was present, it was assigned a score of 1.
(4)
Kinases and G-protein-coupled receptors (GPCRs). A list of human and mouse protein kinase coding genes from the Uniprot (http://www.uniprot.org/docs/pkinfam.txt) database was downloaded (on 9.5.2014, release 2014_04). A list of GPCR coding genes from the IUPHAR/BPS Guide to Pharmacology (http://www.guidetopharmacology.org/GRAC/GPCRListForward?class=A) was downloaded on 8.5.2014 (file targets_and_families.csv). All gene names were checked with HGNC Multi-symbol Checker (http://www.genenames.org/cgi-bin/symbol_checker). The lists of kinases and GPCRs were combined and collapsed to include only unique gene names (including HGNC and non-HGNC converted names). A score of 2 was assigned to each kinase and GPCR coding gene.
(5)
Existing drugs targeting the gene product. The genes measured in the STAGE study and the proteins/genes they interact with (from the ConsensusPathDB) were checked with the HGNC Multi-symbol Checker. All unique gene names (including HGNC and non-HGNC converted names) were listed and checked for drug interactions in DGIdb v2.22 (Interaction.tsv file). Genes with at least one drug in the database were assigned a score of 2.
(6)
Mouse phenotype. A list of all the genes that had atherosclerosis phenotypes in mice was obtained from the University of Utrecht³⁴. The gene names were converted to official gene symbols with HGNC Multi-symbol Checker, and a list of all the unique gene names (including HGNC and non-HGNC converted names) was compiled. Genes with atherosclerosis mouse phenotypes were assigned a score of 1.

To score the resulting network modules, we selected those containing at least one GWAS hit and summed the integrated scores of all genes in the module. The module score was then normalized to the number of genes in the module:

$${Module}\_{Score}=\frac{\sum Integrated\_Gene\_Score}{Module\_Size}$$

Functional Enrichment Analysis

To analyze the functional enrichment of network modules, we downloaded Gene Ontology (GO) (http://www.geneontology.org)⁴⁴ annotations from the AmiGO 2 browser (http://amigo.geneontology.org/amigo). The GO project provides information about gene product function by using ontologies to represent biological knowledge in three classes: (1) molecular functions, (2) the biological processes they contribute to, and (3) the cellular locations where they occur (cellular components). Fisher’s exact test⁴⁵ in R (http://www.r-project.org) was used to calculate the statistical significance of the overlaps; thereafter, the Benjamini-Hochberg (BH) procedure⁴⁶ was used to control for the FDR.

Network Visualization

Networks were visualized with the yFiles organic layout algorithm in Cytoscape v3.2.1⁴⁷.

Drug Enrichment Analysis

To test for drug target enrichment, we identified all genes targeted by drugs in the DGIdb⁴⁸. For each module, the enrichment was calculated by using Fisher’s exact test to compare the number of genes in the module targeted by drugs to the total number of genes targeted by the drugs. We used two approaches. First, we did an enrichment analysis, with Fisher’s exact test, for general drug targets and known cardiometabolic drug targets (Supplementary Table 5). In the analysis where the enrichment of cardiometabolic drug targets was compared to other drug targets, the gene was considered cardiometabolic drug target if one or more cardiometabolic drugs targeted the gene, and other drug target if no cardiometabolic drugs targeted the gene and the gene was on the list of all genes targeted by drugs derived from the DGIdb⁴⁸. Second, we created a dictionary of Anatomical Therapeutic Chemical Classification System (ATC) codes based on information from MedNet INN⁴⁹ and ATC/DDD⁵⁰. ATC codes reflect standard practices in drug administration, according to diseases for which each drug is most prescribed. Using these ATC codes, we created “drug sets”, a concept similar to gene sets but applied to drugs prescribed for the same human system. Fisher’s exact test was used to test enrichment in specific systems.

Results

Prioritizing CAD Candidate Genes in Risk Loci Identified by GWAS

We used integrative bioinformatics to link possible causal genes to CAD-associated SNPs². In brief, we first collected 264 genome-wide significant and suggestive CAD SNPs from the most recently reported GWAS^1,10,11. These variants corresponded to 213 independent association signals based on LD of r² < 0.2. We then identified all proxy SNPs in high LD (r² > 0.8) with the lead variants. Next, we linked candidate genes to lead SNPs and high LD proxies based on proximity, association with gene expression^{13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29} and long-range interaction with trans-genes outside the locus¹². This approach linked 475 genes to the CAD variants (Supplementary Table 1A).

We then scored each of the genes based on functional evidence and prioritized the most likely causal gene at each locus (Supplementary Table 1A) (for more details see Methods). In total, we prioritized 184 unique CAD candidate genes identified in this fashion, of which 68 were at genome-wide significant loci (Supplementary Table 1B). Some loci were linked to more than one equally scoring gene, and no candidate gene could be assigned to 49 SNPs, as no gene had a score above zero or were excluded as the linked gene did not have an HGNC gene name.

Identifying and Scoring Subnetworks (Modules) Containing CAD Candidate Genes

To investigate the molecular mechanisms, pathways, and regulatory networks in which the prioritized CAD genes operate within and across CAD-relevant tissues to affect CAD risk, we identified tissue-specific subnetworks containing CAD candidate genes as shown in Fig. 1A. In this process, first, we inferred regulatory gene networks (RGNs) from the 171 multi-tissue gene co-expression networks from the STAGE study^7,35. Second, tissue-specific nodes of the RGNs were extended with published, high-confidence protein-protein interactions (PPIs) from the ConsensusPathDB database⁸ to create regulatory gene protein networks (RGPNs). The CAD relevance, as assessed by enrichment of CAD candidate genes, of adding PPIs to STAGE gene-networks was confirmed by comparing modules inferred from both STAGE data and PPIs, to modules inferred from PPIs alone (i.e. building modules from PPIs alone, without using STAGE coexpression networks as prior). The details and results from the comparison are described in the Supplementary Material (Page 1). Third, subnetworks (modules) were computed from each RGPN⁵¹, resulting in 953 modules whereof 449 were found to contain at least one CAD candidate gene (Supplementary Table 2). However for the subsequent analysis, we focused on the 286 modules containing at least one of the 68 prioritized CAD genes linked to a genome-wide significant risk loci (P < 5 × 10⁻⁸) (Supplementary Tables 1B and 2). Among these, the most “promising” modules from a CAD perspective were identified by calculating a CAD-feasibility score (Fig. 1B), which is an average module score based on individual scores of each of the module genes according to four criteria: (1) proximity to the module CAD candidate gene(s), (2) genetically modified in mice with an atherosclerosis phenotype, (3) target tissue (i.e., toxic or CAD-relevant), and (4) druggability (e.g., GPCRs and kinases as the two largest druggable gene families in human genome⁵²). Based on the weights assigned to each scoring category (see Methods), the score can range from −2 to 12. The 25 top-scoring modules are shown in Table 1.

Table 1 Key features of the top 25 modules.

Full size table

Known Cardiometabolic Drug Targets Validate Module Scoring Theme

Although we judge the categories for scoring the modules as adequate and reasonable, the scoring categories and weights are arbitrary by nature. We therefore sought means to assess the validity of the module-scoring theme by taking advantage of known target genes of drugs currently in clinical use for CAD and CAD-related risk factors retrieved from the websites of the U.S. Food and Drug Administration (fda.gov) and the open drug resource drugs.com. Reassuringly, many of the 286 modules were enriched for cardiometabolic drugs (Supplementary Table 2 and Supplementary Text page 2) and modules strongly enriched in gene/protein targets for these cardiometabolic drugs also tended to have high CAD-feasibility scores (Supplementary Figure 1). This trend was further reflected in that the top scoring quintiles (4^th and 5^th quintile) of the modules were significantly (Kolmogorov-Smirnov test) more enriched in genes targeted by cardiometabolic drugs than lower scoring quintiles (Fig. 2A). Moreover, the 25 top-scoring modules were found even further enriched in cardiometabolic drug targets (i.e., top 9^th percentile of modules with >4.5 points scores, versus the remaining 33 modules in the top quintile, 12.2 ± 9.9% vs. 5.2 ± 5.8%, P = 0.0016 Kolmogorov-Smirnov test, Table 1). The fact that high scoring modules were also often enriched for targets of non-cardiometabolic drugs (Table 1 and Supplementary Table 2) was not surprising since the scoring theme favors genes/proteins that are known drug targets. However the observation that the ratios of cardiometabolic to other drug targets were the highest in top scoring modules is noteworthy (Fig. 2B) as this underscores their possible relevance for understanding CAD and to identify novel targets associated with genetic risk.

Characteristics of 25 Top-scoring Modules and Drug Target Enrichment

The 25 top-scoring gene/protein modules had four to 111 nodes (Table 1). Seven modules were active in several CAD tissues in STAGE, and 18 modules were tissue-specific. Eight of 25 modules had only one CAD candidate gene, six had two respective genes, five had three, and six had as many as four (Table 1). Thus, although we identified an average of ~1.1 candidate genes per CAD risk locus (184 genes in 169 risk loci), many candidate genes/proteins were found in the same module, implying cumulative effects across independent loci³. Among the 25 top-scoring modules, 22 had an average of four genes/proteins targeted by cardiometabolic drugs; in 13 modules, the genes/proteins targeted by cardiometabolic drugs coincided with the CAD candidate gene (P < 0.05, Fisher’s exact test; Table 1 and Supplementary Table 5). Importantly, however, GO and pathway analyses (Supplementary Table 7) showed that the functional enrichments of these 22 modules mostly did not coincide with the pathway or tissue targeted by the drugs. Thus, cardiometabolic drugs in clinical use today may have unknown pleiotropic effects outside their assumed target tissues and pathways.

Drug Targets and Categories in the 25 Top-scoring Modules

We also tested for the distribution of drug categories that targeted any of the genes in the 25 top-scoring modules (Table 2). To this end, we used the ATC (Anatomic Therapeutic Chemical classification system) codes from MedNet INN⁴⁹ and ATC/DDD⁵⁰. As expected, we noted a significant enrichment (Fisher’s exact test right-tail P < 0.05) in cardiovascular drugs targeting the genes/proteins in four of the 25 top-scoring modules, and two modules were enriched in drugs targeting blood and blood-forming organs), with one overlap (module 84_3). However, the most often enriched ATC categories were antineoplastic and immune-modulating agents (in 13 of the 25 top modules), suggesting some of these CAD modules contain target genes/proteins for non-cardiometabolic drugs that may be repurposed for CAD.

Table 2 ATC groups most represented by drugs that target gene products of the top 25 modules.

Full size table

Enrichment of the 25 Top-scoring Modules in Biological Processes

To help establish the function, pathway, and druggability of CAD candidate genes, we annotated the 25 top-scoring modules by GO biological process (Table 1, Supplementary Table 7). All modules were enriched (P < 6.12 × 10⁻⁶, Fisher’s exact test) in GO categories that were largely non-overlapping between the modules, suggesting that each module has a relatively distinct biological function. However, several categories were over-represented, including extracellular matrix organization and disassembly, blood coagulation, platelet activation, innate immune response, complement activation, leukocyte migration, and various cellular signaling mechanisms. In Fig. 3 and as discussed below, four of the top-scoring 25 modules were highly interesting, as judged from their tissue-belonging, CAD candidate genes, and representative biological processes. The most relevant biological products of these pathways are involved in extracellular matrix organization, blood coagulation, platelet activation, innate immune response, complement activation and leukocyte migration. Similar characteristics for all 286 module are available in the Supplementary Material (See Supplementary Information).

Discussion

Herein, we applied comprehensive bioinformatics to a number of both public and unique datasets of CAD to create a first repository of network modules associated with inherited risk for CAD as identified by GWAS^1,10,11. In tissue-specific manner, these networks describe both established and new biological processes and pathways providing new insights into the mechanisms by which genetic risk affects CAD etiology. Furthermore, the network repository also delivers new insights of both CAD-established, and novel drugs and drugs targets that now can be further evaluated to reassess their effects on CAD and related risk factors. In sum, the identified network modules represent an unprecedented resource for tissue-specific gene–protein interactions directly affected by genetic variance in CAD risk loci and should help expedite their translation into new opportunities for CAD diagnostics and therapies.

Module 91_4 was highly enriched in GO categories related to lipoprotein and cholesterol metabolism (Fig. 3A, Supplementary Table 7) and was mainly active in visceral fat (Fig. 3A and Table 1). Its four CAD risk candidate genes (LDLR, APOE, SCARB1, and NOS3) (Fig. 3A and Supplementary Tables 1 and 8) are well-established in lipoprotein and nitric oxide/cGMP metabolism and have been identified as candidate genes by GWAS^1,53,54. The cardiometabolic drugs targeting LDLR and APOE (e.g. statins), are among the most widely used cardiometabolic therapies worldwide⁵⁵, and NOS3 genotypes are indirectly related to activators of soluble guanylyl cyclase (e.g. riociguat) or inhibitors of phosphodiesterase 5 A (e.g. sildenafil) (Table 1 and Supplementary Tables 5 and 8)⁵⁶. Thus, this module serves as a proof-of-concept for our analysis pipeline. This module also harbored genes/proteins that are known targets of antineoplastic and immunomodulating agents (five drugs, Fisher’s exact test right-tail P = 0.006; Table 2 and Supplementary Table 9). Several other module genes, including immediate neighbors of CAD candidate genes (e.g., CSNK2A1 and HTRA1) are also druggable (Fig. 3A and Supplementary Table 8) and thus may be repurposed for modulating the effects of this module in the context of atherosclerosis prevention.

Ten of the top 25 modules were found to be implicated in extracellular matrix organization and disassembly, blood coagulation, or platelet degranulation/activation (Table 1). Module 82_4, a representative example (Fig. 3B, Supplementary Table 7), has three CAD candidate genes: LRP1, COL4A1 and FN1 (Fig. 3B, Table 1, and Supplementary Tables 1 and 8). This module mainly contained genes expressed in subcutaneous fat (Fig. 3B and Table 1). Four first or second neighbors of the CAD candidate genes in this module are targets of current cardiometabolic drugs: TFPI (dalteparin), ITGB3 (abciximab and eptifibatide), MMP2 (captopril) and VEGFA (dalteparin) (Table 1, Supplementary Tables 5 and 8), and for these gene-drug pairs CAD relevant physiological effects have been reported (for more details and references see https://www.drugbank.ca)^{57,58,59,60,61,62,63}. These genes are good examples of module genes that are directly adjacent to GWAS CAD candidate genes and that may be alternative or better drug targets than the GWAS-derived risk genes themselves. The drugbank database (https://www.drugbank.ca) also reports CAD candidate gene LRP1 and its first neighbor CALR as targets for the fibrinolytic agent tenecteplase (recombinant tissue plasminogen activator tPA). These drug-gene links are based on the binding of the LRP1 and CALR proteins to tPA in in-vitro models^64,65 and the physiological relevance of these interactions have not been established (for further discussion on the limitations of drug-gene interaction databases see below). Moreover, this module contains many nodes that (1) are druggable (the CAD candidate genes FN1 and MMP9⁶⁶ and their neighbors such as THBS1), (2) have an atherosclerosis phenotype in mice, and (3) have a preferable non-toxic target tissue (Fig. 3B and Supplementary Table 8). This module also harbored nodes targeted by antineoplastic and immunomodulating agents (n = 7, Fisher’s exact test right-tail P = 0.001) and by drugs targeting the musculoskeletal system (n = 4, Fisher’s exact test right-tail P = 0.001) (Table 2 and Supplementary Table 9).

Module 134_1 is representative of four modules enriched in biological processes related to the innate immune response (Fig. 3C, Table 1, and Supplementary Table 7). It is a cross-tissue module involving both the arterial wall and skeletal muscle (Fig. 3C and Table 1) and contains four CAD candidate genes: RELA, TNF, SHC1, and LRP1 (Table 1 and Supplementary Tables 1 and 8). Pentoxifylline regulates TNF production and release from white blood cells^67,68, however the physiological and clinical relevance of this in terms of CAD has not been tested (Table 1 and Supplementary Tables 5 and 8). Several other nodes in the module (e.g., LYN and SYK) are druggable, have atherosclerosis phenotypes in mice, and have preferable target tissues (Supplementary Table 8) and thus may be interesting targets for new CAD therapies. Other well-established drugs targeting this module are antineoplastic and immunomodulating agents (n = 7, Fisher’s exact test right-tail P = 4.78 × 10⁻¹²) (Table 2 and Supplementary Table 9).

Several signaling pathways are among the top GO biological processes in the top-scoring modules (Table 1). For example, module 130_2 is enriched in pathways for epidermal growth factor receptor signaling, fibroblast growth factor receptor signaling, JAK-STAT cascade involved in growth hormone signaling, neurotrophin TRK receptor signaling, Fc-epsilon receptor signaling, cytokine-mediated signaling, and insulin receptor signaling (Fig. 3D and Supplementary Table 8). This module contains three CAD candidate genes (IGF1R, SHC1, and IL6R) and is specific to skeletal muscle (Table 1 and Fig. 3D). Unlike the preceding examples, this module is not targeted by current cardiometabolic drugs (Table 1). However, it contains several CAD candidate genes (e.g., IGFR1) and their immediate neighbors (e.g., CXCR4) that are druggable, have atherosclerosis phenotypes in mice, and act in a non-toxic tissue (Supplementary Table 8). The drugs targeting this module are mainly used as antineoplastic and immunomodulating agents (54 drugs, Fisher’s exact test right-tail P = 2.32 × 10⁻⁵⁸) (Table 2 and Supplementary Table 9).

As further validation and extension of our approach and findings, we had a more detailed look at the atherosclerosis related mouse phenotypes reported for the CAD risk candidate genes and their neighbors in the top modules^34,69. In module 91_4, all the four CAD risk candidate genes (LDLR, APOE, SCARB1, and NOS3) display protective role against atherosclerosis (i.e. gene deletion increases atherosclerosis risk/phenotype) by regulating cholesterol or blood pressure levels^70,71,72,73. Interestingly, several of their neighboring genes (LCAT, VLDLR, PLTP, APP) in the modules have driver roles for atherosclerosis (i.e. gene deletion decreases atherosclerosis risk/phenotype)^74,75,76,77 and thereby could be attractive targets for drug inhibition. Mouse models of several CAD risk candidate genes (e.g. SCH1, RELA and FN1) in modules 82_4, 130_2 and 134_1 display phenotypes related macrophage accumulation and foam cell formation in atherosclerotic lesions^78,79,80, thereby highlighting the roles of different immune system elements and signaling pathways these modules regulate in CAD. The mouse phenotypes also confirm that the modules contain attractive CAD drug target candidates, as many of the genes have a driver role for atherosclerosis (e.g. TNF and FN1) (for more details and references see⁶⁹).

The multifactorial etiology of CAD and the diversity of causative mechanisms involved makes it likely that multiple pathways have to be addressed by medical treatment in order to neutralize risk. In this sense, the resource presented here can provide new ideas for combination therapies for the treatment of CAD. Combination therapies could provide beneficial therapeutic effects either by acting within a single module or across several modules, depending to which extend the relevant mechanism is being affected. Within module co-treatment of two different node targets with two (or more) medicines could result in a more efficient reduction of a single CAD driving mechanism and cause. A potential example of this is co-treatment with statins and PCSK9 inhibitors to regulate cholesterol levels and atherosclerosis⁸¹, as PCSK9 was identified in the same module with statin targets LDLR and APOE (see module 112_1 in Supplementary Table 2 and Supplementary Material). Cross module co-treatment could potentially lead to reduction of more than one of the CAD driving mechanisms and causes (e.g. regulation of cholesterol levels and platelet functions) which could lead to efficient overall improvements in CAD patients. A potential example for this would be co-targeting of modules 91_4 and 82_4 with statins and anticoagulants, which could have beneficial effect for treating atherosclerosis and indeed has been shown to be beneficial in the recent COMPASS trial⁸². It is also feasible that modules like we have discovered in our study will help to find new potential candidates for drug repositioning in CAD.

In our approach, we prioritized genes coding for kinases and GPCRs as they represent the largest druggable gene families in the human genome. Hence the reported top modules were found to contain several kinases and GPCRs, both for which (1) drug compounds already exists (e.g. CXCR4 and CCR1 in module 130_2) in different stages of drug life cycle (development to market) and for which (2) current drug compounds are yet to be reported (e.g. TRIB3 in module 82_4). The first category provides potential opportunities for drug compound repurposing²⁵ and second one for new drug development in CAD.

We acknowledge several possible limitations to our study. First, for our prioritization of CAD genes there are limitations inherent by the available data. For example, some effects of disease-associated variants on gene expression will be missed as we did not have data from all CAD-relevant cell types or tissues. This is likely reflected in our results in that we were unable to link genes to every CAD locus. Also, the information we used from mouse studies is limited because not all mouse genes have been studied for CAD related phenotypes, and are further potentially biased since negative results are often not reported. We also chose to exclude the results we found from other animal models of atherosclerosis from the analysis, since these studies are sparse and often inspired by preceding mouse experiments. Thus, their inclusion would potentially further bias our results. Second, the assessments of the tissue specificity and cross-tissue signaling in the combined gene expression and PPI networks may have limitations as the PPI data did not derive from specific tissues nor from CAD individuals as does the STAGE data⁷. However, as outlined in the Supplementary Materials (Page 1), extending PPIs to existing nodes in the STAGE gene networks was essential to infer modules relevant to CAD. We also used other databases in addition to STAGE and PPIs with their own limitations such as lacking filtering mechanisms for excluding false positives and negatives. Furthermore, many of these datasets are under continuous development and refinement. Thus results presented here will be subjected to changes when rerun as the data sources then likely will have been updated. Finally, the reported links between pathways and drugs are often based on indirect or pleiotropic effects of questionable quantitative relevance. However, these limitations do not, in our view, invalidate any of the reported results or used methodology but merely point to the fact that they can be further refined in future re-evaluations.

In conclusion, we present a unique strategy, that is new and innovative, and represents a significant improvement and as such, a substantial contribution toward a more integrative understanding of the molecular processes driving CAD and its therapeutic modulation.

References

Nikpay, M. et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–30 (2015).
Article CAS PubMed PubMed Central Google Scholar
Brænne, I. et al. Prediction of causal candidate genes in coronary artery disease loci. Arterioscler. Thromb. Vasc. Biol. 35, 2207–2217 (2015).
Article PubMed PubMed Central Google Scholar
Franzén, O. et al. Cardiometabolic risk loci share downstream cis- and trans-gene regulation across tissues and diseases. Science 353, 827–30 (2016).
Article ADS PubMed PubMed Central Google Scholar
Deloukas, P. et al. Large-scale association analysis identifies new risk loci for coronary artery disease. Nat. Genet. 45, 25–33 (2013).
Article CAS PubMed Google Scholar
Miller, C. L., Pjanic, M. & Quertermous, T. From locus association to mechanism of gene causality the devil is in the details. Arterioscler. Thromb. Vasc. Biol. 35, 2079–2080 (2015).
Article CAS PubMed PubMed Central Google Scholar
Björkegren, J. L. M., Kovacic, J. C., Dudley, J. T. & Schadt, E. E. Genome-Wide Significant Loci: How Important Are They? J. Am. Coll. Cardiol. 65, 830–845 (2015).
Article PubMed PubMed Central Google Scholar
Hääg et al. Multi-organ expression profiling uncovers a gene module in coronary artery disease involving transendothelial migration of leukocytes and LIM domain binding 2: The Stockholm Atherosclerosis Gene Expression (STAGE) study. PLoS Genet. 5 (2009).
Kamburov, A., Stelzl, U., Lehrach, H. & Herwig, R. The ConsensusPathDB interaction database: 2013 Update. Nucleic Acids Res. 41 (2013).
Girvan, M. & Newman, M. E. J. Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99, 7821–7826 (2002).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Myocardial Infarction Genetics and CARDIoGRAM Exome Consortia Investigators. Coding Variation in ANGPTL4, LPL, and SVEP1 and the Risk of Coronary Disease. N. Engl. J. Med. 374, 1134–44 (2016).
Webb, T. R. et al. Systematic Evaluation of Pleiotropy Identifies 6 Further Loci Associated With Coronary Artery Disease. J Am Coll Cardiol. 69, 823–836 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jin, F. et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature 503, 290–4 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Dimas, A. S. et al. Common regulatory variation impacts gene expression in a cell type-dependent manner. Science 325, 1246–50 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Fairfax, B. P. et al. Innate Immune Activity Conditions the Effect of Regulatory Variants upon Monocyte Gene Expression. Science (80-.). 343, 1246949–1246949 (2014).
Article Google Scholar
Fehrmann, R. S. N. et al. Trans-eqtls reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the hla. PLoS Genet. 7 (2011).
Garnier, S. et al. Genome-Wide Haplotype Analysis of Cis Expression Quantitative Trait Loci in Monocytes. PLoS Genet. 9 (2013).
Gibbs, J. R. et al. Abundant quantitative trait loci exist for DNA methylation and gene expression in Human Brain. PLoS Genet. 6, 29 (2010).
Article Google Scholar
Grundberg, E. et al. Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat. Genet. 44, 1084–9 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hao, K. et al. Lung eQTLs to Help Reveal the Molecular Underpinnings of Asthma. PLoS Genet. 8 (2012).
Liang, L. et al. A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines. Genome Res. 23, 716–726 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lonsdale, J. et al. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–5 (2013).
Article CAS Google Scholar
Montgomery, S. B. et al. Transcriptome genetics using second generation sequencing in a Caucasian population. Nature 464, 773–777 (2010).
Article ADS CAS PubMed Google Scholar
Myers, A. J. et al. A survey of genetic human cortical gene expression. Nat. Genet. 39, 1494–9 (2007).
Article CAS PubMed Google Scholar
Pickrell, J. K. et al. Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature 464, 768–772 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Schadt, E. E. et al. Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 6, 1020–1032 (2008).
Article CAS Google Scholar
Stranger, B. E. et al. Population genomics of human gene expression. Nat. Genet. 39, 1217–24 (2007).
Article CAS PubMed PubMed Central Google Scholar
Veyrieras, J. B. et al. High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet. 4 (2008).
Westra, H.-J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238–43 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zeller, T. et al. Genetics and beyond–the transcriptome of human monocytes and disease susceptibility. PLoS One 5, e10693 (2010).
Article ADS PubMed PubMed Central Google Scholar
Ritchie, G. R. S., Dunham, I., Zeggini, E. & Flicek, P. Functional annotation of noncoding sequence variants. Nat. Methods 11, 294–6 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081 (2009).
Article CAS PubMed Google Scholar
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
Article CAS PubMed PubMed Central Google Scholar
Pasterkamp, G. et al. Human validation of genes associated with a murine atherosclerotic phenotype. Arterioscler. Thromb. Vasc. Biol. 36, 1240–1246 (2016).
Article CAS PubMed Google Scholar
Talukdar, H. A. et al. Cross-Tissue Regulatory Gene Networks in Coronary Artery Disease. Cell Syst. 2, 196–208 (2016).
Article CAS PubMed PubMed Central Google Scholar
Orchard, S. et al. The MIntAct project - IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42 (2014).
Keshava Prasad, T. S. et al. Human Protein Reference Database–2009 update. Nucleic Acids Res. 37, D767–D772 (2009).
Article CAS PubMed Google Scholar
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2015 update. Nucleic Acids Res. 43, D470–D478 (2015).
Article CAS PubMed Google Scholar
Csárdi, G. & Nepusz, T. The igraph software package for complex network research. InterJournal Complex Syst. 1695, 1695 (2006).
Google Scholar
Ihaka, R. & Gentleman, R. R: A Language for Data Analysis and Graphics. J. Comput. Graph. Stat. 5, 299–314 (1996).
Google Scholar
Gentleman, R. C. et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80 (2004).
Article PubMed PubMed Central Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–91 (2009).
Article CAS PubMed PubMed Central Google Scholar
Durinck, S. et al. BioMart and Bioconductor: A powerful link between biological databases and microarray data analysis. Bioinformatics 21, 3439–3440 (2005).
Article CAS PubMed Google Scholar
The Gene Ontology Consortium. Gene Ontology Consortium: going forward. Nucleic Acids Res. 43, D1049–D1056 (2015).
Fisher, R. A. On the Interpretation of χ² from Contingency Tables, and the Calculation of P. J. R. Stat. Soc. 85, 87–94 (1922).
Article Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Shannon, P. et al. Cytoscape: A software Environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Wagner, A. H. et al. DGIdb 2.0: Mining clinically relevant drug-gene interactions. Nucleic Acids Res. 44, D1036–D1044 (2016).
Article ADS CAS PubMed Google Scholar
Van Bever, E. et al. Operational rules for the implementation of INN prescribing. Int. J. Med. Inform. 83, 47–56 (2014).
Article PubMed Google Scholar
World Health Organization. World Health Organization. Guidelines for ATC classification and DDD assignment (1996).
Segal, E. et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat. Genet. 34, 166–76 (2003).
Article CAS PubMed Google Scholar
Hopkins, A. L. & Groom, C. R. The druggable genome. Nat. Rev. Drug Discov. 1, 727–730 (2002).
Article CAS PubMed Google Scholar
Zanoni, P. et al. Rare variant in scavenger receptor BI raises HDL cholesterol and increases risk of coronary heart disease. Science (80-.). 351, 1166–1171 (2016).
Article ADS CAS Google Scholar
Erdmann, J. et al. Dysfunctional nitric oxide signalling increases risk of myocardial infarction. Nature 504, 432–6 (2013).
Article ADS CAS PubMed Google Scholar
Taylor, F. et al. Statins for the primary prevention of cardiovascular disease. Cochrane database Syst. Rev. 1, CD004816 (2013).
Google Scholar
Muniz, J. J. et al. Endothelial nitric oxide synthase genotypes and haplotypes modify the responses to sildenafil in patients with erectile dysfunction. Pharmacogenomics J. 13, 189–96 (2013).
Article CAS PubMed Google Scholar
Amoroso, G. et al. Eptifibatide and abciximab exhibit equivalent antiplatelet efficacy in an experimental model of stenting in both healthy volunteers and patients with coronary artery disease. J.Cardiovasc.Pharmacol. 38, 633–641 (2001).
Article CAS PubMed Google Scholar
Batchelor, W. B. et al. Randomized comparison of platelet inhibition with abciximab, tirofiban and eptifibatide during percutaneous coronary intervention in acute coronary syndromes: The compare trial. Circulation 106, 1470–1476 (2002).
Article CAS PubMed Google Scholar
Naumnik, B., Rydzewska-Rosolowska, A. & Mysliwiec, M. Different effects of enoxaparin, nadroparin, and dalteparin on plasma TFPI during hemodialysis: a prospective crossover randomized study. Clin. Appl. Thromb. Hemost. 17, 480–486 (2011).
Article CAS PubMed Google Scholar
Takahashi, H. et al. A comparison of the effects of unfractionated heparin, dalteparin and danaparoid on vascular endothelial growth factor-induced tumour angiogenesis and heparanase activity. Br. J. Pharmacol. 146, 333–343 (2005).
Article CAS PubMed PubMed Central Google Scholar
Norrby, K. & Nordenhem, A. Dalteparin, a low-molecular-weight heparin, promotes angiogenesis mediated by heparin-binding VEGF-A in vivo. APMIS 118, 949–957 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yamamoto, D., Takai, S., Hirahara, I. & Kusano, E. Captopril directly inhibits matrix metalloproteinase-2 activity in continuous ambulatory peritoneal dialysis therapy. Clin. Chim. Acta 411, 762–764 (2010).
Article CAS PubMed Google Scholar
Brower, G. L., Levick, S. P. & Janicki, J. S. Inhibition of matrix metalloproteinase activity by ACE inhibitors prevents left ventricular remodeling in a rat model of heart failure. Am. J. Physiol. Heart Circ. Physiol. 292, H3057–64 (2007).
Article CAS PubMed Google Scholar
Hu, K. et al. Tissue-type plasminogen activator acts as a cytokine that triggers intracellular signal transduction and induces matrix metalloproteinase-9 gene expression. J. Biol. Chem. 281, 2120–2127 (2006).
Article CAS PubMed Google Scholar
Allen, S. & Bulleid, N. J. Calnexin and calreticulin bind to enzymically active tissue-type plasminogen activator during biosynthesis and are not required for folding to the native conformation. Biochem.J. 328(Pt 1), 113–119 (1997).
Article CAS PubMed PubMed Central Google Scholar
Brænne, I. et al. A genomic exploration identifies mechanisms that may explain adverse cardiovascular effects of COX-2 inhibitors. Sci. Rep. 7, 10252 (2017).
Article ADS PubMed PubMed Central Google Scholar
Pollice, P. F. et al. Oral pentoxifylline inhibits release of tumor necrosis factor-alpha from human peripheral blood monocytes: a potential treatment for aseptic loosening of total joint components. J. Bone Joint Surg. Am. 83–A, 1057–61 (2001).
Article PubMed Google Scholar
Marques, L. J., Zheng, L., Poulakis, N., Guzman, J. & Costabel, U. Pentoxifylline inhibits TNF-alpha production from human alveolar macrophages. Am. J. Respir. Crit. Care Med. 159, 508–11 (1999).
Article CAS PubMed Google Scholar
von Scheidt, M. et al. Applications and Limitations of Mouse Models for Understanding Human Atherosclerosis. Cell Metabolism 25, 248–261 (2017).
Article Google Scholar
Trigatti, B. et al. Influence of the high density lipoprotein receptor SR-BI on reproductive and cardiovascular pathophysiology. Proc. Natl. Acad. Sci. USA 96, 9322–9327 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Ishibashi, S., Goldstein, J. L., Brown, M. S., Herz, J. & Burns, D. K. Massive xanthomatosis and atherosclerosis in cholesterol-fed low density lipoprotein receptor-negative mice. J. Clin. Invest. 93, 1885–1893 (1994).
Article CAS PubMed PubMed Central Google Scholar
Knowles, J. W. et al. Enhanced atherosclerosis and kidney dysfunction in eNOS(−/−)Apoe(−/−) mice are ameliorated by enalapril treatment. J. Clin. Invest. 105, 451–458 (2000).
Article CAS PubMed PubMed Central Google Scholar
Zhang, S. H., Reddick, R. L., Piedrahita, J. A. & Maeda, N. Spontaneous hypercholesterolemia and arterial lesions in mice lacking apolipoprotein E. Science (80-.). 258, 468–471 (1992).
Article ADS CAS Google Scholar
Lambert, G. et al. Analysis of Glomerulosclerosis and Atherosclerosis in Lecithin Cholesterol Acyltransferase-deficient Mice. J. Biol. Chem. 276, 15090–15098 (2001).
Article CAS PubMed Google Scholar
Van Eck, M. et al. Role of the macrophage very-low-density lipoprotein receptor in atherosclerotic lesion development. Atherosclerosis 183, 230–237 (2005).
Article PubMed Google Scholar
Yang, X. P. et al. Increased atherosclerotic lesions in ApoE mice with plasma phospholipid transfer protein overexpression. Arterioscler. Thromb. Vasc. Biol. 23, 1601–1607 (2003).
Article CAS PubMed Google Scholar
Tibolla, G. et al. Increased atherosclerosis and vascular inflammation in APP transgenic mice with apolipoprotein E deficiency. Atherosclerosis 210, 78–87 (2010).
Article CAS PubMed Google Scholar
Ye, X., Jiang, X., Guo, W., Clark, K. & Gao, Z. Overexpression of NF- B p65 in macrophages ameliorates atherosclerosis in apoE-knockout mice. AJP Endocrinol. Metab. 305, E1375–E1383 (2013).
Article CAS Google Scholar
Tan, M. H. et al. Deletion of the alternatively spliced fibronectin EIIIA domain in mice reduces atherosclerosis. Blood 104, 11–18 (2004).
Article CAS PubMed Google Scholar
Martin-Padura, I. et al. p66Shc deletion confers vascular protection in advanced atherosclerosi in hypercholesterolemic apolipoprotein E knockout mice. Endothel. J. Endothel. Cell Res. 15, 276–287 (2008).
CAS Google Scholar
Nicholls, Stephen J. et al. Effect of Evolocumab on Progression of Coronary Disease in Statin-Treated PatientsThe GLAGOV Randomized Clinical Trial. JAMA 1–12, https://doi.org/10.1001/jama.2016.16951 (2016).
Eikelboom, J. W. et al. Rivaroxaban with or without Aspirin in Stable Cardiovascular Disease. N. Engl. J. Med. NEJMoa1709118 https://doi.org/10.1056/NEJMoa1709118 (2017).

Download references

Acknowledgements

Several figure elements in Fig. 1B (panels II-IV) have been obtained and adapted from Servier Medical Art (www.servier.com) which are distributed under Creative Commons license (https://creativecommons.org/licenses/by/3.0/). The research leading to these results has received funding from the European Union Seventh Framework Programme FP7/2007–2013 under grant agreement n° HEALTH-F2-2013-601456 (CVgenes-at-target). Folkert W. Asselbergs is supported by a Dekker scholarship-Junior Staff Member 2014T001 – Netherlands Heart Foundation and UCL Hospitals NIHR Biomedical Research Centre.

Author information

Ingrid Brænne, Tom Michoel, Vinicius Tragante, Baiba Vilne and Tom R. Webb contributed equally to this work.

Authors and Affiliations

Genedata AG, Basel, Switzerland
Harri Lempiäinen, Johannes Eichner, Claudia Biegert & Timo Wittenberger
Institute for Cardiogenetics, Lübeck, Germany
Ingrid Brænne, Christina Willenborg & Jeanette Erdmann
Division of Genetics and Genomics, The Roslin Institute, University of Edinburgh, Edinburgh, United Kingdom
Tom Michoel
Clinical Gene Networks AB, Stockholm, Sweden
Tom Michoel, Arno Ruusalepp & Johan L. M. Björkegren
Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, University of Utrecht, Utrecht, The Netherlands
Vinicius Tragante & Folkert W. Asselbergs
Deutsches Herzzentrum München, Klinik für Herz- und Kreislauferkrankungen, Technische Universität München, Munich, Germany
Baiba Vilne, Lingyao Zeng & Heribert Schunkert
DZHK (German Centre for Cardiovascular Research), Munich Heart Alliance, Munich, Germany
Baiba Vilne & Heribert Schunkert
Department of Cardiovascular Sciences, University of Leicester and NIHR Leicester Biomedical Research Centre, Leicester, United Kingdom
Tom R. Webb, Stephen Hamby & Nilesh J. Samani
Division of Cardiovascular Medicine, Radcliffe Department of Medicine, John Radcliffe Hospital, Oxford, United Kingdom
Theodosios Kyriakou, Anuj Goel & Hugh Watkins
Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, USA
Oscar Franzen & Johan L. M. Björkegren
Laboratory of Experimental Cardiology, Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, University of Utrecht, Utrecht, The Netherlands
Sander W. van der Laan & Gerard Pasterkamp
Integrated Cardio Metabolic Centre, Department of Medicine, Karolinska Institutet, Karolinska Universitetssjukhuset, Huddinge, Sweden
Husain A. Talukdar, Hassan Foroughi Asl & Johan L. M. Björkegren
Laboratory of Clinical Chemistry and Hematology, Division Laboratories and Pharmacy, University Medical Center Utrecht, University of Utrecht, Utrecht, The Netherlands
Gerard Pasterkamp
Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, United Kingdom
Folkert W. Asselbergs
Ludwig-Maximilians-Universität, Munich, Germany
Martin Dichgans, Rainer Malik & Matthias Prestel
4SC Discovery GmbH, Mainz, Germany
Tobias Dreker
Fraunhofer Gesellschaft Zur Forderung Der Angewandten Forschung Ev, Munich, Germany
Mira Graettinger & Philip Gribbon
Deutsches Herzzentrum München, Munich, Germany
Thorsten Kessler & Barbara Stiller
Horizon Discovery Limited, Cambridge, United Kingdom
Christine Schofield

Author notes

A comprehensive list of consortium members appears at the end of the paper

Authors

Harri Lempiäinen
View author publications
You can also search for this author in PubMed Google Scholar
Ingrid Brænne
View author publications
You can also search for this author in PubMed Google Scholar
Tom Michoel
View author publications
You can also search for this author in PubMed Google Scholar
Vinicius Tragante
View author publications
You can also search for this author in PubMed Google Scholar
Baiba Vilne
View author publications
You can also search for this author in PubMed Google Scholar
Tom R. Webb
View author publications
You can also search for this author in PubMed Google Scholar
Theodosios Kyriakou
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Eichner
View author publications
You can also search for this author in PubMed Google Scholar
Lingyao Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Christina Willenborg
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Franzen
View author publications
You can also search for this author in PubMed Google Scholar
Arno Ruusalepp
View author publications
You can also search for this author in PubMed Google Scholar
Anuj Goel
View author publications
You can also search for this author in PubMed Google Scholar
Sander W. van der Laan
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Biegert
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Hamby
View author publications
You can also search for this author in PubMed Google Scholar
Husain A. Talukdar
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Foroughi Asl
View author publications
You can also search for this author in PubMed Google Scholar
Gerard Pasterkamp
View author publications
You can also search for this author in PubMed Google Scholar
Hugh Watkins
View author publications
You can also search for this author in PubMed Google Scholar
Nilesh J. Samani
View author publications
You can also search for this author in PubMed Google Scholar
Timo Wittenberger
View author publications
You can also search for this author in PubMed Google Scholar
Jeanette Erdmann
View author publications
You can also search for this author in PubMed Google Scholar
Heribert Schunkert
View author publications
You can also search for this author in PubMed Google Scholar
Folkert W. Asselbergs
View author publications
You can also search for this author in PubMed Google Scholar
Johan L. M. Björkegren
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

CVgenes@target consortium

Martin Dichgans
, Tobias Dreker
, Mira Graettinger
, Philip Gribbon
, Thorsten Kessler
, Rainer Malik
, Matthias Prestel
, Barbara Stiller
& Christine Schofield

Contributions

H.L. and J.L.M.B. conceived and designed the study. H.L., I.B., T.M., V.T., B.V., and T.R.W. conducted the majority of the data analyses. T.K., J. Eichner, L.Z., C.W., O.F., A.G., S.W.V.D.L., C.B., S.H., H.T. and H.F.A. assisted with the data analyses. T.K. and A.R. provided resources. CVgenes@Target Consortium members provided advice on analysis. G.P., H.W., N.J.S., T.W., J. Erdmann, H.S., F.W.A. and J.L.M.B. supervised the study. H.L., I.B., T.M., V.T., B.V., T.R.W. and J.L.M.B. wrote the first draft of the manuscript. H.L., I.B., T.M., V.T., B.V., T.R.W., G.P., H.W., N.J.S., T.W., J. Erdmann, H.S., F.W.A. and J.L.M.B. reviewed and edited the manuscript.

Corresponding authors

Correspondence to Harri Lempiäinen or Johan L. M. Björkegren.

Ethics declarations

Competing Interests

Harri Lempiäinen is an employee and shareholder of Genedata AG. Johannes Eichner, Claudia Biegert and Timo Wittenberger are employees of Genedata AG. Dr. Bjorkegren and Dr. Michoel are consultants and shareholders in Clinical Gene Networks AB (CGN). Dr. Franzen is part-time employee of CGN. Dr. Björkegren is also chairman of the board of directors in CGN. TRW and NJS are funded by the British Heart Foundation and SEH and NJS by the UK National Institute for Health Research.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary material

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Table 4

Supplementary Table 5

Supplementary Table 6

Supplementary Table 7

Supplementary Table 8

Supplementary Table 9

Online Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lempiäinen, H., Brænne, I., Michoel, T. et al. Network analysis of coronary artery disease risk genes elucidates disease mechanisms and druggable targets. Sci Rep 8, 3434 (2018). https://doi.org/10.1038/s41598-018-20721-6

Download citation

Received: 08 August 2017
Accepted: 06 December 2017
Published: 21 February 2018
DOI: https://doi.org/10.1038/s41598-018-20721-6

This article is cited by

From ‘Omics to Multi-omics Technologies: the Discovery of Novel Causal Mediators
- Pedrum Mohammadi-Shemirani
- Tushar Sood
- Guillaume Paré
Current Atherosclerosis Reports (2023)
A role for artificial intelligence in molecular imaging of infection and inflammation
- Johannes Schwenck
- Manfred Kneilling
- Erik H. J. G. Aarntzen
European Journal of Hybrid Imaging (2022)
Systems biology in cardiovascular disease: a multiomics approach
- Abhishek Joshi
- Marieke Rienks
- Manuel Mayr
Nature Reviews Cardiology (2021)
The Interface of Therapeutics and Genomics in Cardiovascular Medicine
- E. F. Magavern
- J. C. Kaski
- M. Pirmohamed
Cardiovascular Drugs and Therapy (2021)
Prioritization of causal genes for coronary artery disease based on cumulative evidence from experimental and in silico studies
- Alexandra S. Shadrina
- Tatiana I. Shashkova
- Yurii S. Aulchenko
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.