Network-based approach to prediction and population-based validation of in silico drug repurposing

Cheng, Feixiong; Desai, Rishi J.; Handy, Diane E.; Wang, Ruisheng; Schneeweiss, Sebastian; Barabási, Albert-László; Loscalzo, Joseph

doi:10.1038/s41467-018-05116-5

Download PDF

Article
Open access
Published: 12 July 2018

Network-based approach to prediction and population-based validation of in silico drug repurposing

Feixiong Cheng^1,2^na1,
Rishi J. Desai ORCID: orcid.org/0000-0003-0299-7273³^na1,
Diane E. Handy⁴,
Ruisheng Wang⁴,
Sebastian Schneeweiss³,
Albert-László Barabási^1,2,5,6 &
…
Joseph Loscalzo⁴

Nature Communications volume 9, Article number: 2691 (2018) Cite this article

26k Accesses
305 Citations
157 Altmetric
Metrics details

Subjects

Abstract

Here we identify hundreds of new drug-disease associations for over 900 FDA-approved drugs by quantifying the network proximity of disease genes and drug targets in the human (protein–protein) interactome. We select four network-predicted associations to test their causal relationship using large healthcare databases with over 220 million patients and state-of-the-art pharmacoepidemiologic analyses. Using propensity score matching, two of four network-based predictions are validated in patient-level data: carbamazepine is associated with an increased risk of coronary artery disease (CAD) [hazard ratio (HR) 1.56, 95% confidence interval (CI) 1.12–2.18], and hydroxychloroquine is associated with a decreased risk of CAD (HR 0.76, 95% CI 0.59–0.97). In vitro experiments show that hydroxychloroquine attenuates pro-inflammatory cytokine-mediated activation in human aortic endothelial cells, supporting mechanistically its potential beneficial effect in CAD. In summary, we demonstrate that a unique integration of protein-protein interaction network proximity and large-scale patient-level longitudinal data complemented by mechanistic in vitro studies can facilitate drug repurposing.

De novo generation of multi-target compounds using deep generative chemistry

Article Open access 06 May 2024

Causal machine learning for predicting treatment outcomes

Article 19 April 2024

Decrypting the molecular basis of cellular drug phenotypes by dose-resolved expression proteomics

Article Open access 07 May 2024

Introduction

Although investment in biomedical and pharmaceutical research and development has increased significantly over the past 20 years, the annual number of new treatments approved by the US Food and Drug Administration (FDA) has not significantly increased¹. Among the reasons for this shortcoming in contemporary drug development are a lack of well-established predictive pharmacokinetics/pharmacodynamics approaches, and concerning safety and tolerability profiles for new chemical entities from preclinical studies to clinical trials². In addition to these well recognized explanations, another important factor limiting more effective drug development may be continued adherence to the classical (one gene, one drug, one disease) hypothesis. Focusing on just single targets results in failure to anticipate off-target toxicity, unintended beneficial effects, or multiple target interactions leading to suboptimal efficacy^3,4. Without full knowledge of the broader network context of the molecular determinants of disease and drug targets in the protein–protein interaction network (human interactome), investigators cannot develop meaningful approaches for efficacious treatment of complex diseases⁵.

Novel approaches, such as network-based drug-disease proximity, that shed light on the relationship between drugs (drug targets) and diseases [molecular (protein) determinants in disease modules]^6,7,8 can serve as a useful tool for efficient screening of potentially new indications for approved drugs with well-established pharmacokinetics/pharmacodynamics, safety and tolerability profiles, or previously unidentified adverse events^9,10,11,12. However, in order to prioritize the repurposed candidates or suggest novel interventions based on drug-disease associations identified by network-based approaches, rigorous validation is mandatory. Since network-based drug repurposing focuses on drugs that are already approved and are used in clinical practice, such hypothesis testing is possible using large-scale patient-level data collected during routine healthcare. Such data are regularly used to generate actionable evidence regarding effectiveness, harm, use, and value of medications to supplement evidence generated in randomized controlled trials; these trials that lead to drug approval are typically limited in scope owing to a relatively modest study sample size, comparatively short follow-up time, and frequent underrepresentation of the most relevant populations¹³. The unique strengths of routine healthcare data that make them ideal for validating hypotheses generated by network-based predictions include their provision of large patient populations useful for detecting small differences, and the availability of a large number of patient factors recorded without any recall bias, including demographics, comorbid conditions, and medication use, that allow for high-dimensional covariate adjustment to minimize confounding^14,15,16.

In this study, we developed a systems pharmacology-based platform that quantifies the interplay between disease proteins and drug targets in the human protein–protein interactome with state-of-the-art pharmacoepidemiologic methods for hypothesis validation using longitudinal data with over 220 million patients. We followed this analysis with in vitro assays to test potential drug mechanisms. As proof of the utility of the overall approach, we focused on cardiovascular (CV) outcomes given their high prevalence in the population, as an exemplary set of diseases with which to identify associations between drugs used for non-cardiac indications and CV outcomes. We demonstrate that an integrated approach incorporating network proximity together with large-scale patient longitudinal data and in vitro experimental assays offers an effective platform by which to identify and validate novel associations that can be used to minimize unanticipated adverse drug effects and optimize drug repurposing. These results suggest that this integrative approach can be generalized to other drugs/disease combinations.

Results

An atlas of drug effects via network proximity

Our previous studies have demonstrated that disease gene products (proteins) are likely to cluster in the same network neighborhood or disease module within the human protein–protein interactome^6,17,18. Drug targets representing nodes within molecular networks are often intrinsically coupled in both therapeutic and adverse effects. We, therefore, proposed that for a drug with multiple targets to be on-target effective for a disease or to cause off-target adverse effects (Supplementary Fig. 1a), its target proteins should be within or in the immediate vicinity of the corresponding disease module in the human interactome^8,10. We chose CV diseases as a test case of this principle due to their prevalence in the population and their high morbidity and mortality. To examine drug effects on CV diseases, we used a network proximity measure that quantifies the relationship between CV-specific disease modules and drug targets in the human protein–protein interaction (PPI) network (Supplementary Fig. 1b). To improve the data quality of the human interactome, we used only five types of experimental data: (a) binary PPIs obtained using systematic, unbiased, high-throughput yeast-two-hybrid (Y2H) systems¹⁹; (b) kinase-substrate interactions from literature-derived low-throughput and high-throughput experiments; (c) binary PPIs from three-dimensional (3D) protein structures; (d) signaling networks from literature-derived low-throughput experiments; and (e) literature-curated PPIs identified by affinity purification followed by mass spectrometry (AP-MS), Y2H, and/or literature-derived low-throughput experiments in which every interaction is supported by multiple sources of experimental evidence (Methods section). The updated human interactome defined in this way includes 243,603 PPIs connecting 16,677 unique proteins (Supplementary Data 1). We also compiled 984 FDA-approved drugs by pooling the reported experimental drug-target binding affinity data: median effective concentration (EC₅₀), median inhibitory concentration (IC₅₀), inhibition constant/potency (K_i), or dissociation constant (K_d), each ≤10 micromolar (µM) as a cutoff. We first calculated a z-score $\left( {z = \frac{{d - \mu }}{\sigma }} \right)$ for quantifying the significance of the shortest path lengths d(s,t) between targets (t) of a drug (T) and proteins (s) associated with the CV module (S) where the closest distance between a drug and a disease d(S,T) is defined as:

$$d\left( {S,T} \right) =\frac{1}{\Vert {T} \Vert } {\mathop {\sum }\limits_{t \in T} {\min}_{s {\in} S}}d(s,t).$$

(1)

We constructed the reference distance distribution corresponding to the expected network topological distance between two randomly selected groups of proteins matched to size and degree (connectivity) as the original disease proteins and drug targets in the human interactome (cf. Methods). The z-score reduces the study bias (e.g., hub nodes or those nodes with high connectivity) in the shortest-path methods as described in our previous study¹⁰. In total, we computationally investigated 984 FDA-approved drugs [177 CV drugs and 807 non-CV drugs defined by first-level Anatomical Therapeutic Chemical (ATC) classification codes] and 23 types of CV outcomes (specific CV diseases) (Supplementary Table 1). Relying on 177 FDA-approved CV drugs and their known CV indications, we found that the area under the receiver operating characteristic curve (AUC) is over 70% using the network proximity measure (Supplementary Fig. 2), revealing high accuracy for identifying the well-known drug-disease relationships. In addition, we compared the network proximity measure, closest (z-score), against three other network distance-based measures between drug targets and the disease module¹⁰: (1) shortest, (2) kernel, and (3) centre. We found that the closest distance-based z-score outperformed all three alternative network distance measures (Supplementary Fig. 3). We, therefore, used the closest distance-based z-score in the follow-up studies. Figure 1 illustrates the high-confidence predicted drug-CV disease associations (z < −4.0) connecting 431 non-CV drugs to 22 specific CV disease modules. We next proposed that this atlas of the predicted associations between non-CV drugs and CV disorders offers a useful resource with which to prioritize new CV indications or highlight potential (unexpected) adverse cardiac events for various approved drugs.

Validating possible causal associations in patient data

We selected four target associations between non-CV drugs and CV diseases identified by the network proximity measure (closest) for hypothesis validation by analyzing over 220 million patients in healthcare databases (Fig. 2). Target associations were further selected using subject matter expertise based on a combination of factors: (i) strength of the network-based predicted associations (a higher network proximity score in Supplementary Data 2); (ii) novelty of the predicted associations through exclusion of known adverse CV events of non-CV drugs (Methods section); (iii) availability of sufficient patient data for meaningful evaluation (exclusion of infrequently used medications); (iv) availability of an appropriate comparator treatment that is used for the same underlying (non-CV) indication as the drug of interest and predicted to have no association with the intended CV diseases via network proximity analysis (defined reference groups or negative controls); and (v) the fidelity with which the predicted CV diseases were recorded in insurance claims databases. Applying these criteria resulted in four network-based predictions: (1) carbamazepine (z = −2.36) vs. levetiracetam (comparator control, z = −0.07), drugs normally used to treat epilepsy, with CAD; (2) mesalamine (z = −6.10) vs. azathioprine (comparator control, z = −0.09), drugs normally used to treat inflammatory bowel disease, with CAD; (3) hydroxychloroquine (z = −3.85) vs. leflunomide (comparator control, z = −1.87), drugs normally used to treat rheumatoid arthritis, with CAD; and (4) lithium (z = −5.97) vs. lamotrigine (comparator control, z = 0.19), drugs normally used to treat bipolar disorder, with stroke.

Using two large US-based commercial health insurance claims databases connected with the validated Aetion evidence platform²⁰, we next conducted four cohort studies to evaluate the predicted associations based on individual level longitudinal patient data and pharmacoepidemiologic methods, including a new-user active comparator design, propensity score (PS) adjustment for confounding, and multiple sensitivity analyses²¹. Figure 2 summarizes the total patients included in the four cohorts along with specific reasons for exclusion in the Truven MarketScan and Optum Clinformatics databases. Overall, based on more than 50 covariates included in the PS, we included: (1) 76,045 carbamazepine initiators matched 1:1 to 76,045 levetiracetam initiators; (2) 27,305 mesalamine initiators matched 1:1 to 27,305 azathioprine initiators; (3) 37,795 hydroxychloroquine initiators matched 1:1 to 37,795 leflunomide initiators; and (4) 141,294 lithium initiators matched 1:1 to 141,294 lamotrigine initiators. Supplementary Tables S2–S5 demonstrate the balance achieved in patient characteristics and outcome risk factors between the two treatment groups compared via 1:1 PS-matching. Table 1 shows the total number of person-years of follow-up, total event counts (incident diseases) in the patient groups, and incidence rates per 1000 person-years for the diseases of interest (95% confidence interval [CI]) for each of the four comparisons of interest stratified by data source.

Table 1 Summary of sample sizes, follow-up time, events, and incidence rates in pharmacoepidemiologic investigations

Full size table

Figure 3 summarizes the results after pooling the two patient databases for each of the four comparisons before and after PS-matching. In the primary analytical approach of censoring patient follow-up time at discontinuation of the initial treatment (“as-treated” approach), we observed that carbamazepine was associated with a 56% increased risk [hazard ratio (HR) 1.56, 95% confidence interval (CI) 1.12–2.18] of CAD compared with levetiracetam (Fig. 3a), and hydroxychloroquine (Fig. 3d) was associated with a 24% reduced risk of CAD compared to leflunomide (HR 0.76, 95% CI 0.59–0.97). Varying the follow-up assumptions used in the following ways—(1) excluding the first 60 days of follow-up to reduce residual baseline confounding, (2) truncating the follow-up to 1-year to minimize time-varying confounding, and (3) continuing the follow-up for 1-year regardless of treatment discontinuation under an intent-to-treat (ITT) principle–resulted in estimates that were consistent with the primary approach for both the carbamazepine (Fig. 3a) and hydroxychloroquine analyses (Fig. 3d). Mesalamine vs. azathioprine (HR 1.15, 95% CI 0.55–2.42) and lithium vs. lamotrigine (HR 0.71, 95% CI 0.31–1.60) were not consistently associated differentially with the risk of CAD or stroke (Fig. 3b, c and Supplementary Figs. 4 and 5). Therefore, two of the four predicted associations were validated by the large-scale patient data to either decrease the risk of CAD (hydroxychloroquine) or increase the risk of CAD (carbamazepine), supporting our network-based prediction.

In vitro assay of hydroxychloroquine’s mechanism-of-action

Figure 3d reveals that hydroxychloroquine is associated with a 24% reduced risk of CAD compared to leflunomide (HR 0.76, 95% CI 0.59–0.97). [These very robust data are in agreement with a recent study showing that hydroxychloroquine decreases the incidence of CV events in a small cohort of rheumatoid arthritis patients²².] Hydroxychloroquine has been approved for the treatment of malaria and rheumatoid arthritis for many years; however, only recently have studies provided relevant mechanistic insights. Hydroxychloroquine accumulates intracellularly in the endosomal/lysosomal compartment where its inhibitory effects on Toll-like receptors 7 and 9 (TLR7 and TLR9) suppress inflammatory responses²³. We integrated drug targets and disease proteins into the blood vessel-specific protein–protein interaction network (cf. Methods) to identify the overlapping pathways between hydroxychloroquine targets and CAD proteins (Fig. 4a). Two potential pathways were inferred to be involved in the protective effect of hydroxychloroquine in CAD: (a) hydroxychloroquine may activate ERK5 (encoded by MAPK7) to prevent endothelial inflammation via inhibition of cell adhesion molecule expression²⁴; and (b) hydroxychloroquine may inhibit endosomal activation of NADPH oxidase in response to pro-inflammatory agonists (TNF-α and IL-1β) and may decrease production of pro-inflammatory cytokines in stimulated immune cells²⁵. Notably, adhesion molecules (ICAM-1 and VCAM-1)²⁶ and pro-inflammatory cytokines²⁷ play essential roles in CAD. Furthermore, a recent meta-analysis has shown that elevated expression of TNF-α or IL-1β is significantly associated with increased risk of CAD²⁸. Thus, we sought to determine whether hydroxychloroquine has direct anti-inflammatory effects on endothelial cells via these pathways as a potential beneficial mechanism in CAD.

We pretreated human aortic endothelial cells with 10–50 µM hydroxychloroquine and monitored the expression of VCAM1 and IL1B genes in the presence and absence of the cytokine TNF-α. TNF-α (5 ng/ml) caused a robust increase in the expression of VCAM1 and IL1B, and this pro-inflammatory effect was significantly attenuated by all of the doses of hydroxychloroquine tested (Fig. 4b). Similarly, hydroxychloroquine decreased inflammatory responses to 10 and 20 ng/ml TNF-α, as demonstrated by its attenuation of TNF-α-mediated VCAM-1 and IL-1β protein upregulation (Fig. 4c).

Patients with rheumatoid arthritis are reported to have increased endothelial dysfunction²⁹ that correlates with cardiovascular disease risk³⁰. Therefore, we next tested whether hydroxychloroquine altered TNF-α-induced suppression of NOS3 expression³¹, a known marker of endothelial (dys) function. NOS3 encodes the endothelial nitric oxide synthase enzyme, which, via its synthesis of nitric oxide, regulates vascular tone, impairs platelet activation, and impairs adhesion molecule expression contributing to an anti-inflammatory (and anti-atherogenic) phenotype. TNF-α significantly suppressed NOS3 expression, and 50 µM hydroxychloroquine significantly attenuated (reversed) this suppression (Fig. 4d). Taken together, network proximity analysis of the human interactome not only identified a novel protective effect of hydroxychloroquine in CAD, but also offered testable hypotheses by which to elucidate the molecular mechanism(s) of its protective effect.

Although there may be additional pathways for the beneficial actions of hydroxychloroquine that are outside of the blood-vessel-specific protein–protein interaction network, these experimental findings suggest that hydroxychloroquine has a protective, anti-inflammatory effect on endothelial cells, consistent with its potential beneficial effect in CAD.

Discussion

We have demonstrated that an integrated, mechanism-based human protein–protein interactome strategy can successfully uncover novel drug-disease indications, undesirable side effects, and potential mechanisms for these actions of approved drugs, addressing a crucial issue in drug development and patient care. We showed that our network framework yielded over 70% accuracy for identification of well-known drug indications (Supplementary Fig. 2). Specifically, our network-prediction and pharmacoepidemiological analysis reveal that carbamazepine is associated with an increased risk of CAD compared with levetiracetam, which we are able to validate robustly in large-scale patient data (HR 1.56, 95% CI 1.12–2.18, Fig. 3a). Carbamazepine is a first-line widely used anticonvulsant for the treatment of epilepsy and pain associated with trigeminal neuralgia, and works via inhibition of sodium channel protein type 5 subunit alpha (SCN5A)³² and ATP-sensitive potassium (KATP) channels³³. Previous clinical studies have suggested that carbamazepine aggravates high-grade heart block³⁴ and is associated with various cardiovascular risk factors³⁵, consistent with our observations. Moreover, recent genetic studies have shown that mutations in SCN5A and KATP channel genes are associated with structural heart disease³⁶ and adverse cardiac events^37,38. Thus, it is mechanistically feasible that inhibition of SCN5A and KATP channel activities by carbamazepine may be associated with the increased risk of CV diseases. Further studies will be needed to provide experimental and clinical validation of this conclusion.

Pharmacoepidemiologic analyses from the two patient databases revealed an inconsistent association of lithium vs. lamotrigine on the risk of stroke: a null result (HR 1.02, 95% CI 0.79–1.33, Supplementary Fig. 4) in the Truven MarketScan database and a 52% reduced risk of stroke (HR 0.48, 95% CI 0.25–0.93, Supplementary Fig. 5) in the Optum Clinformatics database. Recent studies have shown the potentially protective effect of lithium in stroke³⁹. Thus, to explore the effect of lithium in stroke further, we examined the potential molecular mechanism of lithium in the CV system via network analysis and in vitro assays of lithium exposure in cultured human aortic endothelial cells (Supplementary Fig. 6). We found a subnetwork of stroke genes and genes up- or down-regulated by lithium (Supplementary Fig. 6a) that map to pathways involved in the production of nitric oxide, which not only has anti-thrombotic effects but also vascular and neural protective effects in the central nervous system; however, our subsequent analysis in human aortic endothelial cells suggested that lithium may attenuate activation of these protective pathways (Supplementary Figs. 6b–e). In vitro assay results are consistent with a recent study that maternal use of high-dose lithium during the first trimester is associated with an increased risk of cardiac malformation in the foetus⁴⁰. Thus, our findings suggest that larger clinical trials and additional mechanistic studies may be necessary to clarify lithium’s action in stroke prevention in a broad population or a well-defined sub-population.

Although patients with rheumatoid arthritis on hydroxychloroquine had a lower risk of CAD than rheumatoid arthritis patients treated with leflunomide, the ability of hydroxychloroquine to improve outcomes in patients with other underlying risk factors for CAD is unclear. Nonetheless, several CVD and CAD proteins are found within the hydroxychloroquine subnetwork (Fig. 4a). Furthermore, hydroxychloroquine has anti-inflammatory properties^41,42, and inflammation is a known major contributor to CAD⁴³. In a mouse model of atherosclerosis, hydroxychloroquine was found to have anti-atherogenic and vasculoprotective effects⁴⁴, suggesting its utility in preventing vascular remodeling. Herein, our in vitro assays reveal that hydroxychloroquine attenuates the pro-inflammatory cytokine-mediated activation of human aortic endothelial cells (Fig. 4b–d) by reducing the expression of adhesion molecules, decreasing the production of cytokines, and attenuating the suppression of endothelial nitric oxide synthase. Although additional mechanistic studies are necessary to confirm the beneficial effects of hydroxychloroquine on endothelial function in the context of CAD, the anti-inflammatory properties of hydroxychloroquine on other cell types is well known. In rheumatoid arthritis and systemic lupus erythematosus, hydroxychloroquine has been suggested to mediate its anti-inflammatory action by inhibiting the activation of TLR7 and TLR9 that reside in endosomal/lysosomal compartments²³. Recent evidence suggests that internalization of TNF-α receptors and other plasma membrane receptors to endosomal compartments may be a necessary step in the activation of certain ligand-induced signaling pathways⁴⁵. Thus, hydroxychloroquine, which accumulates in endosomes, may interfere with the inflammatory actions of multiple types of membrane receptors.

In support of an effect of hydroxychloroquine on endosomal signaling, assembly of NADPH oxidase 2 complexes in the endosome in response to pro-inflammatory stimuli was attenuated by hydroxychloroquine to reduce superoxide generation in monocytes⁴⁶. Additionally, in a monocytic cell line, hydroxychloroquine attenuated TNF-α and IL6 expression in response to IL-1β and TNF-α stimulation, respectively⁴⁶. Interestingly, a treatment trial (the OXI trial) has recently been initiated to assess the efficacy of hydroxychloroquine in preventing recurrent CV events in patients with myocardial infarction⁴⁷ owing to its anti-inflammatory effects as well as its additional biological actions⁴⁸. The results of this ongoing trial may provide further insights into the cardioprotective actions of hydroxychloroquine in a subset of non-rheumatoid arthritis patients.

Our pharmacoepidemiologic method, relying on very large patient-level longitudinal data, has several advantages. First, we used two large population-based cohorts to validate the hypothesized associations, which allowed for statistically robust testing of small effect sizes in relatively small treatment subpopulations. Pharmacy dispensing data from insurance claims were used to define exposure to medications. This approach is generally considered to be more accurate than self-reported drug use or medical records⁴⁹. We also applied a large number of covariates to account for confounding in our studies using the recommended approach of PS-matching for improving inference from large healthcare databases, which are increasingly recognized by regulators and payers as a vital source of information through which to understand the safety and effectiveness of medications used in routine care⁵⁰. We conducted multiple sensitivity analyses to rule out chance findings and attempted to replicate our analyses in a second large database. However, there remain certain limitations of this approach. Insurance claims data are primarily collected for administrative purposes and do not contain detailed clinical information; therefore, residual confounding is possible despite high-dimensional covariate adjustment. We defined outcomes purely by using a claims-based definition; although, we used validated and specific codes, endpoints could not be adjudicated. Finally, our databases did not contain information on patient ethnicity, which is also a limitation. Replication of the associations identified in this study using databases that contain information on ethnicity is recommended in future studies to rule out treatment effect heterogeneity by ethnicity.

In summary, we demonstrated that an integration of molecular network-based approaches and state-of-the-art pharmacoepidemiologic methods can facilitate rational strategies for drug repurposing and the detection of side effects. Specifically, we observed that hydroxychloroquine was associated with 24% reduced risk of CAD compared with leflunomide using large-scale patient data, effects that are supported by mechanistic in vitro data. In addition, carbamazepine was associated with a 56% increased risk of CAD compared with levetiracetam. We believe that the approach presented here, if broadly applied, would significantly catalyze innovation in drug discovery and development.

Methods

Building the human protein–protein interactome

To build the comprehensive human protein–protein interactome as currently available, we assembled 15 commonly used databases with multiple types of experimental evidence and the in-house systematic human protein–protein interactome: (1) binary PPIs tested by high-throughput yeast-two-hybrid (Y2H) systems in which we combined binary PPIs tested from two publicly available high-quality Y2H datasets^19,51 and one dataset available from our website: http://ccsb.dana-farber.org/interactome-data.html; (2) kinase-substrate interactions from literature-derived low-throughput and high-throughput experiments from KinomeNetworkX⁵², Human Protein Resource Database (HPRD)⁵³, PhosphoNetworks^54,55, PhosphositePlus⁵⁶, dbPTM 3.0⁵⁷, and Phospho.ELM⁵⁸; (3) carefully literature-curated PPIs identified by affinity purification followed by mass spectrometry (AP-MS), and from literature-derived low-throughput experiments from BioGRID⁵⁹, PINA⁶⁰, HPRD⁵³, MINT⁶¹, IntAct⁶², and InnateDB⁶³; (4) high-quality PPIs from three-dimensional (3D) protein structures reported in Instruct⁶⁴; and (5) signaling networks from literature-derived low-throughput experiments as annotated in SignaLink2.0⁶⁵. The genes were mapped to their Entrez ID based on the National Center for Biotechnology Information (NCBI) database⁶⁶ as well as their official gene symbols based on GeneCards (http://www.genecards.org/). Inferred data, such as evolutionary analysis, gene expression data, and metabolic associations, were excluded. The updated human interactome constructed in this way includes 243,603 protein–protein interactions (PPIs) (edges or links) connecting 16,677 unique proteins (nodes) (Supplementary Data 1), representing over 40% greater size compared to our previously utilized human interactome⁶.

Collection of human cardiovascular disease genes

We began with ~50 types of CV events defined by Medical Subject Headings (MeSH) and Unified Medical Language System (UMLS) vocabularies⁶⁷. For each CV event, we collected disease-associated genes from 8 commonly used data sources: The OMIM database (Online Mendelian Inheritance in Man)⁶⁸, The Comparative Toxicogenomics Database⁶⁹, HuGE Navigator⁷⁰, DisGeNET⁷¹, ClinVar⁷², GWAS Catalog⁷³, GWASdb⁷⁴, and PheWAS Catalog (phewas.mc.vanderbilt.edu)⁷⁵. We annotated all protein-coding genes using gene Entrez ID, chromosomal location, and the official gene symbols from the NCBI database⁶⁶. Here we selected CV events with at least 10 disease-associated genes in the human interactome, resulting in 23 types of CV events (Supplementary Table 1).

Construction of drug-target network

We assembled the physical drug-target interactions on FDA-approved drugs from 6 commonly used data sources, and defined a physical drug-target interaction using reported binding affinity data: inhibition constant/potency (K_i), dissociation constant (K_d), median effective concentration (EC₅₀), or median inhibitory concentration (IC₅₀) ≤10 µM. Drug-target interactions were acquired from the DrugBank database (v4.3)⁷⁶, the Therapeutic Target Database (TTD, v4.3.02)⁷⁷, and the PharmGKB database (30 December 2015)⁷⁸. Specifically, bioactivity data of drug-target pairs were collected from three commonly used databases: ChEMBL (v20)⁷⁹, BindingDB (downloaded in December 2015)⁸⁰, and IUPHAR/BPS Guide to PHARMACOLOGY (downloaded in December 2015)⁸¹. After extracting the bioactivity data related to the drugs from the prepared bioactivity databases, only those items meeting the following four criteria were retained: (i) binding affinities, including K_i, K_d, IC₅₀, or EC₅₀ ≤10 μM; (ii) proteins can be represented by unique UniProt accession number; (iii) proteins are marked as reviewed in the UniProt database⁸²; and (iv) proteins are from Homo sapiens.

Description of network proximity

Given S, the set of disease proteins, T, the set of drug targets, and d(S,T), the closest distance measured by the average shortest path length between nodes s and the nearest disease protein t in the human protein–protein interactome is defined as: $d\left( {S,T} \right) = \frac{1}{\Vert {T} \Vert }\mathop {\sum }\limits_{t \in T} \min_{s \in S}d(s,t)$. To evaluate the significance of the network distance between a drug and a given disease, we constructed a reference distance distribution corresponding to the expected distance between two randomly selected groups of proteins of the same size and degree distribution as the original disease proteins and drug targets in the network. This procedure was repeated 1000 times. The mean $\bar d$ and s.d. (σ_d) of the reference distribution were used to calculate a z-score (z_d) by converting an observed (non-Euclidean) distance to a normalized distance.

Pharmacoepidemiologic methodology

We conducted observational cohort studies using two large US-based health insurance claims databases: (1) Truven MarketScan (2003–2014), and (2) Optum Clinformatics (2004–2013). These data sources contain comprehensive longitudinal information on patient demographics, coded in-patient and out-patient diagnoses and procedures, and outpatient prescription dispensing for their enrollees. Use of the de-identified database was approved by the Institutional Review Board of Brigham and Women’s Hospital, Boston, MA.

We identified patients 18 years or older who initiated treatment with the drug of interest after 180 days of continuous enrollment⁸³. The date on which this new prescription was filled was defined as the index date. We further applied study-specific exclusion criteria (summarized in Fig. 2) in the 180-day pre-index period to include homogeneous groups of patients in each comparison and focused on incident events. The follow-up began on the day after the index date. For the primary analysis, we used an as-treated follow-up approach in which the follow-up was stopped if patients either filled a prescription for a drug in the other exposure group or discontinued the index exposure. Discontinuation was defined as no record of a subsequent prescription of the index medication for 60 days after accounting for the days’ supply of exposure provided by the most recent prescription. We varied the follow-up approach to evaluate the robustness of our results in three sensitivity analyses. First, we did not attribute the outcome occurring in the first 60-days post-index to the index treatment to avoid the possibility of unmeasured baseline confounding. Second, we truncated the follow-up to a maximum of 365-days to limit the potential for time-varying confounding. Finally, we conducted an intention-to-treat (ITT) equivalent analysis in which patients were followed in their index exposure group regardless of treatment change or discontinuation for up to 365 days. In all of the approaches, the follow-up was truncated at the first outcome occurrence, health insurance disenrollment, death, or the most recent date of data availability.

The outcome of CAD was identified as a composite endpoint of hospitalization for myocardial infarction as the primary discharge diagnosis or a coronary revascularization procedure. The ICD-9 codes and CPT codes used to identify these outcomes have been found to have >90% positive predictive value (PPV) in administrative claims databases^84,85. The outcome of stroke was identified using hospitalization claims where ischemic stroke or transient ischemic attack was recorded as the primary discharge diagnosis. The ICD-9 codes used to identify this outcomes have been found to have 96% positive predictive value (PPV) in administrative claims databases⁸⁶.

We identified the large number of covariates, which were measured in the 180-day baseline period preceding each patient’s index date, in each of the four studies to account for confounding. These variables were specifically selected to address clinical scenarios evaluated in each study. For example, in the study of inflammatory bowel disease (IBD) treatments (mesalamine vs. azathioprine), we measured and accounted for IBD severity-related variables, such as diagnosis for active fistula formation or internal penetrating disease, obstructing or stricturing disease, and intra-abdominal surgical procedures. Additionally, patient demographics (age and gender), risk factors for cardiovascular diseases (e.g., hypertension, hyperlipidemia, diabetes, cardiovascular medication use), and markers of contact with the healthcare system (e.g., number of emergency department visits, number of distinct prescription medications used) were measured in all four studies. Please refer to Supplementary Tables 2–5 for a full list of covariates included in each study.

We used propensity score (PS) methods to account for potential confounding⁸⁷. PSs were defined as the predicted probability of receiving the treatment of interest (vs. the comparator) conditional upon patients’ covariate constellations and were calculated using multivariable logistic regression models, including the covariates described above as independent variables. Initiators of each exposure of interest were matched to initiators of the reference exposure based on their PS in 1:1 ratio using a nearest-neighbor algorithm within a caliper of 0.05 on the probability scale⁸⁸. Cox-proportional hazards models were used to estimate the adjusted hazard ratios (HR) between the treatment of interest and the risk of outcome before and after PS-matching. All analyses were conducted separately in the two data sources to avoid any potential effect of differential measurement of study variables across the data sources on the comparative estimates. The results were presented after pooling estimates from the two databases using the DerSimonian and Laird random effects model with inverse variance weights⁸⁹. To address the possibility of population-overlap, we corrected the variance of our pooled hazard ratios assuming 20% overlap between the two databases as follows:

$ \widehat {\sigma ^2}_{{\rm corrected}} = \mathop {\sum }\limits_{i = 1}^2 w_i^2\widehat {\sigma ^2}_i + w_1w_2\frac{{n_1\widehat {\sigma ^2}_1 + n_2\widehat {\sigma ^2}_2}}{{n_1 + n_2}}p_{{\rm overlap}},$

$ \widehat {\sigma ^2}_{{\mathrm {corrected}}} = {\mathrm {corrected}}\,{\mathrm {variance}},$

$ w_i = {\mathrm {inverse}}\,{\mathrm {variance}}\,{\mathrm {weight}}\,{\mathrm {for}}\,{\mathrm {database}}\,i,$

$ \widehat {\sigma ^2}_i = {\mathrm {variance}}\,{\mathrm {of}}\,{\mathrm {the}}\,{\mathrm {estimate}}\,{\mathrm {from}}\,{\mathrm {database}}\,i,$

$ n_i = {\mathrm {sample}}\,{\mathrm {size}}\,{\mathrm {of}}\,{\mathrm {the}}\,{\mathrm {study}}\,{\mathrm {in}}\,{\mathrm {database}}\,i,$

$ p_{{\mathrm {overlap}}} = 0.2.$

All statistical analyses were conducted on the Aetion Platform version 2.1.2 using R (version 3.1.2), which has been validated against the FDA Sentinel system and randomized control trials²⁰.

Tissue-specific subnetwork analysis

We downloaded the RNA-seq data (RPKM value) of 32 tissues from GTEx V6 release (accessed on 01 April 2016, https://gtexportal.org/home/). For each tissue (e.g., blood vessel), we regarded those genes with RPKM ≥1 in >80% of samples as tissue-expressed genes and the remaining genes as tissue-unexpressed. To quantify the expression significance of tissue-expressed gene i in tissue t, we calculated the average expression 〈E(i)〉 and the standard deviation $\delta _E(i)$ of a gene’s expression across all considered tissues⁹⁰. The significance of gene expression in tissue t is defined as $z_E\left( {i,t} \right) = \left( {E\left( {i,t} \right) - \langle E\left( i \right)\rangle} \right) {\mathrm{/}} \delta _E(i)$. For stroke and CAD, we built a blood vessel-specific protein–protein interaction network by comparing genome-wide expression profiles of blood vessels to 31 other different tissues from GTEx.

In in vitro assays, human aortic endothelial cells (Lonza) were passaged in EGM-2 (Lonza) with the addition of hydroxychloroquine (Fig. 4) or lithium chloride (Supplementary Fig. 6) at the doses and times indicated. To assess VEGF-mediated activation of Akt/GSK/eNOS signaling, cells were cultured 24 h in EBM-2 with 0.1% fetal bovine serum in the presence of absence of lithium chloride prior to the addition of VEGF at 50 ng/ml for the times indicated (Supplementary Fig. 6). One hour after cells were exposed to hydroxychloroquine (10–50 μM), TNF-α (5–20 ng/ml) was added to the media. Cells were collected 24 h following TNF-α addition for RNA or protein analysis (Fig. 4).

RNA was collected from cells with the RNeasy kit (Qiagen) using the optional DNase I digestion. cDNA was synthesized from 0.5 μg of RNA using oligo dT primers and the Advantage RT-for-PCR kit (Clontech). Relative RNA levels were measured by quantitative RT-PCR method using the ΔΔC_t method of analysis. β-Actin was used as the endogenous control. The following TaqMan probes (Thermo Fisher) were used for gene expression analysis: VCAM1, Hs00365485_m1; IL1B, Hs01555410_m1; NOS3, Hs01574659_m1 and ACTB, Hs99999903_m1.

Radioimmunoprecipitation assay (RIPA) lysis buffer was supplemented with protease and phosphatase inhibitors (Calbiochem) and used to collect cell extracts. Cell lysates were separated on 4–15% polyacrylamide gradient gels (Biorad), and transferred to polyvinylidene fluoride (PVDF) membranes. Antibodies were obtained from Cell Signaling. VCAM-1 was detected by western blotting using an sc-8304 antibody (Santa Cruz) at a 1:4000 dilution; IL-1β actin was detected using a 1:1000 dilution of antibody #12703 (Cell Signaling); and actin was detected using a 1:4000 dilution of antibody #4970 (Cell Signaling). A secondary anti-rabbit-HRP antibody (Cell Signaling, #7074) was used at 1:2000 together with the ECL western blotting detection reagents from GE Healthcare. Blots were exposed to X-ray film, and the Biorad ChemiDoc Touch Imaging system was used to generate images. For western blot experiments designed to analyze the effects of hydroxychloroquine, each condition was tested in 6 independent experiments. Uncropped scans of the blots used in Fig. 4c are included in Supplementary Fig. 7.

Code availability

The toolbox package for the network proximity calculation can be downloaded at github.com/emreg00/toolbox.

Data availability

The human publicly available protein–protein interactome used in this study is freely available as a supplement to this manuscript (Supplementary Data 1). The unpublished binary human protein–protein interactions can be accessed at http://ccsb.dana-farber.org/interactome-data.html. The global predicted z-scores for 984 FDA-approved drugs and 23 types of cardiovascular events (diseases) via the network proximity approach are freely available in Supplementary Data 2. All other relevant data are available from the authors.

References

Mullard, A. 2016 FDA drug approvals. Nat. Rev. Drug Discov. 16, 73–76 (2017).
Article PubMed CAS Google Scholar
Shih, H. P., Zhang, X. & Aronov, A. M. Drug discovery effectiveness from the standpoint of therapeutic mechanisms and indications. Nat. Rev. Drug Discov. 17, 19–33 (2017).
Article PubMed CAS Google Scholar
Antman, E. M. & Loscalzo, J. Precision medicine in cardiology. Nat. Rev. Cardiol. 13, 591–602 (2016).
Article PubMed Google Scholar
MacRae, C. A., Roden, D. M. & Loscalzo, J. The future of cardiovascular therapeutics. Circulation 133, 2610–2617 (2016).
Article PubMed Google Scholar
Greene, J. A. & Loscalzo, J. Putting the patient back together - social medicine, network medicine, and the limits of reductionism. N. Engl. J. Med. 377, 2493–2499 (2017).
Article PubMed Google Scholar
Menche, J. et al. Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science 347, 1257601 (2015).
Article PubMed PubMed Central CAS Google Scholar
Yildirim, M. A., Goh, K. I., Cusick, M. E., Barabasi, A. L. & Vidal, M. Drug-target network. Nat. Biotechnol. 25, 1119–1126 (2007).
Article PubMed CAS Google Scholar
Wang, R. S. & Loscalzo, J. Illuminating drug action by network integration of disease genes: a case study of myocardial infarction. Mol. Biosyst. 12, 1653–1666 (2016).
Article PubMed PubMed Central CAS Google Scholar
Dudley, J. T. et al. Computational repositioning of the anticonvulsant topiramate for inflammatory bowel disease. Sci. Transl. Med. 3, 96ra76 (2011).
Article PubMed PubMed Central CAS Google Scholar
Guney, E., Menche, J., Vidal, M. & Barabasi, A. L. Network-based in silico drug efficacy screening. Nat. Commun. 7, 10331 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhao, S. et al. Systems pharmacology of adverse event mitigation by drug combinations. Sci. Transl. Med. 5, 206ra140 (2013).
Article PubMed PubMed Central CAS Google Scholar
Himmelstein, D. S. et al. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. eLife 6, e26726 (2017).
Article PubMed PubMed Central Google Scholar
Schneeweiss, S. et al. Real world data in adaptive biomedical innovation: a framework for generating evidence fit for decision making. Clin. Pharmacol. Ther. 100, 633–646 (2016).
Article PubMed CAS Google Scholar
Schneeweiss, S. & Avorn, J. A review of uses of health care utilization databases for epidemiologic research on therapeutics. J. Clin. Epidemiol. 58, 323–337 (2005).
Article PubMed Google Scholar
Schneeweiss, S. et al. High-dimensional propensity score adjustment in studies of treatment effects using health care claims data. Epidemiology 20, 512 (2009).
Article PubMed PubMed Central Google Scholar
Brilliant, M. H. et al. Mining retrospective data for virtual prospective drug repurposing: L-DOPA and age-related macular degeneration. Am. J. Med. 129, 292–298 (2016).
Article PubMed CAS Google Scholar
Goh, K. I. et al. The human disease network. Proc. Natl Acad. Sci. USA 104, 8685–8690 (2007).
Article ADS PubMed CAS Google Scholar
Ghiassian, S. D., Menche, J. & Barabasi, A. L. A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput. Biol. 11, e1004120 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Rolland, T. et al. A proteome-scale map of the human interactome network. Cell 159, 1212–1226 (2014).
Article PubMed PubMed Central CAS Google Scholar
Wang, S. V. et al. Transparency and reproducibility of observational cohort studies using large healthcare databases. Clin. Pharmacol. Ther. 99, 325–332 (2016).
Article PubMed CAS Google Scholar
Schneeweiss, S. A basic study design for expedited safety signal evaluation based on electronic healthcare data. Pharmacoepidemiol. Drug. Saf. 19, 858–868 (2010).
Article PubMed PubMed Central Google Scholar
Sharma, T. S. et al. Hydroxychloroquine use is associated with decreased incident cardiovascular events in rheumatoid arthritis patients. J. Am. Heart Assoc. 5, e002867 (2016).
Article PubMed PubMed Central Google Scholar
Lamphier, M. et al. Novel small molecule inhibitors of TLR7 and TLR9: mechanism of action and efficacy in vivo. Mol. Pharmacol. 85, 429–440 (2014).
Article PubMed CAS Google Scholar
Le, N. T. et al. Identification of activators of ERK5 transcriptional activity by high-throughput screening and the role of endothelial ERK5 in vasoprotective effects induced by statins and antimalarial agents. J. Immunol. 193, 3803–3815 (2014).
Article PubMed PubMed Central CAS Google Scholar
Muller-Calleja, N., Manukyan, D., Canisius, A., Strand, D. & Lackner, K. J. Hydroxychloroquine inhibits proinflammatory signalling pathways by targeting endosomal NADPH oxidase. Ann. Rheum. Dis. 76, 891–897 (2017).
Article PubMed CAS Google Scholar
Hwang, S. J. et al. Circulating adhesion molecules VCAM-1, ICAM-1, and E-selectin in carotid atherosclerosis and incident coronary heart disease cases: the Atherosclerosis Risk In Communities (ARIC) study. Circulation 96, 4219–4225 (1997).
Article PubMed CAS Google Scholar
Tousoulis, D., Oikonomou, E., Economou, E. K., Crea, F. & Kaski, J. C. Inflammatory cytokines in atherosclerosis: current therapeutic approaches. Eur. Heart J. 37, 1723–1732 (2016).
Article PubMed CAS Google Scholar
Kaptoge, S. et al. Inflammatory cytokines and risk of coronary heart disease: new prospective study and updated meta-analysis. Eur. Heart J. 35, 578–589 (2014).
Article PubMed CAS Google Scholar
Herbrig, K. et al. Endothelial dysfunction in patients with rheumatoid arthritis is associated with a reduced number and impaired function of endothelial progenitor cells. Ann. Rheum. Dis. 65, 157–163 (2006).
Article PubMed CAS Google Scholar
Sandoo, A., Kitas, G. D., Carroll, D. & Veldhuijzen van Zanten, J. J. The role of inflammation and cardiovascular disease risk on microvascular and macrovascular endothelial function in patients with rheumatoid arthritis: a cross-sectional and longitudinal study. Arthritis Res. Ther. 14, R117 (2012).
Article PubMed PubMed Central CAS Google Scholar
Anderson, H. D., Rahmutula, D. & Gardner, D. G. Tumor necrosis factor-alpha inhibits endothelial nitric-oxide synthase gene promoter activity in bovine aortic endothelial cells. J. Biol. Chem. 279, 963–969 (2004).
Article PubMed CAS Google Scholar
Jaramillo, N. M. et al. Pharmacogenetic potential biomarkers for carbamazepine adverse drug reactions and clinical response. Drug Metabol. Drug Interact. 29, 67–79 (2014).
Article PubMed CAS Google Scholar
Chen, P. C. et al. Carbamazepine as a novel small molecule corrector of trafficking-impaired ATP-sensitive potassium channels identified in congenital hyperinsulinism. J. Biol. Chem. 288, 20942–20954 (2013).
Article PubMed PubMed Central CAS Google Scholar
Beermann, B., Edhag, O. & Vallin, H. Advanced heart-block aggravated by carbamazepine. Br. Heart J. 37, 668–671 (1975).
Article PubMed PubMed Central CAS Google Scholar
Svalheim, S. et al. Cardiovascular risk factors in epilepsy patients taking levetiracetam, carbamazepine or lamotrigine. Acta Neurol. Scand. 122, 30–33 (2010).
Article Google Scholar
Saffitz, J. E. Structural heart disease, SCN5A gene mutations, and Brugada syndrome: a complex menage a trois. Circulation 112, 3672–3674 (2005).
Article PubMed Google Scholar
Yamagata, K. et al. Genotype-phenotype correlation of SCN5A mutation for the clinical and electrocardiographic characteristics of probands with Brugada syndrome: a Japanese multicenter registry. Circulation 135, 2255–2270 (2017).
Article PubMed CAS Google Scholar
Nichols, C. G., Singh, G. K. & Grange, D. K. KATP channels and cardiovascular disease: suddenly a syndrome. Circ. Res. 112, 1059–1072 (2013).
Article PubMed PubMed Central CAS Google Scholar
Lan, C. C. et al. A reduced risk of stroke with lithium exposure in bipolar disorder: a population-based retrospective cohort study. Bipolar Disord. 17, 705–714 (2015).
Article PubMed CAS Google Scholar
Patorno, E. et al. Lithium use in pregnancy and the risk of cardiac malformations. N. Engl. J. Med. 376, 2245–2254 (2017).
Article PubMed PubMed Central CAS Google Scholar
Rainsford, K. D., Parke, A. L., Clifford-Rashotte, M. & Kean, W. F. Therapy and pharmacological properties of hydroxychloroquine and chloroquine in treatment of systemic lupus erythematosus, rheumatoid arthritis and related diseases. Inflammopharmacology 23, 231–269 (2015).
Article PubMed CAS Google Scholar
Kuznik, A. et al. Mechanism of endosomal TLR inhibition by antimalarial drugs and imidazoquinolines. J. Immunol. 186, 4794–4804 (2011).
Article PubMed CAS Google Scholar
Hansson, G. K. Inflammation, atherosclerosis, and coronary artery disease. N. Engl. J. Med. 352, 1685–1695 (2005).
Article PubMed CAS Google Scholar
Shukla, A. M. et al. Impact of hydroxychloroquine on atherosclerosis and vascular stiffness in the presence of chronic kidney disease. PLoS ONE 10, e0139226 (2015).
Article PubMed PubMed Central CAS Google Scholar
Cendrowski, J., Maminska, A. & Miaczynska, M. Endocytic regulation of cytokine receptor signaling. Cytokine Growth Factor Rev. 32, 63–73 (2016).
Article PubMed CAS Google Scholar
Muller-Calleja, N., Manukyan, D., Canisius, A., Strand, D. & Lackner, K. J. Hydroxychloroquine inhibits proinflammatory signalling pathways by targeting endosomal NADPH oxidase. Ann. Rheum. Dis. 76, 891–897 (2016).
Article PubMed CAS Google Scholar
Hartman, O., Kovanen, P. T., Lehtonen, J., Eklund, K. K. & Sinisalo, J. Hydroxychloroquine for the prevention of recurrent cardiovascular events in myocardial infarction patients: rationale and design of the OXI trial. Eur. Heart J. Cardiovasc. Pharmacother. 3, 92–97 (2017).
PubMed Google Scholar
Olsen, N. J., Schleich, M. A. & Karp, D. R. Multifaceted effects of hydroxychloroquine in human disease. Semin. Arthritis Rheum. 43, 264–272 (2013).
Article PubMed CAS Google Scholar
West, S. L. et al. Completeness of prescription recording in outpatient medical records from a health maintenance organization. J. Clin. Epidemiol. 47, 165–171 (1994).
Article PubMed CAS Google Scholar
Sherman, R. E. et al. Real-world evidence - what is it and what can it tell us? N. Engl. J. Med. 375, 2293–2297 (2016).
Article PubMed Google Scholar
Rual, J. F. et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature 437, 1173–1178 (2005).
Article ADS PubMed CAS Google Scholar
Cheng, F., Jia, P., Wang, Q. & Zhao, Z. Quantitative network mapping of the human kinome interactome reveals new clues for rational kinase inhibitor discovery and individualized cancer therapy. Oncotarget 5, 3697–3710 (2014).
PubMed PubMed Central Google Scholar
Peri, S. et al. Human protein reference database as a discovery resource for proteomics. Nucleic Acids Res. 32, D497–D501 (2004).
Article PubMed PubMed Central CAS Google Scholar
Newman, R. H. et al. Construction of human activity-based phosphorylation networks. Mol. Syst. Biol. 9, 655 (2013).
Article PubMed PubMed Central Google Scholar
Hu, J. et al. PhosphoNetworks: a database for human phosphorylation networks. Bioinformatics 30, 141–142 (2014).
Article PubMed CAS Google Scholar
Hornbeck, P. V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–D520 (2015).
Article PubMed CAS Google Scholar
Lu, C. T. et al. DbPTM 3.0: an informative resource for investigating substrate site specificity and functional association of protein post-translational modifications. Nucleic Acids Res. 41, D295–D305 (2013).
Article PubMed CAS Google Scholar
Dinkel, H. et al. Phospho.ELM: a database of phosphorylation sites–update 2011. Nucleic Acids Res. 39, D261–D267 (2011).
Article PubMed CAS Google Scholar
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2015 update. Nucleic Acids Res. 43, D470–D478 (2015).
Article PubMed CAS Google Scholar
Cowley, M. J. et al. PINA v2.0: mining interactome modules. Nucleic Acids Res. 40, D862–D865 (2012).
Article PubMed CAS Google Scholar
Licata, L. et al. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 40, D857–D861 (2012).
Article PubMed CAS Google Scholar
Orchard, S. et al. The MIntAct project–IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358–D363 (2014).
Article PubMed CAS Google Scholar
Breuer, K. et al. InnateDB: systems biology of innate immunity and beyond–recent updates and continuing curation. Nucleic Acids Res. 41, D1228–D1233 (2013).
Article PubMed CAS Google Scholar
Meyer, M. J., Das, J., Wang, X. & Yu, H. INstruct: a database of high-quality 3D structurally resolved protein interactome networks. Bioinformatics 29, 1577–1579 (2013).
Article PubMed PubMed Central CAS Google Scholar
Fazekas, D. et al. SignaLink 2 - a signaling pathway resource with multi-layered regulatory networks. BMC Syst. Biol. 7, 7 (2013).
Article PubMed PubMed Central Google Scholar
Coordinators, N. R. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 44, D7–D19 (2016).
Article CAS Google Scholar
Bodenreider, O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32, D267–D270 (2004).
Article PubMed PubMed Central CAS Google Scholar
Amberger, J. S., Bocchini, C. A., Schiettecatte, F., Scott, A. F. & Hamosh, A. OMIM.org: Online Mendelian Inheritance in Man (OMIM(R)), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43, D789–D798 (2015).
Article PubMed CAS Google Scholar
Davis, A. P. et al. The comparative toxicogenomics database’s 10th year anniversary: update 2015. Nucleic Acids Res. 43, D914–D920 (2015).
Article ADS PubMed CAS Google Scholar
Yu, W., Gwinn, M., Clyne, M., Yesupriya, A. & Khoury, M. J. A navigator for human genome epidemiology. Nat. Genet. 40, 124–125 (2008).
Article PubMed CAS Google Scholar
Pinero, J. et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, bav028 (2015).
Article PubMed PubMed Central CAS Google Scholar
Landrum, M. J. et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42, D980–D985 (2014).
Article PubMed CAS Google Scholar
Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42, D1001–D1006 (2014).
Article PubMed CAS Google Scholar
Li, M. J. et al. GWASdb v2: an update database for human genetic variants identified by genome-wide association studies. Nucleic Acids Res. 44, D869–D876 (2016).
Article PubMed CAS Google Scholar
Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat. Biotechnol. 31, 1102–1110 (2013).
Article PubMed PubMed Central CAS Google Scholar
Law, V. et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 42, D1091–D1097 (2014).
Article PubMed CAS Google Scholar
Zhu, F. et al. Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery. Nucleic Acids Res. 40, D1128–D1136 (2012).
Article PubMed CAS Google Scholar
Hernandez-Boussard, T. et al. The pharmacogenetics and pharmacogenomics knowledge base: accentuating the knowledge. Nucleic Acids Res. 36, D913–D918 (2008).
Article PubMed CAS Google Scholar
Gaulton, A. et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 40, D1100–D1107 (2012).
Article PubMed CAS Google Scholar
Liu, T. Q., Lin, Y. M., Wen, X., Jorissen, R. N. & Gilson, M. K. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 35, D198–D201 (2007).
Article PubMed CAS Google Scholar
Pawson, A. J. et al. The IUPHAR/BPS Guide to PHARMACOLOGY: an expert-driven knowledgebase of drug targets and their ligands. Nucleic Acids Res. 42, D1098–D1106 (2014).
Article PubMed CAS Google Scholar
Apweiler, R. et al. UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 32, D115–D119 (2004).
Article PubMed PubMed Central CAS Google Scholar
Ray, W. A. Evaluating medication effects outside of clinical trials: new-user designs. Am. J. Epidemiol. 158, 915–920 (2003).
Article PubMed Google Scholar
Kiyota, Y. et al. Accuracy of Medicare claims-based diagnosis of acute myocardial infarction: estimating positive predictive value on the basis of review of hospital records. Am. Heart J. 148, 99–104 (2004).
Article PubMed Google Scholar
Hlatky, M. A. et al. Use of medicare data to identify coronary heart disease outcomes in the Women’s health initiative. Circ. Cardiovasc. Qual. Outcomes 7, 157–162 (2014).
Article PubMed PubMed Central Google Scholar
Birman-Deych, E. et al. Accuracy of ICD-9-CM codes for identifying cardiovascular and stroke risk factors. Med. Care 43, 480–485 (2005).
Article PubMed Google Scholar
Rosenbaum, P. R. & Rubin, D. B. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 41–55 (1983).
Article MathSciNet MATH Google Scholar
Austin, P. C. Some Methods of Propensity‐score matching had superior performance to others: results of an empirical investigation and monte carlo simulations. Biomet. J. 51, 171–184 (2009).
Article MathSciNet Google Scholar
DerSimonian, R. & Laird, N. Meta-analysis in clinical trials. Control Clin. Trials 7, 177–188 (1986).
Article PubMed CAS Google Scholar
Kitsak, M. et al. Tissue specificity of human disease module. Sci. Rep. 6, 35241 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This work was supported by the NIH grants P50-HG004233, U01-HG001715, and U01-HG007690 from NHGRI, P50-GM107618 from NIGMS and PO1-HL083069, R37-HL061795, RC2-HL101543, U01-HL108630, RC4-HL106373, and K99HL138272 from NHLBI, and ME-1303–5638 from PCORI.

Author information

These authors contributed equally: Feixiong Cheng, Rishi J. Desai

Authors and Affiliations

Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, MA, 02115, USA
Feixiong Cheng & Albert-László Barabási
Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Feixiong Cheng & Albert-László Barabási
Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02115, USA
Rishi J. Desai & Sebastian Schneeweiss
Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02115, USA
Diane E. Handy, Ruisheng Wang & Joseph Loscalzo
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02115, USA
Albert-László Barabási
Center for Network Science, Central European University, Budapest, 1051, Hungary
Albert-László Barabási

Authors

Feixiong Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Rishi J. Desai
View author publications
You can also search for this author in PubMed Google Scholar
Diane E. Handy
View author publications
You can also search for this author in PubMed Google Scholar
Ruisheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Schneeweiss
View author publications
You can also search for this author in PubMed Google Scholar
Albert-László Barabási
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Loscalzo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.L., A.-L.B., and S.S. conceived the study. F.C., R.J.D., D.E.H., performed all experiments and analysis. R.W. performed data analysis. F.C., R.J.D., D.E.H., S.S., A.-L.B., and J.L. wrote the manuscript.

Corresponding author

Correspondence to Joseph Loscalzo.

Ethics declarations

Competing interests

A.-L.B. and J.L. are co-founders of Scipher, a startup that uses network concepts to explore human disease. S.S. is consultant to Aetion, Inc., a software manufacturer in which he also owns equity. The remaining authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheng, F., Desai, R.J., Handy, D.E. et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat Commun 9, 2691 (2018). https://doi.org/10.1038/s41467-018-05116-5

Download citation

Received: 22 February 2018
Accepted: 08 June 2018
Published: 12 July 2018
DOI: https://doi.org/10.1038/s41467-018-05116-5

This article is cited by

scDrugPrio: a framework for the analysis of single-cell transcriptomics to address multiple problems in precision medicine in immune-mediated inflammatory diseases
- Samuel Schäfer
- Martin Smelik
- Mikael Benson
Genome Medicine (2024)
Golden bile powder prevents drunkenness and alcohol-induced liver injury in mice via the gut microbiota and metabolic modulation
- Yarong Wang
- Zhenzhuang Zou
- Guozhen Cui
Chinese Medicine (2024)
Identification of Potentially Repurposable Drugs for Lewy Body Dementia Using a Network-Based Approach
- Megha Manoj
- Siddarth Sowmyanarayan
- Jhinuk Chatterjee
Journal of Molecular Neuroscience (2024)
Network-based drug repurposing identifies small molecule drugs as immune checkpoint inhibitors for endometrial cancer
- Faheem Ahmed
- Anupama Samantasinghar
- Kyung Hyun Choi
Molecular Diversity (2024)
Integrating animal experiments, mass spectrometry and network-based approach to reveal the sleep-improving effects of Ziziphi Spinosae Semen and γ-aminobutyric acid mixture
- Airong Ren
- Tingbiao Wu
- Guozhen Cui
Chinese Medicine (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.