Proteome-wide mendelian randomization identifies causal plasma proteins in venous thromboembolism development

Li, Haobo; Zhang, Zhu; Qiu, Yuting; Weng, Haoyi; Yuan, Shuai; Zhang, Yunxia; Zhang, Yu; Xi, Linfeng; Xu, Feiya; Ji, Xiaofan; Hao, Risheng; Yang, Peiran; Chen, Gang; Zuo, Xianbo; Zhai, Zhenguo; Wang, Chen

doi:10.1038/s10038-023-01186-6

Download PDF

Article
Open access
Published: 03 August 2023

Proteome-wide mendelian randomization identifies causal plasma proteins in venous thromboembolism development

Haobo Li ORCID: orcid.org/0000-0002-1882-5831^1,2,
Zhu Zhang¹,
Yuting Qiu³,
Haoyi Weng⁴,
Shuai Yuan⁵,
Yunxia Zhang¹,
Yu Zhang^1,3,
Linfeng Xi^1,3,
Feiya Xu^1,3,
Xiaofan Ji^1,2,
Risheng Hao^1,3,
Peiran Yang⁶,
Gang Chen⁴,
Xianbo Zuo⁷,
Zhenguo Zhai ORCID: orcid.org/0000-0002-7096-8792¹ &
…
Chen Wang¹

Journal of Human Genetics volume 68, pages 805–812 (2023)Cite this article

4291 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Genome-wide association studies (GWAS) have identified numerous risk loci for venous thromboembolism (VTE), but it is challenging to decipher the underlying mechanisms. We employed an integrative analytical pipeline to transform genetic associations to identify novel plasma proteins for VTE. Proteome-wide association studies (PWAS) were determined by functional summary-based imputation leveraging data from a genome-wide association analysis (14,429 VTE patients, 267,037 controls), blood proteomes (1348 cases), followed by Mendelian randomization, Bayesian colocalization, protein-protein interaction, and pathway enrichment analysis. Twenty genetically regulated circulating protein abundances (F2, F11, ABO, PLCG2, LRP4, PLEK, KLKB1, PROC, KNG1, THBS2, SERPINA1, RARRES2, CEL, GP6, SERPINE2, SERPINA10, OBP2B, EFEMP1, F5, and MSR1) were associated with VTE. Of these 13 proteins demonstrated Mendelian randomized correlations. Six proteins (F2, F11, PLEK, SERPINA1, RARRES2, and SERPINE2) had strong support in colocalization analysis. Utilizing multidimensional data, this study suggests PLEK, SERPINA1, and SERPINE2 as compelling proteins that may provide key hints for future research and possible diagnostic and therapeutic targets for VTE.

Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals

Article 16 October 2020

Mapping biological influences on the human plasma proteome beyond the genome

Article Open access 26 September 2024

Genetic associations of protein-coding variants in venous thromboembolism

Article Open access 01 April 2024

Introduction

Venous thromboembolism (VTE), including deep vein thrombosis (DVT) and pulmonary thromboembolism (PTE), is the third most common life-threatening cardiovascular disease after myocardial infarction and stroke. The global incidence rate of VTE is estimated to range between 115 and 269 per 100,000 and mortality rates related to VTE is estimated to range between 9.4 and 32.3 per 100,000 [1,2,3,4]. VTE is a complex disease caused by a combination of genetic predisposing factors and acquired risk factors. Additionally, more than 60% of the variation in susceptibility to common thrombosis is attributable to genetic factors [5, 6].

Genome-wide association study (GWAS) is a research method based on linkage disequilibrium in a population and uses single-nucleotide polymorphisms (SNPs) as markers to search for genetic factors associated with complex diseases, which can reveal the genetic mechanisms related to the occurrence, development, and treatment of diseases [7] in a comprehensive manner. In recent years, GWAS has been applied to uncover the genetic etiology of VTE [8,9,10,11], and several SNPs and genes have been identified to be related to its pathogenesis. For instance, coagulation factors including coagulation factor II (F2), coagulation factor V (F5), and coagulation factor XI (F11) are typical factors participating in the coagulation process, while protein C (PROC) plays a role in anticoagulation. Glycoprotein 6 (GP6) and phospholipase C gamma 2 (PLCG2) are related to platelet generation and regulation [9, 12].

Since the results of GWAS are in the form of SNPs, the isolated outcomes of GWAS are difficult to reflect the impact on genes or proteins. Recently, a novel analytical method called proteome-wide association studies (PWAS) was developed to clarify how proteins involve in the occurrence and development of diseases [13]. Previous PWAS were mostly conducted on nervous system diseases such as depression, lacunar stroke, Alzheimer’s disease, and post-traumatic stress disorder [13,14,15,16] because these studies used tissue-specific proteins instead of blood proteins [17, 18]. Recent release of data on human plasma proteomes [19] enables the explorations of the associations of proteins and the risk of blood diseases, such as VTE.

In this study, we performed PWAS combining the data from GWAS and protein quantitative trait locus (pQTL) to investigate the proteins deserved further investigation as diagnostic and therapeutic targets for VTE. Mendelian randomization (MR) analysis and Bayesian colocalization analysis were also conducted to clarify the causal relationship between the discovered proteins and VTE pathogenesis. Protein-protein interaction (PPI), and pathway enrichment were applied to explore the potential mechanisms of candidate proteins.

Materials and methods

Summary statistics of genome-wide association studies

We generated genome-wide association study summary statistics from the UK Biobank (UKB, access code 56,719) [20]. We identified 14,429 unrelated British Caucasian cases based on a self-report questionnaire (1068, 1093, 1094 from data field 20002) and hospital records (ICD-10: I260, I269, I801, I802, I803, I808, I809, I828, I829, O223, O229, O871; ICD-9: 41511, 41512, 41513, 41519, 45111, 45119, 4512, 4519, 4531, 4532, 45340, 45341, 45342, 4539; OPER4: L791, L902). Patients diagnosed with portal vein thrombosis (ICD-10: I81), Budd-Chiari syndromes (ICD-10: I820), and other coagulation defects (ICD-10: D68.X) were excluded from the cases. We then selected 267,037 unrelated controls of the same ancestry to form a cohort of 281,466 individuals.

For UKB data, we performed variant level quality control by criteria of MAF > 0.01, genotype missingness <0.02, and pHWE >1 × 10^–10 on 96 million imputed variants. We then obtained 8,473,913 variants for downstream analysis. All genotyped variants passing quality control on autosomal chromosomes were tested for association with VTE through logistic regression adjusting for age, sex, and top ten principal components using PLINK [21].

Human blood proteome reference weight for PWAS

We next obtained whole-blood pQTL data from the Atherosclerosis Risk in Communities (ARIC) study [19] including 1348 cis-heritable plasma proteins from 7213 European Americans (EA) to match the GWAS datasets. Proteomic profiling was performed using the SomaScan technology using the v.4.1 platform. Genotyping was conducted using the Infinium Multi-Ethnic Global BeadChip array (Illumina, GenomeStudio) and imputed to the TOPMed reference panel (Freeze 5 on GRCh38). These pQTL data were used as reference weights for subsequent PWAS analysis.

PWAS

Using the FUSION pipeline (http://gusevlab.org/projects/fusion/), we integrated the GWAS summary statistics with the reference human plasma proteomes (ARIC study [19]) to perform the PWAS analysis [22]. We calculated the VTE genetic effect (PWAS z-score) and combined it with the pre-calculated plasma proteome reference weight (z-score × proteome weight) to evaluate the effects of significant SNPs in the GWAS on the protein abundance. Finally, FUSION identified candidate genes associated with VTE regulating the abundance of proteins in the plasma. To control the potential effect of multiple testing on the study results, the false discovery rate (FDR) of P value < 0.05 was used as the significance threshold in our PWAS analysis.

MR analysis

MR was used to verify whether PWAS-significant genes were associated with VTE via their cis-regulated plasma protein abundance [23, 24]. In the MR analysis, protein-relevant SNPs were used as instrumental variables (IVs) to test the causal effect of the exposure (protein expression) on the outcome (VTE). For MR analysis, the inverse variance weighted (IVW) method [25] was used as the main MR analyses. The MR-Egger [26] method was used to detect directional pleiotropy according to the intercept of weighted linear regression of the SNP‐outcome coefficients on SNP‐exposure coefficients. We used the default parameters and P value < 0.0025 was set as the significance level (0.05/20 = 0.0025).

Bayesian colocalization analysis

We applied COLOC to assess the probability of the same variant being responsible for both changing VTE risk and protein expression [27]. We used the FUSION parameter to perform colocalization based on the GWAS and pQTL data. We used the default COLOC priors of p1 = 10⁻⁴, p2 = 10⁻⁴, and p12 = 10⁻⁵, where p1 is the probability that a given variant is associated with VTE, p2 is the probability that a given variant is a significant pQTL, and p12 is the probability that a given variant is significant in both GWAS and pQTL. A posterior colocalization probability (PP4) of 80% was used to denote a shared causal signal. The regional association plots were generated by the R package “LocusCompareR” [28].

Protein-protein interaction and pathway enrichment analysis

We used STRING, a web platform to investigate networks among the 20 significant proteins based on PPI [29]. Additionally, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment assays were performed to conduct gene set enrichment analysis on genes within a PPI network using the R package “clusterProfiler” [30].

Expression analysis of candidate proteins

GSE19151 [31] (https://www.ncbi.nlm.nih.gov/geo/, accessed on 5 February 2023) provided proteins expression in healthy and VTE blood tissues. We verified whether candidate proteins were differentially expressed in VTE patients compared with healthy controls. GSE19151 contained 95 blood tissue samples from Caucasians (47 VTE patients and 48 healthy controls). Differentially expression between VTE patients and healthy controls were screened with P < 0.0083 (0.05/6 = 0.0083).

Results

PWAS identifies 20 candidate proteins associated with VTE using plasma pQTL

There are 1529 genome-wide significant loci for GWAS of VTE (Supplementary Figs. 1 and 2). The PWAS conducted in the GWAS identified 20 genes whose cis-regulated plasma protein levels were associated with VTE at a FDR of P < 0.05 (Table 1, Fig. 1, Supplementary Table 1, and Supplementary Fig. 2) [F2, F11, ABO Blood Group (ABO), PLCG2, LDL receptor-related protein 4 (LRP4), pleckstrin (PLEK), kallikrein B1 (KLKB1), PROC, kininogen 1 (KNG1), thrombospondin 2 (THBS2), serpin family A member 1 (SERPINA1), retinoic acid receptor responder 2 (RARRES2), carboxyl ester lipase (CEL), GP6, serpin family E member 2 (SERPINE2), serpin family A member 10 (SERPINA10), odorant binding protein 2B (OBP2B), EGF containing fibulin extracellular matrix protein 1 (EFEMP1), F5, and macrophage scavenger receptor 1 (MSR1)].

Table 1 The results of the PWAS of VTE, followed by Mendelian randomization and COLOC

Full size table

MR verifies the causal relationship of 13 proteins with VTE using plasma pQTL

Most of the analyzed proteins could be instrumented using several SNPs. MR estimates were mainly based on the IVW method. We further confirmed that thirteen proteins, including F11, ABO, PLCG2, LRP4, PLEK, PROC, KNG1, THBS2, SERPINA1, SERPINE2, SERPINA10, EFEMP1, and F2, have a causal relationship with VTE. Associations between lower EFEMP1, PLEK, PROC, SERPINA1, SERPINA10, SERPINE2, and THBS2 levels and higher VTE risk were identified, as well as associations between higher ABO, F2, F11, KNG1, LRP4, and PLCG2 levels and higher VTE risk (Table 1, Fig. 2, Supplementary Tables 2 and 3).

Colocalization between VTE GWAS and pQTL in the plasma

The colocalization analysis reported for each protein, the probability that the GWAS and pQTL share the same variant, referred to as hypothesis 4 (PP4). This analysis found that 6 of the 20 proteins provided evidence of genetic colocalization based on a PP4 > 80%. The results indicated that F2, F11, PLEK, SERPINA1, RARRES2, and SERPINE2 play important roles in VTE risk (PP4 = 100.0, 99.9, 100.0, 95.5, 80.3, and 91.2%, respectively; Table 1, Fig. 3, Supplementary Table 4, Supplementary Table 5, and Supplementary Fig. 4).

Protein-protein interaction and pathway enrichment analysis

We used the STRING database to investigate the connectivity among the 20 VTE-related proteins from the PWAS and found a protein community based on PPIs. A module is a set of proteins that are more connected to one another than they are to other groups of proteins. The module included F2, F5, F11, PROC, SERPINA1, SERPINE2, KLKB1, and KNG1 (Fig. 4, and Supplementary Table 6).

We also carried out pathway enrichment analysis on genes within a PPI network. We found that these 20 significant genes were involved in complement and coagulation cascades (F2, F5, F11, KLKB1, PROC, KNG1, SERPINA1, and SERPINE2), platelet activation (F2, PLCG2, PLEK, GP6, and SERPINE2), and the immune response (F2 and RARRES2; Fig. 4 and Supplementary Table 7).

Examination of expression in VTE patients

To further confirm the dysregulation of these proteins in VTE, we analyzed blood tissues from VTE patients and healthy control in GSE19151. Gene expression analysis showed that three out of six VTE-related proteins, PLEK, SERPINA1, and SERPINE2, had significantly lower expression levels in the VTE group compared with healthy control group (p < 0.001) (Fig. 5A–C), while the expression level of RARRES2, F2, and F11 was no significant between the VTE group and healthy control group (p > 0.05).

Diagnosis performance of candidate plasma biomarkers

We inspected the individual diagnostic performance of PLEK, SERPINA1, SERPINE2, RARRES2, F2 and F11 between the VTE patients and healthy controls. In Fig. 5D, the area under the ROC curve (AUC) of SERPINE2, SERPINA1, and PLEK were 0.83, 0.74, and 0.71, respectively those of other proteins were in the range of 0.52–0.56 in the VTE diagnosis. SERPINE2, SERPINA1, and PLEK showed good performance for the VTE diagnosis.

Significance of the protein findings

Our research identified the lowest P values for the SNPs within 1 Mb of each of these 20 genes using the summary statistics from the UKB. We determined that ten genes (PROC, KNG1, THBS2, SERPINA1, RARRES2, GP6, SERPINE2, SERPINA10, EFEMP1, and MSR1) had SNPs with P values ranging from 4.05 × 10⁻⁴ to 5.83 × 10⁻⁷ (Supplementary Table 8), implying that these genes could not be significant in GWAS of VTE. These findings are consistent with observations from other PWAS studies that found the novel genes from regions below genome-wide significant P values [32,33,34].

Discussion

In this research, we performed a PWAS analysis of VTE and found 20 proteins that may be involved in VTE pathogenesis, including several known VTE-associated genes, such as F2, F5, F11, PROC, and ABO gene-encoded proteins. The MR analysis proved that 13 of the 20 PWAS-selected genes were linked with VTE via cis-regulated protein abundance. COLOC analysis found 6 representative SNPs (F2 rs1799963, F11 rs2289252, PLEK rs1867312, SERPINA1 rs28929474, SERPINE2 rs3735167, and RARRES2 rs13412535) that are responsible for both VTE risk and protein level modulation. In the following PPI and functional analysis, relevant proteins were enriched in pathways complement and coagulation cascades (F2, F5, F11, KLKB1, PROC, KNG1, SERPINA1, and SERPINE2), platelet activation (F2, PLCG2, PLEK, GP6, and SERPINE2), and immune response (F2 and RARRES2).

A cross-ancestry investigation of VTE genomic predictors conducted by Thibord et al revealed some protein encoding genes related to VTE risk, some of which are supported by our study, including findings on F2, F11, GP6, PLEK, PROS1, and MSR1 [10]. Furthermore, we performed COLOC analysis to assess the probability of some SNPs might be responsible for VTE by changing particular protein expression, which makes outcomes more robust to some extent.

In our analysis, we identified some classical VTE-proteins, such as F2 and F11 [35]. F2 is also known as prothrombin, which is an indispensable key factor in both endogenous and exogenous coagulation pathways and possesses procoagulant, anticoagulant, and antifibrinolytic activities. The G20210A variant (rs1799963) of the F2 gene is common in the Caucasian population. It can promote an increase in prothrombin expression and activity and the synthesis of prothrombin, and thus increase the risk of thrombosis [36]. F11 affects VTE as a part of the extrinsic coagulation pathway. Interestingly, as another well-known gene, the F5 was only significant in our PWAS study. Maybe the result of the MR study is caused by weak IVs for F5.

Furthermore, we also discovered four novel proteins in VTE including SERPINA1, SERPINE2, and PLEK, which are the most meaningful findings in our research. Some existing evidence has implied their potential relationship with VTE. Genetic variations of the SERPINA1 gene rs28929474 are associated with the risk of VTE [37, 38]. The genetic variant of the SERPINA1 rs2749527 has also been reported to influence plasma cortisol levels [39, 40] and an MR analysis discovered that higher plasma cortisol levels were associated with a reduced risk of VTE [41]. SERPINE2 has been discovered to function in many vascular disorders, such as atherosclerosis and restenosis [42]. It differs from conventional thrombosis-related factors as it exists on the surface of most cells but is barely expressed in plasma. SERPINE2 can play an inhibitory role in the coagulation system as well as in the fibrinolytic system [43, 44]. As a result, it is a significant regulator of hemostasis, thrombosis, and vascular disorders, although its function in VTE has not been clarified [45]. The PLEK gene is an active factor in VTE, and it was confirmed in our research as well. A large GWAS meta-analysis found that PLEK rs1867312 was an independent genetic risk signal for VTE [8]. Additionally, the transcribed protein of PLEK, pleckstrin, is found in platelets and is involved with platelet biology [46,47,48].

Our study has several advantages. First, PWAS of VTE was conducted using the human proteome and summary statistics from the UKB, a large population-based prospective study with deep genetic and phenotypic data. Second, we performed the PWAS and verified the risk proteins with independent MR validation analysis. Third, based on Bayesian colocalization used to estimate the probability that two associated signals were observed at a particular site with a common causal variant, we confirmed the pathogenetic proteins (F2, F11, PLEK, SERPINA1, RARRES2, and SERPINE2) of VTE. Finally, PWAS could detect proteins which could be ignored by GWAS.

Our research also has some limitations. First, the impact of ethnic differences on the genetic architecture of VTE cannot be ignored. The current proteomic information was from the European population; thus, the applicability of this study to other populations needs to be discussed. Second, this study only used GWAS and pQTL for PWAS analysis and barely obtained results at the level of protein, which might have minimized the robustness of our conclusions. We could further screen and explore through TWAS to achieve a more complete understanding of the molecular mechanism involved in VTE. Finally, based on our PWAS conclusion, we only performed MR, COLOC, PPI, and functional analysis to verify their rationality, and more diverse and multi-level exploration should be carried out for verification.

Conclusion

This research conducted a PWAS analysis to explore the proteomic pathogenesis of VTE. Several proteins including SERPINA1, SERPINE2, and PLEK, were considered to play a role in the development of VTE and are of great value for further research to find new diagnostic and therapeutic targets for VTE.

Data availability

For access to GWAS summary statistics from the UK Biobank (UKB, access code 56719) in this manuscript see: www.ukbiobank.ac.uk/. For access to the results of the pQTL analysis and protein weights described in this manuscript see: https://doi.org/10.1038/s41588-022-01051-w. All codes analysed in this study can be obtained by a reasonable request to corresponding authors.

References

Di Nisio M, van Es N, Büller HR. Deep vein thrombosis and pulmonary embolism. Lancet. 2016;388:3060–73.
Article PubMed Google Scholar
Wendelboe AM, Raskob GE. Global burden of thrombosis: epidemiologic aspects. Circ Res. 2016;118:1340–7.
Article CAS PubMed Google Scholar
Schulman S, Ageno W, Konstantinides SV. Venous thromboembolism: past, present and future. Thromb Haemost. 2017;117:1219–29.
Article PubMed Google Scholar
Zhang Z, Lei J, Shao X, Dong F, Wang J, Wang D, et al. Trends in hospitalization and in-hospital mortality From VTE, 2007 to 2016, in China. Chest. 2019;155:342–53.
Article PubMed Google Scholar
Souto JC, Almasy L, Borrell M, Blanco-Vaca F, Mateo J, Soria JM, et al. Genetic susceptibility to thrombosis and its relationship to physiological risk factors: the GAIT study. Genetic analysis of idiopathic thrombophilia. Am J Hum Genet. 2000;67:1452–9.
Article CAS PubMed PubMed Central Google Scholar
Morange PE, Suchon P, Trégouët DA. Genetics of venous thrombosis: update in 2015. Thromb Haemost. 2015;114:910–9.
Article PubMed Google Scholar
Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6:95–108.
Article CAS PubMed Google Scholar
Lindstrom S, Wang L, Smith EN, Gordon W, van Hylckama Vlieg A, de Andrade M, et al. Genomic and transcriptomic association studies identify 16 novel susceptibility loci for venous thromboembolism. Blood. 2019;134:1645–57.
Article PubMed PubMed Central Google Scholar
Klarin D, Busenkell E, Judy R, Lynch J, Levin M, Haessler J, et al. Genome-wide association analysis of venous thromboembolism identifies new risk loci and genetic overlap with arterial vascular disease. Nat Genet. 2019;51:1574–79.
Article CAS PubMed PubMed Central Google Scholar
Thibord F, Klarin D, Brody JA, Chen MH, Levin MG, Chasman DI, et al. Cross-ancestry investigation of venous thromboembolism genomic predictors. Circulation. 2022;146:1225–42.
Article CAS PubMed PubMed Central Google Scholar
Zhang Z, Li H, Weng H, Zhou G, Chen H, Yang G, et al. Genome-wide association analyses identified novel susceptibility loci for pulmonary embolism among Han Chinese population. BMC Med. 2023;21:153.
Article CAS PubMed PubMed Central Google Scholar
Croles FN, Nasserinejad K, Duvekot JJ, Kruip MJ, Meijer K, Leebeek FW. Pregnancy, thrombophilia, and the risk of a first venous thrombosis: systematic review and bayesian meta-analysis. BMJ. 2017;359:j4452.
Article PubMed PubMed Central Google Scholar
Wingo TS, Liu Y, Gerasimov ES, Gockley J, Logsdon BA, Duong DM, et al. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat Neurosci. 2021;24:810–17.
Article CAS PubMed PubMed Central Google Scholar
Zhang C, Qin F, Li X, Du X, Li T. Identification of novel proteins for lacunar stroke by integrating genome-wide association data and human brain proteomes. BMC Med. 2022;20:211.
Article CAS PubMed PubMed Central Google Scholar
Wingo AP, Liu Y, Gerasimov ES, Gockley J, Logsdon BA, Duong DM, et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer’s disease pathogenesis. Nat Genet. 2021;53:143–46.
Article CAS PubMed PubMed Central Google Scholar
Wingo TS, Gerasimov ES, Liu Y, Duong DM, Vattathil SM, Lori A, et al. Integrating human brain proteomes with genome-wide association data implicates novel proteins in post-traumatic stress disorder. Mol Psychiatry. 2022;27:3075–84.
Article CAS PubMed PubMed Central Google Scholar
Ou YN, Yang YX, Deng YT, Zhang C, Hu H, Wu BS, et al. Identification of novel drug targets for Alzheimer’s disease by integrating genetics and proteomes from brain and blood. Mol Psychiatry. 2021;26:6065–73.
Article CAS PubMed Google Scholar
Liu J, Li X, Luo XJ. Proteome-wide association study provides insights into the genetic component of protein abundance in psychiatric disorders. Biol Psychiatry. 2021;90:781–89.
Article CAS PubMed Google Scholar
Zhang J, Dutta D, Köttgen A, Tin A, Schlosser P, Grams ME, et al. Plasma proteome analyses in individuals of European and African ancestry identify cis-pQTLs and models for proteome-wide association studies. Nat Genet. 2022;54:593–602.
Article CAS PubMed PubMed Central Google Scholar
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–09.
Article CAS PubMed PubMed Central Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Article CAS PubMed PubMed Central Google Scholar
Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48:245–52.
Article CAS PubMed PubMed Central Google Scholar
Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48:481–7.
Article CAS PubMed Google Scholar
Porcu E, Rüeger S, Lepik K, Santoni FA, Reymond A, Kutalik Z. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun. 2019;10:3300.
Article PubMed PubMed Central Google Scholar
Burgess S, Bowden J, Fall T, Ingelsson E, Thompson SG. Sensitivity analyses for robust causal inference from mendelian randomization analyses with multiple genetic variants. Epidemiology 2017;28:30–42.
Article PubMed Google Scholar
Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44:512–25.
Article PubMed PubMed Central Google Scholar
Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383.
Article PubMed PubMed Central Google Scholar
Liu B, Gloudemans MJ, Rao AS, Ingelsson E, Montgomery SB. Abundant associations with gene expression complicate GWAS follow-up. Nat Genet. 2019;51:768–69.
Article CAS PubMed PubMed Central Google Scholar
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607–d13.
Article CAS PubMed Google Scholar
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innov (Camb). 2021;2:100141.
CAS Google Scholar
Lewis DA, Stashenko GJ, Akay OM, Price LI, Owzar K, Ginsburg GS, et al. Whole blood gene expression analyses in patients with single versus recurrent venous thromboembolism. Thromb Res. 2011;128:536–40.
Article CAS PubMed PubMed Central Google Scholar
Wu BS, Chen SF, Huang SY, Ou YN, Deng YT, Chen SD, et al. Identifying causal genes for stroke via integrating the proteome and transcriptome from brain and blood. J Transl Med. 2022;20:181.
Article CAS PubMed PubMed Central Google Scholar
Toikumo S, Xu H, Gelernter J, Kember RL, Kranzler HR. Integrating human brain proteomic data with genome-wide association study findings identifies novel brain proteins in substance use traits. Neuropsychopharmacology. 2022;47:2292–9.
Article CAS PubMed Google Scholar
Zhang Z, Meng P, Zhang H, Jia Y, Wen Y, Zhang J, et al. Brain proteome-wide association study identifies candidate genes that regulate protein abundance associated with post-traumatic stress disorder. Genes (Basel). 2022;13:1341.
Article CAS PubMed Google Scholar
Yuan S, Burgess S, Laffan M, Mason AM, Dichgans M, Gill D, et al. Genetically proxied inhibition of coagulation factors and risk of cardiovascular disease: a Mendelian randomization study. J Am Heart Assoc. 2021;10:e019644.
Article CAS PubMed PubMed Central Google Scholar
Zhang Y, Zhang Z, Shu S, Niu W, Xie W, Wan J, et al. The genetics of venous thromboembolism: a systematic review of thrombophilia families. J Thromb Thrombolysis. 2021;51:359–69.
Article PubMed Google Scholar
Riis J, Nordestgaard BG, Afzal S. α(1) -Antitrypsin Z allele and risk of venous thromboembolism in the general population. J Thromb Haemost. 2022;20:115–25.
Article CAS PubMed Google Scholar
Manderstedt E, Halldén C, Lind-Halldén C, Elf J, Svensson PJ, Engström G, et al. Thrombotic risk determined by rare and common SERPINA1 variants in a population-based cohort study. J Thromb Haemost. 2022;20:1421–27.
Article CAS PubMed PubMed Central Google Scholar
Bolton JL, Hayward C, Direk N, Lewis JG, Hammond GL, Hill LA, et al. Genome wide association identifies common variants at the SERPINA6/SERPINA1 locus influencing plasma cortisol and corticosteroid binding globulin. PLoS Genet. 2014;10:e1004474.
Article PubMed PubMed Central Google Scholar
Yuan S, Titova OE, Zhang K, Gou W, Schillemans T, Natarajan P, et al. Plasma protein and venous thromboembolism: prospective cohort and mendelian randomisation analyses. Br J Haematol. 2023;201:783–92.
Article CAS PubMed Google Scholar
Allara E, Lee WH, Burgess S, Larsson SC. Genetically predicted cortisol levels and risk of venous thromboembolism. PLoS One. 2022;17:e0272807.
Article CAS PubMed PubMed Central Google Scholar
Kanse SM, Chavakis T, Al-Fakhri N, Hersemeyer K, Monard D, Preissner KT. Reciprocal regulation of urokinase receptor (CD87)-mediated cell adhesion by plasminogen activator inhibitor-1 and protease nexin-1. J Cell Sci. 2004;117:477–85.
Article CAS PubMed Google Scholar
Boulaftali Y, Adam F, Venisse L, Ollivier V, Richard B, Taieb S, et al. Anticoagulant and antithrombotic properties of platelet protease nexin-1. Blood. 2010;115:97–106.
Article CAS PubMed Google Scholar
Boulaftali Y, Ho-Tin-Noe B, Pena A, Loyau S, Venisse L, François D, et al. Platelet protease nexin-1, a serpin that strongly influences fibrinolysis and thrombolysis. Circulation. 2011;123:1326–34.
Article CAS PubMed PubMed Central Google Scholar
Bouton MC, Boulaftali Y, Richard B, Arocas V, Michel JB, Jandrot-Perrus M. Emerging role of serpinE2/protease nexin-1 in hemostasis and vascular biology. Blood. 2012;119:2452–7.
Article CAS PubMed Google Scholar
Coppinger JA, Cagney G, Toomey S, Kislinger T, Belton O, McRedmond JP, et al. Characterization of the proteins released from activated platelets leads to localization of novel platelet proteins in human atherosclerotic lesions. Blood. 2004;103:2096–104.
Article CAS PubMed Google Scholar
Fröbel J, Cadeddu RP, Hartwig S, Bruns I, Wilk CM, Kündgen A, et al. Platelet proteome analysis reveals integrin-dependent aggregation defects in patients with myelodysplastic syndromes. Mol Cell Proteom. 2013;12:1272–80.
Article Google Scholar
Schmidt GJ, Reumiller CM, Ercan H, Resch U, Butt E, Heber S, et al. Comparative proteomics reveals unexpected quantitative phosphorylation differences linked to platelet activation state. Sci Rep. 2019;9:19009.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the participants of the ARIC and UK Biobank for their time and participation. We thank Dr. Wingo (Division of Mental Health, Atlanta VA Medical Center, Decatur, GA, USA) for making the pipeline for PWAS. We thank the editors and reviewers for their comments and instruction.

Funding

This research was funded by Beijing Nova Program (No. Z211100002121057), National Natural Science Foundation of China (No. 82100065), CAMS Innovation Fund for Medical Sciences (2021-I2M-1-061), and Elite Medical Professionals project of China-Japan Friendship Hospital (No. ZRJY2021-QM12).

Author information

Authors and Affiliations

National Center for Respiratory Medicine; State Key Laboratory of Respiratory Health and Multimorbidity; National Clinical Research Center for Respiratory Diseases; Institute of Respiratory Medicine, Chinese Academy of Medical Sciences; Department of Pulmonary and Critical Care Medicine, Center of Respiratory Medicine, China-Japan Friendship Hospital, Beijing, China
Haobo Li, Zhu Zhang, Yunxia Zhang, Yu Zhang, Linfeng Xi, Feiya Xu, Xiaofan Ji, Risheng Hao, Zhenguo Zhai & Chen Wang
China-Japan Friendship Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
Haobo Li & Xiaofan Ji
Capital Medical University, Beijing, China
Yuting Qiu, Yu Zhang, Linfeng Xi, Feiya Xu & Risheng Hao
WeGene, Shenzhen, China; Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, China
Haoyi Weng & Gang Chen
Unit of Cardiovascular and Nutritional Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
Shuai Yuan
State Key Laboratory of Respiratory Health and Multimorbidity, Department of Physiology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and School of Basic Medicine, Peking Union Medical College; National Center for Respiratory Medicine; Institute of Respiratory Medicine, Chinese Academy of Medical Sciences; National Clinical Research Center for Respiratory Diseases, Beijing, China
Peiran Yang
Department of Pharmacy, China-Japan Friendship Hospital, Beijing, China
Xianbo Zuo

Authors

Haobo Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuting Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Haoyi Weng
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Yunxia Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Linfeng Xi
View author publications
You can also search for this author in PubMed Google Scholar
Feiya Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofan Ji
View author publications
You can also search for this author in PubMed Google Scholar
Risheng Hao
View author publications
You can also search for this author in PubMed Google Scholar
Peiran Yang
View author publications
You can also search for this author in PubMed Google Scholar
Gang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xianbo Zuo
View author publications
You can also search for this author in PubMed Google Scholar
Zhenguo Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Chen Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Zhu Zhang, Zhenguo Zhai, and CW designed the research. HL, HW and SY analyzed data. PY, GC, and XZ verified the data. HL and YQ wrote the manuscript. YZ, YZ, LX, FX, RH, and XJ participated in editing the manuscript. All authors were involved in the revision of the manuscript for important intellectual content and approved the final version.

Corresponding authors

Correspondence to Zhu Zhang, Zhenguo Zhai or Chen Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, H., Zhang, Z., Qiu, Y. et al. Proteome-wide mendelian randomization identifies causal plasma proteins in venous thromboembolism development. J Hum Genet 68, 805–812 (2023). https://doi.org/10.1038/s10038-023-01186-6

Download citation

Received: 27 April 2023
Revised: 19 June 2023
Accepted: 23 July 2023
Published: 03 August 2023
Issue Date: December 2023
DOI: https://doi.org/10.1038/s10038-023-01186-6

Subjects

Abstract

Similar content being viewed by others

Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals

Mapping biological influences on the human plasma proteome beyond the genome

Genetic associations of protein-coding variants in venous thromboembolism

Introduction

Materials and methods

Summary statistics of genome-wide association studies

Human blood proteome reference weight for PWAS

PWAS

MR analysis

Bayesian colocalization analysis

Protein-protein interaction and pathway enrichment analysis

Expression analysis of candidate proteins

Results

PWAS identifies 20 candidate proteins associated with VTE using plasma pQTL

MR verifies the causal relationship of 13 proteins with VTE using plasma pQTL

Colocalization between VTE GWAS and pQTL in the plasma

Protein-protein interaction and pathway enrichment analysis

Examination of expression in VTE patients

Diagnosis performance of candidate plasma biomarkers

Significance of the protein findings

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links