Introduction

HPT arises from dysregulated parathyroid hormone (PTH) secretion, leading to disturbances in calcium and phosphorus homeostasis. It is primarily categorized into three types: primary, secondary, and tertiary hyperparathyroidism. Primary hyperparathyroidism (PHPT) represents the third most prevalent endocrine disorder worldwide. Over the past 50 years, the global incidence and prevalence of PHPT have notably increased. Notably, it is associated with parathyroid adenoma in more than 80% of cases. Secondary hyperparathyroidism (SHPT) arises from hypocalcemia, typically due to chronic kidney disease (CKD). CKD poses an escalating global health burden, with an estimated worldwide prevalence of 8–16%. Tertiary hyperparathyroidism occurs rarely by comparison, marked by autonomous and excessive parathyroid hormone secretion from hyperplastic glands triggered by prolonged SHPT1.

HPT is a complex and diverse condition that significantly impacts the quality of life for affected patients with various complications, including cardiovascular diseases2, kidney stones, and bone diseases3. In rare cases, prolonged hyperparathyroidism can ultimately induce end-stage parathyroid carcinoma—an aggressive malignancy with limited treatment options4,5. Along with the disorder of PTH, calcium, and phosphate levels, thorough evaluation and tight management are crucial for the treatment of HPT. The Fifth International Workshop guidelines provide a Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) methodology to guide surgical management of asymptomatic primary hyperparathyroidism as well as non-surgical approaches for symptomatic cases6. However, these guidelines have only applied the GRADE approach to systematic review evidence rather than the process of formulating final recommendations. As a result, clinical practices continue to lack standardization, especially regarding the selection of pharmacological therapies for PHPT7. Furthermore, surgery is one of the recommendations for most primary hyperparathyroidism and tertiary hyperparathyroidism patients, but the long-term effect is poor and a substantial subgroup exists who are asymptomatic and may not derive significant benefits from surgery8. Therefore, it is essential to explore genetics and identify effective drug targets for the prevention of HPT.

While large-scale randomized controlled trials (RCTs) can rigorously assess pharmacological interventions, undertaking such studies requires extensive planning, prolonged execution timelines, and substantial resource allocation. Additionally, RCTs are not feasible to examine causal links between thousands of druggable genes and hyperparathyroidism in the absence of definitive evidence.

Recently, MR analysis has gained widespread usage in drug target development9. MR leverages the random assortment of genetic variants during conception to minimize confounding from environmental and lifestyle factors. Expression quantitative trait locis (eQTLs) linked to changes in gene expression may mimic long-term exposure to drugs targeting the encoded proteins10,11. Genome-Wide Association Study (GWAS) data on the outcome (e.g. hyperparathyroidism risk) can then be integrated with eQTL information to examine causal relationships between gene expression and disease using MR. Such approaches have been implemented for diseases like aortic aneurysms, COVID-19, rheumatoid arthritis, and Parkinson’s disease12,13,14,15. However, MR has yet to be applied to systematically evaluate possible treatment targets for hyperparathyroidism.

In this work, our objective was to identify potential drug targets related to HPT through MR analysis. In two separate exposure cohorts, we found a significant association between the genetically predicted expression of SLC40A1 and PIK3C3 and the risk of HPT, supported by robust evidence for colocalization. The result supports that drug-targeting PIK3C3 and SLC40A1 may slow down HPT progression, which offers a potential approach to prioritize medical treatment for the prevention of HPT.

Methods

Study design and data sources

The present study sought to identify novel therapeutic targets relevant to hyperparathyroidism (HPT). The methodological pipeline is outlined through a flow diagram (Fig. 1) and directed acyclic graph (Fig. 2), with detailed information on data sources provided in Supplementary Table 1. First, two-sample Mendelian randomization was performed using eQTL data from eQTLGen and hyperparathyroidism GWAS from FinnGen to pinpoint potential causal genes. Second, findings were further validated via heterogeneity testing, horizontal pleiotropy assessment, and reverse causality detection. Third, analysis was replicated in GTEx data as external validation. Then, multivariable MR analysis was conducted to delineate joint causal effects of multiple risk factors. Finally, Bayesian colocalization analysis, PheWAS studies, and drug prediction were employed to investigate the feasibility of the identified drug targets for potential future clinical applications.

Figure 1
figure 1

Study design overview (By Figdraw).

Figure 2
figure 2

A directed acyclic graph (DAG) of this work (By Figdraw).

Our study was a secondary analysis of publicly available data. Specifically, summary-level data were obtained from three large-scale genome-wide association resources: the eQTLGenConsortium (https://eqtlgen.org/)16, GTEx dataset (https://gtexportal.org/home/eqtlDashboardPage)17, and FinnGen (https://www.finngen.fi/en)18. The included studies had received ethical approval from their respective institutional review boards. The current study was performed following the current guideline and the STROBE-MR checklist has been attached as supplemental material19.

Actionable drug targets

A total of 4464 druggable genes located on autosomal chromosomes with HGNC annotations were identified for analysis. This set encompassed 1425 genes encoding protein targets in current clinical development, 674 genes linked to proteins engaged by approved drugs or compounds, and 2360 genes belonging to established drug target families. The study carried out by Chris et al. provides further detailed information on these feasible pharmacological targets20.

eQTL data for the identification of genetic instrumental variables

Genetic instrumental variables were derived from two distinct eQTL datasets. The discovery cohort dataset utilized for this purpose comprised eQTL data sourced from the eQTLGen Consortium. Additionally, the replication cohort dataset was obtained from GTEx, ensuring a robust validation of our findings across independent datasets.

eQTLGen consortium

Genome-wide significant cis-eQTLs (false discovery rate < 0.05) within ± 1 Mb of probe locations were identified. The dataset utilized for this analysis, eQTLGen, encompasses 16,987 genes and was derived from the analysis of 31,684 blood samples, primarily collected from individuals of European ancestry and characterized by their overall good health. Emphasizing the relevance of eQTLs in drug development studies due to their proximity to the target genes and direct impact on gene expression, we narrowed down our selection to SNPs situated within 100 kb upstream of transcription start sites or 100 kb downstream of transcription end sites of druggable genes. Consequently,we identified eQTLs associated with 2327 druggable genes.

GTEx

As independent replication cohorts, eQTL data for 670 whole blood 574 and thyroid tissue was obtained from version 8 of the GTEx database (https://gtexportal.org/home/)21.

Outcome data

Hyperparathyroidism

The GWAS data for hyperparathyroidism were sourced from FinnGen Release 8 (https://www.finngen.fi/en), which was published in December 2022. Hyperparathyroidism in FinnGen was defined based on the International Classification of Disease (ICD), encompassing 5,590 cases (ICD-8(2520), ICD-9(2520), ICD-10(550)), and 361,988 controls.

Risk factors

We identified 12 hyperparathyroidism risk factors encompassing chronic kidney disease22, chronic tubulointerstitial nephritis18, glomerulonephritis18, IgA nephropathy18, nephrolithiasis18, blood calcium levels23, serum/plasma magnesium24, serum/plasma 25-hydroxyvitamin D2/D324, serum phosphate levels25, parathyroid hormone26, intestinal malabsorption18, and osteomalacia18 (Supplementary Table S2).

Mendelian randomization analysis

The TwoSampleMR R package (version 0.5.7, https://mrcieu.github.io/ TwoSampleMR/) was employed to execute two-sample MR analysis27. Stringent quality control of the SNP instruments was implemented prior to MR testing. First, variants with weak instrument strength (F-statistic < 10, where F = (beta/se)2) were excluded. Conditionally independent variants in low linkage disequilibrium (LD r2 < 0.1 per 1000 Genomes European panel) were then selected. Finally, Steiger filtering removed genes wherein SNPs explained greater outcome than exposure variance (Supplementary Table S3).

In the primary analysis, the Wald ratio was employed for proposed instruments comprising a single SNP. For proposed instruments that encompassed more than one SNP, a comprehensive approach was adopted. This included the use of the inverse variance weighted (IVW) method, MR-Egger, and weighted median MR. The IVW method assumes all genetic instruments are valid and has the highest statistical power when this assumption holds 28. The weighted median approach allows some (50%) invalid instruments by weighting SNP-specific MR estimates based on magnitude and taking the median estimate with bootstrapped standard errors29. MR-Egger allows for horizontal pleiotropy, wherein some SNPs influence the outcome via alternative pathways, but has lower statistical power. The MR-Egger intercept test can determine the presence of horizontal pleiotropy30. To account for multiple testing in the sensitivity analyses, Bonferroni corrections were implemented to establish adjusted significance thresholds. In the eQTLGen cohort, p values below 2.15e−05 (calculated as p = 0.05/2327, 2327 is the number of druggable genes in eQTLGen data) were deemed significant. Following this, quality control procedures were applied to genes identified as significant, ensuring consistency in the direction of estimated effects across the three methods and confirming the absence of horizontal pleiotropy via the MR-Egger test. Genes passing quality control were further evaluated in the GTEx cohort. A conservative significance threshold of P < 1.25e−02 (calculated as 0.05/4 for Bonferroni correction, 4 is the number of genes replicated in GTEx data) was applied. Replication at this stringent cutoff enhanced robustness of associations by reducing likelihood of false positives, providing external validation complementing the original eQTLGen discovery analysis.

Additionally, multivariable MR analysis was conducted using the MVMR R package, examining associations of PIK3C3 and SLC40A1 with hyperparathyroidism risk while adjusting for the other variable. As an extension of univariate MR, multivariable MR can delineate joint causal effects of multiple risk factors 31. For PIK3C3, the effect of intestinal malabsorption was adjusted18. For SLC40A1, adjustments were made for potential effects of chronic kidney disease22, blood calcium levels23, and glomerulonephritis18.

Reverse causality detection

Adhering to analogous screening criteria employed for eQTLs, genetic instruments for hyperparathyroidism were meticulously chosen from the GWAS dataset of FinnGen. These instruments were then utilized in a bidirectional MR analysis to discern possible reverse causation. Effect estimates were derived using three distinct methods: MR-Inverse Variance Weighted (MR-IVW), MR-Egger, and weighted median. Furthermore, a Steiger filtering procedure was executed to confirm the directionality of the association between expression quantitative trait loci (eQTL) and hyperparathyroidism32. Statistical significance was established at a threshold of P < 0.05, providing a rigorous criterion for the validity and reliability of the observed associations.

Colocalization analysis

For genes with significant Mendelian randomization associations in both eQTLGen and GTEx cohorts, colocalization analysis was performed using the coloc R package with default priors 33. This Bayesian approach tested whether the gene expression-hyperparathyroidism links were driven by shared causal variants at a given locus rather than linkage disequilibrium. Specifically, five mutually exclusive hypotheses were assessed: (H0) no association with either trait; (H1) association with expression only; (H2) association with disease only; (H3) association with both traits, but independent causal variants; (H4) association with both traits driven by a shared causal variant34. Posterior probabilities were furnished for each hypothesis. Prior probabilities were set at 1e−04 for trait 1 only (p1) and trait 2 only (p2), and 1e−05 for both traits (p12). Strong evidence of colocalization was defined as a posterior probability ≥ 0.8 for H4, with values between 0.5 and 0.8 suggesting moderate colocalization.

Phenome‑wide association analysis

Utilizing the AstraZeneca PheWAS Portal (https://azphewas.com/) and PheWeb database (https://pheweb.org/), PheWAS was carried out to thoroughly evaluate the horizontal pleiotropy of possible therapeutic targets and probable adverse effects35,36.

Candidate drug prediction

In the course of this study, the identified target genes are submitted to the Drug Signatures Database (DSigDB, http://dsigdb.tanlab.org/DSigDBv1.0/) for the assessment of protein-drug interactions. DSigDB stands as a significant repository, housing 22,527 gene sets and 17,389 distinct compounds, associated with 19,531 genes. This comprehensive database facilitates the establishment of links between pharmaceuticals, diverse chemicals, and their respective target genes. PIK3C3 and SLC40A1 were analyzed using the DSigDB drug database on Enrichr (https://maayanlab.cloud/modEnrichr/) to identify potential targeting agents37.

Results

MR analysis identifies 5 actionable therapeutic targets for hyperparathyroidism

At Bonferroni significance (P < 2.15e−05), the MR analysis revealed that five genes are causally related to the risk of hyperparathyroidism, as illustrated in Figs. 3 and 4. These genes include chemerin chemokine-like receptor 1 (CMKLR1), follistatin like 1 (FSTL1), immunoglobulin superfamily member 11 (IGSF11), phosphatidylinositol 3-kinase catalytic subunit type 3 (PIK3C3), and solute carrier family 40 member 1 (SLC40A1). Specifically, increased expression of CMKLR1 (OR = 1.16; 95% CI 1.09–1.23; P = 1.56e−06), FSTL1 (OR = 1.35; 95% CI 1.2–1.53; P = 1.05e−06), IGSF11 (OR = 1.62; 95% CI 1.35–1.94; P = 2.89e−07), and PIK3C3 (OR = 1.27; 95% CI 1.15–1.4; P = 1.08e−06) increased the risk of hyperparathyroidism, whereas elevated SLC40A1 (OR = 0.86; 95% CI 0.81–0.92; P = 1.31e−05) decreased the risk of hyperparathyroidism. All 5 genes showed consistent direction of effect across the three methods (Supplemental Table S4) and no heterogeneity (P > 0.05, Supplemental Table S5) and horizontal pleiotropy (P > 0.05, Supplemental Table S6) was detected in the primary analysis. Bidirectional Mendelian randomization analysis showed no evidence for causal effects of hyperparathyroidism on expression of the five identified genes. Application of Steiger filtering provided additional confirmation of directionality from gene expression to disease status (Supplementary Tables S7 and S8).

Figure 3
figure 3

Manhattan plots for associations of genetically predicted 2327 druggable genes levels with hyperparathyroidism in MR analysis.

Figure 4
figure 4

Forest plots illustrating the results of the discovery stage for 5 essential genes.

Replication phase 5 genes remain significant in GTEx eQTL database

As IGSF11 was not available in GTEx whole blood samples, it was excluded from subsequent replication analyses. The replication MR analysis employed identical methodology as the original discovery cohort. Genetically predicted expression of CMKLR1, PIK3C3, and SLC40A1 retained significant causal associations with hyperparathyroidism risk in GTEx whole blood samples (P < 1.25e−02, inverse-variance weighted methods) (Fig. 5; Supplementary Table S9). Importantly, the directionality of effect was consistent across both the discovery and replication sets for all 3 genes.

Figure 5
figure 5

Forest plots illustrating the results of the replication stage for 4 essential genes.

Insights from thyroid tissue analysis

The FSTL1 was not found in thyroid tissue from the GTEx V8 dataset, precluding its inclusion in replication analyses. Using an identical Mendelian randomization approach as the original discovery cohort, we examined the causal relationship between genetically predicted expression levels of several genes in the GTEx thyroid tissue cohort and hyperparathyroidism risk. Genetically predicted expression of IGSF11, PIK3C3, and SLC40A1 retained significant associations with hyperparathyroidism risk in GTEx thyroid tissue cohort (P < 1.25e−02, inverse-variance weighted methods) (Fig. 6; Supplementary Table S10). Importantly, the directionality of effect was consistent between the whole blood samples and thyroid tissue for all three genes.

Figure 6
figure 6

Forest plots illustrating the results of the replication stage for 4 essential genes in thyroid tissue of the GTEx V8 dataset.

Colocalization analysis

The MR results for PIK3C3 and SLC40A1 consistently demonstrated positive or negative estimated effects in both the discovery and two testing cohorts. This consistency suggests a correlation between elevated expression of PIK3C3 and decreased expression of SLC40A1 with an increased risk of hyperparathyroidism. Consequently, antagonists of PIK3C3 and protagonists of SLC40A1, may represent a novel and robust strategy for reducing the risk of hyperparathyroidism. Prior studies have noted that significant MR findings can arise from loci where SNPs in linkage disequilibrium have associations with exposure and outcome mediated through distinct causal variants, conferring false positive results38. Colocalization analysis can elucidate whether exposure and outcome share the same causal SNP when SNPs demonstrate associations with both traits33. Bayesian co-localization strongly suggested that PIK3C3 (coloc.abf-PPH4 = 0.93) and SLC40A1 (coloc.abf-PPH4 = 0.58) shared the same variant with hyperparathyroidism , indicating a higher likelihood that these genes may serve as promising drug targets and have an increased probability of obtaining regulatory approval (Fig. 7, Supplemental Table S11).

Figure 7
figure 7

(A) Regional plot of colocalization evidence of PIK3C3 and hyperparathyroidism susceptibility. (B) Regional plot of colocalization evidence of SLC40A1 and hyperparathyroidism susceptibility.

Multivariable MR analysis identified PIK3C3 and SLC40A1 were independent associations with hyperparathyroidism

Among the 12 hyperparathyroidism risk factors, PIK3C3 exhibited significant associations with intestinal malabsorption (P = 9.11e−04 [IVW], Fig. 8A). To examine the potential independent relationship between PIK3C3 and hyperparathyroidism, multivariable MR analysis was conducted, revealing significant independent associations (P = 2.7e−10 [IVW], Supplemental Table S12). Additionally, SLC40A1 showed weak links to chronic kidney disease (P = 1.26e−03), glomerulonephritis (P = 4.43e−02) and blood calcium levels (P = 2.93e−03) (Fig. 8B). Multivariable MR analysis adjusting for these potential confounders still detected a strong association between SLC40A1 and hyperparathyroidism (P = 6.8e−03[IVW], Supplemental Table S13).

Figure 8
figure 8

Associations between two drug targets and hyperparathyroidism risk factors. (A) Forest plot revealed that intestinal malabsorption is linked to the genetically predicted expression of PIK3C3. (B) Forest plot revealed that chronic kidney disease, glomerulonephritis and blood calcium levels is linked to the genetically predicted expression of SLC40A1.

PheWAS

We conducted phenome-wide MR using the PheWAS Portal and PheWeb database35,36, in order to investigate the possible adverse consequences of these two previous druggable genes. Neither of the two drug targets showed significant associations with other traits at the gene level (P < 5e−08 for genomic association) in PheWAS Portal (Supplemental Figures S1–S2) and PheWeb database (Supplementary Table S1415). This further supports the validity of the study results by indicating that the likelihood of adverse medication reactions against these targets and the existence of horizontal pleiotropy in these genes are probably low.

Candidate drug prediction

Evaluating protein-drug interactions is critical for determining the viability of utilizing target genes as potential drug targets. PIK3C3 and SLC40A1 were analyzed using the DSigDB drug database on Enrichr to identify potential targeting agents. The findings indicate that emodin (CTD 00005893), celecoxib (CTD 00003448), and temozolomide (CTD 00002088) are the most significant drugs linked to PIK3C3, while Panobinostat (CTD 00004412), folic acid (CTD 00005997), and ethanol (CTD 00005337) are the most significant drugs linked to SLC40A1(Table 1).

Table 1 Candidate drug predicted using DSigDB.

Discussion

Hyperparathyroidism significantly impacts the quality of life and survival time of patients39. Currently, conventional drugs and surgical methods fall short of fully meeting clinical treatment needs. The development of effective and minimally invasive treatment methods is extremely necessary40. Thus far, this study represents the first application of MR leveraging eQTL data to systematically identify potentially molecular targets that could potentially provide protection against hyperparathyroidism.

This work identified two drug targets for hyperparathyroidism: PIK3C3 and SLC40A1 employing a variety of MR methods (MR-Egger, Wald ratio/IVW, weighted median method, Cochran’s Q heterogeneity test, horizontal multiplicity test, bidirectional MR analysis, Steiger test, colocalization analysis and multivariable MR analysis). Moreover, the association between PIK3C3 and SLC40A1 with hyperparathyroidism was corroborated using a parallel analysis in the GTEx database, reinforcing the credibility of the potential drug targets identified in this investigation. To further elucidate potential pleiotropy and off-target effects, PheWAS analysis was also conducted. Ultimately, the study also showed the druggable value of these target genes by predicting compounds matching to these targets.

PIK3C3, also known as VPS34, is a member the family of phosphoinositide 3-kinases—lipid kinases that play a key role in autophagy41. There are several consecutive stages that make up the autophagy machinery: initiation, nucleation, elongation, fusion, and degradation42. Specifically, PIK3C3/VPS34 interacts with BECN1/Beclin 1 to form a complex central to autophagosome nucleation41. Previous studies have demonstrated that autophagy inhibitors can potentiate effects of 5-aminolevulinic acid-mediated photodynamic therapy in secondary hyperparathyroidism models40. Additionally, calcimimetics that activate calcium-sensing receptors in parathyroid tissue have been utilized to clinically manage hyperparathyroidism, where they are proposed to ameliorate deleterious tissue remodeling in part by modulating autophagy43,44. Intriguingly, our analyses newly identify genotype–phenotype associations between PIK3C3 variants and altered hyperparathyroidism risk (eQTLGen odds ratio OR = 1.27, P = 1.08e−06; GTEx whole blood samples OR = 1.91, P = 1.53e−140; GTEx thyroid tissue OR = 1.25, P = 4.84e−27), strongly supported by colocalization (PPH4 = 0.933). Moreover, phenome-wide and multivariate Mendelian randomization indicate effects likely represent direct modulation of hyperparathyroidism pathology rather than widespread pleiotropy. These results suggest that the PIK3C3 inhibition as a promising hyperparathyroidism therapeutic strategy worthy of future investigation.

The protein encoded by SLC40A1 functions as a cell membrane protein, potentially facilitating the transport of Fe (2 +) from the interior to the exterior of the duodenal epithelial cells45,46. A portion of ferric iron in the duodenum binds to phosphorus, forming an insoluble compound that is subsequently excreted in feces47. This process contributes to the reducing of serum phosphorus, thereby inhibiting parathyroid hormone secretion. Our MR analysis suggested that SLC40A1, a mediator of Fe translating, may be a promising hyperparathyroidism therapeutic targets (eQTLGen odds ratio OR = 0.86, P = 1.31e−05; GTEx whole blood samples OR = 0.52, P = 1.23e−03; GTEx thyroid tissue OR = 0.75, P = 2.89e−137; PPH4 = 0.581).

Pharmacological treatments, including calcium and calcium-binding agents (calcium acetate and carbonic acid), non-calcium-binding agents (velamum and lanthanum carbonate), vitamin D and its analogs, and pseudocalcifier (sikansei), have advanced recently due to significant advancements in the molecular mechanisms of HPT. In addition, although parathyroidectomy (PTX) is effective in partly SHPT patients, the tolerance of patients with SHPT is poor and the incidence risk of perioperative period is still high. Given the side effects of drugs, inconclusive clinical trial outcomes, and the limitations of surgery, it becomes imperative to explore alternative therapeutic targets that are potentially effective and safe. Thus, to ensure the credibility of results derived from the MR analysis, we employed gene expression as the exposure and disease summary statistics from the UK Biobank cohorts to perform phenome-wide MR based on PheWAS Portal and PheWeb database. The findings indicate that the potential side effects of drugs are minimal and substantially reducing the likelihood of biased results attributable to pleiotropy.

This study has several strengths, including an extensive collection of eQTL, a substantial number of hyperparathyroidism cases, mutual validation in two independent exposure datasets, supportive colocalization analysis, and the exclusion of horizontal pleiotropy through PheWAS and multivariate MR analyses. Limitations should be considered when interpreting our findings. Firstly, our analyses were restricted to individuals of European descent, which limits the generalizability of the results to other ethnic populations. Secondly, due to the paucity of genetic studies on protein levels, we did not identify protein quantitative trait loci (pQTL) for PIK3C3 and SLC40A1. Consequently, we were unable to validate whether protein levels of PIK3C3 and SLC40A1 consistently associated with hyperparathyroidism risk. Additionally, clinical trials are imperative to evaluate the efficacy and safety of targeting PIK3C3 or SLC40A1 for therapeutic management of hyperparathyroidism. Furthermore, while Mendelian randomization provides insight into potential causal relationships, it assumes low-dose chronic exposure and linear dose–response links that may not fully extrapolate to real-world clinical trials that often feature short-term high-dose treatments. Consequently, effect sizes estimated in our analyses may not precisely mirror those observed under standard pharmacological parameters. Finally, owing to the lack of additional GWAS datasets for hyperparathyroidism itself, we could only verify conclusions using independent exposure but not outcome data.

In summary, this cis-eQTL-wide Mendelian randomization and colocalization analysis identified PIK3C3 inhibitors and SLC40A1 antagonists as promising therapeutic targets for hyperparathyroidism. Nevertheless, randomized controlled trials are imperative to conclusively assess the efficacy and safety of these potential drug targets.