Abstract
Mutations in the LRRK2 gene are the most common genetic cause of familial Parkinson’s Disease (LRRK2-PD) and an important risk factor for sporadic PD (sPD). Multiple clinical trials are ongoing to evaluate the benefits associated with the therapeutical reduction of LRRK2 kinase activity. In this study, we described the changes of transcriptomic profiles (whole blood mRNA levels) of LRRK2 protein interactors in sPD and LRRK2-PD cases as compared to healthy controls with the aim of comparing the two PD conditions. We went on to model the protein-protein interaction (PPI) network centred on LRRK2, which was weighted to reflect the transcriptomic changes on expression and co-expression levels of LRRK2 protein interactors. Our results showed that LRRK2 interactors present both similar and distinct alterations in expression levels and co-expression behaviours in the sPD and LRRK2-PD cases; suggesting that, albeit being classified as the same disease based on clinical features, LRRK2-PD and sPD display significant differences from a molecular perspective. Interestingly, the similar changes across the two PD conditions result in decreased connectivity within a topological cluster of the LRRK2 PPI network associated with protein metabolism/biosynthesis and ribosomal metabolism suggesting protein homoeostasis and ribosomal dynamics might be affected in both sporadic and familial PD in comparison with controls.
Similar content being viewed by others
Introduction
Leucine-rich repeat kinase 2 is a large (>250 kDa), multifunctional enzyme encoded by the LRRK2 gene, possessing 2 enzymatic (GTPase and kinase) and 4 scaffold (armadillo, ankyrin, LRR and WD40 motifs) domains1. LRRK2 is able to interact with a large number of protein partners (Zhao et al. 2), and is involved in a range of biological processes including vesicular transport, autophagy, regulation of cellular response to stress, regulation of cell cycle, etc3,4,5,6. Mutations in the LRRK2 gene are an important genetic cause of familial PD (fPD), with 1 to 40% of fPD cases associated with coding variants in LRRK2, depending on the population under study7,8,9,10. Since 2004, when the first variants in the LRRK2 gene were associated with fPD, numerous coding and non-coding variants of LRRK2 have been identified in PD families. These include the G2019S and R1441C/G mutations, which are the 2 most common pathogenic variants occurring on the kinase and GTPase domains of the LRRK2 protein, leading to an increased kinase activity and decreased GTPase activity respectively11,12,13,14,15. Additionally, polymorphisms, mainly in the promoter of LRRK2 and proposed to modulate expression of LRRK2, have been linked to lifetime risk of developing sporadic PD (sPD) (Nalls et al. 16), while upregulated LRRK2 kinase activity (in the absence of pathogenic mutations) has also been related with sPD.
The molecular mechanism(s) underlying the contribution of LRRK2 to both fPD and sPD are as yet unclear, despite the extensive efforts made to investigate LRRK2 function in health and disease. For example, PD-related inflammation both in the Central Nervous System (CNS) and at the periphery, was linked to an increased LRRK2 expression level and strengthened LRRK2 kinase activity in microglia and peripheral immune cells in sPD patients as compared to controls (Cook et al. 17; Di Maio et al. 18). These data indicate that LRRK2 is crucial for the understanding of PD etiopathogenesis, and that LRRK2 might constitute a link between familial and sporadic forms of the disease. There is extensive evidence suggesting that LRRK2 is a crucial regulator of the crosstalk between the CNS and the periphery, possibly via modulation of the immune system19,20. LRRK2 exhibits high expression levels in monocytes, and its protein levels are increased in peripheral immune cells in PD patients as compared to controls21,22. Some evidence suggests that the presence of the LRRK2-G2019S and the LRRK2-R1441G mutations in peripheral immune cells alone is sufficient for inflammation-induced dopaminergic neuronal loss in genotypically normal mouse brain, suggesting that the dysfunction in the peripheral immune system may be pivotal for the disease, at least in the fPD condition23. Our previous study found that a large number of LRRK2 protein interactors are characterised by a significantly higher expression in whole blood, as compared to their levels in the brain regions and other peripheral tissues such as liver, lung and kidney2, again suggesting the relevance of LRRK2 within the peripheral immune system.
A second observation is that, from a clinical perspective, LRRK2-PD and sPD have been reported to present with distinct features. Despite having similar motor-symptoms (such as bradykinesia, tremor, rigidity, and postural instability) as well as sharing some of the non-motor symptoms24,25,26, patients with LRRK2-PD display slower decline considering both movement and cognitive impairment27,28. In addition, LRRK2-PD and sPD show slightly different pathological features. For example, LRRK2-PD patients exhibit less α-synuclein aggregation in Cerebrospinal Fluid (CSF), feature that is a hallmark of sPD29,30 as well as increased basal forebrain volume, which is probably a compensation of the cholinergic system31. Such differences might highlight an intrinsic variation at the molecular level between these 2 forms of PD thus suggesting different model systems might be required to investigate them. Also, this consideration may pose a problem in translational research, for example the use of LRRK2 inhibitors in clinical trials32 might require patient stratification.
Based on the above observations, in this study, we hypothesised that despite important overlaps in their clinical presentation, sPD and LRRK2-PD might have a different molecular signature and therefore the molecular alterations contributing to disease onset and progression might be functionally different. We decided to focus on whole blood transcriptomics, based upon the relevance of LRRK2 as potential regulator of the immune activity. We constructed the protein-protein interaction (PPI) network around LRRK2 (LRRK2net) and evaluated the whole-blood mRNA expression changes within the LRRK2net in a cohort of sPD and LRRK2-PD patients in comparison with healthy controls. The results provide a bioinformatic demonstration that the signature of expression changes in the LRRK2net shows both similarities and differences in sPD vs LRRK2-PD. Of particular interest are the statistically significant differences in both gene expression and co-expression that suggest some of the molecular pathways at the base of these two conditions might be different. Our finding is relevant for the understanding of the different molecular mechanisms of PD, and it highlights the necessity for patient stratification in both discovery research and clinical trials, suggesting different therapeutic approaches might be needed if we intend to move from symptomatic to effective disease treatment.
Results
Whole blood transcriptomic profiling of the LRRK2 interactome
A total of 418 protein interactors of LRRK2 (LRRK2int) were retrieved via an in-house pipeline developed in our previous study2 (Table S1). Tissue-specific expression scores of 378 LRRK2 interactors in the whole blood were extracted from the same study. Among the 418 LRRK2 interactors, 140 (37.0%) presented significantly higher expression scores in blood when compared to other peripheral tissues (liver, lung and kidney) and brain regions (Figure S1). Functional enrichment analysis performed on this selection of LRRK2 interactors returned 234 GO-BP terms (Table S2). After semantic grouping of GO:BP terms, text cloud analysis of the enrichment results showed that terms in the groups of “response to stimulus”, “immune response” and “apoptosis” (N = 100/234, accounting for 42.7% of all enriched GO terms) contained keywords highly associated with immune functions, such as “cytokine”, “leucocyte”, “lymphocyte”, etc (Figure S1). Overall, these results suggested that a substantial proportion of LRRK2 protein interactors might be involved in the regulation of the immune functions at the periphery.
Whole blood transcriptomic profiling of the LRRK2 interactome in the PPMI cohort
Whole blood RNA-Seq read counts were retrieved from the PPMI dataset for 415 (out of 418) LRRK2 interactors and for 657 subjects with validated genotyping data (controls = 170; sPD cases = 371; and LRRK2-PD cases = 116). A total of 38 interactors were removed due to low read counts. No subjects were identified as outliers by PCA (Figure S2). Hence, mRNA levels of the remaining 377 LRRK2 interactors of 657 PPMI subjects formed the PPMI_Matrix. Demographic features of the included cohorts are listed in Table 1 with no significant difference in sex and age nor in motor symptom severity when the sPD and LRRK2-PD cohorts were compared. Of note, 95.5% of the 3 cohorts have white ancestry. Among the LRRK2-PD cases, 100 (86.2%) were LRRK2-G2019S carriers while 16 (13.8%) were LRRK2-R1441C/G carriers. Therefore, and to avoid bias induced by different LRRK2 variants, LRRK2-R1441C/G carriers were removed, leaving a total of 641 PPMI subjects for further analysis.
DEA results (Table S3) showed that: the mRNA levels of 67 interactors (17.7%) were significantly altered in the LRRK2-PD cases vs. controls, with 39 down-regulated and 28 up-regulated interactors (|log2(FC) > 0.05|, adjusted-p < 0.05, Fig. 1A, Table S3). Functional enrichment analysis related the up-regulated interactors mainly to cytoskeletal dynamics/transport, while the down-regulated interactors were mainly related to biosynthetic processes and ribosome biogenesis (Fig. 1B, C, Table S4).
As for the sPD cases, a total of 55 interactors (14.6%) presented significant changes in expression levels in comparison with controls, including 28 down-regulated and 27 up-regulated interactors (|log2(FC) > 0.05|, adjusted-p < 0.05, Fig. 2A, Table S3). Functional enrichment analysis showed that the up-regulated interactors were associated with GO-BP terms related to protein metabolic processes and signalling, while the down-regulated interactors, similar to the LRRK2-PD condition, were predominantly related to biosynthetic processes and ribosome biogenesis (Fig. 2B, C, Table S4).
Of note, only a total of 13 interactors exhibited the same alteration in LRRK2-PD and sPD (Fig. 3A), in which 9 interactors were down-regulated while 4 interactors were up-regulated, suggesting these LRRK2 interactors were consistently affected during PD progression regardless of the existence of the LRRK2-G2019S mutation. Interestingly, 9 out of these 13 LRRK2 interactors were found in the interactome of other PD genes SNCA, PRKN, PINK1, PARK7, VPS35, FBXO7; Supplementary Data S1.) Functional enrichment analysis associated these 13 proteins with protein metabolism/biosynthesis and ribosomal metabolism (Fig. 3B).
Transcriptomic profiles of the LRRK2 interactors differentiate LRRK2-PD and sPD cases
Univariate logistic regression was performed on each of the 109 LRRK2 interactors with significant differential expression in the sPD and/or LRRK2-PD cohorts vs. controls, out of which 11 interactors with p-value < 0.05 were selected for further model construction, including TUBB6, SNCA, HSPA1A, BAG3, ACTA2, TUBG1, LMNB1, CDK2, RAB5B, LRRK2 and SLC25A6 (Table S5). A LASSO regression model constructed with these 11 interactors was then trained on a randomly picked cohort of 296 sPD cases (coded as 1) and 80 LRRK2-PD cases (coded as 0). A λ value of 0.006 (log(λ) = −5.062 was chosen to reach the minimum MSE = 0.316 (Fig. 4A), leaving a total of 9 interactors in the model, including CDK2 (beta = −0.069), RAB5B (beta = −0.967), ACTA2 (beta = −0.910), TUBB6 (beta = −1.051), LRRK2 (beta = 0.224), HSPA1A (beta = 3.103), LMNB1 (beta = 0.611), SNCA (beta = −1.027), SLC25A6 (beta = −1.328) (Fig. 4B). The cut-off on the predicted value was optimised as 0.54 to reach the maximum accuracy in the training set of 80.3%, with True Positive (TP) of 82.8% and True Negative (TN) of 57.9% (Fig. 4C). The refined model was then validated on the test set, containing 75 sPD cases and 20 LRRK2-PD cases. ROC curve showed an AUC = 0.735 (95% CI: 0.617–0.853), suggesting a good classification performance of the LASSO model built upon the mRNA levels of LRRK2 interactors (Fig. 4D).
Co-expression modules of LRRK2 interactors in the sPD and LRRK2-PD conditions
A signed gene co-expression network was constructed across the 3 examined cohorts via WGCNA with the soft power β = 12 (Fig. S3), within which a total of 3 co-expression modules were identified: MTurquoise (N = 92 interactors), MBlue (N = 72 interactors) and MBrown (N = 42 interactors) (Fig. 5A, Table S6). Of note, LRRK2 was found in none of these 3 modules, suggesting that the overall co-expression level between LRRK2 and its interactors was relatively low in the whole blood, which is in accordance with the findings in our previous study2. Functional enrichment analysis associated the MBlue with metabolism (N = 16/82 GO-BPs, 19.5%), protein modification (N = 13/82 GO-BPs, 15.8%), biosynthesis (N = 13/82 GO-BPs, 15.8%), ribosomal function (N = 11/82 GO-BPs, 13.4%) and apoptosis (N = 8/82 GO-BPs, 9.7%) (Fig. 5B, Table S7); MBrown was enriched for transport (N = 7/15 GO-BPs, 46.7%), intracellular organisation (N = 3/15 GO-BPs, 20.0%), cytoskeleton organisation (N = 3/15 GO-BPs, 20.0%) and cell cycle (N = 2/15 GO-BPs, 13.3%) (Fig. 5C, Table S7); while MTurquoise was associated with localisation (N = 12/62 GO-BPs, 19.4%), metabolism (11/62 GO-BPs, 17.7%), protein modification (11/62 GO-BPs, 17.7%) and transport (9/62 GO-BPs, 14.5%) (Fig. 5D, Table S7). Module-Trait correlation analysis showed that the eigengene of MBlue (MEblue) was significantly down-regulated in the LRRK2-PD and sPD cases as compared to the control cohort (p < 0.05), while MEbrown was significantly down-regulated in the LRRK2-PD cases only. No significant changes were identified for the MEturquoise in the 2 PD conditions vs. controls (Fig. 5E). Interestingly, 58 out of 76 LRRK2 interactors within the blue module were found in the interactome of other PD genes (SNCA, PRKN, PINK1, GBA1, PARK7, VPS35, ATP13A2 FBXO7, SLC6A3, DNAJC6, DCTN1 and SYNJ1; Supplementary Data S1).
Construction of the LRRK2net
A total of 4860 connections (interactions) across the LRRK2 interactors were extracted from the HIPPIE database (v2.3), among which 1466 (30.2%) were scored as “high confidence” (HIPPIE confidence score ≥ 0.72), out of which 121 self-interactions were removed from the list, thereby leaving 1345 “2nd-layer” PPIs for 338 LRRK2 interactors for LRRK2net construction (Fig. 6A, Table S8). Degree (i.e., the number of PPIs connected to a given interactor) distribution analysis showed that 216 interactors (51.7%) had degrees ≤ 4; 141 interactors (33.7%) presented degrees between 5 and 14; 41 interactors (9.8%) presented degrees between 15 and 24; 24 interactors showed degree ≥ 24, suggesting that the LRRK2net follows the Power Law distribution (log–log plot R-square = 0.8606) (Fig. 6B, C, Table S9). Interactors with degree ≥ 24 (the top 5% of all) were defined as “sub seed” proteins in the LRRK2net, with TP53 (degree = 68), CDK2 (degree = 48), HSPA8 (degree = 46), HSP90AB1 (degree = 44), HSP90AA1 (degree = 43), YWHAZ (degree = 43), LAPR7 (degree = 39), NPM1 (degree = 37), TRAF2 (degree = 32), IQGAP1 (degree = 32), LIMA1 (degree =31), CAPZA2 (degree = 31), PRKN (degree = 28), DBN1 (degree = 28), YWHAQ (degree = 27), RPS8 (degree = 27), YWHAG (degree = 26), TRADD (degree = 26), RPS3 (degree = 26), AKT1 (degree = 25), YWHAB (degree = 24), HSPA1A (degree = 24), RPS3A (degree = 24) presenting the highest degree, suggesting that these proteins may play an essential role in maintaining the local connectivity of the LRRK2net.
Weighted network analysis on the LRRK2net
A total of 14 topological clusters were identified in the trimmed-LRRK2net using the Fast Greedy algorithm based on the measure of edge betweenness (Fig. 7A, Table S9). Of note, 3 clusters containing less than 5 interactors each were removed (considering a cut-off threshold on cluster connectivity ≥5 proteins), leaving a total of 11 clusters for further analysis. For each of the 11 topological clusters, edges were classified as up/down-regulated or unchanged bases on the differential expression and co-expression levels of LRRK2 interactors in the sPD and LRRK2-PD conditions as compared to the controls. The distribution of the edges across these 3 categories was compared via One Sample Proportion Test to identified clusters significantly altered in expression in sPD or LRRK2-PD in comparison with controls (Fig. 7B, C).
Among the 11 clusters, Cluster A was significantly altered (down-regulated) in both sPD and LRRK2-PD cases vs. controls (p < 0.05), with 72/115 (62.6%) and 57/115 (49.5%) edges down-regulated, respectively (Fig. 7B, C, Fig. 8A). Of note, out of the 14 down-regulated interactors, 12 of them were ribosomal proteins. Functional enrichment analysis associated Cluster A with gene translation and ribosomal functions, suggesting that the sPD and LRRK2-PD pathologies potentially contribute to perturbed ribosomal homoeostasis and translation process by down-regulating this cluster of LRRK2 interactors (Fig. 8B, Table S10).
Discussion
Multifactorial neurodegenerative disorders such as PD present with a complicated aetiopathogenesis, triggered by multiple causative events (or risk factors) from the environment and from the genome. In PD, for example, the majority of the patients have a sporadic form of the disease, with no large effect size genetic variants contributing to etiology; these cases are considered to be due to a complex interplay of small effect size genetic risk factors in combination with a triggering environmental exposure. The often-transient nature of environmental exposures are difficult to study, however PD has been linked to long-term exposure to air pollution and chemicals33,34,35,36, while old age remains the major risk factor for PD37,38. In contrast, a minority of patients present with a familial pattern of disease, with at least one mutation with effect size large enough to drive neurodegeneration. The sporadic and the genetic forms of PD are, therefore, by definition triggered by different combinations of risk factors. This poses the question as to whether, despite the similar clinical presentation and the classification under the same disease name, sporadic and genetic forms might represent a more nuanced spectrum of disorders. This nosological question holds the key to a very practical issue: sporadic disorders are difficult to be modelled in vitro, thus the scientific community frequently relies on genetic models based on the familial forms of the same disease to simulate the disease scenario in vitro and in vivo. These experimental models might not be accurate if we are indeed studying a spectrum disorder where the same clinical manifestation may be triggered by different molecular scenarios. Similarly, a therapeutic approach targeted to the molecular core of the neurodegeneration developed for the genetic forms of the disease might not be fully effective on the sporadic disease, thus requiring cohort-specific interventions. In this study, we applied a systems biology approach to generate a model and investigate the potential molecular differences between sPD and LRRK2-PD, focusing on the transcriptomic expression profile of the LRRK2 protein interactome. We considered that the LRRK2 functionality is orchestrated by the protein interactions that interlink LRRK2 with the cell proteome. It has previously been reported that LRRK2 interaction behaviour is affected by the presence of mutations39; we therefore speculated that the presence of PD causing mutations in LRRK2 (LRRK2-PD) would modify the LRRK2 connectivity and in turn trigger expression changes within the LRRK2 interactome. These changes might be specific for the LRRK2-PD scenario since no LRRK2 mutations are present in sPD. However, it is also possible that expression changes of the LRRK2 interactome happen just as a consequence of PD, in a feedback response to the molecular alterations induced by the disease; in this case, these alterations should be evident in both presence (LRRK2-PD) and absence (sPD) of LRRK2 mutations.
There is increasing evidence that the immune system and immune-related functions are deeply linked to the pathogenesis of PD40. Indeed, we demonstrate that a large portion of the LRRK2 interactome (37%) is enriched for immune related functions and highly expressed in whole blood in comparison with other peripheral tissues and the CNS. We therefore evaluated expression changes of the LRRK2int (cases vs controls) in whole blood mRNA and found that 28.9% (109/377) of the LRRK2 interactors presented significant changes, among which only 13 showed a similar trend of alteration (4 up-regulated and 9 down-regulated) in both the sPD and LRRK2-PD cases. Among these 109 interactors, 9 were selected by the LASSO regression model differentiating the LRRK2-PD and sPD cohorts including: CDK2, RAB5B, ACTA2, TUBB6, LRRK2, HSPA1A, LMNB1, SNCA, and SLC25A6. This model suggested that, globally, sPD and LRRK2-PD might be differentiated by looking at the transcriptomics profiles of the LRRK2 interactors in whole blood. These findings supported our hypothesis that LRRK2-PD and sPD might be triggered by different molecular alterations and thereby need to be treated as different conditions for biomarker discovery and drug development.
When we carried out a functional analysis of the 109 significantly-altered LRRK2 interactors, we found that proteins up-regulated in the LRRK2-PD condition were mainly related to cytoskeletal dynamics and transport, while those up-regulated in the sPD condition were associated with signalling and protein metabolic processes, again suggesting divergent functional profiles for the LRRK2int in the PD scenario depending on the presence/absence of the LRRK2G2019S mutation. However, the downregulated proteins in both cohorts were associated with metabolic processes and ribosomal assembly suggesting these functions to be consistently altered in both PD scenarios regardless of the presence/absence of the LRRK2-G2019S mutation. Interestingly, the 13 protein whose expression profile was altered in both LRRK2-PD and sPD were similarly related to ribosomal activity and protein biosynthesis, suggesting that there are commonalities at the molecular level between LRRK2-PD and sPD.
The results obtained via DEA were further corroborated by the co-expression analysis. We analysed the co-expression behaviour of the LRRK2 interactome using the classical WGCNA pipeline to identify modules of LRRK2 interactors that are co-express across the sPD, LRRK2-PD and control cohorts. A total of 3 co-expression modules were consistently identified in the 3 conditions, and they might indicate functional units of LRRK2 interactors that participate in communal processes. Module-Trait analyses found that one of the 3 modules (MBlue) was down-regulated in both sPD and LRRK2-PD cases as compared to controls, one (MBrown) was down-regulated only in the LRRK2-PD cohort while the other remained unchanged in both PD cohorts vs. controls. Interestingly, MBlue, altered in both LRRK2-PD and sPD, contained 31 ribosomal proteins (RPs).
All these findings suggested the existence of molecular alterations that are specific to the LRRK2 and sporadic PD conditions, however molecular and functional similarities can also be found. For example, in this in silico investigation, we suggest altered ribosomal functionality and protein biosynthetic processes to be an hallmark of PD, regardless the presence of pathogenic LRRK2 mutations. In additions, we confirmed that the LRRK2 interactors that are similarly altered in LRRK2-PD and sPD considering DEA and WGCNA are largely represented within the interactome of other PD genes. This observation might suggest their importance to the molecular pathogenesis of PD, regardless the absence or presence of familial mutations.
Similar alterations in protein synthesis/ribosomal functions were indeed observed in previous studies in the blood and substantia nigra tissues of PD patients as well as related animal models41,42,43,44.
We finally proceeded to identify topological clusters within the LRRK2 interactome, based on the protein connections across LRRK2 interactors. Topological clusters might indicate functional local communities within a larger network based on how proteins relate/connect with each other. The topological clustering algorithm identified 11 clusters in the LRRK2net, these are portions of the network that are more connected within each other than the average connection of the entire network. Among these 11 clusters, cluster A presented lower connectivity in both LRRK2-PD and sPD vs controls and this cluster was functionally related to ribosomal functions. The majority of the RP were, as expected, contained within cluster A; this cluster was significantly downregulated, again potentially suggesting that the functionality of RPs and ribosomal/protein biosynthetic processes are universally reduced during PD.
There are a number of limitations to this study: (1) the sample size of the cohorts are relatively small, especially for the LRRK2-PD cohort and larger sample size would improve statistical power and thereby provide more robust results; (2) PD cases recruited by PPMI were at the early stages of the disease and the whole blood mRNA sequencing was run at the first visit; therefore, the alterations of some LRRK2 interactors could be too subtle to be detected by DEA or WGCNA; 3) expression changes (used as proxy for protein levels) in the LRRK2 interactome are dynamic and affected by the local environment (such as absence/presence of inflammation) while (due to the data available) in our analyses protein interactions have been considered as static.
In conclusion, our study suggests that although sPD and LRRK2-PD share defining aspects of neuropathology and clinical characteristics, the molecular pathways underlying the etiology and pathogenesis of the two conditions have important distinct features. There are shared changes of the LRRK2 interactome that can be appreciated at the transcriptome level in both the conditions, mainly associated with alterations of RPs and proteins whose function is important for protein biosynthesis. However, there are also substantial differences between the two conditions suggested by their unique transcriptomics signatures. This conclusion cautions against considering LRRK2-PD and sPD as identical conditions, highlights the need to for specific experimental models to be generated to differentially study sporadic and LRRK2 PD, and confirms the requirement for patient stratification in clinical trials.
Methods
LRRK2 protein interactors download and quality control (QC)
LRRK2 protein interactors were downloaded via PINOT v1.1 (http://www.reading.ac.uk/bioinf/PINOT/PINOT_form.html), HIPPIE v2.3 (http://cbdm-01.zdv.uni-mainz.de/~mschaefer/hippie/index.php) and MIST v5.0 (https://fgrtools.hms.harvard.edu/MIST/)45,46,47 on 16th March 2023 and the LRRK2 interactome was built following the pipeline in ref. 2. In summary: to access the most comprehensive set of LRRK2 interactors, “Lenient” filter level was applied in PINOT; while no filter was applied for HIPPIE and MIST to download the entire set of raw interaction to be filtered in a second step. Interactors retrieved from the 3 tools were merged and QC-ed to identify interactors with missing publication identifier, missing interaction detection method, no conversion to a standard gene identifier, and with low interaction confidence score.
LRRK2 protein interactome (LRRK2int) in whole blood
In our previous study2, we compared the mRNA levels of LRRK2 interactors pair-wise across 11 brain regions and 4 peripheral tissues in healthy individuals derived from the GTEx database (https://www.gtexportal.org/). Tissues were scored based on the pair-wise comparison results for each LRRK2 interactor. Briefly, the higher the score, the higher a certain interactor is expressed in a certain tissue. A specifically high expression level was defined as tissue score ≥ 12, meaning that a given interactor exhibited significantly higher mRNA levels in tissue X as compared to other 12 tissues. For this current study, scores of LRRK2 interactors in the whole blood were extracted from2 and the interactors with tissue scores ≥ 12 were analysed via functional enrichment analysis.
Whole blood RNA-Seq data download and QC
Baseline (BL = time at diagnosis) whole blood mRNA data (read counts) of LRRK2 interactors for healthy controls (HC), sPD patients and LRRK2-PD patients were retrieved using Ensembl gene ID from the Parkinson’s Progression Marker Initiative (PPMI) dataset on 24th March 2023. PPMI is an ongoing observational, international, multicentred cohort study aimed at identifying the biomarkers of PD progression in a large cohort of participants (https://www.ppmi-info.org/). The current PPMI Clinical study protocol (#002) is WCG approved (IRB Tracking #20200597). The previous PPMI Clinical protocol (#001), was IRB reviewed by the University of Rochester Research Subjects Review Board. Informed consent was obtained from all human participants. PPMI is a public-private partnership—is funded by The Michael J. Fox Foundation for Parkinson’s Research and funding partners, including those reported at https://www.ppmi-info.org/about-ppmi/who-we-are/studysponsors. Study protocol and manuals are available online (http://www.ppmi-info.org/study-design). PPMI enroled patients with early, untreated (de novo) Parkinson’s disease as well as healthy controls of similar age and sex as well as genetic cohorts with Parkinson’s disease and non-manifesting carriers of mutations. Pathogenic variants used to define genetic PD cases included 7 PD-related genes, namely LRRK2, GBA1, VPS35, SNCA, PRKN, PARK7 and PINK1.
In this study, we included the “de novo”, “genetic (with LRRK2 mutations)” and “healthy control” cohorts. Subjects from the 3 cohorts were further filtered to keep only those with robust genetic status records using the following criteria: confirmed by at least 3 out of 6 detection techniques (WGS, WES, RNA-Seq, GWAS, CLIA, SANGER) of which 1 should be a next generation sequencing technique (WGS, WES, RNA-Seq) and 1 should be a screening technique (GWAS, CLIA, SANGER). For the healthy control cohort, subjects with pathogenic variants in the above-mentioned PD-related genes were excluded. PD patients with no pathogenic variants were defined as the sporadic PD (sPD) cohort, while those with pathogenic variants in the LRRK2 gene only were defined as the LRRK2-PD cohort. Principal Component Analysis (PCA) was performed on mRNA read counts to remove potential outliers. Metadata of QC-ed subjects at BL were derived from the PPMI database, including gender, age at screen, motor symptom severity (as evaluated by the MDS-Unified Parkinson’s Disease Rating Scale IIII (MDS-UPDRS III) and LRRK2 mutation type (for the LRRK2-PD cases only). MDS-UPDRS III scores of the sPD and LRRK2-PD cohort were compared via t-test. Transcripts of LRRK2 interactors with read counts ≤ 15 in more than 75% QC-ed subjects were removed48. Read counts of LRRK2 interactors retrieved from PPMI were extracted for the 3 cohorts, thereby forming the “PPMI_Matrix”.
Differential Expression Analysis (DEA) and classification models for sPD and LRRK2-PD
The PPMI_Matrix was then normalised via the median of ratios method using the “count” function in the R package “DESeq2”49. The normalised PPMI_Matrix (hereby referred as “norm_PPMI_Matrix”) was utilised to perform DEA to compare the expression levels of LRRK2 interactors in the control, sPD and LRRK2-PD cohorts using “DESeq2” and calculating fold change (FC) for each of the LRRK2 interactors (i) in [sPD vs control] and [LRRK2-PD vs control]. P-value adjustment for multiple comparisons were performed via Bonferroni’s method. Of note, results from DEA were adjusted for sex. LRRK2 interactors were considered significantly altered when |log2FC| > 0.05 and adjusted-p < 0.05 in [sPD vs control] or [LRRK2-PD vs control]50. Up/down-regulated LRRK2 interactors in the 2 PD conditions were functionally annotated via Gene Ontology Biological Process (GO-BP) enrichment analysis. The read counts of LRRK2 interactors with significant alterations in the 2 PD conditions as compared to controls were utilised to construct a machine learning model via Least Absolute Shrinkage and Selection Operator (LASSO) algorithm using the R package “glmnet”. Of note, in order to reduce the risk for model overfitting, univariate logistic regression was performed on each LRRK2 interactor prior to model training and only those with p-value < 0.05 were included in the LASSO regression model. The train-test split ratio for the LASSO regression model was set as 4:1. The tunning parameter lambda (λ) were optimised by a 10-fold cross-validation (CV) to reach the minimum Mean-Squared Error (MSE) via the “cv.glmnet” function of the “glmnet” package. The refined models were then assessed on the test set. Receiver Operating Characteristic (ROC) curves were generated via the “roc.glmnet” function of the “glmnet” package.
Weighted Gene Co-expression Network Analysis (WGCNA)
Signed Weighted Gene Co-expression Network Analyses (WGCNA) were performed on the norm_PPMI_Matrix via the R package “WGCNA” to identify co-expression modules within the LRRK2net across the sPD, LRRK2-PD and control conditions. Module-Trait correlation was evaluated via the “corPvalueStudent” function in the “WGCNA” package.
LRRK2 PPI network (LRRK2net) construction and weighted network analysis
To construct the LRRK2net, the 2nd-layer PPIs (i.e., PPIs among LRRK2 interactors) were downloaded via HIPPIE (v2.3) on 16th March 2023. The 2nd-layer PPIs with high confidence score (≥ 0.72) were kept for network construction (the LRRK2net). The Fast Greedy Clustering algorithm51 was utilised to detect topological clusters in the LRRK2net based on edge betweenness (i.e., calculating the number of shortest paths between any pair of nodes in the network that pass-through a given edge), via the “cluastermaker2” Cytoscape add-in (v2.3.4). For each obtained topological cluster, edges were classified as up/down-regulated or unchanged based on the following criteria: A) up-regulated edge: i) at least 1 of the 2 nodes connected by the edge had increased expression level in sPD and/or LRRK2-PD vs controls or ii) a strong positive co-expression (Pearson’s coefficient > 0.6) was observed for the 2 nodes connected via the edge in sPD and/or LRRK2-PD but not in controls; B) downregulated edge: i) at least 1 of the 2 nodes connected by the edge presented decreased expression level in sPD and/or LRRK2-PD vs controls or ii) a strong positive co-expression (Pearson’s coefficient > 0.6) for the 2 nodes connected via the edge was observed in the controls but not in sPD and/or LRRK2-PD cases. The percentage of upregulated, downregulated and unchanged edges for each single topological cluster were calculated for the sPD and the LRRK2-PD scenarios and compared via One Sample Proportion Test to identify the trend of each topological cluster and qualitatively define whether a cluster was mainly up/down regulated or unchanged in sPD or LRRK2-PD vs controls.
Functional enrichment analysis
In this study, functional enrichment analyses for LRRK2 interactors were performed via the webtool “g:Profiler” (https://biit.cs.ut.ee/gprofiler/gost)52. The parameters were set as follows: organism—Homo sapiens (Human); data source—GO biological process (GO-BPs) only; statistical domain scope—annotated genes only; statistical method—Fisher’s one-tailed test; significance threshold—Bonferroni correction (threshold = 0.05). No hierarchical filtering was included. To increase the sensitivity of analysis, a cut-off of ≤ 2500 was set for the “term size” of enriched GO terms. For larger GO term lists, GO-BPs were grouped based on semantic similarity and text cloud analysis was performed to extract keywords from term names via the R package “wordcloud” (https://CRAN.R-project.org/package=wordcloud).
Data availability
Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (www.ppmi-info.org/access-data-specimens/download-data), RRID:SCR_006431. For up-to-date information on the study, visit www.ppmi-info.org.
Code availability
The pipelines used in this work are based on publicly available R packages as detailed in the methods section.
References
Berwick, D. C., Heaton, G. R., Azeggagh, S. & Harvey, K. LRRK2 Biology from structure to dysfunction: research progresses, but the themes remain the same. Mol. Neurodegener. 14, 1–22 (2019).
Zhao, Y. et al. Tissue specific LRRK2 interactomes reveal a distinct striatal functional unit. PLoS Comput. Biol. 19, 1–23 (2023).
Sanders, L., Goodson, S. D., Toste, C. M. & Gonzalez-Hunt, C. P. ATM mediated DNA double-strand breaks accumulate in LRRK2 G2019S Parkinson’s disease. FASEB J. 35 (2021).
Komori, T. et al. Phosphorylation of Rab29 at Ser185 regulates its localization and role in the lysosomal stress response in concert with LRRK2. J. Cell Sci. 136 (2023).
Alexander Boecker, C., Goldsmith, J., Dou, D., Cajka, G. G. & Holzbaur, E. L. Increased LRRK2 kinase activity alters neuronal autophagy by disrupting the axonal transport of autophagosomes graphical abstract. Curr. Biol. 31, 2140–2154.e6 (2021).
Tombesi, G. et al. LRRK2 regulates synaptic function through BDNF signaling and actin cytoskeleton. Elife 13 (2024).
Bras, J. M. et al. G2019S dardarin substitution is a common cause of Parkinson’s disease in a Portuguese cohort. Mov. Disord. 20, 1653–1655 (2005).
Lesage, S. et al. LRRK2 G2019S as a cause of Parkinson’s disease in North African Arabs. N. Engl. J. Med. 354, 422–423 (2006).
Ozelius, L. J. et al. LRRK2 G2019S as a cause of Parkinson’s disease in Ashkenazi Jews. N. Engl. J. Med. 354, 424–425 (2006).
Greggio, E. et al. The Parkinson disease-associated leucine-rich repeat kinase 2 (LRRK2) is a dimer that undergoes intramolecular autophosphorylation. J. Biol. Chem. 283, 16906–16914 (2008).
Dauer, W. & Ho, C. C. Y. The biology and pathology of the familial Parkinson’s disease protein LRRK2. Mov. Disord. 25, S40–S43 (2010).
Esteves, A. R., Swerdlow, R. H. & Cardoso, S. M. LRRK2, a puzzling protein: insights into Parkinson’s disease pathogenesis. Exp. Neurol. 261, 206–216 (2014).
Paisán-Ruiz, C., Lewis, P. A. & Singleton, A. B. LRRK2: cause, risk, and mechanism. J. Parkinsons Dis. 3, 85–103 (2013).
Henry, A. G. et al. Pathogenic LRRK2 mutations, through increased kinase activity, produce enlarged lysosomes with reduced degradative capacity and increase ATP13A2 expression. Hum. Mol. Genet. 24, 6013–6028 (2015).
Giesert, F. et al. The pathogenic LRRK2 R1441C mutation induces specific deficits modeling the prodromal phase of Parkinson’s disease in the mouse. Neurobiol. Dis. 105, 179–193 (2017).
Nalls, M. A. et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 18, 1091–1102 (2019).
Cook, D. A. et al. LRRK2 levels in immune cells are increased in Parkinson’s disease. NPJ Parkinsons Dis. 3, 11 (2017).
Di Maio, R. et al. LRRK2 activation in idiopathic Parkinson’s disease. Sci. Transl. Med. 10, eaar5429 (2018).
Wallings, R. L., Herrick, M. K. & Tansey, M. G. LRRK2 at the interface between peripheral and central immune function in Parkinson’s. Front. Neurosci. 14, 531890 (2020).
Russo, I., Bubacco, L. & Greggio, E. LRRK2 as a target for modulating immune system responses. Neurobiol. Dis. 169, 105724 (2022).
Bliederhaeuser, C. et al. LRRK2 contributes to monocyte dysregulation in Parkinson’s disease. Acta Neuropathol. Commun. 4, 123 (2016).
Cook, D. A. et al. LRRK2 levels in immune cells are increased in Parkinson’s disease. npj Parkinson’s Dis. 3, 1–12 (2017).
Kozina, E. et al. Mutant LRRK2 mediates peripheral and central immune responses leading to neurodegeneration in vivo. Brain 141, 1753–1769 (2018).
Haugarvoll, K. et al. Lrrk2 R1441C parkinsonism is clinically similar to sporadic Parkinson disease. Neurology 70, 1456–1460 (2008).
Healy, D. G., Wood, N. W. & Schapira, A. H. V. Test for LRRK2 mutations in patients with Parkinson’s disease. Pr. Neurol. 8, 381–385 (2008).
Kluss, J. H., Mamais, A. & Cookson, M. R. LRRK2 links genetic and sporadic Parkinson’s disease. Biochem Soc. Trans. 47, 651 (2019).
Srivatsal, S. et al. Cognitive profile of LRRK2-related Parkinson’s disease. Mov. Disord. 30, 728–733 (2015).
Alcalay, R. N. et al. Neuropsychological performance in LRRK2 G2019S carriers with Parkinson’s disease. Parkinsonism Relat. Disord. 21, 106–110 (2015).
Garrido, A., Fairfoul, G., Tolosa, E. S., Martí, M. J. & Green, A. α-synuclein RT-QuIC in cerebrospinal fluid of LRRK2-linked Parkinson’s disease. Ann. Clin. Transl. Neurol. 6, 1024–1032 (2019).
Rivero-Ríos, P., Romo-Lozano, M., Fasiczka, R., Naaldijk, Y. & Hilfiker, S. LRRK2-related Parkinson’s disease due to altered endolysosomal biology with variable lewy body pathology: a hypothesis. Front Neurosci. 14, 556 (2020).
Batzu, L. et al. Increased basal forebrain volumes could prevent cognitive decline in LRRK2 Parkinson’s disease. Neurobiol. Dis. 183, 106182 (2023).
Tolosa, E., Vila, M., Klein, C. & Rascol, O. LRRK2 in Parkinson disease: challenges of clinical trials. Nat. Rev. Neurol. 16, 97–107 (2020).
Breckenridge, C. B., Berry, C., Chang, E. T., Sielken, R. L. & Mandel, J. S. Association between Parkinson’s Disease and Cigarette Smoking, Rural Living, Well-water Consumption, Farming and Pesticide Use: Systematic Review and Meta-analysis. PLoS ONE 11 (2016).
Levesque, S., Surace, M. J., McDonald, J. & Block, M. L. Air pollution & the brain: subchronic diesel exhaust exposure causes neuroinflammation and elevates early markers of neurodegenerative disease. J. Neuroinflammation 8, 105 (2011).
Murata, H., Barnhill, L. M. & Bronstein, J. M. Air pollution and the risk of Parkinson’s disease: a review. Mov. Disord. 37, 894–904 (2022).
Yan, D., Zhang, Y., Liu, L., Shi, N. & Yan, H. Pesticide exposure and risk of Parkinson’s disease: dose-response meta-analysis of observational studies. Regul. Toxicol. Pharm. 96, 57–63 (2018).
White, D. L. et al. Post-traumatic stress disorder is associated with further increased Parkinson’s disease risk in veterans with traumatic brain injury. Ann. Neurol. 88, 33–41 (2020).
Funayama, M., Nishioka, K., Li, Y. & Hattori, N. Molecular genetics of Parkinson’s disease: Contributions and global trends. J. Hum. Genet. 68, 125–130 (2022). 2022 68:3.
Manzoni, C., Denny, P., Lovering, R. C. & Lewis, P. A. Computational analysis of the LRRK2 interactome. PeerJ 2015 (2015).
Tan, E. K. et al. Parkinson disease and the immune system—associations, mechanisms and therapeutics. Nat. Rev. Neurol. 16, 303–318 (2020).
Garcia-Esparcia, P. et al. Altered machinery of protein synthesis is region- and stage-dependent and is associated with α-synuclein oligomers in Parkinson’s disease. Acta Neuropathologica Commun. 3, 1–25 (2015).
Plum, S. et al. Proteomic characterization of synaptosomes from human substantia Nigra indicates altered mitochondrial translation in Parkinson’s disease. Cells 9, 2580 (2020).
Chu, Y. & Kordower, J. H. GDNF signaling in subjects with minimal motor deficits and Parkinson’s disease. Neurobiol. Dis. 153, 105298 (2021).
Qian, Y. et al. Detection of microbial 16S rRNA gene in the blood of patients with Parkinson’s disease. Front. Aging Neurosci. 10, 368915 (2018).
Tomkins, J. E. et al. PINOT: An intuitive resource for integrating protein-protein interactions. Cell Commun. Signal. 18, 92 (2020).
Alanis-Lobato, G., Andrade-Navarro, M. A. & Schaefer, M. H. HIPPIE v2.0: enhancing meaningfulness and reliability of protein-protein interaction networks. Nucleic Acids Res. 45, D408–D414 (2017).
Hu, Y. et al. Molecular Interaction Search Tool (MIST): An integrated resource for mining gene and protein interaction data. Nucleic Acids Res. 46, D567–D574 (2018).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 9, 559 (2008).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 1–21 (2014).
Whittle, B. J. et al. Early-stage idiopathic Parkinson’s disease is associated with reduced circular RNA expression. npj Parkinson’s Dis. 10, 1–14 (2024).
Newman, M. E. J. Fast algorithm for detecting community structure in networks. Phys. Rev. E: Stat. Phys. Plasmas Fluids Relat. Interdiscip. Top. 69, 5 (2004).
Peterson, H., Kolberg, L., Raudvere, U., Kuzmin, I. & Vilo, J. gprofiler2—an R package for gene list functional enrichment analysis and namespace conversion toolset g:Profiler. F1000Research 9, 709 (2020).
Balestrino, R. et al. Applications of the European Parkinson’s Disease Association sponsored Parkinson’s Disease Composite Scale (PDCS). npj Parkinson’s Dis. 5, 1–7 (2019). 2019 5:1.
Skorvanek, M. et al. Differences in MDS-UPDRS scores based on Hoehn and Yahr stage and disease duration. Mov. Disord. Clin. Pr. 4, 536–544 (2017).
Acknowledgements
This research was funded by The Michael J. Fox Foundation for Parkinson’s Research (MJFF) (grant number MJFF-023427 to C.M. and K.H.). C.M. and V.E.P. receive funding from MJFF (grant number MJFF-021335). For the purpose of open access, the author has applied a CC BY public copyright license to all Author Accepted Manuscripts arising from this submission. PAL is supported by a Royal Society Industry fellowship in partnership with LifeArc technologies (IF\R2\222002). PAL and CM acknowledge funding from the Biomarkers Across Neurodegenerative Diseases Grant Program 2019, BAND3 (The Michael J. Fox Foundation for Parkinson’s Research, Alzheimer’s Association, Alzheimer’s Research UK, and the Weston Brain Institute [grant number 18063]). The authors are thankful for having been provided access to the PPMI cohort. Funding PPMI: Parkinson’s Progression Markers Initiative (PPMI)—a public-private partnership—is funded by The Michael J. Fox Foundation for Parkinson’s Research and funding partners, including 4D Pharma, Abbvie, AcureX, Allergan, Amathus Therapeutics, Aligning Science Across Parkinson’s, AskBio, Avid Radiopharmaceuticals, BIAL, Biogen, Biohaven, BioLegend, BlueRock Therapeutics, Bristol-Myers Squibb, Calico Labs, Celgene, Cerevel Therapeutics, Coave Therapeutics, DaCapo Brainscience, Denali, Edmond J. Safra Foundation, Eli Lilly, Gain Therapeutics, GE HealthCare, Genentech, GSK, Golub Capital, Handl Therapeutics, Insitro, Janssen Neuroscience, Lundbeck, Merck, Meso Scale Discovery, Mission Therapeutics, Neurocrine Biosciences, Pfizer, Piramal, Prevail Therapeutics, Roche, Sanofi, Servier, Sun Pharma Advanced Research Company, Takeda, Teva, UCB, Vanqua Bio, Verily, Voyager Therapeutics, the Weston Family Foundation and Yumanity Therapeutics.
Author information
Authors and Affiliations
Contributions
Y.Z. performed the analyses and was a major contributor to the writing of the paper; Y.L. contributed to the analyses for the revision of the paper; M.B.S. supervised the machine learning analyses; K.H., P.A.L., and V.E.P. contributed to the interpretation of the data, supervised the study design and revised the writing of the paper; C.M. was responsible for the study design and was a major contributor to the writing of the paper. C.M., K.H., P.A.L., and V.E.P. supported the study via grants as per the acknowledgment section. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhao, Y., Bracher-Smith, M., Li, Y. et al. Transcriptomics and weighted protein network analyses of the LRRK2 protein interactome reveal distinct molecular signatures for sporadic and LRRK2 Parkinson’s Disease. npj Parkinsons Dis. 10, 144 (2024). https://doi.org/10.1038/s41531-024-00761-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41531-024-00761-8