Systems level analysis of sex-dependent gene expression changes in Parkinson’s disease

Tranchevent, Léon-Charles; Halder, Rashi; Glaab, Enrico

doi:10.1038/s41531-023-00446-8

Download PDF

Article
Open access
Published: 21 January 2023

Systems level analysis of sex-dependent gene expression changes in Parkinson’s disease

npj Parkinson's Disease volume 9, Article number: 8 (2023) Cite this article

2907 Accesses
5 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease (PD) is a heterogeneous disorder, and among the factors which influence the symptom profile, biological sex has been reported to play a significant role. While males have a higher age-adjusted disease incidence and are more frequently affected by muscle rigidity, females present more often with disabling tremors. The molecular mechanisms involved in these differences are still largely unknown, and an improved understanding of the relevant factors may open new avenues for pharmacological disease modification. To help address this challenge, we conducted a meta-analysis of disease-associated molecular sex differences in brain transcriptomics data from case/control studies. Both sex-specific (alteration in only one sex) and sex-dimorphic changes (changes in both sexes, but with opposite direction) were identified. Using further systems level pathway and network analyses, coordinated sex-related alterations were studied. These analyses revealed significant disease-associated sex differences in mitochondrial pathways and highlight specific regulatory factors whose activity changes can explain downstream network alterations, propagated through gene regulatory cascades. Single-cell expression data analyses confirmed the main pathway-level changes observed in bulk transcriptomics data. Overall, our analyses revealed significant sex disparities in PD-associated transcriptomic changes, resulting in coordinated modulations of molecular processes. Among the regulatory factors involved, NR4A2 has already been reported to harbor rare mutations in familial PD and its pharmacological activation confers neuroprotective effects in toxin-induced models of Parkinsonism. Our observations suggest that NR4A2 may warrant further research as a potential adjuvant therapeutic target to address a subset of pathological molecular features of PD that display sex-associated profiles.

Single cell transcriptome analysis of the THY-Tau22 mouse model of Alzheimer’s disease reveals sex-dependent dysregulations

Article Open access 07 March 2024

The landscape of multiscale transcriptomic networks and key regulators in Parkinson’s disease

Article Open access 20 November 2019

Transcriptomic differences in MSA clinical variants

Article Open access 25 June 2020

Introduction

Parkinson’s disease (PD) has a worldwide prevalence projected at 12 million by 2040 and no disease-modifying treatments are available¹. Current therapies focus on the replacement of dopamine, but can alleviate only some of the motor symptoms, are hampered by severe adverse effects and loss of efficacy over time, and do not address many of the other heterogeneous symptoms^2,3,4,5. There is widespread agreement in the field that PD patients with diverse clinical features and disease course have different therapeutic needs and would benefit from more personalized medical approaches^6,7,8,9,10.

A striking aspect in the heterogeneity of PD are the pronounced and multifaceted sex differences observed in previous epidemiological and clinical studies. Both the incidence and prevalence of PD is approximately 1.5–2 times greater in men than in women^{11,12,13,14,15,16}, but female patients present significantly more often with a phenotype dominated by disabling tremors^3,17 and, irrespective of their body weight, have an almost 3-fold increased risk to develop treatment-related complications (e.g., involuntary muscle movements known as dyskinesias)^18,19. Moreover, while males tend to display a lower striatal dopamine transporter binding^17,20,21,22, females tend to be affected more often by motor and non-motor symptom fluctuations^23,24,25.

Previous studies have suggested that potential neuroprotective functions of certain sex-related hormones may contribute to sex differences in neurologic disorders^26,27,28,29. However, it is unclear how exactly they may influence the molecular hallmarks of PD, and why in spite of generic sex differences in hormone levels, an almost opposite association between biological sex and disease risk is observed in PD as compared to other degenerative disorders, such as Alzheimer’s disease^30,31,32. Life-style and occupation related differences have been proposed as contributing factors to PD sex differences, e.g., exposure to PD-associated toxicants and head trauma are more common among males¹⁶, but these associations are not strong enough to explain the full extent of the reported disparities. While a variety of meta-analyses of PD omics data have previously already been conducted^{33,34,35,36,37}, to the best of our knowledge, similar integrative analyses have not yet been applied to study molecular sex differences.

In order to contribute to a more detailed molecular-level understanding of disease-associated sex differences, we therefore present a comprehensive statistical meta-analysis of PD transcriptomics data from brain tissue samples of post-mortem case–control studies. Both sex-specific changes, i.e., significant molecular alterations occurring either only in females or only in males, and sex-dimorphic changes, i.e., alterations with opposite direction across both analyses are determined (see “Methods” for details). Finally, in order to understand the coordination and regulation of sex-related PD-associated molecular alterations, we determine pathway- and network activity changes with significant disease-related sex differences and identify transcription factors that may play a key role in regulating the observed downstream changes. The results derived from bulk transcriptomics data are then further examined and characterized in cell-type-specific analyses of corresponding single-cell transcriptomics datasets.

Results

Gene-level analysis of PD-associated molecular sex differences

The gene-level statistical meta-analysis, conducted independently for each biological sex on the substantia nigra (SN) tissue samples from twelve transcriptomics datasets (see “Methods”), identified 1146 significantly differentially expressed genes (DEGs) in males and 118 DEGs in females after multiple testing adjustments (out of 11,959 and 11,975 genes, respectively). Overall, the meta-analysis allowed us to identify more significantly differentially expressed genes than when using the individual datasets in isolation. Since males were over-represented among the samples from the available datasets, in line with the previously reported higher relative risk for males of developing PD¹⁶, we decided to further investigate whether the larger number of male-specific DEGs may have resulted, at least partly, from a higher statistical power of the associated analysis. We have therefore repeated the meta-analyses with three equally sized random subsets of male samples as compared to the female samples. Interestingly, we obtained similar differential expression patterns when sub-sampling male samples (see Supplementary Note 1). These results indicate that the lower number of significant DEGs in females may at least partly reflect a different disease manifestation and progression, with a distinct extent of disease-related gene expression variations in females, in line with the findings from prior studies^38,39,40.

Detailed differential expression statistics for the top-ranked DEGs, including the base 2 log. fold changes (LFC), false-discovery rates (FDR) and consistency scores for both males and females, are presented in Table 1. Expression profiles for selected genes in representative datasets are presented in Fig. 1. The DEGs are split into three categories, depending on whether their alterations are male- or female-specific, or whether they display sex-dimorphic alterations with an increase/decrease of expression levels in males and the opposite change in females. In total, we identified 36 female-specific genes (i.e., significantly differentially expressed between PD and controls in females only with FDR < 0.05, and not approaching significance in males, based on comparing male and female π-value rankings⁴¹) 539 male-specific genes (i.e., FDR < 0.05 in males and not approaching significance in females) and 37 candidate sex-dimorphic genes (i.e., FDR < 0.05 in at least one sex, but with opposite signs of the log. fold change, and a minimum absolute cross-study log. fold change of 0.25 to ensure the robustness of the difference). Table 1 shows the top ten genes for each of these categories, and the complete lists of significant sex-specific and candidate sex-dimorphic DEGs are provided in Supplementary Table 5.

Table 1 Top sex-specific and candidate sex-dimorphic genes.

Full size table

**Fig. 1: Expression levels of selected differentially expressed genes in representative datasets.**

The meta-analysis used bulk transcriptomics data, and the relative proportions of cell types may therefore differ between the compared sample groups. Prior studies have shown that differential gene expression patterns derived from PD bulk transcriptomics datasets of cortical tissues can be confounded by cell type composition differences^42,43. To estimate how this could affect our differential analysis, we analyzed the differential expression profiles of 38 selected cell type markers corresponding to 5 distinct brain cell types (more details about the markers can be found in Supplementary Note 1, Supplementary Table 13, and Supplementary Figs. 3–8). We observe that TH, a commonly used marker for dopaminergic neurons (DA), is the only cell-type marker significantly differentially expressed between patients and controls. This matches with prior expectations, as Parkinson’s disease is associated with a progressive loss of dopaminergic neurons. Since none of the other cell-type markers is differentially expressed, this suggests that the differential expression observed for other genes is unlikely to be driven significantly by differences in cell-type proportions (see also the complementary single-cell transcriptomics analyses below). In addition, we checked whether substantia nigra specific eQTLs from the GTEx database⁴⁴ overlap with both the identified sex-associated DEGs and PD-associated GWAS variants, but this was not the case (see Supplementary Note 1).

Overall, many of the genes with significant sex-dependent PD-associated alterations are involved in cellular processes and organelles previously described to display pathological alterations in PD. In particular, they include genes involved in dopamine metabolism (NR4A2), lysosomal genes (CXCR4, SGSH), and mitochondrial genes (NDUFA10, CA2). Moreover, they cover genes previously implicated in PD relevant phenotypes such as dementia (CXCR4) and neurodegeneration (MAPK1) (see Discussion section for details on prior functional implications of these genes in PD and molecular sex differences).

Pathway-level analysis of PD-associated molecular sex differences

To interpret sex-dependent molecular alterations in PD at the level of global shifts in cellular pathway and process activity, the significant DEGs derived from the statistical meta-analyses were further investigated using functional enrichment analyses across multiple pathway databases. For both the male and female analyses, we specifically analyzed pathway associations of the DEGs tagged as either sex-specific or sex-dimorphic. Selected enriched biological processes from the Gene Ontology (GO), Reactome, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases are presented in Table 2. Supplementary Tables 6 and 7 contain the complete pathway ranking results.

Table 2 Functional terms representative of the results of the functional enrichment analyses.

Full size table

The pathway analysis results for the male-specific DEGs highlighted significantly enriched alterations in particular in mitochondria and energy metabolism related processes, such as proton-transporting two-sector ATPase complex (GO:0016469, FDR = 4.8e−2), oxidative phosphorylation (hsa00190, FDR = 1.4e−2) and Citrate cycle (hsa00020, FDR = 1.7e−2). These results are in line with prior observations of sex-specific differences in mitochondrial function (see refs. ^45,46,47 and the “Discussion” section for the gene-level analyses) and the previous implication of mitochondrial impairment in PD^48,49. Moreover, cellular processes related to synapses and associated signaling reactions were significantly enriched, including the synaptic vesicle cycle pathway in the KEGG database (hsa04721, FDR = 4.3e−3).

Given the lower number of significant female-specific DEGs, the analysis of the female-specific genes did not identify any significant functional enrichment after adjustment of the significance scores for multiple hypothesis testing. However, the top nominally significant functional gene sets are mostly associated with the inflammatory response, and more specifically with chemokine signaling, including processes such as Chemokine signaling pathway (hsa04061, FDR = 7.4e−2) and Chemokine receptors bind chemokines (R-HSA-380108, FDR = 8e−2). No significant overlap was observed between the pathways with female DEGs and male DEGs, suggesting that instead of affecting different genes in the same pathways, sex-specific changes tend to affect diverse pathways. Given a smaller number of female samples in the present meta-analysis, follow-up studies with larger numbers of female biospecimens are warranted to determine whether an enrichment of female-specific genes may be detectable in further cellular processes.

As a representative example illustration for coordinated sex-dependent expression alterations occurring in a cellular process, we show male and female differential expression profiles for the KEGG TNF signaling pathway in Fig. 2, where we observe that downstream targets are often associated with dimorphic differential expression patterns.

**Fig. 2: Overlay of the differential expression statistics on part of the map for the TNF signaling pathway.**

Overall, the pathway analyses revealed a significant enrichment of male-specific PD-associated DEGs in multiple processes previously implicated in PD, highlighting different top-ranked pathways for male and female DEGs. While male-specific DEGs were mainly over-represented in mitochondrial and energy metabolism related pathways, female-specific DEGs only showed nominally significant associations with inflammation and immune response related processes. Further detailed results from the functional enrichment analyses are described in Supplementary Note 1.

Regulatory network analysis

In order to identify key transcription factors (TFs) that control many of the observed downstream sex-dependent expression changes in PD, the sex-specific DEGs were further investigated using target expression levels to estimate TF activity levels (see “Methods”). In total, we identified 18 TFs whose activity was estimated to differ between males and females. We observed that 13 of them displayed fold-changes across the male and female analyses which were consistent with the sex-dependent changes of their downstream targets. The top 10 selected TFs are presented in Table 3, the complete results are shown in Supplementary Table 8. We also reconstructed the regulatory network around these selected transcription factors to highlight specific regulatory mechanisms (see Fig. 3 and Supplementary Fig. 9). Interestingly, several of the transcription factors predicted to control sex-dependent gene regulatory mechanisms are members of the statin family (STAT3, STAT1) or members of the NFκB complex (NFKB1, REL, RELA and RELB; see the discussion in section “Analysis of key transcription factors and molecular sub-networks”).

Table 3 Top ten transcription factors enriched in differentially expressed target genes.

Full size table

**Fig. 3: Visualization of the regulatory sub-networks centered around selected transcription factors.**

Cell-type-specific transcriptomic data analyses

The post-mortem substantia nigra samples considered in this study cover multiple different cell types, including dopaminergic neurons, astrocytes, oligodendrocytes and their progenitors, and microglial cells, among others^50,51,52,53.

To investigate cell-type specific alterations, we therefore analyzed two single-cell transcriptomics datasets derived from post-mortem substantia nigra samples of PD patients and controls, performing dedicated differential expression and functional enrichment analyses for each cell type and on each dataset (see “Methods”). While the significantly smaller sample sizes compared to the bulk transcriptomic data analysis limited the number of detectable significant changes, the pathway analysis results highlighted sex-associated differential activities in mitochondrial processes, apoptosis and cytokine signaling, consistent with the results of the bulk analysis. These changes were particularly pronounced in oligodendrocytes and astrocytes, which may either suggest that these could be the main cell types affected by sex-dependent changes (see details in Supplementary Note 1 and Supplementary Tables 14−16) or reflect a greater power to detect variations in oligodendrocytes, since they represent 46%, and 51% respectively, of all cells in the two scRNA datasets. Moreover, we applied a deconvolution analysis to the bulk RNA-sequencing dataset (NBB), which confirmed the above-mentioned cell proportions. However, the same analysis is not feasible for the microarray datasets considered in this study (see details in Supplementary Note 1).

Discussion

As a first general observation of the meta-analysis, many of the identified sex-specific and candidate sex-dimorphic genes are members of cellular processes previously implicated in PD pathogenesis and progression. In particular, the significant sex-dependent alterations affect dopamine metabolism (NR4A2), mitochondrial (NDUFA10, CA2) and lysosomal processes (CXCR4, SGSH), whose pathological alterations have been confirmed by several prior studies as common molecular hallmarks of PD^{48,49,54,55,56}. However, the corresponding genes have previously not been associated with sex differences in the disease. Generic hormone-dependent sex differences have been reported for NR4A2 expression, but they were only studied in white adipose tissue and not in the context of neurologic disorders⁵⁷. Similarly, for CXCR4, estrogen was described to increase mRNA and protein levels, but this observation was limited to endometrial epithelial cells⁵⁸.

No prior report of sex-specific differences were identified for the other top-ranked significant genes; however, an influence of biological sex on mitochondrial and lysosomal function in general is in line with the results from previous in vitro and in vivo studies across multiple cell types, covering data from healthy adult humans as well as various animal models^{45,46,47,59,60,61,62}. Sex differences in the regulation of the autophagosome–lysosome system in particular have also been proposed to modulate the severity of neurodegenerative disorders, because prior studies suggest that women have a lower basal autophagy⁶¹.

Among the sex-associated DEGs with previously described regulatory functions in PD, the transcription factor NR4A2 (Nuclear Receptor Subfamily 4 Group A Member 2; synonym: NURR1) stands out due to its key regulatory role in the maintenance of dopamine metabolism⁶³ and inflammatory gene expression in glial cells⁶⁴. As a member of the steroid-thyroid hormone-retinoid receptor superfamily, it controls the expression of many essential genes for the development of meso-diencephalic dopaminergic (mdDA) neurons, such as SLC6A3, SLC18A2, TH and DRD2. For all these genes, significant decreased expression values were observed, which are consistent with the upstream decrease in NR4A2. In addition, NR4A2 down-regulation is significantly associated with adult human brain aging and has been demonstrated to increase the expression of the known PD-associated gene alpha-synuclein^65,66. In the transcriptomics meta-analysis presented here, NR4A2 generally displays lower expression in PD patients than in matched controls, but the expression change is significantly more pronounced in males than in females (females: LFC = −0.6, FDR = 1; males: LFC = −1.4, FDR = 1.6e−8).

Expression and abundance alterations of NR4A2 have been associated with PD-like phenotypes and adult aging in both human and animal studies. Specifically, homozygous NR4A2 knockout mice displayed Parkinsonism-like molecular phenotypes in PD-associated brain regions⁶⁷, and heterozygous knockout mice were characterized by reduced locomotor activities⁶⁸ and lower brain dopamine levels⁶⁹.

Interestingly, a recent study showed that a synthetic ligand that activates NR4A2 is neuroprotective in a mouse model of MPTP-Induced Parkinsonism, suppressing loss of dopaminergic neurons in the substantia nigra pars compacta and DA terminals in the striatum⁶⁴. A further study found that activating compounds for NR4A2 prevent neurotoxin (6-OHDA)-induced death in primary DA neurons and rat PC12 cells, and significantly ameliorate behavioral deficits (rotation behavior toward the lesion side) in a 6-OHDA lesioned rat model of PD without detectable dyskinesia-like side effects⁷⁰. NR4A2 heterodimerizes with Retinoid X receptor alpha (RXRα) in midbrain dopaminergic neurons, and synthetic ligands binding to the RXRα binding pocket were reported to confer neuroprotection in C57BL/6 mouse models using different toxins (6-OHDA, MPTP)⁷¹. Overall, these prior findings suggest that both genetic, environmental and sex-associated influences on NR4A2 activity may be involved in modulating the risk and severity of PD. Given the prior evidence for the druggability of NR4A2, the beneficial effects of its activation in PD model systems, and the sex-dependent changes among its downstream target genes, the protein may warrant further investigation as target for pharmacological modulation of a subset of pathological molecular changes in PD that display sex-associated activity profiles.

A further gene of interest with PD-associated regulatory functions among the identified sex-dependent DEGs is CA2 (Carbonic Anhydrase 2). It encodes an enzyme that catalyzes the reversible hydration of carbon dioxide, and is involved in mitochondrial pH regulation. Increased CA2 levels in mitochondria have previously already been associated with neurodegeneration and aging. Specifically, an age-dependent increase of tissue-specific carbonic anhydrases in mitochondria has been observed in mouse brain samples, in particular in the Purkinje cell degeneration (pcd5J) mouse model⁷². The same study also showed that the exposure of C. elegans to CA2 results in a dose-dependent shorter lifespan. Interestingly, many dopaminergic small molecule compounds (i.e., compounds with structure similar to the endogenous neurotransmitter dopamine, which are often used in the treatment of PD) are inhibitors of human carbonic anhydrases, such as CA2⁷³, which may result in compensatory expression alterations in PD. Disambiguation between direct disease-associated effects and treatment effects on CA2 expression will require further studies on drug-naïve patients. Regarding the observed sex differences, in the present meta-analysis, the over-expression of CA2 in PD patients compared to matched controls is significantly stronger in females than in males (females: LFC = 1.1, FDR = 8.2e−4; males: LFC = 0.6, FDR = 0.5). No prior studies reporting sex differences in CA2 levels in the brain could be identified; however, in prostate tissue from rats, a differential regulation of CA2 by the hormones androgen and estrogen has been reported⁷⁴.

Finally, the most significant candidate sex-dimorphic gene identified in the meta-analysis is EFNA1 (Ephrin A1). This gene from the ephrin family binds to multiple ephrin-related receptors to mediate developmental events, in particular as part of nervous system development⁷⁵. Interestingly, EFNA1 was shown to mediate dopaminergic neurogenesis and angiogenesis in a rat model of PD, and its activation proposed as a potential target for the treatment of neurodegenerative diseases⁷⁶. In the meta-analysis, EFNA1 is significantly over-expressed in male PD patients compared to controls, whereas female patients display a slight, non-significant under-expression (females: LFC = −0.3, FDR = 1; males: LFC = 0.8, FDR = 9.3e−5). No previous reports on sex differences in EFNA1 gene expression in the brain were identified.

In summary, the meta-analysis identified genes with statistically significant sex-specific or sex-dimorphic expression changes in PD samples compared to controls, including genes previously implicated in the regulation of dopamine metabolism, mitochondrial or lysosomal functions in the context of PD, and genes associated with general neurodegeneration or aging. An overview of the sex-dependent and PD-associated expression profiles for the three genes of interest discussed above, NR4A2, CA2 and EFNA1, in representative transcriptomics datasets is presented in Fig. 1.

As a main result of the pathway analyses, we observed that DEGs which are either male-specific or sex-dimorphic are over-represented in mitochondrial and energy metabolism related pathways. Considering our results together with previous observations for other diseases involving mitochondrial dysfunction, such as Leber’s hereditary optic neuropathy (LHON), which also display a higher prevalence in males than in females as described for PD, this could indicate a potential generic increased vulnerability of males to mitochondrial impairments. This matches with findings from prior studies for healthy individuals showing a higher activity of mitochondrial respiratory complexes in females compared to males⁷⁷. Furthermore, in this context, matrilineal inheritance of mitochondrial DNA (mtDNA) has previously been proposed to lead to a male-female asymmetry in the expected severity of mitochondrial diseases, because natural selection of mitochondria occurs only in females, and mitochondrial mutations are therefore expected to result more frequently in deleterious effects in men than women⁷⁸. While damage in mtDNA has previously been proposed to be linked with PD and other diseases involving mitochondrial dysfunction^79,80, the specific role of mtDNA in idiopathic PD is still unclear and conflicting results have been obtained from studies on mtDNA variation in PD⁸¹. Hormones, such as estrogen, also have important roles in the regulation of mitochondrial biogenesis and function⁸², and should therefore also be considered as potential contributing factor to sex-differences in mitochondrial function. Follow-up studies will be required to assess the specific influences of these different candidate factors on mitochondria-related sex differences in PD.

The most significant transcription factors identified in the network analysis are involved in inflammatory and immune response related processes (see Supplementary Table 9). A representative example is STAT3 (Signal Transducer and Activator Of Transcription 3), which shows an non-significant increased expression in male patients (LFC = 0.41, FDR = 1.8e−1) and a non-significant decrease in female patients (LFC = −0.29, FDR = 1). Its target genes are enriched among the male DEGs (predicted activity = 5.73), and not among female DEGs (predicted activity = −1.3), in agreement with its known activating effect on its targets. STAT3 is known to play a key regulatory role in determining the balance between astrogliogenesis and neurogenesis in brain neuroinflammation, and is activated in response to the pro-inflammatory cytokines IL-1β and TNF-α⁸³. Multiple functional involvements of STAT3 in PD-associated processes have previously been described, including the modulation of astrogliosis^84,85,86,87, microglia activation^88,89,90,91, and mitochondrial protein expression^92,93. Interestingly, STAT3 has been proposed as a potential drug target for neurodegenerative disorders, because its suppression during brain inflammation was found to promote neurogenesis and inhibit astrogliogenesis⁸³.

Apart from cytokines, steroid hormones have also been described to influence STAT3 response⁹⁴, and correspondingly, STAT3 activity has already been associated with sexual dimorphism in different organs, including the brain^92,95. Moreover, STAT3 activity correlates with clinical descriptors in a sex-dependent manner for different medical conditions, including brain cancer⁹⁶ and inflammatory disorders of different organs^{97,98,99,100,101,102}.

Among the other top-ranked TFs identified in the regulatory network analysis (see the top ten in Table 3) three of the five members of the NF-κB family are also included: NFKB1, RELA and RELB. The targets of these three TFs are all enriched among the DEGs for males (predicted activity >4.4) and not for females (predicted activity <−0.4). Among the TFs themselves, one gene, NFKB1, also displays significantly increased expression in male patients (LFC = 0.42, FDR = 6.2e−3) and not in female patients (LFC = 0.25, FDR = 4.5e−1). The other two genes do not exhibit significant changes, which is in line with the common observation that TFs tend to display smaller alterations than their downstream targets, and the activity of these targets often provides a more robust indication of activated or deactivated regulatory mechanisms.

Due to its central role in the regulation of inflammation-associated processes, the NF-κB pathway has previously also been investigated in the context of PD. Multiple sub-units of the NF-κB complex were found to be over-expressed in the substantia nigra of PD patients^103,104, where they are thought to enhance neuroinflammation^105,106. In model organisms, NF-κB activity correlates with the severity of PD-like symptoms and relevant cellular phenotypes, including mitochondrial homeostasis^107,108. Therefore, NFKB1 has also been proposed as a potential drug target for PD^104,109. Regarding the potential mechanisms linking NF-κB alterations to PD, previous studies have shown that NFKB1 is regulated by the PRKN gene (Parkin), which harbors mutations associated with familial forms of PD¹¹⁰, and that NFKB1 activity is also modulated by STAT3¹¹¹ (see the discussion of STAT3 above).

NF-κB activity differs between males and females under a variety of physiological conditions in different organs^112,113,114, including the brain¹¹⁵. In a cellular model, it was observed that NFKB1 activation protects glutamatergic neurons against oxidative stress-induced neuronal death in a sex-dependant manner (with superior protection of neurons from female donors)¹¹⁵. In the context of brain disorders, genomic variants of the NF-κB sub-unit RELA have been associated with schizophrenia in males¹¹⁶, but to the best our knowledge, NF-κB family members have previously not been linked specifically to sexual dimorphism in PD.

Finally, we note that NFKB1 inhibits the expression of NR4A2¹¹⁷, in line with NFKB1’s increased expression in male patients and the male-specific decrease in its downstream target genes. Similarly, the network in Fig. 3 also contains EFNA1, the top candidate sex-dimorphic gene, which is regulated by STAT3^118,119.

This study has the following limitations: First, while the statistical power was sufficient to identify differentially expressed genes in both males and females after multiple testing correction, only a limited number of relevant samples were available to detect significant changes for smaller effect sizes, in particular for the female analysis. In total, our substantia nigra meta-analysis used 198 samples (after data processing and filtering). This is in particular impacting our definition of candidate sex-dimorphic genes, which is mainly based on effect size differences. Follow-up studies with greater statistical power and a more balanced representation to detect sexual dimorphism will have the potential to show statistical significance for a larger number of variations with small effect sizes. Moreover, the incomplete availability of metadata for most datasets limited our ability to filter out all potential effects of confounding factors. For instance, RNA integrity numbers (RIN) and post-mortem intervals (PMI) are available for only one, respectively two, of the datasets used for the main meta-analysis. Considering this constraint, we conducted dedicated confounder correlation analyses on these datasets (see Supplementary Note 1 and Supplementary Table 10), which suggest only limited influences of PMI/RIN on the presented results. Further follow-up analyses on larger-scale, fully annotated datasets (i.e., with complete metadata) will be required for additional independent validation of these results. Another limitation is linked to the technology used to measure the transcriptomics profiles (mostly microarrays and bulk RNA-sequencing). It has been demonstrated that cell type proportions can influence differential analyses in many different tissues, including brain tissues^42,43. We conducted dedicated cell-type marker analyses to assess these potential influences in our meta-analysis, but cannot entirely rule out the possibility that cell type proportions differ more significantly between PD patients and healthy controls in the substantia nigra than these initial analyses suggest. Further research using single-cell technologies and large sample sizes will be required to better assess the potential role of differential cell-type proportions. A main strength of the applied methodology is that it integrates information from multiple different cohorts, experimental platforms and data types, and aggregates information from individual genes using pathway and network analyses. Thus, it provides both a transcriptome-wide ranking of gene-level sex-dependent alterations in PD and a global overview of the cellular processes impacted by these changes.

Follow-up studies will also be needed to more comprehensively characterize and confirm the mechanisms by which NR4A2 and other regulatory genes control or modulate the identified molecular sex differences in PD. These could include targeted in vitro and in vivo perturbation experiments of NR4A2 and other regulators with sex-dependent activity, such as CA2 and EFNA1, to investigate their impact on PD molecular phenotypes in disease models for both males and females. If the druggability, sex-dependency, and neuroprotective roles of NR4A2 or other candidate targets can be further substantiated, this could pave the way for subsequent preclinical investigations of adjuvant pharmacological strategies to reduce or alleviate sex-dependent molecular pathology in PD.

Methods

An overview of the entire processing and analysis workflow is presented in Fig. 4. Briefly, relevant Parkinson’s disease transcriptomics datasets were collected from public data repositories^{120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136} and complemented by a new dataset generated through RNA sequencing (RNA-seq) of samples from the Netherlands Brain Bank. All datasets were pre-processed and analyzed in order to detect significant transcript abundance differences between patients and controls (separately for males and females). Integrated differential expression statistics and rankings for each sex were obtained using a meta-analysis of all datasets. The genes with PD-associated expression alterations that differ significantly between males and females were further investigated using cellular pathway and network analyses. These analyses of bulk transcriptomic data were complemented by independent analyses of single-cell transcriptomics datasets, in order to further confirm the main identified pathway alterations and characterize their cell-type specificity. The following sections describe the methodologies for the data collection, processing, and downstream analyses.

**Fig. 4: Global workflow of the analysis.**

Sample preparation

Post-mortem human brain samples were obtained from The Netherlands Brain Bank, Netherlands Institute for Neuroscience (Amsterdam, The Netherlands). The tissue samples were homogenized (6875D Freezer/Mill Spex –Instrument Solutions Benelux BV). Fifty mg amount of sample was used for RNA isolation using the inhouse Tecan robot using AllPrep DNA/RNA/Protein kit (Qiagen) as mentioned before¹³⁷. One μg of total RNA was used for library preparation using TruSeq stranded mRNA library preparation kit (Illumina) as per the protocol provided by the manufacturer. Briefly, the mRNA pull down was done using the magnetic beads with oligodT primer. To preserve the strand information, the second strand synthesis was done such that during PCR amplification only first strand was amplified. The libraries were quantified using Qubit dsDNA HS assay kit (Thermofisher) and the size distribution was determined using Agilent 2100 Bioanalyzer. Pooled libraries were sequenced at a the LCSB sequencing platform using NextSeq500. This dataset was integrated with pre-processed and quality-controlled transcriptomics data from public data repositories using a meta-analysis to increase the statistical power to detect expression changes and to ensure the robustness of the findings across data from different studies.

Data collection

Relevant public transcriptomics datasets were identified through keyword-based searches in omics data repositories and literature databases. In particular, ArrayExpress (RRID: SCR_002964)¹³⁸, Gene Expression Omnibus (GEO, RRID: SCR_005012)¹³⁹, BioProject (RRID: SCR_004801)¹⁴⁰ and relevant resources such as the Parkinson’s Progression Markers Initiative (PPMI)¹⁴¹, PubMed (RRID: SCR_004846) and Google Scholar (RRID: SCR_008878) were queried using the following keywords: “parkinson,” “substantia nigra,” and “dopaminergic” in combination with “transcriptomics,” “microarray,” and “RNA-seq.” The list of all online resources, together with the associated queries is provided in the Supplementary Table 1.

The retrieved datasets were further filtered by considering only human transcriptomics datasets (i) covering at least five thousand genes, (ii) focusing on both idiopathic PD patients and controls, and (iii) with at least ten samples. Only datasets analyzing the midbrain region substantia nigra, dopaminergic neurons from the substantia nigra or induced pluripotent stem cells derived dopaminergic neurons were retained. Two of the retrieved datasets (GSE42966 and GSE43490) were merged into a single dataset (Moreira) because they were generated by the same research group, from the same cohort, and shared most of their samples (13 and 15 samples each, 18 unique samples in total). Other types of brain tissues, such as the cortex or putamen were covered by an insufficient number of datasets to be further considered for a meta-analysis. The 20 selected transcriptomics datasets include 428 samples in total and were derived from 12 different measurement platforms^{120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136}. Datasets focusing only on familial PD cases were not considered. Twelve, three and three datasets, respectively, covered the substantia nigra (SN), dopaminergic neurons from the substantia nigra (DA) and induced pluripotent stem cells derived dopaminergic neurons (iPSC-DA). The remaining two datasets are single-cell transcriptomics datasets (10× Genomics/Chromium) derived from substantia nigra samples from PD patients and controls (SC-SN), used to confirm key pathway alterations observed in the bulk transcriptomics data and to characterize their cell-type specificity. All datasets are described in further detail in Supplementary Tables 1–4.

We also ensured that there was no duplicate sample across the different datasets (i.e., samples derived from the same patient). By analyzing the dataset metadata, we identified two datasets with potential duplicates (GSE8397 and GSE26927), i.e., samples derived from the same brains stored at the UK Parkinson’s Disease Society Tissue Bank at Imperial College London. We therefore removed the five potential duplicated samples from the dataset GSE8397 prior to performing the meta-analysis.

For each platform, probe/transcript annotations, such as the chromosomal location and the corresponding gene names were retrieved from the Ensembl database (RRID: SCR_002344)¹⁴², whenever possible, and otherwise from the metadata provided with the dataset, and then matched using current Ensembl annotations for consistency (v102, Nov. 2020).

The original clinical annotations were extracted from the metadata obtained along with the datasets. Additional descriptors were provided by the data generators and missing values were predicted when possible (see next section).

Missing value imputation

For some datasets, clinical descriptors, such as disease status, biological sex and age, were not available for all samples. Contacting the data generators allowed us to obtain 15 additional annotations for 14 samples. However, there were still missing values and we therefore investigated whether these could be predicted.

For eight samples across three datasets, the biological sex could not be retrieved. We therefore estimated these missing values by considering the expression levels of genes located on sex chromosomes. For each dataset, we selected loci that best discriminate between males and females on training data (i.e., samples associated with male or female patients), and used only these loci to make predictions for the samples for which no annotations were available. When using a leave-one-out cross-validation scheme, the predicted values matched with the already known annotations for two of the three datasets (accuracy was 100% and 85%, respectively, for GSE24378 and GSE20163), suggesting that, for these datasets, reliable predictions can be made for the few samples with missing annotations. This allowed us to update the clinical data with the predicted biological sex for five out of eight samples with missing information. However, for the remaining dataset with three sample annotations missing, the expression levels from sex chromosome genes did not provide sufficiently reliable estimations of the biological sex (accuracy for GSE20141 was 73%). Therefore, for this dataset, the three samples with unknown sex were simply removed. For some datasets, reliable predictions could be made, and potential annotation errors were detected (samples annotated as female but predicted to be male or vice versa) but these were not corrected.

Regarding age, the predictions derived from models based on the R package missRanger were not considered accurate enough to be considered. More precisely, and using a leave-one-out cross-validation scheme, the average RMSE (root mean square error) for the three datasets with missing ages was 11.3 years (values between 9.8 and 12.5). This means that the average difference between the real and predicted age was above 11 years, which was not considered acceptable given the important influence of age on expression data.

Detecting experimental batches

The expression metadata was analyzed in order to detect the experimental batches. For the datasets extracted from the Gene Expression Omnibus (GEO) database (RRID: SCR_005012), the retrieved batches were harmonized with annotations from the Gemma database (RRID: SCR_008007)¹⁴³.

For each dataset, we then checked whether there could be a potential batch artifact. Datasets with small batches (less than four samples) or with many batches (more than four) were not further investigated because it is difficult to take the batch effect into account without introducing some bias. For the remaining datasets, the experimental batches were included as covariate in the differential analysis. The summary of this analysis is presented in Supplementary Table 2.

Age matching

The Parkinson’s disease cohorts usually contain age-matched patients and controls, however males and females might not always be age-matched as well, which is equally important in our study. We therefore investigated the age differences between (i) patients and controls, (ii) males and females and (iii) all four categories (female controls, female patients, male controls and male patients).

The results for all datasets are summarized in Supplementary Fig. 1. We observe a significant difference between the patient and control age values. This difference is less than five years for only four of the sixteen datasets (for two datasets, age is completely missing) and less than ten years for ten datasets. A similar situation is observed between males and females although there are some variations per dataset. The age distribution for the four categories of interest are plotted in Supplementary Fig. 1 for three representative datasets. The general trend is that patients are older than controls and males are younger than females. This motivated us to compare patients and controls for each sex independently instead of comparing the four categories of interest all at once with a more complex limma model (another motivation is that several datasets contain enough female samples but not enough male samples or vice versa). This also motivated us to include age as a covariate in the differential expression analysis when possible (see Supplementary Table 3 for more details about the linear models).

Bulk data processing

For microarray datasets, a quality control analysis of the data was conducted using the R package ArrayQualityMetrics (v3.42.0, RRID: SCR_001335)¹⁴⁴ prior to the raw data pre-processing in order to remove outlier samples. This package performs sample outlier detection analysis using three main methods. Any sample that was flagged as an outlier at least twice out of the three checks was removed from the analysis. In total, four samples from three datasets were removed according to these standard filtering criteria.

Affymetrix microarray datasets were then pre-processed using the Affymetrix PowerTools suite with the GC-RMA algorithm (v1.20.0)¹⁴⁵. In general, the signal was summarized at the probe level, except for exon arrays, where a transcript level summary was computed. Illumina and Agilent microarray datasets were pre-processed using manufacturer-specific R software packages (beadarray v2.36.1 and limma v3.42.2)^146,147.

The in-house generated RNA sequencing data was processed using the alignment-free quantification software Kallisto (v0.46)¹⁴⁸ and the homo sapiens Ensembl v96 transcriptome, and then normalized by total read counts and mean-variance relationship estimation using voom¹⁴⁹). For the RNA-sequencing data derived from GSE110717, only pre-processed data was available, and this dataset was therefore directly post-processed after quality control.

The post-processing always included the removal of samples from conditions other than idiopathic PD and healthy controls, of lowquality genetic probes, and of genetic probes/transcripts with zero variance. The heteroscedasticity of the measured signal intensities was plotted for all datasets and if variance-dependent signal intensities were observed, a variance stabilizing normalization was applied (R package vsn v3.54.0^150,151).

The quality control analyses using ArrayQualityMetrics were repeated after completing these processing steps. Three additional samples were removed at this stage, according to the criteria described above, including two with sample quality metrics that were already close to the outlier thresholds in the initial quality control. After these preprocessing steps, the 18 filtered datasets contained in total 331 from the original 346 samples. Supplementary Table 1 contains a flow diagram that summarizes the number of datasets/samples that are kept after each preprocessing step, together with summary statistics on the demographics (e.g., disease status, sex, age).

Removing irrelevant probes/transcripts

The lists of probes and transcripts were cleaned prior to the differential expression analyses in order to focus only on relevant entities. Probes and transcripts were selected using the same criteria but these are mostly relevant for probes as transcripts are in general well defined and associated with a single gene. First, probes associated with five genes or more were discarded as we wanted to focus on signals that are specific enough to be easily interpreted. Second, we also discarded the probes that match more than one gene if there also exists another probe that matches a subset of these genes (e.g., a probe that matches both gene A and gene B will be removed if there is another probe that matches only A or only B). Complex cases of probes matching overlapping sets of genes were removed as their interpretation would have been difficult (e.g., two probes matching respectively genes A+B and genes A+C will be removed regardless of the existence of probes matching only gene A or only gene B or only gene C).

Differential expression analyses

Two differential expression analyses were performed on each dataset individually, using only female or male samples, to identify transcripts that display significant sex-dependent differences between Parkinson’s disease patients and controls.

For the processed bulk datasets, these differential analyses were conducted using the R package limma (v3.42.2, RRID: SCR_010943), which relies on linear models¹⁴⁷. Available clinical covariates were included in the models in order to correct for potential confounders and biases and detect only the disease-associated biological alterations of interest. These covariates cover experimental batches, age and sample pairing information. Experimental sample batches were included as variable in the model when they confounded with the clinical outcomes and sufficient numbers of samples per batch enabled an estimation of possible batch effects (at least 4 samples per batch and maximum 4 batches in total, see section “Detecting experimental batches”). Information on the age of the subjects was also included in the model, whenever available, since ages were often not perfectly matched between patients and controls. The GSE8397 dataset contained two biological replicate samples per subject and the corresponding sample pairing information was also integrated into the model design. The configuration used for each linear model and for each dataset is described in more detail in the Supplementary Table 3.

Dataset weights

The datasets were associated to weights in the meta-analysis. These weights represent a level of confidence and, as such, are derived from the number of samples for the current analysis based on the assumption that datasets with more samples are associated with larger, aka better detection powers and their differential expression analyses should therefore weight more in the meta-analysis. Only relevant samples are taken into account so that, for instance, the number of male samples is not considered for all female analyses. More precisely, the weights are derived from the lowest number of samples across the relevant patient categories (i.e., for a female analysis, this would mean the smallest between the number of female patients and the number of female controls). The weights are computed using an adaptation of the formula by Marot and Mayer (Eq. (1))¹⁵².

$$\begin{array}{ll}{\omega }_{d,s}\!\!&=\sqrt{\frac{{n}_{d,s}}{{\sum }_{\delta \in \Delta }{n}_{\delta ,s}}}\\ {{\mbox{with}}}\,{n}_{\delta ,s}&={\rm{min}}(| {\rm{patients}}_{\delta ,s}| ,| {\rm{controls}}_{\delta ,s}| )\end{array}$$

(1)

with ω_d,s the weight of dataset d for sex s, Δ the set of all omics datasets, patients_δ,s and controls_δ,s respectively the sets of patients and controls in dataset δ for sex s.

Data integration

The statistical results for the differential expression analyses of the individual transcriptomics datasets were integrated using a meta-analysis across all pre-filtered datasets derived from the same tissue. For genes covered by multiple probes/transcripts, it was necessary to define the most relevant entity to select for the meta-analysis. We decided to select the probe/transcript with the highest average expression across the considered datasets since low-signal is associated with lower reliability. This means that, for a given gene, it is possible that two distinct probes/transcripts are selected for the male and female analyses. The list of the 211 genes for which this happens in at least one dataset can be found in Supplementary Table 5. These genes were not considered further in the present study. Prior to the meta-analysis, each gene is therefore associated with a base 2 log. fold change and a nominal p value per dataset.

Next, for each gene, the consensus activity shift (i.e., up- or down-expression) was determined through a weighted voting scheme, where the weights reflect the relative number of samples per dataset. Nominal p value significance scores obtained from the differential expression analyses on the individual datasets were then integrated using the weighted meta-analysis approach by Marot and Mayer¹⁵², again using weights corresponding to the relative number of samples per dataset. For each gene, the meta-analysis focused on the datasets with log. fold change consistent with the overall direction of the estimated cross-study log. fold change (see above) to ensure that the integrated p values represent gene activity shifts in the same direction. Finally, the resulting integrated meta-analysis p values were adjusted for multiple hypothesis testing using the Benjamini and Hochberg method¹⁵³. The male and female analyses were run separately, adjusting however the nominal p values once using both sets of genes. A gene was considered significantly differentially expressed if the false-discovery rate was below 0.05.

We wanted to focus our meta-analysis on genes with a reliable signal. We therefore computed, for each gene, a reliability score by summing up the weights of the datasets for which the gene was present and dividing that value by the sum of all weights. If a given gene has no missing value, its reliability score is 1, if all values are missing, it is 0. In our case, genes whose reliability scores were below 2/3 were not considered further (i.e., representing a maximum of 33.34% missing values if all datasets were to have the same weights). This filter was implemented after the meta-analysis since it might not be fully p value independent. Additionally, apart from providing information on significance as main ranking criteria, we determined a cross-study estimate of the log. fold change for each gene by computing a weighted average of the log. fold changes that are consistent with the consensus activity shift. We also wanted to distinguish between genes that are consistently found to be differentially expressed (i.e., reported as up-expressed in 100% of the datasets) and genes with less clear patterns (i.e., reported as up-expressed in 51% of the datasets). A consistency score was established by computing the weighted percentage of datasets that report a log. fold change in the same direction than the cross-study log. fold change. Only genes with at least 60% consistency were taken into consideration for further analysis. In the evaluation of results, the reader should take into consideration both the final integrated significance score and the consistency score (see Tables 1 and 3 as well as Supplementary Tables 5, 11, and 12).

Sex-specific and candidate sex-dimorphic genes

Next, the significantly differentially expressed genes (DEGs) with FDR < 0.05 in at least one analysis were categorized into female-specific, male-specific and candidate sex-dimorphic genes according to their differential expression profiles in males and in females. The overall process is described as a decision chart in Supplementary Fig. 2.

Due to the difference in detection power between the male and female analyses, the corresponding FDR values have different distributions (male FDR values are a lot smaller than female FDR values). This means that a categorization based only on FDR values will identify most male DEGs as male-specific and only few female DEGs as female-specific. We therefore decided to use a rank based strategy to define sex-specificity. More precisely, we have defined a sex-specificity index that is based on the rank ratios of the gene π-values⁴¹ (see Eq. (2)).

$$\begin{array}{l}{\rm{Spe}}_{g}\,=\,{\rm{rank}}\;{\rm{ratio}}({\pi }_{{\rm{F}},g})-{\rm{rank}}\;{\rm{ratio}}({\pi }_{{\rm{M}},g})\\ {{\mbox{with}}}\,{\pi }_{s,g}\,=\,-{\rm{log}}_{10}({\hat{p}}_{s,g})\times {\rm{abs}}({\rm{LFC}}_{s,g})\\ {{\mbox{with}}}\,{\hat{p}}_{s,g}\,=\,\left\{\begin{array}{ll}p-\epsilon \quad &\,{{\rm{if}}\;p = 1,}\,\\ p\quad &\,{{\mbox{otherwise.}}}\,\end{array}\right.\\ \end{array}$$

(2)

with Spe_g the sex-specificity score of gene g, π_s,g the π value of gene g for sex s, LFC_s,g and p_s,g, respectively, the log. fold change and nominal p value of gene g obtained in the analysis for sex s. To avoid ties in the rank ratios, we introduced ${\hat{p}}_{s,g}$ (to avoid setting π_s,g to 0 when p_s,g is equal to 1) and relied on the nominal p value instead of the FDR.

Sex-specific DEGs were defined as genes that were only significantly differentially expressed in males (FDR < 0.05), but not approaching significance in females, or vice versa (based on their sex-specificity score Spe, see decision chart in Supplementary Fig. 2). The rationale was to avoid calling sex-specific a gene that is significant in males and close to significant in females (or vice versa), which would result in spurious assignments of sex-specificity. Candidate sex-dimorphic DEGs were defined as those with an opposite direction of the change between PD and control samples across males and females (up in one case and down in the other, as determined by the signs of the log. fold changes, and at least one of them showing statistically significant alteration) and requiring the genes to have a minimum absolute cross-study log. fold change >0.25 in both cases to ensure robustness (preventing misinterpretation of gene expression changes with small absolute effect sizes). Genes that showed consistent changes across the analyses, i.e., that were significantly differentially expressed for both males and females with the same direction of the change, were not further considered for subsequent sex-related analyses (for the interested reader, we provide a list of these shared genes in Supplementary Table 5).

Cellular pathway and process analyses

Sex-dependent PD-associated cellular pathway and process alterations were determined using Fisher’s exact test to quantify the significance of the over-representation of sex-specific or sex-dimorphic DEGs among the members of each considered pathway. All genes associated with a reliable profile (i.e., not having more than 33.34% missing values) were used to build a relevant background. The resulting p value significance scores were then adjusted for multiple hypothesis testing according to the method by Benjamini and Hochberg. The pathway analysis was implemented using the clusterProfiler R package (v4.2.0, RRID: SCR_016884)¹⁵⁴.

Regulatory network analysis

Sex-dependent changes in gene regulatory sub-networks were identified in two steps. First, an enrichment analysis method was used to determine the transcription factors whose known target genes are over-represented among the DEGs. The enrichment is considered as a proxy of the transcription factor activities. It was computed using the strategy defined in Dorothea¹⁵⁵ and implemented in the software Funki¹⁵⁶, and the analysis was performed independently for male and female DEGs. The difference between the predicted activity for the sex-specific analyses was then used to identify the most relevant transcription factors involved in the control of downstream sex-dependent expression changes (i.e., transcription factors whose targets are enriched in one sex and relatively underrepresented in the other).

Regulatory interactions between the transcription factors of interest and the differentially expressed genes were extracted from two repositories, MetaCore GeneGo and OmniPath/Dorothea^155,157 (OmniPath/Dorothea version of the 2022/09/01, GeneGo version of the 2022/09/01, RRID: SCR_008125). Gene names were unified through matching to official gene names from Ensembl (v107, Jul2022). The regulatory consistency of the interactions was further checked and interactions associated with inconsistent gene expression log. fold changes were filtered out (i.e., activated targets must show the same direction of change as the source gene, inhibited targets must show an opposite direction of change). Network visualizations presented in Fig. 3 and Supplementary Fig. 9 were created using Cytoscape¹⁵⁸.

Single-cell transcriptomics data analyses

Two single-cell RNA-seq datasets derived from substantia nigra samples of PD patients and controls were analyzed to confirm the main pathway analysis findings from the bulk transcriptomics analysis, and to study the cell-type specificity of sex-dependent disease-associated alterations. Both datasets were pre-processed and quality-filtered using the R software package Seurat (v3.2.0)¹⁵⁹. Cells for which more than 10% of the counts mapped to ribosomal genes were removed after confirming that they also had lower numbers of total counts. In addition, outlier cells associated with less than 200 or more than 10,000 detected genes were also removed. The data was then normalized by applying the SC transform method¹⁶⁰, using the mitochondrial gene/ribosomal gene/ribosomal RNA proportions, cell cycle estimates and patient age as covariates, whenever the relevant variables were available.

For datasets with existing metadata, the original cell cluster annotations corresponding to the different cell types were used, and otherwise, cell clusters were determined as follows. First, a PCA was run to extract the 50 most variable components. These were then used to create a cell-cell association network and identify cell clusters in this network using the enhanced Louvain algorithm (using multilevel refinement, with a resolution of 0.3, 500 different repetitions and maximum 500 iterations per repetition)¹⁶¹. Finally, the expression of known cell markers was used to estimate the most probable cell type for each cluster (see Supplementary Note 1). For each cell cluster, a differential expression analysis was performed separately for males and females using a Poisson distribution model¹⁶². Due to the limited number of available single-cell RNA-seq datasets, no integrative meta-analysis was performed, and the differentially expressed genes for each dataset were further investigated in separate functional enrichment analyses.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data and materials associated with this publication, including links to the data as well as all the Supplementary Files mentioned in the manuscript, are hosted on a dedicated webpage (https://doi.org/10.17881/hpbx-y095). For the RNA-seq data measured in-house, the post-mortem human brain samples were obtained from The Netherlands Brain Bank, Netherlands Institute for Neuroscience (Amsterdam, The Netherlands; open access: http://www.brainbank.nl). All material has been collected from donors for or from whom a written informed consent for a brain autopsy and the use of the material and clinical information for research purposes had been obtained by the NBB. The data can be accessed on the Gene Expression Omnibus database under the identifier GSE168496.

Code availability

The analyses were implemented in R (v3.6.0) and bash (v4.4.20). The code of the entire analysis workflow is made available on the following GitLab repository under the MIT license: https://gitlab.lcsb.uni.lu/bds/geneder/geneder_core (tag corresponding to the version used in the manuscript: v1.3).

References

Adrissi, J. & Fleisher, J. Moving the dial toward equity in Parkinson’s disease clinical research: a review of current literature and future directions in diversifying PD clinical trial participation. Curr. Neurol. Neurosci. Rep. 22, 475–483 (2022).
Article Google Scholar
Jankovic, J. Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79, 368–376 (2008).
Article CAS Google Scholar
Solla, P. et al. Gender differences in motor and non-motor symptoms among Sardinian patients with Parkinson’s disease. J. Neurol. Sci. 323, 33–39 (2012).
Article Google Scholar
Müller, B., Assmus, J., Herlofson, K., Larsen, J. P. & Tysnes, O. B. Importance of motor vs. non-motor symptoms for health-related quality of life in early Parkinson’s disease. Parkinsonism Relat. Disord. 19, 1027–1032 (2013).
Article Google Scholar
Kalia, L. V. & Lang, A. E. Parkinson’s disease. Lancet 386, 896–912 (2015).
Article CAS Google Scholar
Bu, L. L. et al. Toward precision medicine in Parkinson’s disease. Ann. Transl. Med. 4, 26 (2016).
Google Scholar
Gasser, T. Personalized medicine approaches in Parkinson’s disease: the genetic perspective. J. Parkinsons Dis. 6, 699–701 (2016).
Article Google Scholar
Kim, H. J. & Jeon, B. How close are we to individualized medicine for Parkinson’s disease? Expert Rev. Neurother. 16, 815–830 (2016).
Article CAS Google Scholar
Sherer, T. B., Frasier, M. A., Langston, J. W. & Fiske, B. K. Parkinson’s disease is ready for precision medicine. Per. Med. 13, 405–407 (2016).
Article CAS Google Scholar
Titova, N. & Chaudhuri, K. R. Personalized medicine in Parkinson’s disease: time to be precise. Mov. Disord. 32, 1147–1154 (2017).
Article Google Scholar
Baldereschi, M. et al. Parkinson’s disease and parkinsonism in a longitudinal study: two-fold higher incidence in men. Neurology 55, 1358–1363 (2000).
Article CAS Google Scholar
Clavería, L. E. et al. Prevalence of Parkinson’s disease in Cantalejo, Spain: a door-to-door survey. Mov. Disord. 17, 242–249 (2002).
Article Google Scholar
Benito-León, J. et al. Prevalence of PD and other types of parkinsonism in three elderly populations of central Spain. Mov. Disord. 18, 267–274 (2003).
Article Google Scholar
Van Den Eeden, S. K. et al. Incidence of Parkinson’s disease: variation by age, gender, and race/ethnicity. Am. J. Epidemiol. 157, 1015–1022 (2003).
Article Google Scholar
De Lau, L. M. et al. Incidence of parkinsonism and Parkinson disease in a general population: the Rotterdam Study. Neurology 63, 1240–1244 (2004).
Article Google Scholar
Wooten, G. F., Currie, L. J., Bovbjerg, V. E., Lee, J. K. & Patrie, J. Are men at greater risk for Parkinson’s disease than women? J. Neurol. Neurosurg. Psychiatry 75, 637–639 (2004).
Article CAS Google Scholar
Haaxma, C. A. et al. Gender differences in Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 78, 819–824 (2007).
Article Google Scholar
Zappia, M. et al. Sex differences in clinical and genetic determinants of levodopa peak-dose dyskinesias in Parkinson disease: an exploratory study. Arch. Neurol. 62, 601–605 (2005).
Article Google Scholar
Bjornestad, A. et al. Risk and course of motor complications in a population-based incident Parkinson’s disease cohort. Parkinsonism Relat. Disord. 22, 48–53 (2016).
Article Google Scholar
Lavalaye, J., Booij, J., Reneman, L., Habraken, J. B. & Van Royen, E. A. Effect of age and gender on dopamine transporter imaging with [123I]FP-CIT SPET in healthy volunteers. Eur. J. Nucl. Med. 27, 867–869 (2000).
Article CAS Google Scholar
Staley, J. K. et al. Sex differences in [123I]β-CIT SPECT measures of dopamine and serotonin transporter availability in healthy smokers and nonsmokers. Synapse 41, 275–284 (2001).
Article CAS Google Scholar
Laakso, A. et al. Sex differences in striatal presynaptic dopamine synthesis capacity in healthy subjects. Biol. Psychiatry 52, 759–763 (2002).
Article CAS Google Scholar
Sato, K. et al. Prognosis of Parkinson’s disease: time to stage III, IV, V and to motor fluctuations. Mov. Disord. 21, 1384–1395 (2006).
Article Google Scholar
Colombo, D. et al. The “gender factor” in wearing-off among patients with parkinson’s disease: a post hoc analysis of DEEP study. Scientific World J. 2015, 787451 (2015).
Article Google Scholar
Picillo, M. et al. Gender and non motor fluctuations in Parkinson’s disease: a prospective study. Parkinsonism Relat. Disord. 27, 89–92 (2016).
Article Google Scholar
Henderson, V. W., Watt, L. & Buckwalter, J. G. Cognitive skills associated with estrogen replacement in women with Alzheimer’s disease. Psychoneuroendocrinology 21, 421–430 (1996).
Article CAS Google Scholar
Stein, D. G. Progesterone exerts neuroprotective effects after brain injury. Brain Res. Rev. 57, 386–397 (2008).
Article CAS Google Scholar
Pike, C. J. Testosterone attenuates β-amyloid toxicity in cultured hippocampal neurons. Brain Res. 919, 160–165 (2001).
Article CAS Google Scholar
Moffat, S. D. et al. Free testosterone and risk for Alzheimer disease in older men. Neurology 62, 188–193 (2004).
Article CAS Google Scholar
Fratiglioni, L. et al. Very old women at highest risk of dementia and Alzheimer’s disease: incidence data from the Kungsholmen Project, Stockholm. Neurology 48, 132–138 (1997).
Article CAS Google Scholar
Andersen, K. et al. Gender differences in the incidence of AD and vascular dementia: the EURODEM Studies. EURODEM Incidence Research Group. Neurology 53, 1992–1997 (1999).
Article CAS Google Scholar
Miech, R. A. et al. Incidence of AD may decline in the early 90s for men, later for women: the Cache County study. Neurology 58, 209–218 (2002).
Article CAS Google Scholar
Pan, H.-X. et al. GCH1 variants contribute to the risk and earlier age-at-onset of Parkinson’s disease: a two-cohort case-control study. Transl. Neurodegener. 9, 31 (2020).
Article CAS Google Scholar
Huang, P., Yang, X.-D., Chen, S.-D. & Xiao, Q. The association between Parkinson’s disease and melanoma: a systematic review and meta-analysis. Transl. Neurodegener. 4, 21 (2015).
Article Google Scholar
Phung, D. M. et al. Meta-analysis of differentially expressed genes in the substantia nigra in Parkinson’s disease supports phenotype-specific transcriptome changes. Front. Neurosci. 14, 596105 (2020).
Article Google Scholar
Su, L., Wang, C., Zheng, C., Wei, H. & Song, X. A meta-analysis of public microarray data identifies biological regulatory networks in Parkinson’s disease. BMC Med. Genomics 11, 40 (2018).
Mariani, E. et al. Meta-analysis of Parkinson’s disease transcriptome data using TRAM software: whole substantia nigra tissue and single dopamine neuron differential gene expression. PLoS ONE 11, e0161567 (2016).
Crispino, P. et al. Gender differences and quality of life in Parkinson’s disease. Int. J. Environ. Res. Public Health 18, 198 (2021).
Article CAS Google Scholar
Gillies, G. E., Pienaar, I. S., Vohra, S. & Qamhawi, Z. Sex differences in Parkinson’s disease. Front. Neuroendocrinol. 35, 370–384 (2014).
Article CAS Google Scholar
Shulman, L. M. & Bhat, V. Gender disparities in Parkinson’s disease. Expert Rev. Neurother. 6, 407–416 (2006).
Article CAS Google Scholar
Xiao, Y. et al. A novel significance score for gene selection and ranking. Bioinformatics 30, 801–807 (2014).
Article CAS Google Scholar
Nido, G. S. et al. Common gene expression signatures in Parkinson’s disease are driven by changes in cell composition. Acta Neuropathol. Commun. 8, 55 (2020).
Article CAS Google Scholar
Feleke, R. et al. Cross-platform transcriptional profiling identifies common and distinct molecular pathologies in Lewy body diseases. Acta Neuropathol. 142, 449–474 (2021).
Article Google Scholar
Nalls, M. A. et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease. Nat. Genet. 46, 989–993 (2014).
Article CAS Google Scholar
Silaidos, C. et al. Sex-associated differences in mitochondrial function in human peripheral blood mononuclear cells (PBMCs) and brain. Biol. Sex Differ. 9, 34 (2018).
Article CAS Google Scholar
Farhat, F., Amérand, A., Simon, B., Guegueniat, N. & Moisan, C. Gender-dependent differences of mitochondrial function and oxidative stress in rat skeletal muscle at rest and after exercise training. Redox Rep. 22, 508–514 (2017).
Article CAS Google Scholar
Ferreira, L. F. Mitochondrial basis for sex-differences in metabolism and exercise performance. Am. J. Physiol. Regul. Integr. Comp. Physiol. 314, R848–R849 (2018).
Article CAS Google Scholar
Bose, A. & Beal, M. F. Mitochondrial dysfunction in Parkinson’s disease. J. Neurochem. 139, 216–231 (2016).
Article CAS Google Scholar
Antony, P. M., Diederich, N. J., Krüger, R. & Balling, R. The hallmarks of Parkinson’s disease. FEBS J. 280, 5981–5993 (2013).
Article CAS Google Scholar
Welch, J. D. et al. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell 177, 1873.e17–1887.e17 (2019).
Article Google Scholar
Reynolds, R. H. et al. Moving beyond neurons: the role of cell type-specific gene regulation in Parkinson’s disease heritability. NPJ Parkinsons Dis. 5, 6 (2019).
Article Google Scholar
Saunders, A. et al. Molecular diversity and specializations among the cells of the adult mouse brain. Cell 174, 1015.e16–1030.e16 (2018).
Article Google Scholar
Agarwal, D. et al. A single-cell atlas of the human substantia nigra reveals cell-specific pathways associated with neurological disorders. Nat. Commun. 11, 4183 (2020).
Article Google Scholar
Masato, A., Plotegher, N., Boassa, D. & Bubacco, L. Impaired dopamine metabolism in Parkinson’s disease pathogenesis. Mol. Neurodegener. 14, 35 (2019).
Article Google Scholar
Burbulla, L. F. et al. Dopamine oxidation mediates mitochondrial and lysosomal dysfunction in Parkinson’s disease. Science 357, 1255–1261 (2017).
Article CAS Google Scholar
Moors, T. et al. Lysosomal dysfunction and α-synuclein aggregation in Parkinson’s disease: diagnostic links. Mov. Disord. 31, 791–801 (2016).
Article CAS Google Scholar
Pérez-Sieira, S., López, M., Nogueiras, R. & Tovar, S. Regulation of NR4A by nutritional status, gender, postnatal development and hormonal deficiency. Sci. Rep. 4, 4264 (2014).
Article Google Scholar
Mo, R. et al. Estrogen regulates CCR gene expression and function in T lymphocytes. J. Immunol. 174, 6023–6029 (2005).
Article CAS Google Scholar
Miotto, P. M., McGlory, C., Holloway, T. M., Phillips, S. M. & Holloway, G. P. Sex differences in mitochondrial respiratory function in human skeletal muscle. Am. J. Physiol. Regul. Integr. Comp. Physiol. 314, R909–R915 (2018).
Article CAS Google Scholar
Ventura-Clapier, R. et al. Mitochondria: a central target for sex differences in pathologies. Clin. Sci. 131, 803–822 (2017).
Article CAS Google Scholar
Congdon, E. E. Sex differences in autophagy contribute to female vulnerability in Alzheimer’s disease. Front. Neurosci. 12, 372 (2018).
Article Google Scholar
Harris, V. M., Harley, I. T., Kurien, B. T., Koelsch, K. A. & Scofield, R. H. Lysosomal pH is regulated in a sex dependent manner in immune cells expressing CXORF21. Front. Immunol. 10, 578 (2019).
Article CAS Google Scholar
Sacchetti, P., Carpentier, R., Ségard, P., Olivé-Cren, C. & Lefebvre, P. Multiple signaling pathways regulate the transcriptional activity of the orphan nuclear receptor NURR1. Nucleic Acids Res. 34, 5515–5527 (2006).
Article CAS Google Scholar
Hammond, S. L. et al. The nurr1 ligand,1,1-bis(39-Indolyl)-1-(p-Chlorophenyl)methane, modulates glial reactivity and is neuroprotective in MPTP-induced parkinsonisms. J. Pharmacol. Exp. Ther. 365, 636–651 (2018).
Article CAS Google Scholar
Yang, Y. X. & Latchman, D. S. Nurr1 transcriptionally regulates the expression of α-synuclein. Neuroreport 19, 867–871 (2008).
Article CAS Google Scholar
Glaab, E. & Schneider, R. Comparative pathway and network analysis of brain transcriptome changes during adult aging and in Parkinson’s disease. Neurobiol. Dis. 74, 1–13 (2015).
Article CAS Google Scholar
Le, W. D. et al. Selective agenesis of mesencephalic dopaminergic neurons in Nurr1- deficient mice. Exp. Neurol. 159, 451–458 (1999).
Article CAS Google Scholar
Jiang, C. et al. Age-dependent dopaminergic dysfunction in Nurr1 knockout mice. Exp. Neurol. 191, 154–162 (2005).
Article CAS Google Scholar
Zetterström, R. H. et al. Dopamine neuron agenesis in Nurr1-deficient mice. Science 276, 248–250 (1997).
Article Google Scholar
Kim, C. H. et al. Nuclear receptor Nurr1 agonists enhance its dual functions and improve behavioral deficits in an animal model of Parkinson’s disease. Proc. Natl Acad. Sci. USA 112, 8756–8761 (2015).
Article CAS Google Scholar
Spathis, A. D. et al. Nurr1:RXRα heterodimer activation as monotherapy for Parkinson’s disease. Proc. Natl Acad. Sci. USA 114, 3999–4004 (2017).
Article CAS Google Scholar
Pollard, A., Shephard, F., Freed, J., Liddell, S. & Chakrabarti, L. Mitochondrial proteomic profiling reveals increased carbonic anhydrase II in aging and neurodegeneration. Aging 8, 2425–2436 (2016).
Article CAS Google Scholar
Şentürk, M., Ekinci, D., Göksu, S. & Supuran, C. T. Effects of dopaminergic compounds on carbonic anhydrase isozymes I, II, and VI. J. Enzyme Inhib. Med. Chem. 27, 365–369 (2012).
Article Google Scholar
Härkönen, P. L. et al. Differential regulation of carbonic anhydrase ii by androgen and estrogen in dorsal and lateral prostate of the rat. Endocrinology 128, 3219–3227 (1991).
Article Google Scholar
Cramer, K. S. & Miko, I. J. Eph-ephrin signaling in nervous system development. F1000Res5 5, F1000 Faculty Rev-413 (2016).
Jing, X. et al. Ephrin-A1-mediated dopaminergic neurogenesis and angiogenesis in a rat model of Parkinson’s disease. PLoS ONE 7, e32019 (2012).
Article CAS Google Scholar
Silaidos, C. et al. Sex-associated differences in mitochondrial function in human peripheral blood mononuclear cells (PBMCs) and brain. Biol. Sex Differ. 9, 34 (2018).
Article CAS Google Scholar
Frank, S. A. & Hurst, L. D. Mitochondria and male disease. Nature 383, 224 (1996).
Article CAS Google Scholar
Martín-Jiménez, R., Lurette, O. & Hebert-Chatelain, E. Damage in mitochondrial DNA associated with parkinson’s disease. DNA Cell Biol. 39, 1421–1430 (2020).
Article Google Scholar
Di Monte, D. A. Mitochondrial DNA and Parkinson’s disease. Neurology 41, 38–42 (1991).
Article Google Scholar
Müller-Nedebock, A. C. et al. The unresolved role of mitochondrial DNA in Parkinson’s disease: an overview of published studies, their limitations, and future prospects. Neurochem. Int. 129, 104495 (2019).
Article Google Scholar
Klinge, C. M. Estrogenic control of mitochondrial function and biogenesis. J Cell. Biochem. 105, 1342–1351 (2008).
Article CAS Google Scholar
Chen, E. et al. A novel role of the STAT3 pathway in brain inflammation-induced human neural progenitor cell differentiation. Curr. Mol. Med. 13, 1474–1484 (2013).
Article CAS Google Scholar
Hashioka, S. et al. Interferon-γ-induced neurotoxicity of human astrocytes. CNS Neurol. Disord. Drug Targets 14, 251–256 (2015).
Article CAS Google Scholar
Samidurai, M. et al. Tumor necrosis factor-like weak inducer of apoptosis (TWEAK) enhances activation of STAT3/NLRC4 inflammasome signaling axis through PKCδ in astrocytes: implications for Parkinson’s disease. Cells 9, 1831 (2020).
Zhu, Y.-F. et al. Characteristic response of striatal astrocytes to dopamine depletion. Neural Regen. Res. 15, 724–730 (2020).
Article CAS Google Scholar
Choi, D.-J., Kwon, J.-K. & Joe, E.-H. A Parkinson’s disease gene, DJ-1, regulates astrogliosis through STAT3. Neurosci. Lett. 685, 144–149 (2018).
Article CAS Google Scholar
Zhang, J. et al. miR-let-7a suppresses α-Synuclein-induced microglia inflammation through targeting STAT3 in Parkinson’s disease. Biochem. Biophys. Res. Commun. 519, 740–746 (2019).
Article CAS Google Scholar
Qin, H. et al. Inhibition of the JAK/STAT pathway protects against α-synuclein-induced neuroinflammation and dopaminergic neurodegeneration. J. Neurosci. 36, 5144–5159 (2016).
Article CAS Google Scholar
Huang, C. et al. JAK2-STAT3 signaling pathway mediates thrombin-induced proinflammatory actions of microglia in vitro. J. Neuroimmunol. 204, 118–125 (2008).
Article CAS Google Scholar
Przanowski, P. et al. The signal transducers Stat1 and Stat3 and their novel target Jmjd3 drive the expression of inflammatory genes in microglia. J. Mol. Med. 92, 239–254 (2014).
Article CAS Google Scholar
Di Domenico, F. et al. Involvement of STAT3 in mouse brain development and sexual dimorphism: a proteomics approach. Brain Res. 1362, 1–12 (2010).
Article Google Scholar
Wegrzyn, J. et al. Function of mitochondrial Stat3 in cellular respiration. Science 323, 793–797 (2009).
Article CAS Google Scholar
Reed, D. K. & Arany, I. Sex hormones differentially modulate STAT3-dependent antioxidant responses during oxidative stress in renal proximal tubule cells. In Vivo 28, 1097–1100 (2014).
CAS Google Scholar
Heck, A. L., Thompson, M. K., Uht, R. M. & Handa, R. J. Sex-dependent mechanisms of glucocorticoid regulation of the mouse hypothalamic corticotropin-releasing hormone gene. Endocrinology 161, bqz012 (2020).
White, C. L. et al. A sexually dimorphic role for STAT3 in sonic Hedgehog medulloblastoma. Cancers 11, 1702 (2019).
Article CAS Google Scholar
Wang, M. et al. Sex differences in endothelial STAT3 mediate sex differences in myocardial inflammation. Am. J. Physiol. Endocrinol. Metab. 293, E872–E877 (2007).
Article CAS Google Scholar
Wang, M., Crisostomo, P. R., Markel, T. A., Wang, Y. & Meldrum, D. R. Mechanisms of sex differences in TNFR2-mediated cardioprotection. Circulation 118, S38–45 (2008).
CAS Google Scholar
Caetano, M. S. et al. Sex specific function of epithelial STAT3 signaling in pathogenesis of K-ras mutant lung cancer. Nat. Commun. 9, 4589 (2018).
Article Google Scholar
Nacka-Aleksić, M. et al. Sexual dimorphism in rat thymic involution: a correlation with thymic oxidative status and inflammation. Biogerontology 20, 545–569 (2019).
Article Google Scholar
You, D. J., Lee, H. Y., Taylor-Just, A. J., Linder, K. E. & Bonner, J. C. Sex differences in the acute and subchronic lung inflammatory responses of mice to nickel nanoparticles. Nanotoxicology 14, 1058–1081 (2020).
Article CAS Google Scholar
Wu, H., Lai, C.-F., Chang-Panesso, M. & Humphreys, B. D. Proximal tubule translational profiling during kidney fibrosis reveals proinflammatory and long noncoding RNA expression patterns with sexual dimorphism. J. Am. Soc. Nephrol. 31, 23–38 (2020).
Article CAS Google Scholar
Hunot, S. et al. Nuclear translocation of NF-κB is increased in dopaminergic neurons of patients with Parkinson disease. Proc. Natl Acad. Sci. USA 94, 7531–7536 (1997).
Article CAS Google Scholar
Ghosh, A. et al. Selective inhibition of NF-κB activation prevents dopaminergic neuronal loss in a mouse model of Parkinson’s disease. Proc. Natl Acad. Sci. USA 104, 18754–18759 (2007).
Article CAS Google Scholar
Mitra, S., Ghosh, N., Sinha, P., Chakrabarti, N. & Bhattacharyya, A. Alteration of nuclear factor-kappaB pathway promote neuroinflammation depending on the functions of estrogen receptors in substantia nigra after 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine treatment. Neurosci. Lett. 616, 86–92 (2016).
Article CAS Google Scholar
Kaminska, B., Mota, M. & Pizzi, M. Signal transduction and epigenetic mechanisms in the control of microglia activation during neuroinflammation. Biochim. Biophys. Acta Mol. Basis Dis. 1862, 339–351 (2016).
Article CAS Google Scholar
Laforge, M. et al. NF- κB pathway controls mitochondrial dynamics. Cell Death Differ. 23, 89–98 (2016).
Article CAS Google Scholar
Parrella, E. et al. NF-κB/c-Rel deficiency causes Parkinson’s disease-like prodromal symptoms and progressive pathology in mice. Transl. Neurodegener. 8, 16 (2019).
Article Google Scholar
Flood, P. M. et al. Transcriptional factor NF-κB as a target for therapy in Parkinson’s disease. Parkinsons Dis. 2011, 216298 (2011).
Henn, I. H. et al. Parkin mediates neuroprotection through activation of IκB kinase/nuclear factor-κb signaling. J. Neurosci. 27, 1868–1878 (2007).
Article CAS Google Scholar
Warner, N. et al. A genome-wide siRNA screen reveals positive and negative regulators of the NOD2 and NF-κB signaling pathways. Sci. Signal. 6, rs3 (2013).
Article Google Scholar
Muralimanoharan, S., Maloyan, A. & Myatt, L. Evidence of sexual dimorphism in the placental function with severe preeclampsia. Placenta 34, 1183–1189 (2013).
Article CAS Google Scholar
Muralimanoharan, S., Guo, C., Myatt, L. & Maloyan, A. Sexual dimorphism in miR-210 expression and mitochondrial dysfunction in the placenta with maternal obesity. Int. J. Obesity 39, 1274–1281 (2015).
Article CAS Google Scholar
Gaignebet, L. et al. Sex-specific human cardiomyocyte gene regulation in left ventricular pressure overload. Mayo Clin. Proc. 95, 688–697 (2020).
Article CAS Google Scholar
Ruiz-Perera, L. M. et al. NF-κB p65 directs sex-specific neuroprotection in human neurons. Sci. Rep. 8, 16012 (2018).
Article Google Scholar
Hashimoto, R. et al. Variants of the RELA gene are associated with schizophrenia and their startle responses. Neuropsychopharmacology 36, 1921–1931 (2011).
Article CAS Google Scholar
Graham, J. R., Tullai, J. W. & Cooper, G. M. GSK-3 represses growth factor-inducible genes by inhibiting NF-kappaB in quiescent cells. J. Biol. Chem. 285, 4472–4480 (2010).
Article CAS Google Scholar
Xiong, H. et al. Constitutive activation of STAT3 is predictive of poor prognosis in human gastric cancer. J. Mol. Med. 90, 1037–1046 (2012).
Article CAS Google Scholar
Chang, C.-C., Wu, M.-J., Yang, J.-Y., Camarillo, I. G. & Chang, C.-J. Leptin-STAT3-G9a signaling promotes obesity-mediated breast cancer progression. Cancer Res. 75, 2375–2386 (2015).
Article CAS Google Scholar
Durrenberger, P. F. et al. Selection of novel reference genes for use in the human central nervous system: a BrainNet Europe Study. Acta Neuropathol. 124, 893–903 (2012).
Article Google Scholar
Durrenberger, P. F. et al. Common mechanisms in neurodegeneration and neuroinflammation: a BrainNet Europe gene expression microarray study. J. Neural Transm. 122, 1055–1068 (2015).
Article CAS Google Scholar
Moran, L. B. et al. Whole genome expression profiling of the medial and lateral substantia nigra in Parkinson’s disease. Neurogenetics 7, 1–11 (2006).
Article CAS Google Scholar
Duke, D. C., Moran, L. B., Pearce, R. K. B. & Graeber, M. B. The medial and lateral substantia nigra in Parkinson’s disease: mRNA profiles associated with higher brain tissue vulnerability. Neurogenetics 8, 83–94 (2007).
Article CAS Google Scholar
Cantuti-Castelvetri, I. et al. Effects of gender on nigral gene expression and parkinson disease. Neurobiol. Dis. 26, 606–614 (2007).
Article CAS Google Scholar
Kikuchi, T. et al. Human iPS cell-derived dopaminergic neurons function in a primate Parkinson’s disease model. Nature 548, 592–596 (2017).
Article CAS Google Scholar
Devine, M. J. et al. Parkinson’s disease induced pluripotent stem cells with triplication of the α-synuclein locus. Nat. Commun. 2, 440 (2011).
Article Google Scholar
Simunovic, F. et al. Gene expression profiling of substantia nigra dopamine neurons: further insights into Parkinson’s disease pathology. Brain 132, 1795–1809 (2009).
Article Google Scholar
Zheng, B. et al. PGC-1α, a potential therapeutic target for early intervention in Parkinson’s disease. Sci. Transl. Med. 2, 52ra73 (2010).
Article Google Scholar
Corradini, B. R. et al. Complex network-driven view of genomic mechanisms underlying Parkinson’s disease: analyses in dorsal motor vagal nucleus, locus coeruleus, and substantia nigra Biomed. Res. Int. 2014, 543673 (2014).
Schulze, M. et al. Sporadic Parkinson’s disease derived neuronal cells show disease-specific mRNA and small RNA signatures with abundant deregulation of piRNAs. Acta Neuropathol. Commun. 6, 58 (2018).
Article Google Scholar
Lesnick, T. G. et al. A genomic pathway approach to a complex disease: axon guidance and Parkinson disease. PLoS Genet. 3, e98 (2007).
Article Google Scholar
Dijkstra, A. A. et al. Evidence for immune response, axonal dysfunction and reduced endocytosis in the substantia nigra in early stage Parkinson’s disease. PLoS ONE 10, e0128651 (2015).
Article Google Scholar
Fernández-Santiago, R. et al. Aberrant epigenome in iPSC-derived dopaminergic neurons from Parkinson’s disease patients. EMBO Mol. Med. 7, 1529–1546 (2015).
Article Google Scholar
Badanjak, K. et al. iPSC-derived microglia as a model to study inflammation in idiopathic Parkinson’s disease. Front. Cell Dev. Biol. 9, 3037 (2021).
Article Google Scholar
Smajić, S. et al. Single-cell sequencing of the human midbrain reveals glial activation and a Parkinson-specific neuronal state. Brain 145, 964–978 (2020).
Kamath, T. et al. Single-cell genomic profiling of human dopamine neurons identifies a population that selectively degenerates in Parkinson’s disease. Nat. Neurosci. 25, 588–595 (2022).
Article CAS Google Scholar
Shah, P., Muller, E. E. L., Lebrun, L. A., Wampach, L. & Wilmes, P. Sequential isolation of DNA, RNA, protein, and metabolite fractions from murine organs and intestinal contents for integrated omics of host–microbiota interactions. Methods Mol. Biol. 1841, 279–291 (2018).
Parkinson, H. et al. ArrayExpress - a public database of microarray experiments and gene expression profiles. Nucleic Acids Res. 35, D747–D750 (2007).
Clough, E. & Barrett, T. The Gene Expression Omnibus database. Methods Mol. Biol. 1418, 93–110 (2016).
Article Google Scholar
Barrett, T. et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 40, D57–D63 (2012).
Parkinson Progression Marker Initiative. The Parkinson Progression Marker Initiative (PPMI). Prog. Neurobiol. 95, 629–635 (2011).
Article Google Scholar
Yates, A. D. et al. Ensembl 2020. Nucleic Acids Res. 48, D682–D688 (2020).
CAS Google Scholar
Zoubarev, A. et al. Gemma: A resource for the reuse, sharing and meta-analysis of expression profiling data. Bioinformatics 28, 2272–2273 (2012).
Article CAS Google Scholar
Kauffmann, A., Gentleman, R. & Huber, W. arrayQualityMetrics - a Bioconductor package for quality assessment of microarray data. Bioinformatics 25, 415–416 (2009).
Article CAS Google Scholar
Wu, Z., Irizarry, R. A., Gentleman, R., Martinez-Murillo, F. & Spencer, F. A model-based background adjustment for oligonucleotide expression arrays. J. Am. Stat. Assoc. 99, 909–917 (2004).
Article Google Scholar
Dunning, M. J., Smith, M. L., Ritchie, M. E. & Tavaré, S. Beadarray: R classes and methods for Illumina bead-based data. Bioinformatics 23, 2183–2184 (2007).
Article CAS Google Scholar
Ritchie, M. E. et al. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47 (2015).
Article Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS Google Scholar
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. Voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
Article Google Scholar
Lin, S. M., Du, P., Huber, W. & Kibbe, W. A. Model-based variance-stabilizing transformation for Illumina microarray data. Nucleic Acids Res. 36, e11 (2008).
Huber, W., von Heydebreck, A., Sültmann, H., Poustka, A. & Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 18, S96–S104 (2002).
Article Google Scholar
Marot, G., Foulley, J. L., Mayer, C. D. & Jaffrézic, F. Moderated effect size and P-value combinations for microarray meta-analyses. Bioinformatics 25, 2692–2699 (2009).
Article CAS Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS Google Scholar
Garcia-Alonso, L., Holland, C. H., Ibrahim, M. M., Turei, D. & Saez-Rodriguez, J. Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res. 29, 1363–1375 (2019).
Article CAS Google Scholar
Hernansaiz-Ballesteros, R., Holland, C. H., Dugourd, A. & Saez-Rodriguez, J. Funki: interactive functional footprint-based analysis of omics data. Bioinformatics 38, 2075–2076 (2021).
Türei, D., Korcsmáros, T. & Saez-Rodriguez, J. OmniPath: guidelines and gateway for literature-curated signaling pathway resources. Nat. Methods 13, 966–967 (2016).
Article Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS Google Scholar
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888.e21–1902.e21 (2019).
Article Google Scholar
Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).
Article CAS Google Scholar
Waltman, L. & van Eck, N. J. A smart local moving algorithm for large-scale modularity-based community detection. Eur. Phys. J. B 86, 471 (2013).
Article Google Scholar
Witten, D. M. Classification and clustering of sequencing data using a Poisson model. Ann. Appl. Stat. 5, 2493–2518 (2011).
Article Google Scholar

Download references

Acknowledgements

We thank the members of the Biomedical Data Science group of the Luxembourg Centre for Systems Biomedicine (LCSB) and in particular Armin Rauschenberger for his help with the statistical analyses, Diana Hendrickx for providing feedback on the differential expression analyses, Quentin Klopfenstein for providing support for the deconvolution of the RNA-seq data, and Muhammad Ali for his help with the regulatory networks. We also thank Laurent Heirendt for providing help with respect to git and GitLab and Manuel Buttini for providing insightful comments on the manuscript. We wish to express our gratitude to Frank Middleton, Renée Miller, Ippolita Cantuti-Castelvetri, David Standaert, and Anke Dijkstra for providing additional clinical data associated with their datasets. The computational analyses presented in this paper were partly conducted using the HPC facilities at the University of Luxembourg (see http://hpc.uni.lu). EG received support from the Luxembourg National Research Fund (FNR) as part of the National Centre for Excellence in Research on Parkinson’s disease (NCER-PD, grant no. I1R-BIC-PFN-15NCER), the ERA-Net ERACoSysMed JTC-2 project PD-Strat (INTER/11651464), and the European Union’s Horizon 2020 research and innovation program under the grant no. ERAPERMED 2020-314 for the project DIGI-PD.

Author information

Authors and Affiliations

Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
Léon-Charles Tranchevent, Rashi Halder & Enrico Glaab

Authors

Léon-Charles Tranchevent
View author publications
You can also search for this author in PubMed Google Scholar
Rashi Halder
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Glaab
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.-C.T.: conceptualization, methodology, software, formal analysis, investigation, data curation, writing—original draft, writing—review and editing. R.H.: investigation, resources, data curation, writing—review and editing. E.G.: conceptualization, methodology, formal analysis, investigation, writing—original draft, writing—review and editing, supervision, funding acquisition.

Corresponding author

Correspondence to Enrico Glaab.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tranchevent, LC., Halder, R. & Glaab, E. Systems level analysis of sex-dependent gene expression changes in Parkinson’s disease. npj Parkinsons Dis. 9, 8 (2023). https://doi.org/10.1038/s41531-023-00446-8

Download citation

Received: 10 June 2022
Accepted: 03 January 2023
Published: 21 January 2023
DOI: https://doi.org/10.1038/s41531-023-00446-8

This article is cited by

Unravelling cell type-specific responses to Parkinson’s Disease at single cell resolution
- Araks Martirosyan
- Rizwan Ansari
- Matthew G. Holt
Molecular Neurodegeneration (2024)
NFKB1 variants were associated with the risk of Parkinson´s disease in male
- Sergio Perez-Oliveira
- Daniel Vazquez-Coto
- Victoria Álvarez
Journal of Neural Transmission (2024)
Multiomics analysis identifies novel facilitators of human dopaminergic neuron differentiation
- Borja Gomez Ramos
- Jochen Ohnmacht
- Lasse Sinkkonen
EMBO Reports (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Gene-level analysis of PD-associated molecular sex differences

Pathway-level analysis of PD-associated molecular sex differences

Regulatory network analysis

Cell-type-specific transcriptomic data analyses

Discussion

Methods

Sample preparation

Data collection

Missing value imputation

Detecting experimental batches

Age matching

Bulk data processing

Removing irrelevant probes/transcripts

Differential expression analyses

Dataset weights

Data integration

Sex-specific and candidate sex-dimorphic genes

Cellular pathway and process analyses

Regulatory network analysis

Single-cell transcriptomics data analyses

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links