Allele-specific expression of Parkinson’s disease susceptibility genes in human brain

Langmyhr, Margrete; Henriksen, Sandra Pilar; Cappelletti, Chiara; van de Berg, Wilma D. J.; Pihlstrøm, Lasse; Toft, Mathias

doi:10.1038/s41598-020-79990-9

Download PDF

Article
Open access
Published: 12 January 2021

Allele-specific expression of Parkinson’s disease susceptibility genes in human brain

Margrete Langmyhr^1,2,
Sandra Pilar Henriksen¹,
Chiara Cappelletti³,
Wilma D. J. van de Berg⁴,
Lasse Pihlstrøm¹ &
…
Mathias Toft^1,2

Scientific Reports volume 11, Article number: 504 (2021) Cite this article

3649 Accesses
10 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Genome-wide association studies have identified genetic variation in genomic loci associated with susceptibility to Parkinson’s disease (PD), the most common neurodegenerative movement disorder worldwide. We used allelic expression profiling of genes located within PD-associated loci to identify cis-regulatory variation affecting gene expression. DNA and RNA were extracted from post-mortem superior frontal gyrus tissue and whole blood samples from PD patients and controls. The relative allelic expression of transcribed SNPs in 12 GWAS risk genes was analysed by real-time qPCR. Allele-specific expression was identified for 9 out of 12 genes tested (GBA, TMEM175, RAB7L1, NUCKS1, MCCC1, BCKDK, ZNF646, LZTS3, and WDHD1) in brain tissue samples. Three genes (GPNMB, STK39 and SIPA1L2) did not show significant allele-specific effects. Allele-specific effects were confirmed in whole blood for three genes (BCKDK, LZTS3 and MCCC1), whereas two genes (RAB7L1 and NUCKS1) showed brain-specific allelic expression. Our study supports the hypothesis that changes to the cis-regulation of gene expression is a major mechanism behind a large proportion of genetic associations in PD. Interestingly, allele-specific expression was also observed for coding variants believed to be causal variants (GBA and TMEM175), indicating that splicing and other regulatory mechanisms may be involved in disease development.

Exploring the genetic and genomic connection underlying neurodegeneration with brain iron accumulation and the risk for Parkinson’s disease

Article Open access 06 April 2023

A transcriptome-wide association study identifies susceptibility genes for Parkinson’s disease

Article Open access 09 September 2021

Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders

Article Open access 03 February 2023

Introduction

Variants altering gene expression play a critical role in human health and disease and may be particularly important in neurological disorders as small changes in the gene expression of neurons may affect disease risk^1,2. A frequent disorder of the brain is Parkinson’s disease (PD), a neurodegenerative movement disorder affecting approximately 1% of the world’s population over 60 years of age. The majority of PD patients are considered to have sporadic disease, likely caused by the cumulative effects of common or rare risk factors, each with a small increase in risk for PD development³. Recent genome-wide association studies (GWAS) have discovered a large number of genetic loci, mostly single nucleotide polymorphisms (SNPs), associated with increased risk for PD^4,5,6. Although some studies have suggested causal mechanisms for PD risk variants⁷, the functional genes responsible for most of the susceptibility loci remain unknown.

Follow-up analyses of GWAS susceptibility loci aim to establish causal mechanisms underlying the identified associated genetic variants. This is not a straight-forward task since the identified loci normally span numerous genes as a result of the complicated local linkage disequilibrium (LD) structure of many human chromosomal loci. Furthermore, most susceptibility variants map to non-coding regions of the genome, suggesting that the variants may affect the trait through alterations of gene regulation^8,9,10. GWAS loci typically contain several regulatory elements that may affect genes at some distance^11,12. Consequently, there is still much to learn regarding the precise biological mechanisms underlying GWAS associations¹³.

One approach to link functional relevance to trait-associated SNPs in GWAS is to identify genotypes that correlate significantly with the expression level of a gene, so called expression quantitative trait loci (eQTLs)^14,15. It has been reported that SNPs associated with complex diseases are more likely to be eQTLs than other SNPs¹⁶. Allelic expression profiling provides a direct way to measure the effect of cis-regulatory variation on gene expression. By measuring transcripts from each allele of a gene using a transcribed SNP as a marker to differentiate between the two mRNA copies of heterozygous individuals, the effect of trans-acting factors on gene expression is essentially removed since the output from one allele serves as a within-sample control for the other¹⁷. When allele-specific expression is detected, it indicates the presence of cis-regulatory variation in high LD with the transcribed marker used to differentiate between alleles¹⁸.

We determined the allele-specific expression of genes located in PD-associated loci in the brain and whole blood of PD patients and age-matched controls to gain insight into disease-specific molecular mechanisms. We validated the sensitivity of real-time quantitative PCR (qPCR) to detect allele-specific expression effects and analysed gene expression of PD-associated GWAS loci in post-mortem human brain. By pinpointing genes with allele-specific expression, we highlight some of the genes likely to be involved in disease pathogenesis. Interestingly, we also report that transcribed lead variants identified in GWAS are also eQTLs, suggesting that the molecular mechanism modulating disease risk of such variants may affect splicing or other regulatory functions in addition to amino acid changes in the protein structure as often expected.

Results

Identifying transcribed variants for allelic expression analysis of Parkinson’s disease-associated risk loci

In 2014, a large-scale meta-analysis of PD GWAS identified and verified 26 genomic loci associated with an increased risk of developing the disease⁵. To analyse the allelic expression of disease-associated risk variants, we selected transcribed variants to compare the relative expression of the two alleles in heterozygous individuals as outlined in Fig. 1. In brief, 2792 proxy SNPs were identified in the Broad Institute HaploReg v4.1 catalogue (last accessed December 2019), applying an LD cut-off threshold of r² ≥ 0.6. Out of the total number of lead and proxy SNPs, 86 out of 2,818 (3.1%) map to transcribed regions of 20 human RefSeq genes. Two loci, rs14235 in BCKDK and rs34311866 in TMEM175, had coding lead SNPs. We also included the coding secondary signal rs34884217 in TMEM175, whereas rs17649553 and its transcribed proxy SNPs in five genes were excluded from further analysis due to their location within a highly complex polymorphic inversion locus on chromosome 17q21. We selected one transcribed SNP per locus and excluded assays where > 50% of the cDNA samples yielded Ct values > 36 (rs2273596 in TMEM229B), as this indicates a very low level of gene expression¹⁹. We excluded the SNCA locus because the only transcribed proxy SNP in high LD with lead SNP rs356182, rs356165 (r² = 0.76), is located in the 3′-UTR of SNCA. SNCA transcripts have variable 3′-UTR lengths and rs356165 is only present in a transcript isoform that contains an extended 3′-UTR region²⁰. A rs356165 assay would therefore not be a representative assay to detect allele-specific expression for total SNCA mRNA levels, but for a sub-population. Based on these inclusion criteria, allelic expression could be determined for 13 variants in 12 selected genes (Supplementary Fig. S1). The transcribed proxy SNPs that we selected act as markers for allele-specific expression and are not the only candidate causal SNPs in the locus. Table 1 lists all the included SNPs with information on the GWAS-associated gene, the functional annotation of the location of the transcribed proxy SNP and the degree of LD between the transcribed proxy SNP and the lead SNP.

Table 1 Lead and transcribed, proxy SNPs selected for allelic expression analysis.

Full size table

Validation of genotyping assays as a sensitive method for determining allelic expression

We initially set out to test whether the standard genotyping assays from KASP (LGC Biosearch Technologies) and TaqMan (Applied Biosystems) were sensitive enough to identify small changes in allelic input. To achieve this, we tested the assays using samples that consisted of a mixture of gDNA from the two homozygotes for each SNP combined at nine fixed ratios (4:1, 2:1, 1.5:1, 1.25:1, 1:1, 1:1.25, 1:1.5, 1:2, and 1:4). A heterozygous gDNA sample for each SNP was included in each assay as a control for a 1:1 ratio of the alleles. One KASP assay for rs749670 in ZNF646 did not show the expected distribution of the ratios of allele-specific signals and was excluded from further analysis (Fig. 2a). The assay was replaced by a TaqMan assay for rs749671 in ZNF646. All remaining tested assays showed a clear distribution of the alleles at mixed ratios (Fig. 2b–k). These results demonstrate that real-time qPCR is sufficiently sensitive to detect subtle differences in allelic ratio and so is a robust method to determine allele-specific expression.

Identifying allele-specific expression at Parkinson’s disease GWAS risk loci in human brain tissue

To test whether a cis-acting PD-associated SNP affects transcript levels, we measured the relative allelic expression levels in heterozygous samples by determining the ratio of transcripts for each allele. RNA and gDNA was available from 101 human frontal cortex tissue samples from 37 PD patients and 64 age-matched controls. First, every selected transcribed SNP was genotyped, and the heterozygous samples were further analysed for allele-specific expression by quantification of the alleles in cDNA. The cDNA ratio was then normalized to the mean gDNA allelic ratio as gDNA from the heterozygous donors determine the difference between signals when the alleles are equally represented¹⁸.

The results of the allelic expression analysis for all selected PD-associated loci are summarized in Table 2, including the total number of heterozygous samples available in each assay and the overall P value per assay when testing for differences between all cDNA samples pooled against all gDNA samples on a global level. Among the 13 transcribed SNPs studied in 12 human genes, we observed significant allele-specific expression in 10 SNPs (Fig. 3a,b,d,e,g,i,j,k,l,m). The assay for rs58241213 in LZTS3 (Fig. 3g) displayed a single significant outlier. In the other nine assays, we observed a consistent imbalanced expression level of one allele compared to the other in all brain samples. Importantly, the mean difference in allelic expression in cDNA compared with gDNA for each of the ten assays was highly significant (P < 10^–5). For three out of 10 assays with allele-specific expression, the risk allele was more expressed than the alternative allele, whereas the other seven assays showed the opposite trend. However, there were no differences when comparing the results for PD patients and controls, indicating that the observed allele-specific expression is independent of disease status. For three SNPs, rs4649383 in SIPA1L2, rs199355 in GPNMB, and rs76179989 in STK39 (Fig. 3c,f,h), there was no clear allele-specific expression observed. Our allelic expression analyses provide evidence that the majority of these 13 PD-associated risk loci exert their effects through cis-regulatory mechanisms.

Table 2 Allelic expression results for 12 Parkinson’s disease genome-wide associated genes in human brain samples.

Full size table

We selected a single locus, GCH1/WDHD1, from the 10 loci that displayed allele-specific expression to take forward for a preliminary examination of the regulatory mechanism using the UCSC Genome Browser^21,22. The GCH1/WDHD1 locus contains the lead SNP rs11158026 located in the intron of GCH1 (Supplementary Fig. S2a) and the proxy SNP rs28481699 (r² = 0.71) in the 3′-UTR of WDHD1 for which we showed allele-specific expression (Fig. 3i). There are 53 additional SNPs in LD (r² ≥ 0.6) with rs11158026, although only one has an r² value over 0.9, corresponding to the proxy SNP rs3783640 that is also found in the intron of GCH1. rs3783640 is located in a region of closed chromatin across multiple brain regions and without regulatory element annotations in the UCSC Genome Browser^21,22. On the other hand, data from the ENCODE Registry of Candidate cis-Regulatory Elements (cCREs) in the human genome²³ suggests the lead SNP rs11158026 lies within a predicted distal enhancer element (Supplementary Fig. S2b, ENCODE accession ‘EH38E1715755′). Moreover, publicly available ATAC-seq data²⁴ from neuronal cells from cortex brain regions reveal accessible chromatin peaks surrounding lead SNP rs11158026 that are not present in non-neuronal cells, making rs11158026 a more likely candidate for cis-regulation of WDHD1.

Selected PD-associated loci are brain-specific eQTLs

Next, we investigated whether the effects of the cis-regulatory mechanisms we observed through allele-specific expression in brain samples were tissue-specific. We repeated five allelic expression assays in cDNA generated from whole blood. Blood samples were collected from 54 PD patients and 40 healthy controls and genotyped for the SNPs. The five assays were chosen based on the minor allele frequency (MAF) of the PD-associated SNPs to ensure a high number of heterozygous samples. The results are summarized in Table 3. All five selected SNPs displayed allele-specific expression in brain samples. For three out of five assays, rs14235 in BCKDK, rs58241213 in LZTS3, and rs2270968 in MCCC1 (Fig. 4a,d,e), the allele-specific expression in whole blood was consistent with our observations in the brain samples. The overall significance of differences in allelic expression between all cDNA samples compared with all gDNA samples were P < 10^–13 in the three assays with allele-specific expression (Table 3). For the remaining two assays, rs708723 in RAB7L1 and rs951366 in NUCKS1 (Fig. 4b,c), we did not observe allele-specific expression in whole blood and the effect appears to be brain-specific.

Table 3 Allelic expression analysis results for five Parkinson’s disease-associated genes in blood samples.

Full size table

Discussion

The GWAS era, where thousands of genetic associations have been identified without a known causal mechanism, has created an enormous potential for understanding and treating human disease. Studies that reveal regulatory mechanisms contributing to individual diseases give considerable possibilities for improving human health by enabling the development of new diagnostic markers and novel therapeutics. Current evidence suggests that cis-regulatory SNPs have causal roles in many complex human diseases^25,26. In PD, previous studies have found a non-random distribution of risk SNPs overlapping with tissue-specific putative regulatory elements²⁷. Given this background, we report in the present study that the majority of the PD genome-wide associated loci examined in prefrontal cortex tissue from patients and controls had allele-specific expression effects, emphasizing the importance of investigating variation in cis-regulation of gene expression in disease-relevant tissue. For a small subset of genes, we also identified allele-specific effects in peripheral blood cells, raising the possibility for their use as biomarkers of PD risk.

Allele-specific expression is often associated with subtle differences in transcript levels²⁸, likely a consequence of genetic and epigenetic variation in cis-regulatory DNA regions. To determine the precise molecular mechanisms underlying the allele-specific expression in all of the studied GWAS loci was beyond the scope of this paper, thus the interpretation of our findings remains open to discussion. A genetic interpretation of the results could involve DNA sequence variants that affect cis-regulatory elements, such as changes to transcription factor binding sites or microRNA binding sites resulting in altered transcription efficiency or transcript stability. Also, the allele-specific expression observed in our dataset could have an epigenetic basis, such as allele-specific methylation of DNA or allele-specific histone modifications^29,30.

The allele-specific expression of PD GWAS-associated genes provide evidence for a number of candidate causal genes for future follow-up studies. Viewing our results in the context of already published pathway analyses can help prioritize candidate causal genes for functional studies. The results presented here are in agreement with several large-scale pathway analyses that highlight the involvement of molecular processes leading to dysregulation in mitochondrial homeostasis and lysosomal dysfunction as main contributors to PD etiology^6,31,32,33. We find it interesting that two of the variants with the highest mean fold changes, rs14235 in BCKDK and rs2270968 in MCCC1, are located within genes involved in mitochondrial function. Mitochondrial impairment is a well-established pathological pathway in PD and a shared feature between the monogenic and sporadic forms of the disease³⁴. Furthermore, multiple of the 24 loci identified in the meta-analysis of PD GWAS from 2014⁵ include genes that are involved in the functions of the autophagy-lysosome-pathway³⁵. In line with our results, a previous study observed allele-specific expression of coding variants in two lysosomal genes, GBA and TMEM175³⁶. These variants are non-synonymous and were top-hit SNPs in GWAS so are believed to be the functional variants. The allele-specific effect was particularly strong for rs34884217, the secondary signal in TMEM175. TMEM175 encodes a lysosomal potassium channel and has been linked to the dysfunction of glucocerebrosidase and accumulation of α-synuclein³⁶. The rs34884217 variant encodes a Q65P amino acid substitution in the TMEM175 protein, which is predicted to be benign and without effects on TMEM175 protein function. However, there are multiple TMEM175 transcripts and rs34884217 is located in the first codon of exon 4, a potential splice acceptor site. The rs34884217 variant disrupts this potential splice site (AG to CG) and may affect how the TMEM175 transcript is spliced, leading to altered expression of the TMEM175 isoforms (Supplementary Fig. S3). We also showed allele-specific expression in the GCH1/WDHD1 locus and found preliminary evidence for a mechanistic interpretation of our results. The lead SNP in this locus, rs11158026, is located the intron of GCH1. The GCH1 gene encodes GTP cyclohydrolase 1; an essential enzyme for dopamine production in nigrostriatal cells³⁷. We found that rs11158026 is located in a genomic region marked by open chromatin and a predicted enhancer element (Supplementary Fig. S2). Although further studies are warranted to confirm the true causal variant in this locus, our results are in agreement with these reports. As we have shown that allele-specific expression occurs in WDHD1, one interpretation is that lead SNP rs11158026 results in an altered enhancer effect influencing one or more genes, not excluding a similar enhancer effect on GCH1 expression levels.

To analyse the allele-specific effects, we used the gDNA allelic ratio as a reliable internal control for transcript imbalance. Moreover, by examining the expression levels of alleles within the same biological sample, we have removed potential variation introduced by the environment³⁸. It should also be emphasized that analysing allelic expression using real-time qPCR produced highly significant data even with a limited sample size, an important restriction when working with brain disorders. Real-time qPCR removes the presence of confounding known and unknown trans-acting factors, which would have added significant variation to the expression measurements. Interestingly, our study identified novel eQTLs that are not listed in the GTEx database (https://gtexportal.org/home/, last accessed November 2019). Data in the GTEx database was generated by RNA-seq, a common method for allelic expression analysis. The difference in sensitivity between the two tools, real-time qPCR and RNA-seq, could explain why our results were not in agreement with data from the GTEx database. Furthermore, it highlights the need for studies such as ours, as valuable genetic information may be overlooked when relying solely on publicly available databases.

Mendelian randomization can be used to nominate likely causal gene candidates, taking advantage of eQTL data in bination with GWAS summary statistics. The largest meta-analysis of PD GWAS to date⁶ included a Mendelian randomization analysis where the expression or methylation of 151 (64%) of 237 genes was significantly associated with a possible causal change in PD. Notably, most loci tested contained multiple putatively causal genes. In our analysis, two genes within the PARK16 locus (rs708723 in RAB7L1 and rs951366 in NUCKS1) were tested and both showed allele-specific expression. In such cases, we cannot pinpoint a single gene, nor can we exclude the involvement of both genes in disease susceptibility. Thus, while allelic expression analysis has the potential to prioritize gene candidates in association regions containing more than one candidate gene, the method cannot separate between multiple genes when they all show allele-specific expression effects. Interestingly, although our data largely support the results of the Mendelian randomization analysis by Nalls et al.⁶, there are some notable differences. First, we included transcribed proxy SNPs from GBA, BCKDK, RAB7L1 and LZTS3, however, these genes were not included in the Mendelian randomization analysis. Also, whereas Nalls et al. report that expression changes in GPNMB is associated with PD-susceptibility⁶, we did not detect any allele-specific expression effects for this gene in our brain samples. Moreover, as previously discussed, we found that WDHD1 is an eQTL with the potential causal SNP being located in GCH1. Nalls et al. do not find significant changes in WDHD1 expression in their brain or blood samples. However, the expression of GCH1 was significantly associated with a possible causal change in PD⁶, supporting our hypothesis that the putative causal variant is located in this gene. These discrepancies may reflect the different sources of data as the Mendelian randomization analysis⁶ used expression summary statistics from GTEx, derived from RNA-seq, whereas our allelic expression analysis was generated by real-time qPCR.

The biological impact of common functional variation is cell context dependent³⁹, and SNPs that are preferentially expressed in disease-relevant cell types contribute more to risk⁴⁰. Therefore, as many traits manifest themselves only in certain tissues and cells, it is important to integrate data from the tissue of interest for the studied disease when interpreting GWAS results using gene expression as an intermediate phenotype. Neurological disorders are particularly challenging to study as brain tissue samples need to be collected post-mortem and contain a large degree of cellular heterogeneity⁴¹. Consequently, the choice of brain cell type is highly relevant when studying PD. It is reported that genes near PD-associated risk variants are enriched for expression in the brain, specifically in neurons from the substantia nigra pars compacta and the frontal cortex among others⁶. We therefore included tissue samples obtained from the medial frontal gyrus, a brain region also known to be affected by α-synuclein pathology in advanced stages of PD⁴². Furthermore, the importance of tissue choice when studying eQTLs was highlighted in a study from Hernandez and colleagues, specifically addressing whether the use of blood versus brain tissue would differ for brain phenotypes¹. They demonstrated that while some eQTLs were shared between blood and brain tissues, there were a number of examples where brain tissue was required for eQTL discovery. This correlates well with our findings, as three out of five genes showed allele-specific expression in both blood and brain tissue and two genes were brain-specific.

A major task in the post-GWAS era is to identify the disease-conferring risk gene within a disease-associated locus. Allelic expression analysis has identified multiple genes influenced by cis-regulatory SNPs^{43, 44} and our results suggest that measuring allelic expression using real-time qPCR could help prioritize functional candidate genes. Moreover, the absence of allele-specific expression weakens the evidence for the disease involvement of an examined gene. In our study, the assays for rs199355 in GPNMB, rs76179989 in STK39 and rs4649393 in SIPA1L2 did not display any allele-specific expression effects. The two former SNPs were both in very high LD (r² > 0.9) with the lead SNP, and so these genes may not be directly involved in PD pathogenesis. However, it should be mentioned that only 15 brain samples were heterozygous for the STK39 variant and available for analysis, limiting the power to detect a true difference. For SIPA1L2, the proxy SNP rs4649393 is only moderately correlated (r² = 0.64) to the lead SNP, also affecting the statistical power. This highlights two of the limitations to this approach; the dependency on transcribed proxy SNPs that meet the LD correlation threshold requirement and the availability of a sufficient number of heterozygous samples from the tissue of interest.

Additional loci have been identified in subsequent PD GWAS meta-analyses resulting in a total of 90 independent genome-wide significant signals^{4, 6}. Although the majority of these novel variants have small effect sizes, increasing the odds for PD by as little as 1.05⁶, the pipeline used in this study can be applied to highlight novel disease-associated genes. We have demonstrated that allelic expression analysis is both a time- and cost-efficient method to establish cis-regulatory effects present in susceptibility loci, provided that a transcribed proxy SNP and disease-relevant tissue material is available. This approach can therefore easily be applied when nominating functionally relevant genes in post-GWAS follow-up work for PD and other complex disorders.

Methods

DNA and RNA collection from human prefrontal cortex tissue

Post-mortem brain tissue was collected from 101 individuals and kindly received from the Netherlands Brain Bank (NBB; http://www.brainbank.nl/) and Normal Aging Brain Collection (NABCA; Amsterdam UMC—location VUMC, The Netherlands)⁴⁵. For all donors, a written informed consent for a brain autopsy and the use of the material and clinical information for research purposes had been obtained from the donor or the next of kin⁴⁶. Autopsy was performed using a standardized protocol by NBB (open access: www.brainbank.nl). The Medical Ethics Committee of the VU University Medical Centre, Amsterdam, approved all procedures of NBB and NABCA.

We retrieved information from medical records on symptoms and signs of PD as described by MDS criteria⁴⁷. The inclusion criteria for the PD patients in this study were: (1) clinical diagnosis of probable or established PD according to MDS criteria⁴⁷, (2) presence of Lewy body pathology at autopsy, and (3) cases with other major neurological or psychiatric diseases were excluded from the study. Control cases included in this study did not have any neurological or psychiatric diseases during life and their brains did not show any Lewy body, AD or vascular pathology at autopsy.

Post-mortem examination was performed by an experienced neuropathologist (Annemieke JM Rozemuller, AmsterdamUMC, location VUmc) and neuroanatomist (WvB). The presence of neuropathological hallmarks was assessed following consensus criteria for diagnosis of PD⁴⁸, Thal amyloid-β phase, Braak stage for neurofibrillary pathology and CERAD neuritic plaque scores were determined according to the National Institute on Aging—Alzheimer’s Association guidelines⁴⁹, and Braak and McKeith stages for α-synuclein were determined according to the Brain Net Europe guidelines⁴².

Fresh-frozen tissue blocks of the right medial frontal gyrus were included, as this region is affected in late stage PD⁵⁰ and was available for all cases in the retrospective study. The frozen tissue blocks were sectioned using a cryostat and only grey matter was included and isolated from 37 PD patients and age-matched 64 non-demented controls. DNA and RNA isolation were performed with the AllPrep DNA/RNA/miRNA Universal Kit (Qiagen) according to the manufacturer’s instructions. DNA and RNA concentrations were quantified using the NanoDrop spectrophotometer (Thermo Fisher Scientific). The purity and integrity of all RNA samples were assessed using the Bioanalyzer 2100 instrument (Agilent Technologies, mean RIN = 8.2, standard deviation = 1.4).

Ethical approval and consent to participate

For all donors, a written informed consent for a brain autopsy and the use of the material and clinical information for research purposes had been obtained from the donor or the next of kin⁴⁶. Autopsy was performed using a standardized protocol defined by NBB (open access: www.brainbank.nl). The Medical Ethics Committee of the VU University Medical Centre, Amsterdam, approved all procedures of NBB and NABCA.

DNA and RNA collection from whole blood

Blood samples were collected for DNA (EDTA tubes, Greiner Bio-One) and RNA (PAXgene RNA tubes, PreAnalytiX) isolation from 54 PD patients and 40 controls. DNA isolation was performed using the Maxwell 16 LEV Blood DNA Kit (Promega) according to the manufacturer’s instructions. RNA was isolated using the Maxwell 16 LEV Blood simplyRNA Kit (Promega) according to the manufacturer’s instructions. DNA and RNA concentrations were quantified using the NanoDrop spectrophotometer (Thermo Fisher Scientific).

cDNA synthesis

Total complementary DNA (cDNA) was prepared using 20 ng total RNA isolated from brain and blood samples which was reverse transcribed with the High-Capacity cDNA Reverse Transcription Kit (Applied Biosystems) according to the manufacturers’ protocol in a 20 μl reaction. A negative control (no-RT) was always included to monitor genomic DNA (gDNA) contamination in our samples.

SNP selection for allelic expression analysis

Transcribed proxy SNPs were identified from the PD GWAS meta-analysis⁵ in HaploReg v4.1 (https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php, last accessed December 2019). To select appropriate markers for detecting allele-specific expression, we applied a LD correlation cut-off of r² ≥ 0.6 for transcribed, linked variants and selected data from individuals of European descent.

SNP genotyping

To identify heterozygous donor samples suitable for allelic expression measurements, genotyping of all gDNA samples from blood and brain tissues was performed using either Kompetitive Allele Specific PCR (KASP, LGC Biosearch Technologies) or TaqMan (Applied Biosystems) technology (KASP and TaqMan Genotyping assay IDs available upon request). KASP genotyping reactions were carried out using 10 ng gDNA, 2.5 μl 2 × Master Mix with low ROX (LGC Biosearch Technologies) and 0.07 μl primer probes in a 5 μl total reaction. TaqMan genotyping was carried out in a 5 μl total reaction with 10 ng genomic DNA, 2.5 μl TaqMan Genotyping Master Mix (Applied Biosystems) and 0.25 μl 20 × TaqMan assay. All genotyping reactions were performed in a MicroAmp Optical 384 well reaction plate of the ViiA7 Real-Time PCR system and analysed using the SDS program v. 2.3 (Applied Biosystems).

Sensitivity test of genotyping assays

To assess the sensitivity of the allele-specific expression assays, samples of gDNA homozygous for both alleles of the corresponding SNPs were mixed at nine ratios: 4:1, 2:1, 1.5:1, 1.25:1, 1:1, 1:1.25, 1:1.5, 1:2, and 1:4. A heterozygous gDNA sample was included as a control of the 1:1 ratio. DNA concentrations were quantified using Qubit High Sensitivity DNA spectroscopy (Invitrogen) to ensure equal sample input. The allele-specific signals of the ratios were quantified by real-time qPCR using either KASP or TaqMan genotyping assays. For KASP assays, the mixtures of ratios were analysed with 2.5 μl 2 × Master Mix with low ROX (LGC Biosearch Technologies) and 0.07 μl primer probe mix in a 5 μl total reaction. For TaqMan assays, the sample ratios were analysed in 2.5 μl TaqMan Universal Master Mix II without UNG (Applied Biosystems) with 0.25 μl 20 × primer probe mix in a 5 μl total reaction. Samples were run in triplicates on a MicroAmp Optical 384 well plate (Applied Biosystems) on the ViiA7 Real-Time PCR system (Applied Biosystems). Data was analysed by SDS v. 2.3 (Applied Biosystems).

Allelic expression assays

Allelic expression was evaluated in heterozygous samples by determining the ratio of transcript levels for each allele using cDNA samples, and then normalized to the allelic ratio for the corresponding gDNA samples. gDNA from the same heterozygous donors was included as a control because heterozygous gDNA samples should show a 1:1 allelic ratio and any unequal amplification of the alleles must therefore be corrected for. For each assay, cDNA from heterozygous samples was measured in five replicates. Allelic expression assays with KASP technology were done in a 5 μl total reaction with 2.5 μl 2 × Master Mix with low ROX (LGC Biosearch Technologies), 0.07 μl primer probes and 5 ng cDNA or with 10 ng gDNA. TaqMan reactions were carried out in a 5 μl total reaction with 2.5 μl TaqMan Universal Master Mix II without UNG (Applied Biosystems), 0.25 μl 20 × TaqMan assays and 5 ng cDNA or 10 ng gDNA. All assays included a negative control without DNA and a no-RT control and were performed in a MicroAmp optical 384 well plate on the ViiaA7 Real-Time PCR System. The SDS v.2.3 software (Applied Biosystems) determined the cycle thresholds (Ct) values.

Data analysis and statistics

To analyse the allelic expression measurements of heterozygous donors, the relative allelic expression of the two alleles was calculated as delta Ct (ΔCt) = Ct (FAM) − Ct (VIC). Then, the normalized ΔCt (nΔCt) was calculated by determining the difference between the allelic ratios (ΔCt) for cDNA and the mean ΔCt of all gDNA samples¹⁸. Samples with Ct values > 36 were excluded from further analysis as it indicates very low gene expression levels. A preliminary Shapiro–Wilk test was used to determine whether the allelic expression measurements from cDNA and gDNA were normally distributed. This preliminary test rejected the null hypothesis for normality for a selection of the assays, therefore a non-parametric test was chosen. A Mann–Whitney U test was used to identify significant differences between the ΔCt per individual cDNA sample against the mean of all gDNA samples, and to test for overall differences in the assays by combining all cDNA samples pooled against all gDNA samples. Error bars represent the standard error of the mean. Data was analysed using R Studio v. 1.1.463. P values < 0.05 were considered significant.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Hernandez, D. G. et al. Integration of GWAS SNPs and tissue specific expression profiling reveal discrete eQTLs for human traits in blood and brain. Neurobiol. Dis. 47, 20–28 (2012).
Article CAS PubMed PubMed Central Google Scholar
Little, J. et al. Genetic variation associated with the occurrence and progression of neurological disorders. Neurotoxicology 61, 243–264 (2017).
Article CAS PubMed Google Scholar
Pihlstrøm, L., Morset, K. R., Grimstad, E., Vitelli, V. & Toft, M. A cumulative genetic risk score predicts progression in Parkinson’s disease. Mov. Disord. 31, 487–490 (2016).
Article PubMed Google Scholar
Chang, D. et al. A meta-analysis of genome-wide association studies identifies 17 new Parkinson’s disease risk loci. Nat. Genet. 49, 1511–1516 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nalls, M. A. et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease. Nat. Genet. 46, 989–993 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nalls, M. A. et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: A meta-analysis of genome-wide association studies. Lancet Neurol. 18, 1091–1102 (2019).
Article CAS PubMed PubMed Central Google Scholar
Soldner, F. et al. Parkinson-associated risk variant in distal enhancer of alpha-synuclein modulates target gene expression. Nature 533, 95–99 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Farh, K. K. et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 518, 337–343 (2015).
Article ADS CAS PubMed Google Scholar
Kundaje, A. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article CAS PubMed PubMed Central Google Scholar
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Edwards, S. L., Beesley, J., French, J. D. & Dunning, A. M. Beyond GWASs: Illuminating the dark road from association to function. Am. J. Hum. Genet. 93, 779–797 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gallagher, M. D. & Chen-Plotkin, A. S. The post-GWAS era: From association to function. Am. J. Hum. Genet. 102, 717–730 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lowe, W. L. Jr. & Reddy, T. E. Genomic approaches for understanding the genetics of complex disease. Genome Res. 25, 1432–1441 (2015).
Article CAS PubMed PubMed Central Google Scholar
He, X. et al. Sherlock: Detecting gene-disease associations by matching patterns of expression QTL and GWAS. Am. J. Hum. Genet. 92, 667–680 (2013).
Article CAS PubMed PubMed Central Google Scholar
Albert, F. W. & Kruglyak, L. The role of regulatory variation in complex traits and disease. Nat. Rev. Genet. 16, 197–212 (2015).
Article CAS PubMed Google Scholar
Nicolae, D. L. et al. Trait-associated SNPs are more likely to be eQTLs: Annotation to enhance discovery from GWAS. PLoS Genet. 6, e1000888–e1000888 (2010).
Article PubMed PubMed Central CAS Google Scholar
Pastinen, T. Genome-wide allele-specific analysis: Insights into regulatory variation. Nat. Rev. Genet. 11, 533–538 (2010).
Article CAS PubMed Google Scholar
Keshari, P. K. et al. Allelic imbalance of multiple sclerosis susceptibility genes IKZF3 and IQGAP1 in human peripheral blood. BMC Genet. 17, 59 (2016).
Article PubMed PubMed Central CAS Google Scholar
Locke, J. M., Hysenaj, G., Wood, A. R., Weedon, M. N. & Harries, L. W. Targeted allelic expression profiling in human islets identifies cis-regulatory effects for multiple variants identified by type 2 diabetes genome-wide association studies. Diabetes 64, 1484–1491 (2015).
Article CAS PubMed Google Scholar
Rhinn, H. et al. Alternative alpha-synuclein transcript usage as a convergent mechanism in Parkinson’s disease pathology. Nat. Commun. 3, 1084 (2012).
Article ADS PubMed CAS Google Scholar
Haeussler, M. et al. The UCSC genome browser database: 2019 update. Nucleic Acids Res. 47, D853-d858 (2019).
Article CAS PubMed Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
Article CAS PubMed PubMed Central Google Scholar
Moore, J. E. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Fullard, J. F. et al. An atlas of chromatin accessibility in the adult human brain. Genome Res. 28, 1243–1252 (2018).
Article CAS PubMed PubMed Central Google Scholar
Emilsson, V. et al. Genetics of gene expression and its effect on disease. Nature 452, 423–428 (2008).
Article ADS CAS PubMed Google Scholar
De Gobbi, M. et al. A regulatory SNP causes a human genetic disease by creating a new transcriptional promoter. Science 312, 1215–1217 (2006).
Article ADS PubMed CAS Google Scholar
Coetzee, S. G. et al. Enrichment of risk SNPs in regulatory regions implicate diverse tissues in Parkinson’s disease etiology. Sci. Rep. 6, 30509 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Bell, C. G. & Beck, S. Advances in the identification and analysis of allele-specific expression. Genome Med. 1, 56 (2009).
Article PubMed PubMed Central CAS Google Scholar
Gaur, U., Li, K., Mei, S. & Liu, G. Research progress in allele-specific expression and its regulatory mechanisms. J. Appl. Genet. 54, 271–283 (2013).
Article CAS PubMed Google Scholar
Buonocore, F. et al. Effects of cis-regulatory variation differ across regions of the adult human brain. Hum. Mol. Genet. 19, 4490–4496 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bandres-Ciga, S. et al. Large-scale pathway specific polygenic risk and transcriptomic community network analysis identifies novel functional pathways in Parkinson disease. Acta Neuropathol. 140, 341–358 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bandres-Ciga, S. et al. The endocytic membrane trafficking pathway plays a major role in the risk of Parkinson’s disease. Mov. Disord. 34, 460–468 (2019).
Article CAS PubMed PubMed Central Google Scholar
Billingsley, K. J. et al. Mitochondria function associated genes contribute to Parkinson’s Disease risk and later age at onset. NPJ Parkinson’s Dis. 5, 8 (2019).
Article Google Scholar
Park, J.-S., Davis, R. L. & Sue, C. M. Mitochondrial dysfunction in Parkinson’s disease: New mechanistic insights and therapeutic perspectives. Curr. Neurol. Neurosci. Rep. 18, 21 (2018).
Article PubMed PubMed Central CAS Google Scholar
Gan-Or, Z., Dion, P. A. & Rouleau, G. A. Genetic perspective on the role of the autophagy-lysosome pathway in Parkinson disease. Autophagy 11, 1443–1457 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jinn, S. et al. TMEM175 deficiency impairs lysosomal and mitochondrial function and increases alpha-synuclein aggregation. Proc. Natl. Acad. Sci. USA 114, 2389–2394 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mencacci, N. E. et al. Parkinson’s disease in GTP cyclohydrolase 1 mutation carriers. Brain 137, 2480–2492 (2014).
Article PubMed PubMed Central Google Scholar
Heap, G. A. et al. Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing. Hum. Mol. Genet. 19, 122–134 (2010).
Article CAS PubMed Google Scholar
Lee, D. et al. A method to predict the impact of regulatory variants from DNA sequence. Nat. Genet. 47, 955–961 (2015).
Article CAS PubMed PubMed Central Google Scholar
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: From polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ramasamy, A. et al. Genetic variability in the regulation of gene expression in ten regions of the human brain. Nat. Neurosci. 17, 1418–1428 (2014).
Article CAS PubMed PubMed Central Google Scholar
Alafuzoff, I. et al. Staging/typing of Lewy body related alpha-synuclein pathology: A study of the BrainNet Europe Consortium. Acta Neuropathol. 117, 635–652 (2009).
Article CAS PubMed Google Scholar
Pastinen, T. et al. Mapping common regulatory variants to human haplotypes. Hum. Mol. Genet. 14, 3963–3971 (2005).
Article CAS PubMed Google Scholar
Ge, B. et al. Global patterns of cis variation in human cells revealed by high-density allelic expression analysis. Nat. Genet. 41, 1216–1222 (2009).
Article CAS PubMed Google Scholar
Jonkman, L. E. et al. Normal Aging Brain Collection Amsterdam (NABCA): A comprehensive collection of postmortem high-field imaging, neuropathological and morphometric datasets of non-neurological controls. Neuroimage Clin. 22, 101698 (2019).
Article PubMed PubMed Central Google Scholar
Klioueva, N. M. et al. BrainNet Europe’s code of conduct for brain banking. J. Neural Transm. 122, 937–940 (2015).
Article PubMed Google Scholar
Postuma, R. B. et al. MDS clinical diagnostic criteria for Parkinson’s disease. Mov. Disord. 30, 1591–1601 (2015).
Article PubMed Google Scholar
Dickson, D. W. et al. Neuropathological assessment of Parkinson’s disease: Refining the diagnostic criteria. Lancet Neurol. 8, 1150–1157 (2009).
Article CAS PubMed Google Scholar
Hyman, B. T. et al. National Institute on Aging-Alzheimer’s Association guidelines for the neuropathologic assessment of Alzheimer’s disease. Alzheimers Dement. 8, 1–13 (2012).
Article PubMed PubMed Central Google Scholar
Braak, H. & Braak, E. Diagnostic criteria for neuropathologic assessment of Alzheimer’s disease. Neurobiol. Aging 18, S85–S88 (1997).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Tone Berge and Steffan Bos-Haugen for helpful advice. We thank Angela Ingrassia and Yvonne de Gier for processing the brain tissue samples for DNA and RNA isolation. This work was supported by grants from the South-Eastern Norway Regional Health Authority (Project No. 2016057 to MT), Research Council of Norway (Project No. 250597 to MT) and Norwegian Health Association (to LP).

Author information

Authors and Affiliations

Department of Neurology, Oslo University Hospital, Nydalen, P.O. Box 4956, 0424, Oslo, Norway
Margrete Langmyhr, Sandra Pilar Henriksen, Lasse Pihlstrøm & Mathias Toft
Institute of Clinical Medicine, University of Oslo, Oslo, Norway
Margrete Langmyhr & Mathias Toft
Department of Mechanical, Electronic and Chemical Engineering, OsloMet – Oslo Metropolitan University, Oslo, Norway
Chiara Cappelletti
Department of Anatomy and Neurosciences, Section Clinical Neuroanatomy and Biobanking, Amsterdam Neuroscience, Amsterdam UMC, Location VU Medical Center, Amsterdam, The Netherlands
Wilma D. J. van de Berg

Authors

Margrete Langmyhr
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Pilar Henriksen
View author publications
You can also search for this author in PubMed Google Scholar
Chiara Cappelletti
View author publications
You can also search for this author in PubMed Google Scholar
Wilma D. J. van de Berg
View author publications
You can also search for this author in PubMed Google Scholar
Lasse Pihlstrøm
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Toft
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L., L.P., and M.T. contributed to conception and design of the study. M.L., L.P., C.C., S.P.H., W.v.d.B., and M.T. contributed to acquisition, analysis and/or interpretation of data. M.L. and M.T. drafted the manuscript. All authors contributed to editing and approval of the final draft.

Corresponding author

Correspondence to Mathias Toft.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Langmyhr, M., Henriksen, S.P., Cappelletti, C. et al. Allele-specific expression of Parkinson’s disease susceptibility genes in human brain. Sci Rep 11, 504 (2021). https://doi.org/10.1038/s41598-020-79990-9

Download citation

Received: 12 June 2020
Accepted: 10 December 2020
Published: 12 January 2021
DOI: https://doi.org/10.1038/s41598-020-79990-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Exploring the genetic and genomic connection underlying neurodegeneration with brain iron accumulation and the risk for Parkinson’s disease

A transcriptome-wide association study identifies susceptibility genes for Parkinson’s disease

Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders

Introduction

Results

Identifying transcribed variants for allelic expression analysis of Parkinson’s disease-associated risk loci

Validation of genotyping assays as a sensitive method for determining allelic expression

Identifying allele-specific expression at Parkinson’s disease GWAS risk loci in human brain tissue

Selected PD-associated loci are brain-specific eQTLs

Discussion

Methods

DNA and RNA collection from human prefrontal cortex tissue

Ethical approval and consent to participate

DNA and RNA collection from whole blood

cDNA synthesis

SNP selection for allelic expression analysis

SNP genotyping

Sensitivity test of genotyping assays

Allelic expression assays

Data analysis and statistics

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links