It has been almost two decades since the first genetic form of Parkinson’s disease (PD), a point mutation in the α-synuclein gene (SNCA), was identified.1 Remarkable technological advancements in recent times have accelerated genetic discovery for PD and nominated potential molecular pathways for therapeutic intervention strategies. To date, pathogenic mutations in six genes (SNCA, LRRK2, VPS35, PARKIN, PINK1 and DJ-1) have been confirmed to result in carriers presenting the clinical phenotype associated with PD, with the vast majority of these reporting a family history of the disease.1 Furthermore, a recent large-scale meta-analysis of population-based genome-wide association studies (GWAS) has identified 24 independent genomic loci (including the familial genes SNCA and LRRK2) that influence susceptibility to the common sporadic form of PD.2

The loci nominated through GWAS approaches tend to be large genomic regions defined by patterns of linkage disequilibrium (LD) and contain multiple candidate genes. Thus, identifying the variant (or gene) responsible for modifying disease susceptibility within these loci remains challenging. In the previous issue, Pihlstrøm et al.3 attempt to fine-map and resequence the PARK16 locus in patient populations from Scandinavia. PARK16 was one of the first loci nominated through GWAS efforts in both Asian and European cohorts.4, 5 Although replications in Asian populations were consistently supportive, studies in Caucasian populations were initially controversial.3 The recent large-scale meta-GWAS in Caucasian samples has replicated the association and would appear to confirm this disease locus across ethnicities.2 From these studies it would appear that PARK16 contains five candidate genes (SLC45A3, NUCKS1, RAB7L1, SLC41A1 and PM20D1) and it remains unclear which specific gene and/or variant(s) is responsible for the observed association.

The authors had previously attempted to replicate the association at the PARK16 locus using three single-nucleotide variants (rs947211, rs823128 and rs823156) in these populations and no association was observed.6 However, in the present study the authors report a different haplotype spanning the transcription start site for RAB7L1 (rs974211-rs1572931-rs1775143) associating with disease risk (P=0.046). Pihlstrom and colleagues7 also replicated a recently nominated epistatic interaction between this haplotype and a LRRK2 GWAS association with the variant rs1491942. Targeted resequencing of the coding regions of the candidate genes identified a low-frequency risk variant in PM20D1 (rs141605758); however, after further genotyping in a larger case–control cohort, there was no convincing evidence of pathogenicity. From these data the authors conclude that RAB7L1 is the most compelling candidate in the PARK16 locus in the Scandinavian population.

The thrust of the study and the level of allelic heterogeneity observed in replication attempts suggest the existence of different functional variants or one variant of independent origin within the PARK16 locus. The most recent large-scale meta-GWAS has indicated an association at the 3′ untranslated region of RAB7L1,2 whereas the strongest association in the initial Caucasian GWAS study was in NUCKS1 (rs823121), upstream of RAB7L1.5 Similarly, the strongest association in the Japanese population also lies upstream of RAB7L1 (rs947211),4 whereas in the Chinese population both downstream (rs823156) and upstream (rs823128) polymorphisms of RAB7L1 were most significantly associated with PD risk.8 These differences suggest that elucidating the functional variant for PARK16 may be particularly challenging as the associations appear dependent on population-specific LD structure and thus different populations may harbor separate risk factors. This scenario means that targeted resequencing of the entire genomic locus in homogenous populations such as described may prove fruitful in resolving the functional variant driving risk.

Differences in population structure are highlighted by Pihlstrom et al. within the Scandinavian population as the risk alleles in RAB7L1 (rs1775143) had a larger effect size in Sweden (odds ratio (OR) 0.76) compared with Norway (OR 0.91). Perhaps, conditional analysis with multiple common variants across PARK16 will help reveal independent association signals for each population. In addition to the genetic evidence, RAB7L1 is a very attractive biological candidate; the protein is part of the endosomal–lysosomal pathway and has been shown to interact with both LRRK2 and VPS35.7, 9 RAB7L1 localizes to the trans-Golgi network, and retromer/vesicular trafficking has become an emerging pathway since the implication of VPS35 and DNAJC13 in PD.10, 11

Although the causal variant in PARK16 is yet to be identified, it does not seem to be a simple amino-acid change: no mutations in the coding regions of the candidate genes that could account for the observed association were observed by Pihlstrom and colleagues. This suggests that the mutation is likely within a noncoding region of the locus and would support the application of next-generation sequencing approaches to tease out the functional variant(s). Noncoding variation can affect splice sites, transcription factor-binding sites or methylation sites, modulating splicing and gene expression, which is difficult to identify without appropriate tissue types and functional readouts. Identifying the truly functional variants at this locus and others may prove difficult. Nonetheless, at PARK16, the genetic association of RAB7L1 and the biological relevance of the protein in the pathogenesis of PD are becoming more convincing and further investigation to define the pathogenic mechanism is warranted.