The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data

Leonard, Hampton L.; Murtadha, Ruqaya; Martinez-Carrasco, Alejandro; Jama, Alina; Müller-Nedebock, Amica Corda; Gil-Martinez, Ana-Luisa; Illarionova, Anastasia; Moore, Anni; Bustos, Bernabe I.; Jadhav, Bharati; Huxford, Brook; Storm, Catherine; Towns, Clodagh; Vitale, Dan; Chetty, Devina; Yu, Eric; Grenn, Francis P.; Salazar, Gabriela; Rateau, Geoffrey; Iwaki, Hirotaka; Elsayed, Inas; Foote, Isabelle Francesca; Jansen van Rensburg, Zuné; Kim, Jonggeol Jeff; Yuan, Jie; Lake, Julie; Brolin, Kajsa; Senkevich, Konstantin; Wu, Lesley; Tan, Manuela M. X.; Periñán, María Teresa; Makarious, Mary B.; Ta, Michael; Pillay, Nikita Simone; Betancor, Oswaldo Lorenzo; Reyes-Pérez, Paula R.; Alvarez Jerez, Pilar; Saini, Prabhjyot; al-Ouran, Rami; Sivakumar, Ramiya; Real, Raquel; Reynolds, Regina H.; Hu, Ruifneg; Abrahams, Shameemah; Rao, Shilpa C.; Antar, Tarek; Leal, Thiago Peixoto; Iankova, Vassilena; Scotton, William J.; Song, Yeajin; Singleton, Andrew; Nalls, Mike A.; Dey, Sumit; Bandres-Ciga, Sara; Blauwendraat, Cornelis; Noyce, Alastair J.

doi:10.1038/s41531-023-00472-6

Download PDF

Article
Open access
Published: 04 March 2023

The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data

npj Parkinson's Disease volume 9, Article number: 33 (2023) Cite this article

2234 Accesses
3 Citations
23 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 24 May 2023

This article has been updated

Abstract

Open science and collaboration are necessary to facilitate the advancement of Parkinson’s disease (PD) research. Hackathons are collaborative events that bring together people with different skill sets and backgrounds to generate resources and creative solutions to problems. These events can be used as training and networking opportunities, thus we coordinated a virtual 3-day hackathon event, during which 49 early-career scientists from 12 countries built tools and pipelines with a focus on PD. Resources were created with the goal of helping scientists accelerate their own research by having access to the necessary code and tools. Each team was allocated one of nine different projects, each with a different goal. These included developing post-genome-wide association studies (GWAS) analysis pipelines, downstream analysis of genetic variation pipelines, and various visualization tools. Hackathons are a valuable approach to inspire creative thinking, supplement training in data science, and foster collaborative scientific relationships, which are foundational practices for early-career researchers. The resources generated can be used to accelerate research on the genetics of PD.

Alpha-synuclein research: defining strategic moves in the battle against Parkinson’s disease

Article Open access 26 July 2021

Fox Insight collects online, longitudinal patient-reported outcomes and genetic data on Parkinson’s disease

Article Open access 24 February 2020

Defining the causes of sporadic Parkinson’s disease in the global Parkinson’s genetics program (GP2)

Article Open access 12 September 2023

Introduction

An abundance of Parkinson’s disease (PD) data spanning many different modalities (genetic, transcriptomic, proteomic, epigenomic, clinical, and more) has been generated over the past few years. However, many challenges remain in effectively using and integrating this data to produce meaningful and impactful results. A lack of access to good training resources and limited connections to other researchers can delay progress. These problems are even more evident in underrepresented populations, where a lack of resources may hinder researchers and clinicians from performing necessary research on diverse ancestry populations and building local research capacity. To accelerate research, the wide adoption of open science practices is critical. Bringing researchers together to share data, code, tools, and pipelines will help reproducibility and create lower entry points to generate the results needed to make the necessary progress in PD research.

With the intention of creating pipelines and tools to facilitate PD genetics research, 49 early-career scientists from 12 countries collaborated in a virtual 3-day “Hackathon” event in May 2021. The event combined scientists from two initiatives, the International Parkinson’s Disease Genomics Consortium (IPDGC)¹ and the first resource project of the Aligning Science Across Parkinson’s (ASAP) initiative, the Global Parkinson’s Genetics Program (GP2)². IPDGC and GP2 exist to drive forward research into the genetic basis of PD, and for both initiatives, training and collaboration are vital aspects of accelerating and diversifying research efforts. Hackathons are a helpful tool that can help promote this mutual effort, providing a creative and engaging outlet to facilitate networking and team building. Hackathons have recently become popular in the health and biology domain, where the outcomes have focused on a variety of research disciplines and challenges. Some hackathons focus on the development of useful tools, such as the event hosted by DNAnexus and the Baylor College of Medicine in 2020, where participants created novel tools for structural variation and SARS-CoV-2 research³. Others focus on reaching a solution or creative idea to a research question through competition, such as the events using data from The Alzheimer’s Disease Neuroimaging Initiative (ADNI), where teams competed to predict mild cognitive impairment (MCI) conversion to Alzheimer’s Disease⁴. The first combined ‘GP2/IPDGC Hackathon 2021’ event provided teams with a choice of nine PD genetics and genomics analysis topics. The focus of the GP2/IPDGC Hackathon was not competition, but training, networking, collaboration, and the development of tools that would be useful to the broader research community.

As part of an open science initiative, the Accelerating Medicines Partnership Parkinson’s Disease Program (AMP PD, https://amp-pd.org/)⁵ aims to identify and validate biomarkers by providing researchers access to a large, harmonized dataset that includes clinical, genomic, transcriptomic, and proteomic data. GP2 has recently partnered with AMP PD to make one space where researchers can access multiple PD cohorts and a range of data with a single sign-on. Both AMP PD and GP2 use Terra⁶, a platform for researchers to access data and run analysis tools (Fig. 1). The Terra platform allows analysis to be done directly in the cloud, navigating many data sharing and research governance issues that arise from combining data from different geographical regions. Additionally, cloud analysis allows for ease in collaboration, replication of results, and addresses privacy and data use concerns by preventing the download of individual-level data. Using Terra is necessary for accessing the wealth of AMP PD and GP2 data, so many of the project topics for this hackathon involved gaining experience and building pipelines on Terra. This allowed hackathon participants to get comfortable using Terra and cloud computing and make available tools to help future researchers use Terra easily and quickly. Whether the teams employed Terra or another platform to create their tools, all were created with the idea to openly share whenever possible to any interested party to accelerate progress.

**Fig. 1: The AMP PD and GP2 collaboration framework.**

Results

Hackathon description

The “GP2/IPDGC Hackathon 2021” event included 49 participants from 12 countries: Brazil, Canada, France, Germany, India, Mexico, Norway, Russia, South Africa, Spain, Sudan, Sweden, the UK, and the United States. This event was open to all levels of experience and career stages, so participants had a range of coding and genetic analysis skills. Each of the nine teams were designed in order to spread participant experience evenly so that the skill levels were well-balanced. Participant demographics are further outlined in Table 1.

Table 1 Summary of hackathon participant demographics.

Full size table

Each of the 3 days of the event had scheduled teamwork sessions, but teams were allowed to create their own schedules outside of those sessions to accommodate the wide array of time-zone differences inherent to an international event. The event also provided scheduled training and seminar sessions. On the first day, participants were given a seminar on the use of machine learning in healthcare as well as a hands-on demonstration of Terra to prepare them for some of the hackathon project topics. On the second day, participants were given a seminar on code “health”. The first two days also included time for icebreakers and games, both within each team and across all hackathon participants, in order to build better connections and facilitate continued academic contact outside of the event. The third and final day was reserved for working to finish the projects and final presentations. Github was used to share code and applications were deployed publicly.

Hackathon outcomes

To simplify the description of the outcomes of the hackathon, the nine projects from this event have been grouped into one of three categories; genome-wide association study (GWAS)-level and post-GWAS analyses, downstream analyses of genetic variation, and data visualization. GWAS-level and post-GWAS projects were designed to be helpful to researchers looking for ways to follow up their GWAS analyses easily. The projects for this topic included efforts to create general post-GWAS and pathway and cell-type enrichment workflows and examples. Downstream analyses of genetic variation projects aimed at providing examples and pipelines for additional investigation of genetic variation, including colocalization, variant interaction, and network generation and visualization. Finally, the data visualization projects aimed to provide resources that other researchers may use to help further their research or generate hypotheses). These included a GP2 cohort browser, expansion of a PD locus browser, visualization of longitudinal PD clinical outcomes, and visualizing longitudinal and cross-variant effects. A summary of the projects and the topics they belong to, and links to available code and applications are found in Table 2. A glossary for any potentially unfamiliar terms can be found in Box 1.

Table 2 Summary of the goals and outcomes of each of the hackathon projects.

Full size table

Box 1 Glossary for unfamiliar terms

Accelerating medicines partnership Parkinson’s disease (AMP PD)	A program with the aim of deep molecular characterization and longitudinal clinical profiling of PD patient data and biosamples with the goal of identifying and validating diagnostic, prognostic, and/or disease progression biomarkers for PD.
ANNOVAR	A software tool to utilize update-to-date information to functionally annotate genetic variants.
BioFIND	An observational clinical study designed to discover and verify biomarkers of PD.
Broad-sense heritability	The proportion of phenotypic variance that can be attributed to genetic causes.
Colocalization	A process that determines whether a single variant is responsible for both the GWAS and eQTL signals in a locus.
Expressive quantitative trait locus (eQTL)	A locus that explains a fraction of the genetic variance of a gene expression phenotype.
Functional mapping and annotation (FUMA)	A platform that can be used to annotate, prioritize, visualize, and interpret GWAS results.
Genome-wide association study (GWAS)	An analysis used to identify inherited genetic variants associated with the risk of disease or a particular trait.
The Global Parkinson’s Genetics Program (GP2)	An ambitious program to genotype >150,000 volunteers around the world to further understand the genetic architecture of PD.
GREML-LDMS	Method to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing (WGS) data
Hoehn and Yahr (HY) scale	A scoring system that allows for the quantification of the different stages of PD.
The International Parkinson Disease Genomics Consortium (IPDGC)	A worldwide collaboration dedicated to the identification of both Mendelian and risk genes is important for PD.
Leiden algorithm	Network building algorithm with potential benefits over the Louvain algorithm.
Locus (plural: Loci)	The specific physical location of a gene or other DNA sequence on a chromosome.
Louvain	Network building algorithm.
Linkage disequilibrium (LD)	The nonrandom association of alleles at different loci.
Narrow-sense heritability	The fraction of phenotypic variance that can be attributed to additive genetic variation.
Parkinson’s progression markers initiative (PPMI)	A study is collaborating with partners around the world to create a robust open-access dataset and biosample library for PD.
The Parkinson’s Disease Biomarkers Program (PDBP)	A program dedicated to accelerating the pace of PD biomarkers research.
Polygenic risk score (PRS)	A measure of disease risk calculated by the cumulative effect of multiple risk variants.
Posterior probability (PP)	The statistical probability that a hypothesis is true when calculated in the light of relevant observations.
Single nucleotide polymorphism (SNP)	A variation in a single base pair in a DNA sequence.
Unified Parkinson’s disease rating scale (UPDRS)	A rating tool used to gauge the severity and progression of PD in patients.

Discussion

In the first IPDGC and GP2 joint hackathon, 49 early-career researchers from across the globe came together virtually to create the tools described here. The event lasted three days and was broken up into virtual team meetings, icebreakers and networking breaks, work sessions, and training opportunities. Each of the nine projects had teammates assigned according to their interests and skill level. At the end of the 3-day event, each team presented their progress and results. Within the following week, teams refined their Github content or deployed their apps and the final tools are now available to the public to aid future research.

As the amount of data available for disease research grows and becomes increasingly cloud-centric, public tools like these will help reduce the difficulty and time it takes to visualize, analyze, and understand this data. In order to do effective and prompt research, sharing tools and code for analyses must become the standard.

In addition to creating several helpful pipelines and visualization applications for the PD research community, this hackathon revealed the need for further documentation and training on cloud computing in the disease research field. Many of the tools created during this event are designed to be used in a cloud setting to assist new researchers in analyzing cloud-based data. However, more resources like these will be needed to ensure cloud resources can be used efficiently.

Hackathons are a valuable tool for prototyping new ideas and tools, but they are also helpful for creating new collaborative networks and working relationships among researchers. This event’s virtual setting allowed many people of different backgrounds and skill sets to work together on creative solutions. Encouraging this kind of creative thinking and creating opportunities for trainees to invest in new and necessary skills is integral to facilitating productive research.

The “GP2/IPDGC Hackathon 2021” event resulted in the creation of novel pipelines and applications designed to assist future genetics research. Many of these novel pipelines were designed for Terra, to make genetic analysis in the cloud more accessible to researchers inexperienced with cloud computing platforms. An added benefit of this event was that participants were given the opportunity to increase their analysis skills and build connections by working together on a new project. By providing training opportunities to scientists and producing original and novel applications, this event further demonstrated collaborative and international hackathons as an important tool for the scientific community.

Methods

GWAS-level and post-GWAS analyses

GWAS of PD have nominated 90 independent risk signals in individuals of European ancestry, explaining ~16–36% of the heritable risk⁷, as well as two additional risk signals in Asian populations⁸. Typically, published GWAS are accompanied by various follow-up analyses, but performing these analyses is not always straightforward. Common post-GWAS analyses include heritability estimation and polygenic risk score (PRS) calculation. Heritability analyses estimate the percentage of disease risk accounted for by common genetic variants, and PRS can be used to predict disease risk by aggregating the effects at multiple common risk loci⁹. Another follow-up approach to help interpret GWAS results is to combine Single Nucleotide Polymorphisms (SNPs) into a group of functionally related genes, such as genes belonging to a single biological pathway or cell type. This is called gene set enrichment analysis and is a widely used approach to examine the cumulative effect of SNPs in a particular biological process and determine whether there are particular pathways, processes, or cell types affected in disease.

Project 1 (post-GWAS analysis)

We used AMP PD version 1 release data to develop a Terra-based pipeline for assessing SNP-based heritability, as well as polygenic risk score calculation. Using the GREML-LDMS method¹⁰ applied to data from the AMP PD version 1 release, we estimated narrow-sense heritability (h²) to be roughly 52%, which is much higher than the typical estimate of 22%⁷, and is likely biased due to the recruitment of specific variant carriers present in the AMP PD cohort. AMP PD provides information regarding the recruitment arm. Researchers should investigate this information to determine if specific samples need to be removed from certain analyses in future work. We then used PLINK v1.9¹¹ and estimated risk effect sizes from the summary statistics of the latest PD GWAS⁷ to calculate the genetic risk scores of PD LRRK2 mutation carriers (n = 382), control LRRK2 mutation carriers (n = 275), and control individuals without PD causing mutations (n = 3435) from the AMP PD version 2.5 dataset. We tested normalized z-scores for association with LRRK2 carrier disease status. Mean ± standard deviation of unadjusted PRS scores were higher in PD LRRK2 mutation carriers (−0.0166 ± 0.004), compared to control LRRK2 mutation carriers (−0.0182 ± 0.004) and controls (−0.0180 ± 0.004), suggesting that PD LRRK2 mutation carriers share a common polygenic risk profile with idiopathic PD (iPD), contrary to control LRRK2 mutation carriers (Fig. 2a).

**Fig. 2: Results from the GWAS-level and post-GWAS analyses projects.**

Project 2 (pathway and cell enrichment pipeline)

We aimed to create a pipeline to annotate GWAS summary statistics to test the enrichment of biological pathways and cell types. Analyses were performed on the Terra platform using the most recent PD GWAS summary statistics⁷. We created a pipeline to correctly format the GWAS summary statistics for a common annotation tool Functional Mapping and Annotation of Genome-Wide Association Studies (FUMA)¹², then downloaded the formatted data and uploaded it to FUMA. We also ran the WEB-based GEne SeT AnaLysis Toolkit (WebGestalt) directly on Terra using the WebGestaltR package¹³. We selected the nearest genes to the GWAS significant SNPs (P < 5 × 10⁻⁸) from Nalls et al. 2019 GWAS summary statistics. Using WebGestaltR, we conducted an overrepresentation analysis and gene set enrichment analysis. We identified 97 unique genes from the genome-wide significant hits in the PD GWAS summary statistics. We generated summary data for these PD genes annotated by biological processes, cellular components, and molecular functions (Fig. 2b). There was no significant enrichment of any Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway gene sets in the overrepresentation analysis (FDR P < 0.05).

Downstream analyses of genetic variation

While GWAS have identified many common variants associated with complex diseases like PD, it is follow-up analyses that have started to decode GWAS results, and more downstream analysis is needed to unravel the implications of observed genetic variation in PD. Three types of analysis were the focus for the downstream analyses of the genetic variation topics, including colocalization, variant interaction, and network generation and visualization. Colocalization analysis allows the calculation and estimation of the correlation between a GWAS locus and an expression quantitative trait locus (eQTL). Variant interaction, or epistasis, is an interaction of genetic variation at two or more loci to produce a phenotypic outcome that is not predicted by the additive combination of effects attributable to the individual loci¹⁴. Its importance in humans continues to be a matter of debate^15,16, but it may explain some of the “missing heritability” underlying complex diseases such as PD^16,17,18. In addition to investigating individual variant effects with colocalization and epistasis, visualizing biological networks can help with understanding complex molecular relationships and interactions. In PD research, genetic and gene expression data has been used in community network analysis to nominate pathways and genes for drug target and functional prioritization^19,20.

Project 3 (colocalization pipeline)

Colocalization analysis takes into account five hypotheses: H0 (no association between the locus and either trait), H1 (locus has an association with first trait only), H2 (locus has an association with second trait only), H3 (locus has an association with both traits but driven by different SNPs which are not in linkage disequilibrium (LD)), H4 (locus has an association with both traits driven by same SNPs). For Project 3: Colocalization pipeline, we considered colocalization analysis with a posterior probability of colocalization in H4 (PPH4) greater than 0.8 to be significant. We utilized the coloc R package^21,22 and summary statistics from ref. ⁷. We used eQTL data from a cerebellar cortical meta-analysis of four cohorts²³, publicly available from the AMP-AD Knowledge Portal²⁴. As an example for our pipeline, we extracted the region ± 500 kb around DYRK1A, nominated in Nalls et al. 2019, from the GWAS summary statistics and eQTL data. To visualize the results, we employed the eQTpLot R package²⁵, which can generate different plots for GWAS and eQTL signal colocalization, as well as the correlation between their p values and enrichment of eQTLs among variants and LD of loci of interest, allowing efficient and intuitive visualization of gene expression and trait interaction. We used our previously generated results for DYRK1A and whole brain eQTL as an example for creating visualizations using this package. (Fig. 3a).

**Fig. 3: Results from the downstream analysis of genetic variation projects.**

Project 4 (network generations and visualization pipeline)

We sought to develop a Leiden network and subsequent visualization pipeline for transcriptomic and genomic data to identify and visualize both a priori and complex phenotype gene regulatory networks. The Leiden algorithm is one option for community detection of networks and can be faster and return more reliable results than the more well-known Louvain algorithm²⁶. We relied on the leidenalg²⁷ package in Python to produce weighted and unweighted networks on GWAS summary statistics and then visualized the resulting networks. (Fig. 3b). Data used consisted of AMP PD genomic, transcriptomic data, and public eQTL data from the eQTL catalog²⁸ and PD summary statistics from the most recent PD GWAS⁷. This project was designed as a proof-of-concept for a pipeline for detecting gene networks and relating them to PD phenotype information via GWAS summary stats.

Project 5 (variant interaction pipeline)

We developed a workflow that can be summarized as follows: (1) We utilized individual-level test data in binary format to perform data harmonization with PLINK v1.9¹¹ to ensure that the risk allele was consistent for all the variants; (2) We established a minor allele frequency (MAF) threshold >0.05 to subset variants, keeping only common genetic variation; (3) We annotated variants of interest using ANNOVAR²⁹, differentiating between coding and non-coding as well as annotated predicted gene consequence; (4) We carried out interaction analyses in R 3.6 using the glm() function and adjusting for age, gender, and the first five components. (Fig. 3c).

Data visualization

Visualization of clinical and genetic data plays an essential role in research. It can be used to inform the progress of initiatives like GP2, help researchers to view data in a meaningful way, and generate and corroborate hypotheses. As GWAS and other analyses nominate more PD risk loci, efforts to decode the role of these variants and how they interact with both longitudinal and cross-sectional phenotypes will be needed. Four projects focused on data visualization, including a GP2 cohort tracker, updates to the IPDGC locus browser, visualization of longitudinal clinical phenotypes, and visualization of longitudinal and cross-sectional variant effects.

Project 6 (GP2 cohort tracker visualization)

We designed the GP2 cohort tracker visualization to show essential information about cohorts recruited for GP2 and showcase their diversity, geographical location of enrollment, ancestry representation, and additional relevant metadata. We designed this visualization to inform progress and inspire others to contribute to this initiative. In the form of a one-page dashboard developed with the open-source Python software Streamlit, the visualization includes separate maps for complex and monogenic cohorts. It was critical to include easy-to-use search and discovery aspects built into the dashboard. If a user knows the name of a particular cohort, then they can pull up information for that cohort that populates the rest of the dashboard. The user can also filter by general methods such as cohort size or country. This design is used internally and externally on the GP2 website (https://gp2.org/cohort-dashboard/) to inform those interested in the progress of GP2’s cohort integration (Fig. 4a).

**Fig. 4: Results from the data visualization projects.**

Project 7 (IPDGC GWAS loci browser expansions)

To facilitate investigations of nominated risk variants, members of IPDGC have created a PD GWAS locus browser (https://pdgenetics.shinyapps.io/GWASBrowser/) that makes relevant statistics and datasets available to the public³⁰. Throughout the hackathon, our team continued the development of this browser through the addition of new datasets and features. To identify secondary association signals at each locus from the Nalls et al. 2019 study, we performed conditional analysis using the Genome-wide Complex Trait Analysis (GCTA) tool^31,32. Locus zoom plots were added to display the results of this conditional analysis (Fig. 4b)³³. Power calculations were done for each risk variant by Nalls et al. 2019 to determine if the findings were sufficiently powered. To do so, we followed methods used by the Genetic Association Study Power Calculator tool (https://csg.sph.umich.edu/abecasis/gas_power_calculator/), using summary statistics from Nalls et al. 2019, a disease prevalence of 0.01, and a significance level of 0.05 as input. We queried blood gene expression data included in the AMP PD version 2.5 release to measure expression levels in PD cases and controls. We obtained TPM expression at baseline for samples that had case or control status and no PD mutations in whole-genome sequencing data, leaving a total of 1710 samples. Expression data for each gene was displayed in a violin plot and added to the expression section of the browser (Fig. 4b). The literature section of the browser was updated to display a description, PubMed hit count, and word cloud plot for each gene within 1 MB of a PD risk variant. Our last addition to the browser was a display of user statistics. We used the googleAnalyticsR package³⁴ to record and visualize the number of visits for the browser and each risk variant within a period specified by the user (Fig. 4b).

Project 8 (visualization of longitudinal UPDRS/HY scores)

The Unified Parkinson’s Disease Rating Scale (UPDRS) and Hoehn-Yahr (HY) stage are two of the most common measures of the severity of PD. We set out to develop a user-friendly and adaptable app to display a diverse set of visualizations of longitudinal UPDRS/HY scores, based on data from GP2, utilizing the Streamlit library from Python. During the Hackathon, we successfully integrated data from three cohorts: Parkinson’s Progression Markers Initiative (PPMI), Parkinson’s Disease Biomarkers Program (PDBP), and BioFIND^35,36,37. We were also able to produce four different visualizations (Fig. 4c). First, we created bar graphs to visualize changes in scores from the data over time, and we added the option to include baseline patients only. Second, we created line graphs showing confidence intervals for longitudinal changes in HY and UPDRS scores. Our final visualizations were experimental, but we produced proof-of-concept visualizations with limited options. We created a Sankey graph visualization that better visualized how participants moved between different subsets of the population over time. Lastly, the fourth visualization is a Kaplan–Meier curve showing the time to reach a certain threshold within our progression scores.

Project 9 (Visualize longitudinal and cross-sectional variant effects)

We set out with the aim of creating an interactive and user-friendly web application that would allow users to (i) visualize the effect of a genetic variant across multiple cohorts using publicly available GWAS summary statistics and (ii) input their GWAS summary statistics for visualization and meta-analysis with existing data. As test data, we used a small subset of results from a study of amyloid-β levels in cerebrospinal fluid derived from healthy control individuals and individuals with PD from the PPMI dataset³⁵. Amyloid-β levels were measured at baseline and in follow-up visits; thus, results were available from both a cross-sectional and longitudinal GWAS. Using the R shiny³⁸ framework, we produced a skeleton framework for our web application, with several tabs, including (i) an “Upload data” tab where users could upload and query their data and (ii) a “Plots” tab where users could query the available test data and visualize it. In the “Plots” tab, we allowed users to query by a variant of interest, with the option to choose which biomarker(s) and cohort(s) they wished to visualize. The variant beta was visualized across cohorts and biomarkers using a forest plot, while the number of participants/observations was visualized using a bar plot (Fig. 4d). Tabs were available for cross-sectional and longitudinal plots, with the option to download the plots, and finally, data was also made available in a tabular format.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Both AMP PD and GP2 data were available for access through the AMP PD website (https://amp-pd.org/).

Code availability

All code, pipelines, and applications that were used and produced are available on GitHub via the provided links in this manuscript.

Change history

24 May 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41531-023-00529-6

References

International Parkinson Disease Genomics Consortium (IPDGC). Ten years of the International Parkinson Disease Genomics Consortium: progress and next steps. J. Parkinsons. Dis. 10, 19–30 (2020).
Article Google Scholar
Global Parkinson’s Genetics Program. GP2: the global Parkinson’s genetics program. Mov. Disord. 36, 842–851 (2021).
Article Google Scholar
Mc Cartney, A. M. et al. An international virtual hackathon to build tools for the analysis of structural variants within species ranging from coronaviruses to vertebrates. F1000Res 10, 246 (2021).
Google Scholar
Toga, A. W. & Crawford, K. L. The Alzheimer’s disease neuroimaging initiative informatics core: a decade in review. Alzheimers Dement. 11, 832–839 (2015).
Article PubMed PubMed Central Google Scholar
Iwaki, H. et al. Accelerating medicines partnership: Parkinson’s disease. Genetic resource. Mov. Disord. 36, 1795–1804 (2021).
Article CAS PubMed PubMed Central Google Scholar
“Terra.” n.d. Accessed 1 March 2023. https://app.terra.bio/.
Nalls, M. A. et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 18, 1091–1102 (2019).
Article CAS PubMed PubMed Central Google Scholar
Foo, J. N. et al. Identification of risk loci for Parkinson disease in Asians and comparison of risk between Asians and Europeans: a genome-wide association study. JAMA Neurol. 77, 746–754 (2020).
Article PubMed Google Scholar
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat. Genet. 47, 1114–1120 (2015).
Article CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article PubMed PubMed Central Google Scholar
Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z. & Zhang, B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 47, W199–W205 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fisher, R. A. The correlation between relatives on the supposition of Mendelian inheritance. Trans. R. Soc. Edin. 52, 399–433 (1918).
Article Google Scholar
Carlborg, O. & Haley, C. S. Epistasis: too often neglected in complex trait studies? Nat. Rev. Genet. 5, 618–625 (2004).
Article CAS PubMed Google Scholar
Ritchie, M. D. & Van Steen, K. The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation. Ann. Transl. Med. 6, 157 (2018).
Article PubMed PubMed Central Google Scholar
Ritchie, M. D., Holzinger, E. R., Li, R., Pendergrass, S. A. & Kim, D. Methods of integrating data to uncover genotype-phenotype interactions. Nat. Rev. Genet. 16, 85–97 (2015).
Article CAS PubMed Google Scholar
Bandres-Ciga, S., Diez-Fairen, M., Kim, J. J. & Singleton, A. B. Genetics of Parkinson’s disease: an introspection of its journey towards precision medicine. Neurobiol. Dis. 137, 104782 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bandres-Ciga, S. et al. Large-scale pathway specific polygenic risk and transcriptomic community network analysis identifies novel functional pathways in Parkinson disease. Acta Neuropathol. 140, 341–358 (2020).
Article CAS PubMed PubMed Central Google Scholar
Quan, P. et al. Integrated network analysis identifying potential novel drug candidates and targets for Parkinson’s disease. Sci. Rep. 11, 13154 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wallace, C. coloc: Repo for the R package coloc. (Github). (2022).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Sieberts, S. K. et al. Large eQTL meta-analysis reveals differing patterns between cerebral cortical and cerebellar brain regions. Sci. Data 7, 340 (2020).
Article CAS PubMed PubMed Central Google Scholar
“AD Knowledge Portal.” n.d. Accessed 2 March 2023. https://adknowledgeportal.synapse.org/.
Drivas, T. G., Lucas, A. & Ritchie, M. D. eQTpLot: a user-friendly R package for the visualization of colocalization between eQTL and GWAS signals. BioData Min. 14, 32 (2021).
Article CAS PubMed PubMed Central Google Scholar
Traag, V. A., Waltman, L. & van Eck, N. J. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 9, 1–12 (2019).
Article CAS Google Scholar
Traag, V. leidenalg: Implementation of the Leiden algorithm for various quality functions to be used with igraph in Python. (Github). (2020).
Kerimov, N. et al. A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat. Genet. 53, 1290–1299 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Article CAS PubMed PubMed Central Google Scholar
“Gas Power Calculator.” n.d. Accessed 2 March 2023. https://csg.sph.umich.edu/abecasis/cats/gas_power_calculator/index.html.
“Google Analytics API into R.” n.d. Accessed 2 March 2023. https://code.markedmondson.me/googleAnalyticsR/index.html.
Marek, K. et al. The Parkinson’s progression markers initiative (PPMI) - establishing a PD biomarker cohort. Ann. Clin. Transl. Neurol. 5, 1460–1477 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rosenthal, L. S. et al. The NINDS Parkinson’s disease biomarkers program. Mov. Disord. 31, 915–923 (2016).
Article CAS PubMed Google Scholar
Kang, U. J. et al. The BioFIND study: characteristics of a clinically typical Parkinson’s disease biomarker cohort. Mov. Disord. 31, 924–932 (2016).
Article PubMed PubMed Central Google Scholar
Web Application Framework for R [R package shiny version 1.7.1]. (2021).

Download references

Acknowledgements

This project was supported by the Global Parkinson’s Genetics Program (GP2). GP2 is funded by the Aligning Science Against Parkinson’s (ASAP) initiative and implemented by The Michael J. Fox Foundation for Parkinson’s Research (https://gp2.org). For a complete list of GP2 members, see https://gp2.org. Data used in the preparation of this article were obtained from the AMP PD Knowledge Platform. For up-to-date information on the study, see https://www.amp-pd.org. AMP PD—a public-private partnership—is managed by the FNIH and funded by Celgene, GSK, the Michael J. Fox Foundation for Parkinson’s Research, the National Institute of Neurological Disorders and Stroke, Pfizer, Sanofi, and Verily. This research was supported in part by the Intramural Research Program of the NIH, National Institute on Aging (NIA), National Institutes of Health, Department of Health and Human Services; project numbers ZO1 AG000535 and ZO1 AG000949, as well as the National Institute of Neurological Disorders and Stroke. Panel c of Fig. 3 was created with BioRender.com. Please see the supplemental material for IPDGC and GP2 member acknowledgements.

Funding

Open Access funding provided by the National Institutes of Health (NIH).

Author information

Authors and Affiliations

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
Hampton L. Leonard, Anni Moore, Dan Vitale, Francis P. Grenn, Hirotaka Iwaki, Jonggeol Jeff Kim, Julie Lake, Mary B. Makarious, Michael Ta, Pilar Alvarez Jerez, Tarek Antar, Yeajin Song, Andrew Singleton, Mike A. Nalls, Sara Bandres-Ciga & Cornelis Blauwendraat
Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
Hampton L. Leonard, Ruqaya Murtadha, Dan Vitale, Hirotaka Iwaki, Pilar Alvarez Jerez, Yeajin Song, Andrew Singleton, Mike A. Nalls, Sara Bandres-Ciga & Cornelis Blauwendraat
Data Tecnica International LLC, Washington, DC, USA
Hampton L. Leonard, Dan Vitale, Hirotaka Iwaki & Mike A. Nalls
German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany
Hampton L. Leonard & Anastasia Illarionova
Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, University College London, London, UK
Alejandro Martinez-Carrasco, Catherine Storm, Clodagh Towns, Lesley Wu, Manuela M. X. Tan, Mary B. Makarious & Raquel Real
Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, University College London, London, UK
Alina Jama, Devina Chetty, Zuné Jansen van Rensburg & Shameemah Abrahams
Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
Amica Corda Müller-Nedebock
South African Medical Research Council/Stellenbosch University Genomics of Brain Disorders Research Unit, Stellenbosch University, Cape Town, South Africa
Amica Corda Müller-Nedebock, Regina H. Reynolds & William J. Scotton
Department of Neurodegenerative Disease, University College London, London, UK
Ana-Luisa Gil-Martinez & Regina H. Reynolds
Great Ormond Street Institute of Child Health, Genetics and Genomic Medicine, University College London, London, UK
Ana-Luisa Gil-Martinez
The Ken & Ruth Davee Department of Neurology and Simpson Querrey Center of Neurogenetics, Feinberg Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
Bernabe I. Bustos
Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount, Hess Center for Science and Medicine, New York, NY, 10029, USA
Bharati Jadhav
Preventive Neurology Unit, Wolfson Institute of Population Health, Queen Mary University of London, London, UK
Brook Huxford, Isabelle Francesca Foote, Jonggeol Jeff Kim, Sumit Dey & Alastair J. Noyce
Department of Human Genetics, McGill University, Montreal, QC, Canada
Eric Yu, Konstantin Senkevich & Prabhjyot Saini
The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, QC, Canada
Eric Yu & Prabhjyot Saini
INNCOSYS, Col. Morelos Second Section, 50120, Toluca de Lerdo, México
Gabriela Salazar
Institut du Cerveau - Institute of Brain and Spine (ICM), Hôpital Pitié, 47 Bd de l’Hôpital, 75013, Paris, France
Geoffrey Rateau
Faculty of pharmacy, University of Gezira, Wad Medani, P.O. Box 20, Sudan
Inas Elsayed
International Parkinson Disease Genomics Consortium (IPDGC)-Africa, University of Gezira, Wad Medani, P.O. Box 20, Sudan
Inas Elsayed
Unit for Psychological Medicine, Wolfson Institute of Population Health, Queen Mary University of London, London, UK
Isabelle Francesca Foote
Center for Advanced Parkinson Research, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02115, USA
Jie Yuan & Ruifneg Hu
Translational Neurogenetics Unit, Wallenberg Neuroscience Center, Department of Experimental Medical Science, Lund University, Lund, Sweden
Kajsa Brolin
Department of Neurology and Neurosurgery, McGill University, Montréal, QC, Canada
Konstantin Senkevich
Department of Neurology, Oslo University Hospital, Oslo, Norway
Manuela M. X. Tan
Unidad de Trastornos del Movimiento, Servicio de Neurología y Neurofisiología Clínica, Instituto de Biomedicina de Sevilla, Hospital Universitario Virgen del Rocío/CSIC/Universidad de Sevilla, Seville, Spain
María Teresa Periñán
CIBERNED, Madrid, Spain
María Teresa Periñán
South African National Bioinformatics Institute (SANBI), South African Medical Research Council Bioinformatics Unit, University of the Western Cape, Bellville, South Africa
Nikita Simone Pillay
Veterans Affairs Puget Sound Health Care System, Seattle, WA, USA
Oswaldo Lorenzo Betancor
Department of Neurology, University of Washington School of Medicine, Seattle, WA, USA
Oswaldo Lorenzo Betancor
Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de, México, Juriquilla, México
Paula R. Reyes-Pérez
Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA
Rami al-Ouran
University of Southern California, Los Angeles, CA, 90007, USA
Ramiya Sivakumar
Department of Genomic Medicine, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, 44195, USA
Shilpa C. Rao & Thiago Peixoto Leal
Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
Shilpa C. Rao
Department of Neurology With Friedrich Baur Institut, University Hospital of Ludwig-Maximilians-Universität München, Munich, Germany
Vassilena Iankova

Authors

Hampton L. Leonard
View author publications
You can also search for this author in PubMed Google Scholar
Ruqaya Murtadha
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Martinez-Carrasco
View author publications
You can also search for this author in PubMed Google Scholar
Alina Jama
View author publications
You can also search for this author in PubMed Google Scholar
Amica Corda Müller-Nedebock
View author publications
You can also search for this author in PubMed Google Scholar
Ana-Luisa Gil-Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia Illarionova
View author publications
You can also search for this author in PubMed Google Scholar
Anni Moore
View author publications
You can also search for this author in PubMed Google Scholar
Bernabe I. Bustos
View author publications
You can also search for this author in PubMed Google Scholar
Bharati Jadhav
View author publications
You can also search for this author in PubMed Google Scholar
Brook Huxford
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Storm
View author publications
You can also search for this author in PubMed Google Scholar
Clodagh Towns
View author publications
You can also search for this author in PubMed Google Scholar
Dan Vitale
View author publications
You can also search for this author in PubMed Google Scholar
Devina Chetty
View author publications
You can also search for this author in PubMed Google Scholar
Eric Yu
View author publications
You can also search for this author in PubMed Google Scholar
Francis P. Grenn
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Salazar
View author publications
You can also search for this author in PubMed Google Scholar
Geoffrey Rateau
View author publications
You can also search for this author in PubMed Google Scholar
Hirotaka Iwaki
View author publications
You can also search for this author in PubMed Google Scholar
Inas Elsayed
View author publications
You can also search for this author in PubMed Google Scholar
Isabelle Francesca Foote
View author publications
You can also search for this author in PubMed Google Scholar
Zuné Jansen van Rensburg
View author publications
You can also search for this author in PubMed Google Scholar
Jonggeol Jeff Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Julie Lake
View author publications
You can also search for this author in PubMed Google Scholar
Kajsa Brolin
View author publications
You can also search for this author in PubMed Google Scholar
Konstantin Senkevich
View author publications
You can also search for this author in PubMed Google Scholar
Lesley Wu
View author publications
You can also search for this author in PubMed Google Scholar
Manuela M. X. Tan
View author publications
You can also search for this author in PubMed Google Scholar
María Teresa Periñán
View author publications
You can also search for this author in PubMed Google Scholar
Mary B. Makarious
View author publications
You can also search for this author in PubMed Google Scholar
Michael Ta
View author publications
You can also search for this author in PubMed Google Scholar
Nikita Simone Pillay
View author publications
You can also search for this author in PubMed Google Scholar
Oswaldo Lorenzo Betancor
View author publications
You can also search for this author in PubMed Google Scholar
Paula R. Reyes-Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Pilar Alvarez Jerez
View author publications
You can also search for this author in PubMed Google Scholar
Prabhjyot Saini
View author publications
You can also search for this author in PubMed Google Scholar
Rami al-Ouran
View author publications
You can also search for this author in PubMed Google Scholar
Ramiya Sivakumar
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Real
View author publications
You can also search for this author in PubMed Google Scholar
Regina H. Reynolds
View author publications
You can also search for this author in PubMed Google Scholar
Ruifneg Hu
View author publications
You can also search for this author in PubMed Google Scholar
Shameemah Abrahams
View author publications
You can also search for this author in PubMed Google Scholar
Shilpa C. Rao
View author publications
You can also search for this author in PubMed Google Scholar
Tarek Antar
View author publications
You can also search for this author in PubMed Google Scholar
Thiago Peixoto Leal
View author publications
You can also search for this author in PubMed Google Scholar
Vassilena Iankova
View author publications
You can also search for this author in PubMed Google Scholar
William J. Scotton
View author publications
You can also search for this author in PubMed Google Scholar
Yeajin Song
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Singleton
View author publications
You can also search for this author in PubMed Google Scholar
Mike A. Nalls
View author publications
You can also search for this author in PubMed Google Scholar
Sumit Dey
View author publications
You can also search for this author in PubMed Google Scholar
Sara Bandres-Ciga
View author publications
You can also search for this author in PubMed Google Scholar
Cornelis Blauwendraat
View author publications
You can also search for this author in PubMed Google Scholar
Alastair J. Noyce
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

on behalf of The International Parkinson Disease Genomics Consortium (IPDGC) and The Global Parkinson’s Genetics Program (GP2)

Contributions

H.L.L., S.BC., A.J.N., M.A.N., S.D., and C.B. contributed to the concept and design of the study. H.L.L., R.M., A.MC., A.C.MN., AL.GM., A.I., A.M., B.I.B., B.J., B.H., C.S., C.T., D.V., D.C., E.Y., A.J., F.P.G., G.S., G.R., H.I., I.E., I.F.F., Z.JVR., J.J.K., J.Y., J.L., K.B., K.S., L.W., M.M.X.T., M.T.P., M.B.M., M.T., N.S.P., O.L.B., P.R.RP., P.A.J., P.S., R.AO., R.S., R.R., R.H.R., R.H., S.A., S.C.R., T.A., T.P.L., V.I., W.J.S., Y.S., A.S., M.A.N., S.D., S.BC., C.B., and A.J.N. were involved in the acquisition of data, data generation, and data cleaning. H.L.L., R.M., A.MC., A.C.MN., AL.GM., A.I., A.M., B.I.B., B.J., B.H., C.S., C.T., D.V., D.C., E.Y., A.J., F.P.G., G.S., G.R., H.I., I.E., I.F.F., Z.JVR., J.J.K., J.Y., J.L., K.B., K.S., L.W., M.M.X.T., M.T.P., M.B.M., M.T., N.S.P., O.L.B., P.R.RP., P.A.J., P.S., R.AO., R.S., R.R., R.H.R., R.H., S.A., S.C.R., T.A., T.P.L., V.I., W.J.S., Y.S., A.S., M.A.N., S.D., S.BC., C.B., and A.J.N. were involved in the creation of tools and pipelines. H.L.L., R.M., A.MC., A.C.MN., AL.GM., A.I., A.M., B.I.B., B.J., B.H., C.S., C.T., D.V., D.C., E.Y., A.J., F.P.G., G.S., G.R., H.I., I.E., I.F.F., Z.JVR., J.J.K., J.Y., J.L., K.B., K.S., L.W., M.M.X.T., M.T.P., M.B.M., M.T., N.S.P., O.L.B., P.R.RP., P.A.J., P.S., R.AO., R.S., R.R., R.H.R., R.H., S.A., S.C.R., T.A., T.P.L., V.I., W.J.S., Y.S., A.S., M.A.N., S.D., S.BC., C.B., and A.J.N. contributed to the drafting of the article and revising it critically.

Corresponding author

Correspondence to Hampton L. Leonard.

Ethics declarations

Competing interests

H.L.L., D.V., H.I., Y.S., and M.A.N. declare no competing non-financial interests, but the following competing financial interests: H.L.L., D.V., H.I., Y.S., and M.A.N.’s participation in this project was part of a competitive contract awarded to Data Tecnica International LLC by the National Institutes of Health to support open science research. M.A.N. also currently serves on the scientific advisory board for Clover Therapeutics and is an advisor to Neuron23 Inc. R.M., A.MC., A.C.MN., AL.GM., A.I., A.M., B.I.B., B.J., B.H., C.S., C.T., D.C., E.Y., A.J., F.P.G., G.S., G.R., I.E., I.F.F., Z.JVR., J.J.K., J.Y., J.L., K.B., K.S., L.W., M.M.X.T., M.T.P., M.B.M., M.T., N.S.P., O.L.B., P.R.RP., P.A.J., P.S., R.AO., R.S., R.R., R.H.R., R.H., S.A., S.C.R., T.A., T.P.L., V.I., W.J.S., A.S., S.D., S.BC., C.B., and A.J.N. declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Material

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Leonard, H.L., Murtadha, R., Martinez-Carrasco, A. et al. The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data. npj Parkinsons Dis. 9, 33 (2023). https://doi.org/10.1038/s41531-023-00472-6

Download citation

Received: 07 July 2022
Accepted: 13 February 2023
Published: 04 March 2023
DOI: https://doi.org/10.1038/s41531-023-00472-6