Multimodal phenotypic axes of Parkinson’s disease

Markello, Ross D.; Shafiei, Golia; Tremblay, Christina; Postuma, Ronald B.; Dagher, Alain; Misic, Bratislav

doi:10.1038/s41531-020-00144-9

Download PDF

Article
Open access
Published: 05 January 2021

Multimodal phenotypic axes of Parkinson’s disease

npj Parkinson's Disease volume 7, Article number: 6 (2021) Cite this article

6340 Accesses
21 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Individuals with Parkinson’s disease present with a complex clinical phenotype, encompassing sleep, motor, cognitive, and affective disturbances. However, characterizations of PD are typically made for the “average” patient, ignoring patient heterogeneity and obscuring important individual differences. Modern large-scale data sharing efforts provide a unique opportunity to precisely investigate individual patient characteristics, but there exists no analytic framework for comprehensively integrating data modalities. Here we apply an unsupervised learning method—similarity network fusion—to objectively integrate MRI morphometry, dopamine active transporter binding, protein assays, and clinical measurements from n = 186 individuals with de novo Parkinson’s disease from the Parkinson’s Progression Markers Initiative. We show that multimodal fusion captures inter-dependencies among data modalities that would otherwise be overlooked by field standard techniques like data concatenation. We then examine how patient subgroups derived from the fused data map onto clinical phenotypes, and how neuroimaging data is critical to this delineation. Finally, we identify a compact set of phenotypic axes that span the patient population, demonstrating that this continuous, low-dimensional projection of individual patients presents a more parsimonious representation of heterogeneity in the sample compared to discrete biotypes. Altogether, these findings showcase the potential of similarity network fusion for combining multimodal data in heterogeneous patient populations.

Comprehensive subtyping of Parkinson’s disease patients with similarity fusion: a case study with BioFIND data

Article Open access 17 September 2021

Matthew Brendel, Chang Su, … Fei Wang

Multi-modality machine learning predicting Parkinson’s disease

Article Open access 01 April 2022

Mary B. Makarious, Hampton L. Leonard, … Mike A. Nalls

Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Article Open access 16 December 2022

Anant Dadu, Vipul Satone, … Faraz Faghri

Introduction

Individuals with Parkinson’s disease (PD) present with a range of symptoms, including sleep, motor, cognitive, and affective disturbances¹. This heterogeneity is further complicated by individual differences in the age of disease onset and the rate of pathological progression^2,3. Most attempts to resolve heterogeneity in PD rely on clustering or subtyping of patients based solely on clinical-behavioral assessments. While these efforts have shown that it is possible to stratify patients into clinically meaningful categories with considerable predictive utility^4,5; cf.⁶, clinical measures do not directly measure the underlying pathophysiology of PD⁷. Given the prominence of synucleinopathy^8,9, dopamine neuron dysfunction¹⁰, and distributed grey matter atrophy in PD^11,12,13, it is increasingly necessary to develop biologically informed biotypes by integrating multiple sources of evidence in addition to clinical assessments.

Modern technological advances and data-sharing efforts increasingly permit deep phenotyping in large samples of patients, making simultaneous behavioral assessments, physiological measurements, genetic assays, and brain imaging available at unprecedented scales¹⁴. A principal challenge is to parsimoniously integrate these multi-view data in order to take full advantage of each source of information¹⁵. How to account for multiple sources of data to characterize heterogeneous patient samples is a topic of significant interest in computational medicine^16,17, with important applications for oncology^18,19, psychiatry^20,21,22, and neurology⁷. Indeed, recent advances in techniques like multiple kernel learning have yielded promising results for integrating disparate data modalities in the context of supervised and unsupervised problems^23,24. More broadly, there exist many families of techniques for investigating multi-view data, including multiple kernel learning, matrix factorization, and deep learning, that have been increasingly used in recent years to tackle issues of data integration²⁵. Yet, the most commonly employed technique—to simply concatenate data modalities—ignores the structure inherent in individual modalities, potentially yielding biased estimates^19,25.

In the present report, we seek to generate a comprehensive, multimodal characterization of PD. We apply an unsupervised learning technique, similarity network fusion (SNF¹⁹), to integrate data from four data modalities in n = 186 individuals with de novo PD from the Parkinson’s Progression Markers Initiative database¹⁴. Using structural T1-weighted magnetic resonance imaging (MRI), clinical-behavioral assessments, cerebrospinal fluid assays, and single-photon emission computed tomography (SPECT) data we generate patient similarity networks and combine them via an iterative, non-linear fusion process. We demonstrate that SNF yields a more balanced representation of multimodal patient data than standard techniques like data concatenation. We use the patient network generated by SNF to reveal putative PD biotypes, and examine how neuroimaging data contributes to cluster definition and patient discriminability. Finally, we explore how a continuous low-dimensional representation of the fused patient network yields individual estimates of PD patient pathology.

Results

Analytic overview

Complete data were obtained for n = 186 patients from the Parkinson’s Progression Markers Initiative database¹⁴. Data sources included (1) cortical thickness, (2) subcortical volume, (3) clinical-behavioral assessments, (4) dopamine activate transporter (DAT) binding scans, and (5) cerebrospinal fluid assays. Although cortical thickness and subcortical volume are both derived from the same data modality (i.e., anatomical, T1-weighted MRI scans), they are estimated using different algorithms so we retain them as separate sources. For more detailed information on data collection and estimation of derivatives please see “Methods”.

We combine these multimodal data using SNF, which first constructs patient similarity networks for each modality and then iteratively fuses the networks together (Fig. 1a). Both the construction of patient similarity networks and the fusion process of SNF are governed by two free hyperparameters. The first parameter, K, controls the size of the patient neighborhoods to consider when generating the similarity networks: smaller values of K will result in more sparsely connected networks, while larger values will generate denser networks. The second parameter, μ, determines the weighting of edges between patients in the similarity network: small values of μ will keep only the strongest edges in the patient networks, while larger values will retain a wider distribution of edge weights. We use the fused networks to generate categorical representations of patient data via spectral clustering and continuous representations via diffusion map embedding (Fig. 1b–d). The remainder of the reported results examines the utility of these representations in estimating patient pathology and disease severity.

**Fig. 1: Similarity network fusion analysis pipeline.**

Similarity network fusion provides a viable alternative to data concatenation

As the current field standard, data concatenation seems an intuitively appealing method for examining multimodal patient data. In a concatenation framework, features from all data modalities are joined together to create a single patient by feature matrix which is then converted into a patient similarity network using an affinity kernel (e.g., cosine similarity, a radial basis function). This approach is particularly convenient in that it is almost entirely data-driven, with no free parameters beyond the choice of affinity kernel. However, data concatenation tends to suffer from the curse of dimensionality²⁶. That is, modalities with many features—regardless of their relative importance—tend to dominate the resulting patient network, obscuring information in modalities with fewer features (Fig. 2a). This is of particular concern when integrating metrics derived from MRI images, which often have tens or hundreds of times more features than lower-dimensionality data sources.

**Fig. 2: Comparison of data concatenation and SNF for integrating data.**

Similarity network fusion, on the other hand, does not necessarily suffer from such issues. In an SNF framework, patient similarity networks are generated separately for each data modality and then iteratively fused together to create a single, multimodal patient network (Fig. 2a). Converting to patient networks prior to fusing across sources reduces the likelihood of biasing results towards higher-dimensionality data, potentially yielding more balanced representations of the input data¹⁹. However, SNF is governed by two free hyperparameters, K and μ, demanding greater computational complexity to avoid arbitrary selection.

To investigate the extent to which SNF is a plausible alternative to data concatenation we generated patient networks using both techniques, varying the dimensionality of cortical thickness data across five increasingly high-resolution subdivisons of the Desikan–Killiany parcellation (ranging from 68 to 1000 features; see “Methods: Cortical thickness”)²⁷.

Rather than selecting a single combination of parameters to combine data modalities in SNF we conducted an exhaustive parameter search, performing the fusion for 10,000 combinations of K and μ (100 values for each parameter; Fig. 2b). We subjected the resulting networks to spectral clustering with a two-, three-, and four-cluster solution^28,29. We then integrated the resulting 30,000 clustering assignments using an adapted consensus clustering approach, previously described in refs. ³⁰ and³¹ (see Fig. 1c or “Methods: Consensus clustering for more details”). Briefly, we assessed the stability of the clustering solutions in parameter space, retaining only those solutions above the 95th percentile of stability, and used the resulting solutions (n = 1262) to create a co-assignment probability matrix which was thresholded and partitioned to generate a set of “consensus” assignments^30,32.

To assess the relative contribution of the individual data modalities to the concatenation- and SNF-derived clustering assignments we also clustered unimodal patient networks, created separately for each data source (n = 5). We compared the similarity between uni- and multimodal clustering assignments using normalized mutual information scores (NMI³³), and the goodness-of-fit of clustering assignments to patient networks using modularity³⁴. To ensure that the results of the two data integration techniques were more directly comparable, we only used the clustering assignments from concatenated data that had the same number of clusters as those derived via the consensus SNF approach.

We find that at even the lowest dimensionality (i.e., 68 features) patient networks generated using concatenation are dominated by information from cortical thickness data (Fig. 2b). That is, NMI scores show high overlap between clustering assignments derived from only cortical thickness data and assignments generated using concatenated data (average NMI = 0.77 ± 0.25 [0.40–1.00]). Indeed, at the highest cortical thickness dimensionality (1000 features) the clustering assignments for the cortical thickness and concatenated data are identical, suggesting the concatenated patient networks are discarding information from all other lower-dimensional data sources. Estimates of modularity are slightly higher for patient networks derived from only cortical thickness data than from other datatypes, but remain low for all data modalities (average modularity = 0.02 ± 0.02 [0.00–0.06]).

On the other hand, SNF appears much more stable to changes in data dimensionality, with NMI scores and modularity estimates more evenly distributed across data modalities (NMI = 0.15 ± 0.18 [0.01–0.50]; modularity = 0.05 ± 0.06 [0.00–0.17]). While this does not necessarily imply that the generated clustering assignments are meaningful, it does suggest that SNF provides a more balanced representation of the input data than simple concatenation, opening the door for a more holistic assessment of their clinical relevance to patient pathology. Moreover, SNF achieves good discriminability between healthy controls and PD patients (Supplementary Fig. 1), suggesting relevant clinical utility. We chose to examine the fused patient networks generated from the highest (i.e., 1000 node) resolution cortical thickness data for further analyses.

Derived patient biotypes are clinically discriminable across modalities

Consensus clustering of the multimodal SNF-derived patient networks yielded three clusters—or “biotypes”—of n = 72, 69, and 45 individuals (Fig. 3a–c) with strong goodness-of-fit (modularity = 0.494, p < 0.001 by permutation). There were no significant inter-group differences for sex (p = 0.32), age (p = 0.30), education (p = 0.77), symptom duration (p = 0.83), recruitment site (p = 0.20), or MRI scanner strength (p = 0.38) (see Supplementary Table 1 for summary demographics of each cluster).

**Fig. 3: Patient biotypes discriminate multimodal disease severity.**

To assess the clinical relevance of these biotypes we performed a series of univariate one-way ANOVAs for all 1050 input features (see Supplementary Table 2 for a complete list). We found 31 features that significantly distinguished patient groups from one another (false discovery rate [FDR] corrected, q < 0.05; the most discriminating feature from each data modality are shown in Fig. 3d and a full list is available in Supplementary Table 3). Though biotypes showed limited differences in clinical-behavioral assessments using baseline data, longitudinal analyses of subgroup affiliation revealed differentiation over time in clinical measurements frequently used for PD prognosis (Fig. 3e; refer to Supplementary Table 4 for model estimates). Longitudinal differences were less pronounced when biotypes were defined with only baseline clinical data or concatenated data (Supplementary Fig. 2; Supplementary Table 5). Note that while increases in both tremor and postural instability/gait difficulty (PIGD) scores are indicative of clinical severity, higher PIGD (and lower tremor) scores are often found to be related to a more rapidly progressing and severe manifestation of PD³⁵.

Broad examination of the DAT binding, CSF, and clinical assessment data suggests that the three biotypes may separate along a single dimension of PD severity, where group one represents a more severe, group two an intermediate, and group three a more mild phenotype; however, differences in the neuroimaging data reveal this delineation is less straightforward. For instance, individuals in group two—the ostensible “intermediate” clinical biotype—also tend to have greater subcortical volume than individuals in the “mild” clinical group, especially in brain regions typically prone to degeneration in early PD such as the substantia nigra (F(2, 183) = 40.22, p = 8.70 x 10⁻¹³; Fig. 3d). That is, the assignment of cluster labels indicative of disease severity are incapable of capturing all phenotypic aspects of PD.

To further investigate the discrepancy between the clinical and neuroimaging data, we calculated patient scores for a putative neuroimaging biomarker—the “PD-ICA atrophy network”—recently shown to have relevance to both diagnostic PD disease severity and longitudinal prognosis³⁶ (Fig. 4c). While the PD-ICA biomarker was generated from the same dataset examined in the current study, it was derived via metrics which we excluded from our SNF analyses (i.e., whole-brain deformation-based morphometry [DBM] estimates). In line with results from the neuroimaging data included in SNF, we found significant differentiation of PD-ICA atrophy scores between biotypes (F(2, 183) = 5.70, p = 0.004), largely driven by lower atrophy in the “intermediate” compared to the “mild” group (post-hoc Tukey test, p < 0.05). These observed discrepancies between MRI-derived metrics and other modalities raise the question: how important is MRI brain imaging to characterizing PD pathology?

**Fig. 4: Embedded patient dimensions capture relevant clinical information.**

Neuroimaging data is critical to patient biotype characterization

Although one benefit of SNF is the ability to seamlessly integrate neuroimaging data with clinical assessments, high-resolution MRI scans can be both costly and inconvenient for PD patients. Most previous biotyping and patient classification studies in PD have focused solely on clinical-behavioral assessments^4,5, raising the possibility that an adequate solution can be identified without MRI scans. Thus, we investigated whether a similar patient characterization can be identified using a reduced dataset excluding MRI data.

Data from clinical-behavioral assessments, cerebrospinal fluid assays, and subcortical DAT binding scans were used to generate fused patient networks following the same procedures described above (see “Similarity network fusion provides a viable alternative to data concatenation”). Consensus clustering of the networks yielded three subgroups of n = 83, 59, and 44 individuals, showing moderate overlap with biotypes defined on the full dataset (NMI = 0.43).

Despite this similarity, it is possible these “no-MRI” biotypes would result in a different characterization of PD disease severity than the previously described biotypes. To assess this, we applied the same univariate one-way ANOVA framework to examine which of the original 1050 features were discriminable between the subgroups generated without MRI data. This revealed only five features (from two modalities) that significantly distinguished these patient subgroups (FDR-corrected, q < 0.05): DAT binding in bilateral caudate and putamen and clinical scores on the the Unified PD rating scale, part II (UPDRS-II³⁷). Comparing these features with the complementary set derived from all the data (Supplementary Table 3) highlights the reduced discriminability of these “no-MRI” subgroups.

Given that the reduced dataset contains relatively few (i.e., 34) features, it is possible that SNF may be “over-engineering” a solution that could be better achieved with alternative clustering techniques. To address this possibility, we reproduce a previously published clustering solution of PD patients that does not include features derived from MRI data and compare the results of this technique to SNF (see “Supplementary Results: Comparing clustering techniques”). We find that, as above, this clustering solution fails to yield subgroups with meaningful differences for any neuroimaging metrics.

Taken together, these results support the notion that excluding MRI data from PD clustering yields patient subgroups that fail to capture significant differences in PD pathophysiology. Nevertheless, the value of neuroimaging data in defining patient subgroups does not explain the inconsistencies previously observed between neuroimaging metrics and clinical data in the patient biotypes. In the early stages of PD, pathophysiology may not perfectly align with clinical symptomatology; that is, hard partitioning into a small number of discrete biotypes may be an over-simplification of the disease.

Patient biotypes separate along continuous dimensions of severity

To examine the possibility that PD pathology is multidimensional, we generated a continuous low-dimensional representation of our fused patient networks with diffusion embedding^38,39 (see Supplementary Fig. 3 for a comparison with PCA on concatenated data). Diffusion embedding attempts to find dimensions, often referred to as gradients or components, of a network that encode the dominant differences in patient similarity—akin to principal components analysis (PCA) or multidimensional scaling (MDS). The resulting components are unitless and represent the primary axes of inter-subject similarity. Critically, diffusion embedding is sensitive to complex, non-linear relationships and is relatively robust to noise perturbations compared to other dimensionality reduction methods^38,39.

Patient networks for all SNF hyperparameter combinations used in the consensus clustering (n = 1262 networks) were decomposed with diffusion embedding and realigned using a generalized orthogonal Procrustes analysis. The resulting aligned embeddings were then averaged to generate a single, embedded space (Fig. 1d). Examining the extent to which patient clusters differentiated in embedded space revealed limited overlap in cluster affiliation among the first two dimensions (Fig. 4a, b).

The first two dimensions of the embedded space correlated most strongly with patient variability in DAT binding of the left caudate (dimension one: r = 0.81; Fig. 4c, left panel) and volume of the subthalamic nucleus (dimension two: r = 0.63; Fig. 4c, right panel). Examining further dimensions revealed significant relationships with features from all data modalities (Fig. 4f, g). We also investigated the extent to which patient PD-ICA atrophy scores related to variation along estimated diffusion dimensions¹¹, finding significant associations between atrophy scores and dimensions two and four (r = 0.33 and −0.26, FDR-corrected, q < 0.05; Fig. 4d, e).

To examine whether this dimensional framework provided out-of-sample predictive utility we re-ran the SNF grid search excluding two variables frequently used in clinical settings to assess disease severity: the tremor dominant score and the postural instability/gait difficulty score (PIGD⁴⁰). We regenerated the embedded dimensions following the procedure depicted in Fig. 1d and used five-fold cross-validation to assess the extent to which embeddings predicted patient scores on the held-out clinical variables. A simple linear regression was performed using the first five embedding dimensions as predictors of each clinical score; betas were estimated from 80% of the data and applied to predict scores for the remaining 20%. Out-of-sample scores were predicted moderately well (average Pearson correlation between real and predicted scores: r_tremor = 0.20 [SD = 0.22] ; r_pigd = 0.22 [SD = 0.22]).

Discussion

The present report demonstrates how diverse modalities can be integrated to comprehensively characterize a range of patient characteristics. Using information about behavior, DAT binding, cerebrospinal fluid, cortical thickness, and subcortical tissue volume, we find evidence for distinct biological dimensions or phenotypic axes that span the patient sample. These biologically-informed dimensions provide a more nuanced interpretation of pathology than is permitted from discrete subgroups or biotypes, and hold potential for greater precision in diagnosis.

The present study demonstrates that even in a well-controlled sample of de novo PD patients, there exists considerable heterogeneity. This finding contributes to a rich literature on the diversity of PD symptoms^{5,7,41,42,43,44,45}. Despite the fact that the putative biotypes are based on the first patient visit, there are several important features where the biotypes are not initially different from each other but progressively diverge over time (e.g., tremor scores).

Does the observed heterogeneity imply the existence of fundamentally different diseases, or simply different rates of progression? Recent evidence from animal models suggests that PD originates from misfolding and trans-synaptic spreading of the endogenous protein α-synuclein⁹, with convergent results from human neuroimaging^13,46,47,48. The large-scale atrophy patterns and associated neurological deficits are thought to be mediated by the spread of these pathogenic protein aggregates⁴⁹, penetrating the cerebral hemispheres via the brain stem and subcortex⁵⁰. The heterogeneity of clinical symptoms may simply suggest that patients may differ in the rate and extent of neurodegeneration, leading to diverse symptoms. In other words, individual variation in neurological manifestations of the disease may depend on the spreading pattern and the affected networks⁵¹, as well as factors not directly related to PD pathophysiology (e.g., age, sex, comorbidities).

Hard partitioning methods yield non-overlapping clusters by definition, but is this the best way to characterize heterogeneity in PD? The patient biotypes we initially identified could largely be differentiated from each other on the basis of disease severity, leading us to pursue a dimensional approach. We find that much of the phenotypic variability in the patient sample could be parsimoniously captured by a smaller number of latent dimensions or phenotypic axes that relate to distributed patterns of grey matter atrophy, DAT binding, etc. This finding is reminiscent of previous reports, where even hard clustering solutions often sorted patients into broad disease severity categories^{5,41,42,43,44}. Indeed, recent studies have taken an explicitly dimensional approach, attempting to find low-dimensional projections of clinical-behavioral and neuroimaging data^45,52,53,54. In clinical practice, methods like SNF situate individual patients in a biologically-comprehensive feature space that can then guide more objective clinical decisions about diagnosis and prognosis. Our study is the first step in better understanding which measures are necessary and informative of PD severity, but more work needs to be done to continue to investigate how this will translate to clinical practice.

The notion that PD can be characterized by a smaller number of latent dimensions opens the question of where PD is situated relative to other diseases⁵⁵. Given the natural functional dependencies among the molecular components of a cell, distinct pathological perturbations (e.g. mutations) may affect overlapping gene modules, cell types, tissues, and organs, ultimately manifesting in similar phenotypes. For instance, do the present latent dimensions trace out a transdiagnostic continuum along which we can place other neurodegenerative diseases (e.g. Alzheimer’s disease, tauopathies). How PD fits into a global “disease-ome” remains an exciting question for future research.

More broadly, the present work builds on recent efforts to draw insight from multiple modalities. Modern technological advances and data sharing initiatives permit access to large patient samples with increasing detail and depth¹⁴. How best to integrate information from these multi-view datasets remains a fundamental question in computational medicine^15,19,25. Though there is an active discussion regarding what stage of analysis is most appropriate for data integration²⁵, our results demonstrate that important insights about PD patient heterogeneity can be drawn only by simultaneously considering multiple data modalities. Covariance among clinical, morphometric, and physiological measurements synergistically reveals dominant axes of variance that are not apparent in the individual data modalities. Moreover, we show that simple concatenation induces overfitting to the modality with the greatest dimensionality and autocorrelation, motivating further research on integrating diverse sources of information.

A corollary of the present work is that it is also possible to identify modalities that do not make a significant contribution towards differentiating patients and could potentially be excluded from future data collection efforts. This is an important concern because many modern biological assays—such as brain imaging—are expensive and difficult to administer for some types of clinical populations. As a result, samples may be biased and data may be incomplete for many individuals, limiting the application and utility of the subsequent statistical models. The present analysis can thus help to streamline future data collection efforts. By helping to reduce the feature set, the present analytic approach can also help increase the potential for overlap and interoperability among existing datasets.

Although we took steps to ensure that the reported results are robust to multiple methodological choices, there are several important limitations to consider. First, the present results are only demonstrated in a single patient sample; despite consistency with several other recent studies⁴⁵, formal replication in new longitudinal cohorts is necessary.

Second, the present statistical model leaves out two widely-available and potentially important data modalities: genetic variation and daily movement. Genetic variation is a recognized contributor to the clinical manifestations of PD⁵⁶, and previous work has shown that genetic risk can be used to meaningfully stratify patients⁴⁵. Likewise, objective measurements of daily movement with wearable sensors and smart devices are increasingly prevalent and add a fundamentally different source of information about individual patients⁵⁷. Whether and to what extent the present phenotypic axes reflect the underlying genetic determinants or movement characteristics of PD is an exciting question for future research.

Finally, there has been recent work highlighting the potential of diffusion-weighted imaging (DWI) as an alternative measure to structural, T1w images for predicting PD prognosis⁵⁸; however, the limited availability of DWI in longitudinal cohorts like those used in the current study restricts its more widespread use.

In summary, we report a flexible method to objectively integrate a diverse array of morphometric, molecular, and clinical information to characterize heterogeneity in PD. We find evidence for three biotypes, but show that the sample can alternatively be characterized in terms of continuous phenotypic dimensions. These phenotypic dimensions bring into focus complementary information from multiple modalities and lay the foundation for a more comprehensive understanding of PD.

Methods

Ethical compliance

Data used in this study were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (https://www.ppmi-info.org), accessed in March of 2018. Informed consent was obtained from all individuals prior to enrollment in the PPMI study by the participating sites. Formal approval for data re-use was issued by the Research Ethics Board at McGill University, Montreal, QC.

Neuroimaging data processing

Raw neuroimages were directly converted from DICOM to BIDS format⁵⁹ using heudiconv (https://github.com/nipy/heudiconv). Structural images for each subject were then independently processed with the Advanced Normalization Tools’ (ANTs) longitudinal cortical thickness pipeline⁶⁰.

Briefly, T1-weighted structural images were corrected for signal intensity non-uniformity with N4 bias correction⁶¹. Corrected images were then combined with other available neuroimages (T2-weighted, proton-density, and FLAIR images) across all timepoints to create a temporally unbiased, subject-specific template⁶². A standard template was non-linearly registered to the subject template and used to remove non-brain tissue; to maintain consistency with previous work on the PPMI dataset^11,13 we used the Montreal Neurological Institute (MNI) ICBM-152 2009c template for this purpose (https://www.bic.mni.mcgill.ca/ServicesAtlases/ICBM152NLin2009^63,64,65). The remaining subject template brain tissue was segmented into six classes (cerebrospinal fluid, cortical gray matter, white matter, subcortical gray matter, brain stem, and cerebellum) using ANTs joint label fusion⁶⁶ with a group of fifteen expertly annotated and labeled atlases from the OASIS dataset⁶⁷. Finally, the segmented subject template was used to performing the same registration, brain extraction, and segmentation procedures on the T1w images from each timepoint.

Three metrics were derived from the pre-processed neuroimaging data: subcortical volume, cortical thickness, and whole-brain deformation-based morphometry values.

Subcortical volume measures the absolute volume, in cubic millimeters, for pre-defined regions of interest. Here, we used the regions of interest from the high-resolution subcortical atlas generated by Pauli et al.⁶⁸. Analyses reported in the main text used the deterministic version of the atlas; however, using the probabilistic version returned comparable results. Where applicable the probabilistic atlas was thresholded at 40%; thresholding was performed after registration to subjects’ T1w MRIs.

Cortical thickness measures the distance, in millimeters, between the pial surface and the gray-white matter boundary of the brain. The ANTs diffeomorphic registration-based cortical thickness estimation (DiReCT) algorithm was used to measure cortical thickness in the volumetric space of each T1w image⁶⁹. The diffeomorphic constraint of this procedure ensures that the white matter topology cannot change during thickness estimations, permitting accurate recovery of cortical depth even in sulcal grooves. Previous work has found that this procedure compares favorably to surface-based approaches⁷⁰.

To account for subject-level neuroanatomical variance, we averaged cortical thickness values in each region of the five multi-scale parcellations (68, 114, 219, 448, and 1000 regions) generated by Cammoun et al.⁷¹ for every timepoint for every subject. The parcellations were transformed to the T1w MRI of each timepoint before averaging within regions to minimize bias.

Deformation-based morphometry (DBM) provides an approximate measure of local changes in brain tissue volume for a subject relative to a standard template⁷². DBM values are derived from the deformation maps generated during the non-linear registration process aligning each subject brain to the template space. In the present study, deformation maps were created by concatenating the non-linear warps described in Neuroimaging data processing that (1) mapped the T1w image for each timepoint to the relevant subject-specific template and (2) mapped the subject-specific template to the MNI152-2009c template. Local changes in tissue volume were then estimated from the derivative of these deformation maps, calculated as the determinant of the Jacobian matrix of displacement. To aid interpretability of these changes we calculated the natural log of the Jacobian determinant, such that a value of zero indicates no volume change compared to the MNI template, negative values indicate tissue expansion relative to the MNI template, and positive values indicate tissue loss relative to the MNI template.

Zeighami and colleagues initially used DBM values from the PPMI subjects to find an ICA component map, which they refer to as a PD-ICA map, highlighting regions of the brain with greater relative tissue loss in Parkinson’s patients than in age-matched healthy controls¹¹. Using this map, they generated an “atrophy score” for each patient, which they related to measures of PD disease severity and prognosis^11,36. Unfortunately, a precise calculation of this atrophy score on a different subset of subjects from the PPMI would require the original ICA component table; in order to approximate this score without the component table we employed an alternative procedure originally used in³⁶.

We downloaded the PD-ICA component map from NeuroVault (collection 860, image ID 12551, https://identifiers.org/neurovault.collection:860⁷³) and used the map as a weighted mask on the DBM images estimated for each subject, multiplying DBM values by the component weights in the map and then summing the resulting values to generate a single score for each time point for each subject. Reported atrophy scores are all normalized (i.e., zero mean and unit variance) with respect to the n = 186 patient sample.

Neuroimaging quality control

Neuroimaging data were visually inspected by two authors (RDM and CT) using tools adapted from niworkflows (https://github.com/poldracklab/niworkflows). Processed data from twenty individuals were jointly selected by both raters as anchors and used to guide independent rating of the remaining T1w structural images for all subjects⁷⁴. Quality of brain extraction, brain segmentation, and registration to the MNI152-2009c template was assessed and rated on a scale of 0–2, where 0 indicates a processing failure, 1 indicates a conditional pass, and 2 indicates a full pass. Discrepancies where one rater assigned a score of 0 and the other rater assigned a passing score (n = 11 instances) were jointly reconciled and new scores assigned (n = 9 revised to a score of 0, n = 2 revised to a score of 1).

Cohen’s kappa coefficient (κ) was calculated to compare the inter-rater reliability of the final quality control scores, yielding 84.8% agreement for segmentation and 84.5% agreement for registration ratings⁷⁵. Assessing scores for only the subject-specific template of each subject yielded comparable agreement (85.8% for segmentation, 85.5% for registration). The current study used subjects for whom both segmentation and registration scores were ≥1 across both raters.

Non-neuroimaging data processing

In addition to longitudinal neuroimages, the PPMI provides clinical-behavioral assessments, biospecimen analyses, and single-photon emission computed tomography (SPECT) dopamine active transporter (DAT) binding data for all of its participants. Item-level measures for clinical-behavioral assessments were combined into raw composite scores following instructions in the “Derived Variable Definitions and Score Calculations” guide supplied by the PPMI using the pypmi software package (https://github.com/netneurolab/pypmi). Biospecimen data and pre-computed region of interest metrics for SPECT data were used as provided.

Data cleaning

The current study used five data sources: (1) MRI cortical thickness, (2) MRI subcortical volume, (3) SPECT DAT binding ratios, (4) biological and cerebrospinal fluid assays, and (5) clinical-behavioral assessment scores. Prior to combining data sources we performed (i) data cleaning, (ii) outlier removal, (iii) missing data imputation, (iv-a) batch correction, (iv-b) covariate residualization, and (v) normalization for each modality.

Any features for which ≥20% of individuals were missing data were discarded; subsequently, individuals missing ≥20% of the remaining features were discarded. Putative outlier individuals were then identified using a median absolute deviation method and discarded⁷⁶. The remaining missing data values were imputed, substituting the median value across individuals for each feature. This procedure yielded n = 186 individuals with PD and n = 87 healthy individuals who had data from all five sources. A full list of the 1050 features retained for each data source after preprocessing can be found in Supplementary Table 2.

We used ComBat to correct for site differences in neuroimaging data modalities (i.e., cortical thickness and subcortical volume measurements; neurocombat, https://github.com/ncullen93/neurocombat)^77,78. Patient diagnostic status, family history of PD, sex, race, handedness, and education were included in the ComBat correction procedure.

We residualized the pre-processed data against age, sex, and age × sex interactions in PD patients based on the relationships estimated from healthy individuals; estimated total intracranial volume was also included in the residualization process for cortical thickness and subcortical volume features¹². Finally, PD patient data were z-scored (centered to zero mean and standardized to unit variance). Only PD patient data were used to estimate means and standard deviation for z-scoring.

Data concatenation

Concatenation provides a data-driven, parameter-free approach for integrating multimodal data, where data features from all sources are horizontally stacked into a single sample by feature matrix. Here, we joined patient data for all features from all data sources and converted the resulting matrix into a patient similarity network by applying a cosine similarity function from scikit-learn⁷⁹. As spectral clustering cannot handle negative values we scaled the values of the similarity networks between zero and two. The scaled patient networks were subjected to spectral clustering^28,29 for a two-, three-, and four-cluster solution, and the solutions were compared to clustering assignments generated from SNF, described below.

Similarity network fusion

Similarity network fusion is a method for combining disparate data sources from a group of samples into a single graph, representing the strength of relationships between samples¹⁹. SNF constructs independent similarity networks from each data source using a K-nearest neighbors weighted kernel and then iteratively combines them via a non-linear message passing protocol. The final network contains sample relationships representing information from all data sources and can be subjected to clustering or other graph techniques (see Fig. 1a).

SNF belongs to a broader family of techniques of multi-view learning algorithms (e.g., multiple kernel learning, multi-table matrix factorization). We opted to use SNF because (1) it is an unsupervised learning technique, (2) it is explicitly optimized to control for differing dimensionalities amongst input data modalities¹⁹, and (3) it has been shown to be effective at disentangling heterogeneity in psychiatric populations^21,22.

We constructed similarity networks for each data source using a Python implementation of the methods described in¹⁹ (snfpy; https://github.com/netneurolab/snfpy). A brief description of the main steps in SNF follows, adapted from its original presentation in¹⁹.

First, distance matrices are created from each feature matrix. We selected squared Euclidean distance for the analyses in the main text; however, other distance measures return comparable results (see Supplementary Results: Alternative distance metrics”). Next, distance matrices are converted to similarity networks using a scaled exponential kernel:

$${\bf{W}}(i,j)=\frac{1}{\sqrt{2\pi {\sigma }^{2}}}\,{{\rm{e}}}^{-\frac{{\rho }^{2}({x}_{i},{x}_{j})}{2{\sigma }^{2}}},$$

(1)

where ρ(x_i, x_j) is the Euclidean distance (or other distance metric, as appropriate) between patients x_i and x_j. The value σ is calculated with:

$$\sigma =\mu \frac{{\overline{\rho }}^{2}({x}_{i},{N}_{i})+{\overline{\rho }}^{2}({x}_{j},{N}_{j})+{\rho }^{2}({x}_{i},{x}_{j})}{3},$$

(2)

where $\overline{\rho }({x}_{i},{N}_{i})$ represents the average Euclidean distance between x_i and its neighbors N_1..K. Both K, controlling the number of neighbors, and μ, the scaling factor, are hyperparameters that must be pre-selected, where $K\in [1,2,...,j],j\in {\mathbb{Z}}$ and $\mu \in {{\mathbb{R}}}^{+}$.

In order to fuse the supplied similarity networks each must first be normalized. A traditional normalization performed on a similarity matrix would be unstable due to the self-similarity along the diagonal; thus, a modified normalization is used:

$${\bf{P}}(i,j)=\left\{\begin{array}{ll}\frac{{{\bf{W}}}_{(i,j)}}{2{\sum }_{k\ne i}{{\bf{W}}}_{(i,k)}},&j\,\ne\, i,\\ 1/2,&j=i.\end{array}\right.$$

(3)

Under the assumption that local similarities are more important or reliable than distant ones, a more sparse weight matrix is calculated:

$${\bf{S}}(i,j)=\left\{\begin{array}{ll}\frac{{{\bf{W}}}_{(i,j)}}{{\sum }_{k\in {N}_{i}}{{\bf{W}}}_{(i,k)}},&j\in {N}_{i},\\ 0,&\,\text{otherwise}.\,\end{array}\right.$$

(4)

The two weight matrices P and S thus provide information about a given patient’s similarity to all other patients and the patient’s K most similar neighbors, respectively.

The similarity networks are then iteratively fused. At each iteration, the matrices are made more similar to each other via:

$${{\bf{P}}}^{(v)}={{\bf{S}}}^{(v)}\times \frac{{\sum }_{k\ne v}{{\bf{P}}}^{(k)}}{m-1}\times {({{\bf{S}}}^{(v)})}^{{\rm{T}}},v=1,2,...,m.$$

(5)

After each iteration, the resulting matrices are re-normalized via the above equations. Fusion stops when the matrices have converged or after a pre-specified number of iterations (by default, 20).

Variation in SNF’s two parameters, K and μ, can highlight different aspects of the input data, yielding significantly different fused networks. In order to avoid biasing our results by selecting any specific hyperparameter combination we opted to perform SNF with different combinations of K and μ, using 100 unique values for each parameter (K = 5–105; μ = 0.3–10, logspace). The resulting set of 10,000 fused networks was subjected to consensus clustering and diffusion map embedding to generate both categorical and continuous representations of patient pathology.

For a more in-depth examination of how variation in hyperparameters impacted patient networks refer to “Supplementary Results: Hyperparameter variation”.

In order to find a single clustering solution from the 10,000 fused networks generated by SNF we employed a consensus approach inspired by Bassett et al.³⁰ and Lancichinetti and Fortunato³¹. First, each of the 10,000 networks was subjected to spectral clustering^28,29. As spectral clustering requires pre-specifying the desired number of clusters to be estimated, we chose to generated separate solutions for two, three, and four clusters each. Though it is possible there are more clusters in the dataset, previous work on PD patient data supports 2–4 clusters as a reasonable choice^4,5.

We wanted to consider regions of the parameter space that generated “stable” networks—that is, where small perturbations in either of the two hyperparameters did not appreciably change the topology or resulting clustering of the patient networks. To quantify this we assessed the pairwise z-Rand similarity index⁸⁰ of the clustering solution for each point in hyperparameter space and its four neighbors; local z-Rand values were calculated separately for two-, three-, and four-cluster solutions (Fig. 1c). The resulting cluster similarity matrices were thresholded at the 95th percentile of the values in all three matrices; clustering solutions from regions of parameter space surviving this threshold in any of the three matrices were retained for further analysis, resulting in 1262 × 3 = 3786 assignments. These solutions were used to generate a subject co-assignment matrix representing the normalized probability that two subjects were placed in the same cluster across all assignments. This “co-assignment” matrix was thresholded by generating an average probability from a permutation-based null model, as in³⁰. Briefly, we permuted the assignments for each clustering solution and regenerated the co-assignment probability matrix; the average probability of this permuted matrix was used as the threshold for the original. We clustered the resulting thresholded matrix using a modularity maximization procedure to generate a final “consensus” clustering partition, which was used in all subgroup analyses³².

Diffusion map embedding

While clustering is appealing for its intuitive clinical applications, recent work has shown that continuous representations of clinical dysfunction may provide a more accurate representation of the underlying diseases^12,81 As an alternative to clustering, we applied diffusion embedding to the fused PD patient network.

Diffusion map embedding is a non-linear dimensionality reduction technique that finds a low-dimensional representation of graph structures^38,39. Though closely related to other manifold learning techniques including e.g., Laplacian-based spectral embedding, diffusion map embedding typically uses a different normalization process that approximates a Fokker–Planck diffusion equation instead of traditional laplacian normalization⁸². Moreover, diffusion map embedding attempts to model a multi-scale view of the diffusion process via a diffusion time parameter, t, that allow for more nuanced investigations of the geometry of the input data than are achievable via comparable techniques like spectral embedding. In the current manuscript we used diffusion time t = 0, which reveals the most global relationships of the input dataset³⁸.

Prior to embedding we thresholded our fused graphs, removing edges below the 90th percentile of weights for each individual, and computed the cosine similarity of the resulting network⁸³. This network was decomposed using mapalign (https://github.com/satra/mapalign) to generate an embedded space for each of the 10,000 patient networks.

For our analyses we only considered embeddings for those networks in regions of parameter space deemed “stable” via the z-Rand thresholding procedure described in Consensus clustering, resulting in 1262 embeddings. These embeddings were aligned using a generalized orthogonal Procrustes analysis (rotations and reflections only) and then averaged to generate a single patient embedding which was carried forward to all analyses (Fig. 1d). We only considered the first N components yielding a cumulative variance explained of ≥10%.

To examine the out-of-sample predictive utility of the dimensional embedding framework we re-ran the SNF grid search excluding two variables: the tremor dominant score (tremor) and the postural instability/gait difficulty score (PIGD). Embedded dimensions were regenerated following the same procedures depicted in Fig. 1d. We used a five-fold cross-validation framework to assess the extent to which embeddings predicted patient scores on the held-out clinical variables. A simple linear regression was performed using the first five embedding dimensions as predictors of each clinical score; betas were estimated from 80% of the patients and applied to predict scores for the remaining 20% of the patients. Predicted scores were correlated with actual scores, and the average correlations across all folds were reported.

Statistical assessments

Quantitative assessments of the similarity between clustering assignments were calculated using normalized mutual information (NMI³³), a measure ranging from 0 to 1 where higher values indicate increased overlap between assignments. Alternative measures (e.g., adjusted mutual information) that correct for differing cluster numbers were not used as NMI was only calculated between assignments with the same number of clusters.

Modularity was used to measure the “goodness-of-fit” of clustering assignments, and was calculated by

$$Q={\sum \limits_{ij}}{B}_{ij}\delta ({\sigma }_{i},{\sigma }_{j}),$$

(6)

where δ() is the Kronecker delta function which returns 1 when patients i and j belong to the same cluster and 0 otherwise. Here, B_ij is a modularity matrix whose elements are given by B_ij = A_ij − P_ij, where A_ij and P_ij are the observed and expected similarity between patients i and j. Modularity ranges between −1 and 1, where higher values indicate increasingly well-defined assortative clustering assignments³⁴.

To assess whether identified clusters were discriminable we performed separate one-way ANOVAs across groups for all data features provided to SNF using scipy’s statistical computing module⁸⁴ (model: score ~ biotype). Reported results were FDR-corrected (q < 0.05) using the Benjamini–Hochberg procedure⁸⁵ from the statsmodels modules⁸⁶. Although longitudinal data were limited for many features, clinical-behavioral assessments were available for the majority of subjects up to 5 years post-baseline. Thus, we analyzed the impact of cluster affiliation on longitudinal feature scores using linear mixed effects models, including age, education, and sex as additional covariates.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data used in this study were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (https://www.ppmi-info.org), accessed in March of 2018. PPMI data are freely accessible to researchers by signing a Data Use Agreement on the PPMI website.

Code availability

All code used for data processing, analysis, and figure generation is available on GitHub (https://github.com/netneurolab/markello_ppmisnf) and relies on the following open-source Python packages: h5py⁸⁷, IPython⁸⁸, Jupyter⁸⁹, Matplotlib⁹⁰, NiBabel⁹¹, Nilearn⁹², Numba⁹³, NumPy^94,95, Pandas⁹⁶, scikit-learn⁷⁹, SciPy⁸⁴, StatsModels⁸⁶, Seaborn⁹⁷, BCTPy⁹⁸, and mapalign⁹⁹. Code can also be accessed at https://doi.org/10.5281/zenodo.3731251.

References

Postuma, R. B. et al. Mds clinical diagnostic criteria for Parkinson’s disease. Mov. Disord. 30, 1591–1601 (2015).
Article PubMed Google Scholar
Kalia, L. V. & Lang, A. E. Parkinson’s disease. Lancet 386, 896–912 (2015).
Article CAS PubMed Google Scholar
Tysnes, O.-B. & Storstein, A. Epidemiology of Parkinson’s disease. J. Neural Transmission 124, 901–905 (2017).
Article Google Scholar
Faghri, F. et al. Predicting onset, progression, and clinical subtypes of Parkinson disease using machine learning. bioRxiv. https://doi.org/10.1101/338913 (2018).
Fereshtehnejad, S.-M., Zeighami, Y., Dagher, A. & Postuma, R. B. Clinical criteria for subtyping Parkinson’s disease: biomarkers and longitudinal progression. Brain 140, 1959–1976 (2017).
Article PubMed Google Scholar
Simuni, T. et al. How stable are Parkinson’s disease subtypes in de novo patients: analysis of the PPMI cohort? Parkinsonism Related Disord. 28, 62–67 (2016).
Article Google Scholar
Espay, A. J. et al. Biomarker-driven phenotyping in Parkinson’s disease: a translational missing link in disease-modifying clinical trials. Mov. Disord. 32, 319–324 (2017).
Article PubMed PubMed Central Google Scholar
Fujiwara, H. et al. α-synuclein is phosphorylated in synucleinopathy lesions. Nature Cell Biol. 4, 160 (2002).
Article CAS PubMed Google Scholar
Luk, K. C. et al. Pathological α-synuclein transmission initiates parkinson-like neurodegeneration in nontransgenic mice. Science 338, 949–953 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fearnley, J. M. & Lees, A. J. Ageing and Parkinson’s disease: substantia nigra regional selectivity. Brain 114, 2283–2301 (1991).
Article PubMed Google Scholar
Zeighami, Y. et al. Network structure of brain atrophy in de novo Parkinson’s disease. Elife 4, e08440 (2015).
Article PubMed Central Google Scholar
Zeighami, Y. et al. A clinical-anatomical signature of Parkinsona’s disease identified with partial least squares and magnetic resonance imaging. Neuroimage 190, 69–78 (2019).
Article PubMed Google Scholar
Yau, Y. H. et al. Network connectivity determines cortical thinning in early Parkinson’s disease progression. Nat. Commun. 9, 12 (2018).
Article CAS PubMed PubMed Central Google Scholar
Marek, K. et al. The Parkinson Progression Marker Initiative (PPMI). Prog. Neurobiol. 95, 629–635 (2011).
Article PubMed Central Google Scholar
Mišić, B. & Sporns, O. From regions to connections and networks: new bridges between brain and behavior. Curr. Opin. Neurobiol. 40, 1–7 (2016).
Article PubMed PubMed Central CAS Google Scholar
Kirk, P., Griffin, J. E., Savage, R. S., Ghahramani, Z. & Wild, D. L. Bayesian correlated clustering to integrate multiple datasets. Bioinformatics 28, 3290–3297 (2012).
Article CAS PubMed PubMed Central Google Scholar
Monti, S., Tamayo, P., Mesirov, J. & Golub, T. Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data. Mach. Learning 52, 91–118 (2003).
Article Google Scholar
Nigro, J. M. et al. Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. Cancer Res. 65, 1678–1686 (2005).
Article CAS PubMed Google Scholar
Wang, B. et al. Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 11, 333 (2014).
Article CAS PubMed Google Scholar
Marquand, A. F., Rezek, I., Buitelaar, J. & Beckmann, C. F. Understanding heterogeneity in clinical cohorts using normative models: beyond case-control studies. Biol. Psychiatry 80, 552–561 (2016).
Article PubMed PubMed Central Google Scholar
Stefanik, L. et al. Brain-behavior participant similarity networks among youth and emerging adults with schizophrenia spectrum, autism spectrum, or bipolar disorder and matched controls. Neuropsychopharmacology 43, 1180 (2018).
Article PubMed Google Scholar
Jacobs, G. R. et al. Integration of brain and behavior measures for identification of data-driven groups cutting across children with ASD, ADHD, or OCD. bioRxiv. https://doi.org/10.1101/2020.02.11.944744 (2020).
Zhuang, J., Wang, J., Hoi, S. C. & Lan, X. Unsupervised multiple kernel learning. In JMLR: Workshop and Conference Proceedings: 3rd Asian Conference on Machine Learning 2011 (eds Hsu, C.-N. & Lee, W.S.) (Research Collection School of Information Systems, 2011).
Donini, M. et al. Combining heterogeneous data sources for neuroimaging based diagnosis: re-weighting and selecting what is important. NeuroImage 195, 215–231 (2019).
Article PubMed Google Scholar
Zitnik, M. et al. Machine learning for integrating data in biology and medicine: principles, practice, and opportunities. Information Fusion 50, 71–91 (2019).
Article PubMed Google Scholar
Beyer, K., Goldstein, J., Ramakrishnan, R. & Shaft, U. When is “nearest neighbor” meaningful? In International Conference on Database Theory (eds Berri, C. & Buneman, P.) 217–235 (Springer, 1999).
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage 31, 968–980 (2006).
Article PubMed Google Scholar
Shi, J. & Malik, J. Normalized cuts and image segmentation. IEEE Trans Pattern Anal. Mach. Intell. 22, 888–905 (2000).
Article Google Scholar
Yu, S. X. & Shi, J. Multiclass spectral clustering. In Proc. 9th IEEE International Conference on Computer Vision (2003).
Bassett, D. S. et al. Robust detection of dynamic community structure in networks. Chaos: Interdisciplinary J. Nonlinear Sci. 23, 013142 (2013).
Article Google Scholar
Lancichinetti, A. & Fortunato, S. Consensus clustering in complex networks. Sci. Rep. 2, 336 (2012).
Article PubMed PubMed Central CAS Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Statistical Mech.: Theory Exp. 2008, P10008 (2008).
Article Google Scholar
Strehl, A. & Ghosh, J. Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J. Mach. Learning Res. 3, 583–617 (2002).
Google Scholar
Newman, M. E. Modularity and community structure in networks. Proc. Natl Acad. Sci. 103, 8577–8582 (2006).
Article CAS PubMed PubMed Central Google Scholar
Jankovic, J. & Kapadia, A. S. Functional decline in Parkinson disease. Arch. Neurol. 58, 1611–1615 (2001).
Article CAS PubMed Google Scholar
Zeighami, Y. et al. Assessment of a prognostic MRI biomarker in early de novo Parkinson’s disease. NeuroImage: Clinical 24, 101986 (2019).
Article Google Scholar
Goetz, C. G. et al. Movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23, 2129–2170 (2008).
Article PubMed Google Scholar
Coifman, R. R. et al. Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. Proc. Natl Acad Sci. 102, 7426–7431 (2005).
Article CAS PubMed PubMed Central Google Scholar
Lafon, S. & Lee, A. B. Diffusion maps and coarse-graining: a unified framework for dimensionality reduction, graph partitioning, and data set parameterization. IEEE Trans. Pattern Anal. Mach. Intell. 28, 1393–1403 (2006).
Article PubMed Google Scholar
Stebbins, G. T. et al. How to identify tremor dominant and postural instability/gait difficulty groups with the movement disorder society unified Parkinson’s disease rating scale: comparison with the unified Parkinson’s disease rating scale. Mov. Disord. 28, 668–670 (2013).
Article PubMed Google Scholar
Thenganatt, M. A. & Jankovic, J. Parkinson disease subtypes. JAMA Neurol. 71, 499–504 (2014).
Article PubMed Google Scholar
Erro, R. et al. Clinical clusters and dopaminergic dysfunction in de-novo parkinson disease. Parkinsonism Relat. Disord. 28, 137–140 (2016).
Article PubMed Google Scholar
Lawton, M. et al. Parkinson’s disease subtypes in the oxford parkinson disease centre (opdc) discovery cohort. J. Parkinson’s Dis. 5, 269–279 (2015).
Article Google Scholar
Lawton, M. et al. Developing and validating Parkinson’s disease subtypes and their motor and cognitive progression. J. Neurol. Neurosurg. Psychiatry 89, 1279–1287 (2018).
Article PubMed Google Scholar
Sandor, C. et al. Universal continuous severity traits underlying hundreds of Parkinson’s disease clinical features. bioRxiv. https://doi.org/10.1101/655217 (2019).
Freeze, B., Pandya, S., Zeighami, Y. & Raj, A. Regional transcriptional architecture of Parkinson’s disease pathogenesis and network spread. Brain 142, 3072–3085 (2019).
Article PubMed PubMed Central Google Scholar
Pandya, S. et al. Predictive model of spread of Parkinson’s pathology using network diffusion. NeuroImage 192, 178–194 (2019).
Article CAS PubMed Google Scholar
Zheng, Y.-Q. et al. Local vulnerability and global connectivity jointly shape neurodegenerative disease propagation. PLoS Biol. 17, e3000495 (2019).
Weickenmeier, J., Kuhl, E. & Goriely, A. Multiphysics of prionlike diseases: progression and atrophy. Phys. Rev. Lett. 121, 158101 (2018).
Article CAS PubMed Google Scholar
Warren, J. D. et al. Molecular nexopathies: a new paradigm of neurodegenerative disease. Trends Neurosci. 36, 561–569 (2013).
Article CAS PubMed PubMed Central Google Scholar
Maia, P. D. et al. Origins of atrophy in Parkinson linked to early onset and local transcription patterns. Brain Commun. 2, facaa065 (2020).
Kebets, V. et al. Somatosensory-motor dysconnectivity spans multiple transdiagnostic dimensions of psychopathology. Biol. Psychiatry 86, 779–791 (2019).
Article PubMed Google Scholar
Tang, S. et al. Reconciling dimensional and categorical models of autism heterogeneity: a brain connectomics & behavioral study. BioRxiv. https://doi.org/10.1101/692772 (2019).
Kirschner, M. et al. Latent clinical-anatomical dimensions of schizophrenia. Schizophrenia Bull. 46, 1426–1438 (2020).
Article Google Scholar
Barabási, A.-L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nat. Rev. Genetics 12, 56 (2011).
Article PubMed CAS Google Scholar
Nalls, M. A. et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease. Nat. Genetics 46, 989 (2014).
Article CAS PubMed Google Scholar
Maetzler, W., Domingos, J., Srulijes, K., Ferreira, J. J. & Bloem, B. R. Quantitative wearable sensors for objective assessment of Parkinson’s disease. Mov. Disord. 28, 1628–1637 (2013).
Article PubMed Google Scholar
Abbasi, N. et al. Predicting severity and prognosis in Parkinsona’s disease from brain microstructure and connectivity. Neuroimage: Clin. 25, 102111 (2019).
Article Google Scholar
Gorgolewski, K. J. et al. The Brain Imaging Data Structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. Data 3, 160044 (2016).
Article PubMed PubMed Central Google Scholar
Tustison, N. J. et al. The ANTs longitudinal cortical thickness pipeline. bioRxiv. https://doi.org/10.1101/170209 (2018).
Tustison, N. J. et al. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010).
Article PubMed PubMed Central Google Scholar
Avants, B. B. et al. The optimal template effect in hippocampus studies of diseased populations. Neuroimage 49, 2457–2466 (2010).
Article PubMed Google Scholar
Fonov, V. S., Evans, A. C., McKinstry, R. C., Almli, C. & Collins, D. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage 47, S102 (2009).
Article Google Scholar
Fonov, V. et al. Unbiased average age-appropriate atlases for pediatric studies. Neuroimage 54, 313–327 (2011).
Article PubMed Google Scholar
Collins, D. L., Zijdenbos, A. P., Baaré, W. F. & Evans, A. C. ANIMAL+INSECT: improved cortical structure segmentation. In Biennial International Conference on Information Processing in Medical Imaging (eds Kuba, A., Šáamal, M. & Todd-Pokropek, A.) 210–223 (Springer, 1999).
Wang, H. et al. Multi-atlas segmentation with joint label fusion. IEEE Trans. Pattern Anal. Mach. Intell. 35, 611–623 (2013).
Article PubMed Google Scholar
Klein, A. & Tourville, J. 101 labeled brain images and a consistent human cortical labeling protocol. Frontiers in Neuroscience 6, 171 (2012).
Article PubMed PubMed Central Google Scholar
Pauli, W. M., Nili, A. N. & Tyszka, J. M. A high-resolution probabilistic in vivo atlas of human subcortical brain nuclei. Sci. Data 5, 180063 (2018).
Article PubMed PubMed Central Google Scholar
Das, S. R., Avants, B. B., Grossman, M. & Gee, J. C. Registration based cortical thickness measurement. Neuroimage 45, 867–879 (2009).
Article PubMed Google Scholar
Tustison, N. J. et al. Large-scale evaluation of ANTs and FreeSurfer cortical thickness measurements. Neuroimage 99, 166–179 (2014).
Article PubMed Google Scholar
Cammoun, L. et al. Mapping the human connectome at multiple scales with diffusion spectrum MRI. J. Neurosci. Methods 203, 386–397 (2012).
Article PubMed Google Scholar
Ashburner, J. et al. Identifying global anatomical differences: Deformation-based morphometry. Human Brain Mapping 6, 348–357 (1998).
Article CAS PubMed PubMed Central Google Scholar
Gorgolewski, K. J. et al. NeuroVault.org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain. Front. Neuroinform. 9, 8 (2015).
Article PubMed PubMed Central Google Scholar
Rosen, A. F. et al. Quantitative assessment of structural image quality. NeuroImage 169, 407–418 (2018).
Article PubMed Google Scholar
Cohen, J. A coefficient of agreement for nominal scales. Educational Psychol. Meas. 20, 37–46 (1960).
Article Google Scholar
Iglewicz, B. & Hoaglin, D. C. How to Detect and Handle Outliers Vol. 16 (ASQ Press, 1993).
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127 (2007).
Article PubMed Google Scholar
Fortin, J.-P. et al. Harmonization of cortical thickness measurements across scanners and sites. NeuroImage 167, 104–120 (2018).
Article PubMed Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learning Res. 12, 2825–2830 (2011).
Google Scholar
Traud, A. L., Kelsic, E. D., Mucha, P. J. & Porter, M. A. Comparing community structure to characteristics in online collegiate social networks. SIAM Rev. 53, 526–543 (2011).
Article Google Scholar
Xia, C. H. et al. Linked dimensions of psychopathology and connectivity in functional brain networks. Nat. Commun. 9, 3003 (2018).
Nadler, B., Lafon, S., Kevrekidis, I. & Coifman, R. R. in Advances in Neural Information Processing Systems (eds Weiss, Y., Schölkopf, B., Platt, J.) 955–962 (MIT Press, Cambridge, 2006).
Margulies, D. S. et al. Situating the default-mode network along a principal gradient of macroscale cortical organization. Proc. Natl Acad. Sci. 113, 12574–12579 (2016).
Article CAS PubMed PubMed Central Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Statistical Soc.: Ser. B (Methodological) 57, 289–300 (1995).
Google Scholar
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with python. In 9th Python in Science Conference (eds van der Walt, S. & Millman, J.) (2010).
Collette, A. Python and HDF5: Unlocking Scientific Data (O’Reilly, 2013).
Pérez, F. & Granger, B. E. IPython: a system for interactive scientific computing. Comput. Sci. Eng. 9, 21–29 (2007).
Article Google Scholar
Kluyver, T. et al. In Positioning and Power in Academic Publishing: Players, Agents and Agendas (eds Loizides, F. & Scmidt, B.) 87–90 (IOS Press, 2016).
Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
Article Google Scholar
Brett, M. et al. nipy/nibabel. Zenodo. https://doi.org/10.5281/zenodo.591597 (2019).
Abraham, A. et al. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 8, 14 (2014).
Article PubMed PubMed Central Google Scholar
Lam, S. K., Pitrou, A. & Seibert, S. Numba: A LLVM-based python JIT compiler. In Proc. 2nd Workshop on the LLVM Compiler Infrastructure in HPC 7 (ed Finkel, H.) (ACM, 2015).
Oliphant, T. E. A Guide to NumPy Vol. 1 (Trelgol Publishing USA, 2006).
Van Der Walt, S., Colbert, S. C. & Varoquaux, G. The NumPy array: a structure for efficient numerical computation. Comput. Sci. Eng. 13, 22 (2011).
Article Google Scholar
McKinney, W. et al. Data structures for statistical computing in Python. In Proc. 9th Python in Science Conference (eds van der Walt, S. & Millman, J.) Vol. 445, 51–56 (Austin, TX, 2010).
Waskom, M. et al. mwaskom/seaborn. Zenodo. https://doi.org/10.5281/zenodo.592845 (2018).
Rubinov, M. & Sporns, O. Complex network measures of brain connectivity: uses and interpretations. Neuroimage 52, 1059–1069 (2010).
Article PubMed Google Scholar
Langs, G., Golland, P. & Ghosh, S. S. Predicting activation across individuals with resting-state functional connectivity based multi-atlas label fusion. In International Conference on Medical Image Computing and Computer-Assisted Intervention (eds Navab, N., Hornegger, J., Wells, W. M. & Frangi, A.) 313–320 (Springer, 2015).

Download references

Acknowledgements

This research was undertaken thanks in part to funding from the Canada First Research Excellence Fund, awarded to McGill University for the Healthy Brains for Healthy Lives initiative. BM acknowledges support from the Canadian Insititutes of Health Research (CIHR-PJT) and from the Fonds de recherche du Québec - Santé (Chercheur Boursier). RDM acknowledges support from the Healthy Brains for Healthy Lives (HBHL) initiative at McGill University and the Fonds de recherche du Québec—Nature et technologies (FRQNT). Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (https://www.ppmi-info.org/data). For up-to-date information on the study, visit https://www.ppmi-info.org. The PPMI—a public-private partnership—is funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners, including AbbVie, Avid Radiopharmaceuticals, Biogen, BioLegend, Bristol-Myers Squibb, GE Healthcare, Genentech, GlaxoSmithKline (GSK), Eli Lilly and Company, Lundbeck, Merck, Meso Scale Discovery (MSD), Pfizer, Piramal Imaging, Roche, Sanofi Genzyme, Servier, Takeda, Teva, and UCB (https://www.ppmi-info.org/fundingpartners).

Author information

Authors and Affiliations

McConnell Brain Imaging Centre, Montréal Neurological Institute, McGill University, Montréal, QC, Canada
Ross D. Markello, Golia Shafiei, Christina Tremblay, Ronald B. Postuma, Alain Dagher & Bratislav Misic

Authors

Ross D. Markello
View author publications
You can also search for this author in PubMed Google Scholar
Golia Shafiei
View author publications
You can also search for this author in PubMed Google Scholar
Christina Tremblay
View author publications
You can also search for this author in PubMed Google Scholar
Ronald B. Postuma
View author publications
You can also search for this author in PubMed Google Scholar
Alain Dagher
View author publications
You can also search for this author in PubMed Google Scholar
Bratislav Misic
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.D.M. and B.M. conceived the project. R.D.M. designed and performed all analyses with input from G.S. and C.T. R.D.M. and B.M. wrote the manuscript. A.D. and R.B.P. provided input on all analyses and edited the manuscript.

Corresponding authors

Correspondence to Ross D. Markello or Bratislav Misic.

Ethics declarations

Competing interests

The authors declare no competing interests. Full financial disclosure is provided below: RBP reports grants and personal fees from Fonds de la Recherche en Sante, the Canadian Institute of Health Research, Parkinson Canada, the Weston-Garfield Foundation, the Michael J. Fox Foundation, the Webster Foundation, and personal fees from Takeda, Roche/Prothena, Teva Neurosciences, Novartis, Biogen, Boehringer Ingelheim, Theranexus, GE HealthCare, Jazz Pharmaceuticals, Abbvie, Jannsen, Curasen and Otsuko, outside the submitted work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Information

Reporting Summary FLAT

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Markello, R.D., Shafiei, G., Tremblay, C. et al. Multimodal phenotypic axes of Parkinson’s disease. npj Parkinsons Dis. 7, 6 (2021). https://doi.org/10.1038/s41531-020-00144-9

Download citation

Received: 06 March 2020
Accepted: 19 November 2020
Published: 05 January 2021
DOI: https://doi.org/10.1038/s41531-020-00144-9

This article is cited by

Machine learning and deep learning approach to Parkinson’s disease detection: present state-of-the-art and a bibliometric review
- Gauri Sabherwal
- Amandeep Kaur
Multimedia Tools and Applications (2024)
Effect of probiotic supplementation on gastrointestinal motility, inflammation, motor, non-motor symptoms and mental health in Parkinson’s disease: a meta-analysis of randomized controlled trials
- Jong Mi Park
- Sang Chul Lee
- Yong Wook Kim
Gut Pathogens (2023)
Domain adapted brain network fusion captures variance related to pubertal brain development and mental health
- Dominik Kraft
- Dag Alnæs
- Tobias Kaufmann
Nature Communications (2023)
Targeting Nrf2 signaling pathway and oxidative stress by resveratrol for Parkinson’s disease: an overview and update on new developments
- Mohammad Yasin Zamanian
- Rosario Mireya Romero Parra
- Gholamreza Bazmandegan
Molecular Biology Reports (2023)
Multivariate investigation of aging in mouse models expressing the Alzheimer’s protective APOE2 allele: integrating cognitive metrics, brain imaging, and blood transcriptomics
- Hae Sol Moon
- Ali Mahzarnia
- Alexandra Badea
Brain Structure and Function (2023)