Exploiting macro- and micro-structural brain changes for improved Parkinson’s disease classification from MRI data

Camacho, Milton; Wilms, Matthias; Almgren, Hannes; Amador, Kimberly; Camicioli, Richard; Ismail, Zahinoor; Monchi, Oury; Forkert, Nils D.

doi:10.1038/s41531-024-00647-9

Download PDF

Article
Open access
Published: 26 February 2024

Exploiting macro- and micro-structural brain changes for improved Parkinson’s disease classification from MRI data

npj Parkinson's Disease volume 10, Article number: 43 (2024) Cite this article

943 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease (PD) is the second most common neurodegenerative disease. Accurate PD diagnosis is crucial for effective treatment and prognosis but can be challenging, especially at early disease stages. This study aimed to develop and evaluate an explainable deep learning model for PD classification from multimodal neuroimaging data. The model was trained using one of the largest collections of T1-weighted and diffusion-tensor magnetic resonance imaging (MRI) datasets. A total of 1264 datasets from eight different studies were collected, including 611 PD patients and 653 healthy controls (HC). These datasets were pre-processed and non-linearly registered to the MNI PD25 atlas. Six imaging maps describing the macro- and micro-structural integrity of brain tissues complemented with age and sex parameters were used to train a convolutional neural network (CNN) to classify PD/HC subjects. Explainability of the model’s decision-making was achieved using SmoothGrad saliency maps, highlighting important brain regions. The CNN was trained using a 75%/10%/15% train/validation/test split stratified by diagnosis, sex, age, and study, achieving a ROC-AUC of 0.89, accuracy of 80.8%, specificity of 82.4%, and sensitivity of 79.1% on the test set. Saliency maps revealed that diffusion tensor imaging data, especially fractional anisotropy, was more important for the classification than T1-weighted data, highlighting subcortical regions such as the brainstem, thalamus, amygdala, hippocampus, and cortical areas. The proposed model, trained on a large multimodal MRI database, can classify PD patients and HC subjects with high accuracy and clinically reasonable explanations, suggesting that micro-structural brain changes play an essential role in the disease course.

Prasinezumab slows motor progression in rapidly progressing early-stage Parkinson’s disease

Article Open access 15 April 2024

Neurofilaments as biomarkers in neurological disorders — towards clinical application

Article 12 April 2024

Native-state proteomics of Parvalbumin interneurons identifies unique molecular signatures and vulnerabilities to early Alzheimer’s pathology

Article Open access 01 April 2024

Introduction

Parkinson’s disease (PD) is a severe and heterogeneous progressive disease recognized as the second-most common neurodegenerative disorder. It affects 2–3% of the population aged 65 years or older¹ with an estimate of seven to ten million people worldwide diagnosed with the disease². PD is pathophysiologically characterized by the loss of dopaminergic neurons in the substantia nigra and the accumulation of alpha-synuclein aggregates within cells³. As the disease progresses, this accumulation can spread to wider regions of the cortex⁴. Despite the improving understanding of the disease, the exact causes of PD are still not well understood. Currently, PD is mainly diagnosed following clinical guidelines that involve assessing the presence of bradykinesia (slowness of movement) as well as at least one of the other primary motor symptoms such as tremor, rigidity, and postural instability⁵. Additional motor and non-motor symptoms also contribute to the overall clinical picture⁶. However, diagnostic accuracy based on clinical guidelines can vary from 73.8 to 83.9%, depending on the neurologist’s experience^7,8. Therefore, ongoing research is aiming to identify accurate and reproducible biomarkers for PD diagnosis. Neuroimaging techniques, such as magnetic resonance imaging (MRI) and positron emission tomography, have been studied extensively for this purpose due to their ability to capture the structure and function of the brain with high sensitivity and resolution. For example, T1-weighted MRI has shown potential capturing macro-structural changes in the brain^{9,10,11,12,13,14}. In the context of micro-structural changes, diffusion-tensor MRI (DTI) allows for in-vivo characterization of diffusivity in brain tissues through its sensitivity to the Brownian motion of water molecules. Water tends to diffuse mainly along the axons so that DTI can be employed to visualize white matter tracts throughout the brain¹⁵. This property makes it particularly advantageous for studying white matter integrity in different neurological diseases¹⁶, but may also be used to uncover potential micro-structural changes of gray matter structures in neurodegenerative diseases like PD¹⁷. The main DTI parameters are mean diffusivity (MD), which measures the degree of tissue water diffusivity, fractional anisotropy (FA), axial diffusivity (AD), and radial diffusivity (RD), which are related to the axonal and myelin structure^18,19,20. Nevertheless, visual identification of such macro- and micro-structural changes associated with PD, especially in the early disease phase, can be challenging and time-consuming, even for the most experienced neuroradiologist⁷.

Supervised machine learning approaches seek to automatically detect relevant patterns within a high-dimensional feature space. They do this by learning from the training data provided and creating an association with the ground truth labels provided as a reference. These patterns can then be utilized to classify new, previously unseen cases²¹. Machine learning models have been created for many clinical tasks by leveraging high-dimensional data for individual-level classification²². Following this principle, machine learning models have also been developed for PD classification making use of macro-structural neuroimaging data with accuracy levels above 70%^{13,14,23,24,25,26,27,28,29,30,31}. Similarly, micro-structural information extracted from DTI has also been explored for this purpose (77.44–97.7% accuracy)^{13,30,31,32,33,34,35,36,37}. So far, only two previous studies investigated the benefit of using both, micro- and macro-structural MRI, for PD detection with machine learning^13,30, achieving accuracies of 92.3 and 95.1%. Most of the previous studies for both uni- and multimodal MRI only used small sample sizes and followed a traditional machine learning approach, which typically includes image preparation through hand-crafted preprocessing techniques, feature extraction, and training and testing a classical machine learning model like a support vector machine and elastic-net regression, which are typically evaluated using a cross-validation strategy. A significant drawback of these techniques is that the features must be explicitly designed before training, which could result in missing crucial features. Convolutional neural networks (CNN), a special type of deep learning network, overcome this problem by automatically learning a representation of the input data with multiple levels of abstraction through an iterative training process. However, CNNs typically need large amounts of data to optimize these models without overfitting, which may be one of the reasons that there are no CNN models for PD classification making use of multimodal data to date.

In summary, previous uni- and multimodal studies described have been only tested and evaluated using a rather small number of PD patients and healthy participants ranging from 40 to 374 subjects, using data that is imbalanced and collected within a single study. Additionally, most previous machine learning models failed to provide explanations for their model’s predictions, which is crucial for building trust in computer-aided diagnosis systems and also to aid biomarker discovery.

To overcome these limitations, we collected the largest balanced multimodal MRI dataset containing both T1-weighted and DTI scans from PD patients and matched healthy controls from multiple studies, resulting in a total of nearly 1300 subjects included. Briefly described, for morphological analysis of the brain tissue, the T1-weighted MRI data were affinely and non-linearly registered to an atlas. The non-linear transformation was consequently used to compute the corresponding log-Jacobian maps, which encode volumetric differences between the atlas and a subject’s brain at a voxel level. Parametric maps of MD, FA, AD, and RD were computed based on the DTI datasets and also non-linearly registered to an atlas. The resulting six imaging maps, namely the affinely registered T1-weighted images (Aff), the log-Jacobian maps (Jac), and the six DTI parametric maps (MD, FA, AD, and RD) together with age and sex as parameters were used to train and evaluate a 3D CNN model (CNN_COMBINED) for PD classification. For comparison purposes, six additional CNN models were trained using the single imaging maps only (CNN_AFF, CNN_JAC, CNN_MD, CNN_FA, CNN_AD, CNN_RD), one additional CNN model using all DTI parametric maps but none of the T1-derived imaging maps (CNN_DTI), as well as one classical random forest machine learning model using image-derived phenotypes from the T1-weighted and DTI data (RF_COMBINED). All machine learning models were trained using the same 75%/10%/15% train/validation/test split stratified by diagnosis, sex, age, and study. Saliency maps were used to explain the predictions by highlighting the regions that were most influential for the classification of an individual patient.

Results

Evaluation of models

The prediction metrics in terms of area under the receiver operating characteristic curve (ROC-AUC), specificity, and sensitivity for all trained models are provided in Table 1. The model using the combination of all input imaging maps (CNN_COMBINED) achieved the overall best results (0.89 ROC-AUC, 80.8% accuracy, 82.4% specificity, and 79.1% sensitivity) with respect to the ROC-AUC metric. Therefore, this model was chosen to generate the saliency maps for further post-hoc analysis using the cortical and subcortical Harvard-Oxford atlas brain regions^38,39,40,41 (HO). Overall, the CNN_COMBINED model performed significantly better than the models using only the information from the T1-weighted MRI data (CNN_{JAC and} CNN_AFF, p < 0.05). However, it did not perform significantly better than the individual DTI (CNN_{FA, MD, RD, and AD}) and random forest (RF_COMBINED) models.

Table 1 Table of results

Full size table

Explainability analysis

Figure 1 shows the saliency maps derived from CNN_COMBINED classifications for the correctly diagnosed PD patients in the test set. The highlighted regions across the multiple input maps represent the regions used by the model to produce a classification, whereas a higher saliency value represents higher relevance for the model.

**Fig. 1: Average saliency maps of the correctly classified PD patients overlayed on the PD25 atlas for the independent input maps.**

Figure 2 shows a heatmap of the average intensity and volume covered by the saliency maps for each atlas region across the multiple input images of the sub-cortical and cortical Harvard-Oxford brain atlases. Overall, FA was found to be the most salient image feature, followed by log-Jacobian morphological image feature. The brain regions identified as most important by the CNN_COMBINED model included subcortical regions like the brainstem, bilateral thalamus, right amygdala, right hippocampus, and cortical regions including the occipital cortex, supramarginal gyrus, postcentral and precentral gyrus, temporal cortex, parahippocampal gyrus, angular gyrus, parietal operculum cortex, superior parietal lobe, and subcallosal cortex. For the random forest machine learning model, the most important features selected by the RELIEFF feature selection method are generally in line with the results of the saliency maps computed from the CNN_COMBINED and are shown in Supplementary Table 1.

**Fig. 2: Heatmap of importance for each region of the Harvard-Oxford atlases. Intensity mean values are represented according to the color in each rectangle.**

Figure 3 shows the differences in mean intensity values when comparing early PD patients (Parkinson’s Progression Markers Initiative [PPMI] patients in the test set) and non-early PD patients (rest of the test set). Here, it is possible to identify a pattern with early PD patient’s saliency maps having slightly higher activations in the DTI maps when compared to the PD patients in later disease stages. The opposite can be observed for the later disease stage patients’ activations, which are slightly higher for morphological features derived from the T1-weighted scans when compared to the activations of early patients.

**Fig. 3: Heatmap of mean intensity importance difference for each region of the Harvard–Oxford atlases.**

Discussion

The main contribution of this study is the development of an accurate and explainable deep learning model for the image-guided diagnosis support of Parkinson’s disease based on high-resolution, multimodal MRI data. This model was trained and tested using one of the biggest multi-study databases comprising macro-structural (T1-weighted MRI) and micro-structural (DTI) information from PD patients and HC subjects. Saliency map explanations were then computed to identify the most important regions from each input map for the decision task.

In the present study, the proposed multimodal deep learning model (CNN_COMBINED) performed better than all uni-modal CNN models and the classical machine learning model, although not reaching significance in all cases. Deep learning outperforming traditional machine learning approaches is expected and in line with the current literature^{13,23,24,26,27,28,29,30,33,34,35,37}. The improved performance may be partly explained by the ability of CNN models to automatically extract the important features from the complete set of 3D input maps, and thus learning from every voxel in the image. In contrast, manual feature engineering may cause important information to get missed in the process. Although it is possible that more sophisticated feature engineering techniques could have benefitted the performance of the classical machine learning model, it is unlikely that adding more and other brain regions would have boosted the comparison method beyond the accuracy of the deep learning method. The significantly better results of the CNN_COMBINED model compared to the models trained using information from the T1-weighted data only are likely directly due to the inclusion of DTI maps, which were found to be more important for the task of PD prediction in the combined model. However, it is also possible that some participants are easier to differentiate with information from certain input maps but not from others. This could also explain how the model including all available maps achieved higher performance than the models trained with the best individual map FA maps (i.e., CNN_FA), highlighting the benefits of training a model that combines macro- and micro-structural imaging. Overall, the models trained using micro-structural information performed better than the models trained using macro-structural information only. This implies that the models can identify important micro-structural differences, which are not present in macro-structural data. Nevertheless, considering that T1-weighted images are part of virtually any MRI protocol, it appears reasonable to still include them as inputs of the model, even if the enhancements they offer are modest at a group level. Therefore, this study investigates the use of a combination of micro- and macro-structural information as a multi-channel input to train a CNN for PD classification.

In the context of deep learning, a simple and accurate model is also typically assumed to be more generalizable and easier to explain, which is particularly important for a multimodal approach where data complexity is increased. The proposed deep learning model is relatively lightweight, containing only 2,982,369 trainable parameters. In contrast, larger CNN models or ensembled approaches ranging from 16 to 89 million trainable parameters (not to mention trained with far smaller sample sizes) as previously described in literature^24,27,35,37 could lead to less interpretable results and are more vulnerable to overfitting.

The prediction performance of the proposed model on unseen data is in the range of previously reported accuracies based on macro-structural brain data, micro-structural brain imaging, and a combination of both (ranging from 71.5% to 100%; accuracy reported due to the lack of AUC in all works)^{13,23,24,26,27,28,29,30,33,34,35,37,42}. While accuracy levels close to 100% in previous works seem impressive, some of these studies have not been peer-reviewed, so that the results described should be taken with a grain of salt, while in other cases, such high accuracies were achieved on very small and imbalanced datasets. This result inflation is a well-known challenge in neuroimaging research, which should be considered^43,44. In contrast, the machine learning models created for this study were trained on a much richer and balanced database of Parkinson’s disease patients and healthy control subjects using well-established and common pipelines for image registration and DTI map calculation. The results achieved on the diverse test set (which also contained data from the eight different studies) with robust processing pipelines suggest that our model generalizes well to unseen data. Moreover, the complexity of the datasets used for this study resembles that of a current real clinical scenario where it is essentially impossible to collect highly homogeneous data. This data diversity can enhance the generalizability of our classifier when applied in various clinical contexts. Nevertheless, to address potential biases resulting from dataset imbalances between studies, we applied weights to the loss function during training. Thus, our model stands out as an important step towards a computer-aided diagnosis tool for PD because of its ability to identify micro- and macro-structural differences between the groups despite the heterogeneity of the data.

This study applies methods from the explainable artificial intelligence domain to complex CNN models for PD classification from multimodal data. CNN models are often considered “black box” models, which suggests that it is challenging to understand how they arrive at their predictions. Through explainable techniques, we can overcome the currently existing clinical adoption barrier in precision medicine. In this context, the developed model could be employed by medical professionals to evaluate the validity of the models’ predictions and use the generated saliency maps for multimodal biomarker identification.

In general, the salient brain areas found in this study are consistent with our existing understanding of Parkinson’s disease and how it affects the brain, both at a micro- and macro-structural level. More in detail, wide-spread morphological changes were identified in subcortical and cortical regions as important by our proposed combined model (CNN_COMBINED). The model also gave high importance to the FA DTI maps. This finding is in line with multiple previous studies that identified significant brain changes associated with Parkinson’s disease in FA maps^36,37,45,46. Jacobian maps were given the second highest importance, despite not leading to such high classification accuracies when used without the DTI data, emphasizing the importance of a multimodal classification approach. The associated subcortical regions included the brainstem, thalamus, amygdala, and hippocampus. The brainstem is considered a vulnerable structure, which is known to be affected very early in neurodegenerative diseases like PD⁴⁷. The thalamus is known to suffer a 40–50% neuronal loss in PD patients⁴⁸, which could explain its appearance in the saliency maps. Nevertheless, how this contributes to the pathology is not yet clearly understood⁴⁹. Changes in the amygdala and hippocampus have also been reported in the literature^{50,51,52,53,54} and have been linked to non-motor symptoms of Parkinson’s disease, which can present early in the disease. The occipital cortex (occipital pole, lateral inferior and superior divisions, and cuneus cortex), temporal cortex (temporal pole, temporal fusiform cortex, and parahippocampal gyrus anterior and posterior divisions), parietal cortex (supramarginal anterior and posterior gyrus, postcentral and precentral gyrus, angular gyrus, superior lobule, operculum and precuneus cortex), and subcallosal cortex were identified as the most important cortical regions by our model. These findings are also in line with current literature. For example, micro- and macro-structural differences of the temporal cortices in PD compared with HC have been reported⁵⁵. Moreover, macro-structural differences of the occipital and parietal cortices have been found in previous studies⁵⁶. Additionally, micro-structural changes of the subcallosal cortex have been linked to non-motor symptoms in PD⁵⁷. While the brain atlas regions used to analyze the saliency maps include two labels for the white matter in each hemisphere, aiding in the identification of important regions. However, these relatively large regions may not be specific enough for detailed post-hoc analysis of white matter tracts, an aspect that could be more thoroughly explored in future research. Nevertheless, due to the voxel-wise nature of the CNN model, it is able to use detailed data from white matter regions. More generally, the brain areas that our model identified as important are consistent with Braak’s staging model⁴, which proposes that alphasynuclein deposition moves in ascending manner, starting in the brainstem and moving through the subcortex (i.e., subcortical regions) and mesocortex (parahippocampal gyrus) to ultimately the neocortex (making up to 90% of the human cortex).

Model explanations were independently computed for early (i.e., PPMI study) and non-early patients in the test set to investigate how each imaging modality affected their predictions. We found a general increase in importance of the DTI maps, in particular FA, and a decrease of importance for the morphological T1-weighted image information for the early patients from the test set as depicted in Fig. 4. This is in line with the current clinical understanding that micro-structural changes precede macro-structural alterations in neurodegenerative diseases^58,59. The early participants were selected from PPMI, which is a multicenter study collected without a harmonized imaging protocol. Given the large number of centers contributing data to PPMI, it is unlikely that the differences found are simply due to site differences or biases.

Many of the brain regions identified by the CNN model were also found to be important by the RELIEFF feature selection technique (e.g., brainstem, thalamus, amygdala, subcallosal cortex, postcentral and precentral gyrus, as well as multiple regions from the temporal and occipital lobes). Nonetheless, the reduced performance when compared to the CNN_COMBINED, demonstrates how CNN models can extract more meaningful micro- and macro-structural changes than those extracted through feature engineering.

Some limitations of this work need to be mentioned. First, although using data from multiple centers generally improves the reliability of the findings and the generalizability of the model, the retrospective nature of the data collection also results in differences in the specific inclusion/exclusion criteria, psychiatric co-morbidities, cognitive impairment (including dementia), and overall PD diagnosis criteria. We mitigated these effects by including approximately the same proportion of subjects with respect to group, age, and sex from each of the included studies whenever possible as part of the training, validation, and testing splits. Furthermore, a weight was given to each subject during training to help mitigate negative effects of an imbalanced dataset by putting more weight on individual samples from underrepresented studies/groups (i.e., the PD patients from the study with less PD subjects would have a higher weight and vice versa). Nevertheless, additional thorough prospective research is needed to validate the proposed model placing special emphasis on the training weights and their effect on model performance. Additionally, while we rigorously and visually inspected all data for motion artifacts and excluded subjects with visible corruptions, we cannot fully rule out that more subtle motion artifacts survived our quality control. Due to the retrospective nature of this study, it was not possible to correct for motion prospectively or quantify any head motion during image acquisition so that we were limited to the manual quality control. Thus, the classifier developed may also make use of small motion artefacts that could affect the PD cohort more because of the tremor. It should also be mentioned that no advanced data harmonization approaches were utilized to remove potential site- or scanner-specific image data biases, which may still affect the otherwise quantitative DTI maps and log Jacobian maps to some extent. It is also important to highlight that the use of a combined multimodal approach can lead to scalability challenges since more time is required to preprocess and quality control the data for each imaging modality available to every individual subject. However, the pipeline used in this study is well-established and was found to be robust. Additionally, higher data dimensionality inherently results in more complex models, which can be harder to train and optimize, requiring additional computational resources. Thus, we selected a lightweight CNN, which was able to identify clinically reasonable important features and be explainable without compromising performance.

In conclusion, the present study demonstrates that multimodal MRI can be used and is beneficial to develop an explainable computer-aided diagnosis model based on a lightweight 3D CNN for Parkinson’s disease prediction. This model was trained using pre-processed, paired micro- and macro-structural MRI data from eight separate imaging studies, resembling real clinical scenario data. The model achieved performance in the range of literature values while providing clinically sound explanations for its predictions. The proposed model is not meant to replace but rather complement the decision-making process at the discretion and interpretation of experienced physicians²².

Methods

Datasets and subjects

Data were obtained from eight different studies comprising 632 PD patients and 668 control subjects (HC) (1300 participants in total). The studies included were the Parkinson’s Progression Markers Initiative (PPMI [www.ppmi-info.org]), the Canadian Consortium on Neurodegeneration in Aging (CCNA) COMPASS-ND⁶⁰, PD-MCI Calgary⁶¹, PD-MCI Montreal⁶² (Longitudinal Study on Mild Cognitive Impairment in Parkinson’s Disease), Montreal Neurological Institute’s Open Science Clinical Biological Imaging and Genetic Repository (C-BIG [https://www.mcgill.ca/neuro/open-science/c-big-repository]), Hamburg dataset⁶³, United Kingdom Biobank (UK Biobank [https://pan.ukbb.broadinstitute.org]), and healthy subjects only from the Alzheimer’s Disease Neuroimaging Initiative (ADNI [adni.loni.usc.edu]). For this secondary data use study, we included all participants that met the following criteria: (1) diagnosis of either PD or HC according to each individual study, (2) availability of both T1-weighted and DTI datasets from a 3T scanner (earliest available in case of longitudinal studies like PPMI, and only HC from ADNI with an age range of 65 ± 10 to balance the dataset), and (3) both preprocessed scans (T1-weighted and DTI scans) passing our quality control checks (See Preprocessing Section). The final database after preprocessing and quality controls included 1264 subjects (611 PD and 653 HC). Table 2 contains demographics, clinical, and diagnostic criteria information of the participants used for the present study. Supplementary Table 2 contains information regarding the inclusion and exclusion criteria of the different studies. All individual studies obtained ethical approval from their local ethics committee and ensured that all participants provided written informed consent following the guidelines set forth in the declaration of Helsinki. Moreover, the current study received approval from the Conjoint Health Research Ethics Board at the University of Calgary.

Table 2 Demographics of subjects included in the study

Full size table

MRI acquisition

MRI images were acquired using a variety of scanners and sequence parameters across the eight studies. All images were acquired from a range of MRI scanner manufacturers but all using a 3T magnetic field (Siemens [for seven studies], GE [for four studies], and Phillips [for two studies]). The T1-weighted images had varying slice and in-slice resolution, ranging from 0.94 to 2.0 mm, and most of the images were obtained in the sagittal plane. DTI scans had a slice thickness and in-slice resolution ranging between 2–5 and 1.875–2 mm, respectively; b values ranged from 0 to 2000; and number of directions of acquisition ranged from 15 to 114. More information about the imaging protocols used in each study is provided in Supplementary Table 3 in the Supplementary material.

Preprocessing

Stringent quality control checks were performed on the raw data prior to preprocessing, these were performed through visual inspection to exclude MRI scans affected by movement, aliasing, ghosting, and other effects.

The three steps of the data pre-processing are summarized in Fig. 4. First, the TractoFlow pipeline⁶⁴ was used for processing of the DTI data, including skull stripping, denoising, Eddy current correction, N4 bias correction, resampling to an isotropic resolution of 1 mm with linear interpolation, and computation of the four DTI metrics (i.e., FA, MD, RD, and AD). Similarly, individual T1-weighted images were brain extracted, denoised, N4 bias corrected, and resampled to have 1 mm isotropic voxels. Additionally, the extracted individual DTI metrics (using the B0 and FA together as the moving images) were non-linearly registered (Φ_x in Fig. 4) to the preprocessed T1-weighted MRI image (fixed image) of the same individual using ANTs⁶⁵ (Fig. 4a). Briefly explained, this process included aligning the DTI datasets to the patient’s preprocessed T1-weighted scan using an affine transformation in the first step. This transformation was then used to initialize a non-linear registration⁶⁵ as a second step. At this point, all transformed images were visually checked to identify any misaligned datasets. Next, the preprocessed T1-weighted image (moving image) was non-linearly registered (Φ_y in Fig. 4) to the Montreal Neurological Institute (MNI) PD25-T1-MPRAGE-1mm brain atlas (fixed image). This registration step was also subject to a visual inspection. Subsequently, both previous transformations (Φ_x and Φ_y) were used to non-linearly transform all four extracted DTI parameter maps from the patient space into the PD25 atlas space. The affine part of the second transformation (Φ_y aff) was used to affinely transform the preprocessed T1-weighted scan in the patient space to the PD25 atlas space. Additionally, the displacement fields that resulted from this registration (Φ_y) were used to compute the corresponding log-Jacobian maps using ANTs⁶⁶. Briefly explained, log-Jacobians maps represent how much the volume differs between the atlas and individual preprocessed T1-weighted datasets for each voxel in the atlas space. They are symmetrical around zero with negative values indicating a decrease in volume and positive values indicating an increase in volume with respect to the atlas, which makes them well-suited for training CNNs to capture macro-structural brain information as we have shown in our previous work¹⁴. The MNI PD25 brain mask was used to remove any undesired volume changes outside of the brain (background voxels) induced by ANTs’ regularizer. To finalize step two, the DTI metrics in the atlas space, the affinely registered T1-weighted scans, and the log-Jacobian maps were all cropped to 160 × 192 × 160 voxels to remove unnecessary background data and reduce the data size for the CNN setup. Lastly, the inverse transformation between the individual T1-weighted scans and the PD25 atlas (Inv Φ_y) was used to warp the cortical and subcortical HO brain regions^38,39,40,41 from the PD25 atlas space to the individual T1-weighted MRI space with the nearest-neighbor interpolation. The HO subcortical atlas includes 21 brain areas, such as the thalamus, hippocampus, and brainstem, and the HO cortical atlas is comprised of 48 brain regions like the subcallosal cortex, frontal cortex, and planum-polare. Each of these regions in subject space was used to compute the corresponding volume and to determine the median MD, FA, RD, and AD values in those regions. To account for different global head sizes, the volumes of the HO brain regions were normalized with the individual’s intracranial volumes (i.e., each structure’s volume/intracranial volume) calculated from every patient’s brain mask created during step two. Median DTI values were selected over mean values to account for potential non-normal distributions and partial volume effects at the border of brain structures. The cortical and subcortical Harvard-Oxford atlas brain regions were only used for the analysis of the saliency maps and for the baseline comparison model using a classical machine learning model but are not required for the training and testing of the deep learning model described below.

Deep learning and machine learning models

The simple fully convolutional neural (SFCN) network⁶⁷ served as the basis for the deep learning model trained and optimized in this study. This model has obtained state-of-the-art performance in multimodal adult brain age prediction and sex classification using T1-weighted MRI datasets^68,69. Additionally, the SFCN model has been effectively combined with techniques for computing saliency maps for a variety of applications^14,68,69. Shortly described, the proposed architecture contained two input layers, one 4D input layer with dimensions of 160 × 192 × 160 × 6 for the input imaging maps (all four 3D DTI parameter maps, the affinely registered 3D T1-weighted scan, the 3D log-Jacobian map) and a second input layer for age and sex; five convolutional blocks, each including a 3D convolutional layer with a 3 × 3 × 3 kernel (same padding), batch normalization, a 2 × 2 × 2 max pooling (same padding), and ReLU activation. A 3D convolutional layer with a 1 × 1 × 1 kernel (same padding), a batch normalization layer, a 3D average pooling layer (same padding), and ReLU activation made up the sixth convolutional block. The individual filter sizes for each convolutional layer were 32, 64, 128, 256, 256, and 64. The seventh and last block contained a dropout layer with a rate of 0.2, the output was flattened and concatenated with age and sex. The result was then passed to an additional dense layer with 16 units and ReLU activation before being transferred to a single classification node with sigmoid activation.

The data were split into 75% training, 10% validation, and 15% testing stratified by age, sex, group (HC and PD), and data origin (study) in each split. This was done to minimize the risk of adding data biases during training and to counteract data imbalances in the individual studies. The proposed model was trained from scratch in an end-to-end fashion and by utilizing all four computed 3D DTI maps (FA, MD, RD, AD), the affinely registered 3D T1-weighted scans, the 3D log-Jacobian map, and age and sex (CNN_COMBINED) with randomly initiated weights through 110 epochs following an early stopping condition determined by the model’s performance based on the validation set. The atlas region information was not used in the CNN at any time. The Adam optimizer⁷⁰ was used to optimize a binary cross entropy loss during training. The loss was weighted so that each study received equal weights in terms of data size disparities and patient distribution. These weights were specified as 1 minus the proportion of participants from a particular group (PD or HC) that each study contributed to the total number of datasets in that group (e.g., PPMI gave 222 PD patients, hence the weight was = 1 − (222/611)). In addition, the learning rate was empirically determined to be 0.01, with a beta1 parameter of 0.95, an epsilon parameter of 0.001 for the Adam optimizer, and a batch size of 4. All experiments in this study were implemented on a GeForce RTX 3090 using TensorFlow⁷¹.

Eight other models, including deep learning and classical machine learning methods, were created for comparison purposes. The first comparison model was trained to investigate how well a CNN model using the 3D DTI maps alone (CNN_DTI) but with age and sex as additional features can identify patients with PD. Moreover, we trained six more CNN models, one for each DTI map alone (CNN_{FA, MD, RD, and AD}), one using the preprocessed T1-weighted data only, and one using the log-Jacobian maps (CNN_AFF and CNN_JAC). Each of these models also included age and sex as additional features. These six models were trained using the same data splits, architecture, and validation techniques used for the CNN_COMBINED. The baseline models were tuned using independent hyperparameters, which can be found in Table 3. Finally, a classical random forest machine learning model⁷² was trained using the extracted tabulated data to enable a comparison with previously developed classical machine learning models^{13,23,26,28,29,30,33,34}. The random forest model was trained using volume and median DTI metric values for each anatomical region in the Harvard-Oxford atlases (RF_COMBINED). This model was optimized with the same data splits as the CNN models described above, a batch size of 100, and 200 trees. RELIEFF feature selection with ten nearest neighbors⁷³ was used to rank the features based on their importance using the training set to determine the most predictive features, thereby restricting the amount of redundant and non-informative parameters. Iteratively, the least significant feature was removed, the model was re-run, building a new model each time a feature was discarded until only two features remained. RELIEFF was chosen for this task because of its ability to detect non-linear correlations between features and the output class. The model with the best performance within the RF_COMBINED framework was selected to serve as an optimized baseline model representing the combination of traditional feature engineering and machine learning models.

Table 3 Hyperparameters for baseline models

Full size table

Saliency map calculation and region of importance identification

Saliency maps were created for the best-performing CNN model (based on the ROC-AUC) to determine which areas of the brain and imaging parameter contributed the most to the classification of HC and PD. These saliency maps were generated using the SmoothGrad method⁷⁴ for each correctly predicted participant from the test set using CNN_COMBINED, which performed best overall for the test set (see Results Section). Briefly explained, this method adds random noise from a Gaussian distribution to each test dataset and feeds the noisy images into the trained model. The approach then computes and backpropagates the loss function’s associated partial derivative with respect to the noisy input picture. Consequently, each voxel is allocated one value that indicates its relevance to the model’s output. To smoothen the saliency maps, this method is repeated 25 times with a noise standard deviation of 0.01. The intensities of the saliency maps were then min–max normalized to a range of 0 to 1, and the resulting maps were averaged. To acquire a better understanding of the model’s general behavior, a single population saliency map was constructed for each modality by averaging the subject-specific saliency maps across all correctly classified test subjects. The lower voxel intensity values of the average maps were removed with a 0.5 threshold to focus the analysis on the most important highlighted regions.

The Harvard–Oxford cortical and subcortical atlas brain areas (see Preprocessing Section) were utilized to quantify the computed average saliency maps and determine the most relevant regions for classifying Parkinson’s disease patients across the different MRI modalities. Aside from this post-hoc analysis, the atlas regions were not used in any stage of the model training or testing. The volume proportion of each Harvard-Oxford atlas region covered by the thresholded average saliency maps, as well as the corresponding mean saliency map values were computed and explored for each imaging input map for this purpose. Lastly, micro-structural changes of the brain are known to precede macro-structural changes¹³. Thus, the saliency maps of early patients (PPMI patients in the test set) and non-early patients (rest of the test set) were also computed and analyzed to investigate if DTI metrics are more important for early-onset PD patients.

Evaluation metrics

The area under the receiver operating characteristic curve was used to evaluate the performance of each trained classifier, which is sensitive to the trade-off between true and false positive rates²¹. Additional metrics including accuracy, specificity, and sensitivity based on a probability threshold (0.5) were also computed as secondary evaluation metrics. Additionally, a paired t-test was also conducted to establish statistical significance between the models’ predictions, whereas a p value of 0.05 was considered statistically significant.

Data availability

Openly available datasets that support the findings of this study were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (www.ppmi-info.org/data), the C-BIG repository (https://www.mcgill.ca/neuro/open-science/c-big-repository), and the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see www.adni-info.org. The other studies included in this work retain ownership of their scans and are thus not openly available.

Code availability

The underlying code for this study will be made publicly available to facilitate reproducibility (https://github.com/miltoncamachocamacho/PD-SFCNN).

References

Poewe, W. et al. Parkinson disease. Nat. Rev. Dis. Prim. 3, 1–21 (2017).
Google Scholar
Yang, J., Burciu, R. G. & Vaillancourt, D. E. Longitudinal progression markers of Parkinson’s disease: current view on structural imaging. Curr. Neurol. Neurosci. Rep. 18, 1–11 (2018).
Article CAS Google Scholar
Fearnley, J. M. & Lees, A. J. Ageing and Parkinson’s disease: substantia nigra regional selectivity. Brain 114, 2283–2301 (1991).
Article PubMed Google Scholar
Braak, H. et al. Staging of brain pathology related to sporadic Parkinson’s disease. Neurobiol. Aging 24, 197–211 (2003).
Article PubMed Google Scholar
Balestrino, R. & Schapira, A. H. V. Parkinson’s disease. Eur. J. Neurol. 27, 27–42 (2020).
Article CAS PubMed Google Scholar
Bloem, B. R., Okun, M. S. & Klein, C. Parkinson’s disease. Lancet 397, 2284–2303 (2021).
Article CAS PubMed Google Scholar
Rizzo, G. et al. Accuracy of clinical diagnosis of Parkinson disease. Neurology 86, 566–576 (2016).
Article PubMed Google Scholar
Beach, T. G. & Adler, C. H. Importance of low diagnostic accuracy for early Parkinson’s disease. Mov. Disord. 33, 1551–1554 (2018).
Article PubMed PubMed Central Google Scholar
Ibarretxe-Bilbao, N. et al. Progression of cortical thinning in early Parkinson’s disease. Mov. Disord. 27, 1746–1753 (2012).
Article PubMed Google Scholar
Sarasso, E., Agosta, F., Piramide, N. & Filippi, M. Progression of grey and white matter brain damage in Parkinson’s disease: a critical review of structural MRI literature. J. Neurol. 268, 3144–3179 (2021).
Article PubMed Google Scholar
Tessa, C. et al. Progression of brain atrophy in the early stages of Parkinson’s disease: a longitudinal tensor-based morphometry study in de novo patients without cognitive impairment. Hum. Brain Mapp. 35, 3932–3944 (2014).
Article PubMed PubMed Central Google Scholar
Tsiouris, S., Bougias, C., Konitsiotis, S., Papadopoulos, A. & Fotopoulos, A. Early-onset frontotemporal dementia-related semantic variant of primary progressive aphasia: multimodal evaluation with brain perfusion SPECT, SPECT/MRI coregistration, and MRI volumetry. Clin. Nucl. Med. 47, 260–264 (2022).
Article PubMed Google Scholar
Talai, A. S., Sedlacik, J., Boelmans, K. & Forkert, N. D. Utility of multi-modal MRI for differentiating of Parkinson’s disease and progressive supranuclear palsy using machine learning. Front Neurol. 12, 546 (2021).
Article Google Scholar
Camacho, M. et al. Explainable classification of Parkinson’s disease using deep learning trained on a large multi-center database of T1-weighted MRI datasets. Neuroimage Clin. 38, 103405 (2023).
Article PubMed PubMed Central Google Scholar
Hall, J. M. et al. Diffusion alterations associated with Parkinson’s disease symptomatology: a review of the literature. Parkinson Relat. Disord. 33, 12–26 (2016).
Article Google Scholar
Hess, C. W., Ofori, E., Akbar, U., Okun, M. S. & Vaillancourt, D. E. The evolving role of diffusion magnetic resonance imaging in movement disorders. Curr. Neurol. Neurosci. Rep. 13, 1–16 (2013).
Article Google Scholar
Cochrane, C. J. & Ebmeier, K. P. Diffusion tensor imaging in Parkinsonian syndromes. Neurology 80, 857–864 (2013).
Article PubMed PubMed Central Google Scholar
Talai, A. S., Sedlacik, J., Boelmans, K. & Forkert, N. D. Widespread diffusion changes differentiate Parkinson’s disease and progressive supranuclear palsy. Neuroimage Clin. 20, 1037–1043 (2018).
Article PubMed PubMed Central Google Scholar
Shih, Y.-C., Tseng, W.-Y. I. & Montaser-Kouhsari, L. Recent advances in using diffusion tensor imaging to study white matter alterations in Parkinson’s disease: a mini review. https://doi.org/10.3389/fnagi.2022.1018017. (2023)
Mishra, V. R. et al. Influence of analytic techniques on comparing DTI-derived measurements in early-stage Parkinson’s disease. Heliyon 5, e01481 (2019).
Article PubMed PubMed Central Google Scholar
Lo Vercio, L. et al. Supervised machine learning tools: a tutorial for clinicians. J. Neural Eng. 17, 062001 (2020).
Article ADS Google Scholar
Maceachern, S. J. & Forkert, N. D. Machine learning for precision medicine. Genome 64, 416–425 (2021).
Article PubMed Google Scholar
Adeli, E. et al. Kernel-based joint feature selection and max-margin classification for early diagnosis of Parkinson’s disease. Sci. Rep. 7, 1–14 (2017).
Article Google Scholar
Esmaeilzadeh, S., Yang, Y. & Adeli, E. End-to-end Parkinson disease diagnosis using brain MR-images by 3D-CNN. arXiv preprint arXiv:1806.05233 (2018).
Adeli, E. et al. Joint feature-sample selection and robust diagnosis of Parkinson’s disease from MRI data. Neuroimage 141, 206–219 (2016).
Article PubMed Google Scholar
Amoroso, N., La Rocca, M., Monaco, A., Bellotti, R. & Tangaro, S. Complex networks reveal early MRI markers of Parkinson’s disease. Med. Image Anal. 48, 12–24 (2018).
Article PubMed Google Scholar
Chakraborty, S., Aich, S. & Kim, H. C. Detection of Parkinson’s disease from 3T T1 weighted MRI scans using 3D convolutional neural network. Diagnostics 10, 402 (2020).
Article PubMed PubMed Central Google Scholar
Cigdem, O., Yilmaz, A., Beheshti, I. & Demirel, H. Comparing the performances of PDF and PCA on Parkinson’s disease classification using structural MRI images. In: Proc. 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 1–4 (Institute of Electrical and Electronics Engineers Inc.). https://doi.org/10.1109/SIU.2018.8404697 (2018).
Solana-Lavalle, G. & Rosas-Romero, R. Classification of PPMI MRI scans with voxel-based morphometry and machine learning to assist in the diagnosis of Parkinson’s disease. Comput. Methods Prog. Biomed. 198, 105793 (2021).
Article Google Scholar
Gu, Q. et al. Automatic classification on multi-modal MRI data for diagnosis of the postural instability and gait difficulty subtype of Parkinson’s disease. J. Parkinson Dis. 6, 545–556 (2016).
Article Google Scholar
Archer, D. B. et al. Development and validation of the automated imaging differentiation in parkinsonism (AID-P): a multicentre machine learning study. Lancet Digit Health 1, e222–e231 (2019).
Article PubMed PubMed Central Google Scholar
Huang, L., Ye, X., Yang, M., Pan, L. & Zheng, S. hua. MNC-Net: Multi-task graph structure learning based on node clustering for early Parkinson’s disease diagnosis. Comput Biol. Med. 152, 106308 (2023).
Article PubMed Google Scholar
Du, G. et al. Combined diffusion tensor imaging and apparent transverse relaxation rate differentiate Parkinson disease and atypical Parkinsonism. Am. J. Neuroradiol. 38, 966 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liu, T. et al. Disrupted rich-club organization of brain structural networks in Parkinson’s disease. Brain Struct. Funct. 226, 2205–2217 (2021).
Article PubMed Google Scholar
Yasaka, K. et al. Parkinson’s disease: deep learning with a parameter-weighted structural connectome matrix for diagnosis and neural circuit disorder investigation. Neuroradiology 63, 1451–1462 (2021).
Article ADS PubMed PubMed Central Google Scholar
Muñoz-Ramírez, V. et al. Subtle anomaly detection: application to brain MRI analysis of de novo Parkinsonian patients. Artif. Intell. Med. 125, 102251 (2022).
Article PubMed Google Scholar
Zhao, H. et al. Deep learning-based diagnosis of Parkinson’s disease using diffusion magnetic resonance imaging. Brain Imaging Behav. 16, 1749–1760 (2022).
Article PubMed Google Scholar
Makris, N. et al. Decreased volume of left and total anterior insular lobule in schizophrenia. Schizophr. Res. 83, 155–171 (2006).
Article ADS PubMed Google Scholar
Frazier, J. A. et al. Structural brain magnetic resonance imaging of limbic and thalamic volumes in pediatric bipolar disorder. Am. J. Psychiatry 162, 1256–1265 (2005).
Article PubMed Google Scholar
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980 (2006).
Article PubMed Google Scholar
Goldstein, J. M. et al. Hypothalamic abnormalities in schizophrenia: sex effects and genetic vulnerability. Biol. Psychiatry 61, 935–945 (2007).
Article CAS PubMed Google Scholar
Zhang, J. Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease. NPJ Parkinsons Dis. 8, 13 (2022).
Article PubMed PubMed Central Google Scholar
Vabalas, A., Gowen, E., Poliakoff, E. & Casson, A. J. Machine learning algorithm validation with a limited sample size. PLoS One 14, e0224365 (2019).
Article CAS PubMed PubMed Central Google Scholar
Schnack, H. G. & Kahn, R. S. Detecting neuroimaging biomarkers for psychiatric disorders: sample size matters. Front Psychiatry 7, 180068 (2016).
Article Google Scholar
Salamanca, L., Vlassis, N., Diederich, N., Bernard, F. & Skupin, A. Improved Parkinson’s disease classification from diffusion MRI data by fisher vector descriptors. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 9350 pp. 119–126 (Springer Verlag, 2015).
Zhan, W. et al. Regional alterations of brain microstructure in Parkinson’s disease using diffusion tensor imaging. Mov. Disord. 27, 90–97 (2012).
Article PubMed Google Scholar
Arribarat, G., De Barros, A. & Péran, P. Modern brainstem MRI techniques for the diagnosis of Parkinson’s disease and Parkinsonisms. Front Neurol. 11, 791 (2020).
Article PubMed PubMed Central Google Scholar
Blesa, J., Foffani, G., Dehay, B., Bezard, E. & Obeso, J. A. Motor and non-motor circuit disturbances in early Parkinson disease: which happens first?. Nat. Rev. Neurosci. 23, 115–128 (2022).
Article CAS PubMed Google Scholar
Blesa, J., Trigo-Damas, I. & Obeso, J. A. Parkinson’s disease and thalamus: facts and fancy. Lancet Neurol. 15, e2 (2016).
Article PubMed Google Scholar
Churchyard, A. & Lees, A. J. The relationship between dementia and direct involvement of the hippocampus and amygdala in Parkinson’s disease. Neurology 49, 1570–1576 (1997).
Camicioli, R. et al. Parkinson’s disease is associated with hippocampal atrophy. Mov. Disord. 18, 784–790 (2003).
Article PubMed Google Scholar
Bertrand, E. et al. Degenerative axonal changes in the hippocampus and amygdala in Parkinson’s disease. Folia Neuropathol. 41, 197–207 (2003).
PubMed Google Scholar
Foo, H. et al. Associations of hippocampal subfields in the progression of cognitive decline related to Parkinson’s disease. Neuroimage Clin. 14, 37–42 (2016).
Article PubMed PubMed Central Google Scholar
van Mierlo, T. J., Chung, C., Foncke, E. M., Berendse, H. W. & van den Heuvel, O. A. Depressive symptoms in Parkinson’s disease are related to decreased hippocampus and amygdala volume. Mov. Disord. 30, 245–252 (2015).
Article PubMed Google Scholar
Sterling, N. W. et al. Cortical gray and subcortical white matter associations in Parkinson’s disease. Neurobiol. Aging 49, 100–108 (2017).
Article PubMed Google Scholar
Nürnberger, L. et al. Longitudinal changes of cortical microstructure in Parkinson’s disease assessed with T1 relaxometry. Neuroimage Clin. 13, 405–414 (2017).
Article PubMed Google Scholar
Uhr, L., Tsolaki, E. & Pouratian, N. Diffusion tensor imaging correlates of depressive symptoms in Parkinson disease. J. Comp. Neurol. 530, 1729–1738 (2022).
Article PubMed PubMed Central Google Scholar
Müller, M. J. et al. Functional implications of hippocampal volume and diffusivity in mild cognitive impairment. Neuroimage 28, 1033–1042 (2005).
Article PubMed Google Scholar
Jung, W. B., Lee, Y. M., Kim, Y. H. & Mun, C. W. Automated classification to predict the progression of Alzheimer’s disease using whole-brain volumetry and DTI. Psychiatry Investig. 12, 92–102 (2015).
Article PubMed PubMed Central Google Scholar
Duchesne, S. et al. The Canadian dementia imaging protocol: harmonizing national cohorts. J. Magn. Reson Imaging 49, 456–465 (2019).
Article PubMed Google Scholar
Lang, S. et al. Network basis of the dysexecutive and posterior cortical cognitive profiles in Parkinson’s disease. Mov. Disord. 34, 893–902 (2019).
Article PubMed Google Scholar
Hanganu, A. et al. Mild cognitive impairment is linked with faster rate of cortical thinning in patients with Parkinson’s disease longitudinally. Brain 137, 1120–1129 (2014).
Article PubMed Google Scholar
Boelmans, K. et al. Brain iron deposition fingerprints in Parkinson’s disease and progressive supranuclear palsy. Mov. Disord. 27, 421–427 (2012).
Article PubMed Google Scholar
Theaud, G. et al. TractoFlow: a robust, efficient and reproducible diffusion MRI pipeline leveraging Nextflow & Singularity. Neuroimage 218, 116889 (2020).
Article PubMed Google Scholar
Avants, B. B., Tustison, N. J. & Johnson, H. J. ANTs by stnava. http://stnava.github.io/ANTs/ (2009).
Leow, A. D. et al. Statistical properties of jacobian maps and the realization of unbiased large-deformation nonlinear image registration. IEEE Trans. Med Imaging 26, 822–832 (2007).
Article PubMed Google Scholar
Peng, H., Gong, W., Beckmann, C. F., Vedaldi, A. & Smith, S. M. Accurate brain age prediction with lightweight deep neural networks. Med. Image Anal. 68, 101871 (2021).
Article PubMed Google Scholar
Mouches, P., Wilms, M., Rajashekar, D., Langner, S. & Forkert, N. D. Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions. Hum. Brain Mapp. https://doi.org/10.1002/HBM.25805. (2022)
Stanley, E. et al. A fully convolutional neural network for explainable classification of attention deficit hyperactivity disorder. 36 (SPIE-Intl Soc Optical Eng). https://doi.org/10.1117/12.2607509 (2022).
Kingma, D. P. & Ba, J. L. Adam: a method for stochastic optimization. In: Proc. 3rd International Conference on Learning Representations, ICLR 2015—Conference Track Proceedings (International Conference on Learning Representations, ICLR). https://doi.org/10.48550/arxiv.1412.6980 (2017).
Abadi, M. et al. TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Kononenko, I., Šimec, E. & Robnik-Šikonja, M. Overcoming the myopia of inductive learning algorithms with RELIEFF. Appl. Intell. 7, 39–55 (1997).
Article Google Scholar
Smilkov, D., Thorat, N., Kim, B., Viégas, F. & Wattenberg, M. SmoothGrad: removing noise by adding noise. In: Proc. Workshop on Visualization for Deep Learning https://doi.org/10.48550/arxiv.1706.03825 (2017).

Download references

Acknowledgements

Image data used were provided in part by the PPMI-a, public–private partnership funded by Michael J. Fox Foundation, and by the UKBiobank (application number 77508). Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. This work was supported by the Canadian Consortium on Neurodegeneration in Aging (CCNA), the Canada Research Chairs program, the River Fund at Calgary Foundation, the Canadian Open Neuroscience Platform (CONP), the Canadian Institutes of Health Research, and the Tourmaline Chair in Parkinson disease.

Author information

Authors and Affiliations

Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB, Canada
Milton Camacho & Kimberly Amador
Department of Radiology, University of Calgary, Calgary, AB, Canada
Milton Camacho, Matthias Wilms, Kimberly Amador & Nils D. Forkert
Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB, Canada
Matthias Wilms & Nils D. Forkert
Department of Pediatrics and Community Health Sciences, University of Calgary, Calgary, AB, Canada
Matthias Wilms & Nils D. Forkert
Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
Matthias Wilms, Hannes Almgren, Zahinoor Ismail, Oury Monchi & Nils D. Forkert
Department of Clinical Neurosciences, University of Calgary, Calgary, AB, Canada
Hannes Almgren, Zahinoor Ismail, Oury Monchi & Nils D. Forkert
Neuroscience and Mental Health Institute and Department of Medicine (Neurology), University of Alberta, Edmonton, AB, Canada
Richard Camicioli
Department of Psychiatry, University of Calgary, Calgary, AB, Canada
Zahinoor Ismail
College of Medicine and Health, University of Exeter, Exeter, UK
Zahinoor Ismail
Department of Radiology, Radio-oncology and Nuclear Medicine, Université de Montréal, Montréal, QC, Canada
Oury Monchi
Centre de Recherche, Institut Universitaire de Gériatrie de Montréal, Montréal, QC, Canada
Oury Monchi

Authors

Milton Camacho
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Wilms
View author publications
You can also search for this author in PubMed Google Scholar
Hannes Almgren
View author publications
You can also search for this author in PubMed Google Scholar
Kimberly Amador
View author publications
You can also search for this author in PubMed Google Scholar
Richard Camicioli
View author publications
You can also search for this author in PubMed Google Scholar
Zahinoor Ismail
View author publications
You can also search for this author in PubMed Google Scholar
Oury Monchi
View author publications
You can also search for this author in PubMed Google Scholar
Nils D. Forkert
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

For the Alzheimer’s Disease Neuroimaging Initiative

Contributions

(1) Study design; (2) data acquisition; (3) data processing; (4) data analysis; (5) manuscript drafting; (6) manuscript revision. Milton Camacho (1,2,3,4,5,6), Matthias Wilms (1,4,5,6), Hannes Almgren (1,5), Kimberly Amador (4), Richard Camicioli (2,5), Zahinoor Ismail (2,5), Oury Monchi (1,2,5,6), Nils D. Forkert (1,4,5,6).

Corresponding author

Correspondence to Milton Camacho.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in the analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.

Supplementary information

Suplementary Material PDF

Related Manuscript File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Camacho, M., Wilms, M., Almgren, H. et al. Exploiting macro- and micro-structural brain changes for improved Parkinson’s disease classification from MRI data. npj Parkinsons Dis. 10, 43 (2024). https://doi.org/10.1038/s41531-024-00647-9

Download citation

Received: 14 July 2023
Accepted: 22 January 2024
Published: 26 February 2024
DOI: https://doi.org/10.1038/s41531-024-00647-9