Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease

Zhang, Jing

doi:10.1038/s41531-021-00266-8

Download PDF

Review Article
Open access
Published: 21 January 2022

Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease

Jing Zhang ORCID: orcid.org/0000-0003-0168-411X¹

npj Parkinson's Disease volume 8, Article number: 13 (2022) Cite this article

7674 Accesses
27 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease (PD) is a common, progressive, and currently incurable neurodegenerative movement disorder. The diagnosis of PD is challenging, especially in the differential diagnosis of parkinsonism and in early PD detection. Due to the advantages of machine learning such as learning complex data patterns and making inferences for individuals, machine-learning techniques have been increasingly applied to the diagnosis of PD, and have shown some promising results. Machine-learning-based imaging applications have made it possible to help differentiate parkinsonism and detect PD at early stages automatically in a number of neuroimaging studies. Comparative studies have shown that machine-learning-based SPECT image analysis applications in PD have outperformed conventional semi-quantitative analysis in detecting PD-associated dopaminergic degeneration, performed comparably well as experts’ visual inspection, and helped improve PD diagnostic accuracy of radiologists. Using combined multi-modal (imaging and clinical) data in these applications may further enhance PD diagnosis and early detection. To integrate machine-learning-based diagnostic applications into clinical systems, further validation and optimization of these applications are needed to make them accurate and reliable. It is anticipated that machine-learning techniques will further help improve differential diagnosis of parkinsonism and early detection of PD, which may reduce the error rate of PD diagnosis and help detect PD at pre-motor stage to make it possible for early treatments (e.g., neuroprotective treatment) to slow down PD progression, prevent severe motor symptoms from emerging, and relieve patients from suffering.

A replication study, systematic review and meta-analysis of automated image-based diagnosis in parkinsonism

Article Open access 17 February 2022

Machine learning based risk prediction for Parkinson's disease with nationwide health screening data

Article Open access 14 November 2022

Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Article Open access 16 December 2022

Introduction

Parkinson’s disease (PD) is a common, chronic, progressive neurodegenerative movement disorder associated with the aggregation of abnormal α-synuclein in Lewy bodies and the loss of nigrostariatal dopaminergic neurons. The mechanism of neurodegeneration in PD is unclear, and currently, there is no cure for PD. The most striking symptoms of PD are motor symptoms such as tremor, rigidity, bradykinesia, or postural instability and patients with severe motor symptoms often have difficulty using their hands, or have difficulty standing and walking due to tremor and stiff muscles, which severely affects their quality of life. In addition, non-motor symptoms, such as hyposmia/anosmia (smell/olfactory loss), autonomic dysfunction, and rapid eye movement (REM) sleep behavior disorder, usually emerge years before motor symptoms, but they may be mild and are often overlooked. The diagnosis of PD is challenging, e.g., in differentiating PD from essential tremor, drug-induced parkinsonism and atypical parkinsonian disorders such as progressive supranuclear palsy (PSP), multiple system atrophy (MSA), and corticobasal degeneration (CBD). The error rate of a clinical diagnosis of PD is high. A meta-analysis reported that the error rate was 26.2% by nonexperts, and from 16.1% (for initial diagnosis) to 20.4% (for follow up diagnosis) by experts¹. Using autopsy results to evaluate PD diagnoses, Hughes et al.² found the diagnostic error rate was around 24%. Further, using neuropathologic findings of PD as the gold standard, Adler et al.³ found that the accuracy of a clinical diagnosis of PD was only 26% in untreated or medication non-responsive subjects, 53% in medication-responsive early PD (duration shorter than 5 years), and >85% in medication-responsive and longer duration PD. The high error rate in PD diagnosis may be because: (1) Clinical diagnoses of PD are mainly based on results of clinical tests and response to antiparkinsonian medication. Neuroimaging is only used as an assistance in PD diagnosis, although the clinical utility of neuroimaging such as SPECT (single photon emission computed tomography) is high and results of dopamine transporter scan (DaTscan) lead to modified diagnosis in one-third of the patients⁴; (2) Currently, there are few reliable biomarkers for PD⁵, in particular, there is no in vivo imaging tool available to directly image the accumulation of α–synuclein aggregates or the spreading of Lewy bodies in the brain of a PD patient⁶.

Another challenge in the diagnosis of PD is early detection because at early stages of PD, brain changes and symptoms are subtle. The brain regions that are most affected by PD are the basal ganglia and substantia nigra. Neurodegeneration of the basal ganglia and loss of dopaminergic neurons in the substantia nigra begin long before the presence of motor symptoms, and by the time motor symptoms emerge, 40–60% of nigral dopaminergic neurons are lost and up to 80% synaptic function is reduced^7,8. The period between the onset of neurodegeneration and the emergence of motor symptoms is called prodromal (or pre-motor) stage, which might last from several years to decades⁹. Early neuroprotective treatment can slow down neurodegeneration progression and potentially prevent clinical PD symptoms from emerging⁹. Therefore, it is important to detect PD at early stages so that early neuroprotective treatment can be effective.

Clinical assessment and analysis of PD imaging is crucial for the diagnosis of PD. At early stages of PD, loss of neurons in the brain first occurs in the ventrolateral substantia nigra pars compacta, then projects to the posterior putamen, and then to more regions in the striatum. Progressive brain atrophy has been detected on structural MRI in PD, even at early PD stages^10,11. In addition, due to the accumulation of abnormal α-synuclein aggregates in Lewy bodies and the spreading of Lewy bodies in the brain of PD patients over time (from brain stem and olfactory system, to the substantia nigra and then to neocortical regions)¹², PD may be viewed as a progressive brain network disruption¹³. Dopaminergic radiotracer imaging with SPECT or positron emission tomography (PET), a biomarker of early PD, can detect dopaminergic denervation in PD at pre-motor stage^9,14. However, dopaminergic denervation may elude from visual analysis or semi-quantitative image analysis of SPECT or PET images. Consequently, there is variability in dopamine transporter SPECT imaging interpretation between radiologists, which leads to inconsistent diagnosis. Thus, computer-aided diagnosis (CAD) based on machine-learning methods has been developed to help detect dopaminergic denervation on SPECT^{15,16,17,18,19,20} and PET^21,22,23,24 images, and to identify PD-related structural changes on MRI^25,26,27,28 for early detection of PD. Moreover, resting-state functional MRI (rs-fMRI)^{29,30,31,32,33} and diffusion tenser imaging (DTI)^34,35 have been used to identify abnormal functional and structural connectivity in PD. Further, machine-learning-based multi-modal data (including imaging and/or clinical data) analysis has been found helpful in the detection of brain abnormalities in PD^17,30,36,37.

Machine learning (ML), a group of multivariate analytic methods that learn from data, identify data patterns and classify the data, is often used in data mining and artificial intelligence. Machine learning can be either supervised (using training data labeled by humans for data classification) or unsupervised (which does not use training data, but identify data patterns on its own). Supervised learning includes methods such as linear discriminant analysis (LDA)³⁸, support vector machine (SVM)³⁹, artificial neural networks (ANNs)⁴⁰ and random forest⁴¹, while unsupervised learning includes approaches such as cluster analysis⁴². Among these methods, SVM and ANNs are frequently used machine-learning models. SVM creates a line (or a hyperplane) that best separates data into classes and provides linear (or non-linear) mapping between inputs and outputs, while ANNs (consisting of multiple layers) work in a complex and non-linear way, which do not provide direct mapping between inputs and outputs. In addition, based on ANNs, newly developed deep-learning techniques or deep-neural networks are a new set of powerful tools for data classification in PD^18,24,43.

Since each PD case is unique, diagnosis and therapy need to be tailored for individual patients to achieve the best clinical outcome. Machine-learning (ML) techniques have the potential to identify complex data patterns, automate data analysis, and make inferences/classifications for data of individual patients, which may be useful for precision medicine in PD. In recent years, machine learning has been increasingly used in the diagnosis of PD. This paper reviewed the studies that applied machine-learning methods to the diagnosis and early detection of PD in order to provide an overview of this field.

An overview of machine-learning-based studies for the diagnosis of PD

Studies applied machine-learning-based approaches to the diagnosis of PD mainly fall into three categories: 1. Discrimination between PD and Healthy control (HC); 2. Differential diagnosis; 3. Early PD detection. Machine-learning-based imaging studies using SPECT, PET, structural MRI, and functional MRI (fMRI) were summarized in Table 1, Table 2, Table 3, and Table 4, respectively (among them, Table 4 is a Supplementary Table).

Table 1 Machine-learning-based SPECT dopaminergic imaging studies for PD diagnosis and early detection.

Full size table

Table 2 Machine-learning-based PET imaging studies for PD diagnosis and early detection.

Full size table

Table 3 Machine-learning-based structural MRI studies for PD diagnosis and early detection.

Full size table

Table 4 Machine-learning-based fMRI studies for PD diagnosis and early detection.

Full size table

Machine-learning-based studies for the discrimination between PD and HC

Dopaminergic imaging

Reduced uptake of a dopamine transporter radiotracer in the striatum of a PD patient on dopaminergic imaging (SPECT or PET) indicates neuronal degeneration and dopaminergic deficit in PD. In particular, reduced uptake of [¹²³I]FP-CIT ([¹²³I]-ioflupane) (the most widely used dopamine transporter radiotracer in SPECT DaTSCAN imaging) in the striatum (putamen and caudate) helps confirm PD and exclude other disorders such as drug-induced Parkinsonism and essential tremor. Measurements of dopamine transporter binding in the striatum and the distribution of the radiotracer uptake are important to characterize dopaminergic functional deficit in PD. Semi-quantitative analysis computes measurements of dopamine transporter binding in the striatum such as striatal uptake and striatal-binding ratios, but can not capture the distribution of the radiotracer uptake (which is often perceived by experienced experts), while machine-learning methods such as artificial neural network (ANN) and support vector machine (SVM) can identify data patterns in the distribution of the radiotracer uptake on SPECT imaging.

To test the ability whether a machine-learning method can mimic expert pattern recognition skills, Acton and Newberg applied ANN to striatum images obtained from dopaminergic SPECT imaging and obtained an overall diagnostic accuracy of 94.4% (n = 81)¹⁵. Illan et al.⁴⁴ further developed an automatic computer-aided diagnostic system based on SVM (and other classifiers) for PD detection, and found that classification with SVM on striatum images performed the best (the area under the receiver-operating characteristics curve (AUC) was 0.968) (n = 108). In addition, Segovia et al.⁴⁵ used partial least square (PLS) for data dimension reduction of striatum images, classified the imaging features with SVM and obtained a classification accuracy of 94.7% (n = 95). Further, Palumbo et al.⁴⁶ used SVM to classify the uptake values in the striatal regions (accuracy: 90.6–90.7%) and reported that uptake values in the putamen are the most discriminative predictor for PD diagnosis, and adding patient age to data classification improved classification accuracy (95.6%) (n = 56).

Comparative studies between machine-learning-based analysis and semi-quantitative analysis of SPECT images have shown that computer-aided diagnosis (CAD) based on machine-learning methods such as SVM and ANN has outperformed conventional semi-quantitative analysis, reduced interpretation variability of dopaminergic transporter SPECT imaging, and improved diagnostic accuracy of PD and consistency of radiologists^15,19,20.

Further, new imaging features such as texture features have improved classification accuracy (e.g., 97.4%, n = 158)⁴⁷, and recently developed machine-learning techniques such as deep-learning convolutional neural networks (CNNs) have begun to show some promising results. For example, Choi et al.¹⁸ developed an automatic deep-learning system that applied CNNs to SPECT imaging analysis and obtained high detection rates of 96% (PPMI data, n = 431, early PD) and 98.8% (local data, n = 72, advanced PD), which was comparable to that of experts’ visual analysis and semi-quantitative analysis. The deep-learning system could also reclassify patients who were clinically diagnosed as PD, but had scans without evidence of dopaminergic deficit (SWEDD)¹⁸. Further, it has been reported that new classifiers such as enhanced probabilistic neural network and a semi-supervised-learning classifier graph-based transductive learning detected PD more accurately than SVM^37,48. In addition, to overcome the limitations of institution-specific ML software implementations, Zhang and Kagen explored the widely available Google^TM TensorFlow machine-learning software library and applied Artificial Neural network to SPECT image classification for a large sample of PD patients (n = 1171), which yielded a classification accuracy of 93.8 ± 4.7%⁴⁹. Further, Glaab et al.²¹ performed voxel-based whole-brain analysis on FDOPA PET (n = 60 PD) and FDG PET (n = 44 PD) images, classified them with SVM and random forest models, and found that FDOPA PET had (~10%) higher diagnostic performances than FDG PET. Using uptake features or texture features extracted from FDG PET data, classification yielded 70–91% accuracy^21,22,23. Further research is warranted to validate and optimize these new data features and/or new machine-learning methods to improve classification accuracy and reliability.

Structural magnetic resonance imaging (MRI)

Morphometric measurements such as brain gray matter (GM) and white matter (WM) volumes, shapes, cortical thickness, and cortical surface area in regions of interest (ROIs) such as striatum have been used as imaging features to detect progressive brain atrophy in machine-learning-based MRI imaging analysis to aid in the diagnosis of PD.

Subcortical nuclei shape analysis has revealed volume differences in the putamen and shape differences in the striatum (putamen and caudate nucleus) between PD patient group and control group, and discriminant analysis using a combination of these imaging features discriminated individual patients from controls with an accuracy of 75–83% (n = 21)⁵⁰. In another study, SVM was applied to combined MRI imaging features (GM, WM, cerebrospinal fluid (CSF) volumes, cortical thickness, cortical surface area, correlation index of cortical thickness of 78 ROIs), which distinguished patients from controls with an accuracy of 85.8% (n = 69)⁵¹. In addition, new MRI imaging features such as cerebellum shape index³⁶ or GM density feature of the cerebellum⁵² and proper classifiers are promising to improve classification accuracy. For example, classifying GM density decrease in the Crus and Vermis of the cerebellum with SVM improved the classification accuracy to 97%⁵², while classifying neuroimaging biomarkers such as cerebellum shape index, surface area, and volume of regions of interest (ROIs), as well as clinical data (e.g., UPDRS scores) with AdaBoost classifier, yielded a classification accuracy up to 98.9%³⁶.

Functional MRI (fMRI)

Reduced functional connectivity (FC) and brain activity (measured by amplitude of low-frequency fluctuation (ALFF)) in the basal ganglia network (BGN) and sensorimotor network in PD have been reported^53,54,55,56. These findings are consistent across multiple patient samples^54,57, and robust to variations in image processing methods, which suggests that resting-state fMRI (rs-fMRI) might be a biomarker for PD.

However, there are some inconsistent findings across rs-fMRI studies in PD^{58,59,60,61,62}. Machine-learning methods may help reveal the diagnostic value of rs-fMRI in PD and clarify some of the inconsistencies. rs-fMRI measurements such as FC, ALFF, and regional homogeneity (ReHo) have been used as imaging features for PD classification. For example, classifying FC, ALFF, and ReHo features with SVM yielded a classification accuracy of 74% (n = 19)³⁰; using FC features from 12 brain networks, FC classification with SVM yielded 70% accuracy (n = 80)⁶³, while using whole-brain FC features, classification with SVM achieved 93.6% accuracy (n = 21)⁶⁴. Apart from variations across data samples, these results revealed the importance of feature selection and optimization in machine-learning-based rs-fMRI image analysis.

Multi-modal data

Since data from a single source or modality (e.g., SPECT) can not fully capture all the key characteristics of the abnormalities of PD, multi-modal data (e.g., combined SPECT imaging and clinical data such as motor test score) may help improve PD detection. Multi-modal data refers to data from different sources (such as imaging modalities: SPECT, PET, MRI, etc.; and/or clinical tests: motor test, cognitive test, etc.) measured on different scales. Clinical data that are often used for PD diagnosis include motor data and non-motor data of clinical examinations such as motor disorder society-sponsored revision of the Unified Parkinson’s Disease Rating Scale I, II, and III (MDS-UPDRS I,II, and III), Montreal Cognitive Assessment (MoCA), Scales for Outcomes in Parkinson’s Disease—Autonomic (SCOPA-AUT), and University of Pennsylvania Smell Identification Test. In addition, to identify biomarkers of PD progression, the Parkinson Progression Marker Initiative (PPMI)⁵, a comprehensive international multi-center study collected multi-modal clinical data (motor data included MDS-UPDRS; non-motor data included cognitive testing such as MoCA, autonomic testing such as SCOPA-AUT total autonomic score, sleep disorder assessment, and olfactory assessment), imaging data (DaTSCAN and MRI), biospecimen data (blood, CSF, urine) and genetic data (DNA, RNA) of 400 PD patients and 200 health controls over 5 years, and made the data available online, which is a great data resource in the field⁵.

Studies have shown that the combination of imaging and clinical data have improved the detection of brain abnormalities in PD^17,21,36,37. For instance, Hirschauer et al.³⁷ found that PD detection rate of a single-modal feature extracted from SPECT (Ioflupane (¹²³I) striatal-binding ratios in the caudate and putamen) was 66–97%, but the combined multi-modal data features (SPECT + clinical data) yielded a detection rate of 98.6%. Glaab et al.²¹ also reported that combining imaging data features (PET data) with metabolomics data enhanced the discrimination power and diagnostic performance of the machine-learning systems in their study.

In addition, studies have shown that combined genetic and clinical data (such as rapid eye movement (REM) sleep behavior disorder, olfactory loss, and CSF measurements) improved PD diagnosis and early detection^17,36,37. Genetic data (e.g., whether sibling with PD with age of onset <50 years) and clinical data such as abnormal quantitative motor test results are biomarkers of early PD⁹. For a recent review on machine learning using genetic data in PD, see ref. ⁶⁵.

Machine-learning-based studies for the differential diagnosis of PD

Dopaminergic imaging

The difference in striatal uptake or striatal uptake ratios between PD and other parkinsonism has been identified by machine-learning-based dopaminergic imaging analysis. To differentiate between PD and vascular parkinsonism (VP), Huertas-Fernández et al.⁶⁶ developed diagnostic models to classify the [(123)I]FP-CIT uptake in the region of interest (ROI) striatum and the whole brain, and reported that discrimination accuracy between VP and PD reached 90.3 ± 5.8% (using logistic regression for ROI approach), and 90.4 ± 5.9% (using SVM for voxel-based whole-brain approach) (n = 164). Further, to differentiate between PD and atypical parkinsonian syndromes such as MSA or PSP, SVM has been applied to (¹⁸)F-DMFP PET image classification (with imaging features such as striatal uptake and uptake in the thalamus) and has yielded moderate (>70%) classification accuracy (n = 39)^67,68,69. A recent study has shown that using deep-learning method and saliency features (extracted from FDG-PET images) significantly improved the differentiation between PD, MSA, and PSM (n = 502)²⁴.

In addition, attempts have been made to differentiate between PD and essential tremor with machine-learning-based dopaminergic imaging analysis. Using striatal uptake ratios as input data for ANN, Hamilton et al.⁷⁰ distinguished PD from essential tremor (n = 18) with 100% diagnostic accuracy. Further, Palumbo et al.⁷¹ classified striatal uptake ratios with probabilistic neural network (PNN) (n = 261), and confirmed that PNN achieved valid classification accuracy to differentiate between PD and essential tremor (accuracy: 81.9 ± 8.1% for early PD; 78.9 ± 8.1% for advanced PD; 96.6 ± 2.6% for essential tremor).

Structural MRI

Atrophy in the midbrain, basal ganglia, and cerebellar peduncles helps distinguish PD from atypical parkinsonian disorders such as progressive supranuclear palsy (PSP) and multiple system atrophy (MSA). PD has subtle volume reduction in cerebral gray matter (GM) and the basal ganglia^50,72,73, while major brain atrophy of PSP is in the midbrain and superior cerebellar peduncles², and for MSA, major abnormalities are in pons, middle cerebellar peduncles, and cerebellum^2,74. Although challenging, attempts to use machine-learning approach have been made to differentiate between PD and other parkinsonian types based on these structural MRI imaging features.

To distinguish PD from atypical parkinsonian disorders (such as PSP or MSA), Duchesne et al.⁷⁵ developed an automated computer classification system that extracted brain tissue composition and deformation features in the hindbrain region from MRI images, applied SVM to feature classification and obtained a classification accuracy of 91% (PD vs. non-PD (PSP or MSA)) (n = 16 PD). Further, to differentiate PD from atypical parkinsonian disorders, Focke et al.⁷⁶ used GM and WM volumes obtained by voxel-based morphometry (VBM) and found that classification with SVM yielded up to 96.8% accuracy for differentiation between PD and PSP, and 71.9% between PD and MSA, but it failed to differentiate between PD and healthy controls (n = 21 PD). On the other hand, Salvatore et al.⁷⁷ extracted MRI imaging features by principal components analysis, generated voxel-based pattern distribution map of structural differences for identification of voxel-based morphological biomarkers of PD and PSP, and obtained >90% accuracy in differentiating between PD and PSP, or between PSP and healthy control (n = 28 PD). To further distinguish PD from PSP and MSA, Huppertz et al.⁷⁴ developed an automated MRI analysis method that computed atlas-based volumetric measures and classified the imaging features with SVM, reported the majority of classification accuracy of >80%, and found the largest atrophy in PD, PSP, and MSA (compared with controls) (n = 204 PD). To differentiate between PD and scans without evidence of dopaminergic deficit (SWEDD) or healthy controls, Singh et al.^26,27 extracted discretized voxel intensity changes from MRI using unsupervised self-organizing maps, classified the imaging features with SVM (n = 408 PD) and achieved accurate classification performances (>90%).

In addition, abnormalities in the substantia nigra in PD (due to dopaminergic neuronal loss) revealed by T2-weighted MRI, neuromelanin-sensitive MRI or iron-sensitive MRI at high field strength (such as 7 T) or by 3 T susceptibility weighted imaging (SWI) can be used in the diagnosis of PD¹⁴. For example, Haller et al.⁷⁸ examined PD patients with SWI and found that they had increased SWI in the bilateral thalamus and left substantia nigra, which had diagnostic value in differentiating between PD and other parkinsonism (classification accuracy for SVM: 86.92 ± 16.59%) (n = 20 PD).

Functional MRI (fMRI)

Machine-learning methods have also been applied to rs-fMRI analysis for differentiation of PD subtypes such as tremor-PD vs. non-tremor-PD⁷⁹, and postural instability and gait difficulty subtype (PIGD) vs. non-PIGD⁸⁰. Zhang et al.⁷⁹ used rs-fMRI measurement regional network efficiencies as imaging feature, and classified them with linear discriminant analysis, which yielded an accuracy of 92% in differentiating tremor-PD vs. non-tremor-PD⁷⁹. Further, to differentiate between PD with Levodopa-induced dyskinesias (LID) and PD without LID, Herz et al.⁸¹ extracted seed-based FC in cortico-striatal network from rs-fMRI images, classified them with SVM and achieved a differentiation accuracy of 95.8%.

Diffusion tenser imaging (DTI)

Reduced substantia nigra fractional anisotropy (FA) has been identified and regarded as a PD biomarker for over a decade, but recent meta-analyses have found that substantia nigra fractional anisotropy had a very large variation in results across studies⁸², had low pooled sensitivity and specificity, and was not a diagnostic biomarker of Parkinson’s disease⁸³.

However, since DTI reflects the disruption of microstructure (e.g., neuron myelin) integrity, DTI has shown promise in differentiating PD from atypical parkinsonism. Haller et al.⁸⁴ examined DTI images of PD patients and other Parkinsonism with tract-based spatial statistics (TBSS) analysis and found that compared with other parkinsonism, PD patients had an increased FA and a decreased MD in the right frontal white matter, and classification of DTI imaging features using SVM yielded an accuracy of 97.5 ± 7.54% (n = 17 PD vs. 23 other Parkinsonism). Further, Cherubini et al.⁸⁵ combined DTI and MRI voxel-based morphometry features to distinguish PD patients from PSP patients using SVM, which yielded an improved accuracy (100%) (n = 57 PD vs. 21 PSP). Combining with MRI voxel-based morphometry and rs-fMRI imaging features, DTI has also aided in the differentiation between PD subtypes (PIGD vs. non-PIGD)⁸⁰. In addition, using combined DTI and apparent transverse relaxation rate (R2*) imaging, Du et al.⁸⁶ found that MSA has a decreased FA and an increased apparent transverse relaxation rate (R2*) in the subthalamic nucleus, whereas PSP has an increased MD in the hippocampus. Classification of imaging features with Elastic-Net machine-learning technique yielded high differentiation accuracy (>90%) (n = 35 PD vs. 16 MSA vs. 19 PSP)⁸⁶.

Multi-modal data

Compared with single-modal data features, higher classification accuracy or detection rate using combined multi-modal data features has been achieved in differentiation between PD and atypical parkinsonian disorder. For instance, to differentiate between PD and PSP, Cherubini et al.⁸⁵ used combined MRI and DTI imaging features, and achieved a classification accuracy of 100%, which was higher than using either MRI or DTI features alone⁸⁵. In addition, to differentiate between PD and MSA or PSP, Du et al.⁸⁶ reported high classification accuracy (98–99%) using DTI and apparent transverse relaxation rate R2* imaging features, higher than DTI or R2* features alone.

Machine-learning-based studies for the early detection of PD

Dopaminergic imaging

Machine learning has been found useful in dopaminergic imaging analysis for early PD detection. As a pioneering study, Prashanth et al.¹⁶ investigated the value of different SVM methods in classifying SPECT images for early PD detection. Using striatal-binding ratios in the striatal regions from data obtained from the PPMI database, they found that SVM was valuable in early PD detection and SVM with non-linear kernel achieved higher detection rate (96.14 ± 1.89%) than SVM with linear kernel. Oliveira et al.⁸⁷, applied SVM and other classifiers to classification of the binding potential at each voxel in the striatum of SPECT images for early PD detection and reported that SVM achieved the highest detection rate (97.86%). Later, Prashanth et al.¹⁷ added non-motor clinical data features such as cerebrospinal fluid (CSF) measurements to further improve the detection rate of early PD. Prashanth et al.⁸⁸ further found that shape and surface-fitting-based features showed higher importance than striatal-binding ratios for early PD detection and feature classification with SVM yielded a classification accuracy of 97.29 ± 0.11% (n = 427)⁸⁸. In addition, Oliveira et al.⁸⁹ found that the length of the striatal region uptake (detection rate: 96.5%) performed better than uptake ratio-based based features for early PD detection (n = 443). These findings have demonstrated the value of machine-learning approach in dopaminergic image analysis for early detection of PD and the importance of imaging feature and classifier selection/optimization in machine-learning-based imaging analysis.

Structural MRI

Machine-learning methods have helped improve the diagnostic gain of MRI. Classification of combined MRI measurements (gray matter (GM) + white matter (WM) + cerebrospinal fluid (CSF) volumes) with SVM could detect early PD with an accuracy of 80% (n = 19)³⁰. A robust linear discriminant analysis (LDA) classifier with an optimal set of imaging features (GM and WM volumes of 98 ROIs from MRI) that applied to MRI images of early PD patients (99% in early stages) (n = 374) yielded a classification accuracy of 81.9%, which was higher than 69.1% yielded by SVM⁹⁰. In addition, Singh and Samavedham²⁶ demonstrated that the structural changes of early PD could be detected by using MRI features alone with an unsupervised self-organizing map approach (n = 518) and high classification accuracy (>95%) was achieved. This approach was later applied to a larger dataset (n = 1316) from PPMI (Parkinson’s Progression Markers Initiative) and ADNI (Alzheimer’s disease neuroimaging initiative), and yielded high classification performance (95.37 ± 0.02%) for distinguishing patients with PD (or Alzheimer’s disease) from healthy subjects²⁷, which confirmed the value of the machine-learning approach in aiding the diagnosis of neurodegenerative disorders such as PD and Alzheimer’s disease. In a recent study, Amoroso et al.²⁵ used an unsupervised approach for MRI image classification, and they extracted structure regional connectivity features from MRI images (n = 374), applied SVM to imaging feature classification, and obtained a classification accuracy of 88 ± 6% (using MRI features alone) or 93 ± 4% (using MRI features and clinical data).

Functional MRI (fMRI)

Studies have shown that rs-fMRI can detect PD at early stages. Wu et al.²⁹ used effective connectivity extracted from rs-fMRI to examine patients with early PD (n = 16) and reported that the substantia nigra pars compacta in early PD had decreased effective connectivity with regions such as the striatum, thalamus, supplementary motor area and cerebellum, which negatively correlated with the Unified Parkinson’s Disease Rating Scale (UPDRS) scores. However, the detection rate of early PD using rs-fMRI imaging features alone (classified by SVM) is not high (e.g., 74%³⁰) and needs to be improved. In addition, the findings of two rs-fMRI studies in asymptomatic LRRK2 mutation carriers suggested that functional connectivity disruptions precede the presence of PD motor symptoms^32,33. Further, using rs-fMRI, Rolinski et al.³¹ examined patients with rapid eye movement (REM) sleep behavior disorder (RBD) (n = 26), and PD patients (n = 10), and found that functional connectivity measures of basal ganglia network (BGN) dysfunction differentiated RBD and PD from HC with high sensitivity (96%) and specificity (74% for RBD, 78% for PD), suggesting that rs-fMRI may be a biomarker in identifying early functional connectivity changes in the BGN in subjects at high risk of PD and patients with PD. However, confirmative studies are warranted.

Multi-modal data

Combining multi-modal imaging and/or clinical data have improved early PD detection. Long et al.³⁰ found that combined multi-modal features improved early PD detection and multi-modal imaging (MRI and rs-fMRI) with combined multi-modal features (GM + WM + CSF + ReHo+ALFF + FC) yielded higher classification accuracy (87%) than single-modal features (MRI: 80%; rs-fMRI:74%). In addition, Oliveira et al.⁸⁹ examined SPECT images of early PD patients and found that several data features had high classification accuracy including the length of the striatal region (96.5%), the putaminal binding potential (95.4%) and the striatal-binding potential (93.9%), while the combined imaging features had the highest classification accuracy (97.9%). Furthermore, Prashanth et al.¹⁷ classified non-motor clinical data features, such as rapid eye movement (REM) sleep behavior disorder (RBD) and olfactory loss, and CSF measurements in addition to SPECT imaging markers (striatal-binding ratios) with classifiers such as SVM and random forests, and found that a combination of these data features with SVM classification performed the best in early PD detection (detection rate: 96.40 ± 1.08%) (n = 401).

Discussion

The studies reviewed in this paper have demonstrated that machine-learning automated data analysis, identified data patterns (e.g., in the distribution of the radiotracer uptake on SPECT images) and improved the accuracy of imaging quantification in the diagnosis of PD. A recent comprehensive review has confirmed the value of machine learning in assisting the diagnosis of PD, and has further pointed out the potential of these machine-learning applications to enhance clinical decision-making in PD diagnosis⁹¹. Particularly, the review by Mei et al.⁹¹ provided statistical analysis for the machine-learning studies in PD diagnosis, and reported that (1) on average, the classification accuracy of the machine-learning applications was ~94% for SPECT imaging, ~86% for PET imaging, and ~87% for MRI (including fMRI) imaging; (2) SVM and NN (neural network) were the most frequently used methods in the imaging studies, the usage for SVM (50%–70% used for SPECT or PET imaging, ~60% for MRI imaging) was higher than that of NN (22%–53% used for SPECT or PET imaging, ~23% for MRI imaging); and (3) SVM and NN had higher classification accuracy than other machine-learning methods in the imaging studies.

The value and role of machine learning in the diagnosis and early detection of PD

The value and potential of machine learning (ML) in PD diagnosis have been clearly demonstrated by comparative studies that compared ML methods with conventional techniques such as semi-quantitative methods or visual analysis in the diagnosis of PD. For example, it has been shown that computer-aided diagnosis (CAD) system based on machine-learning methods has outperformed semi-quantitative methods of SPECT image analysis^15,19 and improved PD diagnostic accuracy of radiologists²⁰. Further, Choi et al.¹⁸ demonstrated the value of recently developed deep-learning techniques (convolutional neural networks) in SPECT image analysis, and obtained high classification performance that is comparable to experts’ visual analysis and semi-quantitative analysis¹⁸. In addition, ML methods have been shown useful in differential diagnosis^{65,66,67,68,69,85,86} and early PD detection^17,26,30,31.

Machine-learning applications can automatically analyze and classify imaging and clinical data in PD, but machine-learning applications are still in infancy and subject to errors, pitfalls and biases. For example, clinician-dependent class/group labeling of the training data in the machine-learning models may be prone to errors because of the high error rate in the clinical diagnosis of PD. As another example, newly developed deep-learning models may have new challenges such as overfitting, low generalizability and data insufficiency.

The role of machine-learning applications is not to substitute clinicians, but to assist them in clinical decision-making, to relieve them from tedious data preprocessing, to save them from time-consuming manual raw data inspection or processing (e.g., draw regions of interest and perform measurements on images), and to help them focus on important clinical decision-making questions in order to reduce medical errors, and improve the clinical diagnosis of PD. On the other hand, machine-learning applications are far from perfect and still need to be improved. Being aware of the potential errors and problems in machine-learning applications in radiology, Geis et al.⁹² pointed out that clinicians who use the applications are ultimately responsible for clinical decision-making and patient care.

Current limitations and challenges in machine-learning applications

First, prone-to-error labeling of classes/groups in supervised learning

Due to the high error rate of a clinical diagnosis of PD, clinician-dependent labeling for the classes or groups (e.g., PD patients or healthy subjects) of the training data (that are used in supervised machine-learning applications) may be prone to error. To overcome this problem, training data labeling in supervised learning need to be confirmed by pathological (biopsy or post-mortem) data. On the other hand, when the clinical diagnosis of a PD dataset is uncertain and pathological data is not available, unsupervised-learning approaches may be considered. Without the need for training data, unsupervised learning seeks to identify hidden data patterns, which may overcome the problem of mislabeling diagnostic categories in the training data in supervised learning. However, there are some technical challenges in applying unsupervised-learning methods to imaging analysis in PD diagnosis, e.g., unsupervised-learning methods are not good at accurately extracting imaging features⁴³. Attempts have been made to overcome such difficulties, e.g., by using semi-supervised-learning clustering method that combines a small amount of labeled data with a large amount of unlabeled data in the training dataset⁴³. In addition, unsupervised learning has been applied to MRI feature selection in early PD detection^25,26,27. To detect early PD using supervised-learning methods on structural MRI features is challenging, but Singh and Samavedham have demonstrated that the structural changes of early PD could be detected by integrating Kohonen unsupervised self-organizing map and least-squares support vector machine (n = 518)²⁶. This approach was later applied to a larger dataset (n = 1316)²⁷, which confirmed the value and robustness of the method.

Second, machine-learning “black-box”

Since machine-learning applications identify data patterns (e.g., abnormal structural or functional changes in imaging data) that could be invisible (or unrecognizable) to humans, the mechanisms and results of the machine-learning applications (especially for neural network-based models) may be difficult to interpret due to lack of direct “evidence” supporting classification results. This could be against the principle of evidence-based medicine and results in reluctance to accept machine-learning applications in clinical practice. For example, new deep-learning (or deep-neural network) models often have millions of parameters which make them like a “black-box” incomprehensible to clinicians, and make it hard to interpret how the classification results have been obtained. However, just like a microscope allows people to see at cellular level (which is invisible to human eyes), machine-learning methods identify abstract imaging features (that reveal brain signal differences between groups/classes) such as distributions of a radiotracer uptake, image voxel intensity changes, and texture features, and amplify these signal differences between classes/groups at a resolution that the signal differences between classes/groups can be detected by “machines” or machine-learning models to best separate the data into classes or groups. Clinicians do not have to understand the details of the inner workings (e.g., the parameters) of machine-learning methods in order to use these application tools for PD diagnosis, but it is beneficial to have some basic knowledge of the mechanism of a machine-learning method and statistical pitfalls in order to avoid errors. Nevertheless, although there is an abstraction in the mechanism of machine-learning methods, machine-learning applications shall follow the principle of evidence-based medicine, use best evidence in the field of PD diagnosis (e.g., to guide feature selection or check classification results), and provide as much “evidence” as possible to support clinical decision-making. For instance, in addition to imaging feature classification, Singh et al.²⁷ identified disease-specific biomarkers (i.e., significant brain regions affected by PD) with a machine-learning method, and these biomarkers, serving as “evidence”, could be used to decipher disease progression. Further, efforts have been made to interpret classification results of deep-learning models. For example, Magesh et al.⁹³ reported a newly developed deep-learning model CNN based on transfer learning to analyze and classify SPECT DaTSCAN images that could distinguish PD patients from healthy controls with an accuracy of 95.2%, and used Local Interpretable Model-Agnostic Explainer methods to interpret classification results.

Third, overfitting problem in machine learning

Overfitting problem often occurs in machine-learning applications, which refers to a machine-learning method or model performs very well on a training dataset, but not on a test dataset or other datasets⁹⁴. This might be because the machine-learning method is over-trained by the training dataset and the noise in the training data is also modeled which makes the model ungeneralizable to other datasets. A recent rs-fMRI study showed that PD-related functional connectivity changes were not reproducible across the 3 PD samples used in the study⁶². The classification performance (PD vs. HC) was low (50–60%) even in the datasets from a single data sample and the lack of generalizability in these data samples may be mainly due to high PD heterogeneity⁶². In addition, there are new challenges of overfitting in machine-learning applications using deep-learning models⁹⁵. To overcome the overfitting problem in machine-learning applications for PD diagnosis, it is necessary to improve data quality, reduce data heterogeneity, use large data samples and validate machine-learning models with proper validation methods (such as N-fold cross-validation) in order to make machine-learning models generalizable. Further, to avoid overfitting in deep-learning models, some methods such as implicit regulation, proper initiation, adjusting learning rates and reducing model complexity may help the models generalize well⁹⁵.

Future directions

First, improve and validate the machine-learning applications

Despite the progress made in machine-learning applications in the diagnosis and early detection of PD, there is still much room for improvement. 1) more research is needed to address the problem of prone-to-error class labeling in supervised learning. 2) it is necessary to optimize multi-modal data features for an optimal feature set, and choose and optimize machine-learning classifiers for an optimal classifier to improve classification accuracy. This is because: (1) several studies have shown that combined multi-modal data features (such as SPECT + clinical data) had higher detection rate than single-modal features^30,37; (2) comparative studies using different classifiers have demonstrated the differences in classification accuracy between different classifiers^36,37,89. 3) thorough validation is needed before the machine-learning applications can be used in clinical settings. In addition, newly developed deep-learning techniques have shown promising results^18,93,96, but also face new challenges and obstacles such as overfitting, low generalizability and data insufficiency. For a recent review of deep-learning applications for the diagnosis of PD, see ref. ⁹⁷. Research in explainable machine-learning models is needed to address the “black-box” problem in the neural network models. To overcome the new challenges in deep-learning models, further research is needed to avoid overfitting in deep-learning models, improve these new deep-neural network applications and make them more accurate, reliable, generalizable and explainable.

Second, improve modeling longitudinal multi-modal data

Since Parkinson’s disease is a progressive disorder, it is necessary to model multi-modal data over time in order to identify biomarkers for PD progression. Some efforts have been made to tackle this difficult problem in recent years^{98,99,100,101,102}. Due to the complexity of longitudinal multi-modal (imaging and clinical) data, methods such as embedding learning and sparse regression have been proposed, which have obtained promising results¹⁰². Further research is needed to improve modeling these longitudinal multi-modal data so that reliable biomarkers can be identified to enhance the diagnosis and management of PD.

Third, integrate ML-based applications into clinical decision support system to aid in PD diagnosis

It has been demonstrated that the performance of machine-learning-based computer-aided diagnostic (CAD) system generally exceeded that of semi-quantitative analysis on SPECT imaging in distinguishing PD patients from healthy controls^12,16 and improved PD diagnostic accuracy of radiologists¹⁷. More comparative and confirmative studies are needed to further reveal the advantages and weaknesses of these machine-learning applications. Since semi-quantitative imaging analysis software is commercially available at clinics, such software may be upgraded to incorporate mature machine-learning algorisms to further assist clinicians in the diagnosis of PD. However, a framework that facilitates the development, deployment, validation and regulation of such machine-learning-based clinical applications is needed. For example, benchmark data and metrics (e.g., the PPMI database) need to be established to test the optimized and standardized applications. Further, before these ML-based clinical applications are deployed in clinical settings, it is necessary to run clinical trials to assess the diagnostic gain and clinical benefits of such applications over conventional semi-quantitative analysis (or visual analysis). Consequently, rules and regulations are needed to facilitate this process in order to make such ML-based systems available to clinics.

Conclusions

In summary, encouraging progress has been made in applying machine-learning techniques to the diagnosis and early detection of PD. Although machine-learning applications in PD diagnosis are still in their infancy, machine-learning methods have automated imaging data analysis, outperformed conventional semi-quantitative analysis and performed comparably well as experts’ visual inspection in detecting PD-associated dopaminergic degeneration on SPECT imaging, reduced interpretation variability of imaging, improved PD diagnostic accuracy of radiologists and aided in differential diagnosis and early PD detection. Using combined multi-modal imaging and clinical data (in these applications) may further enhance the diagnosis and early detection of PD. To integrate these machine-learning applications into clinical systems, further validation and optimization are needed to make them accurate and reliable. Despite the challenges in translating machine-learning applications into clinical practice, machine-learning techniques are promising to assist clinicians in improving differential diagnosis of parkinsonism and early diagnosis of PD, which may reduce the error rate of PD diagnosis, and help detect PD at pre-motor stage so that early treatments (e.g., neuroprotective treatment) may be applied to slow down PD progression, prevent severe motor symptoms from emerging, and relieve patients from suffering.

Data availability

Data sharing is not applicable to this article because this article is a literature review and no new data were created or analyzed in this study.

References

Rizzo, G. et al. Accuracy of clinical diagnosis of Parkinson disease: a systematic review and meta-analysis. Neurology 86, 566–576 (2016).
Article PubMed Google Scholar
Hughes, A. J., Daniel, S. E., Ben-Shlomo, Y. & Lees, A. J. The accuracy of diagnosis of parkinsonian syndromes in a specialist movement disorder service. Brain 125, 861–870 (2002).
Article PubMed Google Scholar
Adler, C. H. et al. Low clinical diagnostic accuracy of early vs. advance Parkinson disease (Clinicopathologic study). Neurology 83, 406–412 (2014).
Article PubMed PubMed Central Google Scholar
Bega, D. et al. Clinical utility of DaTscan in patients with suspected Parkinsonia syndrome: a systematic review and meta-analysis. NPJ Parkinson’s Dis. 7, 43 (2021).
Article Google Scholar
Marek et al. Parkinson Progression Marker Initiative. The Parkinson Progression Marker Initiative (PPMI). Prog. Neurobiol. 95, 629–635 (2011).
Article Google Scholar
Politis, M., Pagano, G. & Niccolini, F. Imaging in Parkinson’s Disease. Int Rev. Neurobiol. 132, 233–274 (2017).
Article CAS PubMed Google Scholar
Fearnley, J. M. & Lees, A. J. Ageing and Parkinson’s disease: substantia nigra regional selectivity. Brain 114, 2283–2301 (1991).
Article PubMed Google Scholar
Fuente-Fernandez, R. et al. Age-specific progression of nigrostriatal dysfunction in Parkinson’s disease. Ann. Neurol. 69, 803–810 (2011).
Article PubMed Google Scholar
Postuma, R. B. & Berg, D. Advances in markers of prodromal Parkinson disease. Nat. Rev. Neurol. 12, 622–634 (2016).
Article CAS PubMed Google Scholar
Beyer, M. K., Janvin, C. C., Larsen, J. P. & Aarsland, D. A magnetic resonance imaging study of patients with Parkinson’s disease with mild cognitive impairment and dementia using voxel-based morphometry. J. Neurol. Neurosurg. Psychiatry 78, 254–259 (2007).
Article PubMed Google Scholar
Tessa, C. et al. Progression of brain atrophy in the early stages of Parkinson’s disease: a longitudinal tensor-based morphometry study in de novo patients without cognitive impairment. Hum. Brain Mapp. 35, 3932–3944 (2014).
Article PubMed PubMed Central Google Scholar
Rietdijk, C. D., Perez-Pardo, P., Garssen, J., van Wezel, R. J. A. & Kraneveld, A. D. Exploring Braak’s Hypothesis of Parkinson’s Disease. Front. Neurol. 8, 37 (2017).
Article PubMed PubMed Central Google Scholar
Nobili, F. et al. Clinical utility and research frontiers of neuroimaging in movement disorders. Q J. Nucl. Med. Mol. Imaging 61, 372–385 (2017).
Article PubMed Google Scholar
Barber, T. R., Klein, J. C., Mackay, C. E. & Hua, M. T. M. Neuroimaging in pre-motor Parkinson’s disease. NeuroImage Clin. 15, 215–227 (2017).
Article PubMed PubMed Central Google Scholar
Acton, P. D. & Newberg, A. Artificial neural network classifier for the diagnosis of Parkinson’s disease using [99mTc]TRODAT-1 and SPECT. Phys. Med Biol. 51, 3057–3066 (2006).
Article PubMed Google Scholar
Prashanth, R., Dutta Roy, S., Mandal, P. K. & Ghosh, S. Automatic classification and prediction models for early Parkinson’s disease diagnosis from SPECT imaging. Expert Syst. Appl. 41, 3333–3342 (2014).
Article Google Scholar
Prashanth, R., Dutta Roy, S., Mandal, P. K. & Ghosh, S. High-Accuracy detection of early Parkinson’s disease through multimodal features and machine learning. Int. J. Med. Inform. 90, 13–21 (2016).
Article CAS PubMed Google Scholar
Choi, H., Ha, S., Im, H. J., Paek, S. H. & Lee, D. S. Refining diagnosis of Parkinson’s disease with deep learning-based interpretation of dopamine transporter imaging. Neuroimage Clin. 16, 586–594 (2017).
Article PubMed PubMed Central Google Scholar
Taylor, J. C. & Fenner, J. W. Comparison of machine learning and semiquantification algorithms for (I123)FP-CIT classification: the beginning of the end for semi-quantification? EJNMMI Phys. 4, 29 (2017).
Article PubMed PubMed Central Google Scholar
Taylor, J. C. et al. Computer-aided diagnosis for (123I)FP-CIT imaging: impact on clinical reporting. EJNMMI Res. 8, 36 (2018).
Article PubMed PubMed Central Google Scholar
Glaab, E. et al. Integrative analysis of blood metabolomics and PET brain neuroimaging data for Parkinson’s disease. Neurobiol. Dis. 124, 555–562 (2019).
Article CAS PubMed Google Scholar
Shen, T. et al. Use of overlapping group LASSO sparse deep belief network to discriminate Parkinson’s disease and normal control. Front. Neurosci. 13, 396 (2019).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. Use of radiomic features and support vector machine to distinguish Parkinson’s disease cases from normal controls. Ann. Transl. Med. 7, 773 (2019).
Article PubMed PubMed Central Google Scholar
Zhao, Y. et al. A 3D deep residual convolutional neural network for differential diagnosis of Parkinsonian syndromes on 18F-FDG PET images. 2019 IEEE 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 3531–3534 (Berlin, Germany, 2019).
Amoroso, N., La Rocca, M., Monaco, A., Bellotti, R. & Tangaro, S. Complex networks reveal early MRI markers of Parkinson’s disease. Med. Image Anal. 48, 12–24 (2018).
Article PubMed Google Scholar
Singh, G. & Samavedham, L. Unsupervised learning based feature extraction for differential diagnosis of neurodegenerative diseases: A case study on early-stage diagnosis of Parkinson disease. J. Neurosci. Methods 256, 30–40 (2015).
Article PubMed Google Scholar
Singh, G., Samavedham, L. & Lim, E. C. Alzheimer’s disease neuroimaging initiative; Parkinson progression marker initiative. Determination of imaging biomarkers to decipher disease trajectories and differential diagnosis of neurodegenerative diseases (DIsease TreND). J. Neurosci. Methods 305, 105–116 (2018).
Article PubMed Google Scholar
Rahayel, S. et al. Patterns of cortical thinning in idiopathic rapid eye movement sleep behavior disorder. Mov. Disord. 30, 680–687 (2015).
Article PubMed Google Scholar
Wu, T. et al. Basal ganglia circuits changes in Parkinson’s disease patients. Neurosci. Lett. 524, 55e59 (2012).
Article Google Scholar
Long, D. et al. Automatic classification of early Parkinson’s disease with multi-modal MR imaging. PLoS ONE 7, e47714 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rolinski, M., Szewczyk-Krolikowski, K., Tomlinson, P. R. & Nithi, K. REM sleep behaviour disorder is associated with worse quality of life and other non-motor features in early Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 85, 560–566 (2014).
Article PubMed Google Scholar
Helmich, R. C. et al. Reorganization of corticostriatal circuits in healthy G2019S LRRK2 carriers. Neurology 84, 399–406 (2015).
Article CAS PubMed PubMed Central Google Scholar
Vilas, D. et al. Nigral and striatal connectivity alterations in asymptomatic LRRK2 mutation carriers: a magnetic resonance imaging study. Mov. Disord. https://doi.org/10.1002/mds.26799 (2016)
Unger, M. M. et al. Diffusion tensor imaging in idiopathic REM sleep behavior disorder reveals microstructural changes in the brainstem, substantia nigra, olfactory region, and other brain regions. Sleep 33, 767–773 (2010).
Article PubMed PubMed Central Google Scholar
Scherfler, C. et al. White and gray matter abnormalities in idiopathic rapid eye movement sleep behavior disorder: a diffusion-tensor imaging and voxel-based morphometry study. Ann. Neurol. 69, 400–407 (2010).
Article PubMed Google Scholar
Dinov, I. D. et al. Predictive big data analytics: a study of parkinson’s disease using large, complex, heterogeneous, incongruent, multi-source and incomplete observations. PLoS ONE 11, e0157077 (2016).
Article PubMed PubMed Central Google Scholar
Hirschauer, T. J., Adeli, H. & Buford, J. A. Computer-aided diagnosis of Parkinson’s disease using enhanced probabilistic neural network. J. Med. Syst. 39, 179 (2015).
Article PubMed Google Scholar
McLachlan, G. J. Discriminant analysis and statistical pattern recognition (Wiley Interscience, 2004).
Vapnik, V. N. The nature of statistical learning theory. (Springer, 1995).
Yegnanarayana, B. Artificial neural networks. (PHI Learning Pvt. Ltd., 2009).
Breiman, L. Random forests. Mach Learn. 45, 5 32 (2001).
Manton, K. G., Lowrimore, G., Yashin, A., Kovtun, M. Cluster analysis: overview. (Wiley Stats Ref: Statistics Reference Online, 2014).
Zhang, X., Zhai, D., Yang, Y., Zhang, Y. & Wang, C. A novel semi-supervised multi-view clustering framework for screening Parkinson’s disease. Maths Biosci. Eng. 17, 3395–3411 (2020).
Article Google Scholar
Illan, I. A. et al. Automatic assistance to Parkinson’s disease diagnosis in DaTSCAN SPECT imaging. Med. Phys. 39, 5971–5980 (2012).
Article CAS PubMed Google Scholar
Segovia, F. et al. Improved parkinsonism diagnosis using a partial least squares-based approach. Med. Phys. 39, 4395–4403 (2012).
Article CAS PubMed Google Scholar
Palumbo, B. et al. Diagnostic accuracy of Parkinson disease by support vector machine (SVM) analysis of 123I-FP-CIT brain SPECT data: implications of putaminal findings and age. Med. (Baltim.) 93, e228 (2014).
Article Google Scholar
Martinez-Murcia, F., G´orriz, J., Ram´ırez, J., Moreno-Caballero, M. & G´omez-R´ıo, M. Parametrization of textural patterns in 123I-ioflupane imaging for the automatic detection of Parkinsonism. Med. Phys. 41, 012502 (2014).
Article CAS PubMed Google Scholar
Wang, Z. et al. ADNI and PPMI. Multi-modal classification of neurodegenerative disease by progressive graph-based transductive learning. Med. Image Anal. 39, 218–230 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, Y. C. & Kagen, A. C. Machine learning interface for medical image analysis. J. Digit Imaging 30, 615–621 (2017).
Article PubMed Google Scholar
Nemmi, F., Sabatini, U., Rascol, O. & Peran, P. Parkinson’s disease and local atrophy in subcortical nuclei: insight from shape analysis. Neurobiol. Aging 36, 424–433 (2015).
Article PubMed Google Scholar
Peng, B. et al. A multilevel-ROI-features-based machine learning method for detection of morphometric biomarkers in Parkinson’s disease. Neurosci. Lett. 651, 88–94 (2017).
Article CAS PubMed Google Scholar
Zeng, L. L. et al. Differentiating patients with Parkinson’s disease from normal controls using gray matter in the cerebellum. Cerebellum 16, 151–157 (2017).
Article PubMed Google Scholar
Hacker, C. D., Perlmutter, J. S., Criswell, S. R., Ances, B. M. & Snyder, A. Z. Resting state functional connectivity of the striatum in Parkinson’s disease. Brain 135, 3699–3711 (2012).
Article PubMed PubMed Central Google Scholar
Szewczyk-Krolikowski, K. et al. Functional connectivity in the basal ganglia network differentiates PD patients from controls. Neurology 83, 208–214 (2014).
Article PubMed PubMed Central Google Scholar
Skidmore, F. M. et al. Reliability analysis of the resting state can sensitively and specifically identify the presence of Parkinson disease. Neuroimage 75, 249–261 (2013).
Article CAS PubMed Google Scholar
Tang, Y. et al. Identifying the presence of Parkinson’s disease using low-frequency fluctuations in BOLD signals. Neurosci. Lett. 645, 1–6 (2017).
Article CAS PubMed Google Scholar
Wu, T. et al. Parkinson’s disease-related spatial covariance pattern identified with resting-state functional MRI. J. Cereb. Blood Flow. Metab. 1, 1–7 (2015).
Google Scholar
Griffanti, L., Rolinski, M., Szewczyk-Krolikowski, K., Menke, R. A. & Filippini, N. Challenges in the reproducibility of clinical studies with resting state fMRI: An example in early Parkinson’s disease. Neuroimage 124, 704–713 (2016).
Article PubMed Google Scholar
Helmich, R. C. et al. Spatial remapping of cortico-striatal connectivity in Parkinson’s disease. Cereb. Cortex. 20, 1175–1186 (2010).
Article PubMed Google Scholar
Luo, C. et al. Reduced functional connectivity in early-stage drug-naive Parkinson’s disease: a resting-state fMRI study. Neurobiol. Aging 35, 431–441 (2014).
Article PubMed Google Scholar
Wu, T. et al. Functional connectivity of cortical motor areas in the resting state in Parkinson’s disease. Hum. Brain Mapp. 32, 1443–1457 (2011).
Article PubMed Google Scholar
Badea, L., Onu, M., Wu, T., Roceanu, A. & Bajenaru, O. Exploring the reproducibility of functional connectivity alterations in Parkinson’s disease. PLoS ONE 12, e0188196 (2017).
Article PubMed PubMed Central Google Scholar
Pläschke, R. N. et al. On the integrity of functional brain networks in schizophrenia, Parkinson’s disease, and advanced age: Evidence from connectivity-based single-subject classification. Hum. Brain Mapp. 38, 5845–5858 (2017).
Article PubMed PubMed Central Google Scholar
Chen, Y. et al. Discriminative analysis of Parkinson’s disease based on whole brain functional connectivity. PLoS ONE 10, 1–16 (2015).
Google Scholar
Su, C., Tong, J. & Wang, F. Mining genetic and transcriptomic data using machine learning approaches in Parkinson’s disease. NPJ Parkinson’s Dis. 6, 1 (2020).
Google Scholar
Huertas-Fernández, I. et al. Machine learning models for the differential diagnosis of vascular parkinsonism and Parkinson’s disease using [(123)I]FP-CIT SPECT. Eur. J. Nucl. Med Mol. Imaging 42, 112–119 (2015).
Article PubMed Google Scholar
Segovia, F. et al. Distinguishing Parkinson’s disease from atypical parkinsonian syndromes using PET data and a computer system based on support vector machines and Bayesian networks. Front Comput Neurosci. 9, 137 (2015).
Article PubMed PubMed Central Google Scholar
Segovia, F. et al. Multivariate analysis of 18F-DMFP PET data to assist the diagnosis of Parkinsonism. Front Neuroinform. 11, 23 (2017a).
Article PubMed PubMed Central Google Scholar
Segovia, F., Górriz, J. M., Ramírez, J., Martínez-Murcia, F. J. & Salas-Gonzalez, D. Preprocessing of 18F-DMFP-PET data based on hidden Markov random fields and the Gaussian distribution. Front Aging Neurosci. 9, 326 (2017b).
Article PubMed PubMed Central Google Scholar
Hamilton, D., List, A., Butler, T., Hogg, S. & Cawley, M. Discrimination between parkinsonian syndrome and essential tremor using artificial neural network classification of quantified DaTSCAN data. Nucl. Med. Commun. 27, 939–944 (2006).
Article PubMed Google Scholar
Palumbo, B. et al. Comparison of two neural network classifiers in the differential diagnosis of essential tremor and Parkinson’s disease by 123I-FP-CIT brain SPECT. Eur. J. Nucl. Med. Mol. Imaging 37, 2146–2153 (2010).
Article PubMed Google Scholar
Sterling, N. W. et al. Striatal shape in Parkinson’s disease. Neurobiol. Aging 34, 2510–2516 (2013).
Article PubMed PubMed Central Google Scholar
Menke, R. A. et al. Comprehensive morphometry of subcortical grey matter structures in early-stage Parkinson’s disease. Hum. Brain Mapp. 35, 1681–1690 (2014).
Article PubMed Google Scholar
Huppertz, H. J. et al. Differentiation of neurodegenerative parkinsonian syndromes by volumetric magnetic resonance imaging analysis and support vector machine classification. Mov. Disord. 31, 1506–1517 (2016).
Article PubMed Google Scholar
Duchesne, S., Rolland, Y. & Verin, M. Automated computer differential classification in Parkinsonian syndromes via pattern analysis on MRI. Acad. Radiol. 16, 61–70 (2009).
Article PubMed Google Scholar
Focke, N. K. et al. Individual voxel-based subtype prediction can differentiate progressive supranuclear palsyfrom idiopathic Parkinson syndrome and healthy controls. Hum. Brain Mapp. 32, 1905–1915 (2011).
Article PubMed PubMed Central Google Scholar
Salvatore, C. et al. Machine learning on brain MRI data for differential diagnosis of Parkinson’s disease and progressive supranuclear palsy. J. Neurosci. Methods 222, 230–237 (2014).
Article CAS PubMed Google Scholar
Haller, S. et al. Differentiation between Parkinson disease and other forms of Parkinsonism using support vector machine analysis of susceptibility-weighted imaging (SWI): initial results. Eur. Radiol. 23, 12–19 (2013).
Article CAS PubMed Google Scholar
Zhang, D., Liu, X., Chen, J. & Liu, B. Distinguishing patients with Parkinson’s disease subtypes from normal controls based on functional network regional efficiencies. PLoS ONE 9, e115131 (2014).
Article PubMed PubMed Central Google Scholar
Gu, Q. et al. Automatic classification on Multi-Modal MRI data for diagnosis of the postural instability and gait difficulty subtype of Parkinson’s disease. J. Parkinsons Dis. 6, 545–556 (2016).
Article PubMed Google Scholar
Herz, D. M. et al. Resting-state connectivity predicts levodopa induced dyskinesias in Parkinson’s disease. Mov. Disord. 31, 521–529 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schwarz, S. T. et al. Diffusion tensor imaging of nigral degeneration in Parkinson’s disease: a region-of-interest and voxel-based study at 3 T and systematic review with meta-analysis. Neuroimage Clin. 3, 481–488 (2013).
Article PubMed PubMed Central Google Scholar
Hirata, F. C. C. et al. Substantia nigra fractional anisotropy is not a diagnostic biomarker of Parkinson’s disease: a diagnostic performance study and meta-analysis. Eur. Radiol. 27, 2640–2648 (2017).
Article PubMed Google Scholar
Haller, S. et al. Individual detection of patients with Parkinson disease using support vector machine anal-ysis of diffusion tensor imaging data: initial results. AJNR Am. J. Neuroradiol. 33, 2123–2128 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cherubini, A. et al. Magnetic resonance support vector machine discriminates between Parkinson disease and progressive supranuclear palsy. Mov. Disord. 29, 266–269 (2014).
Article PubMed Google Scholar
Du, G. et al. Combined diffusion tensor imaging and apparent transverse relaxation rate differentiate Parkinson Disease and Atypical Parkinsonism. AJNR Am. J. Neuroradiol. 38, 966–972 (2017).
Article CAS PubMed PubMed Central Google Scholar
Oliveira, F. P. & Castelo-Branco, M. Computer-aided diagnosis of Parkinson’s disease based on [(123)I]FP-CIT SPECT binding potential images, using the voxels-as-features approach and support vector machines. J. Neural Eng. 12, 026008 (2015).
Article PubMed Google Scholar
Prashanth, R., Roy, S. D., Mandal, P. K. & Ghosh, S. High-accuracy classification of parkinson’s disease through shape analysis and surface fitting in 123I-Ioflupane SPECT imaging. IEEE J. Biomed. Health Inform. 21, 794–802 (2017).
Article CAS PubMed Google Scholar
Oliveira, F. P. M., Faria, D. B., Costa, D. C., Castelo-Branco, M. & Tavares, J. M. R. S. Extraction, selection and comparison of features for an effective automated computer-aided diagnosis of Parkinson’s disease based on [123I]FP-CIT SPECT images. Eur. J. Nucl. Med Mol. Imaging 45, 1052–1062 (2018).
Article PubMed Google Scholar
Adeli, E. et al. Joint feature-sample selection and robust diagnosis of Parkinson’s disease from MRI data. Neuroimage 141, 206–219 (2016).
Article PubMed Google Scholar
Mei, J., Desrosiers, C. & Frasnelli, J. Machine learning for the diagnosis of parkinson’s disease: a review of literature. Front. Aging Neurosci. 13, 184 (2021).
Article Google Scholar
Geis, J. R. et al. Ethics of artificial intelligence in radiology: summary of the Joint European and North American Multisociety Statement. Radiology 293, 436–440 (2019).
Article PubMed Google Scholar
Magesh, P. R., Myloth, R. D. & Tom, R. J. An explainable machine learning model for early detection of Parkinson’s disease using LIME on DaTSCAN imagery. Computers Biol. Med. 126, 104041 (2020).
Article CAS Google Scholar
Lee, E. J., Kim, Y. H., Kim, N. & Kang, D. W. Deep into the brain: artificial intelligence in stroke imaging. J. Stroke 19, 277–285 (2017).
Article PubMed PubMed Central Google Scholar
Fang, C., Ding, J., Huang, Q., Tong, T. & Sun, Y. The overfitting iceberg. https://blog.ml.cmu.edu/2020/08/31/4-overfitting/ (2021)
Kiryu, S. et al. Deep learning to differentiate parkinsonian disorders separately using single midsagittal MR imaging: a proof of concept study. Eur. Radiol. 29, 6891–6899 (2019).
Article PubMed Google Scholar
Alzubaidi, M. S. et al. The role of neural network for the detection of Parkinson’s disease: a scoping review. Healthcare 9, 740–760 (2021).
Article PubMed PubMed Central Google Scholar
Li, S., Lei, H., Zhou, F., Gardezi, J. & Lei, B. Longitudinal and Multimodal Data Learning for Parkinson’s Disease Diagnosis via Stacked Sparse Auto-encoder. 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), 384–387 (Venice, Italy, 2019).
Lei, H. et al. Joint detection and clinical score prediction in Parkinson’s disease via multi-modal sparse learning. Expert Syst. Appl. 80, 284–296 (2017).
Article Google Scholar
Huang, Z. et al. Longitudinal and multimodal data learning for Parkinson’s disease diagnosis. 2018. 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI, Washington DC, USA, 2018).
Lei, H., Huang, Z., Elazab, A., Li, H. & Lei, B. Longitudinal and Multi-modal Data Learning via Joint Embedding and Sparse Regression for Parkinson’s Disease Diagnosis. Machine Learning in Medical Imaging (published by Springer International Publishing). pp 310–318 (2018).
Huang, Z. et al. Parkinson’s disease classification and clinical score regression via united embedding and sparse learning from longitudinal data. IEEE Trans Neural Netw Learn Syst. pp(99):1–15 (2021).

Download references

Acknowledgements

The author would like to thank Drs. Steven Petersen and Bradley Schlaggar for their support for machine-learning research in Parkinson’s Disease in the Department of Neurology at the Washington University in St. Louis.

Author information

Authors and Affiliations

Department of Neurology, School of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
Jing Zhang

Authors

Jing Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Z. contributed to the research design, data collection and analysis, and drafting of the manuscript.

Corresponding author

Correspondence to Jing Zhang.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, J. Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease. npj Parkinsons Dis. 8, 13 (2022). https://doi.org/10.1038/s41531-021-00266-8

Download citation

Received: 31 July 2021
Accepted: 10 December 2021
Published: 21 January 2022
DOI: https://doi.org/10.1038/s41531-021-00266-8

This article is cited by

Exploiting macro- and micro-structural brain changes for improved Parkinson’s disease classification from MRI data
- Milton Camacho
- Matthias Wilms
- Nils D. Forkert
npj Parkinson's Disease (2024)
A new hybrid approach based on AOA, CNN and feature fusion that can automatically diagnose Parkinson's disease from sound signals: PDD-AOA-CNN
- Muhammed Yildirim
- Soner Kiziloluk
- Eser Sert
Signal, Image and Video Processing (2024)
Comparative analysis of machine learning techniques for Parkinson’s detection: A review
- Ketna Khanna
- Sapna Gambhir
- Mohit Gambhir
Multimedia Tools and Applications (2023)
The emerging role of furin in neurodegenerative and neuropsychiatric diseases
- Yi Zhang
- Xiaoqin Gao
- Guofen Gao
Translational Neurodegeneration (2022)

Subjects

Abstract

Similar content being viewed by others

A replication study, systematic review and meta-analysis of automated image-based diagnosis in parkinsonism

Machine learning based risk prediction for Parkinson's disease with nationwide health screening data

Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Introduction

An overview of machine-learning-based studies for the diagnosis of PD

Machine-learning-based studies for the discrimination between PD and HC

Dopaminergic imaging

Structural magnetic resonance imaging (MRI)

Functional MRI (fMRI)

Multi-modal data

Machine-learning-based studies for the differential diagnosis of PD

Dopaminergic imaging

Structural MRI

Functional MRI (fMRI)

Diffusion tenser imaging (DTI)

Multi-modal data

Machine-learning-based studies for the early detection of PD

Dopaminergic imaging

Structural MRI

Functional MRI (fMRI)

Multi-modal data

Discussion

The value and role of machine learning in the diagnosis and early detection of PD

Current limitations and challenges in machine-learning applications

First, prone-to-error labeling of classes/groups in supervised learning

Second, machine-learning “black-box”

Third, overfitting problem in machine learning

Future directions

First, improve and validate the machine-learning applications

Second, improve modeling longitudinal multi-modal data

Third, integrate ML-based applications into clinical decision support system to aid in PD diagnosis

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Exploiting macro- and micro-structural brain changes for improved Parkinson’s disease classification from MRI data

A new hybrid approach based on AOA, CNN and feature fusion that can automatically diagnose Parkinson's disease from sound signals: PDD-AOA-CNN

Comparative analysis of machine learning techniques for Parkinson’s detection: A review

The emerging role of furin in neurodegenerative and neuropsychiatric diseases

Search

Quick links