Utilizing a tablet-based artificial intelligence system to assess movement disorders in a prospective study

Purk, Maximilian; Fujarski, Michael; Becker, Marlon; Warnecke, Tobias; Varghese, Julian

doi:10.1038/s41598-023-37388-3

Download PDF

Article
Open access
Published: 26 June 2023

Utilizing a tablet-based artificial intelligence system to assess movement disorders in a prospective study

Maximilian Purk¹,
Michael Fujarski¹,
Marlon Becker¹,
Tobias Warnecke² &
…
Julian Varghese¹

Scientific Reports volume 13, Article number: 10362 (2023) Cite this article

1388 Accesses
2 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Spiral drawings on paper are used as routine measures in hospitals to assess Parkinson’s Disease motor deficiencies. In the age of emerging mobile health tools and Artificial Intelligence a comprehensive digital setup enables granular biomarker analyses and improved differential diagnoses in movement disorders. This study aims to evaluate on discriminatory features among Parkison’s Disease patients, healthy subjects and diverse movement disorders. Overall, 24 Parkinson’s Disease patients, 27 healthy controls and 26 patients with similar differential diagnoses were assessed with a novel tablet-based system. It utilizes an integrative assessment by combining a structured symptoms questionnaire—the Parkinson’s Disease Non-Motor Scale—and 2-handed spiral drawing captured on a tablet device. Three different classification tasks were evaluated: Parkinson’s Disease patients versus healthy control group (Task 1), all Movement disorders versus healthy control group (Task 2) and Parkinson’s Disease patients versus diverse other movement disorder patients (Task 3). To systematically study feature importances of digital biomarkers a Machine Learning classifier is cross-validated and interpreted with SHapley Additive exPlanations (SHAP) values. The number of non-motor symptoms differed significantly for Tasks 1 and 2 but not for Task 3. The proposed drawing features partially differed significantly for all three tasks. The diagnostic accuracy was on average 94.0% in Task 1, 89.4% in Task 2, and 72% in Task 3. While the accuracy in Task 3 only using the symptom questionnaire was close to the baseline, it greatly improved when including the tablet-based features from 60 to 72%. The accuracies for all three tasks were significantly improved by integrating the two modalities. These results show that tablet-based drawing features can not only be captured by consumer grade devices, but also capture specific features to Parkinson’s Disease that significantly improve the diagnostic accuracy compared to the symptom questionnaire. Therefore, the proposed system provides an objective type of disease characterization of movement disorders, which could be utilized for home-based assessments as well.

Clinicaltrials.gov Study-ID: NCT03638479.

Classification of Parkinson’s disease and its stages using machine learning

Article Open access 18 August 2022

Investigating the efficacy and importance of mobile-based assessments for Parkinson's disease: uncovering the potential of novel digital tests

Article Open access 04 March 2024

Machine Learning in the Parkinson’s disease smartwatch (PADS) dataset

Article Open access 05 January 2024

Introduction

Parkinson’s Disease is a widespread neurodegenerative disorder. The prevalence of PD in people over 60 years is about one percent¹.

PD presents with complex, heterogenous symptoms. The primary characterization comes with the motor symptoms², for instance a 4–6 Hz rest tremor, bradykinesia and muscular rigidity^3,4. Because of variable manifestations, subtypes such as tremor dominant or hypokinetic types are described⁵. However, in early stages of Parkinson’s Disease, several non-motor symptoms exist such as loss of smell, constipation, depression and sleep disturbances⁶. Parkinson’s Disease with the above symptoms result in loss of quality of life and can reduce life expectancy⁷. Improving early diagnostics can improve quality of life through the timely introduction of therapy appropriate to the stage⁸. However, the heterogeneous manifestation implicates difficulty in diagnosis. The diagnosis of PD is a clinical diagnosis primarily based on the medical history and the clinical examination⁹. For staging and description, the clinic of the disease is often classified according to the Hoehn and Yahr scale, or the more comprehensive Unified Parkinson’s Disease Rating Scale (UPDRS)^10,11.

The high prevalence combined with the poor rate of correctly diagnosed Parkinson’s Disease patients (PD) of 73.8% by general practitioners and 79.6% by movement disorder experts shows the importance of research in diagnosing movement disorders¹². Furthermore, there is a need for objective and easy-to-use tools in fast track and telemedicine times as there is no reliable biochemical marker that is in daily use. Though, potential biomarkers are currently being researched at many different levels¹³.

In current research, digitalization is playing an increasingly important role. Some research groups are studying the voice of people with Parkinson’s as a subject of exploration^14,15. In addition, there are experiments with wearables and analysis of the writing^16,17,18.

The idea of analyzing tremors by a digitizing tablet was already published in 1990 by Elble et al.¹⁹, refined in several studies and used to classify between PD and healthy control or between PD and essential tremor^20,21,22. Different types of tasks are analyzed such as writing letters or drawing a simple line^23,24.

Memedi et al. built a machine learning classifier based on motor features that distinguishes between bradykinesia and dyskinesia²⁰. The research group around Luciano et al. made another attempt with a spiral drawing examination to find an early biomarker in the diagnosis of PD. The reported sensitivity is 86% and the specificity 81%²⁵.

Most previous studies found significant differences in the distributions of feature values between the different classes or promising results in machine learning. Nevertheless, there is no uniform device-based system recommended in diagnostics of Parkinson’s Disease⁹.

The research to date has tended to focus on the motor symptoms rather than the non-motor component²⁶. Connecting both important feature types, motor symptoms (spiral drawing) and non-motor symptoms (questionnaire) in one integrative assessment provides promising potential for deeper disease characterization²⁷.

The comprehensive approach of the Smart Device System with the inclusion of wearables and the studies of speech was extended to include spiral drawings²⁸. The new data include the time, position and force values collected during the digitized spiral drawing for both arms and the Parkinson’s Disease Non-Motor Scale questionnaire (PD-NMS). As Chen et al. published, the best results in distinguishing different tremors are obtained if the participant follows a given spiral²². In order to include such an assessment in telemedicine, the examination procedure must be easy to implement²⁹.

The first classification Task 1 aims to distinguish PD patients from a healthy control group having no known history of movement disorders (CG). The classification Task 2 aims to separate all movement disorders (MD) from the CG. Task 3 contains the most complex differentiation between PD and diverse movement disorders (DD) and is currently understudied¹⁷, as most previous work focuses on PD versus CG. Task 3 is of high clinical relevance, because the medical expert or the neurologist cannot assume whether the patient is either healthy or has PD. Therefore, disease features or classification models for potential diagnosis should be also evaluated against differential diagnoses to enable added medical value. In the era of digital transformation and mHealth, a tablet-based system enables simple integration highly of PD-relevant input, which are currently captured in hospital-based settings or assessment centers. The goal of the study is to capture spiral-drawing and questionnaire features and to evaluate the applied features in the context of these three tasks.

Material and methods

Study

The prospective study started in 2018 and was extended till the end of 2021. The methods were performed in accordance with relevant guidelines and regulations and approved by the ethical board of the University of Münster and the physician’s chamber of Westphalia–Lippe (Reference number: 2018.328.f-S). It was conducted at the outpatient clinic of movement disorders at the University Hospital Münster in Germany. The details of the study design and the protocol have been published previously²⁸. Study registration ID on ClinicalTrials.gov: NCT03638479. All participants have provided written informed consent before participation. An overview of the study design can be found in Fig. 1. As a tertiary care center for movement disorders, there is broad access to patients affected by movement disorders at the outpatient clinic of the Department of Neurology at the University Hospital Münster. The DD class contains eight patients with essential tremor³⁰, six patients with multiple sclerosis³¹, one tremor associated with lithium³², three patients with atypical parkinsonism³³ (two multiple system atrophy and one progressive supranuclear paralyze), one tremor of unknown origin, one hand tremor associated with dystonia³⁴ and two ataxia patients³⁵. An overview of the demographic data can be found in Table 1. An assessment of the severity of PD can be made by indicating the distribution by the Hoehn and Yahr classification¹⁰.

Table 1 Demographic data.

Full size table

Data acquisition

Details about the study design, procedures and preliminary results were published previously^28,36. The examination was split into two parts. Information about the non-motor symptoms of the participants was captured by answering the Parkinson’s Disease Non-Motor Scale (PDNMS) as patient reported outcome. The 30 yes–no items included in the PDNMS check for typical non-motor symptoms of PD. Among them are details about their sleep, mood, sexual function and cognition²⁷.

The tablet-based assessment was designed with movement disorder specialists with more than a decade of experience in diagnosing and treating at the outpatient clinic for movement disorders. Participants were instructed to draw an Archimedean spiral twice with each hand. The spiral is to be drawn starting in the middle and then following the given lines with a stylus on a tablet. The maximal radius was 3.75 cm, and one spiral contains four loops (Fig. 2). The instructions to the test person did not involve further restrictions like time limits. This shall make potential homemade drawings comparable and decrease the dependency on the rater. As Kotsavasiloglou et al. propose, there is an advantage of basic assessments²³. However, the loosely defined instructions introduce common errors that need to be addressed in later data processing. The recorded spirals were checked by the system whether they were drawn from the central point to the outside, otherwise the time series were reversed.

PD tremors are typically described as a unilateral tremor in the early stages³⁷. By examination of only one hand, the one-sided tremor would not be captured for all cases. The two-handed approach observes the laterality of the disease. Executing the task for each side twice offers a more stable system against execution errors.

Data are acquired on an Apple iPad. The stylus used is an Apple Pencil which recorded the drawing with a sampling rate of 240 Hz. The raw data contain a multivariate time series containing a timestamp, x-coordinate, y-coordinate, and a force value for each data point.

Features

Apart from the PDNM questionnaire, 13 variables representing motor symptoms were identified by reviewing comparable studies after searching for spiral drawing and PD assessment on PubMed and Google Scholar including grey literature from 2000 to 2022^22,24,38,39. These features can be split into four categories. The first category contains the information of the PDNMS questionnaire and covers the non-motor symptoms²⁷. The further three categories focus on the quantification of the motor symptoms by calculating a metric for precision, force, or time related. The features that address the motor symptoms will be numbered consecutively from F1 to F13. A list of all features can be seen in the supplements along with their category (S1).

The two corresponding feature values per arm were combined by the mean of the values. Since lifting the pen during the assessment is a common error, some outliers in the raw data needed to be filtered. In cases of time-independent features like the mean distance to the spiral, the top five percentage with the highest values were removed as outliers. For the time-dependent features the first and last 10 percent of the datapoints were clipped off due to high variance on dropping and lifting the pen.

The further analysis and feature extraction were performed on the participant’s arm with a more prominent tremor. In preparation, the x- and y-positions of the stylus were converted to a time series describing the distance to the given spiral. The side with the more significant standard deviation in the distance to the perfect spiral was considered the stronger affected side and these values were utilized for further analysis (Fig. 2).

Non-motor symptoms features

QYes is the count of positive answered questions in the PDNMS described above. Non-motor impairments, such as cardiovascular or memory, were quantified by the PDNMS. The score includes a total of nine dimensions, including gastrointestinal symptoms and a subjective assessment of fatigue. Thus, the possible range is between 0 (all questions answered with “no”) and 30 (all questions answered with "yes")²⁷.

Precision features

The second group contains the features that quantify the precision of the spirals. It uses a method for generating the data regarding the distance to the given spiral. For F1c DistanceFFT, a discrete fast Fourier transformation (FFT) was applied on the distance-time series with the Python 3.8 NumPy package (version 1.20.2)⁴⁰. The new data were reduced to the frequency spectrum between 3 and 15 Hz and quantized into 20 bins. On these bins, the standard deviation was calculated to detect the presence of a dominant frequency. A dominant frequency implies an overall low variation across the bins with a single outlier, and therefore a low standard deviation. For further analyses, the absolute value of the difference between both sides is calculated as information on laterality.

Feature F2 MaxDistance calculates the maximal distance for each drawn spiral to the given spiral. Features F3 MeanDistance and F4 StdDevDistance calculate the mean and the standard deviation of the given distance series respectively.

Feature F5 ChangeOfRadiusDirection counts the shifts of the radius from increasing to decreasing and vice versa. A perfect spiral has a steadily increasing radius. Introducing irregularities into the spiral, such as by tremor, increases the number of shifts. Features F6 ChangesOfDirectionX and F7 ChangesOfDirectionY are like F5 but only consider the X or Y axis respectively.

Force features

The third group contains the force-related features. The force value provided through the Apple Development Framework is in relation to a predefined value with 1 equal to an average user input⁴¹. Features F8 MeanForce, F9 StDevForce and F10 MedianForce correspond to the mean, standard deviation and median of the applied force respectively.

Time-related features

In the context of the bradykinesia and the rigor as potential motor symptoms in PD, the temporal dynamics of the drawing have to be analyzed⁹. The time-related aspect is considered in the last feature group. Feature F11 TimeOfDrawing calculates the total drawing time.

The x-, y-coordinates and time stamps are transformed into a time series of velocities. The velocity at each time instance is calculated according to (1).f, velocity function; x, x-coordinate; y, y-coordinate; t, time value.

$${f}_{i}({x}_{i}{,y}_{i},{t}_{i}) = \frac{\sqrt{{({x}_{i}{-x}_{i-1})}^{2}+{({y}_{i}{-y}_{i-1})}^{2}}}{({t}_{i}-{t}_{i-1})}$$

(1)

Calculating the discrete differentiation of the position with respect to the time and applying the Euclidean norm to acquire a 1-dimensional scalar for the absolute velocity at time instance t_i. f₀ is assumed to be 0.

Features F12 MeanVelocity and F13 StdDevVelocity are calculated as the mean or standard deviation of the velocity time series respectively (2).

N, total number of data points; σ², variance; x, x-coordinate; y, y-coordinate; t, time value; μ, mean-value of $f_{i} \left( {x_{i} ,y_{i} ,t_{i} } \right)$

$$\sigma^{2} \left( {x_{i} ,y_{i} ,t_{i} } \right) = \frac{{\mathop \sum \nolimits_{i}^{N} \left( {f_{i} \left( {x_{i} ,y_{i} ,t_{i} } \right) - \mu } \right)^{2} }}{{\text{N}}}$$

(2)

Calculation of the variance of the velocity time series. Feature F13 equals the square root of the variance.

To improve the visualization, the log value of the drawing task-related features is used for further investigations since erroneous extreme values have less influence on the axis scaling.

Feature statistics and visualization

The feature statistics are split into two approaches. The Spearman correlation coefficient is used to determine correlations between the age, the feature values, and the tasks (Table 2) and the Mann–Whitney-U test is used to determine significant differences in means among the groups of Task 1, 2, and 3. P-values below 0.0036 were considered significant after applying Bonferroni correction. The correlations are calculated in Python 3.9.9 with the pandas 1.2.5 package⁴² while the Mann–Whitney-U was calculated using the Python package SciPy (version 1.6.3). The three features with high correlation with a classification target were chosen for testing for statistically significant differences between the classes (Fig. 3). A list of all tests is available in Supplements S4. For the Machine Learning cross-validation all features were used for training.

Table 2 Spearman correlation coefficient of the feature values, the age, and the classification task if the correlation is significant (*: p ≤ 0.05/c, **: p ≤ 0.01/c, ***: p ≤ 0.001/c, after Bonferroni-Correction c = 14).

Full size table

To examine the influence of the age distribution we calculated the correlation between the age and both, the feature values and the target variables as shown in Table 2. The correlation coefficients are listed for combinations with significant outcomes.

Machine learning and SHAP analysis

A CatBoost classifier is implemented and the accuracy, precision, recall and F1 for all three tasks are calculated to evaluate the potential of the classifier and features (Table 3). Furthermore, we included the accuracies for all three tasks for a majority class voter (dummy), a model only based on the PDNMS Score, and a model only based on tablet data to evaluate the benefits of integration. CatBoost was chosen because of the suitability with limited data and the high accuracy as an ensemble method⁴³. The tablet features are normalized using a standard scaler and reduced in dimensionality with principal component analysis during the training phase. The default values are used as hyperparameters. To prevent overfitting, a stratified fivefold cross-validation was applied. For interpretability analyses, the Shapley Additive exPlanations (SHAP) values are used on the 5 resulting models (Fig. 4). The SHAP values describe the impact of the feature on the classifier outcome. Features with a high importance-variation show a high impact of the particular feature⁴⁴. For further understanding, information can be found in the original paper by Lundberg⁴⁵.

Table 3 Dummy: accuracy of simple majority class voting estimator.

Full size table

Results

Feature statistics and visualization

Table 2 shows the correlation between the target variable, the age, and the features. The tablet features partially have a strong correlation between the control group and PD as a distinct group (Task1) and the movement disorders combined (Task 2). The features correlate less to the differentiation of PD and DD (Task 3). The features F1c DistanceFFT, F2 MaxDistance, F3 MeanDistance, and F4 StDevDistance strongly correlate with Task 1 and 2 but are not significantly correlated with Task 3. Feature F5 ChangeOfRadiusDirection has a high anti-correlation with Task 3, low correlation with Task 2, and does not correlate with Task 1 at all.

There is no significant correlation between the features and the age distribution (Table 2). A full list of correlation coefficients regardless of the level of significance is available in Supplements S2. The highest correlation coefficient of the age is towards feature F3 MeanDistance with a p-value of 0.024. In addition, the age and Task 3 shows a positive correlation of 0.53.

The results of the statistical test are depicted above the boxes and indicate the level of significance. Similarly, to the correlation coefficients, the PDNMS score, and the distance-based tablet features significantly differ for Tasks 1 and 2, indicating potentially well-suited markers for detection of movement disorders. The Mann–Whitney U test did not show a significant difference in the distribution of any of the features between PD and DD.

CatBoost score

The model was trained for three different feature sets for all three tasks as depicted in Table 3. The baseline dummy corresponds to a majority class voter. QYes Only and Tablet Only show the mean accuracy of the model trained on solely the PDNMS score and the tablet-based features respectively. Both subsets show an improvement in accuracy compared to the baseline. The PDNMS score performs better in Task 1 and 2 while the tablet features outperform the PDNMS score in Task 3. The integration of both data modalities greatly improves the performances for Task 1 and 2, while the PDNMS score only slightly improves the accuracy in Task 3.

Feature importance

The high SHAP feature importance in Task 1 and 2 in Fig. 4 for the PDNMS questionnaire are consistent with the found correlations in Table 2. For Task 3 the questionnaire performs last, indicating only little impact on the differentiation of the considered movement disorders.

Higher absolute SHAP values suggest a higher influence on the model’s output. In Fig. 4a,b the QYes feature has the highest importance on the classifier. If more questions of the PDNMS were answered with "Yes" the more likely a disease becomes. For distinguishing MD and CG (Fig. 4), F5 ChangeOfRadiusDirection has the second highest impact followed by the mean and median force during the drawing (F8 and F10). In classification Task 3 PD versus MD (Fig. 4), the distribution for feature F5 ChangeOfRadiusDirection shows a high feature importance.

Error analysis

To study the cause of misclassifications, we calculated the descriptive statistics for the erroneously classified subgroups for all Tasks in Table 4 for the tests sets of the cross validation. The PD patients that were not recognized in tasks 1 and 3 were on average older than 70 years. The false positives in Task 3 (DD identified as PD) were on average 52 years old and therefore younger than the false negatives (PD identified as DD).

Table 4 Error analysis descriptive statistics of false classifications.

Full size table

The age and gender distributions of the false positives and negatives in Task 2 were about the same. The PDNMS Score in Task 1 of the false positives was higher than the average CG with 3.6, while the false negatives had a significantly lower PDNMS Score with 1 on average. The same applies to the distribution in Task 2 with MD and CG. No PD patients were misclassified in Task 2.

Discussion

Technology-based objective measure is emerging in the era of smart devices and this study shows the feasibility of objective drawings on a tablet and patient reported outcomes in distinguishing Parkinson’s Disease from healthy individuals and other movement disorders. While related work focuses on the former task of differentiating PD from healthy individuals^20,25, our cohort includes a multitude of other movement disorders.

The Machine Learning classifier achieved an accuracy of 94.00% in Task 1 (PD vs. CG), 89.4% in Task 2 (MD vs. CG). Limiting the feature set to the tablet-based features results in an accuracy of 84.5% for Task 1 and indicates a slight improvement compared to similar work^20,25. Task 3 (PD vs. DD) and performs with an accuracy of 72% which is comparable to the accuracy of non-expert physicians of 73.8% according to Rizzo et al.¹². Cross-validation was employed as a rigorous method to assess the performance of the model on the available dataset and estimate its generalization performance. However, the performance of a model is influenced by the specific dataset used for training and testing, which could limit the generalizability of the findings. Therefore, it is necessary to conduct a future study with a larger dataset to enhance the transferability of the outcomes.

The novelty of the study is the combination and the integrated analyses of motor and non-motor symptoms assessments. The analysis of the motor symptoms based on drawing and the non-motor symptoms with the PDNMS resulted in noticeable information gain. The PDNMS shows high classification performance for Task 1 and 2, and a low performance close to the baseline in Task 3 (PD vs. DD). Therefore, the questionnaire is able to separate non-healthy participants from a healthy control group. This result is to be expected since the recruited MD patients include several neurodegenerative diseases that cause multiple non-motor symptoms. The tablet data performs slightly worse for Task 1, worse for Task 2, but considerably better for Task 3 compared to the PDNMS. The integration of the PDNMS and the tablet features improved the performance for all three Tasks. The features regarding global properties of the resulting spiral, like number of directional changes of the drawing, proved to be of no relevance for the classification tasks except for F5 ChangeOfRadiusDirection for Task 2 and 3. It describes the number of directional changes along the radial axis and indirectly measures the frequency of a possible tremor. The standard deviation of several properties has shown a significant impact on the classification of Task 1 and 2 and should be considered for further analyses of MD detection.

Including the tablet data significantly improves the performance for Task 3 compared to a simple questionnaire. Here, the proposed Feature F5 ChangeOfRadiusDirection proved to be of high feature importance. PD patients have significantly lower values, indicating that tremors for PD patients tend to have a lower frequency compared to patients of the other movement disorders group regardless of the amplitude.

However, the differentiation between PD and DD in Task 3 remains difficult. The versatility of symptoms of the recruited patients causes the PDNMS questionnaire to underperform³⁷. Even the medication has a high impact on the symptoms⁴⁶. The high variance in symptoms of the PD patients further makes it difficult to distinguish Parkinson’s Disease from other diseases⁴⁷. The broad range in the Hoehn and Yahr Scale or the Unified Parkinson’s Disease Rating Scale demonstrates the high variability in manifestations^10,37. The treatment with L-Dopa cannot prevent the switch between On- and Off States in later disease stages, which represents good motor function (On) and the immobility (Off)^9,48. In the light of similar characteristics or pathogenesis of PD and diverse differential diagnoses as essential tremor, it remains challenging for the provided assessments to distinguish all these entities without further information, e.g., through clinical examination, imaging, or further biomarkers.

Due to the aging population, an increase in the prevalence of movement disorders is likely and the demand for objective diagnosis criteria based on cost-effective technology rises⁴⁹. The results present the potential of the system’s method in health care, especially in settings with lack of movement disorder experts. There are chances of improving healthcare with telemedicine and maintain adequate health care in sparsely populated regions⁵⁰. Moreover, the promising results of Task 2 to detect general movement disorders can provide the individual information about the need for an individual with motor abnormalities to visit the physician.

Next to the field of diagnosis, there is a broad area of usage of objective movement disorder assessments for monitoring the progress of symptoms. As mentioned in the introduction, there is a high disparity in the examination results between the physicians¹². This tool makes individuals and their symptoms comparable, even over a long time and between different physicians.

In summary, there is potential in applying device-based tools that utilize sensors and patient reported outcomes for surveillance and diagnostics of movement disorders. Further research is necessary for finding disease-specific biomarkers to improve the classification of specific entities like Parkinson’s Disease from further differential diagnoses.

Data availability

The data supporting our findings are openly available on our Git repository. https://imigitlab.uni-muenster.de/published/sds-tablet-based-ai.

References

Tysnes, O.-B. & Storstein, A. Epidemiology of Parkinson’s disease. J. Neural Transm. 124, 901–905 (2017).
Article PubMed Google Scholar
Gelb, D. J., Oliver, E. & Gilman, S. Diagnostic criteria for Parkinson disease. Arch. Neurol. 56, 33–39 (1999).
Article CAS PubMed Google Scholar
Findley, L. J., Gresty, M. A. & Halmagyi, G. M. Tremor, the cogwheel phenomenon and clonus in Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 44, 534–546 (1981).
Article CAS PubMed PubMed Central Google Scholar
Pagano, G., Ferrara, N., Brooks, D. J. & Pavese, N. Age at onset and Parkinson disease phenotype. Neurology 86, 1400–1407 (2016).
Article CAS PubMed PubMed Central Google Scholar
Thenganatt, M. A. & Jankovic, J. Parkinson disease subtypes. JAMA Neurol. 71, 499–504 (2014).
Article PubMed Google Scholar
Barone, P. et al. The PRIAMO study: A multicenter assessment of nonmotor symptoms and their impact on quality of life in Parkinson’s disease. Mov. Disord. 24, 1641–1649 (2009).
Article PubMed Google Scholar
Jagadeesan, A. J. et al. Current trends in etiology, prognosis and therapeutic aspects of Parkinson’s disease: A review. Acta Bio Med. Atenei Parm. 88, 249–262 (2017).
CAS Google Scholar
Postuma, R. B. Prodromal Parkinson disease: Do we miss the signs?. Nat. Rev. Neurol. 15, 437–438 (2019).
Article PubMed Google Scholar
Kalia, L. V. & Lang, A. E. Parkinson’s disease. The Lancet 386, 896–912 (2015).
Article CAS Google Scholar
Hoehn, M. M. & Yahr, M. D. Parkinsonism: Onset, progression, and mortality. Neurology 17, 427–427 (1967).
Article CAS PubMed Google Scholar
Martínez-Martín, P. et al. Unified Parkinson’s disease rating scale characteristics and structure. Mov. Disord. 9, 76–83 (1994).
Article PubMed Google Scholar
Rizzo, G. et al. Accuracy of clinical diagnosis of Parkinson disease: A systematic review and meta-analysis. Neurology 86, 566–576 (2016).
Article PubMed Google Scholar
Lotankar, S., Prabhavalkar, K. S. & Bhatt, L. K. Biomarkers for Parkinson’s disease: Recent advancement. Neurosci. Bull. 33, 585–597 (2017).
Article CAS PubMed PubMed Central Google Scholar
Klucken, J. et al. Mobile biometrische Ganganalyse: Potenzial für Diagnose und Therapiemonitoring beim Parkinson-Syndrom. Nervenarzt 82, 1604–1611 (2011).
Article CAS PubMed Google Scholar
Srulijes, K. et al. Association between vestibulo-ocular reflex suppression, balance, gait, and fall risk in ageing and neurodegenerative disease: Protocol of a one-year prospective follow-up study. BMC Neurol. 15, 192 (2015).
Article PubMed PubMed Central Google Scholar
Klucken, J. et al. „Wearables“ in der Behandlung neurologischer Erkrankungen—wo stehen wir heute?. Nervenarzt 90, 787–795 (2019).
Article PubMed Google Scholar
Varghese, J. et al. Sensor validation and diagnostic potential of smartwatches in movement disorders. Sensors 21, 3139 (2021).
Article ADS PubMed PubMed Central Google Scholar
Parziale, A., Senatore, R., Della Cioppa, A. & Marcelli, A. Cartesian genetic programming for diagnosis of Parkinson disease through handwriting analysis: Performance vs. interpretability issues. Artif. Intell. Med. 111, 101984 (2021).
Article CAS PubMed Google Scholar
Elble, R. J., Sinha, R. & Higgins, C. Quantification of tremor with a digitizing tablet. J. Neurosci. Methods 32, 193–198 (1990).
Article CAS PubMed Google Scholar
Memedi, M. et al. Automatic spiral analysis for objective assessment of motor symptoms in Parkinson’s disease. Sensors 15, 23727–23744 (2015).
Article ADS PubMed PubMed Central Google Scholar
Sisti, J. A. et al. Computerized spiral analysis using the iPad. J. Neurosci. Methods 275, 50–54 (2017).
Article ADS PubMed Google Scholar
Chen, K.-H., Yang, B.-S. & Chen, Y.-J. A digital assessment system for evaluating kinetic tremor in essential tremor and Parkinson’s disease. BMC Neurol. 18, 25 (2018).
Article PubMed PubMed Central Google Scholar
Kotsavasiloglou, C., Kostikis, N., Hristu-Varsakelis, D. & Arnaoutoglou, M. Machine learning-based classification of simple drawing movements in Parkinson’s disease. Biomed. Signal Process. Control 31, 174–180 (2017).
Article Google Scholar
Zham, P., Arjunan, S. P., Raghav, S. & Kumar, D. K. Efficacy of guided spiral drawing in the classification of Parkinson’s disease. IEEE J. Biomed. Health Inform. 22, 1648–1652 (2018).
Article PubMed Google Scholar
San Luciano, M. et al. Digitized spiral drawing: A possible biomarker for early Parkinson’s disease. PLoS ONE 11, e0162799 (2016).
Article PubMed PubMed Central Google Scholar
Bain, P. G. et al. Assessing tremor severity. J. Neurol. Neurosurg. Psychiatry 56, 868–873 (1993).
Article CAS PubMed PubMed Central Google Scholar
Chaudhuri, K. R. et al. The metric properties of a novel non-motor symptoms scale for Parkinson’s disease: Results from an international pilot study. Mov. Disord. 22, 1901–1911 (2007).
Article PubMed Google Scholar
Varghese, J. et al. A smart device system to identify new phenotypical characteristics in movement disorders. Front. Neurol. 10, 48 (2019).
Article PubMed PubMed Central Google Scholar
Klaassen, B., van Beijnum, B. J. F. & Hermens, H. J. Usability in telemedicine systems—A literature survey. Int. J. Med. Inform. 93, 57–69 (2016).
Article CAS PubMed Google Scholar
Haubenberger, D. & Hallett, M. Essential tremor. N. Engl. J. Med. 378, 1802–1810 (2018).
Article PubMed Google Scholar
Oh, J., Vidal-Jordana, A. & Montalban, X. Multiple sclerosis: Clinical aspects. Curr. Opin. Neurol. 31, 752–759 (2018).
Article PubMed Google Scholar
Baek, J. H., Kinrys, G. & Nierenberg, A. A. Lithium tremor revisited: Pathophysiology and treatment. Acta Psychiatr. Scand. 129, 17–23 (2014).
Article CAS PubMed Google Scholar
Levin, J., Kurz, A., Arzberger, T., Giese, A. & Höglinger, G. U. The differential diagnosis and treatment of atypical Parkinsonism. Dtsch. Ärztebl. Int. 113, 61–69 (2016).
PubMed PubMed Central Google Scholar
Pandey, S. & Sarma, N. Tremor in dystonia. Parkinsonism Relat. Disord. 29, 3–9 (2016).
Article PubMed Google Scholar
Kuo, S.-H. Ataxia. Contin. Minneap. Minn 25, 1036–1054 (2019).
Google Scholar
Varghese, J. et al. Smartwatch-based examination of movement disorders: Early implementation and measurement accuracy. 64 Jahrestag. Dtsch. Ges. Für Med. Inform. Biometrie und Epidemiologie e.V. (GMDS). https://doi.org/10.3205/19GMDS136 (2019).
Jankovic, J. Parkinson’s disease: Clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79, 368–376 (2008).
Article CAS PubMed Google Scholar
Chen, K.-H., Lin, P.-C., Yang, B.-S. & Chen, Y.-J. The difference in visuomotor feedback velocity control during spiral drawing between Parkinson’s disease and essential tremor. Neurol. Sci. 39, 1057–1063 (2018).
Article ADS PubMed Google Scholar
Sadikov, A. et al. ParkinsonCheck A Decision Support System for Tremor Detection. /paper/ParkinsonCheck-A-Decision-Support-System-for-Tremor-Sadikov-Zabkar/bccacf91ac7ffbd4fef14ef1066e93daff74373f (2015).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
force | Apple Developer Documentation. https://developer.apple.com/documentation/uikit/uitouch/1618110-force (2022).
Reback, J. et al. pandas-dev/pandas: Pandas 1.3.5. (2021) 10.5281/ZENODO.3509134.
Dorogush, A. V., Ershov, V. & Gulin, A. CatBoost: Gradient boosting with categorical features support. arXiv:181011363 Cs Stat (2018).
Genuer, R., Poggi, J.-M. & Tuleau-Malot, C. VSURF: An R package for variable selection using random forests. R J. 7, 19 (2015).
Article Google Scholar
Lundberg, S. & Lee, S.-I. A unified approach to interpreting model predictions. arXiv:170507874 Cs Stat (2017).
Suppa, A., Bologna, M., Conte, A., Berardelli, A. & Fabbrini, G. The effect of L-dopa in Parkinson’s disease as revealed by neurophysiological studies of motor and sensory functions. Expert Rev. Neurother. 17, 181–192 (2017).
Article CAS PubMed Google Scholar
Caproni, S. & Colosimo, C. Diagnosis and differential diagnosis of Parkinson disease. Clin. Geriatr. Med. 36, 13–24 (2020).
Article PubMed Google Scholar
Chou, K. L. et al. The spectrum of “off” in Parkinson’s disease: What have we learned over 40 years?. Parkinsonism Relat. Disord. 51, 9–16 (2018).
Article PubMed Google Scholar
Feigin, V. L. et al. The global burden of neurological disorders: Translating evidence into policy. Lancet Neurol. 19, 255–265 (2020).
Article PubMed Google Scholar
Schneider, R. B. & Biglan, K. M. The promise of telemedicine for chronic neurological disorders: The example of Parkinson’s disease. Lancet Neurol. 16, 541–551 (2017).
Article PubMed Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. The study is funded by the Innovative Medical Research for Young Scientists (IMF) with project number VA111809.

Author information

Authors and Affiliations

Institute of Medical Informatics, University of Münster, Münster, Germany
Maximilian Purk, Michael Fujarski, Marlon Becker & Julian Varghese
Department of Neurology and Neurorehabilitation, Klinikum Osnabrück–Academic Teaching Hospital of the University of Münster, Osnabrück, Germany
Tobias Warnecke

Authors

Maximilian Purk
View author publications
You can also search for this author in PubMed Google Scholar
Michael Fujarski
View author publications
You can also search for this author in PubMed Google Scholar
Marlon Becker
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Warnecke
View author publications
You can also search for this author in PubMed Google Scholar
Julian Varghese
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: J.V. Data curation: M.P. Formal analysis: M.P., M.F., M.B., J.V. Funding acquisition: J.V. Investigation: M.P., J.V. Methodology: M.P., J.V. Project administration: M.P., J.V. Resources: J.V., T.W. Supervision: J.V., T.W. Validation: T.W., J.V. Visualization: M.P. Writing—original draft: M.P. Writing—review & editing: MP, JV, MF, MB, TW.

Corresponding author

Correspondence to Michael Fujarski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Purk, M., Fujarski, M., Becker, M. et al. Utilizing a tablet-based artificial intelligence system to assess movement disorders in a prospective study. Sci Rep 13, 10362 (2023). https://doi.org/10.1038/s41598-023-37388-3

Download citation

Received: 28 September 2022
Accepted: 21 June 2023
Published: 26 June 2023
DOI: https://doi.org/10.1038/s41598-023-37388-3

This article is cited by

Machine Learning in the Parkinson’s disease smartwatch (PADS) dataset
- Julian Varghese
- Alexander Brenner
- Tobias Warnecke
npj Parkinson's Disease (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Material and methods

Study

Data acquisition

Features

Non-motor symptoms features

Precision features

Force features

Time-related features

Feature statistics and visualization

Machine learning and SHAP analysis

Results

Feature statistics and visualization

CatBoost score

Feature importance

Error analysis

Discussion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links