Abstract
Emergent tremor in Parkinson’s disease (PD) can occur during sustained postures or movements that are different from action tremor. Tremor can contaminate the clinical rating of bradykinesia during finger tapping. Currently, there is no reliable way of isolating emergent tremor and measuring the cardinal motor symptoms based on voluntary movements only. In this study, we investigated whether emergent tremor during repetitive alternating finger tapping (RAFT) on a quantitative digitography (QDG) device could be reliably identified and distinguished from voluntary tapping. Ninety-six individuals with PD and forty-two healthy controls performed a thirty-second QDG-RAFT task and the Movement Disorders Society – Unified Parkinson’s Disease Rating Scale Part III (MDS-UPDRS III). Visual identification of tremor during QDG-RAFT was labeled by an experienced movement disorders specialist. Two methods of identifying tremor were investigated: 1) physiologically informed temporal thresholds 2) XGBoost model using temporal and amplitude features of tapping. The XGBoost model showed high accuracy for identifying tremor (area under the precision-recall curve of 0.981) and outperformed temporal-based thresholds. Percent time duration of classifier-identified tremor showed significant correlations with MDS-UPDRS III tremor subscores (r = 0.50, p < 0.0001). There was a significant change in QDG metrics for bradykinesia, rigidity, and arrhythmicity after tremor strikes were excluded (p < 0.01). The results demonstrate that emergent tremor during QDG-RAFT has a unique digital signature and the duration of tremor correlated with the MDS-UPDRS III tremor items. When involuntary tremor strikes were excluded, the QDG metrics of bradykinesia and rigidity were significantly worse, demonstrating the importance of distinguishing tremor from voluntary movement when rating bradykinesia.
Similar content being viewed by others
Introduction
The cardinal motor symptoms of Parkinson’s Disease (PD) are bradykinesia, rigidity, tremor, gait impairment, and postural instability. Rest tremor is present in over 70% of people with PD (PWP)1 and occurs at frequencies between 4 and 7 Hz. Rest tremor is usually suppressed with the onset of action but can emerge during sustained movement (such as walking) and when a posture of the hand or limb is sustained2,3,4,5,6,7.
The emergence of tremor can confound the clinical assessment of bradykinesia using the Movement Disorders Society Parkinson’s Disease Rating Scale Part III (MDS-UPDRS III). For instance, during the finger-tapping task, the thumb may be held in a fixed posture, and regular amplitude, high-frequency finger movement that is actually tremor can be interpreted as voluntary movement, leading to an underestimated bradykinesia score. Conversely, bradykinesia may be overestimated when intermittent tremor interrupts voluntary movement and cause irregularities in tapping, which may be interpreted as hesitations. On the clinical scale, there is no way to isolate involuntary emergent tremor from voluntary movements which is crucial to ensure a more accurate and dependable measurement of Parkinsonian symptom severity.
A potential solution to this dilemma is to utilize high-resolution sensors to quantify motor behavior. One example of this approach is quantitative digitography (QDG), which measures repetitive alternating finger tapping (RAFT) on adjacent tensioned levers, where the hand is held in a sustained posture. QDG provides objective and validated measures of the cardinal motor signs of PD, including rigidity, from a simple 30-s RAFT task performed on a digitography device8,9,10,11,12. QDG metrics for bradykinesia, rigidity, and gait impairment are significantly different in PWP compared to age-matched, healthy controls, are correlated with the UPDRS III, the MDS-UPDRS III, and respective upper and lower extremity sub-scores, and track the effect of medication, pallidotomy, and deep brain stimulation8,9,12. Additionally, QDG is effective in tracking symptom progression over time9,10,12,13. Critically, initial observations from QDG during periods of tremor suggested that it could capture the unique temporal characteristics of tremor8. Therefore, QDG may offer a solution by accurately identifying emergent tremor that could occur in PWP whose hands are held in posture as they perform RAFT and measure bradykinesia and rigidity more reliably based on voluntary tapping only.
In this study, we demonstrate that emergent tremor has a unique digital signature during QDG-RAFT, which could be reliably identified and could distinguish involuntary from voluntary strikes, enabling more representative measurement of voluntary motor signs such as bradykinesia and rigidity based on voluntary strikes only. We used labeled datasets that were created based on visual identification of tremor during QDG-RAFT by a blinded movement disorders specialist to develop and investigate the efficacy of two methods for identifying tremor on a strike-by-strike basis: (1) Using physiologically informed thresholds for temporal features of tapping. (2) Supervised-learning-based XGBoost classifier that uses both temporal and amplitude features of tapping as input features. We performed correlations of tremor percentage detected during RAFT and clinical ratings of tremor severity and persistence from the MDS-UPDRS III. Additionally, we evaluated whether metrics of bradykinesia, rigidity, and arrhythmicity were significantly different when measured based on voluntary strikes only, after the exclusion of tremor. Lastly, we assessed whether the change in metrics was associated with the percent duration of tremor in a trial.
Results
A cohort of ninety-six individuals with PD and forty-two healthy controls performed a self-paced thirty-second rapid alternating finger tapping task (RAFT) on a digitography device. The mean age of the PD participants was (65.1 ± 9.1 years), the disease duration was 8.7 ± 5.7 years, and the mean off-therapy MDS-UPDRS III (N = 83) was 25.1 ± 13.4. The healthy controls were not significantly different in age (60.0 ± 9.0 years, p = 0.127).
Participants were instructed to press and release the keys completely with their index and middle fingers in an alternating fashion while tapping as fast and consistently as possible (see Methods for additional details). Eight metrics of tapping were extracted from the amplitude and timing of the press and release phases of RAFT, as shown in Fig. 1a: press and release amplitudes, inter-strike interval (ISI), dwell time (DT), press and release durations, peak duration, and release slope.
a Raw QDG Trace with highlighted features of tapping. Example traces of QDG-RAFT by a b Healthy Control and c an individual with Parkinson’s Disease. In panels b, c, the first row depicts the key displacement by the index and middle fingers over the 30 s trial, while the second row displays press amplitude and the last two rows depict the inter-strike interval (ISI) and dwell time (DT) of each press by the middle finger, respectively.
RAFT performed by a representative, healthy control was characterized by consistent full amplitude and low (short) ISI key strikes with low (short) dwell times as shown in Fig. 1b. On the other hand, RAFT performed by a representative PWP, shown in Fig. 1c, demonstrated fewer taps per thirty seconds, lower amplitudes, greater (longer) ISIs, longer dwell times and more variability in all metrics than those demonstrated by the healthy control.
Labeled datasets were created based on the visual identification of tremor emergence during QDG-RAFT by an experienced movement disorders specialist. A model that classified tremor based on physiologically determined temporal thresholds was developed and tested using a dataset with the positive class comprising strikes from five trials, which were rated to have tremor for the entire duration of the trial, and the negative class consisting of strikes from 75 healthy control trials (Dataset 1). Tremor strikes from three additional trials, for which the rater provided a timestamp for tremor onset and the emergent tremor persisted for the remainder of the trial, were added to the positive class of the labeled dataset to reduce class imbalance (Dataset 2) for the development of an XGBoost model that used both, temporal and amplitude metrics, to classify tremor (see Methods for additional details).
Characteristics of tremor captured on the digitography device
Figure 2 (see Supplementary Movie 1) demonstrates the emergence of tremor in a PWP during RAFT that eventually replaces any voluntary tapping. The RAFT trace initially demonstrates voluntary alternating strikes, characterized by inconsistent amplitudes, irregular, long ISIs, and variable, long-duration dwell times.
More consistent, non-alternating, high frequency, short dwell time, and high amplitude strikes appeared after twenty-two seconds; the synchronized video demonstrated regular, rhythmic tremor of both fingers, which were striking the device at the same time. Most of these strikes had ISIs of ~225 ms and dwell times of ~55 ms.
Using temporal characteristics to distinguish tremor from voluntary strikes
The characteristic frequency of PD resting tremor is 4–7 Hz, which is equivalent to an ISI between 143 and 250 ms. Four combinations of physiologically informed ISI and dwell time thresholds were iteratively applied to classify tremor in Dataset 1. The sensitivity rate and false-positive rates were evaluated to determine the efficacy of each combination of ISI and dwell time thresholds. An ISI threshold of 216 ms and dwell time threshold of 70 ms had the lowest sensitivity rate of 50.00% and a false-positive rate of 4.30%. Increasing the ISI threshold to 225 ms and the dwell time threshold to 85 ms increased the sensitivity rate to 73.45% and the false-positive rate to 7.49%. We found that further increasing the ISI threshold from 225 to 230 ms, while keeping the dwell time constant at 85 ms, increased the sensitivity rate by 2.26% and the false-positive rate by 1.23%, whereas increasing the dwell time threshold from 80 to 85 ms, while keeping the ISI threshold constant at 225 ms, increased the sensitivity by 5.26% and the false-positive rate by 0.67%. Since increasing the ISI above 225 ms leads to an increase in false-positive rate without a substantial increase in the sensitivity rate, an ISI threshold of 225 ms and a dwell time threshold of 85 ms were chosen for further evaluation (Table 1).
Applying the determined ISI threshold of 225 ms and dwell time threshold of 85 ms to classify strikes in the test set (25% of Dataset 2) had a sensitivity rate of 79%, precision rate of 63%, but a high false-positive rate of 6.7%. The high false-positive rate was largely due to the ability of some healthy controls to perform voluntary RAFT at a high frequency with a short dwell time.
Post-hoc assessment of classification performance was performed by iteratively establishing the ISI threshold to be every ISI value between 143 and 250 ms (corresponding to tapping frequencies between 4 and 7 Hz which is the characteristic frequency of Parkinsonian rest tremor) at each of the identified candidate dwell time thresholds (70, 75, 80, and 85 ms) and evaluating the sensitivity, specificity, and precision for each iteration. The results of this analysis showed similar results to that described above, with the XGBoost Model having a greater sensitivity rate and lower false-positive rate than the best ISI and dwell time combination at each candidate dwell time threshold which were determined based on calculating the highest Youden Index and F1 score. (See Supplementary Fig. 1).
Leveraging temporal and amplitude features of tapping increased the accuracy of tremor identification
An XGBoost classifier model was used to identify tremor on a strike-by-strike basis, which included the press and release amplitude as features, in addition to five temporal metrics of tapping. Assessing the XGBoost model on the test set revealed that the classifier had a high sensitivity of 98%, a precision rate of 84%, an area under the precision-recall curve (AUC) of 0.981, and a lower false-positive rate of 2.7% (Fig. 3a).
The classifier was further validated by comparing the model’s identification of tremor in a trial to the ground truth labels provided by the blinded movement disorders specialist. The classifier correctly identified the presence of tremor in 31 out of 31 trials visually rated as having tremor by the movement disorders specialist and did not identify any instances of tremor in 4 out 5 trials marked as having no tremor. Only one trial visually rated to have no tremor was identified to have a tremor for 0.37% of the trial by the classifier.
Temporal and amplitude metrics are important features of the XGBoost classifier’s performance
Figure 3b demonstrates that the ISI had the highest contribution to the gain of the model when used in decision trees (58.39). The press amplitude had the second-greatest contribution to the gain of the model (14.84) which was closely followed by the dwell time (14.28) and release amplitude (11.46). The release duration (5.52), press duration (5.48), and release slope (4.76) exhibited a comparatively lesser impact on the model’s gain when included in decision trees.
Evaluating the importance of features based on permutation importance revealed that the four features of tapping that had the highest impact on the F1 score were the ISI (49.2), dwell time (8.2), press amplitude (6.6), and release amplitude (5.2). These were followed by the press duration (4.8), release slope (3.8), and release duration (2.2, Fig. 3c).
Digital signatures of tapping are different between tremor and voluntary strikes
After applying the classifier to identify tremor in the cohort of RAFT trials performed by 96 PWP, we found that key strikes classified as tremor had a significantly different distribution of all eight metrics when compared to voluntary strikes; the largest difference was for ISI (D = 0.88) followed by dwell time (D = 0.63), peak duration (D = 0.45), release duration (D = 0.44), release slope (D = 0.30), press duration (D = 0.19), press amplitude (D = 0.18), and release amplitude (D = 0.18), all with P < 0.001 (Table 2). This finding demonstrated that tremor during RAFT had a unique signature that could be measured using QDG technology, which included both temporal and amplitude features.
Classifier-identified tremor correlated with clinical ratings of tremor
The duration of tremor during RAFT (expressed as a percentage of the time of the trial, %T) was compared to tremor-related items in the MDS-UPDRS III from the corresponding upper extremity, in cases where the summed rest tremor and postural tremor items were greater than 1 (n = 59).
The strongest correlation was seen between %T and the weighted overall MDS-UPDRS III tremor score (postural + kinetic tremor + 0.5(rest tremor severity + constancy of rest tremor); rho = 0.50, p = 2.34e−4; see Supplementary Fig. 2 for individual scores). %T was also significantly correlated with the sum of rest tremor severity + constancy scores (rho = 0.45, p = 0.0012), with the constancy score itself (rho = 0.43, p = 0.0019), and with the sum of rest + postural tremor scores (rho = 0.35, p = 0.0068).
A significant change in QDG metrics after the exclusion of tremor strikes
In trials performed by TD individuals (n = 85), measurement of QDG metrics based on voluntary strikes only showed that there was a significant change in the metrics when tremor strikes were excluded: bradykinesia (ISI (z = 6.27, p = 5.17e−12), press amplitude (z = 3.92, p = 9.00e−05) and press amplitude CV (z = −3.12, p = 0.0018)), rigidity (release slope (z = −3.35, p = 8.15e−04)) and arrhythmicity (ISI CV (z = −6.04, p = 1.56e−09)). The corrected values for all of the QDG metrics demonstrated that the inclusion of tremor strikes would have masked or underestimated the severity of bradykinesia and rigidity (Table 3). Two trials that had less than five voluntary strikes remaining after the exclusion of tremor were not included in these analyses.
Change in QDG metrics correlated with severity of tremor in trial
The change in QDG Metrics after the exclusion of tremor was significantly correlated to %T for trials performed by TD PWP (n = 85): average dwell time (rho = 0.90, p = 3.46e−31), ISI (rho = 0.87, p = 3.6e−28), ISI CV (rho = −0.55, p = 6.03e−08) and release slope (rho = −0.46, p = 8.84e−06). There was no significant correlation between press amplitude, press amplitude CV, and %T (See Supplementary Table 1).
However, we found that the effect of tremor on QDG metrics (press amplitude, press Amplitude CV, and release slope in particular) was non-uniform and depended on the prevalence of tremor in the trial. Out of the 85 TD trials evaluated, 52 trials showed an increase in press amplitude after the exclusion of tremor strikes, which suggested that tremor strikes were lower amplitude compared to voluntary strikes in these trials. We found that the majority of these trials (36 out of 52 trials) had <5% %T in the trial. In contrast, 11 out of the 85 TD trials had a decrease in press amplitude after the exclusion of tremor strikes, which indicated higher amplitude tremor strikes; these trials had >20%T. Therefore, we also looked at the correlation between change in QDG metrics and % tremor specifically in trials with more than 5% tremor present. When the trials with <5% %T were excluded, there was a significant correlation between the %T and the change in press amplitude (rho = −0.69, p = 1.17e−4) and press amp CV (rho = 0.67, p = 2.80e−4) (n = 26), Fig. 4; the direction of change showed that including trials with >5% tremor underestimated the hypometria and variability of the amplitude of voluntary tapping. There was still a significant correlation with the change in release slope (rho = −0.63, p = 6.53e−4), while there was no significant correlation between the change in ISI, dwell time, and ISI CV (see Supplementary Table 1).
a–f Scatter plot of percent duration of tremor in trial (%T) and change in metrics measuring bradykinesia (ISI, press amplitude, and press amplitude CV), rigidity (release slope), arrhythmicity (ISI CV) and dwell time after the exclusion of tremor in TD Trials (n = 85). Blue markers are indicative of the trials with >5%T (n = 26) and the grey markers indicate trials with <5%T (n = 59). The Rho and p-values reported in panels a–f indicates the correlation between %T and change in metrics after the exclusion of tremor strikes in TD trials with >5% %T (n = 26). We found that the change in metrics (press amplitude, press amplitude CV, and release slope, in particular) was non-uniform and dependent on the prevalence of tremor in the trial. Although there was no significant correlation between the change in press amplitude, press amplitude CV and %T when all TD trials were included (n = 85), there was a significant correlation between the change in press amplitude (Rho = −0.69, p = 1.17e−4) and press amplitude CV (Rho = 0.63, p = 6.53e−4) when trials with <5% %T were excluded from the analysis (n = 26) (see Supplementary Table 1).
Discussion
Emergence of rest tremor of the upper extremity was reliably detected and characterized when the hand was in a sustained posture and the index and middle fingers were in contact with tensioned, engineered levers of a digitography device (QDG) in people with Parkinson’s disease (PWP). A model incorporating both temporal and amplitude features of lever displacement accurately differentiated involuntary tremor strikes from voluntary finger-tapping strikes during repetitive alternating finger-tapping (RAFT). The characteristics of the tremor strikes were significantly different from voluntary strikes in all eight QDG metrics of tapping. The inclusion of amplitude as well as temporal features to distinguish tremor from voluntary strikes resulted in a higher sensitivity (98%) compared to the best model using only temporal features (79%) and lowered the false-positive rate from 6.7% to 2.7%; the area under the precision-recall curve (AUC) was 0.981. The higher false positivity rate seen in the comparisons using only temporal features was due to the performance of a few healthy controls, who could voluntarily tap at high frequencies with short dwell times for the duration of the task. The percent time that tremor was detected in PWP during QDG-RAFT was significantly correlated with the clinical assessment of tremor using the movement disorders-unified Parkinson’s disease rating scale (MDS-UPDRS) III tremor sub-scores. The analysis of the QDG-RAFT trace from tremor dominant PWP after the tremor strikes had been excluded showed that there was a significant change in QDG metrics for bradykinesia, rigidity, and arrhythmicity when measured based on voluntary tapping only and that the inclusion of tremor strikes would have contaminated the measurement of the motor signs. There was a significant correlation between the percent time of tremor in the trial and the change in the metrics after the exclusion of tremor strikes. This suggests that a more accurate and precise measure of the disorders of voluntary motion in PD (bradykinesia, rigidity, freezing behavior, and freezing episodes) can be achieved when tremor strikes are isolated and excluded.
QDG technology was originally introduced as part of a suite of objective motor assessments that were developed to supplement the clinical assessment of PWP, who were evaluated before and after deep brain stimulation along the CAPSIT protocol8,14,15,16. QDG-RAFT metrics have been validated with the UPDRS III and MDS-UPDRS III scores and sub-scores of bradykinesia (upper and lower extremity), rigidity, gait impairment, and freezing of gait (FOG)10,11,12,13. This study has shown that tremor can be reliably measured using QDG technology and correlates with the MDS-UPDRS III tremor sub-scores.
The clinical characterization of tremor is usually divided into rest, postural, or action tremor. However, it is well known that emergent (rest) tremor may occur after the limbs have been held in a position of posture and/or when walking2,3,4,5,6,7. Action tremor is largely defined as the oscillatory motion of the limb during a voluntary task that is superimposed on a linear or smooth trajectory, such as performing the finger-to-nose-to-finger test or when drawing spirals. Emergent rest tremor may occur when a person with PD has their arm held in a posture while they perform a voluntary movement, such as the finger tapping task in the MDS-UPDRS III; the emergent tremor may interrupt or overtake the voluntary tapping. This phenomenon is demonstrated in the video in this study, where an emergent rest tremor of the wrist overtook the voluntary finger tapping on the digitography device (Fig. 2; Supplementary Movie 1). The fact that the fingers were resting on tensioned engineered levers that sensed the amplitude and timing of lever displacement allowed the algorithm to distinguish involuntary tremor strikes from voluntary strikes and to determine that there was a different digital signature of tremor strikes. Separating tremor strikes from voluntary strikes provided a measure of the amplitude, frequency of tremor, and constancy of tremor during the 30-s trial that correlated with the MDS-UPDRDS tremor sub-scores.
The emergence of tremor poses the risk of contaminating estimates of bradykinesia, such as during the finger tapping task of the MDS-UPDRS III. Bradykinesia can be underestimated when tremor takes over voluntary tapping at a frequency that is higher than that seen in voluntary movement; tremor frequency is 4–7 Hz, while voluntary tapping in PWP is usually between 1 and 3 Hz. If the PWP is asked to extend both finger and thumb when tapping, the execution of the movement is usually slower and more representative of their bradykinesia.
Bradykinesia can be overestimated when intermittent tremor can cause irregularities in voluntary finger tapping that may be graded as hesitations, which are designed to be reflective of the sequence effect of bradykinesia. This was confirmed by the results of this study: tremor strikes were found to artificially reduce ISI (i.e., tapping appears faster) when not accounted for. Tremor also was found to affect other aspects of bradykinesia, such as amplitude and rhythmicity. Tremor strikes in trials with intermittent emergent tremor (<%5%T) had a smaller amplitude in relation to the voluntary strikes, thus artificially decreasing the observed amplitude, if not excluded. Whereas, in trials where emergent tremor overtook voluntary tapping for substantial portions of the task, tremor was characterized by high amplitude strikes which artificially increased the observed amplitude, if not excluded. This finding suggested that the severity of bradykinesia would have been overestimated without the removal of tremor strikes in trials with minor intermittent emergent tremor. On the other hand, bradykinesia would have been underestimated in trials with prolonged emergent tremor if the tremor was included in the analysis. Currently, there is also no way to distinguish emergent tremor that overtakes voluntary tapping on smartphones, which will lead to underestimating bradykinesia from smartphone apps that only count the number of taps per minute17. This study demonstrated that the inclusion of tremor strikes in the analysis of the QDG-RAFT trial would have significantly under-represented the severity of bradykinesia and rigidity.
Complimentary technologies that enable remote continuous passive measure of resting tremor in PD are available using inertial measurement units (IMUs) and/or accelerometers embedded in commercial smartwatches, which are usually worn on the more affected wrist18,19,20,21,22,23,24. Either tremor acceleration or the mean daily time spent with tremor was correlated with certain aspects of the MDS-UPDRS III tremor sub-scores.
The accelerometer in smartphones has also been used to measure rest and postural tremor frequency by holding the phone in one hand, and the tremor measured by a smartphone was correlated to that measured by needle EMG25. Lipsmeier et al. reported on the first clinical trial in which the smartphone was used in a variety of active tests and passive monitoring over a six-month period26. Rest tremor was measured when the participant held the smartphone daily for thirty seconds at a time. The average of 2 weeks of daily testing demonstrated that the skewness of rest tremor moderately correlated with the MDS-UPDRS III rest tremor constancy score. Other features of tremor were not reported. In the clinic setting a tremor stability index measured from a wearable accelerometer differentiated essential tremor from Parkinson’s disease tremor but no comparison to the clinical rating scale of tremor was provided27.
Passive monitoring relies on distinguishing tremor from voluntary movement by spectral (frequency) analysis and usually long periods of recordings (6 days to 2 years) are used to generate a mean daily tremor occurrence22,23,26,28. While useful for measuring tremor over time, passive monitoring studies have also suffered from limitations in compliance of use, especially as the period of monitoring became longer29,30,31,32. This may be solved by the use of the Apple Watch, which also provides many other functions thus perhaps making it more desirable to wear. In addition to traditional spectral analysis, machine learning and deep learning techniques, which are capable of capturing more complex patterns in time and frequency domain features extracted from inertial signals, have been implemented to improve the accuracy of tremor classification in real-world settings33. While these approaches have shown promise, discriminating certain voluntary movements from tremor was found to be challenging and makes them susceptible to false positives22.
The current study demonstrated that QDG technology distinguished tremor from voluntary movement using only a 30 s task, and the percent time with tremor significantly correlated with the MDS-UPDRS tremor sub-scores. The short period of time to perform the RAFT task may improve compliance compared to passive monitoring devices, which is supported by a previous study showing a compliance rate of 91% when participants with early-stage untreated PD were asked to perform RAFT among other tasks once a week for six months34. QDG serves as a platform to provide clinicians and patients with high-resolution quantitative evaluation of all cardinal motor signs in PD remotely that is delivered in real time and will be accessible in the patient’s electronic health record35. This may be useful in several clinical scenarios: (1) when therapy is initiated, it is very difficult to know whether and how the patient is responding to medication or deep brain stimulation (DBS): remote QDG testing will allow the clinician to monitor the effect of initiating therapy and make adjustments based on real-time data which can lead to faster optimization of therapy without adversely affecting the clinician’s workflow. (2) Up to 50% of people with PD do not get access to neurological care due to the lack of proximity to a neurology clinic. The implementation of QDG can facilitate remote monitoring by neurologists, thereby enhancing access to specialized care in underserved regions. (3) Deep brain stimulation candidacy requires an on-and-off medication evaluation—a barrier to accessing DBS has been the difficulty for people to come to neurology centers off medication; QDG remote testing will enable neurologists to prescribe on and off testing from home as adjunctive testing to an in-person evaluation. In addition, the reliable distinction between voluntary movement and tremor, shown by the results of this study, can provide at a glance when the person may have medication refractory tremor but improvement of voluntary movement from medication. Such patients are excellent candidates for DBS as it will improve tremor (and voluntary movement); however, this is not separated in the MDS-UPDRS III scale and may appear as if there is little overall improvement from medication thus denying the patient a valuable therapy. Additionally, QDG’s ability to measure rigidity, alongside each of the other cardinal signs of PD, could be utilized for evaluation of clinical trials, which often rely on blinded video assessments of MDS-UPDRS III, but must disregard items related to rigidity. Therefore, QDG can profoundly aid clinicians in making complex care management decisions through high-resolution, quantitative insights that would otherwise have to be made based on subjective descriptions of symptoms by patients or through clinical rating scales at in-frequent timepoints.
When discussing the results of this study, it is important to acknowledge the limitations associated with model development and data analysis. There is a risk of overfitting for the XGBoost model due to the somewhat small dataset in the current study. We used a 75%/25% train/test split to minimize the potential of overfitting. A greater sample size of labeled tremor strikes from more participants would reduce potential overfitting by adding variability to the dataset and improving the generalizability of the classifier. More raters to identify the emergence of tremor during the task would also improve the validity of ground truth labels used to develop the classifiers. The reported sensitivity and false-positive rate of the temporal thresholds may be overestimated due to data leakage. Rather than using a train/test split, these thresholds were determined by comparing the sensitivity and false-positive rates across various ISI and dwell time thresholds using the same dataset. This only further supports the advantage of the XGBoost model approach for identifying tremor. The observed correlations between MDS-UPDRS tremor sub-scores and the model-identified tremor characteristics may have been slightly deflated due to having multiple certified UPDRS raters in this cohort, thus introducing inter-rater variability in the MDS-UPDRS scores, a known limitation even among certified raters36. Despite this potential variability, significant correlations were still observed. Lastly, although it may be argued that passive monitoring over days or weeks may be more likely to capture tremor compared to a 30-s task, we demonstrated that tremor was reliably captured and measured. Indeed, the performance of RAFT was able to elicit tremor even in participants with an MDS-UPDRS III tremor score of 1. This may be due to the known influence of cognitive load on tremor amplitude37,38.
In conclusion, the paper demonstrated that emergent tremor has a distinct digital profile that can be distinguished from voluntary movements when fingers are in contact with tensioned levers. Static temporal thresholds were less effective in differentiating high-frequency voluntary tapping by healthy controls from tremor in PWP, leading to a high false-positive rate. On the other hand, an XGBoost classifier, which uses both temporal and amplitude metrics of tapping as input features reliably identified tremor and discerned voluntary tapping. The percent duration of tremor identified by the classifier in 30-s QDG-RAFT trials correlated with the MDS-UPDRS III items of tremor. When tremor strikes were excluded from the analysis of the disorder of voluntary tapping, there were significant changes in the QDG metrics of bradykinesia, rigidity, and arrhythmicity, demonstrating that the inclusion of tremor when assessing voluntary movement will underestimate the severity of Parkinsonism. These findings highlight that tremor can be reliably measured with QDG and completes the evidence that QDG technology produces validated metrics of all the motor signs in PD in real time from a simple 30-s RAFT task on a portable digitography device that can be used remotely.
Methods
Human subjects
Ninety-six individuals (52 males) with clinically established PD and 42 healthy controls (20 males) performed thirty seconds of self-paced, repetitive alternating finger tapping (RAFT) on a digitography device in the off-therapy state. Long-acting medications were withdrawn over 24–48 h and short-acting medications were withdrawn for at least 12 h prior to testing. For patients with DBS, stimulation was turned off for at least 15 min before testing10. Data was collected in the Stanford Human Motor Control and Neuromodulation Laboratory. All participants provided written informed consent to participate in the study, which was approved by the Stanford University Institutional Review Board.
Experimental protocol
The RAFT involved pressing and releasing tensioned, engineered levers on a digitography device with the index and middle finger in an alternating pattern for thirty seconds, as described previously9,10,12,13,34. Participants were instructed to press and release the keys completely while going as fast and as consistently as possible for the duration of the trial. The task was performed without auditory or visual feedback: participants closed their eyes and white noise was played through headphones. They were instructed to start and stop tapping only when they heard an auditory cue. The task was performed on each hand. No external cueing was given during the task. Participants also performed the Unified Parkinson’s Disease Rating Scale (UPDRS III). Thirteen individuals performed the original UPDRS III and 83 individuals performed the Movement Disorder Society (MDS)-UPDRS III.
Kinematic data acquisition and analysis
The digitography device used an optical encoder that produced a voltage signal as the lever was pressed and released, which was linearly related to the displacement of the lever and had a resolution of 62.5 µm per 40 mV10.
Visual identification of tremor
An experienced movement disorders specialist (HBS) was shown video recordings of 36 RAFT trials from thirty-one tremor-dominant (TD) participants and five akinetic rigid (AR) participants. The rater was blinded to the phenotype of the participant and trials were randomly presented. The rater identified any occurrence of tremor in the trial and provided timestamps for periods of sustained tremor. A labeled dataset was created based on the movement disorders specialist’s ratings, with the positive class (tremor = 1) comprising strikes from five trials in which the rater identified tremor for the entire duration of the trial, and the negative class (tremor = 0) consisting of strikes from 75 trials performed by control participants (Dataset 1).
An algorithm to identify tremor strikes using temporal characteristics and based on video identification
Visual observation between trials with tremor and trials with no tremor suggested that tremor strikes could be distinguished from voluntary tapping by temporal characteristics of a shorter ISI (higher frequency) and shorter dwell time, Fig. 2, which has been previously observed8. Four combinations of physiologically informed ISI and dwell time thresholds were systematically applied to classify tremor in strikes from Dataset 1. The efficacy of each ISI and dwell time threshold combination was evaluated based on the sensitivity (True Positive/True Positive + False Negative) and false-positive rate (False Positive/False Positive + True Negative). The most suitable ISI and dwell time thresholds were identified by comparing the resulting sensitivity and false-positive rates of each combination of thresholds.
An XGBoost classifier to identify tremor
An XGBoost classifier was developed to identify emergent tremor in the RAFT trial on a strike-by-strike basis. For each strike, eight features of tapping were extracted as potential input features for the model, which included temporal metrics of tapping: ISI, dwell time, release duration, press duration, peak duration, release slope, and amplitude metrics: press and release amplitude. For the development of the model, tremor strikes from three additional trials, for which the rater provided a timestamp for tremor onset and the emergent tremor persisted for the remainder of the trial, were added to the positive class of the labeled dataset to reduce the class imbalance (Dataset 2). The dataset was split into a training (75% of the dataset) and test set (25% of the dataset) with stratification to account for the class imbalance and prevent biased representation of classes in either the training or test set, which could lead to misleading evaluation of model performance. Features were normalized such that the data had zero mean and unit variance to mitigate the effects of features having different scales on the model and facilitate better convergence. Variance inflation factor (VIF) analysis was performed to evaluate multicollinearity between input features and showed that there was high multicollinearity between features when all eight metrics of tapping were included as input features. To eliminate multicollinearity, we manually removed features, retrained the model, and evaluated model performance. We found that removing Peak Duration as an input feature successfully eliminated multicollinearity between features while having the lowest impact on model performance. Bayesian optimization was implemented to perform hyperparameter tuning, and stratified K-fold cross-validation was used to evaluate model performance on the training set and further adjust hyperparameters. Early stopping was applied while training to prevent overfitting and improve the generalization of the model. A prediction probability threshold of 0.35 was used, which implies that a strike was classified as a tremor if the probability of a particular strike being a tremor was determined to be greater than 0.35 by the classifier since it was found to increase sensitivity while not increasing the false-positive rate substantially. Model performance was evaluated on the test set using sensitivity, precision (True Positive/True Positive + False Positive), F1 score (harmonic mean of precision and recall) and the false-positive rate as measures of performance, instead of the general accuracy score to mitigate the imbalanced class distribution from biasing the interpretation of model performance. Feature importance was evaluated using two methods: permutation importance and gain importance. Permutation feature importance analysis, using the F-1 score as the scoring metric, computes the decrease in the F1 score when the values of each feature are randomly shuffled. Gain importance measures the gain (reduction in the model’s loss function) achieved by splitting on a particular feature during the construction of decision trees, providing quantitative insight into the contribution of each feature to the model’s overall performance.
Statistical analysis
Statistical analyses were performed on MATLAB (version R2022a, MathWorks Inc., Natick, MA, USA). A two-sample t-test was used to evaluate the difference in ages between the control participants and PWP. A Kolmogorov–Smirnov test was used to assess the difference between the distribution of the eight metrics of tapping for tremor and voluntary strikes. A significant threshold of p < 0.006 (0.05/8) was established to account for the evaluation of eight metrics based on a Bonferroni correction. Spearman correlations were used to assess correlations between four calculated tremor sub-scores from the UPDRS and the percent duration of tremor in each trial identified by the XGBoost classifier. An overall weighted tremor score was derived by combining the postural (item 3.15) and kinetic sub-scores (item 3.16) of the MDS-UPDRS III assessment, along with an equally weighted sum of rest tremor severity (upper extremity sub-scores of item 3.17) and constancy items (item 3.18) (i.e., postural tremor + kinetic tremor + 0.5(rest tremor severity + constancy of rest tremor)). A weight of 0.5 was used to give equal weighting to rest tremor severity and constancy scores to prevent rest tremor items from biasing the scores. The significance threshold was set at p < 0.0125 (0.05/4), applying a Bonferroni correction to account for the evaluation of four tremor sub-scores. A Wilcoxon signed-rank test was used to evaluate whether there was a significant change in QDG metrics for bradykinesia (ISI, press amplitude, and press amplitude CV), rigidity (release slope), arrhythmicity (ISI CV) and dwell time after the exclusion of tremor in 85 trials performed by TD individuals. Spearman correlations were used to evaluate the correlation between the percent duration of tremor in a trial and the change in metrics after tremor strikes were excluded. The significance threshold for this analysis was set at p < 0.008 (0.05/6) due to a Bonferroni correction to account for the evaluation of the six metrics.
Data availability
The datasets used and/or analyzed during the current study available from the corresponding author [HBS] on reasonable request for research studies with a defined scientific question and plan pertaining to the use of the data.
Code availability
The underlying code for this study [and training/validation datasets] is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author [HBS] for research studies with a defined scientific question and plan pertaining to the use of the code.
References
Baumann, C. Epidemiology, diagnosis and differential diagnosis in Parkinson’s disease tremor. Parkinsonism Relat. Disord. 18, S90–S92 (2012).
Jankovic, J., Schwartz, K. S. & Ondo, W. Re-emergent tremor of Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 67, 646–650 (1999).
Shahed, J. & Jankovic, J. Motor symptoms in Parkinson’s disease. Parkinson’s Dis. Relat. Disord. https://doi.org/10.1016/s0072-9752(07)83013-2 (2007).
Papengut, F., Raethjen, J., Binder, A. & Deuschl, G. Rest tremor suppression may separate essential from Parkinsonian rest tremor. Parkinsonism Relat. Disord. 19, 693–697 (2013).
Belvisi, D. et al. Re-emergent tremor in Parkinson’s disease. Parkinsonism Relat. Disord. 36, 41–46 (2017).
Wilken, M., Rossi, M., Rivero, A. D., Hallett, M. & Merello, M. Re-emergent tremor provocation. Parkinsonism Relat. Disord. 66, 241–244 (2019).
Leodori, G. et al. Re‐emergent tremor in Parkinson’s disease: the role of the Motor Cortex. Mov. Disord. 35, 1002–1011 (2020).
Bronte-Stewart, H. M., Ding, L., Alexander, C., Zhou, Y. & Moore, G. P. Quantitative digitography (QDG): a sensitive measure of digital motor control in idiopathic Parkinson’s disease. Mov. Disord. 15, 36–47 (2000).
Taylor Tavares, A. L. et al. Quantitative measurements of alternating finger tapping in Parkinson’s disease correlate with updrs motor disability and reveal the improvement in fine motor control from medication and deep brain stimulation. Mov. Disord. 20, 1286–1298 (2005).
Trager, M. H. et al. Arrhythmokinesis is evident during unimanual not bimanual finger tapping in Parkinson’s disease. J. Clin. Mov. Disord. 2, 1–7 (2015).
Trager, M. H., Wilkins, K. B., Koop, M. M. & Bronte-Stewart, H. A validated measure of rigidity in Parkinson’s disease using alternating finger tapping on an engineered keyboard. Parkinsonism Relat. Disord. 81, 161–164 (2020).
Wilkins, K. B. et al. Quantitative digitography measures motor symptoms and disease progression in Parkinson’s disease. J. Parkinson’s Dis. 12, 1979–1990 (2022).
Prabhakar, V. et al. Quantitative digitography measures fine motor disturbances in chronically treated HIV similar to Parkinson’s disease. Front. Aging Neurosci. 12, 1–9 (2020).
Moore, G. P., Ding, L. & Bronte‐Stewart, H. M. Concurrent Parkinson tremors. J. Physiol. 529, 273–281 (2000).
Bronte-Stewart, H. M. Postural instability in idiopathic Parkinson’s disease: the role of medication and unilateral pallidotomy. Brain 125, 2100–2114 (2002).
Koop, M. M., Andrzejewski, A., Hill, B. C., Heit, G. & Bronte-Stewart, H. M. Improvement in a quantitative measure of Bradykinesia after microelectrode recording in patients with Parkinson’s disease during deep brain stimulation surgery. Mov. Disord. 21, 673–678 (2006).
Omberg, L. et al. Remote smartphone monitoring of Parkinson’s disease and individual response to therapy. Nat. Biotechnol. 40, 480–487 (2021).
Van Someren, E. J. W. et al. New actigraph for long-term tremor recording. Mov. Disord. 21, 1136–1143 (2006).
Braybrook, M. et al. An ambulatory tremor score for Parkinson’s disease. J. Parkinson’s. Dis. 6, 723–731 (2016).
López-Blanco, R. et al. Smartwatch for the analysis of rest tremor in patients with Parkinson’s disease. J. Neurol. Sci. 401, 37–42 (2019).
Mahadevan, N. et al. Development of digital measures for nighttime scratch and sleep using wrist-worn wearable devices. npj Dig. Med. 4, 1–10 (2021).
Powers, R. et al. Smartwatch inertial sensors continuously monitor real-world motor fluctuations in Parkinson’s disease. Sci. Transl. Med. 13, 1–11 (2021).
Burq, M. et al. Virtual exam for Parkinson’s disease enables frequent and reliable remote measurements of motor function. npj Digit. Med. 5, 65 (2022).
Adams, J. L. et al. Using a smartwatch and smartphone to assess early Parkinson’s disease in the watch-PD study. npj Parkinson’s Dis. 9, 64 (2023).
Araújo, R. et al. Tremor frequency assessment by iPhone® Applications: correlation with EMG Analysis. J. Parkinson’s Dis. 6, 717–721 (2016).
Lipsmeier, F. et al. Evaluation of smartphone‐based testing to generate exploratory outcome measures in a phase 1 Parkinson’s Disease Clinical Trial. Mov. Disord. 33, 1287–1297 (2018).
di Biase, L. et al. Tremor stability index: a new tool for differential diagnosis in tremor syndromes. Brain 140, 1977–1986 (2017).
Bloem, B. R. et al. The personalized Parkinson project: examining disease progression through broad biomarkers in early Parkinson’s disease. BMC Neurol. 19, 1–10 (2019).
Bot, B. M. et al. The mPower study, Parkinson disease mobile data collected using Researchkit. Sci. Data 3, 160011 (2016).
Silva de Lima, A. L. et al. Feasibility of large-scale deployment of multiple wearable sensors in Parkinson’s disease. PLoS ONE 12, e0189161 (2017).
Elm, J. J. et al. Feasibility and utility of a clinician dashboard from wearable and mobile application Parkinson’s disease data. npj Digit. Med. 2, 95 (2019).
Cohen, S. et al. Characterizing patient compliance over six months in remote digital trials of Parkinson’s and Huntington Disease. BMC Med. Inform. Decis. Mak. 18, 138 (2018).
Sigcha, L. et al. Automatic resting tremor assessment in Parkinson’s disease using smartwatches and multitask convolutional neural networks. Sensors 21, 291 (2021).
Goetz, C. G. et al. Testing objective measures of motor impairment in early Parkinson’s disease: feasibility study of an AT-home testing device. Mov. Disord. 24, 551–556 (2009).
Evers, L. J. W., Krijthe, J. H., Meinders, M. J., Bloem, B. R. & Heskes, T. M. Measuring Parkinson’s disease over time: the real‐world within‐subject reliability of the MDS‐UPDRS. Mov. Disord. 34, 1480–1487 (2019).
Hoffman, S. et al. Comprehensive real time remote monitoring for Parkinson’s disease using Quantitative Digitography. npj Parkinson’s Dis. 10, 1–13 (2024).
Raethjen, J. et al. Provocation of Parkinsonian tremor. Mov. Disord. 23, 1019–1023 (2008).
Dirkx, M. F. et al. Cognitive load amplifies Parkinson’s tremor through excitatory network influences onto the thalamus. Brain 143, 1498–1511 (2020).
Acknowledgements
We would like to thank the members of the Human Motor Control and Neuromodulation lab, and, most importantly, the participants who dedicated their time to this study. This work was supported in part by the following: NINDS UH3NS107709 and UG3NS128150, NINDS R21 NS096398-02, Michael J. Fox Foundation (9605), Robert and Ruth Halperin Foundation, John E Cahill Family Foundation, Parkinson’s Foundation Summer Student Fellowship (PDF-SFW-1330), Stanford Neuroscience: Translate, and Apple Inc.
Author information
Authors and Affiliations
Contributions
A.G.: Conceptualization, Methodology, Software, Validation, Formal Analysis, Writing-Original Draft, Writing-Review&Editing. K.W.: Conceptualization, Methodology, Formal Analysis, Writing-Original Draft, Writing-Review&Editing. M.P.: Conceptualization, Methodology, Software, Validation, Formal Analysis, Writing-Review&Editing. Y.K.: Conceptualization, Methodology, Software, Validation, Formal Analysis, Writing-Review&Editing. A.V.: Methodology, Software, Validation, Writing-Review&Editing. M.T.: Conceptualization, Formal Analysis, Data curation, Writing-Original Draft, Writing-Review&Editing. H.B.S.: Conceptualization, Methodology, Writing-Original Draft, Writing-Review&Editing.
Corresponding author
Ethics declarations
Competing interests
Dr. Bronte-Stewart and Dr. Trager have a patent (PCT/US2021/043787) for objective measurement of PD symptoms. The remaining authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Gala, A.S., Wilkins, K.B., Petrucci, M.N. et al. The digital signature of emergent tremor in Parkinson’s disease. npj Parkinsons Dis. 10, 147 (2024). https://doi.org/10.1038/s41531-024-00754-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41531-024-00754-7






