Speech-based characterization of dopamine replacement therapy in people with Parkinson’s disease

Norel, R.; Agurto, C.; Heisig, S.; Rice, J. J.; Zhang, H.; Ostrand, R.; Wacnik, P. W.; Ho, B. K.; Ramos, V. L.; Cecchi, G. A.

doi:10.1038/s41531-020-0113-5

Download PDF

Article
Open access
Published: 12 June 2020

Speech-based characterization of dopamine replacement therapy in people with Parkinson’s disease

R. Norel ORCID: orcid.org/0000-0001-7737-4172¹,
C. Agurto¹,
S. Heisig¹,
J. J. Rice¹^na1,
H. Zhang²,
R. Ostrand¹,
P. W. Wacnik²,
B. K. Ho³,
V. L. Ramos² &
…
G. A. Cecchi¹

npj Parkinson's Disease volume 6, Article number: 12 (2020) Cite this article

4178 Accesses
26 Citations
14 Altmetric
Metrics details

Subjects

Abstract

People with Parkinson’s (PWP) disease are under constant tension with respect to their dopamine replacement therapy (DRT) regimen. Waiting too long between doses results in more prominent symptoms, loss of motor function, and greater risk of falling per step. Shortened pill cycles can lead to accelerated habituation and faster development of disabling dyskinesias. The Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) is the gold standard for monitoring Parkinson’s disease progression but requires a neurologist to administer and therefore is not an ideal instrument to continuously evaluate short-term disease fluctuations. We investigated the feasibility of using speech to detect changes in medication states, based on expectations of subtle changes in voice and content related to dopaminergic levels. We calculated acoustic and prosodic features for three speech tasks (picture description, reverse counting, and diadochokinetic rate) for 25 PWP, each evaluated “ON” and “OFF” DRT. Additionally, we generated semantic features for the picture description task. Classification of ON/OFF medication states using features generated from picture description, reverse counting and diadochokinetic rate tasks resulted in cross-validated accuracy rates of 0.89, 0.84, and 0.60, respectively. The most discriminating task was picture description which provided evidence that participants are more likely to use action words in ON than in OFF state. We also found that speech tempo was modified by DRT. Our results suggest that automatic speech assessment can capture changes associated with the DRT cycle. Given the ease of acquiring speech data, this method shows promise to remotely monitor DRT effects.

A systematic review and narrative analysis of digital speech biomarkers in Motor Neuron Disease

Article Open access 07 December 2023

Speech acoustic indices for differential diagnosis between Parkinson’s disease, multiple system atrophy and progressive supranuclear palsy

Article Open access 27 October 2022

Speech's syllabic rhythm and articulatory features produced under different auditory feedback conditions identify Parkinsonism

Article Open access 09 July 2024

Introduction

Parkinson’s disease (PD) is the second most common neurodegenerative disease, with an estimated prevalence of 0.3% in industrialized countries, 1.0% in people over 60, and 3.0% in people over 80¹. Roughly 10 million people worldwide live with PD, and ~60,000 Americans are diagnosed with PD each year². Balance and gait disturbances in people with Parkinson’s (PWP) lead to falls, mobility loss, serious injuries, and reduced independence^3,4. Close to 90% of PWP develop speech disorders, leading to significant decline in quality of life due to substantial deterioration in functional communication^5,6.

At present, the most common treatments for PD contain L-DOPA and there is evidence that dopamine replacement therapy (DRT) improves functional balance^4,7. Unfortunately, prolonged use of DRT often results in habituation, leading to reduced symptom control⁸, fluctuations of symptom relief known as “ON” (symptom relief) and “OFF” (reduced symptom relief) states^9,10, and dyskinesias. To characterize the progression of PD, the most widely-used clinical rating scale is the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS)¹¹ of which part III characterizes motor activities. The MDS-UPDRS was designed for occasional in-clinic evaluation, rather than frequent monitoring. Administering the rating scale requires training and certification from the Movement Disorder Society. The MDS-UPDRS part III includes a section for scoring speech in five levels; 0: normal (no problems); 1: slight (speech is soft, slurred or uneven); 2: mild (occasionally parts of the speech are unintelligible); 3: moderate (frequently parts of the speech are unintelligible); and 4: severe (speech cannot be understood). It has been noted that the inter-rater reliability of speech scores on the UPDRS (rather than the MDS-UPDRS) is inconsistent^12,13, and recently Nöth et al.¹⁴ showed that the MDS-UPDRS does not accurately capture deterioration in communication. The current gold standard for at-home continuous (every half hour) recording of a patient’s “ON” and “OFF” state is the self-reported Hauser¹⁵ diary. This technique places the burden of monitoring on the patient and the results may be confounded by other factors, such as the patient’s mood and lack of sleep, and definitionally relies on the patient’s subjective self-assessment. Additionally, differences in self-assessment and objective assessment of speech deficiencies in PWP have been reported, probably due to adaptation to changes¹⁶. For these reasons, alternative methods reducing the burden on patients and at the same time providing consistent assessments of medication state at home are of great importance for the PD community. These methods would not only infer current medication state, but also address a currently unmet need for the information necessary to determine optimal pill cycle timing on an individual level.

Speech has been shown to be different between PWP and controls^17,18,19 and to be affected by dopamine levels in PWP²⁰. Features of speech applicable to monitoring a patient’s long-term disease progression include: reduced loudness, decreased variability in pitch and intensity, reduced stress, breathiness and hoarseness, and imprecise articulation¹⁷. For three recent reviews of speech features applicable in monitoring PD progression see refs. ^18,19,21. Previous studies have also shown that speech features can differentiate healthy controls from PWP, most notably by using acoustic measurements of sustained phonations^22,23,24,25. Recently, an automatic evaluation of dysarthria in PWP was performed²⁶ by analyzing six types of diadochokinetic (DDK) exercises. However, studies on the effectiveness of using changes in speech production to differentiate PWP in the “ON” versus the “OFF” states have generated mixed or contradictory results. Okada and colleagues²⁷ studied vowel articulation in PWP and reported that vowel space was significantly expanded after DRT, contrary to previous findings²⁸ which found no change in vowel space before and after treatment. Fabbri and colleagues analyzed a cohort of late-stage PD patients²⁹ and similarly did not find significant changes in speech as assessed by clinical evaluation and automated analysis of voice stability/variability following an otherwise positive L-DOPA response. In a recent meta-analysis Pinho and colleagues³⁰ report that DRT modifies F0 (fundamental frequency) and jitter, but does not have an impact on vocal intensity. Smith et al. ¹⁸ showed differences in PWP and aged matched controls in word-finding-difficulties by analysis of the semi-structured Cookie Theft description test. There is also evidence that cognitive impairment, which can be a prominent feature of advanced/late stage of PD, either affects or is reflected in language production¹⁹. Embodied cognition postulates that the motor system influences cognition. In particular, it has been suggested that action words and motor representation of those actions activate the same network in the brain^31,32. One difference between PWP and healthy controls, and PWP in “ON” vs. “OFF” state is the use of action verbs^{33,34,35,36,37}. These are verbs that describe actions such as “run” or “swim,” as compared with verbs that describe mental states or emotions such as “think” or “hope”. PWP typically produce fewer action verbs than healthy controls^33,38,39.

Based on these findings, we evaluated the speech of PD participants during two medication states (ON vs. OFF) on three different speech tasks: picture description, diadochokinetic rate test, and reverse counting. These tasks were characterized with acoustic (cepstral analysis), prosodic (speech tempo), and linguistic (semantic embedding) features. In this study, we aim to test the following hypotheses: (i) DRT causes changes in speech that can be detected using are reflected in acoustic, prosodic and semantic features, which can be detected using automatic methods, (ii) tasks involving semi-structured free speech can provide more information than structured tasks to assess medication states, and (iii) the use of speech features associated with action verbs, which are relevant for the discrimination of PWP and controls, would be equally applicable for differentiating ON and OFF medication states.

Results

Statistical analysis

Table 1 shows the top 5 ON/OFF statistically significant discriminating features after applying a Bonferroni correction for multiple comparisons (α = 0.05) for each of the three speech tasks. Note that reverse counting and diadochokinetic rate are dominated by features of low-frequency energy (MFCC1) and high-frequency energy (MFCC11), respectively. High-frequency energy captures changes in perceived hoarseness^40,41 in PWP. Features from the picture description task captured significant changes both in low and high frequency in addition to speech rate and semantic content. This conformed to the expected increased richness of its feature set. The top 5 variables were: the robust minimum (10th percentile) for the concepts of “act” and “play”, which confirmed the differential use of action verbs as a result of medication, in concordance with the reported differential use of action verbs between PWP and^{33,34,35,36,37,38,39,42}; the mode (most frequent value) of a low-frequency MFCC spectral energy, the skewness (asymmetry) of a very high-frequency MFCC spectral energy, and the robust maximum (90th percentile) of the distribution of inter-syllable time intervals, which indicates longer tails of the distribution in ON state, suggesting more control on speech production. We evaluated patterns of co-variation among the five top-ranked features for the picture description task using partial correlation (see Fig. 1). The “OFF” state was characterized by strong positive partial correlations between SF (play) and acoustic (MFCC #2) and SF (act and play) and NS (pct90) features.

Table 1 Top ranked features.

Full size table

**Fig. 1: Comparison of Partial correlations for top features for both states.**

Classification

Binary classification was performed by subtracting feature values for one medication state from the other. Table 2 presents the highest accuracy achieved for each combination of acoustic, prosodic, and content features. Figure 2 shows the best performance for each of the three speech tasks, picture description, reverse counting, and diadochokinetic rate. For picture description, the combination of acoustic (MFCC), prosodic (NS) and semantic feature types resulted in a top classification accuracy of 0.89. For reverse counting, acoustic features alone provided for a classification accuracy of 0.84. Finally, for diadochokinetic rate, the best result was obtained using also acoustic features alone, resulting in an accuracy rate of 0.60.

Table 2 Classification performance.

Full size table

Discussion

We combine acoustic, prosodic, and semantic features of speech to predict medication state in PWP. High accuracy rates (see Fig. 2) were achieved with all speech tasks, in particular for picture description (0.89) and reverse counting (0.84). Both of these tasks have a cognitive component that can be captured by our features, which suggests that cognition may be also contributing to the differentiation of ON/OFF states on top of the speech degradation. Specifically, in our analysis we found that features obtained for the picture description task (free speech) could successfully differentiate L-DOPA “ON/OFF” states. Although it has been reported that dopamine replacement does not significantly improve speech in PWP^43,44,45,46, there is evidence that dopamine affects motor skills which affect speech production⁴⁵. Given that neurologists usually give the same score in ON/OFF states to speech part of MDS-UPDRS, we suggest that the effect of dopamine on speech in PWD is located in features that are undetectable for human perception (e.g., high frequency content) which can be captured with our methods. In addition, the use of better recording equipment in comparison with past decades also allowed us to detect enough subtle differences to drive the classification. The low granularity of MDS-UPDRS sub-scores contributes as well to lack of differentiation on ON/OFF state scores by humans.

In particular, our results suggest that the main difference between medication states is characterized by changes in the speech energy. This speech energy variation is characteristic of hypokinetic dysarthria found in PD⁴⁷. MFCC #11 and MFCC #12 capture high frequency information [MFCC #11: 9.5 kHz–12.6 kHz and MFCC #12: 12.6 kHz–16.7 kHz], likely perceived by listeners as the difference in hoarseness between “ON” and “OFF” states⁴⁸. High frequency components of speech affect intelligibility^49,50 and differentiate between patients with dysphonia and controls^48,49. Two rat models of PD⁵¹ showed rats with a damped dopaminergic system had a lower maximum frequency than controls for both simple and frequency-modulated calls. The other set of important features are the tails (10^th percentile or robust minimum) of the distributions of words uttered by the participants related to the seed words act and play, in particular, we observe that participants show a higher robust minimum of the similarity to these concepts when they are in ON than when they are in OFF. This is consistent with the hypothesis that L-DOPA brings participants closer to a normative state, given that as already mentioned there is strong evidence of a bias against action-oriented verbs in PWP’s speech production^{33,34,35,36,37,38}. To further assess the robustness of MFCCs in our analysis, we performed an experiment to evaluate the change in classification performance when the duration of the recording was reduced. The results showed that there were not significant changes when the duration of the recording was reduced down to 10 s. Our relatively high classification accuracy even with this short duration suggests brief ambient or prompted speech samples captured outside the clinic could be used to monitor PD patients. More information on the experiment can be found in the Supplementary material.

The results of our semantic analysis indicate that action verbs like play and act have a higher association (higher similarity distance) with descriptions from participants in ON-state vs. OFF-state (t-statistic of 5.76 and 4.63 reported in Table 1). This indicates that PWP in OFF state have more difficulty producing action verbs. This recapitulates findings in the previous work^33,38,39 where the speech of PWP is compared with heathy controls. These studies show that PWP typically produce fewer action verbs such as “run” or “swim,” compared with verbs that describe mental states or emotions such as “think” or “hope”. Given that motor function influences cognition^31,32 and motor responses are less affected in ON-state, we think language production is less affected than in OFF-state. We also think that while the production of action verbs is affected in PWP with respect to healthy participants there is also a difference in impairment within each PWP produced by medication state that can be captured by our analysis. We speculate that this difference may also be present between subjects. However, it would be necessary to include a larger cohort with healthy participants to perform a better assessment of language production.

In the picture description task, three types of features—acoustic, semantic, and prosodic—were informative and complementary to each other.

We showed (see Fig. 1) that a very high positive correlation between different features (red lines) occurred only in the “OFF” state between SF (play) and MFCC #2 and NS. This differential relationship in the two states, after combining the three categories of features, helped achieve better discrimination between the medication states. Specifically, we found an improvement of 35% with respect to using only MFCC features (see Table 2). Classification accuracy may have been enhanced compared to the non-free speech tasks since subjects can express emotions while describing the picture, captured with MFCCs^52,53,54,55 and possibly by the differences in the NS distribution^40,41,56,57.

The effect of DRT on speech identified by our multivariate approach is significant but evidently subtle. As per the neurologists’ assessment, 64% of the subjects presented an MDS-UPDRS speech score difference of 0, with one participant actually improving the score from the ON to the OFF state. Improvement in speech induced by DRT may only be overtly manifested longitudinally. Even so, this should result from the cumulative effect of weak positive changes. Comparative neuroanatomy studies of songbirds suggest a possible mechanism for these effects, based on the significant homology between cortico-basal-thalamo-cortical loops in humans and pallial (i.e. cortex-like) loops in songbirds responsible for speech and song production, respectively⁴⁵. Dopaminergic activity is involved in song production both in its conspecific-directed and undirected forms, with the former interpretable as communication. Auditory feedback is required for learning song in juveniles^58,59; moreover, dopamine neurons encode the error between expected performance and auditory feedback during singing, suggesting that dopamine signaling underlies song stability even in adults. To the extent that the homology is valid, a plausible hypothesis is that dopamine is also involved in maintaining speech stability through feedback monitoring⁶⁰, so that replacement therapy may induce subtle effects. Therefore, the neurologists’ limitations to detect medication-state changes in speech may be due to the coarse-grained nature of the score categories, or the inability of human raters to distinguish differences in speech with the current assessment protocol.

Finally, we would like to mention that the present method meets the need for a quantitative metric to monitor patients’ speech, and potentially correlates with disease progression and the effect of dopamine replacement therapy. In addition, objective mathematical and computational analysis of speech can increase the granularity in the assessment and avoid human biases that result in inconsistencies between raters^12,13. It has previously been documented that perceptual analysis of speech on PD is outperformed by acoustic analysis^61,62. An objective and easy method to monitor disease progression can have important effects on research in at least two ways. First, continuous monitoring can help gain insights into the progression of the disease and identify factors contributing to the stabilization of prognosis such as medicines, other types of treatments, and interventions. Second, when testing a new treatment in a clinical trial, continuous, unbiased monitoring is more powerful than self-reporting with all its associated biases and burden on the participant.

We combine acoustic and semantic features of speech to characterize PD medication state. Our study explored different, easily implementable speech tasks for monitoring PWP, and obtained high accuracy rates for differentiating medication states. Our best results were obtained using the picture description task, which collected participants’ free speech. Our accuracy results in this preliminary study, which ranged from 0.60 to 0.89, demonstrate the feasibility of this method to monitor PD patients and assess dopamine replacement therapy effects. These results support the potential of this approach to be used as a complementary tool to aid neurologists in monitoring PD patients. We acknowledge that one of the main limitations in this study is the small cohort of PD participants and that a larger cohort (with participant enrollment at different sites) is necessary to further validate the approach. In addition, the effects seen in this study will be best validated with participant groups which include greater variability among MDS-UPDRS speech scores as well as greater differences in overall MDS-UPDRS scores between medication states. This type of analysis is only expected to work in persons still able to communicate and understand directions. The onset of PD-related dementia or even mild cognitive impairment would be expected to impact the results. An important limitation of the current method is that the two states we characterized were pre-determined – one being “ON” and the other being “OFF”. Based on prior clinical literature, we expected there would be a change in speech quality between the two states and our method was set up to differentiate these states. When applied to two unknown states for an individual PWP, this expectation does not necessarily hold true, and a threshold for determining “no change” needs to be designed/produced to avoid finding artifactual “change” between two equivalent states. Conversely, the smaller this threshold can be, the more sensitive this method can be used to quantify changes between the two states in question. Finally, there is a continuum of states in the transition between ON and OFF that would need to be classified. Further work is necessary to extend this work to automated speech assessment in continuous, minimally obtrusive, remote patient monitoring in PWP.

Methods

Participants

Twenty-five participants (6 females age 67 ± 6 years; 19 males age 69 ± 7.5 years) with idiopathic Parkinson’s disease were enrolled. The average disease duration was 5.8 + 3 years. All participants provided written informed consent to take part in the study. They were the first cohort recruited in a larger study for Project BlueSky (Pfizer-IBM Research collaboration)⁶³. Participants were recruited and the protocol was run at Tufts Medical Center, Boston, Massachusetts. The study was approved by the Tufts Health Sciences Campus Institutional Review Board, IRB # 12371. Inclusion criteria consisted of response to L-DOPA treatment, ability to recognize “wearing off” symptoms, participant confirmation of improvement after L-DOPA dose, and assessment of stage 3 or lower on the Hoehn and Yahr scale. Exclusion criteria were a current history of neurological disease besides PD, psychiatric illness that would interfere with participation, treatment with an investigational drug within 30 days or 5 half- lives (whichever is longer) preceding the enrollment in this study, alcohol consumption exceeding 7 drinks/week for females or 14 drinks/week for males, and use of a cardiac pacemaker, electronic pump or any other implanted medical devices (including deep brain stimulation devices). Each participant was evaluated by one of two neurologists during each visit, using the MDS-UPDRS-III protocol. PD participants were diagnosed in stages 1 (N = 2), 2 (N = 22), or 3 (N = 1) of the Hoehn and Yahr scale. UPDRS total scores were 35.6 ± 14.6 and 48.6 ± 13.6 for ON and OFF state respectively. Table 3 summarizes demographic information, clinical variables, and the improvement of several symptoms after L-DOPA intake for the analyzed subjects. In addition, Supplementary Table 1 provides the list of current medications for each participant.

Table 3 Subjects demographics.

Full size table

Design and protocol

This paper describes the analysis of three speech tasks from the “Observational Study in Parkinson’s Patient Volunteers to Characterize Digital Signatures Associated with Motor Portion of the MDS-UPDRS, Daily Living Activities and Speech” conducted at Tufts Medical Center. The speech tasks were performed by each participant at each of two sessions, one before and one after L-DOPA administration, to capture behavior in the “OFF” and “ON” medication states respectively. The order of medication state per session was randomized to counterbalance possible practice effects. As a result, 13 participants were in the “ON” state for session 1 and “OFF” state for session 2, and 12 participants were in the “OFF” state for session 1 and “ON” state for session 2. To ensure that we correctly acquired the data to reflect both medication states, medication dosage and timing was dictated by the participant’s normal daily regimen of dopamine replacement therapy. Following IRB guidelines, we did not have participants take any other dosage. Both medication states were self-reported by the patients and confirmed by the neurologist. To ensure that we captured peak medication effects in the ON-state, all patients arrived in OFF-state to the clinic. This was confirmed by the neurologist. When the session was ON-state, the patient took his/her scheduled L-DOPA dose and the evaluation began after both the participant and neurologist confirmed the ON-state (state ON/OFF questioning was performed every 0.5 h until ON or 1.5 h post-dose, whatever was earlier). If the first session was in ON-state, the second session (OFF-state) began between 0.5 to 1 h before their next scheduled L-DOPA dose the same day or up to 14 days later.

In the first speech task, participants described the “Cookie Theft” picture from the Boston Diagnostic Aphasia Exam⁶⁴ and a second, similar picture, the “Lightbulb Changing”⁶⁵ (both pictures are shown in Supplementary Fig. 1). Participants were asked to provide a verbal description of the picture. The “Cookie Theft” picture was presented to all participants in session 1 and the “Lightbulb Changing” picture in session 2. The objective of this task was to evaluate cognitive skills and communication ability, as well as to check for changes in action verb use^{33,34,35,36,37,38,39}^,42. The second task was reverse counting, a modification of the classic test for mental state evaluation^66,67 where participants count backward by three, starting from a different (experimenter-provided) number in each session, to maintain the level of cognitive difficulty. This cognitive assessment test evaluates concentration in the participant. The third speech task was a diadochokinetic rate test widely used for assessing dysarthria^42,68 to measure speech production. In this test, participants were asked to pronounce the syllable sequence “pa-ta-ka” as rapidly as possible for 10 s.

Data acquisition

To record the speech tasks, participant wore a Shure SM10A, a head-mounted, low-impedance, dynamic cardioid microphone. Audacity software⁶⁹ was used to record the speech task using 16-bits at 44.1 kHz. All audio recordings were saved in the uncompressed ‘.wav’ format.

Feature extraction

Processing of the speech recordings was performed using Python⁷⁰ and Praat^71,72. Acoustic and prosodic (speech tempo) features were computed in all three speech tasks, and semantic features were computed for picture description, as explained below.

Speech production in humans is the result of modulating the source of sound energy that comes from the larynx with different parts of the vocal tract (e.g., the oral cavity). Speech degradation such as the one presented in Parkinson’s disease is usually a consequence of anomalies in the functionality of the parts of the vocal tract (e.g. imprecise articulation) or in the source (e.g., breathy voice). Cepstral analysis is useful for speech analysis as it can separate the sound source from its modulation. MFCCs is a technique that not only incorporates cepstral analysis, but also uses a non-linear scale (Mel scale) that approximates the human auditory system’s response. Due to these advantages, MFCCs have been used in speaker identification methods, speech quality assessment⁷³, and in classification of neurological diseases^23,26,74 with great accuracy. Thirteen MFCCs were calculated using the “python-speech-features” package⁷⁵. Following common practice⁷⁶, the first coefficient was replaced by the log of the total frame energy in order to analyze the overall energy in the speech. To calculate the coefficients, a window size of 25 milliseconds (ms) and window overlap of 10 ms were used, and pauses were automatically removed from the recording. A pause was defined by a silence threshold of −25 dB and minimum duration of 100 ms⁷⁷. To represent the distribution of each coefficient, we computed 10 statistical descriptors: mean (mn), variance (vn), kurtosis (kur), skewness (sk), mode (mod), percentiles 10th (pct10), 25th (pct25), 50th (pct50), 75th (pct75), and 90th (pct90). The rationale for using these statistical descriptors is that most of these distributions are not Gaussian, and therefore mean and variance do not fully characterize them. For any distribution, kurtosis is a measure of how “wide” or “narrow” it is, skewness is a measure of its asymmetry, the 50th percentile (or median) is a robust (to outliers) measure of the central tendency of the distribution, and similarly the 10th and 90th percentiles are robust versions of the minimum and maximum. These 130 features (10 descriptors for each of 13 coefficient distributions) were calculated on all speech tasks.

To characterize speech tempo, a prosodic feature, we used nuclei syllable (NS)⁷⁸. This feature estimates the temporal location of syllables within the speech stream. This analysis detects individual syllables in speech by identifying peaks in intensity (i.e., loudness) that are preceded and followed by dips in intensity. We then computed the elapsed time between syllables in the speech recording both with and without pauses removed. To represent the distribution of syllable duration, we computed 8 statistical descriptors (percentiles 10 and 90, mode, mean, variance, skewness, kurtosis plus interquartile range—IQR, a measure of variability).

As the picture description task elicited narrative speech, we also analyzed the semantic content of the description (semantic features; SF). The theory of embodied cognition posits that the motor (action) and sensory (perception) systems influence overall cognitive processing, and in particular, linguistic processing of words that are related to motor and sensory processes, and that they activate the same brain networks. Thus, the strong form of the theory predicts that people who have impaired movement abilities, such as those with PD, should show deficits in linguistic processing of words related to actions and motor activities^79,80. For this reason, we calculated semantic similarity to evaluate the relationship between the descriptions of the pictures by the participants and action or non-action words. To compute the similarity, we first isolated the nouns and verbs from the manually-transcribed recordings using the Stanford parser⁸¹. Based on the previous literature^{33,34,35,36,37,38,39,42}, the following seed words were chosen as action or non-action base words for calculating semantic distance: action, act, move, play, energetic, inaction, sleep, rest, sit and wait. Next, we obtained a numerical representation of all the words using Global Vectors for word representations (GloVe)^82,83. Finally, similarity distance (where a larger value indicates “more similar” not “farther away”) was computed between each verb and noun spoken by the participant and each of the seed words. To represent the distribution obtained for each seed word, the following statistical descriptors were calculated from the distances of the participant’s words: median, 10th percentile, 90th percentile, skewness, kurtosis, IQR (Inter Quartile Range). The total number of words (nw) was also computed in the analysis.

Statistical analysis

To evaluate whether these speech production features were sensitive to the differences between the two medication states, we performed two-sample paired t-tests for each feature, comparing their values across participants in the “ON” versus “OFF” state. To investigate how the features interacted in each medication state, we also computed the partial correlations among the top features for each speech task. Partial correlation captured the pattern of covariation between a pair of features by removing the effect of the other analyzed features.

Classification

We evaluated whether our features could differentiate one medication state from another by applying four general-purpose classifiers: elastic net (EN), logistic regression (LR) with l1-norm regularization, naive Bayes (NB) and random forest (RF). Since it has been demonstrated by Rusz et al.⁸⁴ that age and gender can bias the results in PD vs. control classification tasks, we focused our analysis on subject-based changes. For this, we calculated the difference of the speech features between the two states (“ON”/”OFF”) for each participant. Features were standardized (mean = 0 and standard deviation = 1) before being input to the classifiers. We used 10-fold cross-validation, leaving entire participants out in the test folds. We chose a fixed number of features to provide the classifiers, selecting the top-ranked 5 features. Other parameters, including those specific to a classifier type, were selected through a double-nested approach to avoid over-fitting. To determine which features were the highest-ranked and should be included in the classifier (feature selection), all features were rank-ordered based on the p-values from the paired t-tests on the training folds to order the set by how well each feature individually discriminated between the two classes (“ON” and “OFF”). After feature selection, we ran each of the four classifiers using the 5 top-ranked features, for each category of features (NS only, MFCCs only, MFCCs + SF, etc., further details below). Finally, accuracy rates were calculated over 50 instantiations of the 10-fold partition; we provide mean and standard deviation and report the best classifier. For comparison, we also run the same classifiers using all of the features (see Supplementary Table 2).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data used in this study are available on request from the corresponding author R.N. and agreement with Tufts University through B.K.H. The data are not publicly available due to voice being potentially identifiable, which could compromise research participant privacy.

Code availability

Our methodology is implemented using open-source libraries from Python and Praat only. Here, we specify the names and versions of the libraries used for each section of our approach. MFCC features were extracted using python-speech-features v5.0 library. For prosody, specifically nuclei syllable, information was obtained using SyllableNuclei v.Sep2010 Praat script⁷⁸ and pauses were removed using PraatVocalToolKit⁸⁵. For semantic features, the transcripts were processed using the Stanford Parser⁸¹ lexparser v.2017 and the semantic embedding was obtained using GloVe v1.2^82,83. All statistical analyses were performed using numpy v.1.12.0 and scipy v.0.18.1 python libraries. Finally, for classification, we trained our models using scikit-learn 0.18.1 and sklearn-contrib-lightning v0.5.0 python libraries. For each algorithm, we fixed the values of some parameters and selected the optimal values of other ones from a pool of values using a cross validation approach. For random forests, we used a fixed number of trees (20) and we optimized the maximum depth from these values: [2, 3, 5, 7, 10, 16]. Similarly, logistic regression with l1-norm regularization had a fixed tradeoff parameter of C = 1, and we optimized the amount of regularization from this set of values: [1E−5, 2.5E−4, 6.3E−3, 1.6E−1,4, 1E2]. For elastic net, we optimized the amount of regularization and the l1/l2 ratio using the set of values [1E−4, 1E−2, 1, 1E2, 1E4] and [0.001, 0.01, 0.1, 0.5, 0.9], respectively. Finally, naive Bayes was used with the default parameters.

References

Lee, A. & Gilbert, R. M. Epidemiology of Parkinson disease. Neurol. Clin. 34, 955–965 (2016).
PubMed Google Scholar
Parkinson’s Foundation. Available at: https://www.parkinson.org/Understanding-Parkinsons/Statistics (Accessed: 4th February 2019).
Schaafsma, J. D. et al. Gait dynamics in Parkinson’s disease: relationship to Parkinsonian features, falls and response to levodopa. J. Neurol. Sci. 212, 47–53 (2003).
PubMed Google Scholar
McNeely, M. E., Duncan, R. P. & Earhart, G. M. Medication improves balance and complex gait performance in Parkinson disease. Gait Posture 36, 144–148 (2012).
PubMed PubMed Central Google Scholar
Ramig, L., Halpern, A., Spielman, J., Fox, C. & Freeman, K. Speech treatment in Parkinson’s disease: Randomized controlled trial (RCT). Mov. Disord. 33, 1777–1791 (2018).
CAS PubMed PubMed Central Google Scholar
Polychronis, S., Niccolini, F., Pagano, G., Yousaf, T. & Politis, M. Speech difficulties in early de novo patients with Parkinson's disease. Parkinsonism Relat. Disord. 64, 256–261 (2019).
PubMed Google Scholar
McNeely, M. E. & Earhart, G. M. Medication and subthalamic nucleus deep brain stimulation similarly improve balance and complex gait in Parkinson disease. Parkinsonism Relat. Disord. 19, 86–91 (2013).
PubMed Google Scholar
Hametner, E., Seppi, K. & Poewe, W. The clinical spectrum of levodopa-induced motor complications. J. Neurol. 257, 268–275 (2010).
CAS Google Scholar
Lees, A. J. The on-off phenomenon. J. Neurol. Neurosurg. Psychiatry 52, 29–37 (1989).
PubMed Central Google Scholar
wearingoff. http://www.wearingoff.eu/wearing-off/describing-wearing-off (2017).
Goetz, C. G. et al. Movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23, 2129–2170 (2008).
PubMed Google Scholar
Martinez-Martin, P. et al. Unified Parkinson’s disease rating scale characteristics and structure. Mov. Disord. 9, 76–83 (1994).
CAS PubMed Google Scholar
Richards, M., Marder, K., Cote, L. & Mayeux, R. Interrater reliability of the Unified Parkinson’s Disease Rating Scale motor examination. Mov. Disord. 9, 89–91 (1994).
CAS PubMed Google Scholar
Nöth, E., Rudzicz, F., Christensen, H., Orozco-Arroyave, J. R. & Chinaei, H. Remote monitoring of neurodegeneration through speech. In Final Presentation of the Third Frederick Jelinek Memorial Summer Workshop (JSALT) (2016).
Hauser, R. A. et al. A home diary to assess functional status in patients with Parkinson’s disease with motor fluctuations and dyskinesia. Clin. Neuropharmacol. 23, 75–81 (2000).
CAS PubMed Google Scholar
Pawlukowska, W., Szylińska, A., Kotlęga, D., Rotter, I. & Nowacki, P. Differences between subjective and objective assessment of speech deficiency in parkinson disease. J. Voice 32, 715–722 (2018).
PubMed Google Scholar
Logemann, J. A., Fisher, H. B., Boshes, B. & Blonsky, E. R. Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of Parkinson patients. J. Speech Hear. Disord. 43, 47–57 (1978).
CAS PubMed Google Scholar
Smith, K. M. & Caplan, D. N. Communication impairment in Parkinson’s disease: Impact of motor and cognitive symptoms on speech and language. Brain Lang. 185, 38–46 (2018).
PubMed Google Scholar
Auclair-Ouellet, N., Lieberman, P. & Monchi, O. Contribution of language studies to the understanding of cognitive impairment and its progression over time in Parkinson's disease. Neurosci. Biobehav. Rev. 80, 657–672 (2017).
PubMed Google Scholar
Im, H. et al. Effect of levodopa on speech dysfluency in Parkinson’s disease. Mov. Disord. Clin. Pr. 6, 150–154 (2019).
Google Scholar
Magee, M., Copland, D. & Vogel, A. P. Motor speech and non-motor language endophenotypes of Parkinsonas disease. Expert Rev. Neurother. 19, 1191–1200 (2019).
CAS PubMed Google Scholar
Little, M. A. et al. Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans. Biomed. Eng. 56, 1015–1022 (2009).
PubMed PubMed Central Google Scholar
Tsanas, A., Little, M. A., McSharry, P. E., Spielman, J. & Ramig, L. O. Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans. Biomed. Eng. 59, 1264–1271 (2012).
PubMed Google Scholar
Yang, S. et al. Effective dysphonia detection using feature dimension reduction and kernel density estimation for patients with Parkinson’s disease. PLoS ONE 9, e88825 (2014).
PubMed PubMed Central Google Scholar
Belalcázar-Bolaños, E. A., Orozco-Arroyave, J. R., Vargas-Bonilla, J. F., Haderlein, T. & Nöth, E. Glottal flow patterns analyses for Parkinson’s disease detection: acoustic and nonlinear approaches. In International Conference on Text, Speech, and Dialogue 400–407 (2016).
Vásquez-Correa, J. C., Orozco-Arroyave, J. R., Bocklet, T. & Nöth, E. Towards an automatic evaluation of the dysarthria level of patients with Parkinson’s disease. J. Commun. Disord. 76, 21–36 (2018).
PubMed Google Scholar
Okada, Y., Murata, M. & Toda, T. Effects of levodopa on vowel articulation in patients with Parkinson’s disease. Kobe J. Med. Sci. 61, E144–E154 (2015).
CAS PubMed Google Scholar
Fabbri, M. et al. Speech and voice response to a levodopa challenge in late-stage Parkinson’s disease. Front. Neurol. 8, 432 (2017).
PubMed PubMed Central Google Scholar
Fabbri, M. et al. Do patients with late-stage Parkinson’s disease still respond to levodopa? Parkinsonism Relat. Disord. 26, 10–16 (2016).
PubMed Google Scholar
Pinho, P. et al. Impact of levodopa treatment in the voice pattern of Parkinson’s disease patients: a systematic review and meta-analysis1. Pinho P, Monteiro L, Soares MF de P, Tourinho L, Melo A, Nóbrega AC. Impact of levodopa treatment in the voice pattern of Parkinson’. in CoDas 30, e20170200 (2018).
Boulenger, V., Hauk, O. & Pulvermüller, F. Grasping ideas with the motor system: semantic somatotopy in idiom comprehension. Cereb. Cortex 19, 1905–1914 (2008).
PubMed PubMed Central Google Scholar
Péran, P. et al. Mental representations of action: the neural correlates of the verbal and motor components. Brain Res. 1328, 89–103 (2010).
PubMed Google Scholar
Garcia, A. M. et al. How language flows when movements don’t: an automated analysis of spontaneous discourse in Parkinson’s disease. Brain Lang. 162, 19–28 (2016).
PubMed Google Scholar
Rodriguez-Ferreiro, J., Menéndez, M., Ribacoba, R. & Cuetos, F. Action naming is impaired in Parkinson disease patients. Neuropsychologia 47(14), 3271–3274 (2009).
PubMed Google Scholar
Fernandino, L. et al. Parkinson’s disease disrupts both automatic and controlled processing of action verbs. Brain Lang. 127, 65–74 (2013).
PubMed Google Scholar
Fernandino, L. et al. Where is the action? Action sentence processing in Parkinson’s disease. Neuropsychologia 51, 1510–1517 (2013).
PubMed PubMed Central Google Scholar
Herrera, E. & Cuetos, F. Semantic disturbance for verbs in Parkinson’s disease patients off medication. J. Neurolinguist. 26, 737–744 (2013).
Google Scholar
Garcia, A. M. & Ibáñez, A. Words in motion: Motor-language coupling in Parkinson’s disease. Transl. Neurosci. 5, 152–159 (2014).
Google Scholar
Cotelli, M., Manenti, R., Brambilla, M. & Borroni, B. The role of the motor system in action naming in patients with neurodegenerative extrapyramidal syndromes. Cortex 100, 191–214 (2017).
PubMed Google Scholar
Oudeyer, P. Novel useful features and algorithms for the recognition of emotions in human speech. In Speech Prosody 2002, International Conference (2002).
Alonso, J. B., Cabrera, J., Medina, M. & Travieso, C. M. New approach in quantification of emotional intensity from the speech signal: emotional temperature. Expert Syst. Appl. 42, 9554–9564 (2015).
Google Scholar
Hauk, O., Johnsrude, I. & Pulvermüller, F. Somatotopic representation of action words in human motor and premotor cortex. Neuron 41, 301–307 (2004).
CAS PubMed Google Scholar
De Letter, M., Santens, P., De Bodt, M., Boon, P. & Van Borsel, J. Levodopa-induced alterations in speech rate in advanced Parkinson’s disease. Acta Neurol. Belg. 106, 19 (2006).
PubMed Google Scholar
Skodda, S., Flasskamp, A. & Schlegel, U. Instability of syllable repetition in Parkinson’s disease—influence of levodopa and deep brain stimulation. Mov. Disord. 26, 728–730 (2011).
PubMed Google Scholar
Simonyan, K., Horwitz, B. & Jarvis, E. D. Dopamine regulation of human speech and bird song: a critical review. Brain Lang. 122, 142–150 (2012).
PubMed PubMed Central Google Scholar
Brabenec, L., Mekyska, J., Galaz, Z. & Rektorova, I. Speech disorders in Parkinson’s disease: early diagnostics and effects of medication and brain stimulation. J. Neural Transm. 124, 303–334 (2017).
CAS PubMed Google Scholar
Gomez-Vilda, P. et al. Parkinson’s disease monitoring by biomechanical instability of phonation. Neurocomputing 255, 3–16 (2017).
Google Scholar
Naranjo, N. V., Lara, E. M., Rodriguez, I. M. & Garcia, G. C. High-frequency components of normal and dysphonic voices. J. Voice 8, 157–162 (1994).
Google Scholar
Monson, B. B., Hunter, E. J., Lotto, A. J. & Story, B. H. The perceptual significance of high-frequency energy in the human voice. Front. Psychol. 5, 587 (2014).
PubMed PubMed Central Google Scholar
Vitela, A. D., Monson, B. B. & Lotto, A. J. Phoneme categorization relying solely on high-frequency energy. J. Acoust. Soc. Am. 137, EL65–EL70 (2015).
PubMed Google Scholar
Ciucci, M. R. et al. Reduction of dopamine synaptic activity: degradation of 50-kHz ultrasonic vocalization in rats. Behav. Neurosci. 123, 328 (2009).
PubMed PubMed Central Google Scholar
Nwe, T. L., Wei, F. S. & De Silva, L. C. Speech based emotion classification. In TENCON 2001. Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology vol. 1, 297–301 (2001).
Koolagudi, S. G. & Rao, K. S. Emotion recognition from speech: a review. Int. J. speech Technol. 15, 99–117 (2012).
Google Scholar
Rawat, A. & Mishra, P. K. Emotion recognition through speech using neural network. Int. J. 5, 422–428 (2015).
Google Scholar
Zhu, L., Chen, L., Zhao, D., Zhou, J. & Zhang, W. Emotion recognition from Chinese speech for smart affective services using a combination of SVM and DBN. Sensors 17, 1694 (2017).
PubMed Central Google Scholar
Swain, M., Routray, A. & Kabisatpathy, P. Databases, features and classifiers for speech emotion recognition: a review. Int. J. Speech Technol. 21, 93–120 (2018).
Google Scholar
Ang, J., Dhillon, R., Krupski, A., Shriberg, E. & Stolcke, A. Prosody-based automatic detection of annoyance and frustration in human-computer dialog. In Seventh International Conference on Spoken Language Processing (2002).
Nordeen, K. W. & Nordeen, E. J. Auditory feedback is necessary for the maintenance of stereotyped song in adult zebra finches. Behav. Neural Biol. 57, 58–66 (1992).
CAS PubMed Google Scholar
Leonardo, A. & Konishi, M. Decrystallization of adult birdsong by perturbation of auditory feedback. Nature 399, 466 (1999).
CAS PubMed Google Scholar
Tourville, J. A., Reilly, K. J. & Guenther, F. H. Neural mechanisms underlying auditory feedback control of speech. Neuroimage 39, 1429–1443 (2008).
PubMed Google Scholar
Eliasova, I. et al. Acoustic evaluation of short-term effects of repetitive transcranial magnetic stimulation on motor aspects of speech in Parkinson’s disease. J. Neural Transm. 120, 597–605 (2013).
CAS PubMed Google Scholar
Huh, Y. E. et al. Differences in early speech patterns between Parkinson variant of multiple system atrophy and Parkinson’s disease. Brain Lang. 147, 14–20 (2015).
PubMed Google Scholar
Erb, M. K., et al. The BlueSky Project: monitoring motor and non-motor characteristics of people with Parkinson’s disease in the laboratory, a simulated apartment, and home and community settings. In 22nd International Congress of Parkinson’sDisease and Movement Disorders (2018).
Goodglass, H., Kaplan, E. & Barresi, B. The Assessment of Aphasia and Related DIsorders. (Lippincott Williams & Wilkins, 2001).
Marshall, R. C. & Wright, H. H. Developing a clinician-friendly aphasia test. Am. J. Speech Lang. Pathol. 16, 295–315 (2007).
PubMed Google Scholar
Hayman, M. A. X. Two minute clinical test for measurement of intellectual impairment in psychiatric disorders. Arch. Neurol. Psychiatry 47, 454–464 (1942).
Google Scholar
Smith, A. The serial sevens subtraction test. Arch. Neurol. 17, 78–80 (1967).
CAS PubMed Google Scholar
Ackermann, H., Konczak, J. & Hertrich, I. The temporal control of repetitive articulatory movements in Parkinson’s disease. Brain Lang. 56, 312–319 (1997).
CAS PubMed Google Scholar
Audacity.
PythonSoftwareFoundation. Welcome to Python.org. Available at: https://www.python.org/. (Accessed: 15th March 2018)
Boersma, P. Praat, a system for doing phonetics by computer. Glot Int 5, 341–347 (2001).
Google Scholar
Boersma, P. & Weenink, D. Praat: doing phonetics by computer (2017).
Kapoor, T. & Sharma, R. K. Parkinson’s disease diagnosis using Mel-frequency cepstral coefficients and vector quantization. Int. J. Comput. Appl 14, 43–46 (2011).
Google Scholar
Norel, R., Pietrowicz, M., Agurto, C., Rishoni, S. & Cecchi, G. Detection of amyotrophic lateral sclerosis (ALS) via acoustic analysis. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2018-September (2018).
Lyons, J. python-speech-features (2016).
Jurafsky, D. & Martin, J. H. Speech and Language Processing, 2nd edn (Prentice-Hall, Inc., 2009).
Griffiths, R. Pausological research in an L2 context: a rationale, and review of selected studies. Appl. Linguist. 12, 345–364 (1991).
Google Scholar
de Jong, N. H. & Wempe, T. Praat script to detect syllable nuclei and measure speech rate automatically. Behav. Res. Methods 41, 385–390 (2009).
PubMed Google Scholar
Boulenger, V. et al. Word processing in Parkinson’s disease is impaired for action verbs but not for concrete nouns. Neuropsychologia 46, 743–756 (2008).
PubMed Google Scholar
Gallese, V. & Cuccio, V. The neural exploitation hypothesis and its implications for an embodied approach to language and cognition: insights from the study of action verbs processing and motor disorders in Parkinson’s disease. Cortex 100, 215–225 (2018).
PubMed Google Scholar
Klein, D. & Manning, C. D. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1 423–430 (2003).
Pennington, J., Socher, R. & Manning, C. D. GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP) 1532–15431 (2014).
Pennington, J., Socher, R. & Manning, C. D. GloVe.6B. https://nlp.stanford.edu/projects/glove (2017).
Rusz, J., Novotny, M., Hlavnicka, J., Tykalova, T. & Ruzicka, E. High-accuracy voice-based classification between patients with Parkinson’s disease and other neurological diseases may be an easy task with inappropriate experimental design. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 1319–1321 (2017).
PubMed Google Scholar
Corretge, R. Praat Vocal Toolkit (2019).

Download references

Acknowledgements

The authors are deeply thankful to the volunteer participants without whose active involvement the present study would not have been possible. Tufts University School of Medicine was the sponsor. IBM Inc. and Pfizer Inc. funded this work.

Author information

Deceased: J. J. Rice.

Authors and Affiliations

IBM T.J. Watson Research Center, Yorktown Heights, NY, 10598, USA
R. Norel, C. Agurto, S. Heisig, J. J. Rice, R. Ostrand & G. A. Cecchi
Pfizer Digital Medicine & Translational Imaging: Early Clinical Development, Cambridge, MA, 02139, USA
H. Zhang, P. W. Wacnik & V. L. Ramos
Department of Neurology, Tufts University School of Medicine and Tufts Medical Center, 800 Washington St, Boston, MA, 02111, USA
B. K. Ho

Authors

R. Norel
View author publications
You can also search for this author in PubMed Google Scholar
C. Agurto
View author publications
You can also search for this author in PubMed Google Scholar
S. Heisig
View author publications
You can also search for this author in PubMed Google Scholar
J. J. Rice
View author publications
You can also search for this author in PubMed Google Scholar
H. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
R. Ostrand
View author publications
You can also search for this author in PubMed Google Scholar
P. W. Wacnik
View author publications
You can also search for this author in PubMed Google Scholar
B. K. Ho
View author publications
You can also search for this author in PubMed Google Scholar
V. L. Ramos
View author publications
You can also search for this author in PubMed Google Scholar
G. A. Cecchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Research project: (A) Conception: J.J.R., G.A.C., R.N., and PWW. (B) Organization: P.W.W. and S.H. (C) Execution: R.N., H.Z., S.H., R.O., and B.K.H.; Statistical analysis: R.N. and C.A. (A) Design: G.A.C., R.N., and C.A. (B) Execution: R.N. and C.A. (C) Review and critique; Manuscript preparation: R.N. (A) Writing of the first draft, R.N. and C.A. (B) Review and critique: G.A.C., J.J.R., H.Z., P.W.W., S.H., R.O., and B.K.H.

Corresponding author

Correspondence to R. Norel.

Ethics declarations

Competing interests

R. Norel, C. Agurto, S. Heisig, J.J. Rice, R. Ostrand, G.A. Cecchi disclose that their employer, IBM Research, is the research branch of IBM Corporation. R. Norel, G.A. Cecchi, S. Heisig own stock in IBM Corporation. H. Zhang, P.W. Wacnik, V.L. Ramos disclose that their employer, Pfizer Digital Medicine & Translational Imaging: Early Clinical Development, is a research branch of Pfizer Corporation. B.K. Ho is employed by the Department of Neurology, Tufts University School of Medicine and Tufts Medical Center.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Norel, R., Agurto, C., Heisig, S. et al. Speech-based characterization of dopamine replacement therapy in people with Parkinson’s disease. npj Parkinsons Dis. 6, 12 (2020). https://doi.org/10.1038/s41531-020-0113-5

Download citation

Received: 26 September 2019
Accepted: 19 May 2020
Published: 12 June 2020
DOI: https://doi.org/10.1038/s41531-020-0113-5

This article is cited by

SpEx: a German-language dataset of speech and executive function performance
- Julia A. Camilleri
- Julia Volkening
- Susanne Weis
Scientific Reports (2024)
Neurocognitive correlates of semantic memory navigation in Parkinson’s disease
- Felipe Diego Toro-Hernández
- Joaquín Migeot
- Adolfo M. García
npj Parkinson's Disease (2024)
Detecting Parkinson’s disease and its cognitive phenotypes via automated semantic analyses of action stories
- Adolfo M. García
- Daniel Escobar-Grisales
- Juan Rafael Orozco-Arroyave
npj Parkinson's Disease (2022)
Short-term effect of dopaminergic medication on speech in early-stage Parkinson’s disease
- Tereza Tykalova
- Michal Novotny
- Jan Rusz
npj Parkinson's Disease (2022)
Parkinson's disease patients with freezing of gait have more severe voice impairment than non-freezers during “ON state”
- Qian Yu
- Xiaoya Zou
- Oumei Cheng
Journal of Neural Transmission (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Statistical analysis

Classification

Discussion

Methods

Participants

Design and protocol

Data acquisition

Feature extraction

Statistical analysis

Classification

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links