Introduction

Bilateral subthalamic nucleus deep brain stimulation (STN-DBS) represents a short and long-term effective treatment in advanced Parkinson’s Disease (PD)1,2. However, the long-term effects of bilateral STN-DBS on axial features3,4 and different speech variables are still debated. After surgery, PD patients may develop heterogeneous profiles of dysarthria related to the possible spreading of current to cerebellothalamic, cortico-bulbar, cortico-spinal and pallido-fugal levels5,6. Moreover, it has been previously reported that speech intelligibility may worsen 1 year after surgery when compared with those under a control group under optimal medical treatment7. The majority of the studies regarding the effects of STN-DBS on speech focused on short-term follow-up7,8,9,10,11 while few studies have assessed the long-term effects2,12,13,14. In particular, a previous study reported a worsening of speech intelligibility at five and eight years after surgery in the off-medication condition12, while another acoustic study reported variation of the long-term averaged spectrum (LTAS) descriptors for reading and monologue in different stimulation conditions in the long-term after surgery14. The objective of this study was to evaluate the long-term (five years) effects of bilateral STN-DBS on speech in advanced PD patients using a standardized perceptual-acoustic analysis of speech. In the following paragraphs we will report the results of the study, followed by the discussion of these findings before finally describing the methodology of the study.

Results

Patient population

From 2012 to 2017, 40 PD patients underwent STN-DBS. Of these, fifteen subjects were excluded from the study because of missing data (eight patients), lack of consent to participate (four patients), and non-native Italian speakers (three patients). The remaining 25 PD patients with a median follow-up of five years after surgery (range 3–7 years) were included (males: 18; disease duration at surgery: 10.44 [± 4.62] years; age at surgery: 58.40 [± 5.73] years; age at PD onset 47.76 [± 5.63] years). Nineteen patients were included in the PIGD subtype, five in the indeterminate, and one in the TD subtype. Genetic assessment revealed heterozygous mutation in the GBA gene in three patients (12%). Preoperative brain-MRI revealed the presence of white matter hyperintensities of vascular origin in four patients (16%). The mean preoperative levodopa responsiveness was 62.24% (± 16.38%). A detailed description of stimulation parameters and settings is reported in Table 1, while the changes of the different speech and clinical variables in the different conditions tested are shown in Table 2. Concerning the perceptual assessment of dysarthria severity, a significant reduction of the score was found in all the three postoperative conditions tested when compared with the preoperative ones. On the contrary, no significant differences were found by comparing the different postoperative conditions with each other.

Table 1 Stimulation parameters at postoperative evaluation.
Table 2 Changes of speech and clinical variables over time.

Primary outcomes

In the long-term and at the group level, speech intelligibility did not significantly worsen with respect to preoperative values when comparing the postoperative on-stimulation/off-medication condition with the preoperative off-medication condition (z =  − 0.371, p = 0.710). The shutdown of the stimulation led to an acute significant worsening of speech intelligibility in the postoperative assessment (z =  − 3.500, p < 0.001) when comparing the on-stimulation/off-medication and off-stimulation/off-medication conditions. Furthermore, the assessment of disease progression effects on speech intelligibility obtained when comparing the postoperative off-stimulation/off-medication condition with the preoperative off-medication condition showed a significant worsening with respect to preoperative values (z =  − 2.92, p < 0.005). Analyzing the long-term postoperative changes of speech intelligibility at the individual level for each patient, sixteen patients were classified as “stable” while the remaining nine patients integrated a “worsened” subgroup (Fig. 1). Compared to a “stable” subgroup, “worsened” patients showed a greater preoperative and postoperative disease motor severity quantified by: (i) higher scores in the UPDRS part III total score, H&Y staging, and akinesia subscore in the off-medication condition; (ii) lower loudness of spontaneous speech and lower speech intelligibility in all three postoperative conditions; (iii) lower intensity of prolonged phonation in on-stimulation/off-medication and off-stimulation/off-medication conditions (Table 3). Furthermore, they showed a significantly worse performance in the preoperative Stroop test (errors) and postoperative Trail Making test B. In addition, all the GBA1-PD patients were included in the “worsened” subgroup with a trend toward significance (p = 0.06) at the chi-square independence test. Disease duration, age at surgery, follow-up duration and stimulation parameters were not significantly different between the two subgroups.

Figure 1
figure 1

Changes of speech intelligibility in the stable and worsened groups.

Table 3 Differences in clinical and speech variables between “worsened” and “stable” subgroups.

Secondary outcomes

Regarding the comparison between the different speech variables in the conditions tested, a statistically significant reduction of the duration of sustained phonation was found in the off-stimulation/off-medication condition when compared with the on-stimulation/off-medication condition (p < 0.005) and with the preoperative on-medication condition (p < 0.005). In addition, the mean intensity of sustained phonation was significantly reduced in the off-stimulation/off-medication condition if compared with the preoperative off-medication condition (p < 0.005) highlighting the negative effects of disease progression on this acoustic parameter.

A direct correlation between the variation of speech intelligibility and tremor (p = 0.026) and PIGD subscores (p = 0.040) by comparing the off-stimulation/off-medication with the on-stimulation/off-medication one was also found, highlighting that the rebound of tremor and axial symptoms can worsen speech in the acute testing situation. On the contrary, no correlations were found between the variation of speech intelligibility and the changes in the different motor scores and subscores when comparing the on-stimulation/off-medication and preoperative off-medication condition. Furthermore, a negative correlation was also found in the on-stimulation/off-medication condition between speech intelligibility and akinesia subscore (p = 0.025) meaning that patients with a more severely decreased movements amplitude and speed have less intelligibility. The negative correlation between speech intelligibility and akinesia subscore was also confirmed in the on-stimulation/on-medication condition (p = 0.013).

Discussion

Our results add to further observations regarding the long-term effects of bilateral STN-DBS on speech in PD patients7,12. Globally, the studied cohort showed a long-term maintenance of speech intelligibility after surgery, highlighting some possible beneficial long-term effects of STN-DBS when comparing off stimulation and on stimulation conditions. This is consistent with previous studies that showed no worsening of speech following STN-DBS when assessed either with the UPDRS speech item15 or dedicated perceptual assessments16, which was reduced with the introduction of L-DOPA17, in line with our findings. Nevertheless, when using a proper perceptual assessment of speech intelligibility, it is commonly accepted that a significant worsening of speech intelligibility should be expected one year after surgery, when comparing postoperative on-stimulation/off-medication condition with the preoperative off-medication condition9,18. This has also been reported in the long-term by Aviles-Olmos et al.12, who reported a significant worsening of speech intelligibility at five (− 43.7%) and eight years (− 21.4%) in the off-medication conditions if compared with preoperative values. The different results obtained in our cohort may be due to the different evaluation of speech intelligibility (i.e., sentence task of the Assessment of Intelligibility for Dysarthric Speech and single word intelligibility). However, by looking at the individual cases of our cohort in detail, two distinct subgroups could be identified based on the long-term changes of speech intelligibility, with different clinical characteristics. This is in line with similar approaches that tried to disentangle different possible speech outcomes following STN-DBS18,19. Patients in the “worsened” subgroup showed a worse preoperative and postoperative motor disease severity together with a worse cognitive function particularly regarding executive-frontal domain. These results are in line with a recent 2-year follow-up study that underlined the potential predictor role of preoperative cognitive function for speech deterioration after STN-DBS19. The trend toward the significance of PIGD subscores between the two “worsened” and “stable” subgroups suggests a possible link between speech deterioration and axial features, confirming the correlation between hypokinetic dysarthria and PD axial symptoms8. A similar tendency towards significance was also found for the over-distribution of GBA1-PD patients in the “worsened” group. GBA1-PD patients complain about higher axial and cognitive burden when compared with wild-type PD patients20 leading to the assumption that GBA1-PD patients may be at higher risk of developing post-operatively not only gait and cognitive worsening but also speech deterioration. Interestingly, disease duration, sex, duration of follow-up and the total electrical energy delivered by STN-DBS21 did not significantly differ between the two groups. This result is in line with a previous study that did not find any association between speech intelligibility deterioration and disease duration7 even if another study by Tripoliti et al.9 found that disease duration was a predictive factor for speech outcome. In addition, concerning preoperative speech variables we did not find differences between the two groups, while postoperatively, both speech intelligibility and mean intensity of spontaneous speech were significantly lower in the worsened group if compared with the stable one. With regards to the comparison between the different speech variables, a statistically significant reduction of the intensity of sustained phonation emerged in the off-stimulation/off-medication condition if compared with the on-stimulation/off-medication condition highlighting the possible positive effect of STN-DBS on voice intensity, as already confirmed in previous studies7,18. The improvement of speech intensity due to STN-DBS has been linked to its effects on hypokinesia and rigidity of language-related organs18,22 which represent one of the major pathophysiological bases of hypokinetic dysarthria, characterized by hypophonia, monotony, hypoarticulation of consonants and inappropriate silences23.

A further result that emerged in our sample was the significant reduction of the maximum phonation time (MPT) in the off-stimulation/off-medication condition when compared to the on-stimulation/off-medication condition. This result shows how stimulation, even 5 years after surgery, can positively influence MPT that is significantly reduced following the device’s shutdown. This finding is in line with previous studies that reported a beneficial effect of stimulation on MPT in the short-term after surgery24. Even in this case, the reduction of MPT can be traced back to the underlying laryngeal dysfunction and the reduced respiratory volume mainly related to the rigidity and hypo-bradykinesia of the laryngeal and diaphragmatic muscles23.

This study also allows for evaluation of the possible effects of dopaminergic therapy alone on speech parameters by comparing the preoperative off-medication and on-medication conditions. The results obtained did not show statistically significant differences between these two conditions, suggesting that dopaminergic therapy alone did not significantly improve or maintain speech postoperatively. These findings were consistent with previous studies that showed little or no effects of levodopa alone on acoustic speech parameters8,25,26,27. In particular, no significant changes following levodopa intake were found in phonatory, articulatory or prosodic parameters, as previously reported27 confirming the possible involvement of non-dopaminergic pathways in the pathophysiology of hypokinetic dysarthria in PD8.

We also found a direct correlation between the variation of speech intelligibility and tremor and PIGD subscores highlighting that the rebound of tremor and axial symptoms may worsen speech in the acute testing situation when stimulation is turned off. In addition, speech intelligibility also correlated with the UPDRS akinesia subscore in two out of the three postoperative conditions tested, highlighting a possible link between the decrease in limb movement speed and amplitude and speech. This is in line with other studies that have reported a correlation between speech variables and limb bradykinesia28,29. Our study has several limitations, including a lack of definition for the position of the electrodes, the small sample size, the lack of a control group and the lack of the assessment of speech intelligibility also at the sentence level. Nevertheless, our findings highlight the possibility of positive effects of STN-DBS on speech intelligibility after surgery in the long-term. Interestingly, it also gives a better understanding of PD characteristics associated with long-term speech worsening after STN-DBS. This information may allow clinicians to improve candidates’ selection for DBS and refine prognostic accuracy (e.g., GBA1 genetic status influence on speech) and that, if necessary, early speech interventions should be used after surgery in PD patients treated with STN-DBS at a higher risk of speech deterioration.

Methods

Participants

Patients treated with bilateral STN-DBS from 2012 to 2017 at the Neurological Unit of the OCB University Hospital were included. All patients fulfilled the diagnosis of PD according to the UK Brain Bank criteria30 and suffered from disabling motor complications. Data from non-native Italian speakers were excluded. This study was approved by the ethics committee of the Area Vasta Emilia Nord, Italy (protocol number: 2019/0,056,629), and written informed consent was obtained from participants. The study was performed in accordance with the Declaration of Helsinki.

Clinical assessment

The clinical evaluation was performed in accordance with the CAPSIT-PD protocol31. Levodopa responsiveness was evaluated through an acute levodopa challenge. Hoehn and Yahr scale (H&Y) and the Unified Parkinson’s Disease Rating Scale (UPDRS)32 were applied to quantify disease severity both in the “off-medication” condition (obtained after a 12-h antiparkinsonian medication withdrawal) and “on-medication” condition (obtained after 60 min and the administration of a 30% higher dose of the usual levodopa morning intake)31. Different subscores were extrapolated from the UPDRS including tremor, postural instability/gait disorders (PIGD), akinesia and UPDRS item 18 (speech) subscores. PD motor phenotype (tremor dominant [TD], indeterminate and PIGD)33 was also extrapolated and patients were screened for the presence of mutations in the glucocerebrosidase-1 (GBA-1), leucine-rich repeat kinase-2, α-synuclein and parkin genes34,35. The total amount of dopaminergic medications was calculated as levodopa equivalent daily dose (LEDD) milligrams36. All patients underwent 3-Tesla brain-MRI to evaluate the presence/absence of white matter hyperintensities of vascular origin. All subjects were re-evaluated with a median five-years follow-up (range 3–7 years) after surgery. Neurological evaluation (superimposable with the preoperative one) and the speech assessment were carried out on the same day and in the following conditions: on-stimulation/off-medication (washout of at least 12-h of dopaminergic medications); off-stimulation/off-medication (stimulation was temporarily turned off for at least 1-h); on-stimulation/on-medication (stimulation was turned on and dopaminergic therapy was administered [early morning LEDD plus 30%]). Each patient underwent a complete neuropsychological assessment before surgery and at long-term evaluation, including phonemic fluency, spatial perception (localization of numbers), Raven’s progressive matrices, Stroop test and Trail making test part B.

Speech evaluation

Speech evaluation was performed both preoperatively during the acute levodopa challenge and at postoperative assessment. Each evaluation was carried out in a silent environment with conversation voice intensity and was recorded with a digital voice recorder (model SONY ICDPX240) kept 20 cm from the patient’s lips25. The perceptual-acoustic analysis was performed using Praat software37 and blinded to the patient’s condition. The following tasks were included: word intelligibility (calculated as the percentage of words correctly transcribed by the examiner among a set of 25 recorded words)25; oral diadochokinesis task in which participants produced the syllables /pa/, /ta/, /ka/ and the pseudoword /pataka/, as fast as they could with habitual pitch and loudness (irregular rhythm [presence of absence], uncontrolled acceleration [presence of absence]); sustained production of the phoneme /a/ for as long as possible, performed three times (duration [sec], intensity [dB]); counting from 1 to 20 (speech rate [syllables/second]). Single words intelligibility was selected due to its advantage of eliminating a number of other variables that can affect intelligibility, such as sentence level syntactic and prosodic variables. Furthermore, the use of single words to assess intelligibility is a much less difficult task for dysarthric participants than sentence level productions. As such, if an intelligibility impairment is noted at the single word level, intelligibility deficits are more than likely at higher/more complicated levels of speech productions, such as the sentence level38. A calibration tone (80 SPL dB, 1 kHz) was included at the beginning of each recording to serve as a reference in the determination of speaking amplitude. According to recent guidelines39, these parameters have been selected because they represent acoustic characteristics previously reported as altered in hypokinetic dysarthria23,27. The presence and severity of hypokinetic dysarthria were perceptually determined by two speech language pathologists, both Italian native speakers. Speakers’ severity of dysarthria was categorized on a coarse scale ranging from none, mild, moderate to severe (1: severe, 2: moderate; 3: mild: 4: none)40,41.

Statistical analysis

Descriptive statistics were used for describing demographic and clinical data. The primary objective of the study was to evaluate the long-term effects of bilateral STN-DBS on speech intelligibility in advanced PD patients. As primary outcomes, we selected the change of speech intelligibility between postoperative on-stimulation/off-medication condition and preoperative off-medication condition; postoperative on-stimulation/off-medication and off-stimulation/off-medication condition. Positive changes represented improvement of speech intelligibility while negative changes represented speech worsening. The presence of significant differences in speech intelligibility in the different conditions tested was calculated using the Friedman test with subsequent post-hoc Wilcoxon signed rank test with Bonferroni correction for multiple comparisons because of the use of multiple tests (statistical adjusted significance was set at p-value < 0.005). Based on the presence/absence of long-term postoperative worsening of speech intelligibility, patients were divided into two groups (“stable/imp” [absent or positive variation] and “worsened” [negative variation]) that were compared to find significant differences in demographic, clinical and speech variables. With regards to continuous and ordinal variables, the Mann–Whitney test was used, while for categorical variables the chi-square independence test was applied (statistical significance was set at p-value < 0.05). Secondary outcomes included: the changes of the other speech variables in the different conditions tested; the correlation between the variation of speech intelligibility and the variation in the different motor scores and subscores in the on-stimulation/off-medication condition compared with the off-stimulation/off-medication condition and in the on-stimulation/off-medication compared with preoperative off-medication condition; the correlation between the different motor scores and subscores and speech intelligibility in the postoperative conditions tested. Correlation analyses were performed by using the Spearman Correlation analysis. Statistical analysis was performed using the IBM SPSS Statistics for Windows version 20.0 (IBM, Armonk, NY, USA).