Spasmodic dysphonia (SD) is an incurable focal dystonia of the larynx that impairs speech and communication. Vibro-tactile stimulation (VTS) alters afferent proprioceptive input to sensorimotor cortex that controls speech. This proof-of-concept study examined the effect of laryngeal VTS on speech quality and cortical activity in 13 SD participants who vocalized the vowel /a/ while receiving VTS for 29 minutes. In response to VTS, 9 participants (69%) exhibited a reduction of voice breaks and/or a meaningful increase in smoothed cepstral peak prominence, an acoustic measure of voice/speech quality. Symptom improvements persisted for 20 minutes past VTS. Application of VTS induced a significant suppression of theta band power over the left somatosensory-motor cortex and a significant rise of gamma rhythm over right somatosensory-motor cortex. Such suppression of theta oscillations is observed in patients with cervical dystonia who apply effective sensory tricks, suggesting that VTS in SD may activate a similar neurophysiological mechanism. Results of this feasibility study indicate that laryngeal VTS modulates neuronal synchronization over sensorimotor cortex, which can induce short-term improvements in voice quality. The effects of long-term VTS and its optimal dosage for treating voice symptoms in SD are still unknown and require further systematic study.
Spasmodic dysphonia (SD) is a speech-specific focal dystonia of the larynx that typically develops in middle adulthood between 40–50 years of age1. SD leads to the formation of voice breaks and/or a strained or choked speech2. Two major classes of SD have been identified: adductor (ADSD), characterized by involuntary vocal fold closure; and abductor (ABSD), exhibiting excessive vocal fold opening1. Currently, SD is primarily treated with Botulinum neurotoxin injection (BoNT), which despite its effectiveness, is an invasive method and only temporarily relieves voice symptoms3.
The underlying neural mechanism of SD is not entirely understood, but it is known to involve structural and functional alterations in the basal ganglia–thalamo-cortical circuitry, the brainstem, and the cerebellum4,5,6,7,8. Several forms of focal dystonia, including cervical dystonia (CD), blepharospasm, and spasmodic dysphonia present with somatosensory system abnormalities even in non-dystonic muscles9,10,11,12. This suggests that while the motor symptoms of dystonia are focal, the corresponding somatosensory impairments are general.
Furthermore, it is known that in cervical dystonia successful sensory tricks (geste antagoniste) can ease dystonic symptoms by touching areas over or near the dystonic musculature; a phenomenon that sheds light on the link between abnormal somatosensation and the dystonic motor manifestations13,14. In addition, vibro-tactile stimulation (VTS) has been shown to reduce the severity of dystonic postures. For example, vibration over dystonic neck muscles induced immediate head righting and temporarily restored upright head posture in people with torticollis15. It has long been established that VTS in the range of 40–100 Hz stimulates the mechanoreceptors and muscle spindles that affect motor behavior16,17,18,19. However, the knowledge on the distribution and function of somatosensory receptors within the laryngeal musculature is still inconclusive. A series of neuroanatomical, histochemical, and electron microscopic studies supported the existence of muscle spindles in the larynx20,21,22,23,24, while others failed to provide support for the existence of intrafusal muscle fibers in cricothyroid and thyroarytenoid muscles25 or could not elicit stretch responses in these areas26.
Moreover, the mucosa of the epiglottis is known to have an array of mechanoreceptors responsive to mechanical stimulation between 10–70 Hz with a depression amplitude of <100μm27,28. It was further demonstrated that the sensory basis for the laryngeal adductor response is dependent on the stimulation of mechanoreceptors in the laryngeal mucosa in the cat and humans26,29. Moreover, the discovery of Krause type sensory corpuscles at the free edge of vocal cords is consistent with the notion that signals from these mechanoreceptors are not only used for the control of swallowing but also for voice control28.
Recent work from our group demonstrated upper limb proprioceptive deficits in SD12. Moreover, studies on the effects of sensory stimulation of the larynx have indicated a reduced inhibition of laryngeal muscle responses to sensory stimulation, i.e., a reduced suppression of motor responses to laryngeal sensory stimulation5. Functional neuroimaging in SD during phonation has also shown increased central activation in the laryngeal somatosensory cortex in patients with SD8. These studies highlight different forms of somatosensory abnormalities in SD, which may contribute to the pathomechanism of the disease. This notion opens an avenue for a potential behavioral treatment for SD that seeks to modulate the somatosensory information of the laryngeal musculature to improve the speech motor output. In particular, the vibro-tactile stimulation of laryngeal muscles might be a suitable tool for this purpose, given that it is shown to alter the afferent proprioceptive signals produced by the vibrated muscle mechanoreceptors and muscle spindles16,17.
This study sought to establish the feasibility of laryngeal VTS as a non-invasive neuromodulation treatment for spasmodic dysphonia. We pursued two specific aims: First, to demonstrate that prolonged VTS can induce short-term acute changes in speech quality. Second, to document the changes in cortical activity in SD that are associated with the application of laryngeal VTS. To evaluate the effectiveness of laryngeal VTS we had people with SD vocalize the vowel /a/ while receiving VTS for a total of 29.4 minutes. We derived markers of speech quality and analyzed the neuronal response to VTS over sensorimotor cortex.
Improvement of measures of speech quality in response to laryngeal VTS
We recorded the voice of 13 SD participants as they read a list of sentences devised for the speech evaluation of SD30 at 4 different time stamps along the experimental protocol: Prior to VTS (Pretest), after 14.7 minutes of VTS (Post-set 1), after 29.4 minutes of VTS (Post-set 2) and 20 minutes past the cessation of VTS (Retention) (see Fig. 1B for an overview). Subsequently, we derived the number of voice breaks and smoothed cepstral peak prominence (CPPS) as measures of speech quality from the acoustic signal. CPPS is based on the acoustic signal’s power spectrum and correlates strongly with the severity of SD voice symptoms31 (for details, see Method: Measures of speech quality).
Nine out of 13 participants (69%) responded to VTS and showed a reduction of the number of voice breaks and/or a rise of CPPS (>+1 dB) at Post-set 1 and/or the Post-set 2 as compared to Pretest. The remaining four participants did not show a consistent response to VTS as quantified by a rise in CPPS. It is noteworthy that none of the non-responding patients exhibited voice breaks at Pretest (see Fig. 2; bottom panels). Improvements in both speech quality measures for the responders were preserved at the retention stage (see Fig. 2; top panels). As a group, participants showed a significant rise of CPPS after 14.7 minutes of laryngeal VTS in comparison to their Pretest (p = 0.02, d = 1.06), which was retained at 20 minutes past the last application of VTS (p = 0.006, d = 1.15; see Fig. 2). The corresponding effect sizes were large (Cohen’s d >0.8) at both time stamps. In addition, 6 out of 7 SD participants (86%) who exhibited voice breaks at Pretest, showed a reduction of voice breaks in response to laryngeal VTS with 4 patients having no voice breaks after 14.7 minutes of VTS (see Fig. 2 and Table 1).
Change in the cortical oscillatory behavior in response to laryngeal VTS
The effect of VTS on cortical oscillatory activity over somatosensory and premotor cortex resulted in an almost immediate suppression of low-frequency oscillations as illustrated in an exemplar time-frequency plot of one SD participant in Fig. 3. For the complete patient sample, the application of VTS during the first 14.7 minutes (Set 1; see Fig. 1B) induced a significant event-related desynchronization of cortical theta-band oscillations over the left somatosensory, motor, and premotor cortex (C5: p = 0.049, d = 0.82; CP5: p = 0.049, d = 0.47; FC5: p = 0.049, d = 0.65; see Fig. 4, top panels), and a significant immediate rise of the somatosensory and motor cortical gamma power over the right hemisphere: (C6: p = 0.037, d = 0.48; CP6: p = 0.037, d = 0.39; FC6: p = 0.029, d = 0.51). After participants had received VTS in Set 2, a similar pattern of cortical activity emerged. It again resulted in a significant desynchronization of theta oscillations over the left motor cortical area (C5: p = 0.012, d = 0.83), and a significant rise of gamma oscillations over the right somatosensory-motor cortical regions: (C6: p = 0.015, d = 0.38; CP6: p = 0.027, d = 0.36; FC6: p = 0.015, d = 0.47; see Fig. 4, bottom panels).
There were no significant changes of theta spectral power over the right hemisphere, or of gamma band power over the left hemisphere. Similarly, assessment of ERSP in other frequency bands (alpha and beta) did not reveal any significant changes pre- versus post-VTS for any of the electrodes over left/right hemispheres (all p’s > 0.05).
We also performed a Pearson’s correlation analysis between the change in behavioral markers of voice quality (CPPS or the number of voice breaks) and theta/gamma ERSP for all participants collectively (responders and non-responders). No significant correlational relationships were observed. We then repeated the same analysis only on the responder group for whom either a rise in the CPPS or a decline in the number of voice breaks was observed (SD1, SD2, SD4, SD5, SD6, SD7, SD8, SD9, SD10). Again, no significant relationships were found.
A subsequent coherence analysis examined potential differences in the spectral characteristics of somatosensory-motor cortical interactions in each hemisphere. This analysis found no evidence that laryngeal VTS significantly affected the inter-regional spectral coherence between somatosensory and motor cortical areas within each hemisphere (all p’s > 0.05).
This pilot-feasibility study explored whether laryngeal vibro-tactile stimulation can provide benefits for patients with spasmodic dysphonia by monitoring its short-term effects on voice quality and the associated activity over laryngeal somatosensory-motor cortical areas. The main findings of this research are as follows: First, a one-time application of laryngeal VTS resulted in the significant improvement of two standard measures of voice/speech quality in 69% of the patients. The effect persisted for at least 20 minutes after the cessation of VTS. What seemed to discriminate the “responders” from those participants, who received little to no benefits from VTS, was that “non-responders” were more mildly affected and had no voice breaks prior to receiving VTS (see Fig. 2). Second, the application of laryngeal VTS induced an immediate significant suppression of theta band synchronization over the left somatosensory-motor cortex and the immediate significant rise of gamma band synchronization over the right somatosensory-motor cortical region.
Possible mechanisms behind the effectiveness of laryngeal VTS for improving speech in SD
Abnormal kinaesthetic function has been reported in non-dystonic limbs and muscle systems in SD12 and other forms of focal dystonia such as blepharospasm and cervical dystonia10. This implies that a more generalized somatosensory deficit underlies or is associated with the focal motor dysfunction in dystonia. We here explored if modulating somatosensory inputs could provide an avenue for a missing behavioral treatment for SD. Our approach of applying vibro-tactile stimulation constitutes a form of non-invasive neuromodulation that alters the output of afferent proprioceptive and tactile mechanoreceptors16,17, which is then centrally processed. Among the prominent neuropathological features of dystonia are reduced neuronal discharge rates and altered discharge patterns within the basal ganglia-thalamo-cortical motor circuitry32. Invasive neuromodulation techniques, such as deep brain stimulation, attempt to normalize the irregular neuronal discharge patterns by applying high-frequency impulses to targeted subcortical nuclei33,34 with the aim to restore the activity of upstream motor cortical networks. Here we suggest that a non-invasive high-frequency peripheral stimulation via laryngeal VTS may similarly modulate the discharge patterns of neurons in the somatosensory-motor speech network35, which can positively affect the speech motor output in SD.
Modulation of cortical oscillations in response to laryngeal VTS
We recorded EEG signals to understand how laryngeal VTS affects cortical activity in SD. We found that in our sample of SD participants applying laryngeal VTS was associated with a significant suppression of theta band power oscillations over the left somatosensory-motor cortex and a significant rise of gamma rhythm over right somatosensory-motor cortex (see Fig. 4). Theta oscillations are detectable in a number of brain nuclei, including the striatum36,37. Previous research identified abnormal theta oscillations at subcortical and cortical levels in other forms of focal dystonia such as cervical dystonia38,39. These abnormal theta oscillations in globus pallidus internus significantly correlate with the severity of symptoms in cervical dystonia40.
The susceptibility of focal dystonia to somatosensory stimulation has long been recognized as patients with task-specific dystonia may use sensory tricks (geste antagoniste) to alleviate dystonic symptoms temporarily by touching or pressing areas of or near the dystonic musculature. The neurophysiological correlate of an effective sensory trick is the suppression of abnormal cortical theta oscillations in CD41. The similarity between our EEG finding of suppressed theta band power in SD and the one reported for patients with CD41, suggests that the improvement of abnormal speech motor output in SD via laryngeal VTS may activate the same neurophysiological mechanism underlying an effective sensory trick in CD.
Another identified feature of modulated sensorimotor cortical processing due to VTS was the rise of gamma rhythm over right somatosensory-motor cortex. Gamma band oscillations are believed to form through the activation of excitatory pyramidal neurons and inhibitory interneurons regulated by the GABA-mediated synaptic current42. The synchronization of gamma oscillations underlies task-specific functions such as somatosensory processing43 and motor preparation42,44. Gamma activity in the 40 Hz range has been detected during speech45. Movement-induced changes in gamma amplitude seem to reflect the processing of afferent proprioceptive feedback in motor cortex46,47. Moreover, a rise of subcortical gamma-band synchronization correlates with the amplitude and velocity of hand movements, highlighting its involvement in the neural control of movement48. Given the empirical evidence showing that cortical gamma band activity underlies volitional motor control, our finding of a VTS-induced rise of gamma oscillations cortical areas involved in voice and speech motor control, indicates that laryngeal VTS alters information processing within speech cortical networks, which positively influences the voice quality of people with SD.
Limitations of the study
This proof-of-concept study yielded initial evidence that laryngeal VTS can improve voice symptoms in SD. A main limitation of this study is the lack of a control SD group that would allow for the systematic examination of possible confounding placebo or practice effects. Although we cannot exclude the possibility that the observed improvements in voice symptoms constitute a placebo effect, we do know from our pilot work that attaching the vibrators to the skin above the voice box (without being turned on) does not improve voice quality in SD. That is, it is unlikely that mere tactile stimulation would suffice in reducing voice symptoms. Moreover, there are no reports indicating that touching the neck constitutes a widely used and effective sensory trick in SD. In addition, the observed improvements in voice quality are not explained as a Hawthorne or special attention effect. On the contrary, as these patients were tested in their symptomatic stage when speech production is exhausting and effortful, one would expect that repeated vocalization and speech over more than 30 minutes results in a decline of speech, which we did not observe. Participants had not practiced the relevant test sentences prior to testing, nor is there evidence that voice symptoms in SD subside with repeated and prolonged speech. Finally, the effects on speech were observed when VTS was not applied. We recorded speech always after the end of each set of VTS (see Fig. 1B). In addition, the positive effects on markers of speech lasted for 20 minutes after the cessation of VTS.
A different drawback concerns the lack of an objective established clinical scale to classify disease severity. Understanding why and how disease severity interacts with laryngeal VTS could be very useful in predicting who would respond well and would likely be a non-responder to VTS. We choose CPPS and the number of voice breaks as prominent predictors of SD severity49. The inclusion of other outcome measures such as the consensus auditory-perceptual evaluation of voice (CAPE-V)50 may provide additional markers for examining the effectiveness of laryngeal VTS. In summary, obtaining additional outcome measures to characterize disease severity in SD and then testing the effects of VTS in a larger sample of SD patients would be clinically meaningful in understanding who responds well to laryngeal VTS and who will likely not benefit from this treatment.
This is the first study that investigated the effect of laryngeal VTS on SD voice symptoms. Its results lay the scientific foundation for a randomized clinical trial to examine the usefulness of the approach in a larger patient sample and to document the longitudinal changes in voice quality and the underlying cortical responses to laryngeal VTS in SD. Such clinical trial must address the shortcomings of this feasibility study. In a first step towards translating this knowledge into a clinical application, we are currently conducting a clinical trial, in which people with adductor SD undergo an 8-week training, in which they apply laryngeal VTS in-home (ClinicalTrials.gov Identifier: NCT03746509). Its results should solidify our knowledge on the effectiveness of VTS for treating the voice symptoms in SD.
The current study showed that the application of laryngeal VTS can result in meaningful improvements of speech quality in SD. Laryngeal VTS induced a significant suppression of theta band power over the left somatosensory-motor. A similar suppression of theta oscillations is observable in cervical dystonia patients applying effective sensory tricks, suggesting that VTS in SD may activate a similar neurophysiological mechanism.
Thirteen people with SD (8 female, 5 male; mean age ± SD: 58.6 ± 12.5 years) were recruited through the University of Minnesota Fairview Lion’s Voice Clinic. Patients receiving Botulinum neurotoxin were tested toward the end of their injection cycle when they are most symptomatic (see Tables 2 and 3 for clinical characteristics). This study was approved by Institutional Review Board of the University of Minnesota. All study participants gave written informed consent prior to study begin. No human participants under the age of 18 years were recruited for this study. The experiment was conducted in line with the relevant guidelines and regulations. The clinical trial related to this work is registered with clinicaltrials.gov (Study identifier: NCT03746509; first posted on 19 November 2018).
A potential concern when examining SD patients medicated with BoNT is vocal fold immobility, often occurring at higher dosage, one-sided BoNT injections (>5 units). Nevertheless, in this study, patients were given bilateral low-dose injections (0.2–2 units per vocal fold), while the dosage of injection was determined according to the severity of voice symptoms. This technique reduces the possibility of occurrence of vocal fold immobility. Another concern might be the bowing of vocal folds that occasionally appears shortly after BoNT injections. However, this condition also disappears with the improvement of voice/re-emergence of vocal spasms51. Accordingly, because the experimental session was held only after the recurrence of the symptoms, it was unlikely that vocal fold immobility/bowing occurred in our sample of study participants.
As stimulators, we used a pair of lightweight encapsulated vibro-motors (Model 307–100, Pico Vibe, Precision Microdrives Ltd., London, UK; diameter: 8.8. mm, length 25 mm). The vibro-motors were attached bilaterally on the lateral area of the thyroid cartilage at the height of vocal folds. Preliminary work in our laboratory with healthy human volunteers showed that a vibration frequency of 100 Hz with these vibrators generates peaks in the power spectrum of the voice signal that are within the frequency range known to stimulate laryngeal mechanoreceptors in animals27 or induce kinaesthetic illusions in humans which are known to be based on muscle spindle input18,19. Accordingly, the vibration frequency for VTS was set at100Hz in this study. Thus, we could reasonably assume that besides the tactile receptors of the skin above the voice box, laryngeal mechanoreceptors were also stimulated. At 100Hz, vibration frequency the vibration amplitude of the vibro-motors was ~1.7 G (1 G = 9.81 m/s2).
Electroencephalographic (EEG) data were recorded with the ActiveTwo data acquisition system (Biosemi B.V. Ltd, Amsterdam, Netherlands). The sampling rate was set at 512 Hz. Brain potentials were captured via Biosemi’s 64-channel EEG cap with an equiradial system of electrode placement. A series of 250 ms long auditory cues (1000 Hz, 98 dB) generated by RPvdsEx software (Tucker-Davis Technologies Ltd., Alachua, USA) guided the study participants throughout the experiment. The auditory stimuli were presented via a pair of sound delivery tubes embedded in the left and right ear canals. The tubes were surrounded by disposable foam earplugs, which masked any auditory inputs to the ears except for the presented auditory stimuli. The same system was used to control the activation of the vibro-motors. The time-stamp of auditory cues and vibration onset/endpoint were captured simultaneously.
The experiment took place in a chamber that was electrically and acoustically isolated. Participants were seated on a comfortable chair, asked to avoid extra movements, and to focus their attention at a fixed point on the front wall. A pair of vibro-motors was attached to the skin over the participant’s laryngeal area (see Fig. 1A). Prior to the experiment, the severity of speech symptoms was evaluated by (1) reading aloud a series of standard SD symptom-eliciting sentences30; and (2) pronouncing the vowel /a/ three times, each lasting four seconds. Participants pronounced the vowels and read the sentences at their habitual pitch and loudness. All speech and voice signals were recorded for later offline analysis.
The experimental protocol comprised two blocks: (1) laryngeal vibration (VTS Only), and (2) vowel vocalization accompanied by laryngeal VTS (Vocalization + VTS). During the VTS Only condition, the laryngeal vibrators were alternately turned on and off (3 seconds ON following 3 seconds OFF), for 50 repetitions and then stayed ON continuously for the final 3 minutes. During the Vocalization + VTS condition, participants received an auditory cue (1000 Hz, 98 dB) for 250 ms, and then vocalized the vowel /a/ continuously for 4 seconds. During the second half of this vocalization period, laryngeal VTS was applied (see Fig. 3). Participants stopped vocalization with the cessation of laryngeal VTS. This procedure was repeated 50 times with 4-second long resting intervals in between trials. Participants received VTS in two sets with each set lasting 14.7 minutes. Between sets, at the end of set 2, and 20 minutes after the cessation of VTS (Retention), we evaluated voice/speech quality using the same assessment tasks given at Pretest (see Fig. 1B). The duration of the retention period was arbitrarily picked between the minimum and maximum duration of VTS application (>14.7 min and <29.4 min). For further details, see S2. Supplementary Notes.
Measures of speech quality
Participants read two sets of standard sentences30 devised for the speech evaluation of people with adductor and abductor spasmodic dysphonia in their normal conversational style (see S1 Supplementary Methods). Assessment of these recorded voice data was performed offline. Two voice measures were obtained: (1) the number of voice breaks (VB), and (2) the change in the cepstral peak prominence (CPP) of voice52. CPP is an acoustic measure of speech quality defined as the logarithm of the Fourier Transform of the signal’s power spectrum. CPP is the difference between the amplitude of the cepstral peak and the estimated value on the regression line right below the cepstral peak. The higher the relative amplitude of the cepstral peak of a voice signal, the more a well-defined harmonic structure of the voice exists. Subsequently, the CPP signal was smoothed by averaging the cepstral magnitude across frequencies and time31. The smoothed measure of CPP referred to as the CPPS is strongly correlated with the overall dysphonia severity53. In our analyses, speech signals were broken into ‘voiced’ and ‘voiceless’ segments and CPPS values were derived only for the ‘voiced’ periods.
The PRAAT software54 was implemented for the acoustic analysis of the voice data and the derivation of CPPS values. A certified speech-language pathologist identified voice breaks by analyzing the continuous speech of the spoken sentences. Voice tremor was identified in the sustained vowels by examining the pitch tracing and the upper harmonics in narrow-band spectrograms using the PRAAT.
EEG signal processing and electrocortical measures
The EEGLab toolbox of MATLAB (The MathWorks, Natick, MA) was used for exploring the EEG data55. The averaged signal of the two external electrodes embedded over bilateral mastoid bones was used to reference all electrodes. The data were high-passed filtered at the cut-off frequency of 1 Hz to address possible baseline drifts. A zero-phase notch filter was used to remove power line noise. Next, in order to weaken the potential effect of non-cortical sources that might have been commonly captured by electrodes, each channels was re-referenced to the common average of all electrodes. Segments of EEG recordings from 1000 ms before vocalization to 4000 ms after the onset of vocalization were extracted as data epochs. We subsequently used the ‘runica’ algorithm to perform independent component analysis (ICA) on all data channels. This was followed by the implementation of an automated multiple artifact rejection algorithm ‘SASICA’56 on the resultant components to identify and remove the contaminated ICs. This algorithm recruits spatiotemporal criteria to distinguish the artifactual components. This is critically important for the identification and removal of muscle artifacts that may have contaminated the EEG data during vowel vocalization. At the end, the remaining ICs were linearly summed up and the output dataset was used for extracting the features.
As primary EEG measure, we obtained the event-related spectral perturbation (ERSP) of somatosensory-motor cortical electrodes in response to VTS. ERSP presents the logarithm of the mean event-related alteration in spectral power relative to the resting state at each frequency bin57. Band-specific features were extracted for the physiologically-relevant frequency ranges (i.e. <50 Hz): theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz), and the low gamma (30–49 Hz). ERSP was extracted for six sites: CP5, C5, FC5, CP6, C6, and FC6. CP5 and CP6 were nearby bilateral somatosensory cortical areas. C5 and C6 were nearby bilateral motor cortical areas. FC5 and FC6 were nearby bilateral premotor cortical regions. Indices ‘5’ and ‘6’ reflect the cortical areas close to bilateral vocalization regions over somatosensory and motor homunculi58,59.
As secondary EEG measure, we obtained the event-related coherence (ERCOH) between pairs of somatosensory-motor cortical electrodes as an indicator of the level of synchrony between the two electrodes60. ERCOH was derived for CP5-FC5 and CP6-FC6 electrode pairs. Before the computation of ERCOH, EEG epochs were pre-whitened to exclude possible autocorrelations/trends that might interfere with the data.
EEG features were extracted from the Vocalization + VTS conditions of both sets (see Fig. 1B) to investigate the immediate cortical response to VTS. For each condition, the 4000 ms long trials were divided into two segments: (1) VTS-off (before the onset of laryngeal vibration), and (2) VTS-on (after the onset of laryngeal vibration). For each participant, the ERSP measure of the average of the 50 recorded epochs was derived separately for the VTS-off and VTS-on segments. Because the first 500 ms of the VTS-off period additionally contain cortical auditory evoked potentials61 or be influenced by the reaction time of the study participants62, the first 500 ms of vocalization were excluded from further EEG analysis (i.e. the VTS-off interval was defined between 500–2000 ms after the presentation of the auditory cue).
For SD1 to SD3 no EEG data were available. Statistical comparisons of the pre- versus post-VTS cortical potentials were performed on the available EEG data of 10 participants. The Kolmogorov-Smirnov test was implemented to examine the normality of the data. Since the distribution of the data was not normal, the non-parametric Wilcoxon sign rank test was used for statistical assessments. For each frequency band and for the group of electrodes covering each hemisphere, p-values were adjusted for multiple comparisons using the Benjamini-Hochberg method63. The significance level was set at p-value = 0.05. The effect size was calculated using Cohen’s d.
The datasets generated under this study are available from the corresponding author on a reasonable request.
Castelon Konkiewitz, E. et al. Service-based survey of dystonia in Munich. Neuroepidemiology 21, 202–206 (2002).
Ludlow, C. L. Spasmodic dysphonia: a laryngeal control disorder specific to speech. J Neurosci 31, 793–393 (2011).
Watts, C., Whurr, R. & Nye, C. Botulinum toxin injections for the treatment of spasmodic dysphonia. Cochrance Database Sys Rev. https://doi.org/10.1002/14651858.CD004327.pub2 (2004).
Simonyan, K., Berman, B. D., Herscovitch, P. & Hallett, M. Abnormal striatal dopaminergic neurotransmission during rest and task production in spasmodic dysphonia. J Neurosci 33, 14705–14714, https://doi.org/10.1523/JNEUROSCI.0407-13.2013 (2013).
Ludlow, C. L., Yamashita, T., Schulz, G. M. & Deleyiannis, F. W. B. Abnormalities in long latency responses to superior laryngeal nerve stimulation in adductor spasmodic dysphonia. Ann Otol Rhinol Laryngol 104, 928–935, https://doi.org/10.1177/000348949510401203 (1995).
Samargia, S., Schmidt, R. & Kimberley, T. J. Shortened cortical silent period in adductor spasmodic dysphonia: Evidence for widespread cortical excitability. Neurosci Lett 560, 12–15, https://doi.org/10.1016/j.neulet.2013.12.007 (2014).
Simonyan, K. et al. Focal white matter changes in spasmodic dysphonia: a combined diffusion tensor imaging and neuropathological study. Brain 131, 447–459, https://doi.org/10.1093/brain/awm303 (2008).
Simonyan, K. & Ludlow, C. L. Abnormal activation of the primary somatosensory cortex in spasmodic dysphonia: an fMRI study. Cereb Cortex 20, 2749–2759, https://doi.org/10.1093/cercor/bhq023 (2010).
Maschke, M., Gomez, C. M., Tuite, P. J. & Konczak, J. Dysfunction of the basal ganglia, but not the cerebellum, impairs kinaesthesia. Brain 126, 2312–2322, https://doi.org/10.1093/brain/awg230 (2003).
Putzki, N. et al. Kinesthesia is impaired in focal dystonia. Mov Dis 21, 754–760, https://doi.org/10.1002/mds.20799 (2006).
Patel, N., Hanfelt, J., Marsh, L. & Jankovic, J. Alleviating manoeuvres (sensory tricks) in cervical dystonia. J Neurol Neurosurg Psychiatry 85, 882–884, https://doi.org/10.1136/jnnp-2013-307316 (2014).
Konczak, J., Aman, J. E. & Chen, Y.-W. Li, K.-y. & Watson, P. J. Impaired limb proprioception in adults with spasmodic dysphonia. J Voice 29, 777.e717–777.e723, https://doi.org/10.1016/j.jvoice.2014.12.010 (2015).
Kägi, G. et al. Sensory tricks in primary cervical dystonia depend on visuotactile temporal discrimination. Mov Dis 28, 356–361, https://doi.org/10.1002/mds.25305 (2013).
Poisson, A. et al. History of the ‘geste antagoniste’ sign in cervical dystonia. J Neurol 259, 1580–1584, https://doi.org/10.1007/s00415-011-6380-7 (2012).
Karnath, H., Konczak, J. & Dichgans, J. Effect of prolonged neck muscle vibration on lateral head tilt in severe spasmodic torticollis. J Neurol Neurosurg Psychiatry 69, 658–660, https://doi.org/10.1136/jnnp.69.5.658 (2000).
Bianconi, R. & Van Der Meulen, J. P. The response to vibraion of the end organs of mammalian muscle spindels. J Neurophysiol 26, 177–190, https://doi.org/10.1152/jn.19126.96.36.199 (1963).
Brown, M. C. & Matthews, E. I. PB. The use of vibration as a selective repetitive stimulus for Ia afferent fibres. J Physiol 191, 31P–32P (1967).
Cordo, P., Gurfinkel, V. S., Bevan, L. & Kerr, G. K. Proprioceptive consequences of tendon vibration during movement. J Neurophysiol 74, 1675–1688, https://doi.org/10.1152/jn.19188.8.131.525 (1995).
Cordo, P. J., Gurfinkel, V. S., Brumagne, S. & Flores-Vieira, C. Effect of slow, small movement on the vibration-evoked kinesthetic illusion. Exp Brain Res 167, 324–334, https://doi.org/10.1007/s00221-005-0034-x (2005).
Baken, R. J. & Noback, C. R. Neuromuscular spindles in intrinsic muscles of a human larynx. J Speech Lang Hearing Res 14, 513–518, https://doi.org/10.1044/jshr.1403.513 (1971).
Grim, M. Muscle spindles in the posterior cricoarytenoid muscle of the human larynx. Folia Morphologia (Praha) 15, 124–131 (1967).
Hirayama, M., Matsui, T., Tachibana, M., Ibata, Y. & Mizukoshi, O. An electron microscopic study of the muscle spindle in the arytenoid muscle of the human larynx. Eur Arch Otorhinolarynigol 244, 249–252 (1987).
Paulsen, K. Occurrence & number of muscle spindles in internal laryngeal muscles of humans (m. cricoarytenoideus & m. cricothyreoideus)]. Z Zellforsch Mikro Anat 48, 349–355 (1958).
Tellis, C. M., Rosen, C., Thekdi, A. & Sciote, J. J. Anatomy and fiber type composition of human interarytenoid muscle. Ann Otol Rhinol Laryngol 113, 97–107 (2004).
Brandon, C. A. et al. Staining of human thyroarytenoid muscle with myosin antibodies reveals some unique extrafusal fibers, but no muscle spindles. J Voice 17, 245–254 (2003).
Loucks, T. M. J., Poletto, C. J., Saxon, K. G. & Ludlow, C. L. Laryngeal muscle responses to mechanical displacement of the thyroid cartilage in humans. J App Physiol 99, 922–930, https://doi.org/10.1152/japplphysiol.00402.2004 (2005).
Davis, P. J. & Nail, B. S. Quantitative analysis of laryngeal mechanosensitivity in the cat and rabbit. J Physiol 388, 467–485 (1987).
Nagai, T. Encapsulated sensory corpuscle in the mucosa of human vocal cord: An electron microscope study. Arch Histol Jap 45, 145–153, https://doi.org/10.1679/aohc.45.145 (1982).
Andreatta, R. D., Mann, E. A., Poletto, C. J. & Ludlow, C. L. Mucosal afferents mediate laryngeal adductor responses in the cat. J Appl Physiol (1985) 93, 1622–1629, https://doi.org/10.1152/japplphysiol.00417.2002 (2002).
Woodson, G. E. In Laryngeal Evaluation: Indirect Laryngoscopy to High-Speed Digital Imaging (ed Katherine A. Kendall; Rebecca J. Leonard) Ch. 25, (Thieme, 2010).
Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P. & Corthals, P. Acoustic measurement of overall voice quality: a meta-analysis. J Acoust Soc Am 126, 2619–2634, https://doi.org/10.1121/1.3224706 (2009).
Vitek, J. L. Pathophysiology of dystonia: A neuronal model. Mov Dis 17, S49–S62, https://doi.org/10.1002/mds.10142 (2002).
Hendrix, C. M. & Vitek, J. L. Toward a network model of dystonia. Ann New York Acad Sci 1265, 46–55, https://doi.org/10.1111/j.1749-6632.2012.06692.x (2012).
Johnson, M. D., Miocinovic, S., McIntyre, C. C. & Vitek, J. L. Mechanisms and targets of deep brain stimulation in movement disorders. Neurotherapeutics 5, 294–308, https://doi.org/10.1016/j.nurt.2008.01.010 (2008).
Ludlow, C. L. Central nervous system control of voice and swallowing. J Clin Neurophysiol 32, 294–303, https://doi.org/10.1097/WNP.0000000000000186 (2015).
Berke, J. D., Okatan, M., Skurski, J. & Eichenbaum, H. B. Oscillatory entrainment of striatal neurons in freely moving rats. Neuron 43, 883–896, https://doi.org/10.1016/j.neuron.2004.08.035 (2004).
Goto, Y. & O’Donnell, P. Synchronous activity in the hippocampus and nucleus accumbens in vivo. J Neurosci 21, RC131–RC131, https://doi.org/10.1523/JNEUROSCI.21-04-j0003.2001 (2001).
Liu, X. et al. Involvement of the medial pallidum in focal myoclonic dystonia: A clinical and neurophysiological case study. Mov Dis 17, 346–353 (2002).
Liu, X. et al. The sensory and motor representation of synchronized oscillations in the globus pallidus in patients with primary dystonia. Brain 131, 1562–1573, https://doi.org/10.1093/brain/awn083 (2008).
Neumann, W.-J. et al. A localized pallidal physiomarker in cervical dystonia. Ann. Neurol. 82, 912–924, https://doi.org/10.1002/ana.25095 (2017).
Tang, J. K. et al. Changes in cortical and pallidal oscillatory activity during the execution of a sensory trick in patients with cervical dystonia. Exp Neurol 204, 845–848 (2007).
Nowak, M., Zich, C. & Stagg, C. J. Motor cortical gamma oscillations: what have we learnt and where are we headed? Curr Behav Neurosci Rep 5, 136–142, https://doi.org/10.1007/s40473-018-0151-z (2018).
Bauer, M., Oostenveld, R., Peeters, M. & Fries, P. Tactile spatial attention enhances gamma-band activity in somatosensory cortex and reduces low-frequency activity in parieto-occipital areas. J Neurosci 26, 490, https://doi.org/10.1523/JNEUROSCI.5228-04.2006 (2006).
Engel, A. K., Fries, P. & Singer, W. Dynamic predictions: oscillations and synchrony in top–down processing. Nat Rev Neurosci 2, 704–716, https://doi.org/10.1038/35094565 (2001).
Palva, S. et al. Distinct gamma-band evoked responses to speech and non-speech sounds in humans. J Neurosci 22, RC211–RC211, https://doi.org/10.1523/JNEUROSCI.22-04-j0003.2002 (2002).
Miller, K. J. et al. Cortical activity during motor execution, motor imagery, and imagery-based online feedback. Proc Natl Acad Sci USA 107, 4430–4435, https://doi.org/10.1073/pnas.0913697107 (2010).
Muthukumaraswamy, S. D. Functional properties of human primary motor cortex gamma oscillations. J Neurophysiol 104, 2873–2885, https://doi.org/10.1152/jn.00607.2010 (2010).
Brücke, C. et al. Scaling of movement is related to pallidal γ oscillations in patients with dystonia. J Neurosci 32, 1008, https://doi.org/10.1523/JNEUROSCI.3860-11.2012 (2012).
Peterson, E. A. et al. Toward validation of the cepstral spectral index of dysphonia (CSID) as an objective treatment outcomes measure. J Voice 27, 401–410, https://doi.org/10.1016/j.jvoice.2013.04.002 (2013).
Kempster, G. B., Gerratt, B. R., Abbott, K. V., Barkmeier-Kraemer, J. & Hillman, R. E. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol 18, 124–132, https://doi.org/10.1044/1058-0360(2008/08-0017) (2009).
Ludlow, C. L. et al. Consensus-Based Attributes for Identifying Patients With Spasmodic Dysphonia and Other Voice Disorders. JAMA. Otolaryngol Head Neck Surg 144, 657–665, https://doi.org/10.1001/jamaoto.2018.0644 (2018).
Fraile, R. & Godino-Llorente, J. I. Cepstral peak prominence: a comprehensive analysis. Biomed Signal Process Control 14, 42–54, https://doi.org/10.1016/j.bspc.2014.07.001 (2014).
Hillenbrand, J. & Houde, R. A. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Lang Hearing Res 39, 311–321, https://doi.org/10.1044/jshr.3902.311 (1996).
Boersma, P. & van Heuven, V. Speak and unspeak with PRAAT. Glot International 5, 341–347 (2001).
Delorme, A. & Makeig, S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Method 134, 9–21, https://doi.org/10.1016/j.jneumeth.2003.10.009 (2004).
Chaumon, M., Bishop, D. V. M. & Busch, N. A. A practical guide to the selection of independent components of the electroencephalogram for artifact correction. J Neurosci Method 250, 47–63, https://doi.org/10.1016/j.jneumeth.2015.02.025 (2015).
Makeig, S. Auditory event-related dynamics of the EEG spectrum and effects of exposure to tones. Electroencephalog. Clin Neurophysiol 86, 283–293, https://doi.org/10.1016/0013-4694(93)90110-H (1993).
Caviness, J. N., Liss, J. M., Adler, C. & Evidente, V. Analysis of high-frequency electroencephalographic-electromyographic coherence elicited by speech and oral nonspeech tasks in parkinson’s disease. J Speech Lang Hearing Res 49, 424–438, https://doi.org/10.1044/1092-4388(2006/033) (2006).
Mor, N., Simonyan, K. & Blitzer, A. Central voice production and pathophysiology of spasmodic dysphonia. Laryngoscope 128, 177–183, https://doi.org/10.1002/lary.26655 (2018).
Pfurtscheller, G. & Andrew, C. Event-related changes of band power and coherence: methodology and interpretation. J Clin Neurophysiol 16 (1999).
Alvarenga, K. F. et al. The influence of speech stimuli contrast in cortical auditory evoked potentials. Brazil J Otorhinolaryngol 79, 336–341 (2013).
Santee, J. L. & Kohfeld, D. L. Auditory reaction time as a function of stimulus intensity, frequency, and rise time. Bull Psychonom Soc 10, 393–396, https://doi.org/10.3758/BF03329370 (1977).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Series B 57, 289–300 (1995).
We sincerely thank all participants who attended the study, as well as the Center for Applied and Translational Sensory Sciences at the University of Minnesota for providing the recording space and facility. This research was supported by grants from the U.S. National Institutes of Health NIH 1R21DC011841 to PW and JK and by NIH 1 R01 DC016315-01A1 to JK.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Khosravani, S., Mahnan, A., Yeh, I. et al. Laryngeal vibration as a non-invasive neuromodulation therapy for spasmodic dysphonia. Sci Rep 9, 17955 (2019). https://doi.org/10.1038/s41598-019-54396-4
Scientific Reports (2020)
What Is New in Laryngeal Dystonia: Review of Novel Findings of Pathophysiology and Novel Treatment Options
Current Otorhinolaryngology Reports (2020)
Validation of the Communicative Participation Item Bank as an Outcome Measure for Spasmodic Dysphonia
The Laryngoscope (2020)