Laryngeal vibration as a non-invasive neuromodulation therapy for spasmodic dysphonia

Khosravani, Sanaz; Mahnan, Arash; Yeh, I-Ling; Aman, Joshua E.; Watson, Peter J.; Zhang, Yang; Goding, George; Konczak, Jürgen

doi:10.1038/s41598-019-54396-4

Download PDF

Article
Open access
Published: 29 November 2019

Laryngeal vibration as a non-invasive neuromodulation therapy for spasmodic dysphonia

Sanaz Khosravani¹,
Arash Mahnan ORCID: orcid.org/0000-0003-4097-9308¹,
I-Ling Yeh^1,2,
Joshua E. Aman³,
Peter J. Watson⁴,
Yang Zhang⁴,
George Goding⁵ &
…
Jürgen Konczak ORCID: orcid.org/0000-0002-4422-5610¹

Scientific Reports volume 9, Article number: 17955 (2019) Cite this article

7117 Accesses
16 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Spasmodic dysphonia (SD) is an incurable focal dystonia of the larynx that impairs speech and communication. Vibro-tactile stimulation (VTS) alters afferent proprioceptive input to sensorimotor cortex that controls speech. This proof-of-concept study examined the effect of laryngeal VTS on speech quality and cortical activity in 13 SD participants who vocalized the vowel /a/ while receiving VTS for 29 minutes. In response to VTS, 9 participants (69%) exhibited a reduction of voice breaks and/or a meaningful increase in smoothed cepstral peak prominence, an acoustic measure of voice/speech quality. Symptom improvements persisted for 20 minutes past VTS. Application of VTS induced a significant suppression of theta band power over the left somatosensory-motor cortex and a significant rise of gamma rhythm over right somatosensory-motor cortex. Such suppression of theta oscillations is observed in patients with cervical dystonia who apply effective sensory tricks, suggesting that VTS in SD may activate a similar neurophysiological mechanism. Results of this feasibility study indicate that laryngeal VTS modulates neuronal synchronization over sensorimotor cortex, which can induce short-term improvements in voice quality. The effects of long-term VTS and its optimal dosage for treating voice symptoms in SD are still unknown and require further systematic study.

Hyperactive sensorimotor cortex during voice perception in spasmodic dysphonia

Article Open access 14 October 2020

Impaired auditory discrimination and auditory-motor integration in hyperfunctional voice disorders

Article Open access 23 June 2021

A novel theta-controlled vibrotactile brain–computer interface to treat chronic pain: a pilot study

Article Open access 10 February 2024

Introduction

Spasmodic dysphonia (SD) is a speech-specific focal dystonia of the larynx that typically develops in middle adulthood between 40–50 years of age¹. SD leads to the formation of voice breaks and/or a strained or choked speech². Two major classes of SD have been identified: adductor (ADSD), characterized by involuntary vocal fold closure; and abductor (ABSD), exhibiting excessive vocal fold opening¹. Currently, SD is primarily treated with Botulinum neurotoxin injection (BoNT), which despite its effectiveness, is an invasive method and only temporarily relieves voice symptoms³.

The underlying neural mechanism of SD is not entirely understood, but it is known to involve structural and functional alterations in the basal ganglia–thalamo-cortical circuitry, the brainstem, and the cerebellum^4,5,6,7,8. Several forms of focal dystonia, including cervical dystonia (CD), blepharospasm, and spasmodic dysphonia present with somatosensory system abnormalities even in non-dystonic muscles^9,10,11,12. This suggests that while the motor symptoms of dystonia are focal, the corresponding somatosensory impairments are general.

Furthermore, it is known that in cervical dystonia successful sensory tricks (geste antagoniste) can ease dystonic symptoms by touching areas over or near the dystonic musculature; a phenomenon that sheds light on the link between abnormal somatosensation and the dystonic motor manifestations^13,14. In addition, vibro-tactile stimulation (VTS) has been shown to reduce the severity of dystonic postures. For example, vibration over dystonic neck muscles induced immediate head righting and temporarily restored upright head posture in people with torticollis¹⁵. It has long been established that VTS in the range of 40–100 Hz stimulates the mechanoreceptors and muscle spindles that affect motor behavior^16,17,18,19. However, the knowledge on the distribution and function of somatosensory receptors within the laryngeal musculature is still inconclusive. A series of neuroanatomical, histochemical, and electron microscopic studies supported the existence of muscle spindles in the larynx^{20,21,22,23,24}, while others failed to provide support for the existence of intrafusal muscle fibers in cricothyroid and thyroarytenoid muscles²⁵ or could not elicit stretch responses in these areas²⁶.

Moreover, the mucosa of the epiglottis is known to have an array of mechanoreceptors responsive to mechanical stimulation between 10–70 Hz with a depression amplitude of <100μm^27,28. It was further demonstrated that the sensory basis for the laryngeal adductor response is dependent on the stimulation of mechanoreceptors in the laryngeal mucosa in the cat and humans^26,29. Moreover, the discovery of Krause type sensory corpuscles at the free edge of vocal cords is consistent with the notion that signals from these mechanoreceptors are not only used for the control of swallowing but also for voice control²⁸.

Recent work from our group demonstrated upper limb proprioceptive deficits in SD¹². Moreover, studies on the effects of sensory stimulation of the larynx have indicated a reduced inhibition of laryngeal muscle responses to sensory stimulation, i.e., a reduced suppression of motor responses to laryngeal sensory stimulation⁵. Functional neuroimaging in SD during phonation has also shown increased central activation in the laryngeal somatosensory cortex in patients with SD⁸. These studies highlight different forms of somatosensory abnormalities in SD, which may contribute to the pathomechanism of the disease. This notion opens an avenue for a potential behavioral treatment for SD that seeks to modulate the somatosensory information of the laryngeal musculature to improve the speech motor output. In particular, the vibro-tactile stimulation of laryngeal muscles might be a suitable tool for this purpose, given that it is shown to alter the afferent proprioceptive signals produced by the vibrated muscle mechanoreceptors and muscle spindles^16,17.

This study sought to establish the feasibility of laryngeal VTS as a non-invasive neuromodulation treatment for spasmodic dysphonia. We pursued two specific aims: First, to demonstrate that prolonged VTS can induce short-term acute changes in speech quality. Second, to document the changes in cortical activity in SD that are associated with the application of laryngeal VTS. To evaluate the effectiveness of laryngeal VTS we had people with SD vocalize the vowel /a/ while receiving VTS for a total of 29.4 minutes. We derived markers of speech quality and analyzed the neuronal response to VTS over sensorimotor cortex.

Results

Improvement of measures of speech quality in response to laryngeal VTS

We recorded the voice of 13 SD participants as they read a list of sentences devised for the speech evaluation of SD³⁰ at 4 different time stamps along the experimental protocol: Prior to VTS (Pretest), after 14.7 minutes of VTS (Post-set 1), after 29.4 minutes of VTS (Post-set 2) and 20 minutes past the cessation of VTS (Retention) (see Fig. 1B for an overview). Subsequently, we derived the number of voice breaks and smoothed cepstral peak prominence (CPPS) as measures of speech quality from the acoustic signal. CPPS is based on the acoustic signal’s power spectrum and correlates strongly with the severity of SD voice symptoms³¹ (for details, see Method: Measures of speech quality).

Nine out of 13 participants (69%) responded to VTS and showed a reduction of the number of voice breaks and/or a rise of CPPS (>+1 dB) at Post-set 1 and/or the Post-set 2 as compared to Pretest. The remaining four participants did not show a consistent response to VTS as quantified by a rise in CPPS. It is noteworthy that none of the non-responding patients exhibited voice breaks at Pretest (see Fig. 2; bottom panels). Improvements in both speech quality measures for the responders were preserved at the retention stage (see Fig. 2; top panels). As a group, participants showed a significant rise of CPPS after 14.7 minutes of laryngeal VTS in comparison to their Pretest (p = 0.02, d = 1.06), which was retained at 20 minutes past the last application of VTS (p = 0.006, d = 1.15; see Fig. 2). The corresponding effect sizes were large (Cohen’s d >0.8) at both time stamps. In addition, 6 out of 7 SD participants (86%) who exhibited voice breaks at Pretest, showed a reduction of voice breaks in response to laryngeal VTS with 4 patients having no voice breaks after 14.7 minutes of VTS (see Fig. 2 and Table 1).

Table 1 VTS induced change in smoothed cepstral peak prominence (ΔCPPS) and the number of voice breaks (ΔVB) relative to baseline (Pretest). Unit for ΔCPPS is dB.

Full size table

Change in the cortical oscillatory behavior in response to laryngeal VTS

The effect of VTS on cortical oscillatory activity over somatosensory and premotor cortex resulted in an almost immediate suppression of low-frequency oscillations as illustrated in an exemplar time-frequency plot of one SD participant in Fig. 3. For the complete patient sample, the application of VTS during the first 14.7 minutes (Set 1; see Fig. 1B) induced a significant event-related desynchronization of cortical theta-band oscillations over the left somatosensory, motor, and premotor cortex (C5: p = 0.049, d = 0.82; CP5: p = 0.049, d = 0.47; FC5: p = 0.049, d = 0.65; see Fig. 4, top panels), and a significant immediate rise of the somatosensory and motor cortical gamma power over the right hemisphere: (C6: p = 0.037, d = 0.48; CP6: p = 0.037, d = 0.39; FC6: p = 0.029, d = 0.51). After participants had received VTS in Set 2, a similar pattern of cortical activity emerged. It again resulted in a significant desynchronization of theta oscillations over the left motor cortical area (C5: p = 0.012, d = 0.83), and a significant rise of gamma oscillations over the right somatosensory-motor cortical regions: (C6: p = 0.015, d = 0.38; CP6: p = 0.027, d = 0.36; FC6: p = 0.015, d = 0.47; see Fig. 4, bottom panels).

There were no significant changes of theta spectral power over the right hemisphere, or of gamma band power over the left hemisphere. Similarly, assessment of ERSP in other frequency bands (alpha and beta) did not reveal any significant changes pre- versus post-VTS for any of the electrodes over left/right hemispheres (all p’s > 0.05).

We also performed a Pearson’s correlation analysis between the change in behavioral markers of voice quality (CPPS or the number of voice breaks) and theta/gamma ERSP for all participants collectively (responders and non-responders). No significant correlational relationships were observed. We then repeated the same analysis only on the responder group for whom either a rise in the CPPS or a decline in the number of voice breaks was observed (SD1, SD2, SD4, SD5, SD6, SD7, SD8, SD9, SD10). Again, no significant relationships were found.

A subsequent coherence analysis examined potential differences in the spectral characteristics of somatosensory-motor cortical interactions in each hemisphere. This analysis found no evidence that laryngeal VTS significantly affected the inter-regional spectral coherence between somatosensory and motor cortical areas within each hemisphere (all p’s > 0.05).

Discussion

This pilot-feasibility study explored whether laryngeal vibro-tactile stimulation can provide benefits for patients with spasmodic dysphonia by monitoring its short-term effects on voice quality and the associated activity over laryngeal somatosensory-motor cortical areas. The main findings of this research are as follows: First, a one-time application of laryngeal VTS resulted in the significant improvement of two standard measures of voice/speech quality in 69% of the patients. The effect persisted for at least 20 minutes after the cessation of VTS. What seemed to discriminate the “responders” from those participants, who received little to no benefits from VTS, was that “non-responders” were more mildly affected and had no voice breaks prior to receiving VTS (see Fig. 2). Second, the application of laryngeal VTS induced an immediate significant suppression of theta band synchronization over the left somatosensory-motor cortex and the immediate significant rise of gamma band synchronization over the right somatosensory-motor cortical region.

Possible mechanisms behind the effectiveness of laryngeal VTS for improving speech in SD

Abnormal kinaesthetic function has been reported in non-dystonic limbs and muscle systems in SD¹² and other forms of focal dystonia such as blepharospasm and cervical dystonia¹⁰. This implies that a more generalized somatosensory deficit underlies or is associated with the focal motor dysfunction in dystonia. We here explored if modulating somatosensory inputs could provide an avenue for a missing behavioral treatment for SD. Our approach of applying vibro-tactile stimulation constitutes a form of non-invasive neuromodulation that alters the output of afferent proprioceptive and tactile mechanoreceptors^16,17, which is then centrally processed. Among the prominent neuropathological features of dystonia are reduced neuronal discharge rates and altered discharge patterns within the basal ganglia-thalamo-cortical motor circuitry³². Invasive neuromodulation techniques, such as deep brain stimulation, attempt to normalize the irregular neuronal discharge patterns by applying high-frequency impulses to targeted subcortical nuclei^33,34 with the aim to restore the activity of upstream motor cortical networks. Here we suggest that a non-invasive high-frequency peripheral stimulation via laryngeal VTS may similarly modulate the discharge patterns of neurons in the somatosensory-motor speech network³⁵, which can positively affect the speech motor output in SD.

Modulation of cortical oscillations in response to laryngeal VTS

We recorded EEG signals to understand how laryngeal VTS affects cortical activity in SD. We found that in our sample of SD participants applying laryngeal VTS was associated with a significant suppression of theta band power oscillations over the left somatosensory-motor cortex and a significant rise of gamma rhythm over right somatosensory-motor cortex (see Fig. 4). Theta oscillations are detectable in a number of brain nuclei, including the striatum^36,37. Previous research identified abnormal theta oscillations at subcortical and cortical levels in other forms of focal dystonia such as cervical dystonia^38,39. These abnormal theta oscillations in globus pallidus internus significantly correlate with the severity of symptoms in cervical dystonia⁴⁰.

The susceptibility of focal dystonia to somatosensory stimulation has long been recognized as patients with task-specific dystonia may use sensory tricks (geste antagoniste) to alleviate dystonic symptoms temporarily by touching or pressing areas of or near the dystonic musculature. The neurophysiological correlate of an effective sensory trick is the suppression of abnormal cortical theta oscillations in CD⁴¹. The similarity between our EEG finding of suppressed theta band power in SD and the one reported for patients with CD⁴¹, suggests that the improvement of abnormal speech motor output in SD via laryngeal VTS may activate the same neurophysiological mechanism underlying an effective sensory trick in CD.

Another identified feature of modulated sensorimotor cortical processing due to VTS was the rise of gamma rhythm over right somatosensory-motor cortex. Gamma band oscillations are believed to form through the activation of excitatory pyramidal neurons and inhibitory interneurons regulated by the GABA-mediated synaptic current⁴². The synchronization of gamma oscillations underlies task-specific functions such as somatosensory processing⁴³ and motor preparation^42,44. Gamma activity in the 40 Hz range has been detected during speech⁴⁵. Movement-induced changes in gamma amplitude seem to reflect the processing of afferent proprioceptive feedback in motor cortex^46,47. Moreover, a rise of subcortical gamma-band synchronization correlates with the amplitude and velocity of hand movements, highlighting its involvement in the neural control of movement⁴⁸. Given the empirical evidence showing that cortical gamma band activity underlies volitional motor control, our finding of a VTS-induced rise of gamma oscillations cortical areas involved in voice and speech motor control, indicates that laryngeal VTS alters information processing within speech cortical networks, which positively influences the voice quality of people with SD.

Limitations of the study

This proof-of-concept study yielded initial evidence that laryngeal VTS can improve voice symptoms in SD. A main limitation of this study is the lack of a control SD group that would allow for the systematic examination of possible confounding placebo or practice effects. Although we cannot exclude the possibility that the observed improvements in voice symptoms constitute a placebo effect, we do know from our pilot work that attaching the vibrators to the skin above the voice box (without being turned on) does not improve voice quality in SD. That is, it is unlikely that mere tactile stimulation would suffice in reducing voice symptoms. Moreover, there are no reports indicating that touching the neck constitutes a widely used and effective sensory trick in SD. In addition, the observed improvements in voice quality are not explained as a Hawthorne or special attention effect. On the contrary, as these patients were tested in their symptomatic stage when speech production is exhausting and effortful, one would expect that repeated vocalization and speech over more than 30 minutes results in a decline of speech, which we did not observe. Participants had not practiced the relevant test sentences prior to testing, nor is there evidence that voice symptoms in SD subside with repeated and prolonged speech. Finally, the effects on speech were observed when VTS was not applied. We recorded speech always after the end of each set of VTS (see Fig. 1B). In addition, the positive effects on markers of speech lasted for 20 minutes after the cessation of VTS.

A different drawback concerns the lack of an objective established clinical scale to classify disease severity. Understanding why and how disease severity interacts with laryngeal VTS could be very useful in predicting who would respond well and would likely be a non-responder to VTS. We choose CPPS and the number of voice breaks as prominent predictors of SD severity⁴⁹. The inclusion of other outcome measures such as the consensus auditory-perceptual evaluation of voice (CAPE-V)⁵⁰ may provide additional markers for examining the effectiveness of laryngeal VTS. In summary, obtaining additional outcome measures to characterize disease severity in SD and then testing the effects of VTS in a larger sample of SD patients would be clinically meaningful in understanding who responds well to laryngeal VTS and who will likely not benefit from this treatment.

Conclusions

This is the first study that investigated the effect of laryngeal VTS on SD voice symptoms. Its results lay the scientific foundation for a randomized clinical trial to examine the usefulness of the approach in a larger patient sample and to document the longitudinal changes in voice quality and the underlying cortical responses to laryngeal VTS in SD. Such clinical trial must address the shortcomings of this feasibility study. In a first step towards translating this knowledge into a clinical application, we are currently conducting a clinical trial, in which people with adductor SD undergo an 8-week training, in which they apply laryngeal VTS in-home (ClinicalTrials.gov Identifier: NCT03746509). Its results should solidify our knowledge on the effectiveness of VTS for treating the voice symptoms in SD.

The current study showed that the application of laryngeal VTS can result in meaningful improvements of speech quality in SD. Laryngeal VTS induced a significant suppression of theta band power over the left somatosensory-motor. A similar suppression of theta oscillations is observable in cervical dystonia patients applying effective sensory tricks, suggesting that VTS in SD may activate a similar neurophysiological mechanism.

Methods

Participants

Thirteen people with SD (8 female, 5 male; mean age ± SD: 58.6 ± 12.5 years) were recruited through the University of Minnesota Fairview Lion’s Voice Clinic. Patients receiving Botulinum neurotoxin were tested toward the end of their injection cycle when they are most symptomatic (see Tables 2 and 3 for clinical characteristics). This study was approved by Institutional Review Board of the University of Minnesota. All study participants gave written informed consent prior to study begin. No human participants under the age of 18 years were recruited for this study. The experiment was conducted in line with the relevant guidelines and regulations. The clinical trial related to this work is registered with clinicaltrials.gov (Study identifier: NCT03746509; first posted on 19 November 2018).

Table 2 Clinical characteristics of study participants.

Full size table

Table 3 Clinical and self-perceived markers of speech and voice symptom severity for study participants.

Full size table

A potential concern when examining SD patients medicated with BoNT is vocal fold immobility, often occurring at higher dosage, one-sided BoNT injections (>5 units). Nevertheless, in this study, patients were given bilateral low-dose injections (0.2–2 units per vocal fold), while the dosage of injection was determined according to the severity of voice symptoms. This technique reduces the possibility of occurrence of vocal fold immobility. Another concern might be the bowing of vocal folds that occasionally appears shortly after BoNT injections. However, this condition also disappears with the improvement of voice/re-emergence of vocal spasms⁵¹. Accordingly, because the experimental session was held only after the recurrence of the symptoms, it was unlikely that vocal fold immobility/bowing occurred in our sample of study participants.

Apparatus

As stimulators, we used a pair of lightweight encapsulated vibro-motors (Model 307–100, Pico Vibe, Precision Microdrives Ltd., London, UK; diameter: 8.8. mm, length 25 mm). The vibro-motors were attached bilaterally on the lateral area of the thyroid cartilage at the height of vocal folds. Preliminary work in our laboratory with healthy human volunteers showed that a vibration frequency of 100 Hz with these vibrators generates peaks in the power spectrum of the voice signal that are within the frequency range known to stimulate laryngeal mechanoreceptors in animals²⁷ or induce kinaesthetic illusions in humans which are known to be based on muscle spindle input^18,19. Accordingly, the vibration frequency for VTS was set at100Hz in this study. Thus, we could reasonably assume that besides the tactile receptors of the skin above the voice box, laryngeal mechanoreceptors were also stimulated. At 100Hz, vibration frequency the vibration amplitude of the vibro-motors was ~1.7 G (1 G = 9.81 m/s²).

Electroencephalographic (EEG) data were recorded with the ActiveTwo data acquisition system (Biosemi B.V. Ltd, Amsterdam, Netherlands). The sampling rate was set at 512 Hz. Brain potentials were captured via Biosemi’s 64-channel EEG cap with an equiradial system of electrode placement. A series of 250 ms long auditory cues (1000 Hz, 98 dB) generated by RPvdsEx software (Tucker-Davis Technologies Ltd., Alachua, USA) guided the study participants throughout the experiment. The auditory stimuli were presented via a pair of sound delivery tubes embedded in the left and right ear canals. The tubes were surrounded by disposable foam earplugs, which masked any auditory inputs to the ears except for the presented auditory stimuli. The same system was used to control the activation of the vibro-motors. The time-stamp of auditory cues and vibration onset/endpoint were captured simultaneously.

Experimental procedure

The experiment took place in a chamber that was electrically and acoustically isolated. Participants were seated on a comfortable chair, asked to avoid extra movements, and to focus their attention at a fixed point on the front wall. A pair of vibro-motors was attached to the skin over the participant’s laryngeal area (see Fig. 1A). Prior to the experiment, the severity of speech symptoms was evaluated by (1) reading aloud a series of standard SD symptom-eliciting sentences³⁰; and (2) pronouncing the vowel /a/ three times, each lasting four seconds. Participants pronounced the vowels and read the sentences at their habitual pitch and loudness. All speech and voice signals were recorded for later offline analysis.

The experimental protocol comprised two blocks: (1) laryngeal vibration (VTS Only), and (2) vowel vocalization accompanied by laryngeal VTS (Vocalization + VTS). During the VTS Only condition, the laryngeal vibrators were alternately turned on and off (3 seconds ON following 3 seconds OFF), for 50 repetitions and then stayed ON continuously for the final 3 minutes. During the Vocalization + VTS condition, participants received an auditory cue (1000 Hz, 98 dB) for 250 ms, and then vocalized the vowel /a/ continuously for 4 seconds. During the second half of this vocalization period, laryngeal VTS was applied (see Fig. 3). Participants stopped vocalization with the cessation of laryngeal VTS. This procedure was repeated 50 times with 4-second long resting intervals in between trials. Participants received VTS in two sets with each set lasting 14.7 minutes. Between sets, at the end of set 2, and 20 minutes after the cessation of VTS (Retention), we evaluated voice/speech quality using the same assessment tasks given at Pretest (see Fig. 1B). The duration of the retention period was arbitrarily picked between the minimum and maximum duration of VTS application (>14.7 min and <29.4 min). For further details, see S2. Supplementary Notes.

Measures of speech quality

Participants read two sets of standard sentences³⁰ devised for the speech evaluation of people with adductor and abductor spasmodic dysphonia in their normal conversational style (see S1 Supplementary Methods). Assessment of these recorded voice data was performed offline. Two voice measures were obtained: (1) the number of voice breaks (VB), and (2) the change in the cepstral peak prominence (CPP) of voice⁵². CPP is an acoustic measure of speech quality defined as the logarithm of the Fourier Transform of the signal’s power spectrum. CPP is the difference between the amplitude of the cepstral peak and the estimated value on the regression line right below the cepstral peak. The higher the relative amplitude of the cepstral peak of a voice signal, the more a well-defined harmonic structure of the voice exists. Subsequently, the CPP signal was smoothed by averaging the cepstral magnitude across frequencies and time³¹. The smoothed measure of CPP referred to as the CPPS is strongly correlated with the overall dysphonia severity⁵³. In our analyses, speech signals were broken into ‘voiced’ and ‘voiceless’ segments and CPPS values were derived only for the ‘voiced’ periods.

The PRAAT software⁵⁴ was implemented for the acoustic analysis of the voice data and the derivation of CPPS values. A certified speech-language pathologist identified voice breaks by analyzing the continuous speech of the spoken sentences. Voice tremor was identified in the sustained vowels by examining the pitch tracing and the upper harmonics in narrow-band spectrograms using the PRAAT.

EEG signal processing and electrocortical measures

The EEGLab toolbox of MATLAB (The MathWorks, Natick, MA) was used for exploring the EEG data⁵⁵. The averaged signal of the two external electrodes embedded over bilateral mastoid bones was used to reference all electrodes. The data were high-passed filtered at the cut-off frequency of 1 Hz to address possible baseline drifts. A zero-phase notch filter was used to remove power line noise. Next, in order to weaken the potential effect of non-cortical sources that might have been commonly captured by electrodes, each channels was re-referenced to the common average of all electrodes. Segments of EEG recordings from 1000 ms before vocalization to 4000 ms after the onset of vocalization were extracted as data epochs. We subsequently used the ‘runica’ algorithm to perform independent component analysis (ICA) on all data channels. This was followed by the implementation of an automated multiple artifact rejection algorithm ‘SASICA’⁵⁶ on the resultant components to identify and remove the contaminated ICs. This algorithm recruits spatiotemporal criteria to distinguish the artifactual components. This is critically important for the identification and removal of muscle artifacts that may have contaminated the EEG data during vowel vocalization. At the end, the remaining ICs were linearly summed up and the output dataset was used for extracting the features.

As primary EEG measure, we obtained the event-related spectral perturbation (ERSP) of somatosensory-motor cortical electrodes in response to VTS. ERSP presents the logarithm of the mean event-related alteration in spectral power relative to the resting state at each frequency bin⁵⁷. Band-specific features were extracted for the physiologically-relevant frequency ranges (i.e. <50 Hz): theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz), and the low gamma (30–49 Hz). ERSP was extracted for six sites: CP5, C5, FC5, CP6, C6, and FC6. CP5 and CP6 were nearby bilateral somatosensory cortical areas. C5 and C6 were nearby bilateral motor cortical areas. FC5 and FC6 were nearby bilateral premotor cortical regions. Indices ‘5’ and ‘6’ reflect the cortical areas close to bilateral vocalization regions over somatosensory and motor homunculi^58,59.

As secondary EEG measure, we obtained the event-related coherence (ERCOH) between pairs of somatosensory-motor cortical electrodes as an indicator of the level of synchrony between the two electrodes⁶⁰. ERCOH was derived for CP5-FC5 and CP6-FC6 electrode pairs. Before the computation of ERCOH, EEG epochs were pre-whitened to exclude possible autocorrelations/trends that might interfere with the data.

EEG features were extracted from the Vocalization + VTS conditions of both sets (see Fig. 1B) to investigate the immediate cortical response to VTS. For each condition, the 4000 ms long trials were divided into two segments: (1) VTS-off (before the onset of laryngeal vibration), and (2) VTS-on (after the onset of laryngeal vibration). For each participant, the ERSP measure of the average of the 50 recorded epochs was derived separately for the VTS-off and VTS-on segments. Because the first 500 ms of the VTS-off period additionally contain cortical auditory evoked potentials⁶¹ or be influenced by the reaction time of the study participants⁶², the first 500 ms of vocalization were excluded from further EEG analysis (i.e. the VTS-off interval was defined between 500–2000 ms after the presentation of the auditory cue).

Statistical analysis

For SD1 to SD3 no EEG data were available. Statistical comparisons of the pre- versus post-VTS cortical potentials were performed on the available EEG data of 10 participants. The Kolmogorov-Smirnov test was implemented to examine the normality of the data. Since the distribution of the data was not normal, the non-parametric Wilcoxon sign rank test was used for statistical assessments. For each frequency band and for the group of electrodes covering each hemisphere, p-values were adjusted for multiple comparisons using the Benjamini-Hochberg method⁶³. The significance level was set at p-value = 0.05. The effect size was calculated using Cohen’s d.

Data availability

The datasets generated under this study are available from the corresponding author on a reasonable request.

References

Castelon Konkiewitz, E. et al. Service-based survey of dystonia in Munich. Neuroepidemiology 21, 202–206 (2002).
Article CAS Google Scholar
Ludlow, C. L. Spasmodic dysphonia: a laryngeal control disorder specific to speech. J Neurosci 31, 793–393 (2011).
Article CAS Google Scholar
Watts, C., Whurr, R. & Nye, C. Botulinum toxin injections for the treatment of spasmodic dysphonia. Cochrance Database Sys Rev. https://doi.org/10.1002/14651858.CD004327.pub2 (2004).
Article Google Scholar
Simonyan, K., Berman, B. D., Herscovitch, P. & Hallett, M. Abnormal striatal dopaminergic neurotransmission during rest and task production in spasmodic dysphonia. J Neurosci 33, 14705–14714, https://doi.org/10.1523/JNEUROSCI.0407-13.2013 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ludlow, C. L., Yamashita, T., Schulz, G. M. & Deleyiannis, F. W. B. Abnormalities in long latency responses to superior laryngeal nerve stimulation in adductor spasmodic dysphonia. Ann Otol Rhinol Laryngol 104, 928–935, https://doi.org/10.1177/000348949510401203 (1995).
Article CAS PubMed Google Scholar
Samargia, S., Schmidt, R. & Kimberley, T. J. Shortened cortical silent period in adductor spasmodic dysphonia: Evidence for widespread cortical excitability. Neurosci Lett 560, 12–15, https://doi.org/10.1016/j.neulet.2013.12.007 (2014).
Article CAS PubMed Google Scholar
Simonyan, K. et al. Focal white matter changes in spasmodic dysphonia: a combined diffusion tensor imaging and neuropathological study. Brain 131, 447–459, https://doi.org/10.1093/brain/awm303 (2008).
Article PubMed Google Scholar
Simonyan, K. & Ludlow, C. L. Abnormal activation of the primary somatosensory cortex in spasmodic dysphonia: an fMRI study. Cereb Cortex 20, 2749–2759, https://doi.org/10.1093/cercor/bhq023 (2010).
Article PubMed PubMed Central Google Scholar
Maschke, M., Gomez, C. M., Tuite, P. J. & Konczak, J. Dysfunction of the basal ganglia, but not the cerebellum, impairs kinaesthesia. Brain 126, 2312–2322, https://doi.org/10.1093/brain/awg230 (2003).
Article PubMed Google Scholar
Putzki, N. et al. Kinesthesia is impaired in focal dystonia. Mov Dis 21, 754–760, https://doi.org/10.1002/mds.20799 (2006).
Article Google Scholar
Patel, N., Hanfelt, J., Marsh, L. & Jankovic, J. Alleviating manoeuvres (sensory tricks) in cervical dystonia. J Neurol Neurosurg Psychiatry 85, 882–884, https://doi.org/10.1136/jnnp-2013-307316 (2014).
Article PubMed PubMed Central Google Scholar
Konczak, J., Aman, J. E. & Chen, Y.-W. Li, K.-y. & Watson, P. J. Impaired limb proprioception in adults with spasmodic dysphonia. J Voice 29, 777.e717–777.e723, https://doi.org/10.1016/j.jvoice.2014.12.010 (2015).
Article Google Scholar
Kägi, G. et al. Sensory tricks in primary cervical dystonia depend on visuotactile temporal discrimination. Mov Dis 28, 356–361, https://doi.org/10.1002/mds.25305 (2013).
Article Google Scholar
Poisson, A. et al. History of the ‘geste antagoniste’ sign in cervical dystonia. J Neurol 259, 1580–1584, https://doi.org/10.1007/s00415-011-6380-7 (2012).
Article CAS PubMed Google Scholar
Karnath, H., Konczak, J. & Dichgans, J. Effect of prolonged neck muscle vibration on lateral head tilt in severe spasmodic torticollis. J Neurol Neurosurg Psychiatry 69, 658–660, https://doi.org/10.1136/jnnp.69.5.658 (2000).
Article CAS PubMed PubMed Central Google Scholar
Bianconi, R. & Van Der Meulen, J. P. The response to vibraion of the end organs of mammalian muscle spindels. J Neurophysiol 26, 177–190, https://doi.org/10.1152/jn.1963.26.1.177 (1963).
Article CAS PubMed Google Scholar
Brown, M. C. & Matthews, E. I. PB. The use of vibration as a selective repetitive stimulus for Ia afferent fibres. J Physiol 191, 31P–32P (1967).
Article CAS Google Scholar
Cordo, P., Gurfinkel, V. S., Bevan, L. & Kerr, G. K. Proprioceptive consequences of tendon vibration during movement. J Neurophysiol 74, 1675–1688, https://doi.org/10.1152/jn.1995.74.4.1675 (1995).
Article CAS PubMed Google Scholar
Cordo, P. J., Gurfinkel, V. S., Brumagne, S. & Flores-Vieira, C. Effect of slow, small movement on the vibration-evoked kinesthetic illusion. Exp Brain Res 167, 324–334, https://doi.org/10.1007/s00221-005-0034-x (2005).
Article CAS PubMed Google Scholar
Baken, R. J. & Noback, C. R. Neuromuscular spindles in intrinsic muscles of a human larynx. J Speech Lang Hearing Res 14, 513–518, https://doi.org/10.1044/jshr.1403.513 (1971).
Article CAS Google Scholar
Grim, M. Muscle spindles in the posterior cricoarytenoid muscle of the human larynx. Folia Morphologia (Praha) 15, 124–131 (1967).
CAS Google Scholar
Hirayama, M., Matsui, T., Tachibana, M., Ibata, Y. & Mizukoshi, O. An electron microscopic study of the muscle spindle in the arytenoid muscle of the human larynx. Eur Arch Otorhinolarynigol 244, 249–252 (1987).
Article CAS Google Scholar
Paulsen, K. Occurrence & number of muscle spindles in internal laryngeal muscles of humans (m. cricoarytenoideus & m. cricothyreoideus)]. Z Zellforsch Mikro Anat 48, 349–355 (1958).
Article CAS Google Scholar
Tellis, C. M., Rosen, C., Thekdi, A. & Sciote, J. J. Anatomy and fiber type composition of human interarytenoid muscle. Ann Otol Rhinol Laryngol 113, 97–107 (2004).
Article Google Scholar
Brandon, C. A. et al. Staining of human thyroarytenoid muscle with myosin antibodies reveals some unique extrafusal fibers, but no muscle spindles. J Voice 17, 245–254 (2003).
Article Google Scholar
Loucks, T. M. J., Poletto, C. J., Saxon, K. G. & Ludlow, C. L. Laryngeal muscle responses to mechanical displacement of the thyroid cartilage in humans. J App Physiol 99, 922–930, https://doi.org/10.1152/japplphysiol.00402.2004 (2005).
Article Google Scholar
Davis, P. J. & Nail, B. S. Quantitative analysis of laryngeal mechanosensitivity in the cat and rabbit. J Physiol 388, 467–485 (1987).
Article CAS Google Scholar
Nagai, T. Encapsulated sensory corpuscle in the mucosa of human vocal cord: An electron microscope study. Arch Histol Jap 45, 145–153, https://doi.org/10.1679/aohc.45.145 (1982).
Article CAS PubMed Google Scholar
Andreatta, R. D., Mann, E. A., Poletto, C. J. & Ludlow, C. L. Mucosal afferents mediate laryngeal adductor responses in the cat. J Appl Physiol (1985) 93, 1622–1629, https://doi.org/10.1152/japplphysiol.00417.2002 (2002).
Article Google Scholar
Woodson, G. E. In Laryngeal Evaluation: Indirect Laryngoscopy to High-Speed Digital Imaging (ed Katherine A. Kendall; Rebecca J. Leonard) Ch. 25, (Thieme, 2010).
Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P. & Corthals, P. Acoustic measurement of overall voice quality: a meta-analysis. J Acoust Soc Am 126, 2619–2634, https://doi.org/10.1121/1.3224706 (2009).
Article ADS PubMed Google Scholar
Vitek, J. L. Pathophysiology of dystonia: A neuronal model. Mov Dis 17, S49–S62, https://doi.org/10.1002/mds.10142 (2002).
Article Google Scholar
Hendrix, C. M. & Vitek, J. L. Toward a network model of dystonia. Ann New York Acad Sci 1265, 46–55, https://doi.org/10.1111/j.1749-6632.2012.06692.x (2012).
Article ADS Google Scholar
Johnson, M. D., Miocinovic, S., McIntyre, C. C. & Vitek, J. L. Mechanisms and targets of deep brain stimulation in movement disorders. Neurotherapeutics 5, 294–308, https://doi.org/10.1016/j.nurt.2008.01.010 (2008).
Article PubMed PubMed Central Google Scholar
Ludlow, C. L. Central nervous system control of voice and swallowing. J Clin Neurophysiol 32, 294–303, https://doi.org/10.1097/WNP.0000000000000186 (2015).
Article PubMed PubMed Central Google Scholar
Berke, J. D., Okatan, M., Skurski, J. & Eichenbaum, H. B. Oscillatory entrainment of striatal neurons in freely moving rats. Neuron 43, 883–896, https://doi.org/10.1016/j.neuron.2004.08.035 (2004).
Article CAS PubMed Google Scholar
Goto, Y. & O’Donnell, P. Synchronous activity in the hippocampus and nucleus accumbens in vivo. J Neurosci 21, RC131–RC131, https://doi.org/10.1523/JNEUROSCI.21-04-j0003.2001 (2001).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. Involvement of the medial pallidum in focal myoclonic dystonia: A clinical and neurophysiological case study. Mov Dis 17, 346–353 (2002).
Article Google Scholar
Liu, X. et al. The sensory and motor representation of synchronized oscillations in the globus pallidus in patients with primary dystonia. Brain 131, 1562–1573, https://doi.org/10.1093/brain/awn083 (2008).
Article PubMed Google Scholar
Neumann, W.-J. et al. A localized pallidal physiomarker in cervical dystonia. Ann. Neurol. 82, 912–924, https://doi.org/10.1002/ana.25095 (2017).
Article CAS PubMed Google Scholar
Tang, J. K. et al. Changes in cortical and pallidal oscillatory activity during the execution of a sensory trick in patients with cervical dystonia. Exp Neurol 204, 845–848 (2007).
Article Google Scholar
Nowak, M., Zich, C. & Stagg, C. J. Motor cortical gamma oscillations: what have we learnt and where are we headed? Curr Behav Neurosci Rep 5, 136–142, https://doi.org/10.1007/s40473-018-0151-z (2018).
Article PubMed PubMed Central Google Scholar
Bauer, M., Oostenveld, R., Peeters, M. & Fries, P. Tactile spatial attention enhances gamma-band activity in somatosensory cortex and reduces low-frequency activity in parieto-occipital areas. J Neurosci 26, 490, https://doi.org/10.1523/JNEUROSCI.5228-04.2006 (2006).
Article CAS PubMed PubMed Central Google Scholar
Engel, A. K., Fries, P. & Singer, W. Dynamic predictions: oscillations and synchrony in top–down processing. Nat Rev Neurosci 2, 704–716, https://doi.org/10.1038/35094565 (2001).
Article CAS PubMed Google Scholar
Palva, S. et al. Distinct gamma-band evoked responses to speech and non-speech sounds in humans. J Neurosci 22, RC211–RC211, https://doi.org/10.1523/JNEUROSCI.22-04-j0003.2002 (2002).
Article PubMed PubMed Central Google Scholar
Miller, K. J. et al. Cortical activity during motor execution, motor imagery, and imagery-based online feedback. Proc Natl Acad Sci USA 107, 4430–4435, https://doi.org/10.1073/pnas.0913697107 (2010).
Article ADS PubMed Google Scholar
Muthukumaraswamy, S. D. Functional properties of human primary motor cortex gamma oscillations. J Neurophysiol 104, 2873–2885, https://doi.org/10.1152/jn.00607.2010 (2010).
Article PubMed Google Scholar
Brücke, C. et al. Scaling of movement is related to pallidal γ oscillations in patients with dystonia. J Neurosci 32, 1008, https://doi.org/10.1523/JNEUROSCI.3860-11.2012 (2012).
Article CAS PubMed PubMed Central Google Scholar
Peterson, E. A. et al. Toward validation of the cepstral spectral index of dysphonia (CSID) as an objective treatment outcomes measure. J Voice 27, 401–410, https://doi.org/10.1016/j.jvoice.2013.04.002 (2013).
Article PubMed Google Scholar
Kempster, G. B., Gerratt, B. R., Abbott, K. V., Barkmeier-Kraemer, J. & Hillman, R. E. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol 18, 124–132, https://doi.org/10.1044/1058-0360(2008/08-0017) (2009).
Article PubMed Google Scholar
Ludlow, C. L. et al. Consensus-Based Attributes for Identifying Patients With Spasmodic Dysphonia and Other Voice Disorders. JAMA. Otolaryngol Head Neck Surg 144, 657–665, https://doi.org/10.1001/jamaoto.2018.0644 (2018).
Article Google Scholar
Fraile, R. & Godino-Llorente, J. I. Cepstral peak prominence: a comprehensive analysis. Biomed Signal Process Control 14, 42–54, https://doi.org/10.1016/j.bspc.2014.07.001 (2014).
Article Google Scholar
Hillenbrand, J. & Houde, R. A. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Lang Hearing Res 39, 311–321, https://doi.org/10.1044/jshr.3902.311 (1996).
Article CAS Google Scholar
Boersma, P. & van Heuven, V. Speak and unspeak with PRAAT. Glot International 5, 341–347 (2001).
Google Scholar
Delorme, A. & Makeig, S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Method 134, 9–21, https://doi.org/10.1016/j.jneumeth.2003.10.009 (2004).
Article Google Scholar
Chaumon, M., Bishop, D. V. M. & Busch, N. A. A practical guide to the selection of independent components of the electroencephalogram for artifact correction. J Neurosci Method 250, 47–63, https://doi.org/10.1016/j.jneumeth.2015.02.025 (2015).
Article Google Scholar
Makeig, S. Auditory event-related dynamics of the EEG spectrum and effects of exposure to tones. Electroencephalog. Clin Neurophysiol 86, 283–293, https://doi.org/10.1016/0013-4694(93)90110-H (1993).
Article CAS Google Scholar
Caviness, J. N., Liss, J. M., Adler, C. & Evidente, V. Analysis of high-frequency electroencephalographic-electromyographic coherence elicited by speech and oral nonspeech tasks in parkinson’s disease. J Speech Lang Hearing Res 49, 424–438, https://doi.org/10.1044/1092-4388(2006/033) (2006).
Article Google Scholar
Mor, N., Simonyan, K. & Blitzer, A. Central voice production and pathophysiology of spasmodic dysphonia. Laryngoscope 128, 177–183, https://doi.org/10.1002/lary.26655 (2018).
Article PubMed Google Scholar
Pfurtscheller, G. & Andrew, C. Event-related changes of band power and coherence: methodology and interpretation. J Clin Neurophysiol 16 (1999).
Alvarenga, K. F. et al. The influence of speech stimuli contrast in cortical auditory evoked potentials. Brazil J Otorhinolaryngol 79, 336–341 (2013).
Article Google Scholar
Santee, J. L. & Kohfeld, D. L. Auditory reaction time as a function of stimulus intensity, frequency, and rise time. Bull Psychonom Soc 10, 393–396, https://doi.org/10.3758/BF03329370 (1977).
Article Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Series B 57, 289–300 (1995).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

We sincerely thank all participants who attended the study, as well as the Center for Applied and Translational Sensory Sciences at the University of Minnesota for providing the recording space and facility. This research was supported by grants from the U.S. National Institutes of Health NIH 1R21DC011841 to PW and JK and by NIH 1 R01 DC016315-01A1 to JK.

Author information

Authors and Affiliations

Human Sensorimotor Control Laboratory, School of Kinesiology, University of Minnesota, Minnesota, USA
Sanaz Khosravani, Arash Mahnan, I-Ling Yeh & Jürgen Konczak
Department of Occupational Therapy, Singapore Institute of Technology, Singapore, Singapore
I-Ling Yeh
Department of Neurology, University of Minnesota, Minnesota, USA
Joshua E. Aman
Department of Speech, Language, and Hearing Sciences, University of Minnesota, Minnesota, USA
Peter J. Watson & Yang Zhang
Department of Otolaryngology, University of Minnesota, Minnesota, USA
George Goding

Authors

Sanaz Khosravani
View author publications
You can also search for this author in PubMed Google Scholar
Arash Mahnan
View author publications
You can also search for this author in PubMed Google Scholar
I-Ling Yeh
View author publications
You can also search for this author in PubMed Google Scholar
Joshua E. Aman
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Watson
View author publications
You can also search for this author in PubMed Google Scholar
Yang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
George Goding
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Konczak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K., J.A., P.W. designed the experiment. J.K., S.K., I.-L.Y., and J.A. set up the hardware/software for VTS application. S.K., I.-L.Y. and A.M. collected the data. S.K., I.-L.Y., J.A., A.M., P.W. and Y.Z. analyzed the data. G.G. served as clinical liaison. S.K. and J.K. wrote the manuscript with input from P.W., Y.Z. and G.G. J.K. supervised the work.

Corresponding author

Correspondence to Jürgen Konczak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khosravani, S., Mahnan, A., Yeh, IL. et al. Laryngeal vibration as a non-invasive neuromodulation therapy for spasmodic dysphonia. Sci Rep 9, 17955 (2019). https://doi.org/10.1038/s41598-019-54396-4

Download citation

Received: 01 August 2019
Accepted: 09 November 2019
Published: 29 November 2019
DOI: https://doi.org/10.1038/s41598-019-54396-4

This article is cited by

Objective Assessment of Covid-19 Severity Affecting the Vocal and Respiratory System Using a Wearable, Autonomous Sound Collar
- D. Ishac
- S. Matta
- G. Nassar
Cellular and Molecular Bioengineering (2022)
Effects of low-frequency repetitive transcranial magnetic stimulation in adductor laryngeal dystonia: a safety, feasibility, and pilot study
- Cecília N. Prudente
- Mo Chen
- Teresa J. Kimberley
Experimental Brain Research (2022)
Altered sensory system activity and connectivity patterns in adductor spasmodic dysphonia
- Tobias Mantel
- Christian Dresel
- Bernhard Haslinger
Scientific Reports (2020)
What Is New in Laryngeal Dystonia: Review of Novel Findings of Pathophysiology and Novel Treatment Options
- Necati Enver
- Michael J. Pitman
Current Otorhinolaryngology Reports (2020)
Treatment of Dystonia: Medications, Neurotoxins, Neuromodulation, and Rehabilitation
- Ian O. Bledsoe
- Aaron C. Viser
- Marta San Luciano
Neurotherapeutics (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Improvement of measures of speech quality in response to laryngeal VTS

Change in the cortical oscillatory behavior in response to laryngeal VTS

Discussion

Possible mechanisms behind the effectiveness of laryngeal VTS for improving speech in SD

Modulation of cortical oscillations in response to laryngeal VTS

Limitations of the study

Conclusions

Methods

Participants

Apparatus

Experimental procedure

Measures of speech quality

EEG signal processing and electrocortical measures

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links