Introduction

Isolated rapid eye movement sleep disorder (iRBD) is a parasomnia characterized by dream-enactment behavior and loss of physiologic muscle atonia during the rapid eye movement sleep phase. iRBD is considered a prodromal stage of neurodegeneration as more than 80% of diagnosed patients developed alfa-synuclein-aggregation disorders such as Parkinson’s disease (PD), Lewy body dementia, or multiple system atrophy1,2. Considering the development of Parkinson’s disease-modifying treatment3, a multicentre study including 1280 iRBD patients identified quantitative fine motor skill testing as the strongest predictor for conversion4. Another study by Postuma et al. 5 revealed that voice and face akinesia represent the earliest prodromal motor manifestations in iRBD subjects preceding the onset of parkinsonism by a mean 9.8 years. This is likely a consequence of motor speech complexity and its sensitiveness to neural damage6.

Hypokinetic dysarthria of PD, which is mainly characterised by articulatory, phonatory and prosodic alterations, occurs in up to 90% of patients over the course of the disease6,7. Moreover, speech impairment is present in a majority of newly diagnosed PD patients8,9. Considering that patients with iRBD are at high risk of developing PD, the speech behavior assessment in iRBD is subjected to thorough investigation. Recent multilanguage research based on fully automated analysis of seven distinctive speech dimensions of hypokinetic dysarthria10, including harsh voice, slow sequential motion rates, imprecise consonants, monoloudness, monopitch, prolonged pauses, and articulation rate performed on 150 iRBD patients revealed that only monopitch was able to significantly differentiate iRBD patients from controls11. Interestingly, monopitch was found in iRBD subjects with impaired olfactory function before the nigrostriatal dopaminergic transmission is affected12, that is, in Braak stage 2 before the substantia nigra is affected by synucleinopathy13. Among monopitch, vowel articulation impairment represents one of the core deficits contributing to dysarthric speech, as it reflects the range of articulatory movements and strongly correlates with overall intelligibility14,15,16. The potential of imprecise vowel articulation to serve as an early biomarker can also be supported by a previous pilot study where deficits in vowel articulation were detected in a small sample of 20 patients with de novo PD17. However, potential changes of vowel articulation in iRBD have never been investigated. Also, no previous research independently related articulation impairment to other essential prodromal features of synucleinopathy, such as olfactory dysfunction.

The purpose of this study is to investigate vowel articulation in iRBD and early-stage PD patients compared to healthy controls in order (i) to verify the prospect of using measurement of vowel articulation as a biomarker for the detection of prodromal PD and (ii) to investigate the relationship between articulation measures and the degree of motor and smell dysfunction.

Results

Group differences

Normative values of the first two formants for iRBD, PD and healthy control (HC) groups are summarized in Table 1. Vowel Space Area (VSA) was found to be the best parameter for differentiating between groups [F(2,177) = 7.4, p = 0.001, η2 = 0.08] (Fig. 1). Post hoc comparisons revealed significantly smaller VSA in both iRBD (p = 0.01) and PD (p = 0.001) compared to HC individuals. In addition, group differences were also detected for Vowel Articulation Index (VAI) [F(2,177) = 6.3, p = 0.002, η2 = 0.07], as the PD group manifested significantly smaller VAI (p = 0.002) compared to HC group. Slight vowel duration differentiation was also observed across groups [F(2,177) = 3.2, p = 0.04, η2 = 0.04), associated with differences between PD and iRBD groups (p = 0.04). The sub-experiment concerning olfactory function in iRBD showed that iRBD group with preserved olfactory function (iRBD-POF) had greater VSA than iRBD group with abnormal olfactory function (iRBD-AOF) [F(1,54) = 5.4, p = 0.024, η2 = 0.094] (Fig. 2a). In addition, iRBD-AOF with normal dopamine transporter single-photon emission computed tomography (DAT-SPECT) showed greater VSA than iRBD-AOF with abnormal DAT-SPECT [F(1,31) = 4.2, p = 0.049, η2 = 0.140] (Fig. 2b). No significant differences for VAI and vowel duration were found.

Table 1 Normative values of formant frequencies in male groups of iRBD, PD and HC.
Fig. 1: Comparison of vowel measurements including VSA, VAI and vowel duration between HC, iRBD and PD using boxplots.
figure 1

The centre line indicates the median, and the bounds of the box indicate the 25th and 75th percentiles. The whiskers extend to the most extreme data points not considered outliers, and the outliers are plotted individually using the ‘x’ symbol. VSA, vowel space area, VAI vowel articulation index, “Asterisks” indicate significant differences after Bonferroni correction: *p < 0.05; **p < 0.01.

Fig. 2: Mean F1 and F2 values and vowel space area for a) iRBD-POF compared iRBD-AOF subgroups and b) iRBD-AOF with normal DAT-SPECT compared to iRBD-AOF with abnormal DAT-SPECT.
figure 2

VSA vowel space area, VAI vowel articulation index, VD vowel duration, iRBD-POF isolated rapid eye movement sleep behaviour disorder patients with preserved olfactory function, iRBD-AOF isolated rapid eye movement sleep behaviour disorder patients with abnormal olfactory function, DAT-SPECT dopamine transporter single-photon emission computed tomography; “Asterisks” indicate significant differences: *p < 0.05.

Correlations between speech and motor variables

Movement Disorder Society Unified Parkinson’s Disease Rating Scale motor part (MDS-UPDRS III) total in PD patients showed negative correlation with VSA (r = −0.29, p = 0.03) and VAI (r = −0.29, p = 0.03). In addition, bradykinesia and rigidity subscore in PD patients showed negative correlation with VSA (r = −0.33, p = 0.01) and VAI (r = −0.34, p < 0.01) while neither correlation between postural instability and gait difficulty (PIGD) subscore and VSA (r = −0.04, p = 0.75) or VAI (r = −0.12, p = 0.75) nor between tremor subscore and VSA (r = 0.01, p = 0.96) or VAI (r = 0.06, p = 0.64) was detected (Fig. 3).

Fig. 3: Significant correlations between clinical motor scales and acoustic data plotted to 2D space with the trend of averaged data (black line) in the PD group.
figure 3

VSA Vowel space area, VAI Vowel articulation index, MDS–UPDRS III Movement Disorder Society – Unified Parkinson’s Disease Rating Scale part III.

Regarding brain imaging, the putamen binding ratio in iRBD showed positive correlation with VSA (r = 0.35, p = 0.01). No other significant correlations were found between vowel articulation parameters and clinical scales in PD or iRBD.

Discussion

Results of our study revealed that subtle impairment in vowel articulation due to the reduced articulatory range of motion is already evident in prodromal synucleinopathy. Articulatory impairment in iRBD was detectable through objective acoustic analysis despite almost no perceptual dysarthria severity was noted during clinical examination. Since the extent of articulatory undershoot was related to bradykinesia and rigidity in our PD cohort, we may hypothesize that vowel articulation abnormalities in parkinsonism result mainly as a consequence of nigrostriatal degeneration. The strength of this study is that vowel articulation features were evaluated in a large sample of iRBD and de-novo PD patients. Examining drug-naïve patients is especially important as dopaminergic treatment may improve certain aspects of speech disorder such as vowel articulation18. Since the vowel articulation performance of iRBD subjects intermediated between controls and de-novo PD patients and since it was more severe in iRBD with hyposmia compared to iRBD with preserved smell function, acoustic assessment of vowels could be potentially useful as a diagnostic and prognostic biomarker in α-synuclein-aggregation disorders.

Our findings of lowered vowel space in iRBD demonstrate that vowel articulation is already altered in the prodromal stages of synucleinopathy. In particular, vowel articulation was more impaired in iRBD subgroup with severe hyposmia, which is one of the most common and earliest non-motor prodromal features to emerge in PD2,4. Thus, we might assume that vowel articulation impairment is an early prodromal symptom that progresses along with olfactory dysfunction in the disease process of synucleinopathy. Accordingly, recent research demonstrated that dysprosody is already present in iRBD subjects with impaired olfactory function but still intact nigrostriatal pathway12. Together, these findings might indicate that speech production is already slightly affected in Braak’s stage 2, which is associated with Lewy pathology within brainstem nuclei13, a brain region crucial for controlling vocal fold tension19.

Furthermore, our findings in iRBD subgroup with impaired olfactory function implies that articulatory undershoot is also result of nigrostriatal neurodegeneration, as greater vowel deficits were found in the subgroup of patients with abnormal compared to those with normal DAT-SPECT. This assumption can be further supported by the observed link between the extent of vowel articulation decline and bradykinesia and rigidity but not axial gait symptoms in our de-novo PD group. Also, the previous pilot study discovered a positive correlation between amelioration of vowel articulation and dopaminergic treatment-related improvement in bradykinesia and rigidity18. On the other hand, degeneration in non-dopaminergic brain regions may further contribute to the worsening of vowel articulation performance as PD progresses. This is in agreement with a previous longitudinal study reporting a further decline of vowel articulation in the course of the disease in PD patients with an average disease duration of 6 years after the diagnosis20. Indeed, it is well known that vowel articulation impairments strongly correlate with overall intelligibility14,15,16, which tends to be affected in the later stages of PD. A recent study showed that articulatory deficits in de-novo PD patients are indicative of more widespread brain damage affecting extranigral cortical or subcortical regions8. In accordance with this assumption, a former study reported that aggravation of dysarthria during PD progression results particularly from the increasing severity of cerebral non-dopaminergic lesions21. Finally, in PD patients treated with bilateral subthalamic nucleus deep brain stimulation, the severity of the residual parkinsonian speech score was predictive of a poor postoperative outcome, likely due to the presence of non-dopaminergic lesions within the brain22. However, we cannot exclude that vowel articulation impairment in advanced PD may be also related at least in part to levodopa-induced dyskinesia23, together with the neurodegeneration of dopaminergic and non-dopaminergic brain areas. In summary, given the existing evidence in literature, we may hypothesize that vowel articulation deficits, along with limb bradykinesia, are primarily related to dopaminergic involvement in the early stages, whereas nondopaminergic lesions further contribute to the worsening of vowel articulation in the later stages of PD. Therefore, acoustic assessment of vowel articulation may provide a surrogate marker of neurodegeneration from prodromal to more advanced synucleinopathy for scenarios where a single robust feature to monitor the dysarthria progression is desired.

In our study, both VSA and VAI reflected speech impairment in drug-naive PD patients, which is in accordance with a previous pilot study investigating a small sample of 20 de-novo PD patients17. Contrary to our results on de-novo PD patients, several authors suggested VAI is superior over VSA in moderate to advanced stages of PD20,24. The principle of VAI construction focuses on formant centralization in order to minimize the effect of interspeaker variability; as a consequence, VAI may not reflect subtle articulatory spatial modifications25. On the other hand, VSA is calculated out of the maximal extent of vowel working space, which might better mirror subtle speech changes18,26. In other words, we may hypothesize that PD vowel articulatory impairment is tongue-dominant and emerge initially in the posterior parts of an articulatory organ26,27. Since the individual vowel /u/, which is characterised with tongue positioning high and backward28, seem to be the most affected vowel in early stages of PD17, VSA construction shall significantly acknowledge single-vowel frequency shift, whereas the sensitivity of VAI might be lowered as it accounts for centralization of all corner vowel that does not need to be affected in the early disease process.

Previous studies assumed that speaking rate may influence vowel articulation performance in dysarthrias29. However, we did not find differences for vowel duration at the group level between HC and both patient cohorts, likely as a consequence of the very early stages of synucleinopathy investigated. Despite inconclusive results regarding articulation rate in later stages of synucleinopathies, including no changes in the speech rate, decreased speech rate, or even an accelerated speech rate30,31,32, our findings are in agreement with a recent study showing no speech rate changes in de-novo PD and only a trend toward slower speech rate in iRBD33. Faster speech observed in advanced PD is likely to reflect a physiological tendency to accelerate speech due to the impaired motor planning (oral festination)30,31 whereas the tendency toward a slower speech rate may be theoretically attributed to the degeneration of non-dopaminergic pathways8.

This study has certain limitations. We enrolled exclusively male participants primarily because of the strong predominance of male subjects within iRBD patients34. Anatomical dispositions of the vocal tract reflect sexual dimorphism35,36, and we therefore cannot exclude a potential sex effect on our results. On the other hand, no sex-specific speech dysfunction in de-novo PD was found9. The findings of current study are based on a cross-sectional design. Future research is needed to estimate the sensitivity of vowel articulation features in long-term follow-up and in predicting phenoconversion from iRBD to established parkinsonism.

In conclusion, the vowel articulation deficits in male subjects significantly differentiated both iRBD and de-novo PD patients from controls. Future studies should elaborate our findings in the female population, and vowel articulation analysis might then have the potential to serve as a speech biomarker indicating the prodromal stage of synucleinopathy. Introducing novel acoustic biomarkers enable to design a speech assessment battery allowing for valid and easy evaluation of a large number of subjects in clinical trials as speech assessment is inexpensive, noninvasive and digitally storable. Additionally, vowel articulation analysis may facilitate better speech phenotype classification of PD and has, thus, the potential utility for personalized medicine.

Methods

Participants

From 2015 to 2021, a total of 180 male Czech participants, including a separate group of iRBD, PD, and HC participants, were recruited. Each participant provided written informed consent. The study received approval from the Ethics Committee of the General University Hospital in Prague, Czech Republic, and has been performed in accordance with the ethical standards established in the 1964 Declaration of Helsinki.

The iRBD group consisted of 60 male patients aged 65.6 (SD 7.1) years diagnosed according to the third edition of the International Classification of Sleep Disorders37. None of the iRBD patients suffered a significant communication disorder or had a history of treatment with antiparkinsonian medication, nor were they taking any medication affecting sleep, cognition or motor function. The PD group consisted of 60 untreated drug-naïve male patients aged 61.8 (SD 11.6) years, fulfilling the Movement Disorder Society clinical diagnostic criteria for PD38. At the time of examination, none of the PD patients had a history of communication disorder unrelated to PD, nor were they taking any medication affecting sleep or cognition function.

Both iRBD and PD subjects were examined by a movement disorder specialist (P.D.) using the MDS-UPDRS III39. Symptom duration was assigned based on the self-reported occurrence of the first motor symptoms. To define specific movement disorder manifestations that may influence speech disorder, three subscores from the MDS-UPDRS III scale were calculated as follows: (i) PIGD subscore (MDS-UPDRS part III, 3.9 Arising from the chair, 3.10 Gait, 3.11 Freezing of gait, 3.12 Postural stability, 3.13 Posture), (ii) Bradykinesia and Rigidity subscore (MDS-UPDRS part III, 3.3 Rigidity, 3.4 Finger tapping, 3.5 Hand movements, 3.6 Pronation-supination movements of hands, 3.7 Toe-tapping, 3.8 Leg agility, 3.14 Body bradykinesia) and Tremor subscore (MDS-UPDRS part III, 3.15 Postural tremor of the hands, 3.16 Kinetic tremor of the hands, 3.17 Rest tremor amplitude, 3.18 Constancy of rest tremor). Item 3.1 Speech of the MDS-UPDRS III was used for the perceptual description of overall dysarthria severity. In addition, patients were examined using the University of Pennsylvania Smell Identification Test (UPSIT)40, Montreal Cognitive Assessment and Scales for Outcomes in Parkinson’s Disease - Autonomic Dysfunction41. iRBD patients were categorized into two groups based on their olfactory function: iRBD group with preserved olfactory function (iRBD-POF) consisted of patients with UPSIT > 25 (i.e. normosmic to moderately microsmic) and the iRBD group with abnormal olfactory function (iRBD-AOF) of patients with UPSIT ≤ 25 (i.e. severely microsmic or anosmic). Patient clinical and demographic characteristics are summarised in Table 2.

Table 2 Patient clinical and demographic characteristics.

The HC group consisted of 60 male volunteers of comparable aged 64.1 (SD 12.8) years, with no history of significant neurological or communication disorder. No group differences in age distribution were revealed between iRBD, PD and HC groups (ANOVA, p = 0.15).

Speech examination

The audio data were recorded in a quiet room with a low level of ambient noise using a head-mounted condenser microphone (Beyerdynamic Opus 55, Heilbronn, Germany) placed approximately 5 cm from the subject’s lips. The speech signals were sampled at 48 kHz with 16–bit resolution. The recordings were obtained during one session with a speech specialist (D.S., J.R., M.N. or T.T.) who conveyed instructions to the participants. Each participant completed a series of speaking tasks, including reading twice a standardised passage composed of 80 words as part of a longer protocol lasting about 20 minutes. A reading passage (Fig. 4), written by famous Czech writer Karel Capek, was chosen as it is a standardised speaking task representing a natural condition of connected speech that was further shown to possess the highest accuracy for discriminating iRBD from HC11. The second realization of the reading passage was utilized to minimize effect of patient’s natural ability to read aloud42. Neither signs of fatigue nor any changes in the quality of voice from the beginning to the end of the session were observed in any participant.

Fig. 4: The text of the reading passage.
figure 4

Ten occurrences of each corner vowel used for the acoustic analysis are depicted by red (vowel /a/), green (vowel /i/) and blue (vowel /u/) colour.

Speech analyses

Acoustic measures were performed using the widely used speech-analysis software PRAAT (available at www.praat.org). Using PRAAT, both the combined wideband spectrographic display and the power spectral density were used to determine the first (F1) and the second (F2) formant frequencies in Hz. A total of 30 vowels per passage were studied, including 10 occurrences of /a/, 10 occurrences of /i/, and 10 occurrences of /u/ (Fig. 4). The formant frequencies of vowels /a/, /i/, and /u/ were extracted from a 30-ms segment at the temporal midpoint of the stable part of each vowel (to avoid the influence of vowels preceding or following); this method has been previously validated43 and proved to be reliable when applied to different languages17,44,45. The values of F1 and F2 frequencies were separately averaged for the individual corner vowel of each participant. The measurement of VSA and VAI were used. VSA is traditional and probably the most used articulatory-acoustic measure46,47, and can be easily calculated using the following formula:48

$$VSA = 0.5 \times \left( {\left[ {F2_{/u/} + F2_{/i/}} \right] \times \left[ {F1_{/u/} - F1_{/i/}} \right] - \left[ {F2_{/a/} + F2_{/u/}} \right] \times \left[ {F1_{/a/} - F1_{/u/}} \right] - \left[ {F2_{/a/} + F2_{/i/}} \right] \times \left[ {F1_{/a/} - F1_{/i/}} \right]} \right).$$
(1)

The measurement of VAI is another commonly used measure46 that was introduced by Roy et al 49. and can be expressed using the following formula49:

$$VAI = \left( {F2_{/i/} + F1_{/a/}} \right)/\left( {F1_{/i/} + F1_{/u/} + F2_{/u/} + F2_{/a/}} \right).$$
(2)

In addition, vowel duration was measured as the difference between the onset and offset of each vowel according to previously published methodology50. The final value used for statistical analysis was averaged across all 30 occurrences available.

Analysis of inter- and intra-judge reliability was not performed as the stability of methodology used for both healthy as well as dysarthric speech was thoroughly validated in previous studies17,50.

Dopamine transporter imaging

In PD and RBD patients, we performed DAT-SPECT using the [123I]-2-b-carbomethoxy-3b-(4-iodophenyl)-N-(3-fluoropropyl) nortropane (DaTscan®, GE Healthcare) tracer according to European Association of Nuclear Medicine procedure guidelines51, using common acquisition and reconstruction parameters described in detail previously52. Automated semi-quantitative analysis was applied using the BasGan V2 software53, and specific binding ratios in both caudate nuclei and putamina relative to background binding were calculated; the lower value from both hemispheres was used for further analyses. Specific binding ratio values below the 95% reference prediction interval were considered abnormal.

Statistical analyses

As the Kolmogorov-Smirnov test for independent samples showed that all acoustic variables were normally distributed, we used analysis of covariance (ANCOVA) with post hoc Bonferroni adjustment to assess group differences. Since there was wide variability in age among PD participants, age was considered as a covariate. The Pearson’s partial correlation analysis controlled for age was performed to test for significant relationships between the clinical and acoustic data. The level of significance was set to p < 0.05. All statistical analyses were performed in MATLAB (MathWorks, Natick, MA, USA).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.