Introduction

It is well-established that throughout the course of development and into adulthood, females generally outperform males in verbal language abilities. However, the sources of these differences are not well understood. At least in children, the development of many linguistic skills is associated with basic speech perception abilities1,2, yet it remains unclear if males and females exhibit basic neural differences for processing speech sounds. In addition to activating auditory regions, speech perception tasks are well-known to recruit portions of the dorsal stream (i.e. premotor/motor cortex) that may provide dynamic sensorimotor-based cognitive (e.g., attention and working memory) scaffolding for audition3,4,5. Anterior dorsal activity has shown high intra- and inter-individual variability that typically is related to cognitive demands6, stimulus context6 and cognitive abilities7. There are a number of reasons to suspect that a listener’s sex may be an additional source of variability. First, males have been shown to have higher grey matter density in the precentral gyrus8. Second, ample behavioral and neurophysiological data demonste sex differences in cognitive tasks, especially attention9,10,11,12,13,14,15,16. Third, behavioral17,18 and neuroimaging studies using various techniques12,19,20,21,22,23,24,25 have revealed sex-based differences in tasks with different attentional demands that also may apply to speech perception. Lastly, a number of communication disorders with sensorimotor deficits are more often diagnosed in males26,27. Despite these observations, it remains unclear if sensorimotor processing for classic speech discrimination tasks displays sexual dimorphism.

Electroencephalography (EEG) offers strong temporal resolution for precisely capturing event-related changes in neural activity across the time course of speech perception tasks. Power in both alpha (8–13 Hz) and beta (15–25 Hz) rhythms is sensitive to capturing sensorimotor function28,29 across the time course of a perception event, including early attentional and later working memory processes. In particular, alpha bands are well known for producing patterns of both event-related desynchronization (ERD) and synchronization (ERS); depicting evidence of both cortical activation and inhibition, respectively30,31. In cognitive tasks, alpha ERD commonly is linked to active stimulus processing and associated with cognitive functions including attention and working memory32,33,34. Alpha ERS is thought to reflect sensory selection35,36. That is, alpha ERS may demonstrate inhibition of irrelevant information during cognitive processing within a particular region of interest31 or an inhibitory process that allows for reallocation of resources from non-relevant to relevant cortical regions37. Beta (15–25 Hz) ERD most commonly is associated with sensorimotor forward modeling in execution38,39, observation40, or even imagination of movement41. However, beta ERD has been found in cognitive tasks that require attention or working memory42,43,44. In a number of tasks with both cognitive and motor components, time-frequency decompositions of EEG data reveal sex-based differences in alpha and beta (in addition to theta) rhythm activity recorded across frontal and central electrodes45.

The sensorimotor mu rhythm emanates from the anterior dorsal stream and, when identified via magnetoecncaphalography (MEG)46 or independent component analysis of EEG data34,43,47,48, is characterized by peaks in both alpha and beta frequencies34,43,49,50,51. Consistent with alpha and beta sensitivity described above, mu rhythm oscillations are responsive to cognitive processes28,29,34,43,47,52,53. In addition, oscillatory activity in the alpha band of the mu rhythm previously has revealed sex differences related to empathy54 and attentional allocation55. A number of studies have temporally decomposed mu rhythm oscillations to characterize sensorimotor-based cognitive activity during the time course of speech discrimination events34,48,50. A well-replicated finding is that following stimulus offset, both alpha and beta bands of mu rthythms desynchronize, suggesting rehearsal while stimuli are held in working memory34,43,47,48. Across studies, however, mu oscillatory activity prior to and during the onset of stimulus presentation (i.e., early attentional mechanisms) has not been consistent. Of interest to the current investigation, Jenson et al.43 tested predominantly female participants and found mu beta ERD alongside mu-alpha ERS during onset of the speech stimulus that became stronger in the presence of noise, suggesting inhibition of sensorimotor activity via auditory gating of noise. Using similar methods and tasks in a predominantly male control group, Saltuklaroglu et al.47 showed markedly less robust early mu-alpha ERS, suggesting inhibition of sensorimotor activity is weaker in males than females prior to the onset of the auditory speech signal. Along with previous evidence of sex differences in behavioral and neural responses during the performance of cognitive and linguistic tasks, the data from the Jenson et al.43 and Saltuklaroglu et al.47 provide some support for mu rhythm oscillations being sensitive to sexual dimorphism in speech discrimination and additional impetus for the current examination.

In the current study, it is hypothesized that sex differences will be observed in oscillations of mu rhythms in speech discrimination tasks. Based on previous findings, sexual dimorphism is most likely to be revealed in the alpha band of the mu rhythm prior to speech perception, which would highlight differences in the contributions of early inhibitory processes related to attention and auditory gating. Time frequency analyses of anterior dorsal activity during speech perception may shed further light on sources of its variability. Additionally, results will contribute to the understanding of why certain communication disorders linked to sensorimotor deficits primarily impact males.

Methods

Participants

42 English-speaking adults (21 females, 21 males) with a mean age of 25.73 years (range 18–43 years, SD = 5.441) were recruited from the greater Knoxville area to participate in this study. Data from all female participants and 2 of the males were collected and analyzed in Jenson et al.43. For the purposes of this study, an additional 19 age-matched males were recruited. The average ages of male and female contributors were 26.06 and 26.25 years respectively, which were not significantly different (t (15) = −0.613, p = 0.549). Participants reported no history of cognitive, communicative, or attentional disorders. Though hearing acuity was not measured, all participants indicated they could clearly hear and attend to the stimuli. The Edinburgh Handedness Inventory56 was used to assess handedness dominance, with all participants indicating a right-handed preference. This study was approved by the University of Tennessee Health Science Center Institutional Review Board. In accordance with the Declaration of Helsinki, all participants gave written, informed consent prior to participation.

Stimuli

Similar to previous studies43,48, syllable stimuli /ba/ and /da/ were generated using AT&T naturally speaking text-to-speech software, which employs synthetic analogs of a human male speaker to create syllable utterances. To reduce potential for discrimination response bias57, syllable pairs were constructed so that half of the stimuli contained identical syllables (e.g. /ba//ba/) and half were different syllables (e.g. /ba/ /da/). Each syllable was 200 ms in duration, and each pair was divided by 200 ms of silence, creating a total duration of 600 ms from onset of the first syllable to offset of the second. Stimuli were low-pass filtered below 5 KHz and normalized for root-mean-square (RMS) amplitude. Syllable pairs were placed in both a quiet (quiet discrimination; QD) and noisy (noisy discrimination; ND) background for the purposes of manipulating cognitive load. For noisy discrimination, the signal-to-noise ratio (SNR) was +4 dB. A timeline depicting how syllables were presented within discrimination events is shown in Fig. 1.

Figure 1
figure 1

Single-trial timelines. Timelines for passive noise (PN) and the active discrimination conditions (QD and ND).

Design

Three conditions were used in a within-subject and between-subject design:

  1. 1.

    Control condition - Passive listening to white noise (PN).

  2. 2.

    Discriminate (same or different) between pairs of syllables (QD).

  3. 3.

    Discriminate (same or different) between pairs of syllables embedded in background noise (ND).

Procedure

Participants sat in a reclining armchair in an electronically and magnetically shielded, double-walled, sound-treated booth for the duration of the study. Stimuli were presented at 75 dB SPL through insert earphones and responses were recorded using the Compumedics NeuroScan Stim 2 4.3.3 software system.

In the control condition, participants were asked to passively listen to white noise in each trial. This control condition has been used in previous studies and is effective for stimulating auditory attention without eliciting sensorimotor activity in neurotypical speakers34,43,48. In discrimination conditions, participants were asked to listen to the stimuli and indicate if the syllables in each pair were the same or different via button-press response. The cue to respond was a 100 ms, 1 KHz ‘chirp’ that occurred at the end of each trial (+2000 ms – see Fig. 1). A button-press was required in the PN as well as the discrimination conditions to help maintain attention and control for any sensorimotor activity occurring in anticipation of a motor response (i.e., button-press)29,58. To control for lateralization of anticipatory motor responses, participants alternated between left- and right-handed responses for the two blocks of trials presented in each condition. As such, each condition consisted of 80 trials, presented in two blocks of 40 trials, each with a left- and right-handed response. The order of conditions was randomized for each participant.

EEG Acquisition

EEG data were collected using 68 electrode channels on an unlinked, sintered NeuroScan Quik Cap59. The electrode configuration was based on the extended international 10–2060 system, with two electromyography (EMG) and two electrocardiogram (ECG) channels. Mastoid electrodes (M1 and M2) were used as a reference for all recording electrodes. Eye movement artifact was recorded by placing electrodes on the lateral canthi of each eye (HEOG) along with the superior and inferior orbit (VEOG) of the left eye. The EMG surface electrodes were placed above and below the lips to monitor for any perilabial movement artifact.

EEG data were collected with a Synamps 2 system and Compumedics NeuroScan 4.3.3 software. Raw files were band-pass filtered (from 0.15–100 Hz) and digitized by a 24-bit analog to digital converter, sampling at 500 Hz. Continuous EEG data contained event markers representing stimulus onset and participant responses.

EEG Data Processing

EEGLAB 2014a61, an open source MATLAB toolbox, was used to process continuous EEG data. The following details the steps used to process the EEG data at both the individual and group levels. Additional details regarding data processing steps and analysis methods are described below.

Individual processing/analysis of EEG data

Data from the two 40-trial blocks for each condition underwent a series of preprocessing steps. Specifically, data were: (1) appended in order to create one, 80 trial set for each of the conditions; (2) downsampled to 256 Hz in order to reduce computational load for ICA processing; (3) filtered from 3–34 Hz to isolate mu related activity. This band pass filter was used as it captures mu frequencies of interest and eliminates other extraneous low- and high-frequency information; (4) epoched into 5000 ms segments around time point zero (stimulus onset), creating epochs that extended from −3000 to +2000 ms; (5) re-referenced to M1 and M2 (mastoid electrodes); (6) inspected for gross artifact (>200 uV), visually; and (7) cleaned of incorrect or late responses (responses entered greater than 2 s following the response cue). At least 40 epochs per participant in each condition were required in order to include the participant in the experiment. The overall average of usable trials exceeded this inclusion criterion comfortably.

Independent component analysis (ICA)

A single set of ICA weights were obtained by concatenating the three pre-processed EEG data sets. This method ensured that an accurate comparison was made across conditions within the revealed ICs. The “extended runica” algorithm in EEGLAB 2014a provided the ICA training, with an initial learning rate of 0.001 with a stopping weight of 10e−7. Each participant returned 66 ICs following ICA decomposition, reflecting the number of recording electrodes with the mastoid reference electrodes removed (68 – 2 reference electrodes). The inverse weight matrix (W−1) was projected onto the spatial EEG channel configuration in order to obtain scalp maps for each IC from each participant.

Equivalent current dipole (ECD) source localization

Equivalent current dipole (ECD) models for each component were computed following ICA decomposition. Using the brain electrical source analysis (BESA) model, dipoles were localized in the DIPFIT toolbox, a free extension of the EEGLAB toolbox62. Dipole processing produced a single dipole for each of the 2772 ICs (66 IC x 42 participants). Localization of produced dipoles to a cortical area requires back-projection to a signal source that likely generated the scalp potential distribution for each IC. The best forward model is computed with the goal of explaining the highest amount of scalp map variance63. Remaining unexplained variance is termed “residual variance” (RV) and used as a determiner of “goodness of fit” between the scalp map and projected ECD model.

Current source density (CSD) Source Localization

Current source density (CSD) measures using standardized low-resolution brain electromagnetic tomography (sLORETA) were conducted as a reliability check for ECD sources. sLORETA relies on current source density (CSD) from recorded signals to estimate the source of activity within the brain and address the inverse problem64. The base head model for sLORETA is a Talairach cortical probability brain atlas that is digitized at the Montreal Neurological Institute. EEG scalp electrodes are co-registered between realistic head geometry and the BESA spherical head model59. This model divides the brain space into 6239 voxels, yielding a spatial resolution of 5 mm. EEG channels and the inverse weight projections were exported to sLORETA for each identified mu cluster IC. Estimates of CSD for the components within each mu cluster were revealed by mapping cross-spectra and co-registering with the coordinates from the head model. Paired t-tests were used to assess the statistical significance of the CSD estimates by comparing relative activity to no activation65. Paired (Student) t-tests using a smoothing parameter of 1, the common variance for all components, assessed all frequencies between 3–34 Hz. Voxels that were significant at p < 0.001, after establishing a corrected threshold using 5000 random permutations, were considered active across participants

Group data analyses

Group data were analyzed using the EEGLAB STUDY module. This module is used to analyze ICA data from multiple participants across experimental conditions under different designs. STUDY allows for inclusion parameters to be implemented. Any dipole with a residual variance of 20% or below was included at this step.

Assignment of components to mu clusters

Principal component analysis (PCA) was used to identify clusters of neural components common to all participants. Using the K-means statistical toolbox, PCA identified 35 initial clusters of EEG activity on the bases of common dipoles, spectra, and scalp maps. One left and one right mu cluster were identified. However, previous analyses34,43,47,50 show that PCA clustering can sometimes include components that do not fit the mu inclusion criteria. The criteria used were: (a) characteristic mu spectra, with peaks in both the alpha (8–13 Hz) and beta (15–25 Hz) frequency bands, (b) appropriate source localization to BA 2, 3, 4, 6 and (c) RV of <20%34,43,47,48. Components that did not fit these criteria were removed. Additionally, PCA can sometimes assign mu components that fit the criteria to neighboring (non-mu) clusters. Thus, visual inspection of the mu clusters and neighboring clusters was necessary to identify and re-assign misplaced mu components. At least two of the authors agreed on cluster membership for all components included in left and right mu clusters.

Time-frequency analyses

Event-related spectral perturbations (ERSPs) were used to investigate changes in mu rhythm oscillatory activity. Time-frequency transforms were extracted using a Morlet sinusoidal wavelet set at 3 cycles at 3 Hz and 34 cycles at 34 Hz with a linear rise. Each trial was referenced to a 1000 ms baseline which was selected from the inter-trial interval. A surrogate distribution was constructed using 200 randomly selected latencies from within the selected inter-trial interval61. Changes from baseline in ERSPs were computed using a bootstrap resampling method (p < 0.05 uncorrected).

Within-subject and between-group differences across time and frequency were identified using permutation statistics. The significance threshold of p < 0.05 was implemented and, in order to control for type one error, a false discovery rate (FDR) correction was used66. Following an initial omnibus analysis that showed differences related to condition and group, a quiet versus noisy discrimination contrast was conducted to assess within-subject sensorimotor differences related to presence of noise.

Results

Discrimination Accuracy

All tasks produced over 86% accuracy in participants who contributed to mu clusters, with no significant differences in either test condition between males and females (QD: F(1, 30) = 0.153, p = 0.698, ND: F(1,30) = 0.038, p = 0.847). Only trials with correct responses within 2000 ms of stimulus offset were included in further analysis. Mean reaction times to respond are 1217 ms and 1201 ms for males for QD and ND, respectively. Mean female reaction times are 1165 ms and 1250 ms for QD and ND, respectively. These times are not statistically different (QD: F(1,30) = 0.192, p = 0.664, ND: F(1,30) = 1.088, p = 0.305). The average number of trials that were both accurately discriminated and clean, and therefore could be used in ICA were: PN = 68.26 (SD = 7.47); QD = 69.36 (SD = 7.78); ND = 66.84 (SD = 7.07).

Mu cluster characteristics

Of the 21 females, 20 contributed to either the left or the right mu cluster. Specifically, 12 contributed to both clusters while 4 contributed only to the left and 4 only to the right. All 21 of the matched males contributed to at least one of the mu clusters. Specifically, 11 contributed to both clusters while 5 contributed only to the left and 5 only to the right.

The same proportion (16/21) of members of each sex contributed to both mu clusters. As is common with high density electrode arrays, it was possible for sensorimotor activity to be spread across multiple components, allowing participants to sometimes contribute more than one component to each cluster. The total number of mu components per cluster was 44 (22 from each sex) in the left and 60 (30 from each sex) in the right. Data from components were used for statistical spectral and time-frequency analyses in each hemisphere. Information regarding the source localization of mu clusters using the two different methods is shown in Table 1.

Table 1 Source information for each mu cluster.

Spectral analysis

Average mu spectra across conditions are shown for both males and females in Fig. 2. No differences in spectral power were observed across conditions or between groups.

Figure 2
figure 2

Cluster results for left and right µ components. (A) ECD localized dipoles for the left (1) and right (2) mu clusters (red = female, blue = male), (B) maximum current source density voxels (t-values), (C) mean spectra for left (1) and right (2) cluster components in each condition (Red = females; Blue = males; Solid line = quiet discrimination; QD); Dashed line = noisy discrimination; ND). Notes: ECD = Equivalent Current Dipole.

Time-frequency analyses

For all conditions, patterns of mu oscillatory activity are similar in both hemispheres, suggesting bilateral sensorimotor activity. However, across conditions, patterns of significant differences in the left hemisphere are more robust.

Within- and between-group differences

Within-subject differences (pFDR < 0.05) were found before, during, and after stimulus presentation for both sexes when compared to the control condition (PN). Figures 3 and 4 show time-frequency decompositions of left and right mu rhythm oscillatory activity across test conditions and between groups. In the left hemisphere, within-subject differences were only observed in males. Between-group differences were found only in the QD condition, where females show stronger early mu-alpha ERS and males show stronger mu-beta ERD. In the right hemisphere, patterns of ERSP activity were similar to those found in the left hemisphere. Within-subject differences were again only found in males, though the difference was less robust than in the left hemisphere (i.e., fewer significant time-frequency voxels). No between-group differences were observed in the right hemisphere. Figures 5 and 6 display directional differences in mu-alpha and mu-beta spectral power across the time-course of the stimulus in each test condition for the left and right mu clusters, respectively.

Figure 3
figure 3

sLoreta and mean ERSPs for test conditions in the left hemisphere. sLoreta solutions for the left mu cluster, shown on a 3D Van Essen template highlighting significantly activated voxels associated by mu components, with mean ERSPs across all test conditions (warm colors indicate ERS whereas cool colors indicate ERD). For each condition, ERSPs display changes from a recorded baseline before stimulus onset (−3000 to −2000 ms before stimulus onset for each trial). The final panel in each row indicates statistical differences within each group across test conditions, while the final panel in each column indicates differences between groups within each test condition (pFDR < 0.05). Note: QD = Quiet Discrimination; ND = Noisy Discrimination; FDR = Corrections for False Discovery Rate; ERSP = Event-related Spectral Perturbations.

Figure 4
figure 4

sLoreta and mean ERSPs for test conditions in the right hemisphere. sLoreta solutions for the left mu cluster, shown on a 3D Van Essen template highlighting significantly activated voxels associated by mu components, with mean ERSPs across all test conditions (warm colors indicate ERS whereas cool colors indicate ERD). For each condition, ERSPs display changes from a recorded baseline before stimulus onset (−3000 to −2000 ms before stimulus onset for each trial). The final panel in each row indicates statistical differences within each group across test conditions, while the final panel in each column indicates differences between groups within each test condition (pFDR < 0.05). Note: QD = Quiet Discrimination; ND = Noisy Discrimination; FDR = Corrections for False Discovery Rate; ERSP = Event-related Spectral Perturbations.

Figure 5
figure 5

Alpha and beta contour lines showing power across time for the left hemisphere mu cluster. The middle red (female) and blue (male) lines represent mean alpha (8–13 Hz) and beta (15–25 Hz) value deviations from baseline, while shaded regions indicate the upper and lower quartile limits. Values above zero are ERS, while values below zero are ERD. Significant differences are noted by the grey regions.

Figure 6
figure 6

Alpha and beta contour lines showing power across time for the right hemisphere mu cluster. The middle red (female) and blue (male) lines represent mean alpha (8–13 Hz) and beta (15–25 Hz) value deviations from baseline, while shaded regions indicate the upper and lower quartile limits. Values above zero are ERS, while values below zero are ERD. Significant differences are noted by grey regions.

Discussion

The goal of this study was to investigate sexual dimorphism in sensorimotor-based cognitive activity in speech discrimination. Matched male and female cohorts demonstrate similar high levels of discrimination accuracy. In addition, they contribute similar numbers of components to left and right mu clusters with similar spectral amplitudes and cortical source localization (BA 6/4). Current mu rhythm sources are consistent with those previously identified using ICA43,48,51, known generators of mu rhythms28,29 and with regions of the anterior dorsal stream shown to activate during speech perception67,68,69. Basic groups similarities in accuracy and mu spectral amplitudes allow for time-frequency decompositons to test the research hypothesis that, across the time course of the stimulus presentation, males and females employ different cognitive-based sensorimotor strategies in speech discrimination tasks.

For both groups, the two speech discrimination conditions yield bilateral oscillatory changes compared to the control condition, indicating task-related sensorimotor activity prior to, during, and following syllable perception. Overall, mu activity in the left (Figs 3 and 5) and right (Figs 4 and 6) hemisphere are characterized by similar oscillatory patterns, though weaker in the right hemisphere. This is consistent with findings showing that sensorimotor transformations for speech occur bilaterally70 but are left-hemisphere dominant5,71. Within-subject differences related to experimental condition are found in both left and right mu rhythm activity (though more significant voxels were found in the left). However, between-group differences and within-subject differences between the two experimental conditions (QD and ND) are only found in the left hemisphere. As such, left hemisphere findings are the current focus.

Is sensorimotor activity during speech perception/discrimination sexually dimorophic?

Group differences occur in the quiet discrimination (QD) condition. Prior to and during syllable presentation, females display stronger mu-alpha ERS than males, whereas males display stronger mu-beta ERD than females (Figs 3 and 5). Hence, the two sexes appear to employ different early attentional strategies when accurately discriminating in quiet (i.e., optimal listening) backgrounds. According to models of speech perception such as analysis-by-synthesis, early attentional mechanisms provide top-down predictive codes in the form of sensorimotor forward models that help constrain the forthcoming sensory analysis72,73,74,75. As beta ERD is considered an index of forward modeling, current data suggest that males are more prone to use this early sensorimotor strategy when repeatedly discriminating. Similar early predictive mu-beta ERD has been observed in other speech perception studies43,48 and during auditory beat tracking76,77 or tracking task relevant information78. In contrast to early mu-beta ERD observed in males, females exhibit reduced mu-beta ERD alongside robust mu-alpha ERS. Thus, in lieu of using motor-based projections to predict forthcoming syllables, females appear to inhibit early sensorimotor activity. In cognitive tasks, alpha ERS conveying cortical inhibition is often associated with gating of task irrelevant information79,80 or the re-allocation of cognitive resources to cortical regions involved in processing37. Though it may not be possible to completely disentangle these two functions, in quiet listening conditions (without noise to gate) the latter explanation seems more plausible. This inhibitory activity in anterior dorsal regions perhaps allows cortical resources to be reallocated to regions involved in sensory analysis (e.g., auditory regions). Inhibition of sensorimotor cortex related to cognitive resource reallocation has been observed in working memory load81,82 and attention-based tasks31,36,79,83. Together, data clearly highlight sexual dimorphism in early sensorimotor-based cognitive strategies for repeated speech discrimination in optimal conditons.

In the noisy background (ND) versus in quiet, males and females both exhibit increases in early mu-alpha oscillatory activity when discriminating speech. For males, this pattern constitutes a significant shift in sensorimotor processing strategy from predictive (mu-beta ERD) in the quiet background to inhibitory (mu-alpha ERS) in the noisy background (Figs 3 and 5). In contrast, females appear to use the same sensorimotor processing strategy in noise as they do in quiet, showing slightly stronger early inhibitory activity during discrimination in noise, though the difference does not reach significance (Figs 3 and 5). Thus, in the presence of noise, a general increase in mu-alpha ERS may be related to inhibitory gating of the noise84. Sensory gating of irrelevant stimuli is thought to aid performance and often observed with cognitive load increases35. The fact that it occurs early is consistent with the inhibitory effect helping to provide top-down active sensing, as described when overcoming the ‘cocktail party problem’85. Saltuklaroglu et al. (2017)47 recently reported a similar shift in early mu oscillatory activity in speech discrimination when (predominantly) males switched from quiet to noisy listening. Interestingly, a matched group of (predominantly) males who stutter did not demonstrate this shift, failing to show inhibitory activity in the presence of noise. This finding was linked to auditory gating86 and sensorimotor adaptation deficits that are known to characterize stuttering87,88.

Though findings demonstrate generally stronger early inhibitory activity (mu-alpha ERS) in females and stronger early predictions (mu-beta ERD) in males, the absence of sexual dimorphism in discrimination accuracy suggests that neither of the early sensorimotor processing strategies offer a functional sex-related advantage. Both groups display similar levels of strong mu-alpha and mu-beta ERD after stimulus offset, activity often attributed to later processing (e.g., covert replay)43,89,90. It is possible that this late activity provides the functional support for accurate speech discrimination although additional comparisons of data from correct versus incorrect trials are necessary to verify. Considering ongoing debates6,75,91 regarding the functional role of sensorimotor processing in speech perception, it also is necessary to consider that all sensorimotor activity is epiphenomenal and that auditory discrimination is accomplished through sensory analysis in temporal regions. Regardless, of functionality, the finding of sexual dimorphism in sensoriomotor processing is consistent with a recent systematic review of sex differences in child language development, indicating sexual dimorphism in linguistic strategies that do not produce a functional advantage92. Thus, it is not clear how these findings are related to language abilities in neurotypical populations, though there may be implications for those with neurophysiological disorders.

Sensorimotor inhibitory activity links to developmental neurophysiological disorders

Mu rhythm oscillatory activity also is under the influence of the basal ganglia as the sensorimotor regions from which they emanate are sites of integration for two main basal ganglia based loops93,94,95. Basal ganglia help provide inhibitory function critical to selection and processes that modulate and stabilize sensory experience and fluid movements31,37,96. Bonstrup et al.97 revealed the efficacy of measuring alpha ERS from motor regions (similar to our mu-alpha) to demonstrate reduced inhibitory function found with an age-related decline in basal ganglia function, suggesting that inhibitory power decreases with age and leads to reduced inhibitory mechanisms97. Interestingly, in addition to stuttering47,98, inhibitory deficits also are linked to a number of neurodevelopmental disorders, such as autism99, Tourette’s syndrome100,101,102, and attention deficit/hyperactivity disorders103. All are documented to have a higher prevalence in males than females104. Therefore, sex-based differences in inhibitory activity examined in simple cognitive tasks, such as those herein, may in some way be related to the male prevalence of developmental neurophysiological disorders. If so, measurement of mu-alpha power in tasks known to recruit inhibitory processes (e.g., speech discrimination in noise) may prove useful for identifying or measuring severity of inhibitory dysfunction. However, to accomplish this, appropriate control data separated by sex are critical, as grouped control data that include both males and females may be confounded by sex differences.

Relevance to neuroimaging findings in speech perception

The current findings may be indicative of an additional source of variability in the activity of the anterior dorsal stream during speech perception. Previously uncovered sources of variability (e.g. context, individual cognitive ability, and task difficulty) mainly have been identified using fMRI7,105,106; an imaging technique with excellent spatial resolution. The detection of sexual dimorphism in sensorimotor processing was made possible by the superior temporal resolution of EEG. In contrast to fMRI, this EEG technique allows the cognitive contributions to sensorimotor activity surrounding speech perception to be delineated clearly in syllable discrimination. It will be of further interest to examine whether similar sexual dimorphism exists in passive speech perception tasks where cognitive demands are drastically reduced. Passive speech perception tasks have produced a number of conflicting results with regards to the presence or absence of sensorimotor activity, spurring ongoing debates regarding the nature and functional role of sensorimotor activity in speech perception107. The current data suggest that sex of the participant may be an additional variable to consider in such debates.

It remains somewhat unclear how the current findings compare to those from fMRI. Several studies have noted that increased BOLD responses seems to be correlated negatively with alpha power108,109, which seems logical as both alpha ERD and fMRI BOLD signals are indicators of cortical activity. There also is evidence to suggest a similar relationship in the beta band110. As such, fMRI findings of increased sensorimotor activity in discrimination tasks versus passive listening might be due to the strong late activity when stimuli are held in working memory; the time period that corresponds with the current findings of strong mu-alpha and mu-beta ERD. The relationship between mu-alpha and BOLD activity also has been found when mu-alpha becomes synchronized. That is, spikes in mu-alpha power suggesting cortical inhibition have been found in the presence of a negative BOLD signal111,112. However, it is unclear if the negative BOLD signal represents a true inhibitory response112,113. As such it remains an open question as to whether observing mu-alpha ERS across the time course of an EEG event would be associated with reduced overall BOLD activity or, as some have suggested, have an additive effect114.

Limitations

Though 41/42 participants contributed at least one mu component in the left or right hemisphere, bilateral components were not identified in many participants. The main reason is the strict inclusion criteria (correct localization, low RV, and characteristic spectra) used in this and previous studies34,43 to ensure the veracity of all mu components. Thus, though nearly all particpants produced bilateral ‘mu-like’ components, some were not included because they did not meet the criteria. The main reasons for this are (1) signal noise, that can be an issue even after ICA is applied, and compromise the spectal integrity of the mu rhythm and (2) failure of the mu-like rhythms to localize to sensorimotor regions (i.e., known mu generators). That said, data from the participants included in each cluster were sufficient to reveal within- and between-group differences, even with the conservative criteria for data inclusion. Lastly, this study only examined independent activity from left and right anterior aspects of the dorsal stream. The left hemisphere appeared to be more involved in syllable discrimination as indicated by increased oscillatory power in time-frequency analysis and significant findings in the left hemisphere only. However, statistical comparisions were not made of activity between conditions, mainly due to the relative few participants that contributed to both left and right mu clusters. Future studies that seek to make such comparisons and extract other (e.g., auditory components) to measure functional connectivity should apply individualized head models that enable more accurate source localization

Conclusions

By identifying EEG mu components using ICA and adding a temporal dimension in measurement, sex-based differences in sensorimotor activity during speech discrimination are revealed. Decomposition of the mu rhythm over time makes it possible to obtain rich measures of sensorimotor processing that consider how oscillations in alpha and beta frequencies align and dissociate across an event115. Here, we show that males and females display differences in early sensorimotor activation during speech discrimination in quiet backgrounds, suggesting the use of different attentional mechanisms in optimal conditions. The early inhibitory versus predictive patterns in females and males during optimal listening conditions, respectively, may have implications for understanding (1) previously described sexual dimorphism in various verbal cognitive functions and (2) the male prevalence of neurodevelopmental disorders with inhibitory dysfunction. In noisy conditions, data reveal evidence of sensory gating in both sexes. Together, findings suggest that males and females exhibit early sensorimotor processing for speech discrimination that is fundamentally different in quiet, yet similarly adaptable to adverse conditions. Further, sex differences appear unrelated to discrimination accuracy, suggesting neither early sensorimotor-based cognitive strategy provides a sex-related behavioral advantage. Finally, the nature of the mu rhythm and the precise temporal resolution afforded by this EEG technique provides an excellent means of mapping neural activity to behavior in light of cognitive requirements.