A hemispheric two-channel code accounts for binaural unmasking in humans

Encke, Jörg; Dietz, Mathias

doi:10.1038/s42003-022-04098-x

Download PDF

Article
Open access
Published: 22 October 2022

A hemispheric two-channel code accounts for binaural unmasking in humans

Communications Biology volume 5, Article number: 1122 (2022) Cite this article

1250 Accesses
6 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Sound in noise is better detected or understood if target and masking sources originate from different locations. Mammalian physiology suggests that the neurocomputational process that underlies this binaural unmasking is based on two hemispheric channels that encode interaural differences in their relative neuronal activity. Here, we introduce a mathematical formulation of the two-channel model – the complex-valued correlation coefficient. We show that this formulation quantifies the amount of temporal fluctuations in interaural differences, which we suggest underlie binaural unmasking. We applied this model to an extensive library of psychoacoustic experiments, accounting for 98% of the variance across eight studies. Combining physiological plausibility with its success in explaining behavioral data, the proposed mechanism is a significant step towards a unified understanding of binaural unmasking and the encoding of interaural differences in general.

Binaural summation of amplitude modulation involves weak interaural suppression

Article Open access 26 February 2020

D. H. Baker, G. Vilidaite, … R. E. Millman

Cortical temporal integration can account for limits of temporal perception: investigations in the binaural system

Article Open access 26 September 2023

Ravinderjit Singh & Hari M. Bharadwaj

Modulated stimuli demonstrate asymmetric interactions between hearing and vision

Article Open access 20 May 2019

Quoc C. Vuong, Mark Laing, … Adrian Rees

Introduction

The auditory system has the challenging task of restoring the spatial properties of an acoustic scene based solely on the signals arriving at the two ears. A critical source of information in this process is the difference in arrival time between the signals at the two ears. The delay line or Jeffress model¹, one of the longest-standing models of sensory-neuronal computation, suggests that an array of coincidence-detecting neurons compares neuronal signals from the two cochleae. According to this model, each neuron in the array is associated with a best delay τ compensating for a specific interaural time difference (ITD); the neuron that compensates best for the ITD would then show the strongest response. This concept corresponds to cross-correlation and postulates a place code for ITD. The delay-line concept was supported by the success of quantitative cross-correlation-based models that were able to predict a variety of human psychoacoustic data^2,3,4,5. Equally compelling, the predicted arrangement of axonal delay lines has been found in the nucleus laminaris of the barn owl⁶, a spatial hearing specialist. In mammals, however, no such structure has been found. Instead of a nearly frequency-independent distribution of best-delays centered around τ = 0 as ideal for the Jeffress model⁷, studies found the best delay of neurons in each hemisphere of the brain to be centered around 1/8th of the cycle duration^8,9,10 (see the visualization in Fig. 1c). Mammals thus seem to lack the topographical map of ITDs, as postulated by Jeffress. These findings resulted in the formulation of an alternative coding hypothesis: The two-channel model. Instead of the large number of systematically tuned coincidence detectors used by the Jeffress model, this model relies on the activity within only two broad hemispheric channels⁸. Instead of the Jeffress place code, the ITD is encoded by the relative firing-rate change within both channels. The two-channel code thus represents a rate code. The two-channel approach has been incorporated into several quantitative models dealing with various aspects of binaural hearing^11,12,13,14, but between them, these models still lack the predictive power of cross-correlation-based approaches. As a consequence, and despite the apparent lack of systematic delay lines in mammals, Jeffress-type models are still widely used to account for experimental data in humans, especially when dealing with phenomenons beyond sound localization^15,16.

In addition to sound localization, listening with two ears also provides a benefit in complex environments in which a target sound is masked by sounds from another location¹⁷. This binaural unmasking has been studied extensively using tone-in-noise detection experiments, resulting in a large body of highly reproducible data^3,18,19,20.

These studies consistently found that tone-in-noise detection improves considerably when an interaural difference is introduced into either the tone or the masker. If the masker is identical in both ears, anti-phasic 500-Hz tones can be detected at a sound level 15 dB lower than for in-phase tones¹⁸. This benefit is purely binaural: monaural detection thresholds are unaffected by changes in the tone phase, even though the waveform of the noise signal changes depending on the phase of the added tone (see Fig. 1a).

**Fig. 1: Visualization of the modeling approach.**

Tone-in-noise detection thresholds depend on several stimulus features, including noise correlation, noise ITD, interaural phase relation of noise and target, and noise bandwidth. Models based on the cross-correlation function account for these dependencies by detecting changes in the maximum of the cross-correlation function ρ(τ) which are caused by the tone^2,3. These models benefit from the large array of differently tuned coincidence detectors which enables them to use the most informative delay element, ρ(τ_best), for the respective task. Models that lack the array of delay-tuned detectors, such as the two-channel model, fall short in terms of both accuracy and comprehensiveness^11,12,21.

Adding a tone with an interaural phase difference (IPD) to a correlated masker not only reduces the correlation but also introduces fluctuations in the stimulus IPD^22,23. While the reduction in correlation and the fluctuation strength are closely connected, this relation does not always hold. This is illustrated by comparing two cases: (A) two noise tokens from independent sources; (B) two noise tokens from the same source where one token is phase-shifted by π/2. In both cases, the correlation between the tokens is zero. In the case of the two independent tokens, however, the IPD fluctuates randomly, whereas, by definition, (B) has a constant IPD. The amount of IPD fluctuation thus offers additional information about the underlying binaural statistics and has been proposed as an alternative metric for binaural detection^10,22,24. Yet, this approach has received relatively little attention in quantitative modeling.

This study aims to remedy the current limitations of the two-channel model in accounting for binaural unmasking experiments. By introducing a new mathematical representation of the two-channel model, we reveal a direct connection to the amount of IPD fluctuation proposed to underlie binaural detection. The proposed representation of the two-channel model also creates the theoretical foundation for a new understanding of IPD encoding in the mammalian brainstem. We evaluate this new approach to the two-channel model by comparing predictions of a signal-detection model against an extensive library of binaural detection experiments.

Results

A mathematical representation of the two-channel model

The two-channel model assumes that ITDs are encoded in the activity within two broad hemispheric channels. These channels are represented by the mean activity of neurons in the left and right brain hemispheres⁸. ITD-sensitive neurons in the midbrain have been found respond strongest to ITDs around 1/8th of the cycle duration^8,9,10 which is equivalent to an IPD of π/4. If we assume that each channel is represented by the mean activity of all units within the hemisphere, we can represent them as one correlator each. These correlators show best-IPDs of ±π/4. As a consequence, their relative phase difference is π/2 so that the two channels are orthogonal¹⁰ (see Fig. 1d).

An equivalent but mathematically more convenient method of representing the two ±π/4 channels is to use one correlation with zero best-IPD and a second correlation with π/2 interaural phase offset. This phase offset can be obtained by Hilbert transformation ${{{{{{{\mathcal{H}}}}}}}}$ of either the left ear signal l(t) or the right ear signal r(t). Both real-valued correlations can then be expressed by a single complex-valued correlation coefficient γ:

$$\gamma =\frac{ < l(t)r(t) > }{\sqrt{ < | l(t){| }^{2} > < | r(t){| }^{2} > }}+i\frac{ < {{{{{{{\mathcal{H}}}}}}}}[l(t)]r(t) > }{\sqrt{ < | l(t){| }^{2} > < | r(t){| }^{2} > }},$$

(1)

where i indicates the imaginary unit. Figure 1e shows noise-delay functions for the two correlators (blue and red line). The benefit of using this complex-valued representation is that it is equivalent to directly calculating a single correlation coefficient for the complex-valued or analytic representations of the two signals ${l}_{a}(t)=l(t)+i\,{{{{{{{\mathcal{H}}}}}}}}[l(t)]$ and ${r}_{a}(t)=r(t)+i\,{{{{{{{\mathcal{H}}}}}}}}[r(t)]$:

$$\gamma =\frac{ < {l}_{a}^{* }(t){r}_{a}(t) > }{\sqrt{ < | {l}_{a}(t){| }^{2} > < | {r}_{a}(t){| }^{2} > }},$$

(2)

where the asterisk indicates the complex conjugate, and the angular brackets the ensemble average. This expression is also called the complex-valued correlation coefficient. It is well known in other fields of physics that deal with waves such as optics^25,26, allowing to inherit its established properties and advantages.

Figure 1f represents γ as an arrow in the complex plane where the x- and y axis equal its real and imaginary parts. The example was calculated for an ITD of Δt = 2.3 ms which is also indicated by the dashed line in Fig. 1d. Instead of using the real and imaginary part, γ can also be described by the angle (argument $\arg (\gamma )$) and the length (modulus ∣γ∣) of the arrow. These two values have some interesting properties. The argument $\arg (\gamma )$ equals the expected value of the distribution of instantaneous IPDs, or in other words, the time-averaged IPD (see Fig. 1g). The modulus ∣γ∣ is a measure for the consistency of left and right instantaneous phases and thus for the IPD-fluctuation strength. We will refer to ∣γ∣ as the interaural coherence¹⁰.

Note that there are several differing definitions of coherence. Our use of coherence as ∣γ∣ is a typical time-domain definition²⁵. In general signal processing, the coherence function is instead defined in the frequency domain and calculated as the normalized absolute value of the cross-spectral power density (CSPD)²⁷. The two definitions are closely related, as the time-domain coherence can also be defined by using a Fourier transform of the CSPD (see “Methods” for details). In binaural research, a third definition exists, where interaural coherence is sometimes used to refer to the maximum of the real-valued cross-correlation function²⁸. It is similar but not equivalent to the more general definitions.

The derivation of γ as a mathematical representation of the two-channel model highlights two essential properties of the model: Firstly, the two-channel model can act as a perfect encoder for IPDs, and secondly, the two-channel model also encodes information about the amount of fluctuation in IPD. If these fluctuations can indeed be used for explaining binaural unmasking, as proposed in refs. ^10,22,24, then this should also be possible by using γ. The following section will thus develop a signal-detection model to predict binaural detection based on the quantity γ.

A model of binaural detection

Tone-in-noise detection is usually performed as an alternative forced-choice task with one or more reference intervals containing only the reference noise and a target interval in which the tone is added to the noise. These studies aim to determine the signal-to-noise ratio (SNR) at which the subject can identify the target stimulus with a predefined sensitivity.

A computational model was used to test the hypothesis that binaural tone-in-noise detection can be explained based on the complex correlation coefficient γ. In the model, threshold SNRs are calculated directly from the absolute difference between the complex correlation coefficients of the masker and target stimuli. Correlation coefficients were calculated based on the spectral properties of the two input stimuli, assuming only a peripheral bandpass filter. In addition to this binaural detection path, a monaural pathway provides an SNR-based detection cue in stimuli with few or no binaural cues. A mathematical description of the model is given in “Methods”.

Four stimulus parameters were necessary to define the stimuli used in this study: The IPD of the noise as a function of angular frequency Δφ(ω), the noise correlation ρ_N, the IPD of the target tone Δϕ, and the noise bandwidth Δω. For example, for an out-of-phase tone in 900 Hz wide, correlated noise with 2.3 ms ITD (as used in Fig. 1e–g), the parameters would be set to Δφ(ω) = ω × 2.3 ms, ρ_N = 1, Δψ = π, and Δω = 2π × 900 Hz. Table 1 summarizes the stimulus parameters of all experiments that are discussed below. Three parameters define the model itself: parameters σ_bin and σ_mon, which directly determine the binaural and the monaural detection sensitivity, and a third parameter $\hat{\rho }$, which limits the maximal sensitivity to changes in coherence. Model parameters were optimized separately for each experiment because detection thresholds for identical stimuli were not always identical across studies. This finding is unsurprising since most experiments were conducted with few subjects. Table 1 summarizes the resulting model parameters.

Table 1 Aggregation of stimulus and model parameters.

Full size table

Simulated datasets

The first dataset by Pollack & Trittipoe²⁹ is not from a tone-in-noise experiment but directly quantified the sensitivity to changes in coherence. For the model, this sensitivity was directly calculated using Eq. (8). Experimental data and model results are shown in Fig. 2a.

**Fig. 2: Experimental data and model results for all experiments.**

In the next two experiments, the reference also consisted of noise with predefined interaural correlations. However, the target correlation was not manipulated directly but was changed by adding a tone to the partly correlated noise. Robinson & Jeffress³⁰ collected results for both in-phase and anti-phasic tones (see Fig. 2b). The experiment with anti-phasic tones was repeated by Bernstein & Trahiotis¹⁶, who also collected data at different noise bandwidths (see Fig. 2c). The change in coherence that arises from adding a tone at a given SNR depends on the initial noise correlation ρ_N and on the difference between the tone IPD and the noise IPD³¹. The coherence change is greatest when the two IPDs are out of phase, while there is no influence when the IPDs are the same. In Fig. 2b, this is reflected in the large difference between the threshold SNRs of the in-phase and anti-phasic conditions when ρ_N = −1 and ρ_N = 1. The improvement in threshold SNR with increasing bandwidth, as seen in Fig. 2c, can be explained solely by the filter property of the auditory periphery. Only noise energy that falls within the peripheral filter interacts with the tone, thus determining the coherence. This peripheral filtering improves the SNR and, thus also the nominal threshold SNR. For large stimulus bandwidths far exceeding the peripheral bandwidth, this improvement equals 3 dB/octave.

Instead of directly changing the masking noise correlation, the next set of studies by Langford & Jeffress¹⁹, and van der Heijden & Trahiotis³ applied an ITD to the noise before adding the target tone (Fig. 2d, e). The added ITD results in both a reduction in coherence ∣γ∣ and a shift in the noise IPD (see Fig. 1e, f for an example). The change in noise IPD results in periodic oscillations of the threshold SNR, as the effectiveness of the added phasic or anti-phasic tone changes with noise IPD. The periodic oscillation is superimposed by an overall increase in threshold SNR with ITD. Rabiner et al.²⁰ conducted a slightly different but related experiment: instead of applying the ITD to the whole noise, it was applied only to the envelope of the masking noise, which keeps the noise IPD fixed at zero (Fig. 2f). This removes the oscillations in the threshold SNR while resulting in the same increase in threshold SNR as when using the regular ITD.

Yet another stimulus variation was introduced by Bernstein & Trahiotis¹⁵ who added anti-phasic tones to noises of different interaural correlations and applied the ITD to the whole signal. This modification keeps the phase relation between noise IPD and tone IPD fixed at π so that the ITD only influences the stimulus coherence (Fig. 2g). As in the study in ref. ²⁰, this results in an increase in the threshold SNR without oscillations. The increase is less pronounced at low noise correlations because the effect of the ITD on coherence diminishes with decreasing noise correlation.

Figure 2h shows experimental results from van de Par & Kohlrausch³² as well as simulated threshold SNRs for a large range of bandwidths from 5 Hz to 1 kHz in two configurations with binaural cues and one configuration with monaural cues only. The model accounts for the bandwidth dependence of detection thresholds because of its bandpass filter. The filter does not considerably affect the noise energy at very low bandwidths, so the threshold SNR remains constant. The predicted threshold SNR improves by 3 dB per octave at large bandwidths.

With parameters that were optimized individually for each experiment, the model could account for 91 to 98% of the respective variance (see Table 1). For all datasets together, keeping the individual parameters, the model accounted for 98% of the total variance. Figure 2i visualizes this high correlation between modeled and experimental thresholds by plotting one against the other. A single set of parameters, optimized to reduce the variance across all datasets, still accounted for 93% of the total variance, despite the deviations in the experimental threshold for identical stimulus parameters mentioned above.

Discussion

The complex-valued correlation coefficient model proposed here accounted for nearly all aspects of the psychophysical datasets examined in this study, with limitations discussed below. The modeled binaural sensitivity is directly proportional to the difference in the z-transformed complex correlation coefficient γ between target and reference. The bandpass filter is the only pre-processing stage necessary to account for these datasets. The following discussion will thus focus on these two components of the model.

In the first three experiments (Fig. 2a–c), no phase or time shifts were added to the signal. Consequently, the imaginary part of γ was always 0 so that γ equaled the real-valued correlation coefficient ρ. This means that the model would show identical results if it were based solely on ρ. The comprehensiveness and accuracy of ρ-based models for this kind of stimuli have been demonstrated previously².

The experiments shown in Fig. 2d–i, included ITDs or IPDs, so that γ was generally not real-valued. In these cases, models based on real-valued correlation alone need to include larger parts of the cross-correlation function ρ(τ)^2,3. Alternatively, as shown in this study, this kind of data can be explained entirely by the complex correlation coefficient. To better understand the underlying mechanism, Fig. 3a visualizes the z-transformed coefficients for threshold SNRs for the data of ref. ³. The illustrated complex space can be interpreted as a binaural feature space, where the distance between the reference and target is directly proportional to the binaural sensitivity ${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}$. It is apparent that this distance is determined by both the coherence, reflected in the length of the vector, and the mean IPD reflected in its angle. The relative contributions of length and angle depend on the specific stimulus condition. For conditions where the difference between the mean IPD of the noise and the tone equals 0 or π, the added tone cannot influence the mean IPD of the resulting signal. In these cases, binaural detection was based solely on a change in coherence. If the difference is π, the coherence change caused by the target is large, so the binaural cue is generally large as well. When the difference is 0, detection relied mostly on monaural cues, as visible in Fig. 3a: the vectors of reference and target are nearly equal at Δt = 1 ms. In this situation, the availability of binaural cues increases with decreasing noise coherence, as the added tone can then increase the coherence of the target relative to the reference (see Δt = 3 ms in Fig. 3b). The decrease in coherence with ITD is visible in the decreasing length of the reference vectors.

**Fig. 3: Vector visualization of the z-transformed correlation coefficients.**

Of the data simulated in the present study, only that of van der Heijden & Trahiotis³ and Langford & Jeffress¹⁹ include stimuli with mean IPD differences between noise and tone other than 0 or π. Only these intermediate differences cause a change in both coherence and angle of the target stimulus (Fig. 3). In these intermediate cases, it is particularly advantageous to use the complex plane of the z-transformed correlation coefficient z[γ] as a two-dimensional acoustic stimulus feature space. A key finding of the present study is that the acoustic feature space can be used directly as a perceptual feature space so that the distance between two stimuli in this space is proportional to the binaural sensitivity index ${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}$. The complex plane of z[γ] can thus be interpreted as a perceptually uniform space like, for example, the CIELAB color space that is commonly used to represent color difference sensitivity³³.

If z[γ] is interpreted as a perceptually uniform space, it should be possible to use the same space to explain related phenomena, such as ITD discrimination. For tones, ITDs are equivalent to IPDs. Since IPDs are reflected in the argument of γ, IPD discrimination sensitivity can be directly derived from Eq. (8):

$${{\Delta }}{\sigma }_{thr}=2\arcsin \left(\frac{{d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}{\sigma }_{{{{{{{{\rm{bin}}}}}}}}}}{2{{{{{{\mathrm{arctanh}}}}}}}\,\left(\hat{\rho }\right)}\right).$$

(3)

Using the set of parameters summarized in Table 1. with ${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}=1$ resulted in IPD thresholds equivalent to ITDs in the range of 41 μs to 117 μs (median 60 μs); this is within the range of experimentally obtained thresholds at 500 Hz. For discrimination around zero ITD, experimental thresholds are on average a little lower³⁴, but for discrimination around π thresholds are above the model median³⁵.

As elaborated above, the proposed measure γ is equivalent to calculating two normalized correlation coefficients. Naturally, this assumption implies the existence of some form of neuronal normalization process, an assumption that is shared with the majority of the cross-correlation-based models. Mathematically, this normalization could take place using monaural information such as the activity of the auditory nerve¹³. It has, however, been noted that this process would have to be extremely precise³⁶. Alternatively, normalization could also be based on the activity in “anti-coincidence” detecting neurons such as those found in the lateral superior olive³⁷. The firing rate in these neurons behaves inversely to those that act as coincident detectors and increases with decreasing correlation. Comparison between anti-coincidence and coincidence-detecting neurons could thus be used for normalization. A third method would be to directly use the time course of IPD fluctuation. Instead of encoding the real and imaginary part of γ, this approach would encode the time-averaged IPD and the coherence. The ability of the auditory system to encode for the former is well established³⁸. To encode information equivalent to coherence, the auditory system could directly rely on the amount the IPD fluctuates around its mean. This IPD-fluctuation code would have the benefit of only requiring information about a single quantity—the IPD.

The neuronal substrate necessary to extract γ based on the two channels would depend on the underlying mechanism. Representing γ directly based on the values of coherence and mean IPD would require a long-term integration mechanism with two subsequent processing stages implementing neuronal equivalents to calculating modulus and argument. If γ would instead be represented indirectly via IPD fluctuations, neurons would have to be fast enough to follow these fluctuations. Neurons that do just this have indeed been described: IPD-sensitive neurons can encode the fast fluctuations by means of fast changes in their response rate^39,40,41. This second mechanism would also be in-line with a recent neuroimaging study which reported an elevated cortical load when subjects were presented with low-coherence stimuli⁴². The increased load could result from the brain having to deal with localizing a sound source based on increasingly fluctuating IPDs.

In cases where the noise bandwidth is considerably larger than the peripheral filter bandwidth, the coherence function ∣γ(τ)∣ is fully determined by the power spectrum of the peripheral filter (see Eqs. (4) and (5)). By substituting τ with the ITD, the same function can describe the ITD dependence of ∣γ∣. In the absence of a delay line, the decline of the binaural benefit with masker ITD is therefore a direct cause of the bandwidth after filtering. Langford & Jeffress were the first to describe this relation, and coarsely estimated that a 100-Hz filter bandwidth explains the ITD dependence of their data (see Fig. 2d)¹⁹. With more quantitative analysis, and assuming a triangular filter, Rabiner et al.²⁰ found that their data (see Fig. 2f) was best accounted for by a filter with 85 Hz equivalent rectangular bandwidth (ERB). This value is close to the 79 Hz ERB of the 4th-order gammatone filter that was used in the present study (see “Methods”). The ERB was fixed at 79 Hz to reduce the number of free model parameters. This bandwidth is a typical estimation for the bandwidth of the monaural periphery⁴³ and has also been employed in other binaural detection models^2,44. The same filter bandwidth is also responsible for the change in threshold SNR with stimulus bandwidth, as seen in Fig. 2c, where the point at which the simulated threshold SNR starts to decrease is determined primarily by the bandwidth.

While the proposed model was able to account for nearly all characteristics of the datasets shown in Fig. 2, some limitations do remain.

The experiment shown in Fig. 2h revealed differences between the threshold SNRs for in-phase tones in anti-phasic noise (N_πS₀) and anti-phasic tones in in-phase noise (N₀S_π): thresholds in the (N_πS₀) condition were about 4 dB higher than for N₀S_π. The same trend can be seen in the data shown in Fig. 2b and is consistent across other studies^18,30. This 4 dB difference between the N_πS₀ and the N₀S_π condition can not be accounted for by the presented model. In the model, the two conditions differ only in their mean IPD (that is, in the argument of the correlation coefficient $\arg (\gamma )$), which is zero for the N₀S_π condition and π for N_πS₀. By using γ to predict thresholds directly, the model assumes that the sensitivity to IPDs does not depend on the mean IPD, so predictions for the two conditions are the same. The mean IPD-dependent difference in the experimental thresholds also reflects the previously mentioned difference in sensitivity to changes in IPD, which is lower around π than around 0³⁵. Differences in neuronal coding precision can explain both changes in sensitivity. The responses of IPD-sensitive neurons in the auditory brainstem and midbrain usually show their strongest change around IPDs of zero; this has been suggested to facilitate the accurate encoding of IPDs in this region^8,45. The shape of the IPD-rate functions of these neurons has also been shown to increase IPD sensitivity near zero IPD⁴⁶. The influence of non-sinusoidal IPD-rate functions on the proposed coding mechanism is visualized in Fig. 4. Figure 4a shows asymmetric IPD-rate functions that exhibit a steeper slope towards Δφ = 0 than towards Δφ = ±π. Visualizing these functions in the complex plane (Fig. 4b) results in IPDs that are unevenly distributed with IPDs being more spread out around 0 than around ±π. Consequently, the angle of the complex pointer within this circle will change faster for IPDs around 0 than around π. This is also visualized in Fig. 4c, which shows the sensitivity to changes in IPD calculating as the normalized derivative of the pointers angle with respect to IPD. Including these asymmetric IPD-rate functions would result in different sensitivities to changes in IPD as a function of reference-IPD and in γ to be less sensitive to IPD fluctuations around γ = π than around 0. Differences between N₀S_π and N_πS₀ are larger at low frequencies and get smaller with increasing frequency¹⁸. If these differences result from non-sinusoidal IPD-rate functions, then one would also expect the IPD-rate functions to become more sinusoidal with increasing frequency. This assumption is indeed supported by physiological data^47,48 which shows increasingly harmonic ITD-rate functions with increasing frequency.

**Fig. 4: Visualization of the impact of non-sinusoidal IPD-rate functions.**

Other limitations in explaining experimental data arose from the decision to minimize the model’s complexity. In the N₀S₀ condition shown in Fig. 2h, the model deviates considerably from the data. This deviation results from the sample-to-sample variability of the noise energy, which changes with stimulus bandwidth³². This effect cannot be accounted for by the current model implementation, which is based on infinitely long signals. However, a numeric implementation based on finite signal waveforms should account for this phenomenon. Another dataset that has historically been difficult to account for, employs binaural unmasking in reproducible noise token^49,50. These experiments tested tone-in-noise detection using the same reproducible token across subjects. Like other models based on the average stimulus, we expect the presented model only to explain the average performance but not the variability between individual tokens.

The current model also neglects peripheral processing apart from bandpass filtering. Without this pre-processing, the model cannot account for effects that are associated with the periphery^51,52,53. The lack of realistic peripheral processing also limits the ability of the model to account for differences in binaural detection with frequency. IPD and thus ITD sensitivity rely on the auditory nerves’ activity to lock onto the stimulus phase (so-called phase-locking). Phase locking declines with increasing frequency⁵⁴ which is one of the reasons why sensitivity to ITDs and binaural detection thresholds worsen with increasing frequency^18,34. The loss of phase-locking could be implemented by a low-pass filter included in a model of peripheral transduction. This filter would remove phase information at higher frequencies while keeping the waveform envelope. Extending the current model with a more detailed periphery should thus help to account for the effects associated with the periphery and the reduction in phase sensitivity at higher frequencies. The current implementation also uses only one single frequency channel. Other experiments, however, such as those employing spectrally complex maskers or maskers constructed from two noise sources with different ITDs, might require a multi-channel implementation of the model. The success of this kind of model extension has recently been demonstrated⁵⁵.

Our goal was to test whether the two-channel model that was proposed to underlie ITD sensitivity in mammals could also be used to account for binaural unmasking. By introducing a new mathematical representation of the two-channel model, the complex correlation coefficient, we revealed a direct connection to the amount of IPD fluctuation previously proposed to underlie binaural detection. Using a computational model, we demonstrated that the complex correlation coefficient, and thus the two-channel model, is indeed able to accurately account for many central aspects of tone-in-noise detection. Compared to the previously best-performing model class, the Jeffress model, our approach is better in line with mammalian physiologic data and represents a considerable simplification as well as a reduction of degrees of freedom.

Methods

The following will introduce the underlying mathematics that were used for the model implementation. A Python implementation of the model, as well as the scripts for deriving and plotting predictions for all experiments, are openly available⁵⁶.

Deriving the complex correlation coefficient for tone-in-noise detection experiments

Throughout this study, the complex correlation coefficient was calculated from the cross-spectral power density (CSPD) S(ω) of the signals. Following the Wiener–Khinchine theorem, the cross-correlation function of two signals is equivalent to the inverse Fourier transform of their CSPD⁵⁷. Consequently, the normalized complex correlation coefficient can be calculated as:

$$\gamma =\frac{\int\nolimits_{-\infty }^{\infty }S(\omega ){{{{{{{\rm{d}}}}}}}}\omega }{\sqrt{\int\nolimits_{-\infty }^{\infty }| {S}_{ll}(\omega )| {{{{{{{\rm{d}}}}}}}}\omega \,\int\nolimits_{-\infty }^{\infty }| {S}_{rr}(\omega )| {{{{{{{\rm{d}}}}}}}}\omega }}.$$

(4)

Where S_ll(ω) and S_rr(ω) are the power spectral densities of l(t) and r(t). This CSPD-based approach has the benefit of directly resulting in the expected value of γ as opposed to a waveform-based implementation where the coherence would have to be estimated as the mean of several instances of the signal waveform.

Given the analytical representations of the left and right signals l(t) and r(t), the effective CSPD S(ω) was composed of two parts: the CSPD S_lr of l(t), r(t) and a transfer function H(ω) used to account for the bandpass properties of the auditory periphery:

$$S(\omega )={S}_{lr}| H(\omega ){| }^{2}.$$

(5)

The CSPD S_lr is directly determined by the stimulus used in the respective experiment and can be formalized as:

$${S}_{lr}(\omega )=\left\{\begin{array}{l}\frac{{\rho }_{{{{{{{{\rm{N}}}}}}}}}}{{{\Delta }}\omega }{e}^{i{{\Delta }}\varphi (\omega )},\,\,{\omega }_{0}-\frac{{{\Delta }}\omega }{2}\le \omega \le {\omega }_{0}+\frac{{{\Delta }}\omega }{2}\quad \\ \frac{{\rho }_{{{{{{{{\rm{N}}}}}}}}}}{{{\Delta }}\omega }{e}^{i{{\Delta }}\varphi ({\omega }_{0})}+{{{{{{{\rm{SNR}}}}}}}}{e}^{i{{\Delta }}\psi },\hfill\omega ={\omega }_{0}\quad \\ 0,\hfill{{{{{{{\rm{otherwise}}}}}}}}\hfill\qquad\quad\end{array}\right.$$

(6)

where Δω is the bandwidth of a rectangular noise band centered around ω₀ which, in all cases was set to ω₀/2π = 500 Hz. Δφ(ω) is the IPD spectrum of the noise while Δψ is the IPD of the tone. Both were set according to the conditions used in the respective experiment, as summarized in Table 1. For tone-in-noise detection experiments, γ is independent of the absolute level and only depends on the SNR. Consequently, the noise energy was set to one, so that the energy of the tone equals the SNR. Some experiments also made use of noises with different interaural correlations ρ_N. The CSPD only contains interaurally coherent energy so that, in these cases, the CSPD of the noise is scaled by ρ_N. Assuming the power spectrum of a gammatone filters to account for the bandpass characteristics of the auditory periphery, ∣H(ω)∣² was approximated by:

$$| H(\omega ){| }^{2}={\left[1+\frac{{(\omega -{\omega }_{0})}^{2}}{4{\pi }^{2}b}\right]}^{-n},b=\frac{{{{{{{{\rm{ERB}}}}}}}}(n-1){!}^{2}}{\pi (2n-2)!\,{2}^{(2-2n)}}$$

(7)

where n is the order of the filter and ERB its equivalent rectangular bandwidth⁵⁸. In this study, the filter was centered at 500 Hz with the filter order set to n = 4 and the ERB to 79 Hz.

Modeling the detection performance

A signal-detection model with two branches, one binaural and one monaural, was used to derive tone-in-noise detection thresholds. The first branch calculates the binaural sensitivity index ${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}$ based on the difference of the complex correlation coefficients γ_r of a reference signal and of a target signal γ_t:

$${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}=\frac{\left|z[\hat{\rho }\,{\gamma }_{{{{{{{{\rm{r}}}}}}}}}]-z[\hat{\rho }\,{\gamma }_{{{{{{{{\rm{t}}}}}}}}}]\right|}{{\sigma }_{{{{{{{{\rm{bin}}}}}}}}}}.$$

(8)

Here, z[•] symbolizes the Fisher’s z-transform applied to the modulus of the input while leaving the argument unchanged. This transform normalizes the sampling distribution of the coherence⁵⁹. Direct use of this transformation would result in infinite sensitivity to changes from a coherence of one so that the model parameter $\hat{\rho }$ was introduced. Functionally, this is equivalent to adding uncorrelated noise to the two input signals to account for processing errors on the auditory pathway⁶⁰. The sensitivity of the binaural path is adjusted by the model parameter σ_bin.

The monaural branch offers sensitivity to increases in stimulus energy between reference and target. As the power of the noise is held constant, this increase is directly proportional to the SNR so that the monaural sensitivity index ${d}_{{{{{{{{\rm{mon}}}}}}}}}^{\prime}$ is calculated as:

$${d}_{{{{{{{{\rm{mon}}}}}}}}}^{\prime}=\frac{{{{{{{{{\rm{SNR}}}}}}}}}_{{{{{{{{\rm{eff}}}}}}}}}}{{\sigma }_{{{{{{{{\rm{mon}}}}}}}}}},$$

(9)

where SNR_eff is the effective SNR after peripheral bandpass filtering and σ_mon is used to adjust the sensitivity of the monaural pathway.

Assuming a linear independent combination of the monaural and the binaural information, the monaural and binaural sensitivity indices are then combined to the overall sensitivity index:

$$d^{\prime} =\sqrt{{d^{\prime} }_{{{{{{{{\rm{bin}}}}}}}}}^{2}+{d^{\prime} }_{{{{{{{{\rm{mon}}}}}}}}}^{2}}.$$

(10)

All signals that were used in this study are defined by the noise-IPD-spectrum Δφ(ω), the tone IPD Δψ, the noise correlation ρ_N and the noise bandwidth Δω. With these parameters defined, ${d}_{{{{{{{{\rm{mon}}}}}}}}}^{\prime}$ and ${d}_{{{{{{{{\rm{bin}}}}}}}}}^{\prime}$ and thus $d^{\prime}$ only depend on the SNR. Finding the SNR that results in the $d^{\prime}$ value corresponding to the experimentally defined threshold was solved by using the SLSQP minimization algorithm as implemented in the Python package scipy⁶¹.

Statistics and reproducibility

The model performance in accounting for experimental data was evaluated using the coefficient of determination R² calculated as:

$${R}^{2}=1-\frac{\mathop{\sum}\nolimits_{i}{({y}_{i}-{f}_{i})}^{2}}{\mathop{\sum}\nolimits_{i}{({y}_{i}-\bar{y})}^{2}},$$

(11)

where y_i is the ith experimental data point and f_i the associated model result. $\bar{y}$ is the mean overall experimental data points.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All modeling results are available as CSV files from zenodo with the identifier https://doi.org/10.5281/zenodo.7084922⁶².

Code availability

The Python source code for the model, including scripts for deriving all results, are available on zenodo with the identifier https://doi.org/10.5281/zenodo.5643429⁵⁶.

References

Jeffress, L. A. A place theory of sound localization. J. Comparative Physiol. Psychol. 41, 35–39 (1948).
Article CAS Google Scholar
Bernstein, L. R. & Trahiotis, C. An interaural-correlation-based approach that accounts for a wide variety of binaural detection data. J. Acoust. Soc. Am. 141, 1150–1160 (2017).
Article PubMed Google Scholar
van der Heijden, M. & Trahiotis, C. Masking with interaurally delayed stimuli: The use of “internal” delays in binaural detection. J. Acoust. Soc. Am. 105, 388–399 (1999).
Article PubMed Google Scholar
Colburn, H. S. Theory of binaural interaction based on auditory-nerve data. i. general strategy and preliminary results on interaural discrimination. J. Acoust. Soc. Am. 54, 1458–1470 (1973).
Article CAS PubMed Google Scholar
Colburn, H. S. Theory of binaural interaction based on auditory-nerve data. ii. detection of tones in noise. J. Acoust. Soc. Am. 61, 525–533 (1977).
Article CAS PubMed Google Scholar
Carr, C. E. & Konishi, M. Axonal delay lines for time measurement in the owl’s brainstem. Proc. Natl Acad. Sci. USA 85, 8311–8315 (1988).
Article CAS PubMed PubMed Central Google Scholar
Stern, R. M. & Shear, G. D. Lateralization and detection of low-frequency binaural stimuli: effects of distribution of internal delay. J. Acoust. Soc. Am. 100, 2278–2288 (1996).
Article Google Scholar
McAlpine, D., Jiang, D. & Palmer, A. R. A neural code for low-frequency sound localization in mammals. Nat. Neurosci. 4, 396–401 (2001).
Article CAS PubMed Google Scholar
Joris, P. X., de Sande, B. V., Louage, D. H. & van der Heijden, M. Binaural and cochlear disparities. Proc. Natl Acad. Sci. USA 103, 12917–12922 (2006).
Article CAS PubMed PubMed Central Google Scholar
Marquardt, T. & Mcalpine, D. A π-limit for coding itds: Implications for binaural models. in Hearing—From Sensory Processing to Perception (eds Kollmeier, B. et al.) Ch. 44, 407–416 (Springer, 2007).
Dietz, M., Ewert, S. D., Hohmann, V. & Kollmeier, B. Coding of temporally fluctuating interaural timing disparities in a binaural processing model based on phase differences. Brain Res. 1220, 234–245 (2008).
Article CAS PubMed Google Scholar
Takanen, M., Santala, O. & Pulkki, V. Visualization of functional count-comparison-based binaural auditory model output. Hearing Res. 309, 147–163 (2014).
Article Google Scholar
Encke, J. & Hemmert, W. Extraction of inter-aural time differences using a spiking neuron network model of the medial superior olive. Front. Neurosci. 12. https://doi.org/10.3389/fnins.2018.00140 (2018).
Bouse, J., Vencovský, V., Rund, F. & Marsalek, P. Functional rate-code models of the auditory brainstem for predicting lateralization and discrimination data of human binaural perception. J. Acoust. Soc. Am. 145, 1–15 (2019).
Article PubMed Google Scholar
Bernstein, L. R. & Trahiotis, C. Binaural detection as a joint function of masker bandwidth, masker interaural correlation, and interaural time delay: empirical data and modeling. J. Acoust. Soc. Am. 148, 3481–3488 (2020).
Article PubMed Google Scholar
Bernstein, L. R. & Trahiotis, C. Accounting for binaural detection as a function of masker interaural correlation: Effects of center frequency and bandwidth. J. Acoust. Soc. Am. 136, 3211–3220 (2014).
Article PubMed Google Scholar
Cherry, E. C. Some experiments on the recognition of speech, with one and with two ears. J. Acoust. Soc. Am. 25, 975–979 (1953).
Article Google Scholar
Hirsh, I. J. The influence of interaural phase on interaural summation and inhibition. J. Acoust. Soc. Am. 20, 536–544 (1948).
Article Google Scholar
Langford, T. L. & Jeffress, L. A. Effect of noise crosscorrelation on binaural signal detection. J. Acoust. Soc. Am. 36, 1455–1458 (1964).
Article Google Scholar
Rabiner, L. R., Laurence, C. L. & Durlach, N. I. Further results on binaural unmasking and the EC model. J. Acoust. Soc. Am. 40, 62–70 (1966).
Article CAS PubMed Google Scholar
Marquardt, T. & McAlpine, D. Masking with interaurally "double-delayed” stimuli: the range of internal delays in the human brain. J. Acoust. Soc. Am. 126, EL177–EL182 (2009).
Article PubMed Google Scholar
Zwicker, E. & Henning, G. The four factors leading to binaural masking-level differences. Hearing Res. 19, 29–47 (1985).
Article CAS Google Scholar
Zurek, P. M. Probability distributions of interaural phase and level differences in binaural detection stimuli. J. Acoust. Soc. Am. 90, 1927–1932 (1991).
Article CAS PubMed Google Scholar
Goupell, M. J. & Hartmann, W. M. Interaural fluctuations and the detection of interaural incoherence: bandwidth effects. J. Acoust. Soc. Am. 119, 3971–3986 (2006).
Article PubMed Google Scholar
Saleh, B. Fundamentals of Photonics, Ch. 11 (Wiley-Interscience, 2007).
Just, D. & Bamler, R. Phase statistics of interferograms with applications to synthetic aperture radar. Appl. Optics 33, 4361 (1994).
Article CAS Google Scholar
Shin, K. Fundamentals of Signal Processing for Sound and Vibration Engineers (John Wiley & Sons, 2008).
Google Scholar
Blauert, J. Spatial Hearing : the Psychophysics of Human Sound Localization (MIT Press, 1983).
Google Scholar
Pollack, I. & Trittipoe, W. J. Binaural listening and interaural noise cross correlation. J. Acoust. Soc. Am. 31, 1250–1252 (1959).
Article Google Scholar
Robinson, D. E. & Jeffress, L. A. Effect of varying the interaural noise correlation on the detectability of tonal signals. J. Acoust. Soc. Am. 35, 1947–1952 (1963).
Article Google Scholar
Domnitz, R. H. & Colburn, H. S. Analysis of binaural detection models for dependence on interaural target parameters. J. Acoust. Soc. Am. 59, 598–601 (1976).
Article CAS PubMed Google Scholar
van de Par, S. & Kohlrausch, A. Dependence of binaural masking level differences on center frequency, masker bandwidth, and interaural parameters. J. Acoust. Soc. Am. 106, 1940–1947 (1999).
Article PubMed Google Scholar
International Organization for Standardization. Colorimetry – Part 4: CIE 1976 L*a*b* Colour Space (Standard, International Organization for Standardization, Geneva, CH, 2019).
Brughera, A., Dunai, L. & Hartmann, W. M. Human interaural time difference thresholds for sine tones: the high-frequency limit. J. Acoust. Soc. Am. 133, 2839 (2013).
Article PubMed PubMed Central Google Scholar
Yost, W. A. Discriminations of interaural phase differences. J. Acoust. Soc. Am. 55, 1299 (1974).
Article CAS PubMed Google Scholar
van de Par, S., Trahiotis, C. & Bernstein, L. R. A consideration of the normalization that is typically included in correlation-based models of binaural detection. J. Acoust. Soc. Am. 109, 830–833 (2001).
Article PubMed Google Scholar
Tollin, D. J. & Yin, T. C. T. Interaural phase and level difference sensitivity in low-frequency neurons in the lateral superior olive. J. Neurosci. 25, 10648–10657 (2005).
Article CAS PubMed PubMed Central Google Scholar
Grothe, B., Pecka, M. & McAlpine, D. Mechanisms of sound localization in mammals. Physiol. Rev. 90, 983–1012 (2010).
Article CAS PubMed Google Scholar
Joris, P. X., van de Sande, B., Recio-Spinoso, A. & van der Heijden, M. Auditory midbrain and nerve responses to sinusoidal variations in interaural correlation. J. Neurosci. 26, 279–289 (2006).
Article CAS PubMed PubMed Central Google Scholar
Joris, P. X. Neural binaural sensitivity at high sound speeds: single cell responses in cat midbrain to fast-changing interaural time differences of broadband sounds. J. Acoust. Soc. Am. 145, EL45–EL51 (2019).
Article PubMed PubMed Central Google Scholar
Siveke, I., Ewert, S. D., Grothe, B. & Wiegrebe, L. Psychophysical and physiological evidence for fast binaural processing. J. Neurosci. 28, 2043–2052 (2008).
Article CAS PubMed PubMed Central Google Scholar
Luke, R., Innes-Brown, H., Undurraga, J. A. & McAlpine, D. Human cortical processing of interaural coherence. iScience 25, 104181 (2022).
Article PubMed PubMed Central Google Scholar
Glasberg, B. R. & Moore, B. C. Derivation of auditory filter shapes from notched-noise data. Hearing Res. 47, 103–138 (1990).
Article CAS Google Scholar
Breebaart, J., van de Par, S. & Kohlrausch, A. Binaural processing model based on contralateral inhibition. i. model structure. J. Acoust. Soc. Am. 110, 1074–1088 (2001).
Article CAS PubMed Google Scholar
Brand, A., Behrend, O., Marquardt, T., McAlpine, D. & Grothe, B. Precise inhibition is essential for microsecond interaural time difference coding. Nature 417, 543–7 (2002).
Article CAS PubMed Google Scholar
Shackleton, T. M., Skottun, B. C., Arnott, R. H. & Palmer, A. R. Interaural time difference discrimination thresholds for single neurons in the inferior colliculus of guinea pigs. J. Neurosci. 23, 716–724 (2003).
Article CAS PubMed PubMed Central Google Scholar
Yin, T. C. & Kuwada, S. Binaural interaction in low-frequency neurons in inferior colliculus of the cat. iii. effects of changing frequency. J. Neurophysiol. 50, 1020–1042 (1983).
Article CAS PubMed Google Scholar
McAlpine, D. Creating a sense of auditory space. J. Physiol. 566, 21–28 (2005).
Article CAS PubMed PubMed Central Google Scholar
Mao, J. & Carney, L. H. Binaural detection with narrowband and wideband reproducible noise maskers. iv. models using interaural time, level, and envelope differences. J. Acoust. Soc. Am. 135, 824–837 (2014).
Article PubMed PubMed Central Google Scholar
Davidson, S. A., Gilkey, R. H., Colburn, H. S. & Carney, L. H. An evaluation of models for diotic and dichotic detection in reproducible noises. J. Acoust. Soc. Am. 126, 1906 (2009).
Article PubMed PubMed Central Google Scholar
Eddins, D. A. & Barber, L. E. The influence of stimulus envelope and fine structure on the binaural masking level difference. J. Acoust. Soc. Am. 103, 2578–2589 (1998).
Article CAS PubMed Google Scholar
Hall, J. W., Grose, J. H. & Hartmann, W. M. The masking-level difference in low-noise noise. J. Acoust. Soc. Am. 103, 2573–2577 (1998).
Article PubMed Google Scholar
Bernstein, L. R., van de Par, S. & Trahiotis, C. The normalized interaural correlation: accounting for nosπ thresholds obtained with gaussian and "low-noise” masking noise. J. Acoust. Soc. Am. 106, 870–876 (1999).
Article CAS PubMed Google Scholar
Johnson, D. H. The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones. J. Acoust. Soc. Am. 68, 1115–1122 (1980).
Article CAS PubMed Google Scholar
Eurich, B., Encke, J., Ewert, S. D. & Dietz, M. Lower interaural coherence in off-signal bands impairs binaural detection. J. Acoust. Soc. Am. 151, 3927–3936 (2022).
Article PubMed Google Scholar
Encke, J. & Dietz, M. Model code for: a hemispheric two-channel code accounts for binaural unmasking in humans. https://doi.org/10.5281/zenodo.5643429 (2022).
Khintchine, A. Korrelationstheorie der stationären stochastischen prozesse. Mathematische Annalen 109, 604–615 (1934).
Article Google Scholar
Darling, A. Properties and Implementation of the Gammatone Filter: A Tutorial (Speech Hearing and Language, Work in Progress, University College London, Department of Phonetics and Linguistics, 1991).
McNemar, Q. Psychological statistics. In Factors Which Affect the Correlation Coefficent, 4th edition, 137–139 (John Whiley and Sons inc., 1966).
Lüddemann, H., Riedel, H. & Kollmeier, B. Electrophysiological and psychophysical asymmetries in sensitivity to interaural correlation steps. Hearing Res. 256, 39–57 (2009).
Article Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Encke, J. & Dietz, M. Dataset for: a hemispheric two-channel code accounts for binaural unmasking in humans. https://doi.org/10.5281/zenodo.7084922 (2022).

Download references

Acknowledgements

This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme grant agreement No. 716800 (ERC Starting Grant to M.D.)

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Medical Physics and Acoustics, University of Oldenburg, Oldenburg, Germany
Jörg Encke & Mathias Dietz
Cluster of Excellence ‘Hearing4all’, University of Oldenburg, Oldenburg, Germany
Jörg Encke & Mathias Dietz

Authors

Jörg Encke
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Dietz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.E. and M.D. designed the research; J.E. conducted calculations, analyzed the data, and produced figures; J.E. and M.D wrote the paper.

Corresponding author

Correspondence to Jörg Encke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Rodrigo Laje and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Enzo Tagliazucchi, Joao Manuel de Sousa Valente and Luke R. Grinham. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Reporting Summary-New

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Encke, J., Dietz, M. A hemispheric two-channel code accounts for binaural unmasking in humans. Commun Biol 5, 1122 (2022). https://doi.org/10.1038/s42003-022-04098-x

Download citation

Received: 25 May 2022
Accepted: 06 October 2022
Published: 22 October 2022
DOI: https://doi.org/10.1038/s42003-022-04098-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.