Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Robustness of cortical and subcortical processing in the presence of natural masking sounds

## Abstract

Processing of ethologically relevant stimuli could be interfered by non-relevant stimuli. Animals have behavioral adaptations to reduce signal interference. It is largely unexplored whether the behavioral adaptations facilitate neuronal processing of relevant stimuli. Here, we characterize behavioral adaptations in the presence of biotic noise in the echolocating bat Carollia perspicillata and we show that the behavioral adaptations could facilitate neuronal processing of biosonar information. According to the echolocation behavior, bats need to extract their own signals in the presence of vocalizations from conspecifics. With playback experiments, we demonstrate that C. perspicillata increases the sensory acquisition rate by emitting groups of echolocation calls when flying in noisy environments. Our neurophysiological results from the auditory midbrain and cortex show that the high sensory acquisition rate does not vastly increase neuronal suppression and that the response to an echolocation sequence is partially preserved in the presence of biosonar signals from conspecifics.

## Introduction

For understanding how nervous systems work, it is necessary to analyze neuronal processing of natural and ethologically relevant stimuli1,2. Since animals usually encounter a mixture of different stimuli, neuronal processing of ethologically relevant stimuli could be masked by the presence of irrelevant stimuli. Throughout this manuscript, we will refer to the relevant stimuli as “targets” and the irrelevant stimuli as “maskers”. Although, many behavioral adaptations, from here on called jamming avoidance responses (JARs) have been reported to reduce or even avoid masking3,4,5,6,7,8,9, it remains largely unexplored if neuronal processing of targets profit from the JARs. The present study reports JARs in the frugivorous bat Carollia perspicillata and tested if the JARs could facilitate neuronal processing of echolocation sequences in the presence of masking sounds.

Bats orientate acoustically by emitting biosonar calls that are reflected in surrounding objects10,11,12. For computing the distance to objects, bats measure the time interval from call emission to echo arrival, also called echo delay. The echo delay decreases linearly with decreasing distances. Since bats need to integrate call and echo information, the computation of distances could be interfered by acoustic maskers from call onset to echo arrival13. Neurons involved in distance processing, also called delay tuned neurons, integrate call and echo information to respond to certain echo delays14,15,16,17,18,19,20. Thus, bats represent good animal models to test if JARs could facilitate neuronal processing of distance information. Neurophysiologically, we focused on the inferior colliculus (IC) and the auditory cortex (AC) because most data from delay tuned neurons were obtained by recording neuronal signals from these two brain regions.

Our behavioral results show that C. perspicillata adapts its call emission pattern in the presence of maskers. The bats increase the tendency of emitting grouped calls and they reduce the call intervals (CIs) within call groups. High stimulus rates usually evoke neuronal suppression21,22,23 which has also been shown in C. perspicillata24,25,26. Thus, we tested if the increase in stimulus rate may also increase neuronal suppression which could impede target detection at the neuronal level. Our results show that in the IC, the neuronal suppression did not vastly increase and the neuronal tuning to the target was barely affected by maskers. In contrast, the suppression increased in the AC. But, target detection was higher in the AC than in the IC due to a higher neuronal selectivity in the AC. In summary, our results indicate that the JARs, reported here for C. perspicillata, could facilitate neuronal processing of the targets in the presence of maskers.

## Results

In a flight room (Fig. 1A), the echolocation behavior from eight bats was monitored in the absence (training trials) and presence (test trials) of maskers. A wall, made out of foam, separated the room into two acoustically isolated sides. In training trials, under non-masking conditions, the hand released bats had to land on one out of two platforms at the end of the room (Movie S1). Two ultrasound sensitive microphones, each positioned behind a platform, recorded the call emissions. In test trials, under masking conditions, a masker was presented from one side of the room, whereas the contralateral side was silent. The masker represents a sequence of echolocation calls that was recorded during the first test trial from the tested bat (for details see methods). The bats preferred to land on the platform of the non-masking side (Paired t-test: p = 0.01; Fig. 1B; Movie S2). But, they could still echolocate and land on the platform at the masking side (Movie S3).

### Masker effect on single-unit responses in the inferior colliculus and auditory cortex

Acoustic rates higher than 40 Hz usually evoke neuronal suppression which is also the case for the IC25 and AC26,41 of C. perspicillata. Thus, the bat may only profit from the previously described JAR if the neuronal response to the target is not suppressed in the presence of the masker. The target was an echolocation sequence that mimicked a stimulus scenario in which the bat flies towards an object. Robustness of neuronal tuning to the target was tested by mixing the target with two different maskers (for details see methods) which leads to “mixture conditions”. The “single animal mixture” simulates a situation where two bats fly in the same room. “The colony mixture” simulates a situation in which many bats fly together. Note that the single animal masker condition resembles the situation that bats had to encounter in the behavioral experiments.

To quantify the effects of the masker on distance processing, we compared the “median echo-delays” in response to the target and the mixture. Based on the spike times, each spike was associated to a particular echo-delay of the target. The median of those delays represent the “median delay” of the unit. In response to the single animal mixture, the median delay did not change for 28.6% and 33% of the collicular and cortical units, respectively (black bars in Fig. 3A and B at Δ median delay = 0). In the AC, the colony masker evoked a stronger median delay shift than the single animal masker (Wilcoxon signed rank test: p < 10−5; Fig. 3C). Similar results were obtained by considering the “best delay”, represented by the echo delay evoking the peak in the PSTH, instead of the median delay (Fig. S4CE). Note that the median delay usually shifted towards longer delays in the mixture situation. The median delay was more robust and shifted less in the IC than in the AC (Kruskal-Wallis test and Dunn’s multiple comparison post-hoc test: p < 10−5; Fig. 3C).

To quantify whether the target is encoded in the responses to the mixtures, we calculated d′ values with equations from the signal detection theory for each unit (IC Fig. 3D; AC Fig. 3E42,43). D’ values represent differences of the spike rate obtained in response to the masker and the mixture. High d′ values either arise from a response to the target or from a response caused by an integration of masker and target. The time course of d′ values was obtained by calculating d′ values for each call-echo element of the target (for details see methods and Fig. S3). To exclude spike rate differences that putatively come from integration of masker and target, we considered only d′ values from call-echo elements that evoke a spike rate of at least 50% of the maximum spike rate in response to the target. The sum of these d′ values represent the cumulative d’ for each unit. High cumulative d′ values indicate strong responses to the target in the mixture condition. The data showed that cumulative d′ values were higher for the single animal than for the colony masker (Wilcoxon signed rank test: p < 10−5; Fig. 3F) and they were also higher in the AC than in the IC (Kruskal-Wallis test and Dunn’s multiple comparison post-hoc test: p < 10−5; Fig. 3F). We got similar results when considering all bins of the PSTH for calculating cumulative d′ values (Fig. S4FH). To summarize, our data shows that there exist different strategies for information representation in noisy environments along the colliculo-cortical axis. Masker induced median delay shifts are weaker in the IC than in the AC. At the same time, target representation, as calculated by d′ values, is higher in the AC than in the IC.

### Masker effect on cortical target-distance map

In C. perspicillata’s AC, delay tuned neurons are topographically organized along the rostro-caudal axis11,44. Positioning multi-electrodes along the topographic gradient allows simultaneous characterization of long and short delay tuned neurons at caudal and rostral positions, respectively26,45, (Fig. 4A). We analyzed neuronal tuning in twelve cortical maps, eight from the left and five from the right cortex. Response patterns from two representative cortical maps are shown in Figs 4B–F and S5A–E. The topography was more distinct in response to the target (Figs 4B and S5A) than to the single animal (Figs 4D and S5C) and the colony mixture (Figs 4F and S5E).

The median delays plotted against the electrode position give an overview of the topographic changes induced by the masker (Figs 4G and S5F). In the presence of the colony masker, the topographic maps shifted towards longer delays than in the absence of the masker. Changes in neuronal tuning indicated by changes in delay tuning, response strengths at best delay, and cumulative d’ were independent of the electrode position (Fig. S5G–J).

The direction of the topographic gradient can be read out by the sum of the median delay slopes (Σ slopes; Fig. 4H). Negative Σ slopes signal a decrease of median delay from caudal to rostral and vice versa. In response to the target, the median delays decrease from caudal to rostral 9 out of 12 maps (target maps). Median delays from the three remaining target maps did not change along the rostro-caudal axis. Five single animal and three colony mixture maps, changed their direction of the gradient and had positive Σ slopes.

The smoothness of a topographic map is represented by the absolute slopes between adjacent electrodes. Abrupt delay shifts result in high absolute slopes, or rough topographic maps. Subtle delay shifts result in low absolute slopes, or smooth topographic maps. To identify whether mixture maps were rougher or smoother than target maps, the sum of the absolute slopes (Σ abs slopes) in response to the target was subtracted from Σ abs slopes in response to the mixture (Fig. 4I). Positive values, along the y-axis, indicate a smoother map in response to the mixture than to the target and vice versa. Map roughness increased in 58% (black numbers in Fig. 4I) and 67% (green numbers in Fig. 4I) of the maps in the single animal and colony mixture, respectively. Rougher maps are partially explained by an increase in the delay range that the map covers. The latter is indicated by a linear correlation between the delay range and the absolute slope (Pearson: r = 0.79, p < 10−5, Fig. 4I). In most cases, an increase of the delay range is a result of increasing the maximum delay instead of a decrease in the minimum delay (Fig. 4J). Cumulative d′ values were calculated and plotted for each map (Fig. 4K). The d′ values for the maps were significantly higher in the single animal masker than in the colony masker, indicating a higher target detection ability at the map level in the single animal mixture condition (Paired t-test: p = 0.005).

## Discussion

Many JARs have been reported in different species3,7,8,9. Generally spoken, acoustic signals are modified by the animals to make them distinct from the maskers. The validity of the JARs with respect to a putative facilitation of neuronal processing of targets remains largely unexplored. In the present study, C. perspicillata shows a JAR that is based on controlling the time points of call emission. Such JARs have been discussed in different animals, like crickets46, frogs47,48, electric fish49, birds50,51, bats29,39,52,53, and monkeys54. In response to the maskers, C. perspicillata emits tight groups of calls. The fact that 60% of the calls were emitted in groups, independent from the presence of the masker, shows the importance of emitting call groups during echolocation. In bats, the emission of call groups has been associated with a high complexity of orientation tasks30,33,34,35,36,39,40. Behavioral results in the frugivorous bat Phyllostomus discolor also show an increase of call grouping in the presence of maskers29, corresponding to the present results from C. perspicillata.

How could call grouping facilitate echolocation in the presence of maskers? For illustration, we consider the detection of two different target sequences (Fig. 5). Echolocation calls from the first target are equally distributed (non-grouped sequence; Fig. 5A) and the calls of the second target are grouped into triplets (grouped sequence; Fig. 5B). According to a median CI of 22 ms (red boxplot in Fig. 1F) and an average flight speed of 2.5 m/s28, a triplet carries distance information from ±11 cm. Thus, the call-echo elements, or functional units of a triplet, encode echo delays in the range of ±1 ms. If no functional unit gets masked, then the non-grouped sequence conveys higher spatial resolution (1 ms) than the grouped sequence (3 ms). However, if functional units get masked then two functional units from a triplet of the grouped sequence could get masked without losing much delay information. In contrast, the loss of delay information might be more dramatic if functional units get masked in the non-grouped sequence. Accordingly, the maximum distance that the bat must cover without delay information (maximum blind spot) is longer in the non-grouped than in the grouped sequence (in our example, 63 cm for the non-grouped and 51 cm for the grouped sequence). In summary, grouping the calls more tightly increases the discontinuity of echo information but it could increase the continuity of preserved delays (compare the continuity of preserved delays in (A) and (B)).

At the neuronal level, call groups and maskers increase the acoustic rate which could potentiate neuronal suppression26,45. Neuronal suppression is usually weaker at subcortical than at cortical areas21,25,55. This may explain, why the maskers tend to induce stronger delay tuning shifts in the AC than in the IC. In the AC, delay tuning was usually shifted towards longer delays which could be based on two aspects: i) In noisy environments, the cortex is biased to process echo information from distant objects, like conspecifics. The latter is especially important for tracking flight trajectories of conspecifics for avoiding collisions. ii) The risk of interference is highest at long delays because the functional units span a longer duration than the ones at short delays (Fig. 5). Thus, delay tuning shifts towards long delays could reflect jamming effects.

The target representation in response to the mixture was computed by d′ values that have been used in previous studies42,43. Corresponding to the high d′ values in the AC, target detection in the mixture situation was higher in the AC than in the IC. This result could be explained by the differences of suppression strengths that affect the neuronal selectivity in both brain areas56. In contrast to collicular neurons that usually respond to each acoustic event of an echolocation sequence25, cortical neurons respond selectively to certain acoustic events26. The increased selectivity makes the AC less sensitive to the maskers which results in higher d′ values than in the IC (compare responses to colony masker in Fig. 2D). The fact that the increase of neuronal selectivity along the ascending auditory pathway could be correlated with an improvement of target detection in a noisy situation is supported by findings in frogs. The ability of target detection57 and the neuronal selectivity4,58 increases from the auditory nerve to the IC of frogs.

Note that during the neuronal recordings, the animals were passively listening to echolocation sequences. Attentional effects, binaural effects, directionality of the stimuli and of hearing could further facilitate signal processing under noisy conditions4,31,56,59,60. A rhythmic sampling of the environment that is induced by emitting call groups might be beneficial, because the bats could attend and predict the temporal pattern of echo arrival. Neuronal activity usually oscillates over time which is often associated with attentional phenomena61,62. If the rhythmic sampling of the environment occurs in phase with ongoing brain oscillations, then it might happen that the animal’s attention is high at the expected time points of echo perception61. The latter is conceptualized by the “temporal binding hypothesis”62,63 which states that stimulus features coming from one source like a single object are combined through a synchronized neuronal activity. Stimulus features that do not belong together are not temporally bound or synchronously processed. Thus, although interference signals might evoke neuronal activity, this activity is likely not associated with the ethologically relevant stimuli when the information is not synchronously processed.

In summary, the present results indicate that in noisy environments, C. perspicillata increases the discontinuity of echo information by emitting tightly grouped calls. Each functional unit of the group encodes a similar echo delay which makes some echoes expendable for the bat. Erroneously encoded distance information could get updated at neuronal level. At the neuronal level, moderate delay tuning shifts in the IC and high d′ values in the AC indicate that target information is still encoded in the presence of the masker. Thus, the JARs, found in C. perspicillata, could potentially facilitate neuronal processing of natural and ethologically relevant stimuli.

## Methods

### Animals

Behavioral experiments were conducted in eight (1 female, 7 males) bats of the species Carollia perspicillata and the electrophysiological experiments were conducted in 15 female bats (9 for auditory cortex and 6 for inferior colliculus). All bats were bred in a colony at the Institute for Cell Biology and Neuroscience (Frankfurt University). The experiments comply with all current German laws on animal experimentation and they are in accordance with the Declaration of Helsinki. All experimental protocols were approved by the Regierungspräsidium Darmstadt (experimental permit # F104/57).

### Behavior

Behavioral experiments were performed in a wooden flight chamber (Movie 13; Fig. 1A; length: 4 m; width: 1.4 m; height: 2 m). A wall, made from foam, separated the room into two sides, each side measuring 2.5 m × 0.65 m × 2 m. A landing platform (20 × 20 cm), made of metal mesh, was positioned in one of the walls in each side of the room. Behind each metal mesh, one speaker (Neo CD 1.0 Ribbon Tweeter; Fountek Eelctronics, China) and one ultrasound sensitive microphone (Avisoft Bioacoustics, Germany) were installed. The speaker could produce sounds above 90 dB SPL in the frequency range from 5–130 kHz (calibration curve obtained with a ¼-in. microphone (model 4135; Brüel & Kjaer) + a custom-made microphone amplifier). In the calibration curve, SPL decreased at a rate of 0.13 dB/kHz as sound frequency increased from 5 to 130 kHz. The microphones had a sensitivity of 50 mV/Pa and an input-referred self-noise level of 18 dB SPL. Their frequency response curves were flat (+/−3 dB, as specified by the manufacturer) in the range from 30–130 kHz. Each microphone was connected to a sound acquisition system (one microphone to an UltraSoundGate 116 Hm mobile recording interface and the second microphone to an UltraSoundGate 116 Hb mobile recording interface, + Recorder Software, Avisoft Bioacoustics, Germany) for sound digitalization at 333 kHz (16 bit precision).

During the training trials, call emissions of hand released bats were acoustically recorded with the microphones. The flight behavior was monitored with a webcam (500 SX, Manhattan, USA) placed above the starting point and with a frame rate of 30 Hz. During test trials, one speaker produced playback stimuli resulting in a potentially masking side. The speaker of the contralateral side remained silent (non-masking side). Masking and non-masking sides were randomly selected. The playback stimuli represented repetitions of a representative biosonar call that was recorded during a training trial from the tested animal. The biosonar call was repeated in groups of five (one animal), ten (four animals) or twenty calls (three animals) with within group call intervals of 15 ms and in between group intervals of 35 ms. The within group call intervals of 15 ms lies in the range of the minimum pulse intervals produced by C. perspicillata28. Behavioral data from bats that were stimulated with call groups consisting of five, ten, or twenty calls were not different and therefore were grouped together for data analysis. Acoustic stimuli were generated with a sampling rate of 384 kHz with an Exasound E18 sound card (ExaSound Audio Design, Canada), and sent to an audio amplifier (Rotel power amplifier, RB-850, USA). The stimuli were played with a sound pressure level of 80–90 dB re 20 µPascal (dB SPL).

The call emissions were recorded by the microphones and a segment of two seconds was analyzed from each trial. The segment was chosen such that the echolocation calls with the highest amplitude during the trial were considered. This reduced the risk of missing echolocation calls whose amplitude are too low to be recorded. The analysis segments from test trials were chosen from recordings conducted while the animals flew at the masking side. In total, 48 segments from eight animals were analyzed. Twenty-four segments (3 per animal) were recorded during the training and another 24 were recorded during the test trials. Call emission patterns during test and training trials were compared pairwise, meaning that three pairs of “test” and “training” trials were compared for each animal.

For data analysis, the time points of call emissions were manually tagged in the software Avisoft SAS Lab Pro (Avisoft Bioacoustics, Germany). These time points were later used for the remaining analysis done in Matlab 2014 (MathWorks, USA). Call groups were defined according to the criterions formulated by30. First an “island criterion” defines temporally isolated call groups. The island criterion is fulfilled when the preceding and following call intervals of a call group are 20% longer than the call intervals within call groups. If the island criterion is fulfilled a second criterion, the so called “stability criterion”, defines the size of the call groups indicated by the number of calls belonging to a group. The stability criterion is fulfilled if the call intervals within call groups are stable with a 5% tolerance. Note that doublets, i.e. call groups containing two calls, can only be defined according to the island criterion. For determining triplets, quartets, quintets or sextets, both criteria had to be fulfilled. The behavioral analysis considered doublets to sextets, because sextets represent the longest call groups reported in C. perspicillata27.

### Stimuli for electrophysiology

To study neuronal responses, we used two types of “maskers” and one “target” stimuli. The target was an echolocation sequence recorded from a pendulum flight simulating an approach flight26. In a previous article, we had already shown that the target evoked reliable neuronal responses in high-frequency tuned neurons of the IC and AC of C. perspicillata25,26. Note that in the target, the echo delays decreased from 23 ms at the first, to 1 ms at the last call-echo element. The recording and preparation of the echolocation sequence was explained in detail in a recent study26. Briefly, the acoustic signals were recorded using a pendulum paradigm64, in which the bat was positioned in a pendulum mass and it was swung towards an acrylic glass wall26. During the swing, the bat emitted sequences of echolocation calls that were recorded, together with their echoes, with the aid of an ultrasound sensitive microphone attached to the pendulum (Avisoft Bioacoustics, Germany). The SPL of the acoustic events of the target echolocation sequence varied between 36–77 dB SPL.

In comparison to the single animal masker, the colony masker contained more acoustic events, including calls, echoes, and communication calls from a colony of C. perspicillata with 150 animals. The colony masker was recorded with an ultrasound sensitive microphone that was held for 30 seconds inside of the colony room of the facility. A segment of 1.34 seconds of the recording was used as colony masker. The masker contained more than 200 acoustic events that partially overlapped in time and the SPL of the acoustic events ranged from 43 to 81 dB (median = 63 dB SPL). Due to temporal overlap, it was impossible to measure the exact number of acoustic events in the colony masker. The colony masker reflects a natural acoustic environment that C. perspicillata has to face in the roosts. The “single animal mixture” and the “colony mixture” represent stimuli that were obtained by adding the target stimulus to the single animal and colony maskers, respectively. Stimuli were presented fifteen times.

### Electrophysiological recordings

For cortical recordings, two electrode types were used. (i) Commercially available micro-electrode arrays with 16 recording electrodes organized in 2 × 8 (MicroTargets for Life Science, USA). A reference electrode with an impedance of 10 kΩ was placed adjacent to the recording electrodes. Reference and recording electrodes were made out of tungsten whereas a silver wire placed on the cortical surface of the frontal cortex was used as ground. Each recording electrode had an impedance of 2 MΩ (as reported by the manufacturer). The arrays had an electrode and row spacing of 250 µm. (ii) Custom-built glass electrode arrays of up to 8 channels organized in a single row. Glass electrodes (resistance 1–10 MΩ when filled with 3 mol/L KCl) were pulled from borosilicate capillaries (GB120F-10, Science Products, Germany) with a Flaming/Brown horizontal puller (P97, Sutter, USA) and they were glued together in a fan-shape pattern, ensuring an electrode tip spacing of 250 µm. For IC recordings, single glass electrodes were used with the same specifications as for the glass electrodes of the custom-built glass electrode arrays.

A wireless multichannel recording system (Multi Channel Systems MCS GmbH, Germany) was used for data acquisition at a sampling rate of 20 kHz per channel and 16 bit precision. Action potentials were filtered using a 2nd order Butterworth band-pass filter, with cutoffs between 300–3000 Hz.

### Spike-Data Analysis

Spike detection was based on spike amplitude relative to recording noise level. The spikes were sorted based on the first three principal components of the spike waveforms and they were clustered automatically using the “KlustaKwik” algorithm65. Only the cluster containing the largest amount of spikes was used for analysis.

Neuronal data from the AC comprised 72 spike-sorted single-units that were recorded from twelve cortical maps. Between three and twelve units were recorded simultaneously (median = 6). Twenty units were recorded with the commercially available micro electrode arrays from Microprobes and 52 units were recorded with the custom-made glass electrode arrays. Neuronal data from the IC comprised 49 spike-sorted single-units that were sensitive to high frequencies (>40 kHz).

Data analysis was based on post-stimulus time histograms (PSTHs) constructed with a binsize of 5 ms and 2 ms for cortical and collicular data, respectively. Different binsizes between collicular and cortical units were used because collicular neurons fire temporally more precise than cortical ones19. The initial 150 ms of the cortical response were not considered because of strong stimulus independent onset responses26. Delay tuning to the target stimulus was assessed by assigning each spike, according to its occurrence, to a specific echo delay of the echolocation sequence used as stimulus. The assignment of each spike to a specific delay allowed us to reconstruct delay tuning curves. The best delay is defined based on the call-echo element eliciting the strongest response i.e. the largest number of spikes. The median delay was calculated by measuring the median time of occurrence of the evoked spikes. The median time point was then assigned to the preceding call-echo element whose echo delay represents the median delay. In comparison to the best delay, which reflects the maximum response only, the median delay calculation considers each elicited spike. A response to certain echo-delays was considered to occur if the neuronal response to the target sequence was at least as strong as 50% of the maximum response observed (i.e. the response strength at the best delay).

For quantifying the preservation of the response to the target in the mixture situation (target detection), we calculated unit specific cumulative d′ values (Fig. S3). First we determined median d′ values from the response window to each call-echo element of the target.

$${d^{\prime} }_{i}^{2}=\frac{{(mi{x}_{i}-{m}_{i})}^{2}}{0.5\,[\sigma {(mi{x}_{i})}^{2}+\sigma {({m}_{i})}^{2}]\,}$$
(1)

In the equation (1), mix i and m i represent the median spike rate in response to the call-echo element in the mixture and masker situation, respectively. Thus, high spike rate differences between the response to the mixture and masker result in high d′ values. These differences are due to the presence of the target in the mixture and may either arise from responses to the target or from an integration of the target and masker stimulus. A neuronal response arising from stimulus integration represents a neuronal correlate of jamming. To determine the target detection in the mixture situation it is necessary to exclude high d′ values arising from jamming. Therefore, we calculated a cumulative d’ for each unit by considering only d′ values from call-echo elements that elicited a neuronal response in response to the target stimulus i. Cumulative d′ values were calculated according to the following equation (2):

$$cumulative\,d^{\prime} =\sqrt{{\sum }_{i}{d^{\prime} }_{i}^{2}}$$
(2)

The higher the neuron’s cumulative d’, the higher is its target detection in the mixture situation. The quantification of target detection based on d’ have been successfully performed in previous studies42,43. Data analysis was done in Matlab 2014 and statistics in GraphPad Prism 5 (GraphPad Software, USA; *p < 0.05; **p < 0.01; ***p < 0.0001).

## References

1. 1.

Lewicki, M. S., Olshausen, B. A., Surlykke, A. & Moss, C. F. Scene analysis in the natural environment. Front Psychol 5, https://doi.org/10.3389/fpsyg.2014.00199 (2014).

2. 2.

Theunissen, F. E. & Elie, J. E. Neural processing of natural sounds. Nat Rev Neurosci 15, 355–366, https://doi.org/10.1038/nrn3731 (2014).

3. 3.

Ulanovsky, N. & Moss, C. F. What the bat’s voice tells the bat’s brain. Proc Natl Acad Sci USA 105, 8491–8498, https://doi.org/10.1073/pnas.0703550105 (2008).

4. 4.

Feng, A. S. & Ratnam, R. Neural basis of hearing in real-world situations. Annu Rev Psychol 51, 699–725, https://doi.org/10.1146/annurev.psych.51.1.699 (2000).

5. 5.

Bradbury, J. W. & Vehrencamp, S. L. Principles of animal communication. Vol. 2 pp 81–83 (Sinauer Associates, Inc.; USA 2011).

6. 6.

Watanabe, A. & Takeda, K. The Change of Discharge Frequency by A.C. Stimulus in a Weak Electric Fish. J Exp Biol 40, 57–66 (1963).

7. 7.

Zelick, R. Jamming Avoidance in Electric Fish and Frogs - Strategies of Signal Oscillator Timing. Brain Behav Evolut 28, 60–69, https://doi.org/10.1159/000118692 (1986).

8. 8.

Brumm, H. & Zollinger, S. A. The evolution of the Lombard effect: 100 years of psychoacoustic research. Behaviour 148, 1173–1198, https://doi.org/10.1163/000579511605759 (2011).

9. 9.

Corcoran, A. J. & Moss, C. F. Sensing in a noisy world: lessons from auditory specialists, echolocating bats. J Exp Biol 220, 4554–4566, https://doi.org/10.1242/jeb.163063 (2017).

10. 10.

Simmons, J. A. Bats use a neuronally implemented computational acoustic model to form sonar images. Curr Opin Neurobiol 22, 311–319, https://doi.org/10.1016/j.conb.2012.02.007 (2012).

11. 11.

Kössl, M. et al. Neural maps for target range in the auditory cortex of echolocating bats. Curr Opin Neurobiol 24, 68–75, https://doi.org/10.1016/j.conb.2013.08.016 (2014).

12. 12.

Moss, C. F. & Surlykke, A. Probing the natural scene by echolocation in bats. Front Behav Neurosci 4, https://doi.org/10.3389/fnbeh.2010.00033 (2010).

13. 13.

Neuweiler, G. Auditory Adaptations for Prey Capture in Echolocating Bats. Physiol Rev 70, 615–641 (1990).

14. 14.

Feng, A. S., Simmons, J. A. & Kick, S. A. Echo Detection and Target-Ranging Neurons in Auditory-System of Bat Eptesicus-Fuscus. Science 202, 645–648, https://doi.org/10.1126/science.705350 (1978).

15. 15.

Grinnell, A. D. Neurophysiology of Audition in Bats - Temporal Parameters. J Physiol-London 167, 67–96 (1963).

16. 16.

Dear, S. P. & Suga, N. Delay-Tuned Neurons in the Midbrain of the Big Brown Bat. J Neurophysiol 73, 1084–1100 (1995).

17. 17.

O’Neill, W. E. & Suga, N. Target Range Sensitive Neurons in Auditory-Cortex of Mustache Bat. Science 203, 69–73, https://doi.org/10.1126/science.758681 (1979).

18. 18.

Hechavarría, J. C. et al. Blurry topography for precise target-distance computations in the auditory cortex of echolocating bats. Nat Commun 4, https://doi.org/10.1038/ncomms3587 (2013).

19. 19.

Macías, S., Hechavarría, J. C. & Kössl, M. Temporal encoding precision of bat auditory neurons tuned to target distance deteriorates on the way to the cortex. J Comp Physiol A 202, 195–202, https://doi.org/10.1007/s00359-016-1067-2 (2016).

20. 20.

Wenstrup, J. J. & Portfors, C. V. Neural processing of target distance by echolocating bats: Functional roles of the auditory midbrain. Neurosci Biobehav R 35, 2073–2083, https://doi.org/10.1016/j.neubiorev.2010.12.015 (2011).

21. 21.

Wehr, M. & Zador, A. M. Synaptic mechanisms of forward suppression in rat auditory cortex. Neuron 47, 437–445, https://doi.org/10.1016/j.neuron.2005.06.009 (2005).

22. 22.

Wang, X., Lu, T., Bendor, D. & Bartlett, E. Neural coding of temporal information in auditory thalamus and cortex. Neuroscience 157, 483–493, https://doi.org/10.1016/j.neuroscience.2008.06.041 (2008).

23. 23.

Joris, P. X., Schreiner, C. E. & Rees, A. Neural processing of amplitude-modulated sounds. Physiol Rev 84, 541–577, https://doi.org/10.1152/physrev.00029.2003 (2004).

24. 24.

Hechavarría, J. C., Beetz, M. J., Macías, S. & Kössl, M. Vocal sequences suppress spiking in the bat auditory cortex while evoking concomitant steady-state local field potentials. Sci Rep 6, 39226, https://doi.org/10.1038/srep39226 (2016).

25. 25.

Beetz, M. J., Kordes, S., García-Rosales, F., Kössl, M. & Hechavarría, J. C. Processing of Natural Echolocation Sequences in the Inferior Colliculus of Seba’s Fruit Eating Bat, Carollia perspicillata. eNeuro 4, https://doi.org/10.1523/ENEURO.0314-17.2017 (2017).

26. 26.

Beetz, M. J., Hechavarría, J. C. & Kössl, M. Temporal tuning in the bat auditory cortex is sharper when studied with natural echolocation sequences. Sci Rep-Uk 6, https://doi.org/10.1038/srep29102 (2016).

27. 27.

Brinklov, S., Jakobsen, L., Ratcliffe, J. M., Kalko, E. K. V. & Surlykke, A. Echolocation call intensity and directionality in flying short-tailed fruit bats, Carollia perspicillata (Phyllostomidae). J Acoust Soc Am 129, 427–435, https://doi.org/10.1121/1.3519396 (2011).

28. 28.

Thies, W., Kalko, E. K. V. & Schnitzler, H. U. The roles of echolocation and olfaction in two Neotropical fruit-eating bats, Carollia perspicillata and C-castanea, feeding on Piper. Behav Ecol Sociobiol 42, 397–409, https://doi.org/10.1007/s002650050454 (1998).

29. 29.

Luo, J. H., Goerlitz, H. R., Brumm, H. & Wiegrebe, L. Linking the sender to the receiver: vocal adjustments by bats to maintain signal detection in noise. Sci Rep-Uk 5, https://doi.org/10.1038/srep18556 (2015).

30. 30.

Kothari, N. B., Wohlgemuth, M. J., Hulgard, K., Surlykke, A. & Moss, C. F. Timing matters: sonar call groups facilitate target localization in bats. Front Physiol 5, https://doi.org/10.3389/fphys.2014.00168 (2014).

31. 31.

Wohlgemuth, M. J., Kothari, N. B. & Moss, C. F. Action Enhances Acoustic Cues for 3-D Target Localization by Echolocating Bats. Plos Biol 14, https://doi.org/10.1371/journal.pbio.1002544 (2016).

32. 32.

Wheeler, A. R. et al. Echolocating Big Brown Bats, Eptesicus fuscus, Modulate Pulse Intervals to Overcome Range Ambiguity in Cluttered Surroundings. Front Behav Neurosci 10, https://doi.org/10.3389/fnbeh.2016.00125 (2016).

33. 33.

Moss, C. F., Bohn, K., Gilkenson, H. & Surlykke, A. Active listening for spatial orientation in a complex auditory scene. Plos Biol 4, 615–626, https://doi.org/10.1371/journal.pbio.0040079 (2006).

34. 34.

Falk, B., Jakobsen, L., Surlykke, A. & Moss, C. F. Bats coordinate sonar and flight behavior as they forage in open and cluttered environments. J Exp Biol 217, 4356–4364, https://doi.org/10.1242/jeb.114132 (2014).

35. 35.

Petrites, A. E., Eng, O. S., Mowlds, D. S., Simmons, J. A. & DeLong, C. M. Interpulse interval modulation by echolocating big brown bats (Eptesicus fuscus) in different densities of obstacle clutter. J Comp Physiol A 195, 603–617, https://doi.org/10.1007/s00359-009-0435-6 (2009).

36. 36.

Sändig, S., Schnitzler, H. U. & Denzinger, A. Echolocation behaviour of the big brown bat (Eptesicus fuscus) in an obstacle avoidance task of increasing difficulty. J Exp Biol 217, 2876–2884, https://doi.org/10.1242/jeb.099614 (2014).

37. 37.

Hiryu, S., Bates, M. E., Simmons, J. A. & Riquimaroux, H. FM echolocating bats shift frequencies to avoid broadcast-echo ambiguity in clutter. Proc Natl Acad Sci USA 107, 7048–7053, https://doi.org/10.1073/pnas.1000429107 (2010).

38. 38.

Grinnell, A. D. & Griffin, D. R. The Sensitivity of Echolocation in Bats. Biol Bull 114, 10–22, https://doi.org/10.2307/1538961 (1958).

39. 39.

Adams, A. M., Davis, K. & Smotherman, M. Suppression of emission rates improves sonar performance by flying bats. Sci Rep-Uk 7, https://doi.org/10.1038/srep41641 (2017).

40. 40.

Roverud, R. C. & Grinnell, A. D. Discrimination Performance and Echolocation Signal Integration Requirements for Target Detection and Distance Determination in the CF FM Bat. Noctilio-Albiventris. J Comp Physiol A 156, 447–456, https://doi.org/10.1007/Bf00613969 (1985).

41. 41.

Martin, L. M., García-Rosales, F., Beetz, M. J. & Hechavarría, J. C. Processing of temporally patterned sounds in the auditory cortex of Seba’s short-tailed bat,Carollia perspicillata. Eur J Neurosci 46, 2365–2379, https://doi.org/10.1111/ejn.13702 (2017).

42. 42.

Pressnitzer, D., Meddis, R., Delahaye, R. & Winter, I. M. Physiological correlates of comodulation masking release in the mammalian ventral cochlear nucleus. J Neurosci 21, 6377–6386 (2001).

43. 43.

Mott, J. B., Mcdonald, L. P. & Sinex, D. G. Neural Correlates of Psychophysical Release from Masking. J Acoust Soc Am 88, 2682–2691, https://doi.org/10.1121/1.399987 (1990).

44. 44.

Hagemann, C., Esser, K. H. & Kössl, M. Chronotopically Organized Target-Distance Map in the Auditory Cortex of the Short-Tailed Fruit Bat. J Neurophysiol 103, 322–333, https://doi.org/10.1152/jn.00595.2009 (2010).

45. 45.

Beetz, M. J., Hechavarría, J. C. & Kössl, M. Cortical neurons of bats respond best to echoes from nearest targets when listening to natural biosonar multi-echo streams. Sci Rep-Uk 6, https://doi.org/10.1038/srep35991 (2016).

46. 46.

Cade, W. H. & Otte, D. Alternation Calling and Spacing Patterns in the Field Cricket Acanthogryllus-Fortipes (Orthoptera, Gryllidae). Can J Zool 60, 2916–2920, https://doi.org/10.1139/z82-368 (1982).

47. 47.

Zelick, R. D. & Narins, P. M. Intensity Discrimination and the Precision of Call Timing in 2 Species of Neotropical Treefrogs. J Comp Physiol 153, 403–412 (1983).

48. 48.

Littlejohn, M. J. & Martin, A. A. Acoustic Interaction between 2 Species of Leptodactylid Frogs. Anim Behav 17, 785–791, https://doi.org/10.1016/S0003-3472(69)80027-8 (1969).

49. 49.

Heiligenberg, W., Baker, C. & Bastian, J. Jamming Avoidance-Response in Gymnotoid Pulse-Species - Mechanism to Minimize Probability of Pulse-Train Coincidence. J Comp Physiol 124, 211–224 (1978).

50. 50.

Brumm, H. Signalling through acoustic windows: nightingales avoid interspecific competition by short-term adjustment of song timing. J Comp Physiol A 192, 1279–1285, https://doi.org/10.1007/s00359-006-0158-x (2006).

51. 51.

Ficken, R. W., Ficken, M. S. & Hailman, J. P. Temporal Pattern Shifts to Avoid Acoustic Interference in Singing Birds. Science 183, 762–763, https://doi.org/10.1126/science.183.4126.762 (1974).

52. 52.

Takahashi, E. et al. Adaptive changes in echolocation sounds by Pipistrellus abramus in response to artificial jamming sounds. J Exp Biol 217, 2885–2891, https://doi.org/10.1242/jeb.101139 (2014).

53. 53.

Lin, Y., Abaid, N. & Muller, R. Bats adjust their pulse emission rates with swarm size in the field. J Acoust Soc Am 140, 4318–4325, https://doi.org/10.1121/1.4971331 (2016).

54. 54.

Egnor, S. E. R., Wickelgren, J. G. & Hauser, M. D. Tracking silence: adjusting vocal production to avoid acoustic interference. J Comp Physiol A 193, 477–483, https://doi.org/10.1007/s00359-006-0205-7 (2007).

55. 55.

Creutzfeldt, O., Hellweg, F. C. & Schreiner, C. Thalamocortical Transformation of Responses to Complex Auditory-Stimuli. Exp Brain Res 39, 87–104 (1980).

56. 56.

Bee, M. A. & Micheyl, C. The cocktail party problem: What is it? How can it be solved? And why should animal behaviorists study it? J Comp Psychol 122, 235–251, https://doi.org/10.1037/0735-7036.122.3.235 (2008).

57. 57.

Lin, W. Y. & Feng, A. S. Free-field unmasking response characteristics of frog auditory nerve fibers: comparison with the responses of midbrain auditory neurons. J Comp Physiol A 187, 699–712, https://doi.org/10.1007/s00359-001-0241-2 (2001).

58. 58.

Feng, A. S., Hall, J. C. & Gooler, D. M. Neural Basis of Sound Pattern-Recognition in Anurans. Prog Neurobiol 34, 313–329, https://doi.org/10.1016/0301-0082(90)90008-5 (1990).

59. 59.

Maddox, R. K., Billimoria, C. P., Perrone, B. P., Shinn-Cunningham, B. G. & Sen, K. Competing Sound Sources Reveal Spatial Effects in Cortical Processing. Plos Biol 10, https://doi.org/10.1371/journal.pbio.1001319 (2012).

60. 60.

Suga, N., O’Neill, W. E., Kujirai, K. & Manabe, T. Specificity of Combination-Sensitive Neurons for Processing of Complex Biosonar Signals in Auditory-Cortex of the Mustached Bat. J Neurophysiol 49, 1573–1626 (1983).

61. 61.

Schroeder, C. E., Wilson, D. A., Radman, T., Scharfman, H. & Lakatos, P. Dynamics of Active Sensing and perceptual selection. Curr Opin Neurobiol 20, 172–176, https://doi.org/10.1016/j.conb.2010.02.010 (2010).

62. 62.

Simmons, J. A. Temporal binding of neural responses for focused attention in biosonar. J Exp Biol 217, 2834–2843, https://doi.org/10.1242/jeb.104380 (2014).

63. 63.

Engel, A. K. & Singer, W. Temporal binding and the neural correlates of sensory awareness. Trends in Cognitive Sciences 5, 16–25, https://doi.org/10.1016/S1364-6613(00)01568-0 (2001).

64. 64.

Henson, O. W., Pollak, G. D., Kobler, J. B., Henson, M. M. & Goldman, L. J. Cochlear Microphonic Potentials Elicited by Biosonar Signals in Flying Bats, Pteronotus-P-Parnellii. Hearing Res 7, 127–147, https://doi.org/10.1016/0378-5955(82)90010-7 (1982).

65. 65.

Lewicki, M. S. A review of methods for spike sorting: the detection and classification of neural action potentials. Network-Comp Neural 9, R53–R78, https://doi.org/10.1088/0954-898/9/4/001 (1998).

## Acknowledgements

The authors thank Sebastian Kordes for his help in recordings from the inferior colliculus. This research was funded by the DFG.

## Author information

Authors

### Contributions

M.J.B. performed all experiments, analyzed the data, and wrote the manuscript. M.J.B., F.G.R., M.K., and J.C.H. conceived and designed the study. All authors discussed the results, commented on the manuscript. All authors gave final approval for publication.

### Corresponding author

Correspondence to M. Jerome Beetz.

## Ethics declarations

### Competing Interests

The authors declare no competing interests.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Beetz, M.J., García-Rosales, F., Kössl, M. et al. Robustness of cortical and subcortical processing in the presence of natural masking sounds. Sci Rep 8, 6863 (2018). https://doi.org/10.1038/s41598-018-25241-x

• Accepted:

• Published:

• ### The frugivorous bat Carollia perspicillata dynamically changes echolocation parameters in response to acoustic playback

• M. Jerome Beetz
• , Manfred Kössl
•  & Julio C. Hechavarría

Journal of Experimental Biology (2021)

• ### The effect of input noises on the activity of auditory neurons using GLM-based metrics *

• Maryam Hosseini
• , Gerardo Rodriguez
• , Hongsun Guo
• , Hubert H Lim
•  & Éric Plourde

Journal of Neural Engineering (2021)

• ### The second harmonic neurons in auditory midbrain of Hipposideros pratti are more tolerant to background white noise

• Zhongdan Cui
• , Guimin Zhang
• , Dandan Zhou
• , Jing Wu
• , Long Liu
• , Jia Tang
• , Qicai Chen
•  & Ziying Fu

Hearing Research (2021)

• ### Neural oscillations in the fronto-striatal network predict vocal output in bats

• Kristin Weineck
• , Francisco García-Rosales
• , Julio C. Hechavarría
•  & Manuel S. Malmierca

PLOS Biology (2020)

• ### Adaptations in the call emission pattern of frugivorous bats when orienting under challenging conditions

• M. Jerome Beetz
• , Manfred Kössl
•  & Julio C. Hechavarría

Journal of Comparative Physiology A (2019)