Abstract
Vocal communicators discriminate conspecific vocalizations from other sounds and recognize the vocalizations of individuals. To identify neural mechanisms for the discrimination of such natural sounds, we compared the linear spectro-temporal tuning properties of auditory midbrain and forebrain neurons in zebra finches with the statistics of natural sounds, including song. Here, we demonstrate that ensembles of auditory neurons are tuned to auditory features that enhance the acoustic differences between classes of natural sounds, and among the songs of individual birds. Tuning specifically avoids the spectro-temporal modulations that are redundant across natural sounds and therefore provide little information; rather, it overlaps with the temporal modulations that differ most across sounds. By comparing the real tuning and a less selective model of spectro-temporal tuning, we found that the real modulation tuning increases the neural discrimination of different sounds. Additionally, auditory neurons discriminate among zebra finch song segments better than among synthetic sound segments.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
Scientific Reports Open Access 01 May 2023
-
Tracing development of song memory with fMRI in zebra finches after a second tutoring experience
Communications Biology Open Access 30 March 2023
-
Stimulus dependent transformations between synaptic and spiking receptive fields in auditory cortex
Nature Communications Open Access 27 February 2020
Access options
Subscribe to this journal
Receive 12 print issues and online access
$189.00 per year
only $15.75 per issue
Rent or buy this article
Get just this article for as long as you need it
$39.95
Prices may be subject to local taxes which are calculated during checkout






References
Kuhl, P.K. & Meltzoff, A.N. The bimodal perception of speech in infancy. Science 218, 1138–1141 (1982).
Marler, P. & Peters, S. Selective vocal learning in a sparrow. Science 198, 519–521 (1977).
Braaten, R.F. & Reynolds, K. Auditory preference for conspecific song in isolation-reared zebra finches. Anim. Behav. 58, 105–111 (1999).
Doupe, A.J. & Kuhl, P.K. Birdsong and human speech: common themes and mechanisms. Annu. Rev. Neurosci. 22, 567–631 (1999).
Suga, N. Functional properties of auditory neurones in the cortex of echo-locating bats. J. Physiol. (Lond.) 181, 671–700 (1965).
Pollak, G.D. & Bodenhamer, B.D. Specialized characteristics of single units in inferior colliculus of mustache bat: frequency representation, tuning, and discharge patterns. J. Neurophysiol 46, 605–620 (1981).
Rose, G. & Capranica, R.R. Temporal selectivity in the central auditory system of the leopard frog. Science 219, 1087–1089 (1983).
Singh, N.C. & Theunissen, F.E. Modulation spectra of natural sounds and ethological theories of auditory processing. J. Acoust. Soc. Am. 114, 3394–3411 (2003).
Calhoun, B.M. & Schreiner, C.E. Spectral envelope coding in cat primary auditory cortex: linear and non-linear effects of stimulus characteristics. Eur. J. Neurosci. 10, 926–940 (1998).
Eggermont, J.J. Temporal modulation transfer functions in cat primary auditory cortex: separating stimulus effects from neural mechanisms. J. Neurophysiol. 87, 305–321 (2002).
Creutzfeldt, O., Hellweg, F.C. & Schreiner, C. Thalamocortical transformation of responses to complex auditory stimuli. Exp. Brain Res. 39, 87–104 (1980).
Narins, P.M. & Capranica, R.R. Neural adaptations for processing the two-note call of the Puerto Rican treefrog, Eleutherodactylus coqui. Brain Behav. Evol. 17, 48–66 (1980).
Woolley, S.M. & Rubel, E.W. High-frequency auditory feedback is not required for adult song maintenance in Bengalese finches. J. Neurosci. 19, 358–371 (1999).
Escabi, M.A. & Schreiner, C.E. Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain. J. Neurosci. 22, 4114–4131 (2002).
Shannon, R.V., Zeng, F.G., Kamath, V., Wygonski, J. & Ekelid, M. Speech recognition with primarily temporal cues. Science 270, 303–304 (1995).
Chi, T., Gao, Y., Guyton, M.C., Ru, P. & Shamma, S. Spectro-temporal modulation transfer functions and speech intelligibility. J. Acoust. Soc. Am. 106, 2719–2732 (1999).
Gentner, T.Q. & Margoliash, D. Neuronal populations and single cells representing learned auditory objects. Nature 424, 669–674 (2003).
Theunissen, F.E., Sen, K. & Doupe, A.J. Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural stimuli. J. Neurosci. 20, 2315–2331 (2000).
Theunissen, F.E. et al. Estimating spatio-temporal receptive fields of auditory and visual neurons from their responses to natural stimuli. Network 12, 289–316 (2001).
Simoncelli, E.P. & Olshausen, B.A. Natural image statistics and neural representation. Annu. Rev. Neurosci. 24, 1193–1216 (2001).
Hsu, A., Woolley, S.M.N., Fremouw, T.E. & Theunissen, F.E. Modulation power and phase spectrum of natural sounds enhance neural encoding performed by single auditory neurons. J. Neurosci. 24, 9201–9211 (2004).
Assmann, P.F. Tracking and glimpsing speech in noise: Role of fundamental frequency. J. Acoust. Soc. Am. 100, 2680 (1996).
Miller, L.M., Escabi, M.A., Read, H.L. & Schreiner, C.E. Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex. J. Neurophysiol. 87, 516–527 (2002).
Lu, T., Liang, L. & Wang, X. Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nat. Neurosci. 4, 1131–1138 (2001).
Lu, T. & Wang, X. Information content of auditory cortical responses to time-varying acoustic stimuli. J. Neurophysiol. 91, 301–313 (2004).
Woolley, S.M. & Casseday, J.H. Response properties of single neurons in the zebra finch auditory midbrain: response patterns, frequency coding, intensity coding, and spike latencies. J. Neurophysiol. 91, 136–151 (2004).
Woolley, S.M. & Casseday, J.H. Processing of modulated sounds in the zebra finch auditory midbrain: responses to noise, frequency sweeps and sinusoidal amplitude modulations. J. Neurophysiol. 94, 1143–1157 (2005).
Grace, J.A., Amin, N., Singh, N.C. & Theunissen, F.E. Selectivity for conspecific song in the zebra finch auditory forebrain. J. Neurophysiol. 89, 472–487 (2003).
Gehr, D.D., Capsius, B., Grabner, P., Gahr, M. & Leppelsack, H.J. Functional organisation of the field-L-complex of adult male zebra finches. Neuroreport 10, 375–380 (1999).
Sen, K., Theunissen, F.E. & Doupe, A.J. Feature analysis of natural sounds in the songbird auditory forebrain. J. Neurophysiol. 86, 1445–1458 (2001).
Ringach, D.L. Mapping receptive fields in primary visual cortex. J. Physiol. (Lond.) 558, 717–728 (2004).
Suga, N., Xiao, Z., Ma, X. & Ji, W. Plasticity and corticofugal modulation for hearing in adult animals. Neuron 36, 9–18 (2002).
Machens, C.K. et al. Representation of acoustic communication signals by insect auditory receptor neurons. J. Neurosci. 21, 3215–3227 (2001).
Rieke, F., Bodnar, D.A. & Bialek, W. Naturalistic stimuli increase the rate and efficiency of information transmission by primary auditory afferents. Proc. Biol. Sci. 262, 259–265 (1995).
Klein, D.J., Depireux, D.A., Simon, J.Z. & Shamma, S.A. Robust spectrotemporal reverse correlation for the auditory system: optimizing stimulus design. J. Comput. Neurosci. 9, 85–111 (2000).
Tyler, R.S., Preece, J.P. & Tye-Murray, K. in Department of Otolaryngology (University of Iowa, Iowa City, Iowa, 1990).
Lewicki, M.S. Efficient coding of natural sounds. Nat. Neurosci. 5, 356–363 (2002).
Hsu, A., Borst, A. & Theunissen, F.E. Quantifying variability in neural responses and its application for the validation of model predictions. Network 15, 91–109 (2004).
Acknowledgements
We thank P. Gill for discussion and assistance in the data analysis. We thank J. Mazer for insightful comments on an earlier version of this manuscript. This work was supported by US National Institute of Deafness and Communication Disorders grants to S.M.N.W. and F.E.T. and US National Institute of Mental Health grants to F.E.T. and T.E.F.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Fig. 1
Contribution of the non-linear response to the neurogram distance. (PDF 152 kb)
Supplementary Fig. 2
Correlation between spectrogram distance and neurogram distance. (PDF 154 kb)
Rights and permissions
About this article
Cite this article
Woolley, S., Fremouw, T., Hsu, A. et al. Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds. Nat Neurosci 8, 1371–1379 (2005). https://doi.org/10.1038/nn1536
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nn1536
This article is cited by
-
Tracing development of song memory with fMRI in zebra finches after a second tutoring experience
Communications Biology (2023)
-
Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
Scientific Reports (2023)
-
Speech signal analysis of alzheimer’s diseases in farsi using auditory model system
Cognitive Neurodynamics (2021)
-
Stimulus dependent transformations between synaptic and spiking receptive fields in auditory cortex
Nature Communications (2020)
-
Spectrotemporal modulation provides a unifying framework for auditory cortical asymmetries
Nature Human Behaviour (2019)