Evolving perspectives on the sources of the frequency-following response

Coffey, Emily B. J.; Nicol, Trent; White-Schwoch, Travis; Chandrasekaran, Bharath; Krizman, Jennifer; Skoe, Erika; Zatorre, Robert J.; Kraus, Nina

doi:10.1038/s41467-019-13003-w

Download PDF

Perspective
Open access
Published: 06 November 2019

Evolving perspectives on the sources of the frequency-following response

Nature Communications volume 10, Article number: 5036 (2019) Cite this article

18k Accesses
109 Citations
36 Altmetric
Metrics details

Subjects

Abstract

The auditory frequency-following response (FFR) is a non-invasive index of the fidelity of sound encoding in the brain, and is used to study the integrity, plasticity, and behavioral relevance of the neural encoding of sound. In this Perspective, we review recent evidence suggesting that, in humans, the FFR arises from multiple cortical and subcortical sources, not just subcortically as previously believed, and we illustrate how the FFR to complex sounds can enhance the wider field of auditory neuroscience. Far from being of use only to study basic auditory processes, the FFR is an uncommonly multifaceted response yielding a wealth of information, with much yet to be tapped.

On the generalization of tones: A detailed exploration of non-speech auditory perception stimuli

Article Open access 12 June 2020

Subcortical responses to music and speech are alike while cortical responses diverge

Article Open access 08 January 2024

Sound level context modulates neural activity in the human brainstem

Article Open access 19 November 2021

Introduction

The auditory system must faithfully encode and process rapid variations in acoustic signals and precisely extract important features, such as frequency, amplitude modulation, and sound onsets and offsets. This task is accomplished by a complex, interconnected, and parallel system. Auditory information enters the brainstem from the cochlea via the auditory nerve and ascends via both lemniscal and nonlemniscal auditory pathways¹. Neurons in the lemniscal (or “primary/classical”) pathway are thought to be the main bearers of temporally varying information, with synapses in the brainstem (cochlear nucleus and superior olivary complex), midbrain (central nucleus of the inferior colliculus), thalamus (ventral division of the medial geniculate body), and the primary auditory cortex. The fidelity of sound encoding in these ascending pathways affects all cognitive processes that use the information—and in turn, these ascending pathways are affected by cognitive processes via the vast efferent system. Consequently, sound encoding is relevant to the study of many higher-level functions central to human communication, including speech and music.

Frequency-following responses (FFRs) are recordings of phase-locked neural activity that is synchronized to periodic and transient aspects of sound. Traditionally, FFRs have been measured in humans as electrophysiological potentials to sound, recorded from the scalp. For guidance on collecting FFRs, see Skoe and Kraus for a tutorial in EEG-FFR collection², Krizman and Kraus for a tutorial on EEG-FFR analysis³, and Coffey et al. for technical details on the MEG-FFR⁴ (see Box 1 for key points).

Human FFRs were first measured in the 1970s⁵. Identified as subcortical in origin, they were viewed as a potential supplement to behavioral audiometry. Over the years, the field has moved away from treating the subcortical auditory system as a bottom-up, hardwired conduit for sound, and is increasingly recognizing the contribution of top-down influences within the context of distributed neural networks. Studies using the FFR have played an instrumental role in this evolution of thinking.

The FFR is a noninvasive means of reliably measuring the fidelity and precision with which the brain encodes sound. Measures derived from the FFR (e.g. timing, amplitude, consistency, and pitch tracking, see Fig. 1) reveal an individual’s mapping between a stimulus and the brain’s activity, which may be impaired in disease or enhanced through expertize. The FFR has proven essential to answering basic questions about how our auditory system manages complex acoustic information, how it integrates with other senses, and how both tasks are shaped by experience^6,7,8. FFR measures are related to the ability to differentiate sounds, hear targets in noise, and to experience with music, tonal languages, or multilingualism^{8,9,10,11,12,13}. The FFR can reveal the plastic nature of the human auditory system, including its potential to change over short-time scales, and its sensitivity to enriched and impoverished experiences with sound^{13,14,15,16,17,18,19,20,21,22}.

The FFR is useful to address questions concerning impaired auditory processing in populations with impaired cochlear function^23,24,25,26, and in neurodevelopmental speech and language disorders^{27,28,29,30,31,32} or autism^33,34. It can also be used to study maturational^35,36 and aging-related changes^37,38, sex differences in auditory functions³⁹, and improvement caused by interventions^15,40,41,42. More broadly, the FFR can provide an index of neurological health, for instance, in populations with acquired neurological disorders (e.g. concussion)⁴³. For a comprehensive review of FFR and its role in indexing the effects of experience on the auditory brain, see refs. ^44,45.

A fundamental question is what source(s) underlie the FFR in humans. This is important for basic scientific knowledge for its own sake and also because a greater understanding of the FFR’s sources can inform its translation and deployment in medicine. Methods have emerged that allow for some spatial separation of FFR sources in humans (i.e., brainstem, thalamus, cortex^4,46,47). These studies have reopened questions about the degree to which activity in different subcortical and cortical centres contributes to the well-studied scalp-recorded FFR and whether sources identified using other methods generalize to the traditional, scalp-recorded response. To be clear: while many questions remain to be answered, we do not think the FFR is solely generated in the auditory cortex, nor do we exclude the possibility of cortical contributions under certain circumstances.

Here we aim to update our evolving understanding of the FFR in a way that is accessible to an interdisciplinary audience; and, we wish to outline a roadmap that promotes a more integrative understanding of the FFR and its potential to study human auditory function.

Historical roots and changing views

To our knowledge, the term “frequency-following response” was dubbed in the late 1960s by Worden and Marsh⁴⁸, where it was described in an animal model. Initially investigated with low-frequency pure tones (<500 Hz), FFRs were an appealing alternative/adjunct to other objective measures of auditory function available at the time (e.g., auditory brainstem responses, electrocochleograms) because the latter have poor frequency specificity and are less effective at evoking responses to stimulus frequencies below 500 Hz.

By the 1990s, however, evidence began to emerge that the FFR reflected more than mere stimulus audibility. Gary Galbraith, a pioneer in the use of richer FFR stimuli such as two-tone complexes, missing fundamental stimuli, and speech, reported that the FFR was affected by attention⁴⁹ and by how a particular speech stimulus was perceived by the listener⁵⁰. Galbraith’s insight that “the FFR is a unique tool for understanding the most important of all auditory capacities: the coding and processing of human language” has proven prescient as the 21st century has seen a dramatic increase in investigations into speech-evoked FFR and how response properties relate to human communication. With these discoveries has come a renewed interest in the investigation of the FFR above and beyond its ability to signal sound detection. Instead, as we detail below, the FFR is now seen as a powerful tool to understand the neurophysiological bases of complex auditory behaviors in humans, including speech and music.

Evoked responses, which are also derived from EEG recordings but typically using a low-pass frequency filter (<40 Hz, often referred to as “cortical auditory evoked potentials” or “late-latency responses” and their variants, such as the mismatch negativity or P300), generally reflect a response to stimulus onset and later processing stages. Distinguishing the FFR is the precision with which it retains the morphological features of the waveform of the stimulus, therefore revealing how the auditory system responds to its acoustic elements. An uncommon wealth of analysis strategies accompanies interpretation of this multifaceted response (see Fig. 1 and Box 2). The past 10 years have seen refinements of FFR analyses that capitalize on the richness of the response³.

Evidence for multiple sources in human scalp-recorded FFR

The biological sources of the FFR have been a topic of debate since the early days of the technique^51,52,53. Efforts to clarify the sources of far-field responses have yielded greater understanding of how and where auditory information is integrated across auditory and non-auditory regions and timescales, and the degree to which auditory centres are subject to neuroplasticity^54,55.

Our view of the FFR’s origins relies on three axioms about the auditory system.

1.
The central auditory system is a network of intertwined structures that extend across medulla, pons, midbrain, thalamus, and temporal lobes of cortex. This network is intrinsically connected to other sensory systems and motor, cognitive, and reward systems. To be sure, cells and circuits within each of the nuclei have specialized functions and properties; but, none of these cells or circuits operates in a vacuum. The interactivity of the system means that even something as simple as a primary auditory cortex neuron’s tuning curve has to be considered within the broader context of an integrative and plastic system (reviewed in Kraus and White-Schwoch⁴⁴). Thus, any consideration of one or more sources of the FFR also has to consider how those sources interact with each other and with non-auditory brain circuits. It is also important to bear in mind that the same auditory structure can yield different neural activity depending on the sound’s context^29,56,57,58.
2.
Phase-locking, the phenomenon by which neurons discharge at a particular phase within the stimulus cycle, is a common feature throughout the auditory system. Through this action the recurring, periodic elements of the stimulus (e.g., the period of the fundamental frequency, the period of the amplitude modulation frequency) are encoded in the synchronous activity of a neuronal population. As you ascend the lemniscal pathway the rate of phase-locking decreases. (For more on auditory system phase-locking see Box 3 and Fig. 2).
Fig. 2
Schematic of frequency ranges of speech and music and the relative activation of subcortical and cortical phase-locking to the frequency-following response. Phase-locking limitations of neurons and neuronal assemblies in the human auditory system are not yet known, but can be partly inferred from animal models. Despite phase-locking limitations, the frequency-following response is predictive of the functionality of the entire auditory system
Full size image
3.
The auditory system is plastic. Neurons throughout the auditory axis exhibit rapid plasticity based on stimulus context (e.g., Carbajal and Malmierca⁵⁹) and the interactive nature of the auditory system makes each centre subject to non-auditory input, whether by changes in overall brain physiology or metabolism, changes in environmental input, and/or changes in top-down cognitive input to refine sensory representation. Thus, while an FFR might measure the current functional state of stimulus representation in the auditory brain, that functional state reflects the legacy of this plasticity.

What supports the conventional wisdom that the FFR has a subcortical origin?

Our current understanding of sources of human scalp-recorded FFR is the culmination of non-invasive studies in humans and invasive studies in animal models, each of which has advantages and limitations. The inferior colliculus has often been considered as the dominant source of the FFR derived from EEG scalp-recordings (EEG-FFR) (reviewed in Chandrasekaran and Kraus⁶⁰), based on the auditory system’s reduced capacity for high-frequency phase-locking at higher centres. Additional evidence comes from direct recordings in animal models, in which the neural sources of the FFR have been studied by selectively taking different auditory structures offline by cooling, lesioning, or pharmacological manipulation. For example, the scalp-recorded FFR was abolished or strongly reduced by cryogenic blockade of the IC in cats⁵¹, and in human patients with focal IC lesions⁵², confirming that the IC is an important FFR signal generator. While these experiments ruled out more peripheral sources, they cannot rule out thalamic or cortical sources—since the IC is an obligatory station of the afferent pathway, blocking IC activity fails to disambiguate IC vs. thalamocortical contributions. Approaching this question from the other direction, studies in cats and rabbits showed that FFRs close to 100 Hz remained largely unaffected by decreased auditory cortex function, but were influenced by lesions to the inferior colliculus⁶¹. Also noteworthy is that speech-evoked FFRs and evoked responses to amplitude-modulated tones recorded directly from subcortical structures in animals strongly resemble those recorded from the brain’s surface and those recorded to the same stimuli in humans^62,63.

The FFR’s short stimulus-to-response latency of ~5–9 ms is often quoted as evidence of a subcortical origin (e.g. ref. ⁶⁴), as the IC has a latency of 5–7 ms. However, latency-based arguments are difficult to defend as FFR latencies vary considerably according to stimulus characteristics such as sound pressure level, frequency, and amplitude envelope, and stimulus-to-response latencies much longer than 7 ms have been reported between the stimulus and EEG-FFR in some studies (e.g. 14.6 ms⁶⁵). Furthermore, intracranial recordings from Heschl’s gyrus show that the first responses to sound in the cortex can occur as early as ~9 ms post stimulus onset⁶⁶.

Rethinking FFR sources: The multiple generator hypothesis

There have long been hints of the idea that the FFR comprises multiple generators. We advance the hypothesis that the EEG-FFR is an aggregate response reflecting multiple auditory stations, including the auditory nerve, cochlear nucleus, inferior colliculus, thalamus, and cortex, and that the specific mixture of sources may vary depending on the recording techniques, stimulus, and participant demographic. This hypothesis motivates several predictions.

Prediction 1: Decomposition of a multichannel EEG signal should indicate multiple, independent components. In 1978, Stillman et al. recorded FFRs to tones with various fundamental frequencies using only two EEG channels, and concluded that the human FFR is a composite of several waveforms whose relative influence differs as a function of frequency⁵³. Kuwada et al. recorded human EEG and electrophysiology in rabbits and concluded that surface recordings are composite responses from multiple brain generators⁶². Two-channel recordings and principal component analysis on multichannel EEG data have demonstrated separable FFR components that relate to stimulus properties, such as the presence or absence of energy at the fundamental frequency^64,67,68.
Prediction 2: Multimodal source modeling should indicate multiple generators of the scalp-recorded signal. Coffey et al. reported that FFRs to speech (with f0 ~100 Hz) could be non-invasively recorded using MEG, which allows spatial source localization. MEG-FFR contributions included not only subcortical sources—the cochlear nucleus, inferior colliculus, and medial geniculate body (thalamus)—but also the auditory cortices (with a right-hemisphere predominance)⁴. Using a combination of EEG and functional magnetic resonance imaging (fMRI), a subsequent study confirmed that hemodynamic activity in the right auditory cortex was related to individual differences in the EEG-based FFR f0 strength, consistent with the hypothesis that phase-locked activity in auditory cortex has a hemodynamic signature⁶⁹. Bidelman found corroborating evidence of multiple sources to the FFR, including a cortical one, using distributed source modeling techniques on multichannel EEG recordings and a speech stimulus (with f0 in the same range as in Coffey et al.). This EEG approach revealed subcortical sources contributing more than the auditory cortex⁴⁶ (note that thalamic sources did not appear to be included in the analysis).
Prediction 3: Individual differences in FFR components should correlate with behavior if they are functionally relevant. Zhang and Gong used principal component analysis on multichannel EEG data, and found multiple, separable components with different scalp topographies, only one of which correlated with pitch perception; they concluded that phase-locked activity at different sources differentially relates to behavior⁶⁸. Coffey et al. observed significant correlations between the magnitude of the right auditory cortical MEG-FFR response and pitch perception thresholds, as well as with musical training, suggesting that phase-locked activity in this region provides behaviorally–relevant information⁴. Separately, while the MEG-FFR strength at subcortical and cortical sources was predictive of speech-in-noise (SIN) perception, the strongest correlations were observed with the right auditory cortex⁷⁰. In a cross-modal attention task, Hartmann and Weisz confirmed the strong contribution of cortical regions to the MEG-FFR and found that only the right auditory cortex was significantly affected by attention⁷¹.
Prediction 4: Different stimulus frequencies will bias certain generators. Tichko and Skoe conducted an extensive investigation that measured EEG-FFR amplitude to complex tones as a function of fundamental frequency⁷². EEG-FFRs to stimuli with frequencies between 16.35 and 880 Hz showed generally decreasing amplitude with increasing frequency, but with local maxima at ~44, 87, 208, and 415 Hz. The local maxima suggest an EEG-FFR with multiple underlying generators whose activity interacts constructively or destructively at the scalp depending on the stimulus frequency (Fig. 3a). The EEG-FFR interference pattern that produced these local maxima was modeled by the authors as the summation of multiple phase-locked signals, all phase-locked to the stimulus frequencies but with different latencies (i.e., neural conduction times). The authors suggested that recording protocol, electrode montage, recording quality (i.e. signal-to-noise ratio), and subject demographics influence the EEG-FFR interference patterns because each one of these manipulations alters the strength of phase-locking or the degree to which this phase-locking can be detected at the scalp.
Fig. 3
a Scalp-recorded frequency-following responses (FFRs) may reflect, in part, the summation of phase-locked activity from different sources, each with a characteristic lag relative to the onset of the stimulus. The putative sources of the FFRs include the cochlea, auditory nerve (AN), cochlear nucleus (CN), superior olive (SOC), inferior colliculus (IC), medial geniculate body (MGB), and auditory cortex (AC). b Electrode montage influences the relative contribution of sources in the scalp-recorded signal: for example, the montages shown on the left and central panels which include an electrode at the mastoid likely include a greater contribution from peripheral sources than does the montage illustrated on the right, which references a single vertex channel to the average of other scalp electrodes
Full size image
Prediction 5: Different recording techniques will differ in their sensitivity to different sources. Source-localized EEG-FFR and MEG-FFR do not show identical patterns of source strength^4,46. Results from MEG should not be directly applied to EEG due to their differing sensitivities to radial vs. tangential currents, and to superficials vs. deep sources (discussed with reference to FFR in ref. ⁴); although they both are sensitive to the electrochemical current flows within and between brain cells, they provide partly overlapping and partly complementary information^73,74,75. Still, even using only EEG-FFR, electrode placement and referencing appears to affect signal content. Coffey et al. compared two common electrode montages and found only a moderate correlation in their sensitivity to behavioral measures⁷⁶; these montages, often used interchangeably, may thus differ in the combination of sources to which they are sensitive (Fig. 3b). Likewise, reaction times on an auditory task were noted to track with amplitude of the EEG-FFR in an electrode montage that favors more central subcortical sources, but not in responses from a simultaneously recorded montage that was more peripherally biased⁷⁷.

A thread through this work is that recording modalities, stimuli, and stimulus presentation paradigms all may influence the mix of sources underlying the recorded signal. One must therefore exercise caution in extrapolating conclusions from one modality or paradigm to the results of another.

In summary, the extent of contributions of sources to the scalp-recorded EEG-FFR under different experimental conditions and in different populations is an unsettled topic. Yet the discovery that different recording techniques implicate different underlying generators increases the richness of what FFR can tell us. We find ourselves sympathetic to the view that the EEG-FFR signal can represent a mixture of sources including the auditory nerve, CN, IC, MGB, and cortex, and that the contribution of each source could differ depending on where and how the response is recorded. Regardless of the “real-time” sources of an FFR, and the possibility that one source may dominate the response, we want to reemphasize that each of those potential sources operates in concert with each other (and non-auditory systems) to shape its function.

Approaches to test hypotheses about FFR origins

To make further progress on these concepts, it will be useful to employ methods whereby FFR data are collected simultaneously with other data that unambiguously reflect cortical and network activity^70,78,79,80. Functional connectivity measures that allow for quantification of the strength and direction of information transfer may also prove useful when applied to spatially resolved signals such as EEG/MEG in source space⁸¹. Combinations of different methods could be especially valuable, such as EEG-based FFR together with fMRI or functional near-infrared spectroscopy (fNIRS)^69,82. fMRI or fNIRS provides a means of quantifying functional networks throughout the brain which could be used to relate to FFR variables.

Recent animal neurophysiology studies have demonstrated that an FFR similar to that of humans can be recorded in awake monkeys⁸³, confirming previous demonstrated analogs between humans and anesthetized non-human animals^37,38,84,85. Awake animal preparations could be particularly enlightening because of the possibility of recording simultaneously from multiple sites in behaving animals. Neurophysiological studies in animals and humans⁸⁶ could provide a ground truth comparison for FFR strength estimates, establishing cellular-level correlates of observable EEG signals and their changes with plasticity. Another approach would be to combine FFR measurements with brain stimulation of the auditory cortex.

There is also still more to learn about the “old-fashioned” scalp-recorded FFR. Much work to date has focused on the lower-frequency components of the response relating to the fundamental frequency of the stimulus, even though there are approaches that bias responses to high-frequency cues such as speech formants. A wealth of analysis techniques accompanies the interpretation of the FFR; see Krizman and Kraus³. A deeper understanding of these FFR components can enrich our understanding of complex auditory behaviors. And, when applied in tandem with animal research and other techniques, these techniques can further our understanding of generators underlying these relatively simple paradigms.

Finally, new methods to collect FFR offer many interesting possibilities for future research. For example, an exciting future direction is to record FFR to continuous, natural speech or other signals, instead of the traditional repeated singles stimulus paradigm^87,88,89,90. Combined with free-field recordings⁹¹, portable FFR systems⁹², and/or wearable technologies⁹³, these methods open opportunities to examine FFR in real-world settings. On the analytical front, machine learning algorithms have recently been developed allowing single-trial FFR classification^94,95 which could have many applications, including for instance as neurofeedback in training paradigms.

Network dynamics and the “functional view” of the FFR

Contemporary approaches in systems and cognitive neuroscience emphasize the concept that the nervous system functions as an integrated set of complex networks, comprising various interconnected nodes and hubs at which distinct operations take place⁹⁶, and from whose interactions complex cognition emerges. This perspective strongly informs our view that the auditory nervous system exhibits extensive bidirectional cortical–subcortical and ipsilateral–contralateral connectivity (in addition to bidirectional connectivity with other sensory and cognitive systems). In turn, auditory cortex may itself be considered a hub⁹⁷ for the ventral and dorsal corticocortical loops that are known to underwrite auditory cognition including auditory object recognition, localization, speech, and music^{98,99,100,101}. Thus, we may consider the entire auditory system as consisting of a number of conjoined complex networks, each of which is of course far from fully characterized at this point.

Taking this idea of a highly interconnected nervous system as a framework, we suggest that the FFR serves as an index of the functional properties of the subcortical and early cortical parts of the auditory system. By virtue of the interconnectedness of networks, the FFR is a snapshot of auditory processing. It also seems that the FFR would be influenced by, and hence be relevant to, the corticocortical loops as well. Although direct evidence for such network-level influences remains sparse, the modulation of FFR parameters associated with training-induced plasticity or with cortical dysfunction, as mentioned above, may be one instantiation of this phenomenon^15,102. Similarly, the proposals that the FFR may be influenced by attention^{71,103,104,105} (but see ref. Varghese et al.¹⁰⁶), arousal state¹⁰⁷, or task demands^76,86,108, may constitute another example. Conceptually similar is the idea that stimulus-specific adaptation (and mismatch negativity) were originally considered cortical¹⁰⁹, but we now know that they reflect an integrated auditory change detection response^{56,57,110,111}.

It is our view that the FFR should be thought of as an aggregate measure of the response of the auditory system, reflecting its cumulative prior history. Specific auditory brain centres may contribute differently to a measured response, but those centres function jointly, and in the context of broader neural networks. This gives us the “functional view” of the FFR—we see it as a measure of how well the entire brain is coding sound features much more than as a reflection of activity within any single nucleus, because the nuclei are embedded in complex functional networks. Distinct computations may happen at local nodes, but the functional metrics can be considered as an emergent property of the interactions between nodes. Considering the FFR in this way leads to the development of systems-level hypotheses that should encourage understanding of the relationships between the FFR and other neural features. For example, combining FFR measures with functional MRI may prove useful in delineating the interactions between auditory representations and higher-order cognitive functions (e.g., attention, memory, and even visual and motor operations) and how these interactions change with experience. Similarly, functional and structural connectivity metrics offer opportunities to explore individual differences in network properties and how they affect auditory encoding. All of these approaches can also inform questions relating to development and maturation, as well as to aging and disorders.

Box 1 FFR collection and analysis

The FFR may be measured in its most simple form using one-channel EEG, with a reasonably high sampling rate (>2000 Hz) and open filters, as subjects are presented with a high number of repetitions (500+) of an auditory stimulus such as a complex tone or speech syllable. FFRs have commonly been measured to stimuli with fundamental frequencies above 80 Hz. Although frequency tracking occurs in the brain at much lower frequencies (e.g. ~2–4 Hz¹¹²) it remains to be seen whether these lower frequencies are tracked by identical neural mechanisms. The most commonly used electrode montages are single or averaged earlobes (or mastoids) to vertex (Cz), or hairline (~FCz) to 7th cranial vertebra (see Fig. 3b). More recently, multichannel EEG, and MEG, and complementary measures of whole-brain hemodynamic response (BOLD fMRI)⁶⁹ have been used to provide spatial information. In most studies to date, the stimulus has a fundamental frequency between 80 and 500 Hz. Stimulus duration is usually between 40 and 250 ms, representing a trade-off between ecological validity and expedience, as high numbers of repetitions are needed. However, even sentence-length stimulation has been used⁸⁸.

Insert tube earphones are used to avoid electromagnetic stimulus artifacts, and stimulation is generally suprathreshold (70–80 dB SPL). Stimuli may be presented monaurally (e.g. to study lateralization¹¹³) or binaurally (to maximize the amplitude of the FFR). They are often presented in alternating polarity (i.e. the digital signal is multiplied by −1 for half of the presentations), which allows for analysis of both envelope and spectral responses.

Experimental approaches should be matched to the research questions and constraints. For research questions that concern the function and contribution of specific structures or localize individual differences, methods that provide spatio-temporal separation are necessary (e.g. MEG). For research questions concerning identification of biomarkers, relation to behavioral performance, or tracking longitudinal changes, simpler equipment with a lower cost is desirable. The single-EEG channel FFR has repeatedly proven its worth as a sensitive measure of the fidelity with which the auditory system as a whole preserves useful sound information^6,45. Intermediate approaches that make use of information from more than one channel^64,114 or stimulus are also possible. For example, results from several studies measuring FFRs to a range of frequencies suggested frequency-dependent cortical contributions, thus generating hypotheses for future studies with more definitive localization methodology^47,72,103.

Box 2 Terminology

A source of confusion in the field stems from the many terms and acronyms used to refer to the FFR (e.g. ABR, sABR, cABR, AMFR, SS-EP, SSR). The auditory brainstem response has been invoked to help ground FFR in a known clinical technique. Thus, speech-evoked ABR (or “sABR”) will turn up in some circles. Kraus and colleagues promoted “cABR” for some time. The “c” stands for complex, referring to the evoking stimulus, to contrast it with the clicks and pure tones of conventional audiometric ABR. It was also employed to avoid limiting the discussion of the response to phase-locking to the stimulus periodicity (i.e., the following of the stimulus frequency); cABR was meant to be inclusive of the transient onsets and offsets present in the recording. However, we no longer use the term—in part due to its potentially misleading emphasis on “brainstem,” in addition to the implication that it has identical sources and generating mechanisms as the ABR. Another term used is “envelopefollowing response”¹¹⁵. For speech stimuli, however, the term is suboptimal because it incorrectly implies that temporal fine structure (TFS, the counterpart to the acoustic envelope) cannot be observed in the response. Although the acoustic envelope, which imparts important perceptual attributes of sound (e.g., voice pitch), is the most widely studied facet of the FFR, other facets that have received attention include the TFS and responses to distortion products^3,116,117. We propose that terms such as envelope and temporal fine structure be subordinated as modifiers: FFR_ENV and FFR_TFS. The rarely used “amplitude modulation following response”, likewise, is too limiting, suggesting a narrowly defined stimulus type⁶². SS-EP, steady-state evoked potential⁷, and SSR, steady-state response¹¹⁴ have also emerged recently, adding to the alphabet soup of terminology. The lack of standardized terminology frustrates literature searches, creates false impressions, and at times leads to unnecessary balkanization among researchers. We advocate for simply calling it “FFR,” which avoids presupposing a single source, emphasizing one aspect of the response over another, and limiting the stimulus set. FFR, while imperfect, is a transparent name because it emphasizes that it is a neural response that reflects the acoustical properties of the inducing stimulus, including its periodic and non-periodic components. From a practical standpoint, the term FFR lends itself to easy modification based on collection technique (EEG-FFR, MEG-FFR), analysis technique (FFR_ENV, FFR_TFS) or response subcomponent of interest (FFR-F0, FFR-harmonics). And, it has been the modal term for the past 50 years.

Box 3 Phase-locking in the auditory system

As a general principle, as you ascend the auditory system, neural response latency increases, phase-locking (i.e. neurons discharging at a particular phase within the stimulus cycle) becomes weaker at higher stimulus frequencies, and the frequency range over which phase-locking occurs becomes narrower. It is known from single-neuron recordings that although the upper limit of stimulus synchronization declines progressively in the ascending pathway from 5000 Hz in the auditory nerve, it is still present up to 800 Hz in the medial geniculate body^118,119,120, with some reports suggesting thalamic phase-locking as high as 1000 Hz in cats¹²¹. While the upper limit of phaselocking of individual cortical neurons is often quoted as ~100 Hz, this limit may be an underestimate as it has not clearly been established in humans. Intracranial recording directly from the auditory cortex is possible in neurosurgical epilepsy patients, and using this method, clear phase-locked near-field responses have been reported in response to click trains up to 200 Hz^122,123 and to speech stimuli with fundamental frequencies within the range of 120 Hz⁸⁶. The volley principle¹²⁴, that multiple neurons each contribute to a subdivision of a period, in theory, could enable even higher frequencies to be coded by neural populations. Indeed, multiple-neuron activity recordings from the monkey primary auditory cortex show phase-locking capability at 250 Hz frequencies (but not at 500 Hz)¹²⁵, and in the guinea pig some auditory cortex units phase locked up to 500 Hz (with some units tuned specifically to the fundamental frequency range of the species’ 270 Hz alarm calls, suggesting behavioral importance¹²⁶). The overlap in the phaselocking ranges observed across the auditory neuraxis together with the contribution of the response to transient stimulus components create the physiological conditions for the FFR for any given stimulus frequency to have multiple potential underlying generators. The presence of multiple sources creates complex far-field waveforms that, because of the phase relationships of the sources, can lead to false conclusions about the number and latency of the sources^52,72,127.

Box 4 Questions for future research

Why do FFRs, which cover only a relatively narrow band of frequencies compared to the entire hearing range (Fig. 2a), nonetheless predict such a wide range of auditory behaviors?
How does the FFR relate to other measures that are sensitive to quasi-periodic or aperiodic fluctuations in the signal that allow for reconstruction of other aspects of sound content (refs. ^55,128,129)?
What accounts for large inter-individual differences in several parameters of FFR sound encoding even in a homogeneous population of healthy, normal-hearing adults⁷⁶? What are the neural origins and behavioral correlates of these differences?
How does lateralization of cortical and subcortical FFR, observed in several studies^4,69,113, emerge in the system, and what functional relevance does it have?
What kinds of circuit dynamics do we expect as different structures interact, and can these be observed using FFR methods that allow for source separation? Can bottom-up and top down influences be definitively separated using the FFR?
To what extent do FFR fundamental frequencies and harmonics reflect activity from different cortical and subcortical sources, and under which conditions of measurement (e.g. MEG vs. EEG, stimulus properties)? How and where is this information integrated in the brain?
What are the neural origins of group differences and experience-related changes in the FFR? How does the relative contribution of brainstem, thalamic, and cortical sources vary as a function of age or experience with sound? Do some pathologies affect certain FFR generators more than others?
Can we integrate predictions stemming from theoretical models of oscillatory activity (ref. ¹¹⁶) with physiological FFR data to achieve a better understanding of the functional properties of the whole system?
Is the FFR the product of feed-forward transmission of phase-locked spiking, or alternatively, active oscillatory circuits?

Conclusions

Auditory neuroscience is now more attuned to the significance of top-down influences and the role of neuroplasticity in auditory processing; the auditory system is correctly viewed as part of interconnected circuitry that involves cognitive, sensorimotor, and limbic systems. In many ways, the FFR is an ideal way to access this complex circuit precisely because it is not a monolithic response reflecting only a single stimulus component or single source. Rather, the FFR reveals how the auditory system responds to multiple acoustic elements throughout an entire sound, enabling a wealth of analysis strategies. Germane to this perspective article, the FFR can be measured with a number of different techniques, each of which provides a distinct window into auditory processing. Because the FFR is so rich and complex, much more is to be learned from it (Box 4). There needs to be agreement on terminology, a concerted effort against over-generalization vis-à-vis its generation, and careful harmonization between techniques and research questions to fully understand and successfully harness its potential. We hope this perspective piece serves to both inform readers and to inspire them to embrace the complexity of the FFR while remaining grounded in best practices and interpretation as research into the brain mechanisms underlying this response proceeds.

References

Schnupp, J., Nelken, I. & King, A. Auditory Neuroscience Making Sense of Sound (MIT Press, 2011).
Skoe, E. & Kraus, N. Auditory brain stem response to complex sounds: a tutorial. Ear Hear. 31, 302–324 (2010).
Article PubMed PubMed Central Google Scholar
Krizman, J. & Kraus, N. Analyzing the FFR: a tutorial for decoding the richness of auditory function. Hear. Res. 107779 (2019). https://doi.org/10.1016/j.heares.2019.107779
Article PubMed PubMed Central Google Scholar
Coffey, E. B. J. J., Herholz, S. C., Chepesiuk, A. M. P. P., Baillet, S. & Zatorre, R. J. Cortical contributions to the auditory frequency-following response revealed by MEG. Nat. Commun. 7, 11070 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Moushegian, G., Rupert, A. L. & Stillman, R. D. Scalp-recorded early responses in man to frequencies in the speech range. Electroencephalogr. Clin. Neurophysiol. 35, 665–667 (1973).
Article CAS PubMed Google Scholar
Kraus, N. & Nicol, T. The power of sound for brain health. Nat. Hum. Behav. 1, 700–702 (2017).
Article PubMed Google Scholar
Nozaradan, S., Schönwiesner, M., Caron-Desrochers, L. & Lehmann, A. Enhanced brainstem and cortical encoding of sound during synchronized movement. Neuroimage 142, 231–240 (2016).
Article PubMed Google Scholar
Musacchia, G., Sams, M., Skoe, E. & Kraus, N. Musicians have enhanced subcortical auditory and audiovisual processing of speech and music. Proc. Natl. Acad. Sci. USA 104, 15894–15898 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Thompson, E. C., Woodruff Carr, K., White-Schwoch, T., Otto-Meyer, S. & Kraus, N. Individual differences in speech-in-noise perception parallel neural speech processing and attention in preschoolers. Hear. Res. 344, 148–157 (2017).
Article PubMed Google Scholar
Marmel, F. et al. Subcortical neural synchrony and absolute thresholds predict frequency discrimination independently. J. Assoc. Res. Otolaryngol. 14, 757–766 (2013).
Article CAS PubMed PubMed Central Google Scholar
Omote, A., Jasmin, K. & Tierney, A. Successful non-native speech perception is linked to frequency following response phase consistency. Cortex 93, 146–154 (2017).
Article PubMed PubMed Central Google Scholar
Zhao, T. C. & Kuhl, P. K. Linguistic effect on speech perception observed at the brainstem. Proc. Natl. Acad. Sci. USA 115, 8716–8721 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krishnan, A., Xu, Y., Gandour, J., Carianib, P. & Cariani, P. Encoding of pitch in the human brainstem is sensitive to language experience. Cogn. Brain Res. 25, 161–168 (2005).
Article Google Scholar
Wong, P. C. M., Skoe, E., Russo, N. M., Dees, T. & Kraus, N. Musical experience shapes human brainstem encoding of linguistic pitch patterns. Nat. Neurosci. 10, 420–422 (2007).
Article CAS PubMed PubMed Central Google Scholar
Reetzke, R., Xie, Z., Llanos, F. & Chandrasekaran, B. Tracing the trajectory of sensory plasticity across different stages of speech learning in adulthood. Curr. Biol. 28, 1419–1427.e4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Skoe, E., Krizman, J., Spitzer, E. & Kraus, N. The auditory brainstem is a barometer of rapid auditory learning. Neuroscience 243, 104–114 (2013).
Article CAS PubMed Google Scholar
Parbery-Clark, A., Anderson, S., Hittner, E. & Kraus, N. Musical experience offsets age-related delays in neural timing. Neurobiol. Aging 33, 1483.e1–1483.e4 (2012).
Article Google Scholar
Krizman, J., Marian, V., Shook, A., Skoe, E. & Kraus, N. Subcortical encoding of sound is enhanced in bilinguals and relates to executive function advantages. Proc. Natl. Acad. Sci. USA 109, 7877–7881 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Colella-Santos, M. F., Donadon, C., Sanfins, M. D. & Borges, L. R. Otitis media: long-term effect on central auditory nervous system. Biomed. Res. Int. 2019, 1–10 (2019).
Article Google Scholar
Elmer, S., Hausheer, M., Albrecht, J. & Kühnis, J. Human brainstem exhibits higher sensitivity and specificity than auditory-related cortex to short-term phonetic discrimination learning. Sci. Rep. 7, 7455 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Jafari, Z. & Malayeri, S. Effects of congenital blindness on the subcortical representation of speech cues. Neuroscience 258, 401–409 (2014).
Article CAS PubMed Google Scholar
Jeng, F. C. et al. Cross-linguistic comparison of frequency-following responses to voice pitch in american and chinese neonates and adults. Ear Hear. 32, 699–707 (2011).
Article PubMed Google Scholar
Presacco, A., Simon, J. Z. & Anderson, S. Speech-in-noise representation in the aging midbrain and cortex: effects of hearing loss. PLoS One 14, e0213899 (2019).
Article CAS PubMed PubMed Central Google Scholar
Daly, D. M. D., Roeser, R. J. R., Moushegian, G. & clinical, G. M.-E. and & 1976, undefined. The frequency-following response in subjects with profound unilateral hearing loss. Electronencephalogr. Clin. Neurophysiol. 40, 132–142 (1976).
Article CAS Google Scholar
Zhong, Z., Henry, K. S. & Heinz, M. G. Sensorineural hearing loss amplifies neural coding of envelope information in the central auditory system of chinchillas. Hear. Res. 309, 55–62 (2014).
Article PubMed Google Scholar
Shaheen, L. A., Valero, M. D. & Liberman, M. C. Towards a diagnosis of cochlear neuropathy with envelope following responses. J. Assoc. Res. Otolaryngol. 16, 727–745 (2015).
Article PubMed PubMed Central Google Scholar
Hornickel, J. & Kraus, N. Unstable representation of sound: a biological marker of dyslexia. J. Neurosci. 33, 3500–3504 (2013).
Article CAS PubMed PubMed Central Google Scholar
White-Schwoch, T. et al. Auditory processing in noise: a preschool biomarker for literacy. PLoS Biol. 13, 1–17 (2015).
Article CAS Google Scholar
Chandrasekaran, B., Hornickel, J., Skoe, E., Nicol, T. & Kraus, N. Context-dependent encoding in the human auditory brainstem relates to hearing speech in noise: implications for developmental dyslexia. Neuron 64, 311–319 (2009).
Article CAS PubMed PubMed Central Google Scholar
Basu, M., Krishnan, A. & Weber-Fox, C. Brainstem correlates of temporal auditory processing in children with specific language impairment. Dev. Sci. 13, 77–91 (2010).
Article PubMed Google Scholar
Billiet, C. R. & Bellis, T. J. The relationship between brainstem temporal processing and performance on tests of central auditory function in children with reading disorders. J. Speech Lang. Hear. Res. 54, 228–242 (2010).
Article PubMed Google Scholar
Rocha-Muniz, C. N., Befi-Lopes, D. M. & Schochat, E. Investigation of auditory processing disorder and language impairment using the speech-evoked auditory brainstem response. Hear. Res. 294, 143–152 (2012).
Article PubMed Google Scholar
Otto-Meyer, S., Krizman, J., White-Schwoch, T. & Kraus, N. Children with autism spectrum disorder have unstable neural responses to sound. Exp. Brain Res. 236, 733–743 (2018).
Article PubMed Google Scholar
Russo, N., Nicol, T., Trommer, B., Zecker, S. & Kraus, N. Brainstem transcription of speech is disrupted in children with autism spectrum disorders. Dev. Sci. 12, 557–567 (2009).
Article PubMed PubMed Central Google Scholar
Musacchia, G. et al. Effects of noise and age on the infant brainstem response to speech. Clin. Neurophysiol. 129, 2623–2634 (2018).
Article PubMed Google Scholar
Ribas-Prats, T. et al. The frequency-following response (FFR) to speech stimuli: a normative dataset in healthy newborns. Hear. Res. 371, 28–39 (2019).
Article PubMed Google Scholar
Lai, J. & Bartlett, E. L. Masking differentially affects envelope-following responses in young and aged animals. Neuroscience 386, 150–165 (2018).
Article CAS PubMed Google Scholar
Parthasarathy, A., Datta, J., Torres, J. A. L., Hopkins, C. & Bartlett, E. L. Age-related changes in the relationship between auditory brainstem responses and envelope-following responses. J. Assoc. Res. Otolaryngol. 15, 649–661 (2014).
Article PubMed PubMed Central Google Scholar
Krizman, J., Bonacina, S. & Kraus, N. Sex differences in subcortical auditory processing emerge across development. Hear. Res. 380, 166–174 (2019).
Article PubMed Google Scholar
Anderson, S., White-Schwoch, T., Parbery-Clark, A. & Kraus, N. Reversal of age-related neural timing delays with training. Proc. Natl. Acad. Sci. USA 110, 4357–4362 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Song, J., Skoe, E., Wong, P. & Kraus, N. Plasticity in the adult human auditory brainstem following short-term linguistic training. J. Cogn. Neurosci. 20, 1892–1902 (2008).
Article PubMed PubMed Central Google Scholar
Tierney, A. T., Krizman, J. & Kraus, N. Music training alters the course of adolescent auditory development. Proc. Natl. Acad. Sci. USA 112, 1–6 (2015).
Article CAS Google Scholar
Kraus, N. et al. The neural legacy of a single concussion. Neurosci. Lett. 646, 21–23 (2017).
Article CAS PubMed Google Scholar
Kraus, N. & White-Schwoch, T. Unraveling the biology of auditory learning: a cognitive-sensorimotor-reward framework. Trends Cogn. Sci. 19, 642–654 (2015).
Article PubMed PubMed Central Google Scholar
Kraus, N., Anderson, S., & White-Schwoch, T. The Frequency-Following Response: A Window into Human Communication. Springer Handbook of Auditory Research (eds N. Kraus et al.) 61, 1–15. https://doi.org/10.1007/978-3-319-47944-6_1 (2017).
Chapter Google Scholar
Bidelman, G. M. Subcortical sources dominate the neuroelectric auditory frequency-following response to speech. Neuroimage 175, 56–69 (2018).
Article PubMed Google Scholar
Zhang, X. & Gong, Q. Frequency-following responses to complex tones at different frequencies reflect different source configurations. Front. Neurosci. 13, 130 (2019).
Article PubMed PubMed Central Google Scholar
Worden, F. & Marsh, J. Frequency-following (microphonic-like) neural responses evoked by sound. Electroencephalogr. Clin. Neurophysiol. 25, 42–52 (1968).
Article CAS Google Scholar
Galbraith, G. & Doan, B. Brainstem frequency-following and behavioral responses during selective attention to pure tone and missing fundamental stimuli. Int. J. Psychophysiol. 19, 203–214 (1995).
Galbraith, G. C., Jhaveri, S. P. & Kuo, J. Speech-evoked brainstem frequency-following responses during verbal transformations due to word repetition. Electroencephalogr. Clin. Neurophysiol. 102, 46–53 (1997).
Article CAS PubMed Google Scholar
Smith, J. C., Marsh, J. T. & Brown, W. S. Far-field recorded frequency-following responses: evidence for the locus of brainstem sources. Electroencephalogr. Clin. Neurophysiol. 39, 465–472 (1975).
Article CAS PubMed Google Scholar
Sohmer, H., Pratt, H. & Kinarti, R. Sources of frequency following responses (FFR) in man. Electroencephalogr. Clin. Neurophysiol. 42, 656–664 (1977).
Article CAS PubMed Google Scholar
Stillman, R. D., Crow, G. & Moushegian, G. Components of the frequency-following potential in man. Electroencephalogr. Clin. Neurophysiol. 44, 438–446 (1978).
Article CAS PubMed Google Scholar
Herdman, A. T. et al. Intracerebral sources of human auditory steady-state responses. Brain Topogr. 15, 69–86 (2002).
Article PubMed Google Scholar
Dean Linden, R., Picton, T. W., Hamel, G. & Campbell, K. B. Human auditory steady-state evoked potentials during selective attention. Electroencephalogr. Clin. Neurophysiol. 66, 145–159 (1987).
Article Google Scholar
Pérez-González, D., Malmierca, M. S. & Covey, E. Novelty detector neurons in the mammalian auditory midbrain. Eur. J. Neurosci. 22, 2879–2885 (2005).
Article PubMed Google Scholar
Shiga, T. et al. Deviance-related responses along the auditory hierarchy: combined FFR, MLR and MMN evidence. PLoS One 10, e0136794 (2015).
Article PubMed PubMed Central CAS Google Scholar
Skoe, E., Krizman, J., Spitzer, E. & Kraus, N. Prior experience biases subcortical sensitivity to sound patterns. J. Cogn. Neurosci. 27, 124–140 (2015).
Article PubMed Google Scholar
Carbajal, G. V. & Malmierca, M. S. The neuronal basis of predictive coding along the auditory pathway: from the subcortical roots to cortical deviance detection. Trends Hear. 22, 233121651878482 (2018).
Article Google Scholar
Chandrasekaran, B. & Kraus, N. The scalp-recorded brainstem response to speech: neural origins and plasticity. Psychophysiology 47, 236–246 (2010).
Article PubMed Google Scholar
Kiren, T., Aoyagi, M., Furuse, H. & Koike, Y. An experimental study on the generator of amplitude-modulation following response. Acta Otolaryngol. Suppl. 511, 28–33 (1994).
CAS PubMed Google Scholar
Kuwada, S. et al. Sources of the scalp-recorded amplitude-modulation following response. J. Am. Acad. Audiol. 13, 188–204 (2002).
ADS PubMed Google Scholar
White-Schwoch, T., Nicol, T., Warrier, C. M., Abrams, D. A. & Kraus, N. Individual differences in human auditory processing: insights from single-trial auditory midbrain activity in an animal model. Cereb. Cortex 27, 5095–5115 (2017).
Article PubMed Google Scholar
King, A., Hopkins, K. & Plack, C. J. Differential group delay of the frequency following response measured vertically and horizontally. J. Assoc. Res. Otolaryngol. 17, 133–143 (2016).
Article PubMed PubMed Central Google Scholar
Akhoun, I. et al. The temporal relationship between speech auditory brainstem responses and the acoustic pattern of the phoneme/ba/in normal-hearing adults. Clin. Neurophysiol. 119, 922–933 (2008).
Article CAS PubMed Google Scholar
Brugge, J. F. et al. Functional localization of auditory cortical fields of human: click-train stimulation. Hear. Res. 238, 12–24 (2008).
Article PubMed Google Scholar
Galbraith, G. C. Two-channel brain-stem frequency-following responses to pure tone and missing fundamental stimuli. Electroencephalogr Clin. Neurophysiol. Potentials Sect. 92, 321–330 (1994).
Article CAS Google Scholar
Zhang, X. & Gong, Q. Correlation between the frequency difference limen and an index based on principal component analysis of the frequency-following response of normal hearing listeners. Hear. Res. https://doi.org/10.1016/j.heares.2016.12.004 (2016).
Article PubMed Google Scholar
Coffey, E. B. J., Musacchia, G. & Zatorre, R. J. Cortical correlates of the auditory frequency-following and onset responses: EEG and fMRI evidence. J. Neurosci. 37, 830–838 (2016).
Article Google Scholar
Coffey, E. B. J., Chepesiuk, A. M. P., Herholz, S. C., Baillet, S. & Zatorre, R. J. Neural correlates of early sound encoding and their relationship to speech-in-noise perception. Front. Neurosci. 11, 479 (2017).
Article PubMed PubMed Central Google Scholar
Hartmann, T. & Weisz, N. Auditory cortical generators of the frequency following response are modulated by intermodal attention. Neuroimage 203, 116185 (2019). https://doi.org/10.1016/j.neuroimage.2019.116185
Article PubMed Google Scholar
Tichko, P. & Skoe, E. Frequency-dependent fine structure in the frequency-following response: the byproduct of multiple generators. Hear. Res. 348, 1–15 (2017).
Article PubMed Google Scholar
Lin, F.-H. et al. Assessing and improving the spatial accuracy in MEG source localization by depth-weighted minimum-norm estimates. Neuroimage 31, 160–171 (2006).
Article PubMed Google Scholar
Baillet, S. Magnetoencephalography for brain electrophysiology and imaging. Nat. Neurosci. 20, 327–339 (2017).
Article CAS PubMed Google Scholar
Gross, J. et al. Good practice for conducting and reporting MEG research. Neuroimage 65, 349–363 (2013).
Article PubMed Google Scholar
Coffey, E. B. J., Colagrosso, E. M. G., Lehmann, A., Schönwiesner, M. & Zatorre, R. J. Individual differences in the frequency-following response: relation to pitch perception. PLoS One 11, e0152374 (2016).
Article PubMed PubMed Central CAS Google Scholar
Galbraith, G. C. et al. Putative measure of peripheral and brainstem frequency-following in humans. Neurosci. Lett. 292, 123–127 (2000).
Article CAS PubMed Google Scholar
Bidelman, G. M., Davis, M. K. & Pridgen, M. H. Brainstem-cortical functional connectivity for speech is differentially challenged by noise and reverberation. Hear. Res. 367, 149–160 (2018).
Article PubMed PubMed Central Google Scholar
Musacchia, G., Strait, D. L. & Kraus, N. Relationships between behavior, brainstem and cortical encoding of seen and heard speech in musicians and non-musicians. Hear. Res. 241, 34–42 (2008).
Article PubMed PubMed Central Google Scholar
Presacco, A., Simon, J. Z. & Anderson, S. Effect of informational content of noise on speech representation in the aging midbrain and cortex. J. Neurophysiol. https://doi.org/10.1152/jn.00373.2016 (2016).
Article PubMed PubMed Central Google Scholar
Bastos, A. M. & Schoffelen, J.-M. A tutorial review of functional connectivity analysis methods and their interpretational pitfalls. Front. Syst. Neurosci. 9, 1–23 (2016).
Article Google Scholar
Chandrasekaran, B., Kraus, N. & Wong, P. C. M. Human inferior colliculus activity relates to individual differences in spoken language learning. J. Neurophysiol. 107, 1325–1336 (2012).
Article PubMed Google Scholar
Ayala, Y. A., Lehmann, A. & Merchant, H. Monkeys share the neurophysiological basis for encoding sound periodicities captured by the frequency-following response with humans. Sci. Rep. 7, 16687 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Warrier, C. M., Abrams, D. A., Nicol, T. G. & Kraus, N. Inferior colliculus contributions to phase encoding of stop consonants in an animal model. Hear. Res. 282, 108–118 (2011).
Article PubMed PubMed Central Google Scholar
Abrams, D. A., Nicol, T., White-Schwoch, T., Zecker, S. & Kraus, N. Population responses in primary auditory cortex simultaneously represent the temporal envelope and periodicity features in natural speech. Hear. Res. 348, 31–43 (2017).
Article PubMed Google Scholar
Behroozmand, R. et al. Neural correlates of vocal production and motor control in human Heschl’s gyrus. J. Neurosci. 36, 2302–2315 (2016).
Article CAS PubMed PubMed Central Google Scholar
Puschmann, S., Baillet, S. & Zatorre, R. J. Musicians at the cocktail party: neural substrates of musical training during selective listening in multispeaker situations. Cereb. Cortex https://doi.org/10.1093/cercor/bhy193 (2018).
Article PubMed Central Google Scholar
Forte, A. E., Etard, O. & Reichenbach, T. The human auditory brainstem response to running speech reveals a subcortical mechanism for selective attention. Elife 6, 1–12 (2017).
Maddox, R. K. & Lee, A. K. C. Auditory brainstem responses to continuous natural speech in human listeners. eNeuro 5, ENEURO.0441-17.2018 5, 1–13 (2018).
Etard, O., Kegler, M., Braiman, C., Forte, A. E. & Reichenbach, T. Decoding of selective attention to continuous speech from the human auditory brainstem response. Neuroimage 200, 1–11 (2019).
Article PubMed Google Scholar
Gama, N., Peretz, I. & Lehmann, A. Recording the human brainstem frequency-following-response in the free-field. J. Neurosci. Methods 280, 47–53 (2016).
Article Google Scholar
Kraus, N., Hornickel, J., Strait, D. L., Slater, J. & Thompson, E. Engagement in community music classes sparks neuroplasticity and language development in children from disadvantaged backgrounds. Front. Psychol. 5, 1403 (2014).
PubMed PubMed Central Google Scholar
Wiegers, J. S., Bielefeld, E. C. & Whitelaw, G. M. Utility of the Vivosonic IntegrityTM auditory brainstem response system as a hearing screening device for difficult-to-test children. Int. J. Audiol. 54, 282–288 (2015).
Article PubMed Google Scholar
Yi, H. G., Xie, Z., Reetzke, R., Dimakis, A. G. & Chandrasekaran, B. Vowel decoding from single-trial speech-evoked electrophysiological responses: a feature-based machine learning approach. Brain Behav. 7, e00665 (2017).
Article PubMed PubMed Central Google Scholar
Xie, Z., Reetzke, R. & Chandrasekaran, B. Machine learning approaches to analyze speech-evoked neurophysiological responses. J. Speech Lang. Hear. Res. 62, 587–601 (2019).
Article PubMed PubMed Central Google Scholar
Mišić, B. & Sporns, O. From regions to connections and networks: New bridges between brain and behavior. Curr. Opin. Neurobiol. 40, 1–7 (2016).
Article PubMed PubMed Central CAS Google Scholar
Griffiths, T. D. & Warren, J. D. The planum temporale as a computational hub. Trends Neurosci. 25, 348–353 (2002).
Article CAS PubMed Google Scholar
Hickok, G. & Poeppel, D. The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393–402 (2007).
Article CAS PubMed Google Scholar
Rauschecker, J. & Scott, S. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 12, 718–724 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zatorre, R. J., Chen, J. & Penhune, V. When the brain plays music: auditory–motor interactions in music perception and production. Nat. Rev. Neurosci. 8, 547–558 (2007).
Article CAS PubMed Google Scholar
Feng, G., Yi, H. G. & Chandrasekaran, B. The role of the human auditory corticostriatal network in speech learning. Cereb. Cortex https://doi.org/10.1093/cercor/bhy289 (2018).
Article PubMed Central Google Scholar
Bidelman, G. M., Villafuerte, J. W., Moreno, S. & Alain, C. Age-related changes in the subcortical–cortical encoding and categorical perception of speech. Neurobiol. Aging 35, 2526–2540 (2014).
Article PubMed Google Scholar
Holmes, E., Purcell, D. W., Carlyon, R. P., Gockel, H. E. & Johnsrude, I. S. Attentional modulation of envelope-following responses at lower (93–109 Hz) but not higher (217–233 Hz) modulation rates. J. Assoc. Res. Otolaryngol. 19, 83–97 (2018).
Article PubMed Google Scholar
Hoormann, J., Falkenstein, M. & Hohnsbein, J. Effects of spatial attention on the brain stem frequency-following potential. Neuroreport 15, 1539–1542 (2004).
Article PubMed Google Scholar
Lehmann, A. & Schönwiesner, M. Selective attention modulates human auditory brainstem responses: relative contributions of frequency and spatial cues. PLoS One 9, 1–9 (2014).
Google Scholar
Varghese, L., Bharadwaj, H. M. & Shinn-Cunningham, B. G. Evidence against attentional state modulating scalp-recorded auditory brainstem steady-state responses. Brain Res. 1626, 146–164 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mai, G., Schoof, T. & Howell, P. Modulation of phase-locked neural responses to speech during different arousal states is age-dependent. Neuroimage 189, 734–744 (2019).
Article PubMed Google Scholar
Hairston, W. D., Letowski, T. R. & McDowell, K. Task-related suppression of the brainstem frequency following response. PLoS One 8, 1–9 (2013).
Article CAS Google Scholar
Ulanovsky, N., Las, L. & Nelken, I. Processing of low-probability sounds by cortical neurons. Nat. Neurosci. 6, 391–398 (2003).
Article CAS PubMed Google Scholar
King, C., McGee, T., Rubel, E. W., Nicol, T. & Kraus, N. Acoustic features and acoustic change are represented by different central pathways. Hear. Res. 85, 45–52 (1995).
Article CAS PubMed Google Scholar
Parras, G. G. et al. Neurons along the auditory pathway exhibit a hierarchical organization of prediction error. Nat. Commun. 8, 2148 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Nozaradan, S. Exploring how musical rhythm entrains brain activity with electroencephalogram frequency-tagging. Philos. Trans. R. Soc. B 369, 20130393–20130393 (2014).
Article Google Scholar
Hornickel, J., Skoe, E. & Kraus, N. Subcortical laterality of speech encoding. Audiol. Neurootol. 14, 198–207 (2009).
Article PubMed Google Scholar
Bharadwaj, H. M. & Shinn-Cunningham, B. G. Rapid acquisition of auditory subcortical steady state responses using multichannel recordings. Clin. Neurophysiol. 125, 1878–1888 (2014).
Article PubMed PubMed Central Google Scholar
Aiken, S. J. & Picton, T. W. Envelope and spectral frequency-following responses to vowel sounds. Hear. Res. 245, 35–47 (2008).
Article PubMed Google Scholar
Lerud, K. D., Almonte, F. V., Kim, J. C. & Large, E. W. Mode-locking neurodynamics predict human auditory brainstem responses to musical intervals. Hear. Res. 308, 41–49 (2014).
Article PubMed Google Scholar
Luo, L., Wang, Q. & Li, L. Neural representations of concurrent sounds with overlapping spectra in rat inferior colliculus: comparisons between temporal-fine structure and envelope. Hear. Res. 353, 87–96 (2017).
Article PubMed Google Scholar
Joris, P. X., Schreiner, C. E. & Rees, A. Neural processing of amplitude-modulated sounds. Physiol. Rev. 84, 541–577 (2004).
Article CAS PubMed Google Scholar
Moller, H. J., Devins, G. M., Shen, J. & Shapiro, C. M. Sleepiness is not the inverse of alertness: evidence from four sleep disorder patient groups. Exp. Brain Res. 173, 258–266 (2006).
Article PubMed Google Scholar
Wang, X., Lu, T., Bendor, D. & Bartlett, E. Neural coding of temporal information in auditory thalamus and cortex. Neuroscience 154, 294–303 (2008).
Article CAS PubMed Google Scholar
Rouiller, E., de Ribaupierre, Y. & de Ribaupierre, F. Phase-locked responses to low frequency tones in the medial geniculate body. Hear. Res. 1, 213–226 (1979).
Article Google Scholar
Brugge, J. F. et al. Coding of repetitive transients by auditory cortex on Heschl’s gyrus. J. Neurophysiol. 102, 2358–2374 (2009).
Article PubMed PubMed Central Google Scholar
Nourski, K. V. et al. Coding of repetitive transients by auditory cortex on posterolateral superior temporal gyrus in humans: an intracranial electrophysiology study. J. Neurophysiol. 109, 1283–1295 (2013).
Article PubMed Google Scholar
Irvine, D. R. F. The auditory brainstem: a review of the structure and function of auditory brainstem processing mechanisms. In Progress in Sensory Physiology, Vol. 7 (ed Ottoson, D.) (Springer-Verlag, Berlin, 1986).
Steinschneider, M., Arezzo, J. & Vaughan, H. G. Phase-locked cortical responses to a human speech sound and low-frequency tones in the monkey. Brain Res. 198, 75–84 (1980).
Article CAS PubMed Google Scholar
Wallace, M. N., Shackleton, T. M. & Palmer, A. R. Phase-locked responses to pure tones in the primary auditory cortex. Hear. Res. 172, 160–171 (2002).
Article PubMed Google Scholar
Batra, R., Kuwada, S. & Maher, V. L. The frequency-following response to continuous tones in humans. Hear. Res. 21, 167–177 (1986).
Article CAS PubMed Google Scholar
Anumanchipalli, G. K., Chartier, J. & Chang, E. F. Speech synthesis from neural decoding of spoken sentences. Nature 568, 493–498 (2019).
Article ADS CAS PubMed Google Scholar
Ding, N. & Simon, J. Z. Emergence of neural encoding of auditory objects while listening to competing speakers. Proc. Natl Acad. Sci. USA 109, 11854–11859 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the support of their respective institutions.

Author information

Authors and Affiliations

Department of Psychology, Concordia University, 1455 Boulevard de Maisonneuve Ouest, Montréal, QC, H3G 1M8, Canada
Emily B. J. Coffey
International Laboratory for Brain, Music, and Sound Research (BRAMS), Montréal, QC, Canada
Emily B. J. Coffey & Robert J. Zatorre
Centre for Research on Brain, Language and Music (CRBLM), McGill University, 3640 de la Montagne, Montréal, QC, H3G 2A8, Canada
Emily B. J. Coffey & Robert J. Zatorre
Auditory Neuroscience Laboratory, Department of Communication Sciences, Northwestern University, 2240 Campus Dr., Evanston, IL, 60208, USA
Trent Nicol, Travis White-Schwoch, Jennifer Krizman & Nina Kraus
Communication Sciences and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Forbes Tower, 3600 Atwood St, Pittsburgh, PA, 15260, USA
Bharath Chandrasekaran
Department of Speech, Language, and Hearing Sciences, The Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, 2 Alethia Drive, Unit 1085, Storrs, CT, 06269, USA
Erika Skoe
Montreal Neurological Institute, McGill University, 3801 rue Université, Montréal, QC, H3A 2B4, Canada
Robert J. Zatorre
Department of Neurobiology, Northwestern University, 2205 Tech Dr., Evanston, IL, 60208, USA
Nina Kraus
Department of Otolaryngology, Northwestern University, 420 E Superior St., Chicago, IL, 6011, USA
Nina Kraus

Authors

Emily B. J. Coffey
View author publications
You can also search for this author in PubMed Google Scholar
Trent Nicol
View author publications
You can also search for this author in PubMed Google Scholar
Travis White-Schwoch
View author publications
You can also search for this author in PubMed Google Scholar
Bharath Chandrasekaran
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Krizman
View author publications
You can also search for this author in PubMed Google Scholar
Erika Skoe
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Zatorre
View author publications
You can also search for this author in PubMed Google Scholar
Nina Kraus
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.B.J.C., T.N., T.W.-S., B.C., J.K., E.S., R.J.Z., and N.K. contributed equally to this work.

Corresponding author

Correspondence to Emily B. J. Coffey.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Manuel Malmierca the other, anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Coffey, E.B.J., Nicol, T., White-Schwoch, T. et al. Evolving perspectives on the sources of the frequency-following response. Nat Commun 10, 5036 (2019). https://doi.org/10.1038/s41467-019-13003-w

Download citation

Received: 18 February 2019
Accepted: 14 October 2019
Published: 06 November 2019
DOI: https://doi.org/10.1038/s41467-019-13003-w

This article is cited by

Artifact removal by template subtraction enables recordings of the frequency following response in cochlear-implant users
- Robin Gransier
- Robert P. Carlyon
- Jan Wouters
Scientific Reports (2024)
Frequency-Following Responses in Sensorineural Hearing Loss: A Systematic Review
- Laura Jacxsens
- Lana Biot
- Marc J. W. Lammers
Journal of the Association for Research in Otolaryngology (2024)
Encoding of speech in convolutional layers and the brain stem based on language experience
- Gašper Beguš
- Alan Zhou
- T. Christina Zhao
Scientific Reports (2023)
Frequency Following Responses to Tone Glides: Effects of Age and Hearing Loss
- Michelle R. Molis
- William J. Bologna
- Curtis J. Billings
Journal of the Association for Research in Otolaryngology (2023)
Cortical representation of musical pitch in event-related potentials
- Taehyoung Kim
- Miyoung Chung
- Sung-Phil Kim
Biomedical Engineering Letters (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.