Vocal learning promotes patterned inhibitory connectivity

Miller, Mark N.; Cheung, Chung Yan J.; Brainard, Michael S.

doi:10.1038/s41467-017-01914-5

Download PDF

Article
Open access
Published: 13 December 2017

Vocal learning promotes patterned inhibitory connectivity

Mark N. Miller¹,
Chung Yan J. Cheung² &
Michael S. Brainard¹

Nature Communications volume 8, Article number: 2105 (2017) Cite this article

2666 Accesses
20 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Skill learning is instantiated by changes to functional connectivity within premotor circuits, but whether the specificity of learning depends on structured changes to inhibitory circuitry remains unclear. We used slice electrophysiology to measure connectivity changes associated with song learning in the avian analog of primary motor cortex (robust nucleus of the arcopallium, RA) in Bengalese Finches. Before song learning, fast-spiking interneurons (FSIs) densely innervated glutamatergic projection neurons (PNs) with apparently random connectivity. After learning, there was a profound reduction in the overall strength and number of inhibitory connections, but this was accompanied by a more than two-fold enrichment in reciprocal FSI–PN connections. Moreover, in singing birds, we found that pharmacological manipulations of RA's inhibitory circuitry drove large shifts in learned vocal features, such as pitch and amplitude, without grossly disrupting the song. Our results indicate that skill learning establishes nonrandom inhibitory connectivity, and implicates this patterning in encoding specific features of learned movements.

Neural dynamics underlying birdsong practice and performance

Article 20 October 2021

Thalamus drives vocal onsets in the zebra finch courtship song

Article 22 March 2023

Intrinsic neuronal properties represent song and error in zebra finch vocal learning

Article Open access 19 February 2020

Introduction

Skilled motor behaviors including vocalizations are characterized by high degrees of precision, stereotypy, and adaptability, and are learned through practice that improves initially poor performance to a more expert level¹. The precision and reliability of skilled behaviors is ultimately driven by highly structured neural activity in central premotor circuits^2,3,4, and connectivity patterns within premotor circuitry are critical for generating appropriate behavior. Nonrandom patterns of connectivity among excitatory neurons are a feature of many systems, and plasticity of specific excitatory connections is considered central to the capacity of networks to produce appropriate output^5,6,7,8. However, whether learning shapes inhibitory connectivity to achieve comparable specificity^9,10 or instead promotes diffuse, nonspecific inhibition¹¹ is unclear. The development of temporally precise activation of a diffuse inhibitory network may be sufficient to structure premotor activity during vocal learning¹², yet formation of specific inhibitory connectivity in simulated networks is also sufficient to stably encode complex activity patterns¹³. This motivated us to ask how learning shapes inhibitory connectivity in songbirds, where robust vocal learning is subserved by a well-characterized premotor network.

We used a slice preparation of the avian vocal premotor nucleus RA (Fig. 1a) to examine changes to motor circuitry over the course of vocal learning. Glutamatergic RA projection neurons (PNs) that innervate vocal and respiratory motoneurons¹⁴ produce highly structured activity^2,3 that emerges during learning¹⁵ and ultimately controls the acoustic features of the learned song⁴. Previous studies that manipulated local RA circuitry reported minimal effects on production of learned song, and raised the possibility that moment-by-moment activity in RA projection neurons is largely determined by excitatory input from RA’s afferent regions, HVC and LMAN^16,17. However, within RA, GABAergic FSIs innervate PNs and can coordinate ongoing activity across PNs in acute slices¹⁸. Moreover, in other systems, FSIs can potently modulate ongoing neural activity patterns^{19,20,21,22,23}. These findings raise the possibility that FSIs might be critical modulators of RA activity that undergo plasticity during song acquisition and contribute to the encoding of learned song features. We therefore carried out experiments to test whether connectivity between FSIs and PNs within RA is shaped during vocal learning, and how inhibitory circuitry within RA contributes to the control of learned vocalizations.

Results

We first established that RA cell types in BFs can be differentiated by their spontaneous and evoked firing properties in slices¹⁸ (Fig. 1). PNs are spontaneously pacemaking, whereas FSIs are only sporadically active, and PNs produce adapting trains of action potentials in response to current injection, whereas FSIs produce high-frequency (>200 Hz) trains of short, narrow spikes with large afterhyperpolarizations and minimal spike-frequency adaptation (Fig. 1b–d, Supplementary Fig. 1). PNs and FSIs are thought to form both homotypic and heterotypic synapses and also receive excitatory inputs from HVC and LMAN^24,25,26. Because the prevalence, pattern, and strength of these different connections determines how RA transforms its input into activity patterns that drive structured vocal output during song, we next sought a general description of synaptic activity patterns within RA that might contribute to RA function.

Due to PN pacemaking (Fig. 1b), RA slices are highly spontaneously active even without imposing conditions that enhance excitability²⁴ (Fig. 1e). This activity state mirrors RA activity in vivo under anesthesia and during awake non-singing states^2,3,4. We exploited the correspondence between in vivo and in vitro activity states to monitor ongoing synaptic inputs to PNs in acute RA slices from adult (p90-180) male BFs. Spontaneous excitatory and inhibitory synaptic currents (sEPSCs and sIPSCs) on PNs were recorded in voltage clamp with cesium and QX-314 in the pipette solution to permit isolation of excitatory and inhibitory synaptic currents via manipulation of the holding potential. PNs received both spontaneous excitatory and inhibitory synaptic currents (Fig. 1e) under these conditions. Despite the strong pacemaking activity of RA PNs, we found that PNs receive much more spontaneous inhibition than excitation (Fig. 1e, f), consistent with previous reports¹⁸. Inhibitory input to PNs was 6.18-fold greater than excitatory input (SEM = 0.53-fold, p < 1⁻¹²), and we found no examples of PNs receiving more excitation than inhibition (Fig. 1f).

Since inhibition dominates synaptic activity within RA (Fig. 1f) and is important for the production of structured activity in other systems, we asked whether the inhibitory circuit in RA is a locus of plasticity during vocal learning by comparing RA inhibitory synapses between a group of untutored birds and a group of birds that had completed song learning. We used a computerized tutoring paradigm²⁷ to teach p40–45 BFs an identical tutor song, and maintained other age-matched birds as an untutored comparison group. The ages of tutored and untutored birds were not significantly different (untutored age = 66.1 ± 8.6 SD days, tutored age = 69.7 ± 9.2 SD days, p = 0.37, Supplementary Fig. 2). Tutored birds learned to copy the tutor song within 2–3 weeks, while untutored birds continued to produce unstructured juvenile vocalizations (Fig. 2a, Supplementary Fig. 3). To characterize changes to inhibitory circuitry during learning, we measured synaptic connections between PNs and FSIs using simultaneous whole-cell recordings in acute RA slices (Fig. 2b). Tutoring had a profound effect on RA inhibitory synapses: unitary FSI→PN IPSCs were on average 3.23-fold weaker in tutored than in untutored birds of the same age range (IPSC_untut = 21 ± 3.5 pA; IPSC_tut = 6.5 ± 0.5 pA; p < 0.01, Fig. 2c, d). Furthermore, the probability of FSI→PN connections (P _C) was reduced by nearly 50% after tutoring (untutored P _C = 0.76, 47 connections out of 62 tested; tutored P _C = 0.4, 24 connections out of 60 tested; binomial test p < 0.01, Fig. 2d). Overall, these data indicate that FSI→PN connections are a major feature of RA circuitry that undergo dramatic pruning and weakening during song learning.

Reduction of FSI→PN P _C during song learning could reflect indiscriminate loss of random FSI connections, or it could reflect selective rewiring that preserves or creates functionally important subcircuits within RA. To investigate these possibilities, we separately examined changes to the proportion of pairs in which there was a unidirectional connection from an FSI to PN (FSI→PN) and the proportion of pairs in which there were reciprocal connections between an FSI and PN (FSI ↔ PN). Random loss of FSI connections would produce a decrease in the proportion of both of these patterns. We indeed found that the proportion of unidirectional FSI→PN pairs decreased by 69% between untutored and tutored birds (binomial test p < 0.001 Fig. 3a, left). In contrast, the proportion of reciprocally connected FSI ↔ PN pairs remained constant over tutoring (Fig. 3a, right). This increase in the relative proportion of reciprocal FSI ↔ PN pairs despite an overall pruning of inhibitory connections suggests a nonrandom process that preferentially preserves or creates reciprocal connectivity between FSIs and PNs while eliminating the majority of unidirectional FSI→PN connections.

We further tested this possibility by investigating whether the frequency of different patterns of connectivity between FSIs and PNs exhibited any deviations from random in our tutored and untutored paired-recording data sets (Fig. 3b). We considered the four possible patterns of connections between a given FSI and PN: (1) no connection, (2) unidirectional connection from FSI to PN, (3) unidirectional connection from PN to FSI, or (4) reciprocal connection between FSI and PN. To test whether these connectivity patterns deviated from random, we created separate null models (see Methods) for tutored and untutored data sets that established how prevalent each connection pattern between FSIs and PNs would be if there were no specific patterning beyond that arising from the measured probabilities of unidirectional connections (P _C for FSI→PN and P _C for PN→FSI). In untutored birds, all connection patterns were observed at chance levels, consistent with an initially random patterning of connections between FSIs and PNs (Fig. 3b, left). However, in tutored birds, reciprocal FSI ↔ PN patterns were present at more than double the probability expected by chance (multinomial test p < 0.01, Fig. 3b, right), and the proportion of FSI ↔ PN connections among all connections in tutored birds was significantly greater than in untutored birds (multinomial test p < 0.005).

This indicates that learning promotes specific, nonrandom rewiring of RA circuitry by sparing or creating reciprocal FSI ↔ PN connections, even as overall inhibitory connectivity is reduced, resulting in a network that is enriched in reciprocal connections between FSIs and PNs.

In contrast to the strong effects of tutoring on FSI→PN connection probability, strength, and patterning (Figs. 2 and 3), tutoring had no detectable effect on PN excitatory connections within RA (Fig. 4). We encountered excitatory PN→FSI connections less frequently than FSI→PN connections, and tutoring did not alter PN→FSI EPSC amplitude (Fig. 4a, EPSC_untut = 17.1 ± 3.1pA, EPSC_tut = 20.1 ± 3.7pA, p = 0.63) or connection probability (Fig. 4b, untutored P _C = 0.17, tutored P _C = 0.19, binomial test p = 0.73). Unlike PNs, FSIs receive high levels of spontaneous excitatory input (Fig. 4a, overlaid gray traces), presumably from presynaptic PN pacemaking activity. To confirm PN→FSI connections in this background activity, we used the spike-triggered average EPSC evoked by 100–250 PN spikes (Fig. 4a, Supplementary Fig. 4) to evaluate synaptic connections and calculate PN→FSI P _C. In addition to PN→FSI connections, we also used spike-triggered average EPSCs to search for PN–PN synapses, because they are suggested to play important roles in song patterning and learning²⁵. However, we were unable to detect any PN–PN connections in 262 attempts, indicating that under our conditions these synapses are either very rare (P _C < 0.02, Fig. 4d), very weak (g _syn < 6.25pS, Fig. 4c), or both.

Our observations that FSI connectivity is a primary substrate for intrinsic interactions within RA (Figs. 1, 2, and 4) and that FSI→PN synapses are a major locus of plasticity during song learning (Figs. 2 and 4) led us to examine the functional contribution of FSIs to song production. Because we found that FSI connectivity gains specificity in parallel with the acquisition of highly structured learned vocal output (Fig. 3), we were specifically interested in the possibility that FSI activity is critical for producing learned acoustic features during singing. Patterned bursts across the RA PN population are thought to drive acoustic features including the fundamental frequency (FF) and amplitude of each vocalization^2,3,4, and prevailing models of song production hold that RA PN burst patterns are inherited from afferent HVC inputs to PNs^3,16,28. However, potential roles for RA inhibitory circuitry in shaping song production have not been examined, even as inhibitory circuits are known to critically shape patterned activity in other systems²⁹, including songbird HVC¹². We tested whether RA FSIs contribute to the control of learned song features by pharmacologically manipulating RA inhibitory circuitry in singing birds and measuring the effects on the acoustic structure of learned song.

To decrease RA inhibitory function, we used 1-Naphthyl acetyl spermine (NASPM) to block glutamatergic excitatory inputs to FSIs. In many systems, glutamatergic inputs to FSIs are primarily mediated by AMPARs that lack the gluA2 subunit^30,31, and NASPM specifically antagonizes these gluA2-lacking receptors³². NASPM can therefore reduce recruitment of FSIs and decrease inhibitory gain without completely blocking GABAergic transmission, which might produce pathological activity states. We confirmed in RA slices from adult BFs that bath application of 0.1 mM NASPM attenuated the overall level of spontaneous inhibition received by PNs by 42% (p = 0.0009, Fig. 5a, b), indicating that NASPM is an effective tool to reduce RA inhibitory function.

We next asked if RA inhibitory circuitry contributes to the production of learned song features by delivering NASPM (1–2 mM) into RA with reverse-microdialysis in vivo during singing^33,34. We measured NASPM’s effect on the FF and amplitude of song syllables, because these are features that are learned from the tutor song that are subsequently maintained within a narrow range for the lifetime of the bird^27,35,36. NASPM robustly increased both FF (4.4 ± 0.7% SEM, n = 12, p = 0.0014, Fig. 5d, e) and syllable amplitude (74.4 ± 18.9% SEM, n = 12, p = 0.0008, Fig. 5f, g) without altering overall syllable structure or otherwise disrupting the song (Fig. 5c), indicating that RA inhibitory circuitry can potently regulate the magnitude of specific learned syllable features during singing.

If inhibitory gain directly regulates learned vocal features and suppressing inhibitory function with NASPM increases syllable FF and amplitude, enhancing RA inhibitory function should produce the opposite effects. We tested this prediction by pharmacologically enhancing RA GABAergic function by reverse-microdialysis of the benzodiazepine midazolam, which allosterically increases the open probability of ligand-bound GABA_AR. In contrast to NASPM, midazolam (2.5 mM) reliably and significantly reduced both FF (−2.9 ± 0.7%, n = 5, p = 0.021, Fig. 5d, e) and syllable amplitude (−34.4 ± 4.5%, n = 5, p = 0.004 Fig. 5f, g). Like NASPM, however, midazolam dialysis specifically altered FF and amplitude without altering syllable structure (Fig. 5c), indicating that neither drug grossly disrupted the overall pattern of RA activity, but instead modulated RA activity in a specific fashion that shifted FF and amplitude.

Discussion

Our results show that RA inhibitory circuitry is a major locus of plasticity during song learning and that inhibition is a dominant component of RA circuitry that controls the production of learned vocal features. RA is a cortical analog that projects to brainstem premotor nuclei innervating vocal and respiratory musculature³⁷, and patterned activity in RA projection neurons is widely presumed to participate in the moment-by-moment control of learned features of song^2,3,4,38. Previous work has focused on the excitatory inputs to RA from HVC and LMAN as primary sites of synaptic plasticity responsible for establishing patterned activity within RA during song learning, and thereby encoding learned features of song^26,28,39. Despite the importance of inhibitory function within the upstream vocal motor region HVC for song learning and production^12,23 and indications that interneurons in RA are capable of potently controlling circuit activity in vitro¹⁸, potential roles for RA inhibitory circuits in song production and learning have received little attention. Here, we show that song learning is associated with dramatic pruning of local inhibitory circuitry within RA (Fig. 2), and that this pruning remodels initially random inhibitory connections to selectively preserve reciprocal projections between fast-spiking interneurons and projection neurons (Fig. 3). Moreover, we demonstrate that manipulating inhibitory function within RA of singing birds can drive bidirectional changes to learned acoustic features of song (Fig. 5). Together, these data indicate that encoding of learned song features depends on inhibitory function in RA that is sculpted during song learning, and is not simply inherited from patterns of HVC afferent activity.

Our finding that RA FSI→PN connectivity is initially widespread and nonspecific and then becomes enriched in specific reciprocal patterns during vocal learning (Fig. 3) indicates that acquisition of learned skills may rely in part on the formation of specific patterns of inhibitory connections in addition to plasticity of excitatory connections. This result raises the possibility that diffuse inhibitory connectivity found in neocortex¹¹ may reflect a substrate for learning that has not yet occurred, and that specific inhibitory connectivity patterns in other systems^9,10 may similarly be a product of inhibitory circuit plasticity during learning. Consistent with the possibility that inhibitory circuitry is shaped during learning to encode specific song features, we found that modulating inhibitory gain in RA of singing birds alters the precisely controlled values of FF and amplitude that are learned during song acquisition. This suggests that vocal features are encoded in premotor inhibitory networks during learning, and inhibitory activity in RA subsequently controls the production of these features.

The sculpting of inhibitory circuitry that we describe here likely interacts with other circuit modifications to encode the learned song. HVC inputs to RA PNs are also pruned during song learning²⁸ suggesting that vocal learning engages multiple processes to reduce shared synaptic inputs to RA PNs. Shared inputs including initially exuberant and powerful FSI→PN connectivity might prevent different groups of PNs from independently varying, thereby limiting the complexity and precision of vocal output. Hence, one function of diminished FSI→PN connections during learning might be to enable the formation of sparser and more independently varying PN ensembles required to control the acoustic features of learned song. Additionally, the experience-driven enrichment of reciprocal FSI ↔ PN connections that we observed might be particularly important for generating RA’s characteristically precise premotor activity patterns^2,3,4, which gradually emerge over song learning¹⁵ and are thought to be critical for the moment-by-moment control of syllable FF and amplitude.

More generally, longstanding models attribute control of acoustic features such as FF and syllable amplitude to RA activity on the indirect basis of anatomy³⁷, RA activity patterns^2,3,4, and the disruptive effects of electrically stimulating RA³⁸. Here we provide a causal demonstration that bidirectional manipulation of inhibition in RA produces corresponding bidirectional changes in FF and amplitude. These results further establish RA as a primary source of control signals for learned acoustic features of song, and additionally provide insight into the nature of those control signals: they support a model in which increased activity across the population of PNs (associated with a decrease in inhibitory tone) drives an increase in vocal and respiratory muscle tensions, and corresponding increase in FF and amplitude, while a decrease in PN firing (associated with an increase in inhibitory tone) results in a decrease in FF and amplitude. Together with our finding that song learning is associated with profound remodeling and increased specificity of RA inhibitory connections, these results from singing birds suggest that the specific pattern and strength of inhibitory connections within RA that are shaped during song acquisition determines the precise values of FF and amplitude produced during learned song.

Methods

Animals

Data from 39 male Bengalese Finches are included in this study. All birds were from our breeding colony at UCSF, and experiments were conducted in accordance with NIH and UCSF policies governing animal use and welfare.

Song tutoring

We adapted a computerized tutoring protocol²⁷ to provide finches with a common learning environment, equal exposure to tutor stimuli, and to explicitly constrain the period over which learning could occur. Clutches of Bengalese Finches from our breeding colony were raised from eggs by foster females (2–3 per nest) in sound proof chambers (Acoustic Systems) to prevent exposure to male songs or other tutor stimuli throughout early development. At 35 days post hatch (p35), we transferred each male bird to individual housing within individual sound proof chambers on a 14/10 h light/dark cycle. At p40–p45, we initiated tutoring by activating an operant perch in each cage that triggered tutor song playback through a speaker in the chamber (75 dB). Untutored birds were housed in identical cages, except that their perch triggers were inactive. All tutored birds were tutored with an identical stimulus that we constructed from seven acoustically distinct syllables and two intro notes chosen from a library of recorded Bengalese Finch vocalizations, separated by inter-syllable gaps drawn from the distribution of gaps produced by finches in our colony. Each perch-triggered playback consisted of three identical renditions of the tutor stimulus. We limited playbacks to 3 sets of 10 per day because we found that ad-lib playbacks prevented good learning, as previously reported²⁷. Vocalizations were detected and recorded with custom LabView software. Once birds learned to produce a copy of the tutor stimulus, they were taken from the tutoring apparatus for slice preparation. We usually prepared slices from tutored and untutored birds on consecutive days to achieve age-matching across conditions. Tutored birds that failed to copy the tutor song, retained unstructured juvenile vocalizations, or produced stereotyped song that was different from the tutor song were not included in electrophysiology experiments.

Electrophysiology

RA slices were prepared as previously described²⁶. The birds were deeply anesthetized with 4% isoflurane and decapitated in ice-cold oxygenated ACSF containing 125 mM Choline-Cl, 2.5 mM KCl, 2 mM MgCl₂, 1.25 mM NaHPO₄, 26 mM NaHCO₃, and 1 mM CaCl₂, and adjusted to 350 mOsm with dextrose. 250um coronal or sagittal RA slices were cut (Leica VT1000S) from each hemisphere under cold, oxygenated ACSF and transferred to an interface holding chamber with 38 °C recording ACSF containing 125 mM Choline-Cl, 2.5 mM KCl, 2 mM MgCl₂, 1.25 mM NaHPO₄, 25 mM NaHCO₃, and 2 mM CaCl₂, and adjusted to 350 mOsm with dextrose. After 30 min, slices in the holding chamber were relaxed to room temperature. During recording, bath temperature was maintained at 38 °C with a feedback-controlled inline heater (Warner Instruments). The slices containing RA were submerged in ACSF on the stage of an Olympus BX-51WI microscope and RA was identified with a ×4 or ×10 objective. Neurons in RA were visualized with DIC optics using a ×40 water-immersion objective. Patch pipettes were pulled on a Sutter P-97 puller to achieve tip impedances of 4–10 MΩ. To record spontaneous E/IPSCs, pipette solution contained 20 mM KCl, 100 mM Cs-MethylSulphonate, 10 mM K-HEPES, 0.1% biocytin, 4 mM Mg-ATP, 0.3 mM Na-GTP, 10 mM Na-Phosphocreatine, and 3 mM QX-314, with pH 7.35, and was adjusted to 315 mOsm with sucrose. For paired recordings that required intact action potential generation, pipette solution contained 20 mM KCl, 100mM K-gluconate, 10mM K-HEPES, 0.1% biocytin, 4 mM Mg-ATP, 0.3 mM Na-GTP, and 10 mM Na-phosphocreatine, with pH 7.35, and was adjusted to 315 mOsm with sucrose. Whole-cell recordings from PNs and FSIs were obtained under visual guidance with a ×40 water-immersion objective, current or voltage records were amplified by Multiclamp 700B (Molecular Devices) or Axopatch 1 C/1D (Axon Instruments) amplifiers, digitized at 10 kHz, and recorded with custom IGOR Pro software (Wavemetrics). Pipette capacitance and series resistance were compensated online and series resistance was monitored at 2 min intervals. Recordings with series resistance >20 MΩ or monotonic 25% change in input resistance were discarded. PNs were distinguished from FS interneurons in loose-patch mode on the basis of PN spontaneous pacemaking activity, by post hoc inspection of biocytin fills, by FSIs lower input resistance, and by differences in action potential shape AHP and width (Fig. 1c, d) when using K-Gluconate pipette solution. Action potential amplitides reported in Fig. 1 are relative to 0 mV. Spontaneous IPSCs were recorded in voltage clamp as outward currents at the measured mixed-cation reversal potential determined by reversing sEPSCs, and spontaneous EPSCs were recorded as inward currents at E _Cl determined by observing the reversal of sIPSCs. Reversal potentials were always within 5 mV of the calculated reversal potential when corrected for the liquid junction potential measured during each experiment. Paired recordings were made from 2–4 simultaneously recorded neurons. We tested for synaptic connections between neurons by driving 100–250 action potentials in each neuron with 1–2 ms 0.3–1 nA pulses at 50 Hz with 10 s duty cycle, while monitoring synaptic responses in other simultaneously recorded neurons in voltage clamp at −70 mV. Both inhibitory and excitatory connections were apparent in averaged sweeps, but we also calculated the spike-triggered average offline for all potential synaptic partners to increase sensitivity to very weak connections (Supplementary Fig. 4). We never detected a connection with the spike-triggered average that was not also detected in the averaged sweeps. To maximize our sample of FSIs, which are a minority of neurons in RA, we intentionally targeted small neurons for recording until we found an FSI. Our sample of FSIs, therefore, had smaller somata than our sample of PNs on average. However, we also encountered FSIs with somata as large as PNs, consistent with Spiro et al.¹⁸, which found overlapping distributions of PN and FSI soma size.

Connection motif analysis

We built separate models for untutored and tutored data sets by creating networks with random connectivity based on the unidirectional (FSI→PN, PN→FSI, and PN → PN) connection probabilities that we measured with paired recordings in each condition. To calculate the likelihood of observing reciprocal FSI–PN connections at the rate present in our data sets, we simulated tutored and untutored networks constructed with each data set’s unidirectional connection probabilities and sample size 100,000 times. Conceptually, this approach extends a binomial test to an arbitrary number of potential outcomes (in our case, the four possible connection motifs). To test whether motifs in tutored birds were significantly more or less common than in untutored birds, we created models with the frequencies of each connection pattern (FSI→PN, PN→FSI, FSI ↔ PN) present in untutored birds, and calculated the likelihood of observing the frequencies present in tutored birds. We validated our models with Matlab’s mnpdf() and mnrnd() functions.

In vivo reverse-microdialysis

We pharmacologically manipulated RA in vivo in freely behaving and singing birds as previously described^33,34. Adult (>100 days post hatch) male bengalese finches were implanted bilaterally with microdialysis probes (CMA) targeted to RA. Accurate placement in RA was confirmed during surgery by extracellular recording of RA’s characteristic spontaneous activity. After recovery from surgery, the birds were housed individually within sound attenuating chambers (Acoustic Systems) on a 14/10 h light/dark schedule with free access to food, grit, and water, and vocalizations were recorded with a microphone fixed to the cage ceiling. PBS was continuously delivered to RA at a rate of 0.1 μl/min via a fluid commutator connected to a syringe pump outside the bird’s isolation chamber. To manipulate inhibitory function within RA, we switched from PBS to either NASPM (Tocris) or midazolam (Sigma). Because the switch occurred outside the isolation chamber, the birds remained undisturbed and continued to behave and sing normally through the transition from PBS to drug. A total of >100 undirected song bouts were collected during both PBS and subsequent drug dialysis for each experiment.

Acoustic feature analysis

The songs were recorded at 32 kHz and 16-bit depth with custom LabView software^33,34,36. Offline, the syllables were extracted from audio files based on amplitude threshold crossings of the rectified audio waveform smoothed with a 2 ms moving window and analyzed using custom Matlab software. We focused on syllables with prominent harmonic stacks and minimal frequency modulation for FF and amplitude quantification. FF was calculated as the peak in the band-limited power spectrum of a 2–5 ms window within each syllable during which FF was stable. We measured syllable amplitude by detecting the peak of the smoothed (2 ms moving window) rectified audio waveform.

Song similarity analysis

To quantify the similarity between tutored or untutored songs and the tutor song (Supplementary Fig. 3), we applied a method for automatically classifying vocalizations in an unbiased and unsupervised fashion⁴⁰ based on their acoustic content. Briefly, this method extracts syllables from a test song (e.g., from a tutored or untutored bird) and assembles a statistical model based on the syllables’ acoustic content. Through comparison to similarly constructed statistical models of the reference song (e.g., tutor song), these models are then used to estimate both the amount of information present in the reference that is absent from the test song (unlearned content) and the amount of information present in the test song that is absent from the reference song (improvised content).

Data availability

Data sets generated and analyzed in this study are available from the corresponding author upon request. Code used for analysis is also available upon request.

References

Doupe, A. J. & Kuhl, P. K. Birdsong and human speech: common themes and mechanisms. Annu. Rev. Neurosci. 22, 567–631 (1999).
Article CAS PubMed Google Scholar
Yu, A. C. & Margoliash, D. Temporal hierarchical control of singing in birds. Science 80, 1871–1875 (1996).
Article ADS Google Scholar
Leonardo, A. & Fee, M. S. Ensemble coding of vocal control in birdsong. J. Neurosci. 25, 652–661 (2005).
Article CAS PubMed Google Scholar
Sober, S. J., Wohlgemuth, M. J. & Brainard, M. S. Central contributions to acoustic variation in birdsong. J. Neurosci. 28, 10370–10379 (2008).
Article CAS PubMed PubMed Central Google Scholar
Song, S., Sjöström, P. J., Reigl, M., Nelson, S. & Chklovskii, D. B. Highly nonrandom features of synaptic connectivity in local cortical circuits. PLoS Biol. 3, 0507–0519 (2005).
CAS Google Scholar
Yoshimura, Y., Dantzker, J. L. M. & Callaway, E. M. Excitatory cortical neurons form fine-scale functional networks. Nature 433, 868–873 (2005).
Article ADS CAS PubMed Google Scholar
Morishima, M. Recurrent connection patterns of corticostriatal pyramidal cells in frontal cortex. J. Neurosci. 26, 4394–4405 (2006).
Article CAS PubMed Google Scholar
Jiang, X. et al. Principles of connectivity among morphologically defined cell types in adult neocortex. Science 350, aac9462–aac9462 (2015).
Article PubMed PubMed Central Google Scholar
Yoshimura, Y. & Callaway, E. M. Fine-scale specificity of cortical networks depends on inhibitory cell type and connectivity. Nat. Neurosci. 8, 1552–1559 (2005).
Article CAS PubMed Google Scholar
Otsuka, T. & Kawaguchi, Y. Cortical inhibitory cell types differentially form intralaminar and interlaminar subnetworks with excitatory neurons. J. Neurosci. 29, 10533–10540 (2009).
Article CAS PubMed Google Scholar
Karnani, M. M., Agetsuma, M. & Yuste, R. A blanket of inhibition: functional inferences from dense inhibitory connectivity. Curr. Opin. Neurobiol. 26, 96–102 (2014).
Article CAS PubMed Google Scholar
Vallentin, D., Kosche, G., Lipkind, D. & Long, M. A. Inhibition protects acquired song segments during vocal learning in zebra finches. Science 351, 267–271 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Vogels, T. P., Sprekeler, H., Zenke, F., Clopath, C. & Gerstner, W. Inhibitory plasticity balances excitation and inhibition in sensory pathways and memory networks. Science 334, 1569–1573 (2011).
Article ADS CAS PubMed Google Scholar
Wild, J. M., Williams, M. N. & Suthers, R. A. Neural pathways for bilateral vocal control in songbirds. J. Comp. Neurol. 423, 413–426 (2000).
Article CAS PubMed Google Scholar
Ölveczky, B. P., Otchy, T. M., Goldberg, J. H., Aronov, D. & Fee, M. S. Changes in the neural control of a complex motor sequence during learning. J. Neurophysiol. 106, 386–97 (2011).
Article PubMed PubMed Central Google Scholar
Long, M. A. & Fee, M. S. Using temperature to analyze temporal dynamics in the songbird motor pathway. Nature 456, 189–194 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Y. S., Wittenbach, J. D., Jin, D. Z. & Kozhevnikov, A. A. Temperature manipulation in songbird brain implicated the premotor nucleus HVC in birdsong syntax. J. Neurosci. 47, 2600–2611 (2017).
Article Google Scholar
Spiro, J. E., Dalva, M. B. & Mooney, R. Long-range inhibition within the zebra finch song nucleus RA can coordinate the firing of multiple projection neurons. J. Neurophysiol. 81, 3007–3020 (1999).
CAS PubMed Google Scholar
Pastoll, H., Solanka, L., van Rossum, M. C. W. & Nolan, M. F. Feedback inhibition enables Theta-Nested gamma oscillations and grid firing fields. Neuron 77, 141–154 (2013).
Article CAS PubMed Google Scholar
Couey, J. J. et al. Recurrent inhibitory circuitry as a mechanism for grid formation. Nat. Neurosci. 16, 318–24 (2013).
Article CAS PubMed Google Scholar
Cardin, J. A. et al. Driving fast-spiking cells induces gamma rhythm and controls sensory responses. Nature 459, 663–667 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Sohal, V. S., Zhang, F., Yizhar, O. & Deisseroth, K. Parvalbumin neurons and gamma rhythms enhance cortical circuit performance. Nature 459, 698–702 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Kosche, X. G., Vallentin, D. & Long, M. A. Interplay of inhibition and excitation shapes a premotor neural sequence. J. Neurosci. 35, 1217–1227 (2015).
Article PubMed PubMed Central Google Scholar
Mooney, R. Synaptic basis for developmental plasticity in a birdsong nucleus. J. Neurosci. 12, 2464–2477 (1992).
CAS PubMed Google Scholar
Sizemore, M. & Perkel, D. J. Premotor synaptic plasticity limited to the critical period for song learning. Proc. Natl Acad. Sci. USA 108, 17492–17497, (2011)
Mehaffey, W. H. & Doupe, A. J. Naturalistic burst stimulation drives opposing patterns of heterosynaptic plasticity at two inputs to a songbird motor cortex analogue. Nat. Neurosci. 18, 1–10 (2015).
Article Google Scholar
Tchernichovski, O., Mitra, P. P., Lints, T. & Nottebohm, F. Dynamics of the vocal imitation process: how a zebra finch learns its song. Science 291, 2564–2569 (2001).
Article ADS CAS PubMed Google Scholar
Garst-Orozco, J., Babadi, B. & Ölveczky, B. P. A neural circuit mechanism for regulating motor variability during skill learning. eLife 3, e03697 (2014).
Isaacson, J. S. & Scanziani, M. How inhibition shapes cortical activity. Neuron 72, 231–243 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gittis, A. H. et al. Selective inhibition of striatal fast-spiking interneurons causes dyskinesias. J. Neurosci. 31, 15727–15731 (2011).
Article CAS PubMed PubMed Central Google Scholar
McBain, C. J. & Fisahn, A. Interneurons unbound. Nat. Rev. Neurosci. 2, 11–23 (2001).
Article CAS PubMed Google Scholar
Isaac, J. R., Ashby, M. C. & McBain, C. J. The role of the GluR2 subunit in AMPA receptor function and plasticity. Neuron 54, 859–871 (2007).
Article CAS PubMed Google Scholar
Warren, T. L., Tumer, E. C., Charlesworth, J. D. & Brainard, M. S. Mechanisms and time course of vocal learning and consolidation in the adult songbird. J. Neurophysiol. 106, 1806–1821 (2011).
Article CAS PubMed PubMed Central Google Scholar
Stepanek, L. & Doupe, A. J. Activity in a cortical-basal ganglia circuit for song is required for social context-dependant vocal variability. J. Neurophysiol. 104, 2474–2486 (2010).
Article PubMed PubMed Central Google Scholar
Sober, S. J. & Brainard, M. S. Adult birdsong is actively maintained by error correction. Nat. Neurosci. 12, 927–931 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tumer, E. C. & Brainard, M. S. Performance variability enables adaptive plasticity of ‘crystallized’ adult birdsong. Nature 450, 1240–1245 (2007).
Article ADS CAS PubMed Google Scholar
Wild, J. M. Neural pathways for the control of birdsong production. J. Neurobiol. 33, 653–670 (1997).
Article CAS PubMed Google Scholar
Vu, E. T., Mazurek, M. E. & Kuo, Y. C. Identification of a forebrain motor programming network for the learned song of zebra finches. J. Neurosci. 14, 6924–6934 (1994).
CAS PubMed Google Scholar
Livingston, F. S., White, S. A. & Mooney, R. Slow NMDA-EPSCs at synapses critical for song development are not required for song learning in zebra finches. Nat. Neurosci. 3, 482–488 (2000).
Article CAS PubMed Google Scholar
Mets, D. G. & Brainard, M. S. An automated approach to quantitation of vocalizations and vocal learning. BioRxiv Preprint at https://doi.org/10.1101/166124 (2017).
Maffei, A., Nataraj, K., Nelson, S. B. & Turrigiano, G. G. Potentiation of cortical inhibition by visual deprivation. Nature 443, 81–84 (2006).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Andrea Hasenstaub and Michael Stryker for commenting on drafts of the manuscript and members of the Brainard lab for providing input at every stage of the project. This work was supported by the Howard Hughes Medical Institute and NIH grants R01MH055987 and R01DC006636 (msb), by an NIH F32 NRSA award (mnm), and by an NSF predoctoral award (cjc).

Author information

Authors and Affiliations

Howard Hughes Medical Institute and Departments of Physiology and Psychiatry, University of California-San Francisco, San Francisco, CA, 94158, USA
Mark N. Miller & Michael S. Brainard
Neuroscience Graduate, Program, University of California-San Francisco, San Francisco, CA, 94158, USA
Chung Yan J. Cheung

Authors

Mark N. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Chung Yan J. Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Brainard
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.N.M. and M.S.B. conceived the project, M.N.M. designed and performed the experiments, C.Y.J.C. contributed to in vivo pharmacology experiments, M.N.M. analyzed the data, M.N.M. wrote the manuscript, and M.N.M. and M.S.B. edited the manuscript.

Corresponding author

Correspondence to Mark N. Miller.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Miller, M.N., Cheung, C.Y.J. & Brainard, M.S. Vocal learning promotes patterned inhibitory connectivity. Nat Commun 8, 2105 (2017). https://doi.org/10.1038/s41467-017-01914-5

Download citation

Received: 25 August 2017
Accepted: 25 October 2017
Published: 13 December 2017
DOI: https://doi.org/10.1038/s41467-017-01914-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.