Basolateral amygdala rapid glutamate release encodes an outcome-specific representation vital for reward-predictive cues to selectively invigorate reward-seeking actions

Malvaez, Melissa; Greenfield, Venuz Y.; Wang, Alice S.; Yorita, Allison M.; Feng, Lili; Linker, Kay E.; Monbouquette, Harold G.; Wassum, Kate M.

doi:10.1038/srep12511

Download PDF

Article
Open access
Published: 27 July 2015

Basolateral amygdala rapid glutamate release encodes an outcome-specific representation vital for reward-predictive cues to selectively invigorate reward-seeking actions

Melissa Malvaez¹,
Venuz Y. Greenfield¹,
Alice S. Wang¹,
Allison M. Yorita²,
Lili Feng²,
Kay E. Linker¹,
Harold G. Monbouquette² &
…
Kate M. Wassum^1,3

Scientific Reports volume 5, Article number: 12511 (2015) Cite this article

3553 Accesses
35 Citations
3 Altmetric
Metrics details

Subjects

A Corrigendum to this article was published on 15 February 2016

This article has been updated

Abstract

Environmental stimuli have the ability to generate specific representations of the rewards they predict and in so doing alter the selection and performance of reward-seeking actions. The basolateral amygdala participates in this process, but precisely how is unknown. To rectify this, we monitored, in near-real time, basolateral amygdala glutamate concentration changes during a test of the ability of reward-predictive cues to influence reward-seeking actions (Pavlovian-instrumental transfer). Glutamate concentration was found to be transiently elevated around instrumental reward seeking. During the Pavlovian-instrumental transfer test these glutamate transients were time-locked to and correlated with only those actions invigorated by outcome-specific motivational information provided by the reward-predictive stimulus (i.e., actions earning the same specific outcome as predicted by the presented CS). In addition, basolateral amygdala AMPA, but not NMDA glutamate receptor inactivation abolished the selective excitatory influence of reward-predictive cues over reward seeking. These data support the hypothesis that transient glutamate release in the BLA can encode the outcome-specific motivational information provided by reward-predictive stimuli

A quantitative reward prediction error signal in the ventral pallidum

Article 10 August 2020

David J. Ottenheimer, Bilal A. Bari, … Patricia H. Janak

Environmental context-dependent activation of dopamine neurons via putative amygdala-nigra pathway in macaques

Article Open access 21 April 2023

Kazutaka Maeda, Ken-ichi Inoue, … Okihide Hikosaka

Neural substrates of parallel devaluation-sensitive and devaluation-insensitive Pavlovian learning in humans

Article Open access 05 December 2023

Eva R. Pool, Wolfgang M. Pauli, … John P. O’Doherty

Introduction

Adaptive reward seeking is critical to survival and is disrupted in a variety of neuropsychiatric disorders, including substance abuse, overeating and depression. The basolateral amygdala (BLA) has been implicated in these disorders^1,2,3,4 and is involved in reward processing^5,6, but much is unknown about its precise contribution. The BLA receives dense cortical and thalamic glutamatergic input^7,8,9. Based on the results of BLA lesions^10,11,12,13, these excitatory chemical messages may be thought to convey a sustained emotional valence signal in response to reward-predictive stimuli. However, it is also possible that BLA signaling represents the motivational value of specific reward expectations generated by such cues. Here we investigate the latter.

Assessment of this hypothesis requires a method to selectively measure BLA glutamate signaling with fast temporal resolution in order to distinguish chemical messages related to individual reward-seeking behaviors. Microdialysis allows for selective measurement of extracellular neurochemical concentration changes, but the typical 10–20 min (or even rapid 14–20 s^14,15,16) sampling window does not provide the appropriate temporal resolution and the spatial resolution is inadequate to record from BLA microenvironments. Single-unit electrophysiological recordings provide the required temporal and spatial resolution, but are non-selective and biased to record mostly from output neurons, precluding evaluation of glutamatergic input or local processing of such chemical messages within the BLA. Biosensor technologies^17,18,19,20 provide a solution. Using an electroenzymatic approach, this technique allows online, near-real time, sensitive and selective measurement of extracellular glutamate concentration changes that result from neuronal release^{21,22,23,24,25}. Recent biosensor data support the possibility that BLA glutamate release may convey information important for reward seeking; transient fluctuations in extracellular BLA glutamate concentration were detected immediately preceding reward-seeking actions²¹, but the precise information encoded by these glutamate transients is unknown.

One major source of reward-seeking motivation is the cognitive expectation of specific available rewards, information that is often provided by environmental stimuli. Indeed, an environmental reward-predictive stimulus will selectively invigorate the performance of those actions that earn the same specific reward associated with the stimulus^26,27,28,29. This capacity requires the BLA^11,13,30 and is thought to rely upon retrieval of a cognitive representation of the specific shared reward (i.e., outcome) encoded in both the Pavlovian stimulus-outcome and instrumental action-outcome association^31,32. BLA neurons can fire in response to reward-predictive cues^{33,34,35,36,37} and in anticipation of reward³⁸, but the chemical message driving this neuronal activity and whether it encodes the motivational value of specific reward representations has yet to be clarified.

Therefore, we evaluated the role of BLA glutamate signaling in outcome-specific Pavlovian-instrumental transfer (PIT). In this task, rats are trained to associate two auditory Pavlovian stimuli (CS) with two distinct food rewards and then to respond on two independent levers to earn the same rewards. In the critical PIT test both levers are available and the CS will selectively enhance the response with which it shares a rewarding outcome^26,27,28,32. Because the CSs are never directly associated with the instrumental actions, this test assesses the rats’ ability to mentally represent each specific reward and to use this outcome-specific information to guide and motivate reward seeking. We reasoned that if BLA glutamate release is related to the motivational influence of cue-induced, outcome-specific representations, then blocking ionotropic glutamate receptors (iGluRs) should disrupt the selective excitatory influence of the cue on reward seeking and, under normal conditions, glutamate release should precede and correlate with only those actions that are selectively invigorated by the CS.

Results

Experiment 1

In Experiment 1 (see Fig. 1A) we pharmacologically blocked either BLA AMPA (0, 1 or 3 μg/side of NBQX) or NMDA (0, 1 or 3 μg/side of AP5) glutamate receptors prior to the outcome-specific PIT test in order to assess their respective contributions to the selective invigorating influence of reward-predictive cues. During the PIT test both levers were simultaneously present, but pressing was never rewarded. Each CS was presented 4 times in alternating order, with intervening control CS-free periods (pre-CS). In this test the CS presentation provides the cognitive information (e.g., specific reward representation) that guides action selection and performance in the novel choice scenario.

BLA AMPA and NMDA glutamate receptor involvement in the selective invigorating influence of reward-predictive stimuli over reward-seeking actions

As is clear from Fig. 1C,D, we detected a differential effect of AMPA and NMDA iGluR receptor blockade on the selective-invigorating influence of reward-predictive stimuli over reward seeking during the outcome-specific PIT test. For the AMPA group there was a significant main effect of both CS (F_2,14 = 12.06, p < 0.001) and NBQX dose (F_2,14 = 4.92, p =0.02) on lever pressing, as well as a significant interaction between these factors (F_4,28 = 5.26, p =0.003). Following a control vehicle infusion CS presentation selectively elevated press rate on the lever that, in training, earned the same reward as predicted by the CS (CS-Same) relative to both pre-CS press rate (p < 0.001) and pressing during the CS on the alternate available lever (CS-Different; p < 0.001). This selective elevation on the CS-Same action was blocked by BLA AMPA receptor inactivation (p >0.05, for both doses) and, indeed, CS-Same responding was lower following intra-BLA NBQX infusion than vehicle control (p < 0.001, for both doses). Intra-BLA NBQX did not significantly alter pre-CS baseline or CS-Different response rates (p >0.05), suggesting a specific effect of AMPA receptor blockade on the selective invigorating influence of cues over action performance. The low response rate during the pre-CS period may have, however, been close to the floor for detecting a significant decrease in responding. To ensure AMPA receptor blockade did not alter baseline responding we isolated the first PIT trial for which the pre-CS response rate was higher (~5 presses/min) and in this case found identical results to the trial-averaged data; AMPA receptor blockade selectively attenuated CS-Same responding and did not significantly impact pre-CS response rate (see Supplemental Fig. 1).

Blockade of BLA NMDA receptors was without effect on outcome-specific PIT (Fig. 1D). For the NMDA group there was a main effect of CS (F_2,16 = 18.68, p < 0.001), with neither an effect of AP5 dose (F_2,16 = 0.46, p =0.64), nor AP5 dose x CS interaction (F_4,32 = 0.04, p =0.99). Under each drug dose treatment the CS elevated responding on the CS-Same action relative to both the pre-CS period and to the CS-Different action (p < 0.05, in all cases). After both intra-BLA AMPA and NMDA receptor blockade rats were able to show a CS-induced elevation in Pavlovian conditioned food-port approach responding (see Supplemental Results and Supplemental Fig. 2).

Experiment 2

The results of Experiment 1 suggest that BLA AMPA iGluR activation is necessary for reward-paired cues to selectively invigorate the performance of a specific reward-seeking action. We next used electroenzymatic biosensors to make sub-second measurements of extracellular glutamate concentration changes to interrogate the profile of BLA glutamate release during instrumental conditioning and PIT (see Fig. 2B). We reasoned that if BLA glutamate signaling is related to the motivational value of reward-specific representations, then such signaling might correlate with the performance of reward-seeking actions during instrumental conditioning. More importantly, during the critical PIT test BLA glutamate signaling should correlate with the performance of only those actions that are selectively motivated by the CS-generated reward representation (i.e., CS-Same pressing).

BLA glutamate release during instrumental conditioning

As can be seen in the representative example presented in Fig. 3A, during the instrumental reward-seeking test there were rapid, short-duration increases in glutamate concentration (i.e., glutamate transients) that were increased in both frequency (Fig. 3B; t₇ = 2.34, p =0.05) and amplitude (Fig. 3C; t₇ = 2.85, p =0.02) during instrumental performance, relative to the pre-test baseline period. See Supplemental Figure 3 for further details on transient amplitude. Interestingly, the frequency of these glutamate release events positively correlated (r₁₆ = 0.58, p =0.02; Fig. 3E) with lever-press rate (see Fig. 3D), such that higher press rates were associated with more frequent BLA glutamate transients.

As can also be seen in Fig. 3A, the frequency of glutamate transients fluctuated throughout the instrumental conditioning test and appeared to share a tight temporal relationship to lever-press actions, especially those actions initiating bouts of reward seeking (see representative trial-averaged glutamate concentration v. time trace in Fig. 3F). To specifically evaluate the relationship between glutamate release events and instrumental reward seeking we calculated the likelihood of a glutamate transient in the time immediately surrounding lever presses. Because rats tended to organize their lever pressing into clusters we divided our analysis for those presses that initiated reward seeking (i.e., ‘initiating presses’) excluding presses that occurred within a pressing bout and compared this to all lever presses (including both initiating and intra-bout presses). Initiating presses were defined as the first press after collection of an earned reward or the first press after a >6 s pause in pressing. During the instrumental test rats showed on average 34.25 (sem = 5.53) total reward-seeking bouts per session, with 23.68% (1.77) of total presses being considered ‘initiating presses’. Reward-seeking bouts had an average duration of 8.20 s (1.33) and contained on average 5.84 (1.03) presses. The average reward receipt to next initiating press latency was 23.10 s (7.47). To isolate the initiation of instrumental reward seeking and avoid the presence of contaminating events (e.g., reward receipt, termination of previous bout, etc.) in the reward-seeking initiation analysis window we calculated the change in likelihood of a glutamate transient by counting the glutamate transients in 10, 1-s bins evenly distributed around presses. A longer analysis window are presented in the Supplemental Results and Supplemental Figures 4.

The raw glutamate transient counts around initiating presses for each subject are displayed in the raster plot shown in Fig. 3G. Statistical analysis of the data collapsed over 1-s intervals (to match biosensor response time) and averaged across subjects (Fig. 3G- bottom) found a marginally insignificant effect of Time surrounding the press (F_9,63 = 1.79, p =0.08), a significant effect of Type of press (Initiating press v. All presses, F_1,7 = 61.42, p =0.0001) and a Time x Type of press interaction (F_9,63 = 2.67, p =0.01; Fig. 3G). Glutamate transients were more likely (when controlling for number of presses) to occur time-locked to those initiating presses that followed either reward delivery or a pause in pressing than all presses combined. The likelihood of a glutamate transient was elevated (relative to the control 1-s time bin, 5 s prior to the press, which itself did not differ from the baseline likelihood of a glutamate transient in similar epochs without lever pressing during the pre-test period: t₇ = 2.12, p =0.07) between 3 and 1 s prior to initiating lever presses (p < 0.05). The likelihood of a glutamate transient became elevated again in the 3-s window after the initiating press, which corresponded to the average time at which the next press within a bout occurred (average 2.02 s, sem = 0.12). Given that the average latency between an initiating press and reward delivery was 44.4 s (sem = 10.0; max = 160.6, min = 7.76m), it is unlikely that the glutamate release events that occurred during the 5 s following initiation of reward seeking activity were related to reward receipt (see Supplemental Results and Supplemental Fig. 5 for glutamate transient likelihood around reward receipt). These results corroborate our previous report²¹ and suggest that BLA glutamate release events are increased in frequency and amplitude during instrumental reward seeking, are tightly time-locked to the initiation of instrumental action and positively correlate with instrumental performance.

BLA glutamate release during Pavlovian-instrumental transfer

We next measured BLA glutamate concentration changes during PIT to evaluate how BLA glutamate release relates to the cognitive, reward-specific representations generated by reward-predictive cues that allow them to selectively invigorate reward-seeking actions. As can be seen in the group-averaged glutamate concentration v. time trace presented in Fig. 4A, presentation of a Pavlovian CS did not induce any apparent robust or sustained increase in glutamate concentration, although there was a slight overall drift in the baseline current. The reward-predictive cues did, however, elevate the frequency of discrete glutamate release events (Fig. 4B; main effect of Period: F_2,14 = 4.25, p =0.04), but the amplitude of these transients was, on average, not significantly altered by CS presentation (Fig. 4C; main effect of Period: F_2,14 = 1.72, p =0.22). Glutamate transients were more frequent than the pre-test (no behavior or cues) baseline period during the CS (p < 0.05), but not pre-CS period (p >0.05) in a manner similar to that seen during Pavlovian conditioning (no effect of Extinction: Pavlovian conditioning v. PIT test F_1,5 = 0.74, p =0.43 or Extinction x CS interaction: F_2,10 = 0.11, p =0.89- see also Supplemental Results and Supplemental Figure 6 for data from the Pavlovian conditioning test). That there was not a significant difference in glutamate transient frequency during the CS relative to the pre-CS period is likely due to the reward-predictive nature of the operant box context because of its pairing with reward during instrumental conditioning. Indeed, rats were exploring the chamber, entering the food-delivery port and lever pressing during this period.

As during instrumental conditioning, BLA glutamate transient frequency correlated with lever pressing, but only during the CS and with only those actions for which performance was selectively invigorated by the CS, i.e., CS-Same actions (see behavioral results in Fig. 4D). During the pre-CS period the positive correlation between glutamate transient frequency and lever pressing was weakened, relative to instrumental conditioning as a result of extinction; there was a positive, but non-significant between-subjects relationship between pre-CS lever-press rate and pre-CS glutamate transient frequency (r₈ = 0.61, p =0.11; Fig. 4E). The significant positive correlation reemerged when the CS was present, but only for the CS-Same action (r₈ = 0.75, p =0.03), such that those rats for which the CSs caused a stronger selective invigoration of responding it also induced a higher frequency of BLA glutamate transients. Glutamate transient frequency did not significantly correlate with the performance of actions during the CS that, during training, earned an outcome different from that predicted by the CS (CS-Different actions; r₈ = 0.04, p =0.92).

More importantly, examination of the representative example in Fig. 4F and raster plot displaying raw glutamate transient peak times for all subjects in Fig. 4G suggests that BLA glutamate transients were time-locked to the initiation of reward-seeking activity (see Table 1) specifically on the CS-Same action. There was an overall effect of CS (Pre-CS v. CS-Same v. CS-Different initiating presses; F_2,14 = 6.00, p =0.01) on the likelihood of glutamate transients (normalized to number of initiating pressing) distributed around initiating lever presses, with no significant effect of Time (F_9,63 = 1.21, p =0.31) and a marginally insignificant Time x CS interaction (F_18,126 = 1.50, p =0.10; Fig. 4G- bottom). The likelihood of a glutamate transient was only significantly elevated during the CS prior to initiating presses on the CS-Same action (1-s bin, 2 s prior to CS-Same initiating presses, p < 0.001 relative to the control, 1-s bin 5 s prior to the initiating press). Initiation of CS-Same pressing was significantly more likely than initiation of CS-Different pressing to be preceded (within 5 s) by a glutamate transient (average percentage of CS-Same initiating presses preceded by glutamate transient: 28.87%, sem = 6.09; CS-Different initiating presses: 12.96%, sem = 5.39; t₇ = 2.78, p =0.04). See Supplemental Results and Supplemental Figure 7 for an expanded window of analysis for these data. These data suggest that extinction of the press-reward and context-reward associations during the PIT test disrupted the normal temporal relationship between glutamate release and reward seeking, but this was restored when action performance was motivated by outcome-specific information provided by reward-paired cues. See the Supplemental Results and Supplemental Figure 8 for evaluation of the relationship between BLA glutamate transients and Pavlovian conditioned approach during the PIT test.

Table 1 Pavlovian-instrumental transfer test lever pressing bouts.

Full size table

Interestingly, on the macro scale glutamate transient frequency positively correlated with the ratio of responding between actions during the CS (r₈ = 0.71, p =0.049; Fig. 5A), but did not significantly correlate with all non-discriminate CS responding (r₈ = 0.22, p =0.60). This correlation with the CS response ratio was significant even when controlling for overall response rate during instrumental conditioning (partial correlation: r₈ = 0.81, p =0.03) or during the pre-CS period (partial correlation: r₈ = 0.83, p =0.02) and when controlling for the CSs’ ability to non-discriminately elevate reward seeking (partial correlation: r₈ = 0.75, p =0.05). These data suggest that BLA glutamate transients may be related to the motivational influence of outcome-specific representations.

To further support this interpretation we exploited the utility of the two different specific PIT trial types (one for each predicted outcome). If BLA glutamate transients reflect outcome-specific motivational information then, because glutamate biosensors record from BLA microenvironments, recorded glutamate transients for a given subject/recording location should be specific to CS-Same responding for only one outcome type. If however, glutamate transients are simply related to all motivated lever pressing then they should occur prior to CS-Same actions regardless of expected outcome. The data provide evidence in support of the former. For 6/8 subjects glutamate transients were time-locked to initiating presses on the CS-Same action exclusively for only one outcome type (defined as outcome 1). Which outcome served as outcome 1 was not a function of outcome type (pellets v. sucrose), lever, action-outcome arrangement, CS type, outcome preference, or PIT effect magnitude. In the other 2 subjects glutamate transients showed an outcome-selectivity ratio of 0.57 and 0.50, respectively. Fig. 5B displays a representative trial-averaged glutamate concentration v. time trace around initiating presses on the CS-Same action divided by each outcome type. As is clear from this figure, glutamate concentration increased prior to initiating presses on the CS-Same action, but only for one outcome type. The data averaged across subjects support this observation. There was a main effect of Outcome type (F_1,7 = 6.54, p =0.04) and of Time (F_9,63 = 2.48, p =0.02), as well as a Time x Outcome type interaction (F_9,63 = 2.27, p =0.03) on the likelihood of a glutamate transient in the 10-s period around initiating presses on the CS-Same action. Together these results suggest that BLA glutamate transients encode outcome-specific motivational information provided by reward-predictive cues.

Discussion

The data collected here indicate that transient fluctuations in BLA glutamate release are time-locked to and correlate with instrumental reward seeking and that during PIT these glutamate transients are time-locked to and correlate with only those actions invigorated by outcome-specific motivational information provided by a reward-predictive stimulus. This correlational relationship was bolstered by evidence that blockade of AMPA, but not NMDA iGluRs attenuates the selective invigorating influence of reward-predictive stimuli over reward seeking.

That transient BLA glutamate release events were related to instrumental reward seeking replicates previous results demonstrating a similar relationship²¹ and extends this to show that BLA glutamate transients were associated with the actions that initiated reward seeking following reward delivery or a pause in activity, rather than actions occurring within a bout of reward seeking. This release may drive the previously-reported increases in BLA cell body activity that occur prior to instrumental action^38,39 and have been hypothesized to encode outcome expectations³⁸. The results here suggest that BLA glutamate input may encode information vital for motivating goal-directed action, because after a task is well-learned such information is only necessary for initial actions within a chunk^40,41,42. Indeed, these results corroborate evidence that amygdala neurons in primates show prospective activity that reflects internally-generated plans towards future goals⁴³. Of course BLA glutamate release is not exclusively related to instrumental action; glutamate transients were more likely to occur around instrumental actions, but release events were detected throughout the instrumental test, including especially large events during reward receipt.

The temporal relationship between BLA glutamate transients and reward-seeking was tight, but it was not one-to-one and glutamate release events that reached the detection threshold occurred at a rate lower than might be expected for the major excitatory neurotransmitter and primary input signal to the amygdala. The recording technique employed here measures changes in extrasynaptic glutamate concentration, which, because these signals are abolished by tetrodotoxin²¹, are a proxy measure for the tightly-regulated⁴⁴ synaptic overspill^44,45. Glutamate release within the synapse might, therefore, be expected to relate to a much larger percentage of, if not all, initiating lever presses.

This study provided a novel evaluation of the profile of BLA glutamate release during appetitive Pavlovian conditioning. Although the baseline drift in electrochemical measurements do not allow for a definitive answer, the data showed no indication that Pavlovian reward-predictive cues elicited an overall elevation in glutamatergic tone, contrary to what might be expected if BLA glutamate signaling conveys a sustained, cue-induced, emotional valence or motivational signal. This finding is interesting in light of the wealth of data from single-unit electrophysiological recordings showing that reward-predictive stimuli increase BLA cell body firing^{35,36,38,46,47}. Single-unit recordings are biased towards monitoring mostly from output neurons and the glutamate release recorded here reflects input and local activity, but it is this glutamate input from thalamic^46,48,49 or cortical afferents⁶ that is thought to drive the cell body excitation. There are three potential explanations for this discrepancy. First, for the reasons mentioned above it is possible that glutamate biosensors do not provide the adequate sensitivity to measure glutamate that is being released to drive BLA activity upon CS presentation. Secondly, the aforementioned CS-induced BLA neural activity may not be driven by glutamate. We find this unlikely given data demonstrating strengthened glutamatergic thalamic-BLA synapses during Pavlovian conditioning⁴⁶, but dopamine release has been shown to directly excite BLA projection neurons⁵⁰. Thirdly, key task differences may explain the discrepancy between the current glutamate input recordings and the previously reported cue-evoked cell body firing. In the previous reports BLA cell body firing was robustly elicited by short-duration (2–5 s) CSs that predicted immediate reward with strong certainty. The long-duration (2-min) CS probabilistically paired with reward in our task provides more a context for reward and may not induce a robust increase in BLA cell body firing. In support of this, preliminary evidence suggests that a longer duration (30-s) CS that predicts reward at a variable latency is more likely to induce an inhibition in BLA cell body firing⁵¹, which corroborates our current glutamate release results. Clearly, further interrogation of both glutamate release and cell body activity in similar Pavlovian tasks is necessary. Such investigation may lead to important information regarding potential differences between excitatory BLA input and output activity and of the role of such signaling in Pavlovian reward prediction.

Importantly, although there was no detected sustained CS-induced increase, glutamate release did show transient elevations during the PIT test. Following extinction of the response-reward (and context-reward) association the relationship between BLA glutamate transients and instrumental reward seeking was degraded. During the CS, however, this relationship was restored; glutamate transients were significantly more likely to occur time-locked to reward-seeking activity selectively on the action that shared the same rewarding outcome as the CS. That this relationship was restored despite the fact that the CS induced only a modest (relative to the pre-CS period) increase in the frequency of glutamate transients suggests it was not merely a coincidence of both elevated responding (which the analysis controlled for) and elevated glutamate transients. These data lend support to the hypothesis that BLA glutamate transients encode the outcome-specific motivational information provided by reward-predictive stimuli. In further support of this, cue-induced transient glutamate release only correlated with the ratio of responding during the CS, which is thought to reflect the CS’s cognitive, outcome-specific motivational influence. This is accords with the relationship between BLA activity and biasing influence of cues over instrumental action in humans⁵². Moreover, for each subject/recording location glutamate transients encoded only one specific outcome type. Because biosensors record glutamate in BLA microenvironments, the presumption is that for each subject glutamate input signals related to the other outcome were released in a microenvironment outside the sampling space. In the one subject for which glutamate release did not show outcome specificity the biosensor was likely receiving intermixed glutamate input for both outcome types. This specificity in the glutamate input signal suggests that rather than relating simply to motivated lever pressing, the BLA glutamate release detected here encoded outcome-specific information.

BLA glutamate release events were shown to relate to the outcome-specific motivational influence of Pavlovian stimuli, but it is unlikely that these signals participate in the decision-making process itself. If the BLA and glutamate release therein, was required for the decision process then blockade of BLA AMPA receptors should have not only attenuated actions on the lever that earned the same outcome as the CS, but also increased responding on the alternate lever, indicating an inability to select between actions on the basis of the CS-provided outcome expectation. Instead, BLA AMPA receptor inactivation (Experiment 1) and BLA lesions^11,13 only attenuate the selective invigorating influence of CSs. Lesions to either the orbitofrontal cortex or mediodorsal thalamus do, however, cause such non-discriminate CS-induced response invigoration^11,53. Both of these regions send excitatory projections to the BLA^7,8,9 and unilateral, ipsilateral orbitofrontal cortex inactivation abolishes reward seeking-related BLA glutamate transients²¹. Therefore, the glutamate signals detected here likely arise directly from the orbitofrontal cortex, or, given that our sensor placements are located primarily in the basal amygdala, indirectly from this region via projections from the lateral amygdala⁵⁴. BLA glutamate release may, therefore, be vital for invigorating the performance of actions planned in the orbitofrontal cortex by incorporating outcome-specific motivational value. Indeed, the BLA is vital for outcome-specific representations of motivationally significant, but not valueless events⁵⁵. Correlates of the latter have, however, been identified in the orbitofrontal cortex⁵⁶. This interpretation is also supported by the myriad data suggesting the BLA is required for other behaviors that rely on outcome-specific value information^{10,57,58,59,60} and evidence that BLA neural activity can be outcome specific^33,36,38,47 and may encode such value in the rodent^33,38,61, primate^36,62,63,64 and even human BLA⁶⁵.

In summary, the findings here support a role for rapid BLA glutamate signaling in the motivating influence of outcome-specific representations, in this case provided by a Pavlovian reward-predictive stimulus. These results lay the groundwork for further exploration of the role of BLA excitatory glutamatergic signaling and modulation of such signaling in the variety of reward-seeking behaviors that require retrieval of reward-specific information and are relevant to understanding the neuropsychological disorders marked by a disruption in such cognitive processing.