Introduction

Glutamatergic mechanisms of reward learning and processing contribute to the maladaptive behaviors implicated in neuropsychiatric disorders including addiction. While substances of abuse exert diverse effects throughout the nervous system, pathophysiological changes in nucleus accumbens (NAc) glutamate signaling are essential for many addiction-related behaviors [1,2,3]. Changes in N-methyl-d-aspartate receptor (NMDAR) and α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) function temporally map onto different stages of drug experience [4,5,6,7,8,9]. Within days following cocaine experience, GluN2B-containing NMDARs are transiently upregulated in NAc medium spiny neurons (MSNs) [4, 10, 11]. Subsequently, a protracted drug-free period (withdrawal/abstinence) from cocaine induces a potentiation of synaptic AMPAR function in the NAc shell [6, 10, 12,13,14] and core [5, 7, 14, 15] and is thought to be necessary for the incubation of drug craving and relapse-related behaviors [8, 16,17,18,19,20]. Transient cocaine-induced GluN2B upregulation in NAc MSNs is hypothesized to be necessary for subsequent potentiation of AMPAR and changes in conditioned drug-related behaviors [21,22,23].

MSNs provide the output of the NAc and can be readily (but not exclusively) dichotomized into two groups. D1-MSNs express D1 dopamine (DA) receptors (Drd1a, D1) and send projections to the ventral mesencephalon and the ventral pallidum [24], whereas D2-MSNs express D2 DA and A2A adenosine (Adora2a, A2A) receptors and project to the ventral pallidum. While many nuanced findings have been reported, a simplified functional summary is that activation of D1-MSNs promotes reward-related behaviors, whereas activation of D2-MSNs attenuates reward and promotes anhedonia, anergia, and aversion [25, 26] Furthermore, AMPAR and NMDAR adaptations in response to cocaine exposure can be differentially expressed in NAc MSNs [5, 7, 8, 14, 15, 27]. While pharmacological antagonism of NMDARs disrupts extinction of cocaine seeking [28], and NMDARs containing the GluN2B subunit are implicated in the formation of persistent cocaine-related memories [4, 16, 18], the cell type-specific function of GluN2B in cocaine-mediated synaptic and behavioral adaptations is still unknown.

To investigate cell type-specific function of GluN2B, we crossed floxed Grin2b mice (GluN2B−/−) [29, 30] with mice expressing Cre driven by the D1 and A2A promoters [31]. Following viral-mediated gene transfer of channelrhodopsin-2 (ChR2), we examined the effects of cocaine on reward learning and synaptic properties of these different cell populations and inputs in the NAc shell. We find that cocaine reward learning remained intact in both D1- and A2A-GluN2B−/− mice. Surprisingly, while the cocaine-related memory dissipated in control (ctrl) mice following withdrawal, we find enhanced retention of cocaine conditioned place preference (CPP) in D1-GluN2B−/− mice. We then examined cocaine-induced changes in AMPAR function in the NAc shell through input- and cell type-specific electrophysiology, focusing on inputs from the prefrontal cortex (PFC) or midline thalamus (mThal). At 2 weeks following non-contingent cocaine, AMPAR function was potentiated at mThal-D1(+) MSNs. The AMPAR potentiation dissipated at 4 weeks post cocaine, concurrent with the loss of cocaine reward memory. However, in D1-GluN2B−/− mice, mThal-D1(+) NAc shell AMPAR function remained potentiated, consistent with the retention of cocaine reward memory. These results reveal a cell type-specific role of GluN2B in cocaine-induced functional adaptations of AMPARs and provide insight into the role of GluN2B in associative reward learning.

Methods

Bacterial artificial chromosome (BAC) adult (6–12 weeks old) male mice on a C57BL/6J background were used in all experiments and were housed together in groups of two to five per cage on a 12/12 h light/dark cycle (lights on at 06:00), with food and water available ad libitum. Transgenic floxed mice (Grin2Blox/lox) were generated as previously described [30]. Grin2Blox/lox mice were crossed with BAC transgenic mice expressing Cre recombinase under the regulation of Drd1a [32] or Adora2a promoter, analogous to that strategy we took with Grin1fl/fl mice previously [33]. We elected to use Adora2a instead of the Drd2 promotor because D2 is expressed by striatal interneurons and midbrain dopaminergic neurons. Mice used for electrophysiology either expressed TdTomato under the control of the Drd1a promoter [31, 34] or expressed a Cre-inducible TdTomato (Ai9, The Jackson Laboratory, Sacramento, CA). Cre-negative littermates on the Grin2Blox/lox background were used as controls. All experiments were conducted in a protocol approved by the Vanderbilt University institutional animal care and use committee.

Cocaine conditioned place preference

Prior to the place conditioning pretest, mice were habituated to behavioral testing with a 1 h exposure to similar activity chambers. All place conditioning sessions were 20 min in duration. Conditioning sessions were conducted twice daily, separated by 4 h, and treatment order was counterbalanced across groups. All mice received injections of vehicle (0.9% saline, 10 µL/g, intraperitoneal) or cocaine (20 mg/kg) prior to confinement in one compartment. Overhead video recordings of mouse activity were analyzed with automated software (EthoVision XT, Noldus Information Technology, Leesburg, VA). The apparatus was divided into two contextually distinct environments based on floor and wall (rough with vertical bars vs smooth with checkerboard). No innate preference for either environment was observed and hence cocaine injections were paired with the less-preferred side for each mouse. We assessed submaximal cocaine CPP following one cocaine pairing and maximal cocaine CPP following three additional pairings. The acquisition of cocaine CPP was assessed the day after the last conditioning session and maintenance of the reward-related memory was assessed at extended time points.

Stereotaxic injections

At 4–5 weeks of age, mice underwent stereotaxic surgery for viral-mediated gene transfer of ChR2. Bilateral 250–400 nL injections of AAV5-CaMKII-ChR2-EYFP (UNC Viral Vector Core, Chapel Hill, NC) were delivered into (relative to bregma): mThal (medial-lateral (ML): ±0.3, anterior-posterior (AP): −1.2, dorsal-ventral (DV): −3.0) or PFC (ML: ±0.3, AP: −1.75, DV: −2.75). Bilateral 800 nL injections of AAV5-CaMKII-Cre-GFP or AAV5-CaMKII-ChR2-EYFP were injected into the NAc (ML: ±0.75, AP: 1.3, DV: −4.5). Mice were killed for electrophysiology 4–6 weeks following surgery to allow for ChR2-EYFP expression in axon terminals. Representative images of ChR2-EYFP injection sites can be found in a previous publication [35].

Electrophysiology

Detailed methodology for whole-cell voltage-clamp electrophysiology experiments can be found in a previous publication [5]. In brief, sagittal slices (250 µm) were prepared containing the medial NAc shell. Only the first two slices (<0.6 mm from midline) were used for recordings and the NAc shell was identified as being ventral and rostral to the anterior commissure. Recording pipettes were filled with a cesium-based internal solution (in mM): 120 CsMeSO4, 15 CsCl, 8 NaCl, 10 HEPES, 0.2 EGTA, 10 TEA-Cl, 4 MgATP, 0.3 NaGTP, 0.1 spermine, and 5 QX-314. All recording solutions contained picrotoxin (50 µM). NMDAR-excitatory postsynaptic potentials (EPSCs) were isolated in the presence of NBQX (10 µM) and AMPAR-EPSCs in the presence of AP-5 (50 µM). For current–voltage relationships, NMDAR-EPSCs and AMPAR-EPSCs were normalized to the peak amplitudes obtained at +30 and −60 mV, respectively. For asynchronous EPSC (asEPSC) experiments, equimolar Sr2+ (2.5 mM SrCl2) replaced CaCl2 in the extracellular solution and longer light durations were used (3–5 ms) to evoke glutamate release. Asynchronous events were identified using a predefined template search in the 200 ms window following EPSC onset [5, 36]. The AMPAR rectification index was calculated as (I+40I0)/(I40I0). For cocaine experience experiments, mice received 5 injections of cocaine (15 mg/kg) in a locomotor activity apparatus.

Data analysis

Data are expressed as mean ± SEM. For electrophysiology, each data point represents the average value from one cell. The number of cells and number of mice are represented as “n” and “N”, respectively. One- or two-way analysis of variance (ANOVA) or two-tailed Student’s t-test were used when indicated. Repeated measures or paired tests were performed when appropriate. All post-tests employed Bonferroni corrections for multiple comparisons and p < 0.05 was considered significant.

Results

Verification of cell type-specific GluN2B deletions

To verify the specificity of the genetic deletions, we pharmacologically isolated NMDAR-EPSCs from NAc shell MSNs in acute brain slices identified by the fluorophore tdTomato (Fig. 1a). NMDAR-EPSCs from D1-GluN2B−/− and A2A-GluN2B−/− MSNs exhibited faster decay times than control (D1-GluN2B−/−: 39.4 ± 4.7 ms vs D1-CON: 75.2 ± 9.8 ms, p < 0.01, Fig. 1b; A2A-GluN2B−/−: 31.2 ± 5.9 ms vs A2A-CON: 66.4 ± 12.0 ms, p < 0.05). D1-GluN2B−/− and A2A-GluN2B−/− MSNs also exhibited a blunted current–voltage relationship of NMDAR-EPSCs (Fig. 1b). These data are consistent with GluN2B-containing NMDARs exhibiting slower decay kinetics [37]. We also assessed the sensitivity of the NMDAR-EPSC to the GluN2B antagonist, ifenprodil. Ifenprodil reduced the NMDAR-EPSC in control D1(+) MSNs (77.1 ± 3.5% baseline). In contrast, ifenprodil did not affect the NMDAR-EPSC in MSNs with genetic deletion of GluN2B (95.9 ± 4.3% baseline, p < 0.01 vs control), thus confirming cell type-specific GluN2B knockout.

Fig. 1
figure 1

Verification of cell type-specific GluN2B genetic deletions in the NAc. a Left, Representative pharmacologically isolated NMDAR-EPSCs recorded from fluorescently targeted D1(+) NAc MSNs. Scale bars denote 50 ms and 100 pA. Dotted lines denote time to 1/2 peak (t1/2). Right, Summary of the t1/2 of the NMDAR-EPSC D1(+) MSNs (left; Ctrl, black circles, n/N = 8/3; D1-GluN2B−/−, red squares, n/N = 7/3) and D1(−) MSNs (right; Ctrl, black circles, n/N = 8/3; A2A-GluN2B−/−, blue squares, n/N = 7/3) (*p < 0.05, **p < 0.01, t-tests). b Summary of normalized, NMDAR-EPSC amplitudes as a function of membrane potential; D1(+) (n/N = 6/3, n/N = 7/3, left) and D1(−) (n/N = 8/3, n/N = 6/3, right) MSNs (*p < 0.05, ***p < 0.001, Bonferroni post-tests). c Left, Representative NMDAR-EPSCs inhibited by 10 µM ifenprodil. Middle, Timecourse displaying partial inhibition of NMDAR-EPSCs by ifenprodil in control D1(+) MSNs (n/N = 7/3) but not D1-GluN2B−/− MSNs (n/N = 6/3). Right, Summary of last 5 min of recording. Deletion of GluN2B eliminates ifenprodil sensitivity in D1(+) MSNs (** p < 0.01, t-test). Scale bars denote 100 ms and 100 pA

Enhanced retention of cocaine reward learning in D1-GluN2B−/− mice

To determine consequences of cell type-specific GluN2B function on cocaine reward learning, we performed cocaine CPP in D1- and A2A-GluN2B−/− mice. A single-pairing protocol was used to assess submaximal reward learning. We observed a modest degree of cocaine CPP in control mice (127 ± 50 s/10.5 ± 4.2% session time, p < 0.05, Fig. S1) with no difference across genotypes (F(2, 31) = 0.1282, nonsignificant (n.s.)). We followed up with three additional cocaine/vehicle pairings to generate maximal CPP (303 ± 34 s/25.2 ± 2.8% session time, p < 0.05, Fig. 2a). All GluN2B−/− genotypes exhibited CPP (F(2, 34) = 0.4594, n.s.), indicating that cell type-specific GluN2B function is not required for acquisition of cocaine CPP.

Fig. 2
figure 2

D1-GluN2B−/− display enhanced long-term retention of cocaine reward learning. a Cocaine place conditioning 24 h post cocaine pairing for Ctrl (black circles, N = 13), D1-GluN2B−/− (red triangles, N = 11), and A2A-GluN2B−/− (blue squares, N = 13) mice. b Cocaine place conditioning at 4 weeks following cocaine exposure (N = 8,8,10, **p < 0.01, Bonferroni post-tests). c Representative heat maps displaying time spent on saline-paired (S) and cocaine-paired (C) sides during initial post-test (top) and 4-week maintenance test (bottom) in Ctrl (left) and D1-GluN2B−/− (right) mouse. d Summary of initial post-test (dark) and maintenance test (light) CPP scores. Data were displayed in separate analyses in (a, b) (**p < 0.01, Bonferroni post-tests)

NAc GluN2B expression has been suggested to fulfill an intermediate state underlying the development of long-term behavioral effects of cocaine. Therefore, we sought to measure the maintenance of reward-related memory as assessed by retention of CPP 4 weeks later. We observed a main effect of genotype on the long-term maintenance of cocaine CPP (F(2, 23) = 8.186, p < 0.01, Fig. 2b) with D1-Glun2B−/− mice exhibiting enhanced CPP at 4r weeks relative to controls and A2A-Glun2B−/− (D1: 384 ± 59 s vs CON: 121 ± 40 s, p < 0.01; A2A: 112 ± 63 s, p < 0.01). No difference was observed at a 2-week time point (F(2, 17) = 0.3959, n.s., Fig. S2). To evaluate whether D1-Glun2B−/− mice exhibited persistent memory retention, we compared the 4-week time point to the original test session revealing a significant interaction between time and genotype (Fig. 2c, d; F(2, 25) = 9.778, p < 0.001). Specifically, at 4 weeks, control mice displayed a reduction in cocaine CPP (Maint: 121 ± 40 s vs Test: 303 ± 34 s, p < 0.01), as did A2A-GluN2B−/− (Maint: 112 ± 63 s vs Test: 237 ± 51 s, p < 0.01). In contrast, D1-GluN2B−/− mice retained cocaine CPP at the 4-week time point (Maint: 384 ± 59 s vs Test: 296 ± 40 s, n.s.). Together, these data suggest that loss of GluN2B specifically from D1-expressing cells enhances long-term retention of cocaine reward memory.

Intact locomotor sensitization in D1- and A2A-GluN2B−/− mice

In addition to measuring cocaine CPP, we also analyzed locomotor activity during conditioning sessions. During saline sessions, a two-way ANOVA revealed a main effect of genotype (F(2, 39) = 15.04, p < 0.001, Figure S2A) on locomotor activity. Post-tests indicated that D1-GluN2B−/− mice exhibited less locomotor activity and A2A-GluN2B−/− mice displayed enhanced activity relative to control mice. Cocaine conditioning exhibited a main effect of session (F(3, 39) = 39.64, p < 0.0001, Fig. S2B) and genotype (F(2, 39) = 9.950, p < 0.001) on locomotor activity. However, during the cocaine sessions, post-tests only revealed diminished hyperactivity in D1-GluN2B−/− mice. Importantly, despite displaying less locomotor activity than littermate controls, D1-GluN2B−/− mice exhibited sensitization to the locomotor response (Coc 3: 82.4 ± 18.0 m vs Coc 1: 28.0 ± 6.8 m, p < 0.01 and Coc 4: 88.1 ± 21.0 m, p < 0.01, Bonferroni post-tests). These data indicate that cell type-specific GluN2B function is not required for sensitization of locomotor activity, but does bi-directionally influence locomotion.

Hypoactivity in response to cocaine injections was recapitulated in a separate cohort of D1-GluN2B−/− mice that underwent repeated cocaine injections in the same environment (genotype: (F(1, 15) = 42.53, p < 0.0001; interaction: (F(5, 75) = 4.971, p < 0.001)) (Figure S3A). We then aimed to test whether NAc-specific deletion of GluN2B is sufficient to attenuate the hyperactive locomotor response to cocaine. We virally expressed Cre recombinase in the NAc of 5-week old GluN2Bf/f mice and treated them with repeated cocaine injections 6 weeks later. NAc-GluN2B−/− mice displayed an attenuation of cocaine hyperactivity (genotype: (F(1, 18) = 11.54, p < 0.01; interaction: (F(5, 90) = 4.818, p < 0.0001)) (Figure S3B). Together, these data suggest that altered responses to cocaine in D1-GluN2B−/− do not entirely stem from developmental changes, and that loss of GluN2B in the NAc is sufficient for these effects.

Cocaine-induced changes in NAc shell synaptic physiology

mThal-D1(+) NAc shell AMPAR function is enhanced 2 weeks after cocaine exposure

Glutamatergic synaptic plasticity in the NAc has been demonstrated to mediate long-term conditioned behavioral responses to cocaine [8, 16,17,18, 20]. Moreover, repeated cocaine exposure induced a transient increase in GluN2B expression and function in the NAc shell [4, 10, 11]. This transient increase is hypothesized to play a role in the development of cocaine locomotor sensitization and the incubation of cocaine craving—states that are both associated with upregulation of NAc shell AMPARs. Thus, we investigated the contribution of GluN2B to cocaine-induced adaptations in NAc shell AMPAR function in a cell type-specific manner at PFC and mThal inputs.

To model the cocaine exposure received during CPP, we conditioned mice with 5 days of cocaine in a novel environment and then 2 weeks of withdrawal (Fig. 3a). We recorded quantal-like asEPSCs from PFC and mThal inputs to NAc shell D1(+) MSNs using Sr2+-containing artificial cerebrospinal fluid (aCSF). The amplitude of asEPSCs reflects synaptic AMPAR function, and cocaine experience did not affect PFC-D1(+) asEPSC amplitude (Fig. 3b). No effect on AMPAR current–voltage relationship (Fig. 3c) or rectification index (RI; Fig. 3d) was observed following non-contingent cocaine exposure. At 2 weeks after cocaine administration, the amplitude of mThal-D1 asEPSCs in the NAc shell was increased (coc: 34.6 ± 2.0 pA vs sal: 28.4 ± 1.9 pA, p < 0.05, Fig. 3e) with no effect on current–voltage relationship (Fig. 3f) or RI (Fig. 3g). Consistent with these findings, we did not observe cocaine-induced AMPAR rectification when eliciting EPSCs with non-specific electrical stimulation (Fig. S4). Moreover, we did not observe any effects of cocaine experience on mThal-D1(−) NAc shell AMPAR function (Fig. S3). The enhanced AMPAR-EPSCs at mThal-D1(+) MSNs are consistent with findings in non-cell-type-specific MSN recordings in the shell [10] and mThal-D1(+) MSNs in the NAc core [5].

Fig. 3
figure 3

Cocaine experience enhances AMPAR function at mThal-D1 NAc shell synapses without altering CP-AMPARs. a Temporal schematic of saline/cocaine injection and electrophysiological recordings from mice with viral-mediated expression of ChR2 in either the prefrontal cortex (PFC) (bd) or midline thalamus (mThal) (eg). b Amplitude of asynchronous EPSCs (asEPSCs) elicited in Sr2+-replaced aCSF in saline- (filled triangles) or cocaine-treated mice (open triangles) (n/N = 11/4 sal, 8/3 coc). c Pharmacologically isolated AMPAR current–voltage relationship normalized to the −60 mV peak current (n/N = 11/4 sal, 8/3 coc). d Summary of AMPAR rectification index (RI) at PFC-D1(+). e mThal-D1(+) asEPSC amplitude of sal- (filled squares) or coc-treated mice (open squares (*p < 0.05, n/N = 8/4 sal, 7/3 coc). f AMPAR rectification at mThal-D1 synapses (n/N = 10/4 sal, 11/4 coc). g RI of mThal-D1(+) AMPAR-EPSCs

D1-specific GluN2B KO prolongs cocaine-induced AMPAR plasticity

We determined that loss of GluN2B prolongs mThal-D1(+) AMPAR potentiation induced by cocaine exposure correlating with enhanced retention of CPP 4 weeks after cocaine exposure (Fig. 4a). At this longer withdrawal time point, the asEPSC at mThal-D1(+) NAc shell synapses returned to saline control levels (coc: 27.3 ± 1.6 pA vs sal: 27.0 ± 2.5 pA, n.s., Fig. 4b). Consistent with our data at the 2-week withdrawal time point, AMPAR current–voltage relationship and RI were not different following cocaine withdrawal (Fig. 4c, d). However, in the D1-GluN2B−/− mice, the cocaine-induced potentiation of mThal-D1 AMPAR function persisted at 4 weeks of withdrawal (coc: 34.15 ± 2.3 pA vs sal: 26.38 ± 2.5 pA, p < 0.05, Fig. 4e). RI of AMPAR-EPSCs in GluN2B−/− mice was not different than control (Fig. 4f, g). There was a trend towards increased sEPSC amplitude in cocaine-treated D1-GluN2B−/− MSNs at four weeks (coc: 17.91 ± 1.9pA vs sal: 12.56 ± 1.4pA, p < 0.078, Fig. S4B); otherwise the changes observed in D1-GluN2B−/− mice do not reflect non-specific changes to D1-MSN AMPAR function (Fig. S5C-E). Taken together, these data indicate that increased mThal-D1(+) NAc shell AMPAR function is associated with the expression of cocaine CPP and that D1-specific loss of GluN2B enhances the persistence of cocaine-conditioned synaptic function.

Fig. 4
figure 4

D1-GluN2B−/− mice retain mThal-D1 NAc shell AMPAR potentiation following long-term withdrawal from cocaine. a Temporal schematic of saline/cocaine injection and electrophysiological recordings from mice with viral-mediated expression of ChR2 in the midline thalamus (mThal) in control D1-TdTomato (bd) or D1-GluN2B−/− (eg) mice. b Amplitude of asynchronous EPSCs (asEPSCs) elicited in Sr2+-replaced aCSF for saline- (filled squares) or cocaine-treated mice (open squares) (n/N = 8/3 sal, 12/4 coc). c Isolated AMPAR current–voltage relationship normalized to the peak current obtained at −60 mV (n/N = 9/3 sal, 10/4 coc). d RI of AMPAR current–voltage relationships. e Summary of asEPSC amplitude in saline and cocaine D1-GluN2B−/− mThal-D1 cells (*p < 0.05, n/N = 6/3 sal, 11/4 coc). f Current–voltage relationship of AMPARs at mThal-D1 NAc shell synapses of GluN2B−/− (n/N = 6/3 sal, 6/3 coc). g RI of AMPAR-EPSCs in GluN2B−/−

Discussion

We demonstrate that, while the expression of cocaine CPP diminished over 4 weeks in controls, CPP remains intact in D1-GluN2B−/−, mice suggesting that GluN2B serves as a break to limit or attenuate drug-induced adaptations. Additionally, GluN2B function in D1- and A2A-expressing cell types is not necessary for either cocaine reward learning or psychomotor sensitization. Upon assessment of the circuit and cell type-specific synaptic adaptations potentially underlying the cocaine-induced behaviors, no change to PFC-D1(+) NAc shell AMPAR function was observed following 2-week withdrawal from cocaine. However, mThal-D1(+) NAc shell AMPAR function was enhanced similar to findings in NAc shell and core [5, 10]. AMPAR potentiation subsided at 4 weeks of withdrawal in control mice but persisted in D1-GluN2B−/− mice, suggesting mThal-D1(+) GluN2B regulation of AMPAR function in the NAc modulates retention of cocaine reward memory. It has been hypothesized that prolonged abstinence from cocaine exposure may induce a homeostatic scaling of AMPARs in the NAc [38] [23]. Furthermore, RNA interference knockdown of Grin2B leads to protein synthesis-dependent increases in AMPAR homeostatic scaling in cortical neurons [38]. Thus, one possibility is that GluN2B negatively regulates homeostatic scaling of AMPARs induced by cocaine administration.

Consistent with reports from other brain regions [30, 40, 41], genetic loss of GluN2B function in NAc shell MSNs was associated with faster NMDAR decay kinetics. When assessing current–voltage relationship, we observed that loss of GluN2B attenuated the relative current passed at moderately hyperpolarized membrane potentials. Similar changes to the NMDAR current–voltage relationship in the NAc core have been associated with altered function of the GluN2C subunit following alcohol and cocaine exposure [5, 42, 43]. Together, these data suggest that triheteromeric GluN1/GluN2B/GluN2C NMDARs may be expressed in the NAc and underlie components of cocaine-induced plasticity. Future research investigating these NMDARs could elucidate the pathophysiology of addiction-related behavior and the identification of novel therapeutic targets.

At face value, the enhanced maintenance of cocaine reward learning exhibited by D1-GluN2B−/− mice was reminiscent of the incubation of cocaine craving [44], a phenomenon that occurs following long-term abstinence from extended-access cocaine self-administration. Because this behavior is associated with enhanced NAc calcium-permeable (CP)-AMPAR expression [12, 16,17,18, 20], we aimed to test the hypothesis that loss of GluN2B promoted incorporation of CP-AMPARs and related behaviors. We tested this by examining AMPAR rectification, as CP-AMPARs are known to display inward rectification at positive holding potentials. However, in agreement with previous work [6, 12, 21, 45] but see Mameli et al. 2009; Terrier et al. 2015), we found no evidence that non-contingent cocaine administration upregulated NAc shell CP-AMPAR function overall or at synapses from the PFC or mThal, even in the D1-GluN2B−/−. However, we did find further evidence that cocaine exposure potentiates mThal-D1(+) NAc function, expanding on previous work that examined mThal inputs to the NAc shell [10] and the NAc core [5].

We recently examined D1-GluN1−/− and A2A-GluN1−/− models and found that both mouse lines retained the expression of cocaine CPP [33]. The present data investigating GluN2B are consistent with those findings, which collectively indicate that development of cocaine CPP does not require intact NMDAR function in D1- or A2A-expressing cells. Utilizing a GluN2B antagonist, Ro-25,6981, infused into the NAc, Brown et al. [11] showed the development of cocaine-induced locomotor sensitization is dependent on GluN2B. However, it is less clear how these receptors contribute to synapse maturation in the stretch of time following withdrawal/abstinence in vivo. Beyond the drug intake period, GluN2B-containing NMDARs may actually serve to negatively regulate synaptic AMPAR expression [23, 38], potentially acting as a feedback mechanism to reduce drug-induced adaptations. Our results suggest that this may indeed be the case, as D1(+)-GluN2B−/− mice retain cocaine-induced potentiation of AMPAR currents at mThal-D1(+) synapses and place preference to cocaine. It is worth noting that our results rely on genetically ablating GluN2B at the onset of D1 DA and A2A receptor expression which occurs early in development. Developmental adaptations may play a role in our results; however, no differences in AMPAR function were observed in naive or saline-treated GluN2B-deficient mice. It is also possible that D1-expressing cells outside of the NAc shell are contributing to the behavioral phenotype.

In summary, our results show that specific loss of GluN2B from D1-expressing cells enhances the retention of cocaine-induced conditioned behavioral and synaptic responses. Additionally, the data presented here suggest a correlative effect between AMPAR potentiation at mThal synapses onto NAc shell D1(+) MSNs and retention of cocaine CPP, although recent work from Shukla et al. [13] suggests cocaine CPP is not dependent on incorporation of CP-AMPARs. Together, these data raise the possibility that NMDARs containing GluN2B are expressed after highly salient experiences and weaken or even reverse those memories over time. Alternatively, a more durable reward memory could have been formed due to enhanced excitatory connectivity between mThal and D1(+) MSNs, consistent with the finding that GluN2B deletion in the adult PFC increased the strength of the thalamic input onto those neurons [46]. Furthermore, GluN2B knockdown increased the number of synaptic contacts in developing CA1 pyramidal neurons [47]. While the transient expression of GluN2B-enriched silent synapses is thought to be essential in permitting long-term cocaine-induced adaptations [11, 16, 18, 22, 27], GluN2B may serve an additional feedback role simultaneously or at later time points, such as during extended abstinence. NAc NMDAR activity is considered to be a key player in the acquisition of reward-related memories and behaviors, but more experiments need to be performed to address whether NAc NMDARs may act to temporally restrict drug-associated reward and/or extinction learning. Input-specific investigation combined with cell type-specific deletions of NMDARs represent an attractive tool-set to carefully further elucidate the electrophysiological, biochemical, and morphological changes induced in NAc MSNs by rewarding experiences.