Introduction

Drug addiction is characterized by cycles of compulsive drug-taking and drug-seeking and high rates of relapse. Relapse rates for opioids are higher than any other drug class, with up to 59% of individuals relapsing in the first week of abstinence and 80% relapsing in the first month [1]. Moreover, deaths linked to opioid overdoses are currently the leading cause of accidental death among American adults.

The cortico-basal ganglia (C-BG) circuit has been identified as a critical neural substrate in regulating the development and persistence of addiction. The striatum, which can be anatomically divided into the dorsal striatum and the nucleus accumbens (NAc), serves as the interface of the C-BG circuit, integrating cortical and subcortical inputs to guide a diverse set of behaviors including associative learning, decision-making, and motivation [2, 3]. The NAc is necessary for the development and expression of opioid-seeking [4, 5], and has been shown to modulate opioid reward and opioid-taking [6,7,8,9]. However, the striatum is a heterogeneous structure containing two interspersed populations of GABAergic medium spiny neurons (MSNs): direct pathway MSNs (dMSNs) and indirect pathway MSNs (iMSNs). Both dMSNs and iMSNs receive extensive glutamatergic inputs from cortical, subcortical, and thalamic nuclei; GABAergic inputs from local striatal interneurons and other MSNs via lateral inhibition; and dopaminergic inputs from the ventral tegmental area (VTA) and substantia nigra [10]. However, NAc dMSNs project directly to the VTA, while NAc iMSNs project to the ventral pallidum (VP). As a result, these neuron populations can have opposing control over behavioral output, with stimulation of dMSNs facilitating behavior by serving as a “go” signal and stimulation of iMSNs suppressing behavior by serving as a “stop” signal [3, 11]. With respect to addiction, it has been hypothesized that disruptions in the balance of signaling between dMSNs and iMSNs (i.e., excessive “go” signal or insufficient “stop” signal) could underlie the development of the compulsive drug-taking and drug-seeking behaviors that can develop after drug use. Nonetheless, the role of these striatal cell populations has not been examined in behaviors related to opioid addiction.

As only a subset of drug users transition from casual use to addiction, it is essential to investigate whether addiction vulnerability is an intrinsic (e.g., genetically-defined) or acquired trait [12]. Thus, we have drawn from previous work investigating individual variability in addiction-like behavior [13,14,15] and developed a model for characterizing addiction severity using an intermittent-access (IntA) self-administration (SA) procedure that has been shown to produce distinct patterns of behavior in subsets of rats [14, 15]. Additionally, we incorporated key components of the DSM diagnostic criteria for Opioid Use Disorder, including difficulty stopping or limiting drug intake, high motivation to obtain and consume drug, sustained drug craving during periods of abstinence, and relapse following abstinence into our severity classification model [16]. We then combined this model with dual-viral-mediated gene transfer techniques for selective expression of Gi/o- and Gq-DREADDs in NAc dMSNs and iMSNs to determine how transient, bidirectional changes in the activity of these cell populations [17, 18] alters the motivation to take heroin and cue-induced reinstatement of heroin-seeking in rats expressing low and high levels of addiction-like behavior.

Methods

Subjects

Outbred male Sprague-Dawley rats (n= 133; Envigo) weighing 250–274 g upon arrival were pair-housed in a temperature- and humidity-controlled vivarium on a 12 h light/dark cycle. Rats were mildly food-restricted (~95–100% of initial body weight) 2 days prior to self-administration for the study duration. Behavioral procedures occurred at the beginning of the dark cycle, and all procedures were performed according to National Institutes of Health (NIH) guidelines and approved by Seattle Children’s Research Institute’s Institutional Animal Care and Use Committee (IACUC).

Drugs

Diamorphine HCl (heroin) was obtained through the Drug Supply Program of the National Institute on Drug Abuse (NIDA) and dissolved in sterile saline (0.9% NaCl) to a stock concentration of 5 mg/ml. Syringes were prepared based on body weight for administration of 0.075 mg/kg heroin per infusion (50 μl/infusion delivered over 2.8 s). D-amphetamine (Millipore-Sigma; A5880) was dissolved in sterile saline at a concentration of 2 mg/ml and administered at a dose of 1 ml/kg. Clozapine-N-oxide (CNO) was obtained through the Rapid Access to Investigate Drug Program of the National Institute of Neurological Disorders and Stroke (NINDS) of the NIH and was dissolved in dimethyl sulfoxide (DMSO; Sigma Aldrich, D650) in a hot water bath, then further diluted in sterile water to a final concentration of 6% DMSO. CNO was prepared fresh daily at a concentration of 5 mg/ml and administered ip at a dose of 1 ml/kg. Vehicle (VEH) injections consisted of DMSO dissolved in sterile water to a final concentration of 6% DMSO and were administered ip at a volume of 1 ml/kg.

Viral vectors

Adeno-associated viruses containing Cre-dependent Gi-coupled (pAAV-hSyn-DIO-hM4D(Gi)-mCherry) and Gq-coupled (pAAV-hSyn-DIO-hM3D(Gq)-mCherry) DREADDs were acquired from Addgene (viral prep #44362-AAV8 and #44361-AAV8, respectively) and had titers of ≥4 × 1012 viral genomes/ml. Adeno-associated virus containing Cre-dependent eGFP (rAAV5-hSyn-DIO-eGFP) was acquired from the UNC Vector Core (lot AV4497c) and had a titer of 4.1 × 1021 viral molecules/ml. Canine adenovirus containing retrogradely-transported Cre (CAV2-Cre) had a titer of ~2.5 × 109 viral genomes/μl and was prepared as previously described [19].

Surgical procedures

Rats were anesthetized with isoflurane (1–5% inhaled; Patterson Veterinary) and received meloxicam (1 mg/ml, 1 ml/kg subcutaneously; Patterson Veterinary) for analgesia. Rats were monitored for at least 3 days following each surgical procedure.

Catheter surgeries

Jugular catheterization surgeries were performed as previously described [20]. Catheters were flushed daily with gentamicin (5 mg/ml) and heparin (3 U/ml) in sterile saline (0.2 ml/day, iv) to maintain patency. Catheter patency was checked prior to the first day of SA and following the final PR test with brevital sodium (10 mg/ml in sterile saline, 0.05–0.2 ml/infusion, iv; methohexital sodium, Patterson Veterinary); rats that became ataxic within 5 s of infusion were considered to have maintained patency. Rats that lost patency (n= 19) or whose catheters were damaged (n= 6) prior to the beginning of extinction training were excluded from all analyses.

Stereotaxic surgeries

Stereotaxic surgeries were performed using standard procedures [20]. Viral vectors were delivered via 33-gauge needles and infused at a volume of 500 nl/virus and a rate of 100 nl/min, unless otherwise specified. Coordinates for target brain regions (in mm, relative to bregma) were as follows: NAc [A/P +1.6, M/L ±1.5, D/V −7.3], VTA [A/P −5.0, M/L ±0.9, D/V −8.5], VP [A/P −0.1, M/L ±2.0, D/V −8.0] [21]. For c-Fos experiments, CAV2-Cre was bilaterally infused into the VTA or VP, DIO-hM4Di or DIO-hM3Dq was unilaterally infused into the left NAc, and DIO-eGFP was unilaterally infused into the right NAc. For experiments targeting dMSNs, CAV2-Cre was bilaterally infused into the VTA and DIO-hM4Di or DIO-hM3Dq was bilaterally infused into the NAc. For experiments targeting iMSNs, CAV2-Cre was bilaterally infused into the VP and DIO-hM4Di or DIO-hM3Dq was bilaterally infused into the NAc.

Self-administration and severity classification

Self-administration apparatus

Training and testing were conducted in self-administration chambers equipped with two retractable levers, white cue lights located above each lever, a house light, and a grid floor (MedAssociates). A syringe pump located outside each chamber was connected to a swivel within each chamber to permit iv drug infusions.

Self-administration procedure

Rats were trained to self-administer heroin under an IntA schedule that produces spikes in brain drug concentration [14]. Each SA session (6 h/day, 10 days) consisted of 12 repeating drug-available (5 min) and drug-unavailable (25 min) blocks; the length of each drug-unavailable block ensured that brain concentrations for heroin and its two active metabolites (morphine and 6-monoacetylmorphine) were on the descending limb of their pharmacokinetic curves at the beginning of each drug-available block [22]. Daily IntA SA sessions were signaled by extinction of house lights and extension of two levers into the chamber: lever presses on the active lever resulted in drug delivery (FR1, 0.075 mg/kg/infusion over 2.8 s) and illumination of the cue light (3 s), while lever presses on the inactive lever had no consequences. At the end of each drug-available block, house lights re-illuminated to signal the beginning of the drug-unavailable block and levers remained extended to monitor drug-seeking during cued-unavailability, though lever presses resulted in neither drug delivery nor cue presentation. Following IntA SA, rats underwent a progressive ratio (PR) test in which the number of lever presses required for an infusion increased with each successive infusion (5, 10, 20, 45, 65, 85, 115, 145, 165, 185, 215, 245) until either 30 min passed without an infusion or 6 h total had elapsed [20]. Next, rats underwent daily extinction sessions (60 min/day, 9 days) during which lever presses no longer resulted in cue presentation or drug delivery; this was sufficient to reduce the responding of all rats to <20% of their final day of SA. Following extinction training, rats underwent a cue-induced reinstatement test (60 min), during which the drug-paired cue light was presented at the beginning of the session and active lever presses resulted in further cue presentations but no drug delivery.

Addiction severity classification

Six behavioral measures were collected and used to classify rats according to their overall addiction severity: (1) Intake, calculated as the total number of heroin infusions taken during IntA SA; (2) Consistency, calculated as the cumulative percentage of drug-available blocks across all IntA sessions in which at least one infusion was self-administered; (3) Seeking, calculated as the cumulative number of active presses during drug-unavailable blocks across all IntA sessions; (4) Motivation, calculated as the breakpoint during PR testing; (5) Extinction, calculated as the total number of active lever presses during extinction training; and (6) Relapse, calculated as the total number of active lever presses during cue-induced reinstatement. The raw values for each behavior were converted to z-scores by subtracting the group mean from the individual values and dividing by the standard deviation of the group [23]. Individual z-scores for each animal were combined to give a cumulative addiction severity score (SS), which was used to classify rats into low-risk (SS < −1), moderate-risk (−1 ≤ SS ≤ 1), and high-risk (SS > 1). The goal of this study was to examine neurobiological differences between addiction-vulnerable (high-risk) and addiction-resilient (low-risk) rats, so moderate-risk rats (n = 11) were excluded.

Anatomical tracing

To validate that DREADDs were selectively targeted to dMSNs, cholera toxin subunit B (CTB) conjugated to AlexaFluor (AF) 647 (CTB-647; C-34778, ThermoFisher) was mixed 1:1 with CAV2-Cre and infused into the VTA (700 nl total, 70 nl/min), CTB conjugated to AF488 (CTB-488; C-34775, ThermoFisher) was infused into the VP (350 nl, 70 nl/min), and DIO-hM4Di-mCherry was infused into the NAc. To validate that DREADDs were selectively targeted to iMSNs, CTB-647 was mixed 1:1 with CAV2-Cre and co-infused into the VP (700 nl total, 70 nl/min), CTB-488 was infused into the VTA (350 nl total, 70 nl/min), and DIO-hM4Di-mCherry was infused into the NAc. Twenty-one days later, rats were anesthetized with Beuthanasia-D (2 ml/kg ip; Patterson Veterinary) and transcardially perfused with phosphate-buffered saline (PBS; pH = 7.4) followed by paraformaldehyde (PFA; 4% in PBS). Brains were extracted, fixed overnight in PFA, post-fixed for >48 h in sucrose (30% in PBS), and sectioned (50 μm) with a vibrating microtome. Z-stacks along the rostral/caudal axis of the NAc (A/P +2.5 through A/P +0.7) were collected with confocal microscopy (×20; Zeiss LSM 710), and neuronal localization of CTB-488, CTB-647, and hM4Di-mCherry was quantified using ImageJ (V1.49; NIH) and Adobe Illustrator (CC 2019).

Chemogenetic modulation of amphetamine-stimulated c-Fos

To confirm that activation of Gi/o- and Gq-coupled DREADDs modulates activity of NAc dMSNs and iMSNs, c-Fos was measured following DREADD activation in an amphetamine + novelty task that has previously been shown to induce c-Fos in both dMSNs and iMSNs [24]. Rats were injected with CNO (5 mg/kg, ip) and returned to their home cages. Thirty min later, rats were injected with d-amphetamine (2 mg/kg, ip), and placed into novel operant boxes. Fifty-five min later, rats were anesthetized and perfused, and brain sections were collected as described above. To visualize c-Fos, floating sections were washed (PBS; 3 × 10 min), blocked (0.25% Triton-X, 5% normal goat serum, PBS; 120 min), and incubated with a primary antibody (1:800 rabbit anti-c-Fos, 0.25% Triton-X, 2.5% normal goat serum, PBS; 24 h; Cell Signaling #2250S, RRID: AB_2247211). Sections were then washed (PBS; 3 × 10 min) and incubated with a secondary antibody conjugated to AF-647 (1:500 goat-anti-rabbit, 0.25% Triton-X, 2.5% normal goat serum, PBS; 120 min; Life Technologies #A32733, RRID: AB_2633282). Finally, sections were washed (PBS; 3 × 10 min), mounted on slides, and coverslipped with Vectashield mounting medium with DAPI (Vector Labs). Tri-color z-stacks (20x magnification) along the rostral/caudal axis of the NAc (A/P +2.7 through A/P +0.5) were collected via confocal microscopy and c-Fos + cells were counted by three researchers who were blinded to experimental conditions.

Chemogenetic modulation of addiction-like behaviors

To assess the role of NAc dMSNs and iMSNs in driving motivation and relapse in low- and high-risk rats, excitatory (Gq-coupled; hM3Dq) or inhibitory (Gi/o-coupled; hM4Di) DREADDs were transiently activated with CNO prior to PR and cue-induced reinstatement tests. Gq-DREADDs were used because they directly increase neuronal firing via calcium channel activation, rather than the more broad effects that Gs-DREADDs have on neuronal activity through changes in gene transcription, protein trafficking, etc [17]. In addition, Gs-DREADDs can have some constitutive activity [25, 26]. CNO and VEH injections were administered 30 min prior to test sessions in a pseudo-randomized, within-subject design. Additional days of SA (3) and extinction (6) were interleaved between PR and cue-induced reinstatement tests, respectively, to re-baseline behavior. Unpaired t-tests (VEH first vs VEH second) revealed no significant effect of session order on progressive ratio (t(75) = 1.56, p= 0.12) or cue-induced reinstatement of drug-seeking (t(75) = 1.61, p= 0.11). Following the completion of behavioral testing, rats were anesthetized and perfused, and brain slices were collected as described above. To confirm DREADD expression, floating sections were washed (PBS; 3 × 10 min), blocked (0.25% Triton-X, 5% normal goat serum, PBS; 120 min), and incubated with a primary antibody (1:400 rabbit anti-DsRed, 0.25% Triton-X, 2.5% normal goat serum, PBS; 24 h; Clontech #632496, RRID: AB_2737298). Sections were then washed (PBS; 3 × 10 min) and incubated with a secondary antibody conjugated to AF-568 (1:500 goat-anti-rabbit, 0.25% Triton-X, 2.5% normal goat serum; PBS, 120 min; Life Technologies #A11036, RRID: AB_10563566). Finally, sections were washed (PBS; 3 × 10 min), mounted on slides, and coverslipped with Vectashield mounting medium with DAPI. Z-stacks along the rostral/caudal axis of the NAc (A/P +2.7 through A/P +0.5) were collected via confocal microscopy to confirm viral expression, quantified as the percent of rats with expression in a particular subregion of the NAc and pseudo-colored to correspond with each chemogenetic experiment. Rats with no detectable DREADD expression in NAc or other areas in examined brain slices (e.g., dorsal striatum, prefrontal cortex) were analyzed as a viral “miss” control group to assess the effects of CNO in rats not expressing DREADDs.

Statistical analysis

Behavioral data were collected using automated procedures (MedAssociates), processed using custom-written Python scripts, and analyzed using Python (V3.6) and GraphPad Prism (V8.0). Addiction severity measures were compared between low- and high-risk rats using unpaired two-tailed t-tests. The relative severity distribution was compared between groups using a frequency distribution (range: −10 to +17, bin size = 1) fit with a nonlinear regression (Gaussian), and the overlap coefficient was calculated between distributions using R (V3.5). The relative contribution of each severity metric to overall severity classification was analyzed using principal component analysis (PCA) in Python: raw behavioral data were normalized, the corresponding covariance matrix and eigens were calculated, and data was replotted on the resulting two-dimensional space. Principal components with an eigenvalue >1 were considered significant contributors to overall variance (Tables S1, S2). c-Fos+ cell counts along the rostral-caudal axis of the NAc were averaged into a single value per rat, and c-Fos+ cells were analyzed using paired two-tailed t-tests (eGFP vs DREADD). Lever presses during IntA SA and extinction training (severity × session) and lever presses during PR and cue-induced reinstatement tests (severity × treatment) were independently analyzed for each chemogenetic experiment using two-way repeated-measures analysis of variance (RM-ANOVA) followed by Sidak post-hoc tests. The relationship between severity metrics and the effect size of each chemogenetic manipulation on motivation and relapse were independently analyzed using multiple linear regressions. Statistical significance for all analyses was set at p< 0.05, and data are shown throughout as individual subjects, mean ± SEM, or best-fit line with 95% confidence intervals.

Results

High-risk rats exhibit an addictive phenotype following heroin self-administration

Six behavioral metrics were normalized to z-scores: (1) intake, (2) consistency, (3) seeking, (4) motivation, (5) extinction, and (6) relapse (Fig. 1a), and combined to give an overall score which revealed addiction-resilient (i.e., low-risk; n= 43) and addiction-vulnerable (i.e., high-risk; n= 36) rats (Fig. 1b). Compared to low-risk rats, high-risk rats self-administered more heroin during IntA training (unpaired t-test: t(77) = 10.70, p< 0.0001), engaged in more bouts of drug-taking during drug-available blocks (unpaired t-test: t(77)  =  9.00, p< 0.0001), had higher levels of drug-seeking during cued drug-unavailable blocks (unpaired t-test: t(77) = 7.28, p< 0.0001), showed higher motivation to self-administer heroin during PR testing (unpaired t-test: t(77) = 8.24, p< 0.0001), exhibited higher levels of responding throughout extinction training (unpaired t-test: t(77) = 7.03, p< 0.0001), and had greater rates of heroin-seeking during cue-induced reinstatement (unpaired t-test: t(77) = 5.90, p< 0.0001; Fig. 1c–h). These behavioral differences resulted in largely non-overlapping addiction severity distributions between low-risk and high-risk rats (8.9% overlap; 95% CI: low [−4.1 to −3.2], high [3.4 to 4.0]). Nonetheless, there was variability within each subgroup for which severity metrics contributed to their classification and the number of positive severity metrics (low: 0.6 ± 0.1; high 4.7 ± 0.2; t(77) = 18.92, p< 0.0001; Fig. 1i, j). While the six metrics used for severity classification were all significantly correlated (Fig. S1), the relationship between each metric was highly variable, indicating that the metrics were capturing distinct features of addiction-like behavior (Fig. 1k). Dimensional reduction was performed on the severity metrics to identify the vectors accounting for the majority of the variance within the dataset (Table S1). Distribution along the first principal component (PC1) corresponded with classification into low-risk (PC1 < 0) or high-risk (PC1 > 0) for all rats (Fig. 1l). PC1 accounted for 61% of the cumulative variance and was positively correlated with all severity metrics, while PC2 accounted for 19% of all variance and was negatively correlated with intake and consistency but positively correlated with all other metrics (Table S2).

Fig. 1
figure 1

Heroin self-administration procedure and addiction severity classification. a Timeline of self-administration procedure and behavioral metrics used to classify addiction severity. Dark gray: periods of drug availability; light gray: periods of drug unavailability. (1) Intake: Total heroin intake; (2) Consistency: Percent of drug-available blocks with ≥1 infusion; (3) Seeking: Total active presses during drug-unavailable blocks; (4) Motivation: Breakpoint during PR; (5) Extinction: Total active presses during extinction; (6) Relapse: Total active presses during cue-induced reinstatement. b Severity classification: (i) raw behavioral data were converted to z-scores, (ii) z-scores were combined to give a cumulative severity score (SS), and (iii) individual rats were classified as low-risk (gray) or high-risk (green). ch Compared to low-risk rats, high-risk rats c self-administered more heroin during IntA SA, d had a higher number of drug-available blocks where they engaged in drug-taking, e pressed the drug-paired lever more during drug-unavailable blocks, f had higher breakpoints during a PR test, g pressed the drug-paired lever more during extinction sessions, and h during the cue-induced reinstatement test. i Classification produces largely non-overlapping severity distributions between groups. j High-risk rats scored positive on more severity metrics than low-risk rats. k Individual severity metrics are all significantly correlated (all p< 0.05), though the relationship between pairs of metrics are highly variable. l Principal component analysis of severity metrics, with individual rats and the original severity metrics (length = eigenvalue, angle = eigenvector) replotted on the resulting two-dimensional space. DA, drug-available; DU, drug-unavailable; IntA, intermittent-access; PC, principal component; PR, progressive ratio; SA, self-administration; ****p< 0.0001 (low vs high).

Anatomical tracing of NAc direct and indirect pathways

Reports have indicated that the segregation of MSN output pathways may not be as complete as originally described, with one study in mice finding that >90% of dMSNs send collaterals to the VP prior to synapsing in the VTA [27]. Thus, we first sought to determine whether DREADDs could be targeted selectively to MSNs projecting exclusively to the VTA (i.e., dMSNs) or the VP (i.e., iMSNs) in our rats using CTB labeling (Fig. 2a, c, d, f). For the dMSN experiment, we found that 49.1 and 45.7% of neurons were labeled as putative dMSNs (CTB-647+) and iMSNs (CTB-488+), respectively, while 5.1% of neurons were co-labeled with both fluorophores. Importantly, 27.7% of dMSNs were positive for DREADDs, in contrast to 0% of iMSNs (Fig. 2b). In contrast to what has been reported with mice [27], we did not detect any DREADD+ terminals within the VP, suggesting that DREADD-transfected dMSNs were not sending collaterals to the VP in the rats. For the iMSN experiment, we found comparable levels of distinct MSN labeling with 40.3% and 50.6% of neurons labeled as putative iMSNs (CTB-647+) and dMSNs (CTB-488+), respectively, and 9.1% of neurons co-labeled with both fluorophores. DREADD transfection in iMSNs was similar to dMSNs (28.5%), though a small number of dMSNs (<1%) were found to be positive for DREADD expression (Fig. 2e). Given these relative rates, transfection in the small number of dMSNs is unlikely to contribute significantly to alterations in circuit function or behavior.

Fig. 2
figure 2

Anatomical tracing of NAc direct and indirect pathways. a Viral strategy for anatomical tracing of NAc dMSNs. b Representative photomicrographs of neuronal labeling and viral expression. c Quantification of neuronal labeling. ~30% of putative dMSNs (magenta) and 0% of putative iMSNs (green) co-expressed DREADDs (red). d Viral strategy for anatomical tracing of NAc iMSNs. e Representative photomicrographs of neuronal labeling and viral expression. f Quantification of neuronal labeling. ~30% of putative iMSNs (magenta) and <1% of putative dMSNs (green) co-expressed DREADDs (red). Red arrow: DREADD expressed in cell type of interest; gray arrow: cell co-labeled with CTB-488 and CTB-647; n= 4 rats, scale bar = 100 μm.

DREADDs bidirectionally regulate activity of NAc dMSNs and iMSNs

To confirm that Gi/o- and Gq-DREADDs can modulate activity in NAc dMSNs and iMSNs, we examined the effects of DREADD activation on amphetamine + novelty-induced c-Fos in rats expressing DREADDs in one hemisphere (left NAc) and eGFP in the other (right NAc) (Fig. 3, S2). Inactivation of dMSNs via hM4Di significantly reduced (paired t-test: t(3) = 5.00, p= 0.0154) while activation of dMSNs via hM3Dq significantly enhanced (paired t-test: t(3) = 5.11, p= 0.0145; Fig. 3b, c) the number of NAc c-Fos+ cells. Similarly, inactivation of iMSNs via hM4Di significantly reduced (paired t-test: t(3) = 3.26, p= 0.0473) while activation of iMSNs via hM3Dq significantly enhanced (paired t-test: t(3) = 7.58, p= 0.0048; Fig. 3d, e) the number of c-Fos+ cells. Together with our tracing data, these findings demonstrate the ability to selectively target and modify the activity of dMSNs or iMSNs with excitatory or inhibitory DREADDs.

Fig. 3
figure 3

DREADDs bidirectionally alter amphetamine-stimulated c-Fos in dMSNs and iMSNs. a Timeline for chemogenetic modulation of amphetamine-stimulated c-Fos activation. b Viral strategy and representative histology for DREADD expression in dMSNs. CAV2-Cre was bilaterally infused into the VTA, DIO-eGFP was unilaterally infused in the right NAc, and DIO-hM4Di or DIO-hM3Dq was unilaterally infused in the left NAc. c Inactivation of dMSNs (via hM4Di) decreases while activation of dMSNs (via hM3Dq) increases the number of c-Fos+ cells in the NAc, relative to eGFP. d Viral strategy and representative histology for DREADD expression in iMSNs. Infusions were as in (b), except CAV2-Cre was infused into the VP. e Inactivation of iMSNs (via hM4Di) decreases while activation of iMSNs (via hM3Dq) increases the number of c-Fos+ cells in the NAc, relative to eGFP. n= 4/group; scale bar = 50 μm; *p< 0.05 (left NAc vs right NAc), **p< 0.01 (left NAc vs right NAc).

dMSN inactivation suppresses reinstatement but not motivation in high-risk rats

To test the hypothesis that an imbalance of signaling between NAc dMSNs and iMSNs drives motivation for heroin self-administration under a PR schedule of reinforcement and reinstatement of cued heroin-seeking, dual viral-mediated gene transfer was used to selectively express Gi/o- or Gq-DREADDs in each pathway. Given prior work in our lab showing a role for dMSNs in cue-induced reinstatement of cocaine-seeking but not motivation to take cocaine [20], we first sought to determine if transient chemogenetic inhibition of dMSNs had similar effects following heroin self-administration (Fig. 4b). As with the combined group data (Fig. 1), high-risk rats (n= 9) exhibited a greater addiction severity profile compared to low-risk rats (n= 9), as they self-administered more heroin (severity × session interaction: F(12,180) = 2.43, p= 0.0059), showed greater motivation for heroin during PR testing (main effect of severity: active presses, F(1,16) = 4.97, p= 0.0405; breakpoint, F(1,16) = 4.18, p= 0.0576), and had higher levels of active lever pressing during extinction training (severity × session interaction: F(14,210) = 2.80, p= 0.0007) and during cue-induced reinstatement (main effect of severity: F(1,16) = 12.92, p= 0.0024; Fig. 4c–f). Both low- and high-risk rats discriminated between active and inactive levers during IntA SA and extinction training (Fig. S3). Notably, increasing Gi/o signaling in NAc dMSNs had no effect on the motivation to self-administer heroin regardless of addiction severity (main effect of treatment: active presses, F(1,16) = 0.28, p= 0.60; breakpoint, F(1,16) = 0.14, p= 0.71; linear regression: r2 = 0.00, p = 0.98; Fig. 4d) but selectively attenuated reinstatement in high-risk rats (severity × treatment interaction: F(1,15) = 9.86, p= 0.0067), with the magnitude of attenuation significantly correlated with an individual’s severity score (linear regression: r2 = 0.22, p = 0.0372; Fig. 4f).

Fig. 4
figure 4

Transient modulation of dMSNs bidirectionally alters reinstatement but not motivation in high-risk rats. a Timeline for chemogenetic modulation of motivation under a PR schedule of reinforcement and cue-induced reinstatement of drug-seeking. b Viral strategy, representative histology, and quantification of expression for dMSN inactivation. cf Self-administration, progressive ratio, extinction, and reinstatement data for dMSN-hM4Di rats (low: n= 9; high: n= 9). d dMSN inactivation has no effect on motivation. Left: High-risk rats have greater responding for heroin than low-risk rats, which is unaffected by dMSN inactivation; right: dMSN-hM4Di has no effect on motivation, regardless of addiction severity. f dMSN inactivation selectively reduces reinstatement in high-risk rats. Left: High-risk rats have greater responding for heroin cues than low-risk rats, which is selectively attenuated by CNO; right: the effect size of dMSN-hM4Di significantly correlates with addiction severity. g Viral strategy, representative histology, and quantification of expression for dMSN activation. hk Self-administration, progressive ratio, extinction, and reinstatement data for dMSN-hM3Dq rats (low: n= 11; high: n= 6). i dMSN activation has no effect on motivation. Left: High-risk rats have greater responding for heroin than low-risk rats, which is unaffected by dMSN activation; right: dMSN-hM3Dq has no effect on motivation, regardless of addiction severity. (k) dMSN activation selectively enhances reinstatement in high-risk rats. Left: High-risk rats have greater responding for heroin cues than low-risk rats, which is further enhanced by CNO; right: the effect size of dMSN-hM3Dq significantly correlates with addiction severity. Scale bar = 500 μm; ac , anterior commissure; LV, lateral ventricle; #p< 0.05 (VEH vs CNO), ##p< 0.01 (VEH vs CNO).

dMSN activation exacerbates reinstatement but not motivation in high-risk rats

Given that increasing Gi/o-signaling in dMSNs was sufficient to attenuate reinstatement in high-risk rats, we next tested whether transient disruption of the balance between dMSNs and iMSNs by chemogenetic activation of dMSNs could drive reinstatement in low-risk rats (Fig. 4g). As with the other experiments, compared to low-risk rats (n= 11), high-risk rats (n = 6) self-administered more heroin (severity x session interaction: F(12,180) = 5.23, p< 0.0001), had greater motivation for heroin during PR testing (main effect of severity: active presses, F(1,15) = 38.73, p< 0.0001; breakpoint, F(1,15) = 23.18, p= 0.0002), and had higher responding during extinction sessions (severity × session interaction: F(14,210) = 3.96, p< 0.0001) and during cue-induced reinstatement of drug-seeking (main effect of severity: F(1,15) = 9.20, p= 0.0084; Fig. 4h–k). Consistent with the inhibition experiment, chemogenetic activation of NAc dMSNs had no effect on the motivation to self-administer heroin regardless of addiction severity (main effect of treatment: active presses, F(1,15) = 1.50, p= 0.24; breakpoint, F(1,15) = 0.21, p= 0.65; linear regression: r2 = 0.0649, p = 0.28; Fig. 4i), but selectively enhanced reinstatement in high-risk rats (severity × treatment interaction: F(1,15) = 5.41, p= 0.0344), exacerbating reinstatement proportionally to individual severity scores (linear regression: r2 = 0.2122, p = 0.0410; Fig. 4k).

iMSN activation attenuates reinstatement but not motivation in high-risk rats

Given the well-established role for oppositional control of behavioral output by NAc dMSNs and iMSNs, we next sought to determine whether chemogenetic activation of iMSNs would have similar effects following IntA heroin SA to inactivation of dMSNs (Fig. 5b). Much like what we found with whole group analysis, high-risk rats (n= 8) developed a more addictive phenotype than low-risk rats (n= 6) (self-administration, severity x session interaction: F(12,144) = 1.97, p= 0.0313; PR active presses, main effect of severity: F(1,12) = 10.98, p= 0.0062; PR breakpoint, main effect of severity: F(1,12) = 11.67, p= 0.0051; extinction, severity × session interaction: F(14,168) = 3.02, p= 0.0004; cue-induced reinstatement, main effect of severity: F(1,12) = 23.81, p= 0.0004; Fig. 5c-f). Similar to our findings with inhibition of dMSNs, activation of NAc iMSNs had no effect on the motivation to self-administer heroin in high- or low-risk rats (main effect of treatment: active presses, F(1,12) = 0.43, p= 0.52; breakpoint, F(1,12) = 0.54, p= 0.48) (linear regression: r2 = 0.0027, p = 0.84; Fig. 5d); however, the manipulation selectively reduced reinstatement in high-risk rats (severity x treatment interaction: F(1,12) = 20.15, p= 0.0007), with the degree of reduction significantly correlating with addiction risk (linear regression: r2 = 0.2671, p = 0.0404; Fig. 5f). Together, these findings imply a critical role for NAc dMSNs and iMSNs in encoding drug/cue relationships that drive reinstatement of drug-seeking in high-risk individuals.

Fig. 5
figure 5

Transient modulation of iMSNs bidirectionally alters reinstatement but not motivation in high-risk rats. a Timeline for chemogenetic modulation of motivation under a PR schedule of reinforcement and cue-induced reinstatement of drug-seeking. b Viral strategy, representative histology, and quantification of expression for iMSN activation (low: n= 6; high: n= 8). cf Self-administration, progressive ratio, extinction, and reinstatement data for iMSN-hM3Dq rats. d iMSN activation has no effect on motivation. Left: High-risk rats have greater responding for heroin than low-risk rats, which is unaffected by iMSN activation; right: CNO has no effect on motivation, regardless of addiction severity. f iMSN activation selectively reduces reinstatement in high-risk rats. Left: High-risk rats have greater responding for heroin cues than low-risk rats, which is selectively attenuated by CNO; right: the effect size of iMSN-hM3Dq significantly correlates with addiction severity. g Viral strategy, representative histology, and quantification of expression for iMSN inactivation (low: n= 9; high: n= 8). hk Self-administration, progressive ratio, extinction, and reinstatement data for iMSN-hM4Di rats. i iMSN inactivation has no effect on motivation. Left: High-risk rats have greater responding for heroin than low-risk rats, which is unaffected by iMSN inactivation; right: CNO has no effect on motivation, regardless of addiction severity. k iMSN inactivation selectively enhances reinstatement in high-risk rats. Left: High-risk rats have greater responding for heroin cues than low-risk rats, which is selectively enhanced by CNO; right: the effect size of iMSN-hM4Di significantly correlates with addiction severity. Scale bar = 500 μm; ac,  anterior commissure; #p< 0.05 (VEH vs CNO), ###p< 0.001 (VEH vs CNO).

iMSN inactivation exacerbates reinstatement but not motivation in high-risk rats

To complement these findings, we examined the effect of transient chemogenetic inhibition of iMSNs following heroin self-administration (Fig. 5g). Consistent with our severity classification group data, high-risk rats (n= 8) developed a greater addiction-like behavioral profile compared to low-risk rats (n= 9) (self-administration, severity × session interaction: F(12,180) = 2.06, p= 0.0214; PR active presses, main effect of severity: F(1,15) = 26.57, p= 0.0001; PR breakpoint, main effect of severity: F(1,15) = 39.86, p< 0.0001; extinction, severity x session interaction: F(14,210) = 3.83, p< 0.0001; cue-induced reinstatement, main effect of severity: F(1,15) = 15.48, p= 0.0013; Fig. 5h–k). While increasing Gi/o-signaling in iMSNs had no effect on the motivation to self-administer heroin in either high- or low-risk rats (main effect of treatment: active presses, F(1,15) = 0.07, p= 0.80; breakpoint, F(1,15) = 0.12, p= 0.74; linear regression: r2 = 0.0979, p = 0.19; Fig. 5i), it did selectively enhance reinstatement in high-risk rats (severity × treatment interaction: F(1,15) = 7.62, p= 0.0146), with the effect size significantly correlating with individual severity scores (linear regression: r2 = 0.383, p = 0.0047; Fig. 5k). Together, these findings suggest that while an imbalance of signaling between dMSNs and iMSNs is likely to contribute to reinstatement of drug-seeking in vulnerable individuals, artificially driving an imbalance is incapable of pushing resilient individuals to reinstate drug-seeking.

CNO administration has no effects in rats lacking DREADD expression

Recent reports have found that CNO activation of DREADDs occurs indirectly, via retroconversion to clozapine [28]. To ensure that our results were not due to off-target effects, we examined the effects of CNO on motivation and cue-induced reinstatement in rats that had no detectable DREADD expression (viral “miss” rats). This was a heterogeneous group, with similar numbers of direct (n = 6) and indirect (n = 7) pathway misses (Fig. S5a). Consistent with other groups, high-risk rats (n= 5) developed a greater addiction-like behavioral profile compared to low-risk rats (n= 8) (self-administration, severity x session interaction: F(12,132) = 3.11, p= 0.0007; PR active presses, main effect of severity: F(1,11) = 20.67, p= 0.0008; PR breakpoint, main effect of severity: F(1,11) = 28.15, p= 0.0003; extinction, severity x session interaction: F(14,154) = 8.95, p< 0.0001; cue-induced reinstatement, main effect of severity: F(1,11) = 10.34, p= 0.0082; Fig. S5b–e). Importantly, CNO administration had no effect on motivation (main effect of treatment: active presses, F(1,11) = 0.31, p= 0.59; breakpoint, F(1,11) = 0.65, p= 0.44; linear regression: r2 = 0.001, p = 0.90) or cue-induced reinstatement of heroin-seeking (main effect of treatment: F(1,11) = 0.85, p= 0.38; linear regression: r2 = 0.01, p = 0.73) in high- or low-risk rats (Fig. S5c, e). These results indicate that the observed behavioral effects in the other groups were likely due to the direct actions of CNO on DREADD receptors.

Discussion

Using a viral DREADD approach, we show that NAc dMSNs and iMSNs have opposing roles in driving heroin-seeking in high-risk rats. These findings lend support to the hypothesis that an imbalance in signaling between these cells underlies addictive behaviors. Specifically, they suggest that heroin use in susceptible individuals likely leads to an increase in direct pathway activity and a concomitant decrease in indirect pathway activity. Nonetheless, given that heroin-seeking could be bidirectionally modulated by both cell types, it is unlikely that changes to direct pathway function were maximal or that there was a complete loss of indirect pathway function in high-risk rats. This is not particularly surprising as iMSNs are generally more excitable and have a greater event frequency under basal conditions than dMSNs [29, 30]. Additionally, we demonstrate that although transient alterations in signaling activity in either pathway are sufficient to both attenuate and exacerbate cue-induced reinstatement of heroin-seeking in high-risk rats, reinstatement behavior was surprisingly unmodifiable in low-risk rats. Of note, chemogenetic modulation of dMSNs and iMSNs had no effect on the motivation to self-administer heroin across phenotype, indicating a divergence in the neural circuits responsible for encoding drug-seeking and drug-taking behaviors.

NAc MSNs regulate cue-induced reinstatement of heroin-seeking

The NAc plays a critical role in regulating cue-induced reinstatement of seeking across many types of rewards, including heroin [31], cocaine [32,33,34,35], methamphetamine [36], alcohol [37], and food [35, 38]. Moreover, dMSNs and iMSNs have been shown to play oppositional roles in the rewarding and sensitizing effects of cocaine and amphetamine [23, 39, 40]. There has been no work, however, examining the role of these cell types in opioid addiction. Our results show that both cell populations modulate cue-induced reinstatement of heroin-seeking, complementing and expanding previous work showing a similar role for dorsal striatal dMSNs in cued-reinstatement of cocaine-seeking [20]. More surprisingly, though, these results were selective for high-risk rats, suggesting that their role in the encoding of drug cues is specific to conditions that promote the development of addiction-like behaviors. These findings are consistent with previous work demonstrating that synaptic strengthening of NAc iMSNs is enhanced in addiction-resilient mice following cocaine self-administration [23], and suggest a common role for iMSNs in encoding vulnerability to cue-induced reinstatement of drug-seeking. Notably, however, that same study found that chemogenetic inhibition of NAc iMSNs increased the motivation to self-administer cocaine [23], following previous work showing a role for iMSNs in cocaine-taking [41,42,43]. Additionally, using a polydrug procedure and behavioral economics, we have recently found that rats have greater motivation for cocaine than heroin [44]. This suggests that psychostimulants and opioids differentially engage motivational circuits, likely due to their unique pharmacological properties (e.g., cocaine increases while morphine decreases cFos activation in iMSNs [45, 46]). Thus, while dMSNs and iMSNs appear to play a common role in the encoding of drug-seeking, there may be a divergence in the encoding of drug-taking across classes of drugs.

The NAc receives strong glutamatergic input from the prelimbic (PrL) and infralimbic (IL) regions of the medial prefrontal cortex (mPFC), and previous work has found that mPFC, as well as its projections to the NAc, play a critical role in the encoding of cue value and cue-induced reinstatement of drug-seeking [36, 47,48,49]. Thus, given that modulating activity of subpopulations of NAc MSNs had no effect on reinstatement of drug-seeking in low-risk rats, it is possible that the cue associations formed in mPFC to NAc projections of these rats were insufficiently strengthened during heroin self-administration. The PrL and IL differ in their projection patterns to the NAc, with the PrL preferentially innervating the central NAc core and the IL preferentially innervating the surrounding NAc shell. These divergent signaling pathways are generally thought to encode different types of drug-seeking, with the core encoding cue-induced [5, 31, 50, 51] and the shell encoding context-induced [9, 31, 52] reinstatement. However, cue-induced reinstatement of cocaine [53, 54] or ethanol [55, 56] induces c-Fos activation in both the core and shell, and inactivation of either core or shell attenuates cue-induced reinstatement of sucrose-seeking [57]. Additionally, blockade of AMPA receptors in the IL [58] or inhibition of CaMKII activity in the shell [59] blocks cue-induced reinstatement of heroin- or morphine-seeking, respectively, indicating a role for the shell in reinstatement of opioid-seeking. Of note, there is extensive local connectivity between MSNs in the NAc [10], so manipulations of either core or shell are likely to alter overall signaling within the NAc. Consistent with these studies, we found no differences between rats with DREADD expression in both subregions compared to those with expression limited to the core or shell, though additional work specifically investigating the role of these subregions in cue-induced reinstatement of heroin-seeking will add clarity to our understanding of how the C-BG circuit encodes heroin cues.

Experimental caveats and limitations

The phenotyping approach was designed to capture baseline (i.e., genetic) differences in vulnerability, although with any procedure in outbred animals it is difficult to rule out interactions between genetic and environment factors. In addition, there were no significant differences between high- and low-risk rats in active lever presses during early IntA SA (Fig. 1c–e), and there were no significant differences in inactive presses between groups at any point (Fig. S3). Thus, these data suggests that the differences between low- and high-risk rats are likely due to differences in preference for the amount and/or pattern of heroin that is consumed, rather than differences in learning rates. Nonetheless, it is possible that the DREADD effects were selective for rats with high levels of habitual responding (due to more stimulus-response pairings as a result of higher heroin intake).

It is worth noting that CNO has been shown to activate DREADDs via retro-conversion to clozapine, an atypical antipsychotic with diverse pharmacology [28]. Importantly, while high doses of CNO are capable of having off-target behavioral effects [28, 60], the dose of CNO used in these studies was chosen based on previous work in our lab and others showing no behavioral effects in control rats [20, 39, 61,62,63,64,65,66,67,68,69,70]. Moreover, we found that the effects of CNO were both severity- (high-risk but not low-risk) and behavior- (relapse but not motivation) dependent. In addition, CNO had no effect on inactive lever pressing (Fig. S4), nor in rats that lacked DREADD expression. Although the sample size of these viral “miss” groups is relatively small, the difference between low- and high-risk rats in behaviors contributing to the severity score was comparable to that from the groups expressing DREADDs. Thus, it is unlikely that our observed effects on reinstatement in high-risk rats are due to non-DREADD mediated effects.

Conclusions

In summary, these data support a critical—and opposing—role for the direct and indirect pathways of the NAc in driving cue-induced reinstatement of heroin-seeking in high-risk rats. Interestingly, altering NAc output was unable to change reinstatement in low-risk rats, suggesting that NAc regulation of drug-seeking may be dependent on the strength of the associations that are formed between drugs and cues. Future work will investigate the role of discrete projections to the NAc (e.g., PrL, IL) in vulnerability to drug-seeking, since these projections are thought to regulate the consolidation of cue/drug relationships during initial drug use [48, 50, 71].

Funding and Disclosure

This work was supported by grants from the National Institute on Drug Abuse (F31DA047012 to T.J.O., R01DA036582 to S.M.F., and P30 DA048736 to Dr. Charles Chavkin) and the National Institute of Neurological Disorders (T32NS99578 to Drs. Jane Sullivan and Rachel Wong).