INTRODUCTION

Since the early 1990s, the drug addiction field has given much attention to uncovering the mechanisms of craving and relapse using the extinction/reinstatement animal model (Bossert et al, 2005). This procedure allows researchers to examine individual factors known to elicit drug-seeking behavior and can identify potential interventions aimed at attenuating drug craving. The face validity of the model, particularly with cue reinstatement, is demonstrated by the corresponding increase of craving reported by human addicts when presented with cocaine-related stimuli (Childress et al, 1988; Ehrman et al, 1992) and the return of drug-seeking behavior in animals when presented with cues previously paired with cocaine reinforcement (Markou et al, 1993; Shaham et al, 2003). Both of these measures are thought to reflect incentive motivational effects of the cues acquired through classical conditioning (Stewart, 1983). Further evidence supporting the construct validity of the model are that neuroimaging studies in human addicts and examination of immediate early gene (IEG) expression in animals that reveal the same brain regions are activated by cocaine-paired cues (Childress et al, 1999; Garavan et al, 2000; Grant et al, 1996; Kufahl et al, 2009; Wang et al, 1999; Zavala et al, 2008). In addition, compounds found to decrease self-reports of craving in humans have been shown to attenuate cue-elicited drug-seeking behavior in animals (Fuchs et al, 1998; Yahyavi-Firouz-Abadi and See, 2009), demonstrating predictive validity and overall utility of the cue reinstatement model.

Fos protein expression is an established method for identifying brain circuits associated with cocaine-induced conditioning effects (Brown et al, 1992; Crawford et al, 1995; Neisewander et al, 2000). Fos is a product of the IEG c-fos, an inducible transcription factor important for the initiation of many signal transduction pathways (Curran and Morgan, 1995). When an animal undergoes physiological and/or pharmacological manipulations, c-fos is transiently induced in relevant brain areas, with the resulting Fos protein expression yielding a histological marker of stimulus-induced brain activity (Chaudhuri, 1997; Harlan and Garcia, 1998). In response to cocaine or cocaine-paired cues, Fos is expressed in brain regions commonly associated with reward processing, memory, and drug abuse (Ciccocioppo et al, 2001; Hotsenpiller et al, 2002; Neisewander et al, 2000). More recently, we found that presentation of response-contingent cocaine-paired cues during reinstatement testing induced a widespread pattern of Fos expression throughout the brain, including several prefrontal cortical, limbic, and striatal subregions (Kufahl et al, 2009). Although the association between cue presentation and brain activation appears to rely on previous drug–stimulus pairings (Guo et al, 2008; Miller and Marshall, 2005), little attention has been given to the contribution of novelty on test day. While cue-elicited reinstatement of cocaine-seeking behavior depends to some extent on the sensory characteristics of the cues (See et al, 1999), the propensity for animals to be aroused or motivated by environmental or procedural novelty could also contribute strongly to both behavior and Fos expression (Badiani et al, 1998; Bardo et al, 1989, 1996).

The purpose of this study was to investigate the specificity of both the reinstatement of cocaine-seeking behavior and its associated Fos expression. In Experiment 1, rats were trained to self-administer cocaine that was paired with presentation of either an oscillating light or tone, followed by extinction training where lever presses had no consequences. Half of the rats were then tested for reinstatement of cocaine-seeking behavior with conditioned cues and half were tested with a novel cue (that is, a cue not present during training). We hypothesized that novel cues would not reinstate cocaine-seeking behavior, but would contribute to cue-induced Fos expression in some brain regions. To control for non-motivational contributions of cue exposure to the Fos response, control rats were given saline infusions paired with cues that were yoked to reinforcement delivery in the cocaine-trained rats. In Experiments 2 and 3, we examined the specificity of the findings for drug seeking by examining rats trained with sucrose or cue-only reinforcement.

MATERIALS AND METHODS

Animals

Male Sprague–Dawley rats were housed individually in a temperature-controlled colony with a 12-h (Experiment 1) or a 14-h (Experiments 2/3) reversed light/dark cycle and were briefly handled daily. Animal care and housing conditions were consistent with the 1996 Guide for Care and Use of Laboratory Animals (Clark et al, 1997). Surgical and experimental procedures were approved by the Institutional Animal Care and Use Committee at Arizona State University. Rats weighed 250–325 g before surgery or training.

Experiment 1: Reinstatement and Fos Expression Elicited by Cocaine-Conditioned vs Novel Cues

Surgery. Catheters were constructed from Silastic tubing (10 cm in length, inner diameter 0.012 inches, outer diameter 0.025 inches, Dow Corning, Midland, MI) connected to a 22-G nonferrous metal cannula encased within a plastic screw connector (Plastics One, Roanoke, VA) and implanted into the jugular vein and secured to the skull with dental acrylic cement under isoflurane anesthesia (2–3%) as described previously (Zavala et al, 2007). Incisions were sutured and treated with a topical antibiotic and the rats were given buprenorphine (0.05 mg/kg, s.c.) analgesia and an anti-inflammatory analgesic (meloxicam; 1 mg/kg, s.c.). Patency of the catheters was maintained throughout the experiment by daily flushing with 0.1 ml timentin (66.67 mg/ml; bioWORLD, Dublin, OH) in saline solution containing 70 units/ml heparin sodium. Catheter patency was tested periodically with 0.8 g methohexital sodium (Brevital, Sigma, St Louis, MO), a dose that produces rapid loss of muscle tone only when administered i.v. Rats were given 7 recovery days in their home cages and were handled and weighed daily.

Apparatus. Training and testing were conducted in Plexiglas operant conditioning chambers (20 × 28 × 20 cm) equipped with a food pellet dispenser and a food well located between two levers mounted on the front panel (Med Associates, St Albans, VT). A cue light was mounted above one lever, a tone generator (500 Hz, 10 dB above background noise) was mounted on the side wall near a lever and a house light was mounted on the rear wall opposite the levers. The lever below the cue light and nearest to the tone generator was designated as the active lever. Each conditioning chamber was housed within its own ventilated, sound-attenuating cabinet. An infusion pump containing a 10-ml syringe was located outside of the cabinet. Tygon tubing connected to the syringe was attached to a liquid swivel (Instech, Plymouth Meeting, PA) suspended above the operant conditioning chamber. The outlet of the swivel was fastened to the catheter via Tygon tubing that ran through a metal spring leash (Plastics One). The leash was fastened onto the plastic screw of the cannula that was anchored on the animal's head.

Self-administration (SA) training. SA training was conducted during daily 2-h sessions. Rats were trained to press the active lever to receive cocaine reinforcement (0.75 mg/kg/0.1 ml, i.v.). Upon completing a schedule of reinforcement, either the oscillating cue light or tone was activated, followed 1 s later by activation of the infusion pump for 6 s. Following infusion, the cue light or tone was inactivated. After a 20-s timeout period, responses on the active lever were accumulated toward the next reinforcement schedule. Note that rats receiving the tone vs light cues were trained in separate rooms so that the tone would be completely novel to animals that were trained with the light cue.

All animals began on a fixed ratio (FR) 1 schedule of reinforcement with the capability to progress to a variable ratio (VR) 2, VR3, and finally VR5 schedule. After ending the session on a VR5 schedule for two consecutive days, the animals then began the remaining sessions on a VR5 schedule. All animals were restricted to 16 g/day of food beginning 2 days before training to facilitate acquisition of SA (Carroll et al, 1981) with a graded increase (18, 20, and 22 g/day) as they progressed from an FR1 to a VR5 schedule of reinforcement. Animals were then given food ad libitum for the rest of the experiment. Note that all animals were on a VR5 schedule exclusively for at least the last 5 days of SA (that is, the acquisition criteria). Control rats received saline infusions, cue presentations, and underwent identical food restriction as their cocaine partner throughout SA.

Extinction training. The day after SA training was completed, rats were given daily 90-min extinction sessions across 14–15 days. During these sessions, lever presses had no scheduled consequences; both cocaine and saline-yoked rats were connected to the swivel but no infusions or light/tone cues were delivered. At the end of extinction training, all rats exhibited a decrease in response rates on the active lever to <20 responses during a session or to 20% of the peak response rate that occurred during extinction training. Responding during the terminal extinction session was used as a baseline for statistical comparison with responding on the test day.

Test for reinstatement of cocaine-seeking behavior. Cocaine-seeking behavior was operationally defined as responses on the active lever in the absence of cocaine reinforcement. On the day after the final extinction session, rats were placed in their conditioning chambers for 90 min. The test session length was chosen for optimal expression of stimulus-induced Fos protein expression (Moratalla et al, 1993).

The cocaine rats trained with either the light or the tone cue were assigned to either a novel or a CS test cue condition (4 groups, n=7–9/group). For rats in the CS test cue condition, the cocaine-paired stimulus complex (that is, oscillating cue light or tone and pump motor) was delivered on an FR1 schedule of reinforcement. For rats in the novel test cue condition, the stimulus was the oscillating tone (without the pump motor) for rats trained with the oscillating light, or vice versa, also delivered on an FR1 schedule. Responses on the active lever by saline-yoked controls had no consequences and they received yoked presentations of the same cue as their cocaine-trained counterpart (4 groups, n=3–5/group).

Tissue preparation and Fos protein immunohistochemistry. Immediately following reinstatement testing, rats were deeply anesthetized with sodium pentobarbital (100 mg/kg, i.p.), perfused, and their brains were harvested and post-fixed as described previously (Zavala et al, 2007). The brains were later sectioned using a sliding microtome (Microm International, Walldorf, Germany) connected to a freezing stage (Physitemp, Clifton, NJ). Serial coronal 40-μm sections were collected, separated by 120 μm, centered at anatomical locations corresponding to bregma +3.2, +1.6, −2.56, and −5.6 mm (Paxinos and Watson, 1998). Tissue sections were stored at 4 °C in a cryoprotectant solution.

Sections were labeled with rabbit polyclonal anti-Fos serum (1 : 5000 dilution; sc-52, Santa Cruz Biotechnology, Santa Cruz, CA) followed by biotinylated goat anti-rabbit IgG antibody (1 : 500 dilution; Vector Laboratories, Burlingame, CA), which was visualized using 0.02% 3,3′-diaminobenzidine (Sigma-Aldrich, St Louis, MO) as described previously (Thiel et al, 2010). Sections were then mounted onto gelatin chromium-coated slides, air-dried, dehydrated, and coverslipped for light-microscopic inspection.

Fos immunoreactivity analysis. Fos immunoreactive cells were analyzed using a Nikon Eclipse E600 (Nikon Instruments, Melville, NY) microscope set at 20 × magnification and counted blind to treatment conditions using ImageJ software. The anatomical boundaries of each region were determined using a rat brain atlas (Paxinos and Watson, 1998) as illustrated in Figure 2a. Sections taken at +3.2 mm from bregma contained the Cg1 region of the anterior cingulate cortex (Cg1). Sections taken at +1.6 mm from bregma contained the Cg2 region of the anterior cingulate cortex (Cg2), dorsomedial caudate/putamen (dmCPu), dorsolateral caudate/putamen (dlCPu), nucleus accumbens shell (NAcS), and nucleus accumbens core (NAcC). Sections taken at −2.56 mm from bregma included the basolateral amygdala (BlA). Sections taken at −5.6 mm from bregma included the ventral tegmental area (VTA). The sections were taken such that the rostral-caudal extent of each region of interest was sampled (340 μm). Fos immunoreactivity was counted and identified by black oval-shaped nuclei (Figure 2c/d). Each region was analyzed using both hemispheres from three tissue sections from each animal. The area of each sample measure was 0.26 mm2 and the counts from all the sample areas from a given region were averaged and scaled to provide a mean number of Fos-positive cells per mm2.

Experiments 2 and 3: Reinstatement after Training with Sucrose or Cue Reinforcement Only

Rats in Experiment 2 received response-contingent cue presentations (either the light or the tone) paired with delivery of a sucrose pellet (45 mg, Bio-Serv, Frenchtown, NJ), whereas rats in Experiment 3 received response-contingent cue presentations with no other reinforcer during training. The latter served to examine the reinforcing effects of the cues themselves. The procedures used in these experiments were identical to those used in Experiment 1 with the following exceptions: (1) rats in Experiment 2 received approximately 30 sucrose pellets in their home cage for three days before sucrose reinforcement training in order to familiarize them with the pellets; (2) no surgery was performed on the rats from either experiment; and (3) the training, extinction, and test sessions in both experiments were only 30 min in duration to avoid satiation found with longer training sessions (Bizo et al, 1998) and to attain a comparable number of cue presentations per session as Experiment 1. In addition, rats in Experiment 3 were matched to sucrose rats when they transitioned from food-restriction to ad libitum access in the home cage.

Statistical Analysis

Reinforcement rates, active and inactive lever responses during extinction, and Fos-labeled cells were analyzed by analysis of variances (ANOVAs) with training cue (light, tone), test cue condition (novel, CS), test cue modality (light, tone), and drug history (cocaine, saline) as a between-subjects factors where appropriate and session as a within-subjects factor where appropriate. Analysis of covariance (ANCOVA) was conducted for test-day data with the change in inactive lever presses from baseline to test day as the covariate to account for non-specific increases in lever responses on test day. Significant interactions and main effects were followed by smaller factorials, tests for simple effects, and post hoc tests (Bonferroni or Tukey) where appropriate. Additionally, the correlation between test-day responses on the active lever and Fos expression in the cocaine-trained rats was calculated using the Pearson product-moment correlation (r) for the novel and CS groups in Experiment 1.

RESULTS

All descriptive statistics are reported as mean±standard error of the mean.

Experiment 1

Cocaine SA. The 16 light-trained cocaine rats received a total of 504±46 cocaine infusions and the 18 tone-trained cocaine rats received a total of 413±32 cocaine infusions during SA training. This difference in infusion rates was due to the rapid acquisition of operant responding in the light-trained rats (Figure 1a). A significant interaction between session and training cue was found (F4, 117=2.56, p<0.05), where tests of simple effects revealed that on the first four days of SA, the light-trained groups achieved more infusions than the tone-trained groups (Figure 1a; ANOVAs, ps<0.05). However, analysis of total infusions across all sessions did not differ between the two training cue groups (Figure 1a inset; t-test, t32=1.66, p=0.11).

Figure 1
figure 1

Cocaine reinforcement and seeking behavior from Experiment 1. (a) Daily cocaine reinforcement rates (infusions±SEM) across sessions where infusions were paired with either a light (circles; n=16) or tone (squares; n=18). Inset graph shows the total number of cocaine infusions (±SEM) earned during self-administration. (b) Responses on the active lever (±SEM) across extinction sessions in animals that had previously received either a light or tone cue paired with cocaine reinforcement during training; responses on the active lever produced no scheduled consequences during extinction. (c) Estimated marginal means for responses on the active lever/90 min (±SEM), with the change in inactive lever presses from baseline to test day as the covariate, for groups tested with novel (open symbols, dashed lines) or conditioned (closed symbols, solid lines) light (circles) or tone (squares) cues (n=7–9/group). ^ Represents a significant difference from tone-trained group (tests of simple effects, p<0.05). *Represents a difference from baseline responding (ANOVA, p<0.05). +Represents a difference between test cue modalities (light vs tone; ANOVA, p<0.05).

PowerPoint slide

Extinction and reinstatement of cocaine-seeking behavior. Extinction training significantly reduced responding across sessions (Figure 1b), where a main effect of session was found on the active (F3, 82=33.26, p<0. 001) and inactive (F4, 136=3.38, p<0.05) levers and there were no group differences in response rates. Post hoc analyses revealed that last-day extinction responses on both levers were significantly reduced from first-day responses (Bonferroni correction, ps<0.025). No significant effects were found on the active or inactive lever responses in the saline-yoked groups (Table 1).

Table 1 Mean (±SEM) Active and Inactive Lever Response Rates of Rats During Training, Extinction, and Reinstatement Test Sessions in Experiments 1 (Cocaine and Saline-Yoked), 2 (Sucrose), and 3 (Cues Only)

To assess reinstatement of cocaine-seeking behavior, we compared responses on the active lever during testing with responses on the active lever during the last extinction session (that is, baseline), with day as a within-subjects factor and test cue condition and test cue modality as between-subjects factors. As the covariate (change in inactive lever presses from baseline to test day) was measured after the manipulations, we tested the homogeneity of regression and found the interaction between the covariate and test cue condition and/or modality to be non-significant (ranged from F1, 29=0.29 to 0.83, ps>0.05), thereby justifying the use of ANCOVA. In contrast to our predictions, no day × test cue condition interaction was found (F1, 29=0.87, p=0.36), indicating that rats responded similarly to conditioned and novel stimuli during testing. An interaction between day and test cue modality was found (F1, 29=7.52, p<0.05), where tests for simple effects of day revealed that rats tested with both cue modalities reinstated lever pressing compared with their extinction baseline (Figure 1c; ANOVAs, ps<0.001), but analysis of group effects on the test day revealed differences in the magnitude of reinstatement where rats tested with a light had higher response rates than tone-tested rats (F1, 31=7.65, p<0.01; Figure 1c). No group differences were found during baseline (F1, 31=0.01, p=0.99). No interactions or main effects of responding on the active lever were observed for saline-yoked groups (Table 1).

Fos immunoreactivity. Rats in the cocaine groups exhibited significantly more Fos-immunoreactive nuclei than rats in the saline-yoked groups in all of the brain regions analyzed, with the exception of the VTA (Table 2). This enhancement was evident as a significant main effect of drug history in the dmCPu, dlCPu, NAcC, NAcS, and BlA (Figure 2d; ranged from F1, 45=7.20 to 18.09, ps<0.05). No other significant main effects or interactions were found in these regions, implying that Fos expression was enhanced in the cocaine-trained animals, regardless of cue conditioning or modality. In the Cg1 region of the anterior cingulate cortex, a significant main effect of drug history was also found (F1, 40=9.45, p<0.01), as well as a significant drug history × test cue modality interaction (F1, 40=4.32, p<0.05). Subsequent ANOVAs for the two drug conditions revealed that saline-yoked control rats tested with a light cue expressed significantly more Fos-positive nuclei than the rats tested with a tone (Figure 2e; F1, 16=6.84, p<0.05). No such difference was found in the cocaine-trained groups (F1, 32=1.05, p=0.31). In the Cg2 region of the anterior cingulate cortex, a significant main effect of drug history was also found (Figure 2d; F1, 40=9.33, p<0.01), as well as a significant drug history × test cue condition × test cue modality interaction (F1, 40=4.32, p<0.05). Subsequent ANOVAs for the two drug conditions revealed a test cue condition × test cue modality interaction in the cocaine-exposed rats (F1, 28=7.14, p<0.05), where post hoc analyses revealed a trend toward a higher number of Fos-positive nuclei in the novel light compared with the CS light group (Table 2; Tukey, p=0.09), which likely contributed to the interaction. No interactions or main effects were found in the Cg2 of the saline-yoked animals.

Table 2 Mean (±SEM) Number of Fos-Positive Nuclei for Each Region in Experiment 1
Figure 2
figure 2

Fos protein expression data from Experiment 1. (a) Schematic representations illustrating the regions analyzed in brain sections taken at +3.2, +1.6, −2.56, and −5.8 mm from bregma (Paxinos and Watson, 1998). The numbered regions are as follows: (1) Cg1 region of the anterior cingulate cortex (Cg1); (2) Cg2 region of the anterior cingulate cortex (Cg2); (3) dorsomedial caudate/putamen (dmCPu); (4) dorsolateral caudate/putamen (dlCPu); (5) nucleus accumbens core (NAcC); (6) nucleus accumbens shell (NAcS); (7) basolateral amygdala (BlA); and (8) ventral tegmental area (VTA). (b) Representative photomicrographs at × 10 magnification of Fos protein expression in the NAcC with a dashed rectangle indicating the sample region. (c) NAcC samples taken at × 20 magnification from a representative subject/group with arrows indicating Fos-positive nuclei. All sample areas were 0.26 mm2 and all photomicrographs were taken at × 20 magnification with the scale bar equal to 100 μm. (d) Number of Fos-positive nuclei/mm2 (±SEM) in brain regions that exhibited enhanced Fos expression in rats with a history of cocaine self-administration (n=32) compared with saline controls (n=16) in Experiment 1. Data are grouped by drug history. (e) Fos expression (±SEM) in the Cg1 grouped by drug history and test cue modality (n=7–18/group). Scatter plot of dmCPu for the cocaine (f) novel (n=15) and (g) the CS (n=16) test cue groups with Fos expression vs test session cocaine-seeking behavior. Fos-positive nuclei/mm2 and test session responding on the active lever were centered to where ‘0’ represents the mean of each test cue modality in order to remove group differences. The line represents a best-fit linear relationship between the two variables. *Represents a difference from saline-yoked groups (p<0.05). †Represents a difference from the saline tone-tested group (ANOVA, p<0.05). ++Represents a positive linear correlation among responding on the active lever and Fos-labeled cells (p<0.01). ac, anterior commissure.

PowerPoint slide

Correlations were also conducted within the cocaine groups separately for the novel vs CS test cue conditions to determine whether Fos expression in certain regions was related to the magnitude of responses on the active lever during testing. Interestingly, a positive correlation was found in the dmCPu for the novel (Figure 2f; r=0.70, p<0.01), but not the CS (Figure 2g; r=0.12, p=0.67) test cue condition, indicating that the higher responding on the active lever when tested with a novel cue the higher the expression of Fos in the dmCPu. No other significant correlations were found in the remaining brain regions.

Experiment 2

Sucrose reinforcement training. The 15 light-trained sucrose rats received a total of 382±33 sucrose pellets and the 18 tone-trained sucrose rats received a total of 412±22 sucrose pellets during training. A significant interaction between session and training cue was found (F5, 154=2.38, p<0.05), where tests for simple effects of group revealed that the tone-trained rats exhibited higher reinforcement rates relative to light-trained rats during the transition from mild food restriction to ad libitum access, specifically on days 7 and 8 (Figure 3a; ANOVAs, p<0.05). Overall, there was no group difference in the total number of pellets received across all training sessions (Figure 3a inset; F1, 31=2.48, p=0.13).

Figure 3
figure 3

Sucrose reinforcement and seeking behavior from Experiment 2. (a) Daily reinforcement rates (±SEM) for rats trained to lever press for sucrose pellets that were paired with a light (circles) or a tone (squares). Inset graph shows the total number of sucrose pellets (±SEM) attained across all training sessions. Dashed vertical lines represent the transition period from food restriction (∼16–22 g chow/day) to food ab libitium. (b) Responses on the active lever (±SEM) across extinction sessions in animals that had previously received either a light or a tone cue paired with sucrose reinforcement during training; responses on the active lever produced no scheduled consequences during extinction. (c) Estimated marginal means for responses on the active lever/30 min (±SEM), with the change in inactive lever presses from baseline to test day as the covariate, for groups tested with novel (open symbols, dashed lines) or conditioned (closed symbols, solid lines) light (circles) or tone (squares) cues (n=7–9/group). ^Represents a difference from the light-trained group (tests of simple effects, p<0.05). *Represents a difference from baseline responding (p<0.05).

PowerPoint slide

Extinction and reinstatement of sucrose-seeking behavior. Responses on the active lever reduced across extinction training (Figure 3b), where a main effect of session was found (F2, 56=92.13, p<0. 001), and there were no group differences in response rates. Post hoc analyses revealed that last-day extinction lever presses were significantly reduced from first-day extinction lever presses (Bonferroni correction, p<0.05). No significant changes were found during extinction for responding on the inactive lever (Table 1).

To assess reinstatement of sucrose-seeking behavior, we compared responses on the active lever during testing with responses on the active lever during the last extinction session, with day as a within-subjects factor and test cue condition and test cue modality as between-subjects factors. As with Experiment 1, the use of the covariate (change in inactive lever presses from baseline to test day) was justified as there were no interactions with test-day condition and/or modality (ranged from F1, 28=0.03 to 1.97, ps>0.05). Only a main effect for day was found for active lever presses (F1, 28=123.14, p<0.001), indicating that test-day responding increased compared with extinction baseline regardless of other factors (Figure 3c).

Experiment 3

Cue reinforcement training. Across training, the 18 light cue-trained rats received a total of 65±7 light presentations and the 18 tone cue-trained sucrose rats received a total of 46±3 tone presentations. No interaction between session and training cue was found (Figure 4a; F14, 448=0.94, p=0.52), but the light cue group received more cue presentations than the tone cue group overall (Figure 4a inset; F1, 32=5.73, p<0.05).

Figure 4
figure 4

Cue reinforcement and seeking behavior from Experiment 3. (a) Daily reinforcement rates (±SEM) for rats trained to lever press for either a light (circles) or a tone (squares) cue presentation. Inset graph shows the total number of cue presentations (±SEM) earned across all training sessions. (b) Responses on the active lever (±SEM) across extinction sessions in animals that had previously received either a light or a tone cue without reinforcement during training; responses on the active lever produced no scheduled consequences during extinction. (c) Estimated marginal means for responses on the active lever/30 min (±SEM), with the change in inactive lever presses from baseline to test day as the covariate, for groups tested with novel (open symbols, dashed lines) or conditioned (closed symbols, solid lines) light (circles) or tone (squares) cues (n=8–9/group) #Represents a difference from the tone-trained group in total number of cue presentations (p<0.05). ^Represents a difference from the tone-trained group (tests for simple effects, p<0.05). *Represents a difference from baseline responding (tests of simple effects, p<0.05).

PowerPoint slide

Extinction and test day responding for the cue-only condition. A significant interaction between session and training cue was found (F6.0, 192.8=3.16, p<0. 01), where tests of simple effects of group revealed that the light-trained group had higher responding on the active lever than the tone-trained group on several days throughout extinction (Figure 4b; ANOVAs, ps<0.05). Post hoc analyses revealed that last-day extinction lever presses were significantly reduced from first-day extinction lever presses in the light-trained group (Bonferroni correction; p<0.025), but not in the tone-trained group. No significant changes were observed during extinction for responses on the inactive lever (Table 1).

We analyzed test-day responding for cues the same as Experiments 1 and 2, as well as the use of the covariate, which was justified as no interaction existed with test cue condition and/or modality (range from F1, 29=0.40 to 2.88, ps>0.05). Surprisingly, analysis of the cue-only condition reinstatement data revealed a significant interaction between day and test cue condition (F1, 29=5.22, p<0.05), where tests for simple effects of day revealed that the novel test cue condition exhibited increased responding on the active lever on test day compared with baseline (Figure 4c; F1, 14=6.32, p<0.05), whereas no such effect was found in the CS test cue condition (F1, 16=0.08, p=0.78). Analysis of group effects during test day revealed a trend where the novel cue groups responded higher than the CS groups (F1, 31=3.53, p=0.07), and no difference was found during baseline responding (F1, 31=0.31, p=0.58).

DISCUSSION

The aim of this study was to use a novel cue control group to parse out the contribution of experiencing novelty during cue reinstatement testing to Fos expression. On cue reinstatement test days, animals experience novelty in receiving a response-contingent stimulus (that is, CS+) following a prolonged period of non-reinforced responding. Given that Fos expression is sensitive to environmental and procedural novelty (Badiani et al, 1998; Handa et al, 1993), we compared expression in animals exposed with drug-paired vs novel stimuli predicting that there would be greater Fos and perhaps region-specific activation owing to drug-conditioning. Surprisingly, novel cues elicited similar reinstatement behavior and Fos expression patterns as conditioned stimuli. We then hypothesized that cocaine SA may have cross-sensitized reinforcement circuits to novelty, which acts on converging reward pathways (Bardo et al, 1996; Besheer et al, 1999). To address this question, we examined whether novel cues reinstated sucrose-seeking behavior using parameters that are not consistent with sensitizing effects from sucrose reinforcement (Avena et al, 2005). Nevertheless, a similar outcome occurred where all sucrose groups reinstated regardless of whether they received CS+ or novel cues. These findings suggested that drug-specific sensitization was not a likely explanation, and instead novel cues had similar effects on drug and non-drug reinforcement-seeking behavior. We considered three possible explanations for the novel cue reinstatement effect that are stated below.

Novelty Reinforcement

The rewarding properties of novelty have been documented in humans and animals (Bardo et al, 1989, 1996; Besheer et al, 1999), and motivation for novelty predicts individual sensitivity to drug reward and reinforcement (Belin et al, 2011; Davis et al, 2008; Nadal et al, 2002; Pelloux et al, 2006; Piazza et al, 1989, 2000; Suto et al, 2001). We therefore suggest that the novel stimuli may have activated similar reinforcement and motivation processes as those involved in conditioned reinforcement-seeking behavior. In support, the Fos expression resulting from novel cue reinforcement was highly similar to that found previously during CS+ re-exposure (Kufahl et al, 2009; Neisewander et al, 2000). The latter study also used a no-lever control group and found similar activation, suggesting that our findings are not due to the act of operant responding, but instead likely reflect circuits that have a role in drug motivational processes invoked for expression of cocaine-seeking behavior (Fuchs et al, 2002, 2004, 2006; Ito et al, 2004; Kalivas and McFarland, 2003; Kruzich and See, 2001b). Reinstatement observed under the cue-only condition further supports the reinforcing effects of the response-contingent novel stimuli (Baron and Kish, 1962).

We also considered the stress-provoking effects of novel stimuli and procedures that occur in rodents (Kabbaj et al, 2000; Piazza et al, 1991; Rozin and Kalat, 1971). Stress is known to facilitate both drug reinforcement and incentive motivation, where various stressors have the ability to reinstate drug and sucrose-seeking behavior (Bossert et al, 2005; Shaham et al, 2003; Simms et al, 2011). Novel stimuli have the tendency to induce both approach and avoidance in animals, and it is the balance between these two conflicting motivations that drive subsequent behavior (Montgomery and Monkman, 1955). Although exposure to discrete novel cues in our study may have induced an initial stress response, the animals appeared to approach, rather than avoid, making responses to receive cue presentations. This suggests that potential stress effects of the novel cues likely facilitated, rather than inhibited, the animal's motivation to seek reinforcement.

It is possible that novel and conditioned stimuli may produce similar levels of reinstatement, but by differentially engaging reward (that is, liking) vs incentive salience (that is, wanting) systems. Robinson and Berridge's (1993) theory of incentive sensitization posits that these systems are dissociable and that incentive salience of drug and drug-associated cues sensitizes, whereas the hedonic value of the drug either decreases or remains stable over time. In our experiment, conditioned cues may trigger sensitized motivational circuits with little influence from hedonic value, resulting in reinstatement of drug–seeking behavior. By contrast, the novel cue may first engage hedonic processes, as the increase in responding in the cue-only condition suggests. Activation of reward processes within the drug-associated context may then activate the sensitized motivational circuits, resulting in robust novel cue-seeking behavior. Thus, on the test day the novel and conditioned stimuli may be activating the same sensitized motivational circuit that developed during training, resulting in similar reinstatement effects.

Activation of Action-Outcome Habits

Beyond the stimulating effects of the novel cue, a history of reinforcement paired with response-contingent stimuli likely induced a strong action–outcome habit that the response-contingent novel cue re-engaged during testing. Prolonged appetitive training is known to initially show goal-directed behavior and then transition to habitual responding (Jog et al, 1999), and contingent cues likely facilitated this transition in our study, as well as initial acquisition (Deroche-Gamonet et al, 2002). Kruzich et al (2001a) found that a novel light/tone combination failed to produce reinstatement in animals that had no cocaine-paired cues during training, suggesting that reinforcement experience alone is insufficient for novel cue reinstatement. Thus, novel cues are able to reinforce responding reliably only when they are presented similarly to conditioned stimuli, thereby initiating habits that are resistant to stimulus modality changes.

The positive correlation between the degree of Fos expression in the dmCPu and responding for novel, but not CS+, cues is an interesting finding that is consistent with previous research demonstrating a similar region-specific effect in high novelty-seeking animals (that is, high responders; Kabbaj and Akil, 2001). The cause of this relationship is unclear, but projections from the visual/auditory cortices to the dmCPu (McGeorge and Faull, 1989) may be involved as the sensory properties of novel stimuli are salient upon their first presentation. Our reinstatement data support this claim, where cue modality had a greater effect on responding in the novel vs conditioned cue groups. The dmCPu has been implicated in behavioral flexibility and attention, where neurons in this region fire in response to environmental changes during a spatial working memory task and regional inactivation has been shown to impair performance of a visually signaled time-dependent response task (Mizumori et al, 2000; Ragozzino et al, 2002). The greater degree of change to the environment in the novel, compared with the CS, test cue condition may differentially engage the dmCPu reflective of the amount of attention given to the stimulus.

While novel and conditioned cues elicited similar seeking behavior and brain activation, response magnitude and Fos effects varied in some cases with stimulus modality. Others have reported an impact of cue modality on responding, where a light cue is capable of eliciting greater responding than a tone cue (Baron and Kish, 1962; Di Ciano and Everitt, 2003; Panlilio and Schindler, 1997; Reed et al, 1996; See et al, 1999). We found that a light was more effective than a tone during cocaine SA and reinstatement, as well as during training in the cue-only condition. The light cue also induced greater Fos expression in the Cg1 of the saline controls compared with the tone, which may be due to attention because this region is engaged in the presence of salient stimuli (Downar et al, 2002). By contrast, the tone cue appeared to elicit slightly greater responding during sucrose training. These findings are curious, but this difference could be due to the ability of psychostimulants to increase responding for visual stimuli (Caggiula et al, 2002; Deroche-Gamonet et al, 2002; Donny et al, 2003; Gancarz et al, 2011; Palmatier et al, 2006; Winterbauer and Balleine, 2007) and amplify sensitivity to acoustic stimuli (Davis, 1985; Davis et al, 1975), thereby enhancing the initial reinforcing and/or aversive properties of the light and tone cues, respectively (Baron and Kish, 1962). Regardless, our cue modality effects are consistent with the sensory reinforcement literature.

Contextual Effects

Our procedure of testing in the original training context may also contribute to responding for novel stimuli during the test session by reigniting the incentive motivational value of the reinforcement context. It is well established that contextual manipulations exert control over operant responding, where reinforcement seeking and other conditioned responses are reinstated by exposure to the training context following a period of prolonged abstinence or extinction in a different context (Bouton and Peck, 1989; Crombag and Shaham, 2002; Neisewander et al, 2000). By contrast, exposing rats to a novel context during testing (that is, an AAB procedure) does not reinstate drug-seeking (Bossert et al, 2004; Crombag and Shaham, 2002; Fuchs et al, 2005), or conditioned responses (Bouton and King, 1983), highlighting the unique impact of the original training context on responding. Furthermore, midbrain dopamine neurons fire in response to conditioned stimuli predictive of reward as well as unfamiliar, but similar stimuli when presented in the same context (Schultz, 1998). The incentive motivational value of the context may be necessary to achieve heightened novel cue responding, perhaps through the training environment acting as discriminative stimuli or an ‘occasion setter’ (Alleweireldt et al, 2001; Swartzentruber, 1991). The importance of contextual contributions to novel cue reinstatement of drug-seeking behavior will require further attention.

Conclusions

The present study has uncovered novel information that calls for additional thought about how reinstatement of reinforcement-seeking behavior is mediated. It appears that in a reinforcement-associated context, novelty can trigger incentive motivation similar to that of conditioned stimuli. Furthermore, the neural circuitry involved in reinstatement by both types of cues largely overlaps. We have suggested potential mechanisms, which are not mutually exclusive, that may mediate, or contribute at least in part, to novel cue reinstatement. These include: (1) novel stimulus-elicited activation of brain reinforcement mechanisms that in turn drive reinforcement-seeking behavior similar to CS+ stimuli; (2) response-contingent events re-engaging action–outcome processes that have become habitual; and (3) the ability of novelty or CS+ reinforcement to reinstate the incentive motivational effects of the reward-associated context. In terms of drug abuse and other addictive behaviors, the present findings suggest novel potential mechanisms by which environmental stimuli may disrupt abstinence as effectively as stimuli known to be associated with reinforcement. Further understanding of how novelty relates to craving for drugs of abuse or food reinforcers is important for understanding relapse and for future therapeutic interventions.