Main

Cocaine addiction is a chronic relapsing disorder (Leshner 1997; O'Brien et al. 1998; O'Brien and McLellan 1996). Relapse prevention has therefore emerged as a central issue in treatment and medications development efforts (Carroll et al. 1991; DeLong 1994; O'Brien and McLellan 1996). A wealth of clinical studies point towards drug-related stimuli or events as a critical factor in the long-lasting compulsion to abuse drugs. This literature suggests that environmental stimuli repeatedly paired with the subjective actions of cocaine throughout an individual's history of drug abuse can produce intense drug craving (Childress et al. 1988; Ehrman et al. 1992; Kilgus and Pumariega 1994; Robbins et al. 1997; Satel et al. 1995; Wallace 1989) or elicit automatic behavioral responses (Miller and Gold 1994; Tiffany and Carter 1998) that may lead to relapse.

Consistent with a role of conditioning factors in the resumption of drug abuse habits, several studies in animals have demonstrated that drug-associated stimuli can elicit a transient recovery of extinguished responding at a lever previously associated with cocaine self-administration (e.g., deWit and Stewart 1981; Fuchs et al. 1998). However, little information is available, to date, about the persistence of cue-induced drug-seeking behavior. In recent work that has begun to address this issue, appreciable cued recovery of cocaine-seeking behavior was observed when rats were presented with cocaine cues on two separate occasions spaced 20 days apart. Similarly, the efficacy of a cocaine cue to reinstate extinguished cocaine-seeking behavior was found to persist for up to eight days of intermittent testing (Weiss et al. 2000). The latter study revealed also that cocaine cues increase extracellular dopamine levels in the nucleus accumbens and amygdala. These findings indicate that the behavioral effects of cocaine-related environmental stimuli can show resistance to extinction, and that their motivating effects may involve activation of forebrain DA neurotransmission.

Considering the mounting evidence that learned responses to cocaine-related stimuli represents a significant factor in long-lasting relapse risk (see O'Brien et al. 1998 for review), better understanding of the environmental, behavioral, and neuropharmacological bases of this phenomenon will be of substantial clinical benefit. The purpose of the present study was to characterize the resistance to extinction of cocaine-seeking behavior induced by a cocaine-associated environmental stimulus in rats under conditions of frequent repeated exposure to the cocaine cue, as well as following an additional long-term abstinence period during which rats remained confined to their home cages. A second issue—in view of the recently reported activation of DA release in the nucleus accumbens and amygdala by cocaine cues (Weiss et al. 2000) and human brain imaging data implicating activation of DA-rich forebrain regions in cocaine craving (Breiter et al. 1997; Childress et al. 1999; Garavan et al. 1998; Grant et al. 1996)—was to confirm a dopaminergic involvement in the response-reinstating actions of cocaine-related environmental stimuli by determining whether these effects are sensitive to pharmacological manipulation of DA transmission.

MATERIAL AND METHODS

Animals

Male Wistar rats (Beckman Laboratories, The Scripps Research Institute, La Jolla, CA) weighing 250–300 g at the beginning of the experiment were housed in groups of two or three, in a humidity and temperature (22°C) controlled vivarium on a 12/12-hr light/dark cycle (lights off at 6:00 P.M.) with ad libitum access to food and water except during initial operant training for food-reinforced responding. Training and testing were conducted during the light phase of the light/dark cycle at the same time each day. All procedures were conducted in strict adherence to the National Institutes of Health Guide for the Care and Use of Laboratory Animals.

Apparatus

Rats were trained and tested in standard 29 × 24 × 19.5 cm operant conditioning chambers located inside ventilated sound attenuating cubicles (BRS/LVE Inc., Laurel, MD) and modified for intravenous self-administration as previously described (Weiss et al. 2000). All chambers were equipped with two retractable levers, a white cue light above each lever, and a house light located at the top of the chamber's front panel. Auditory stimuli consisted of a 70 dB white noise produced by a white-noise generator (The Salk Institute, San Diego, CA) and presented via an 80-ohm speaker located in the center of the chamber's front panel below the house light, or an intermittent “beeping” tone (7 kHz, 70 dB), generated by a tone source (Sonalert, Model SC628, 6–28 VDC, Mallory, CO) also located in the center of the front panel just above the speaker. Intravenous infusions were administered by a syringe pump (Razel Scientific Instruments, Stamford, CT) located outside the sound attenuating boxes. Testing equipment and data collection were controlled by an IBM-compatible microcomputer.

Drugs

Cocaine hydrochloride (National Institute on Drug Abuse, Bethesda, MD) was dissolved in sterile physiological saline at a concentration of 0.25 mg/0.1 ml. Cocaine or saline was infused intravenously at a volume of 0.1 ml over a 4-sec period. The dopamine (DA) D1 antagonist SCH 39166 (Schering-Plough, Kenilworth, NJ) was dissolved in ethanol (1μg/1μl), diluted with saline to a concentration of 30 μg/ml and injected subcutaneously (SC) 30 minutes before reinstatement session at a dose of 10 μg/kg. The D1 agonist R(+)SKF 81297 (Research Biochemical International, Natick, MA) was dissolved in physiological saline at a concentration of 1 mg/ml and injected at a dose of 1 mg/kg SC). Nafadotride, a D2/D3 receptor antagonist (provided by J.C. Schwartz, Unité de Neurobiologie et Pharmacologie, INSERM, Centre Paul Broca, Paris, France) was dissolved in physiological saline at a concentration of 1 mg/ml and injected at a dose of 1 mg/kg SC. The D2/D3 receptor agonist S(+)PD 128907 (Research Biochemicals International, Natick, MA) was dissolved in physiological saline at a concentration of 0.3 mg/ml and injected at a dose of 0.3 mg/kg SC. SKF 81297, nafadotride, and PD 128907 were administered 10 minutes before the beginning of reinstatement sessions.

Operant Training for Food-Maintained Responding

Rats were initially food restricted (15 grams/day; Purina rat chow) in order to establish lever-pressing behavior maintained by 45 mg food pellets (Noyes Inc., Lancaster, NH). Food pellets were available in 1-hr daily sessions on a fixed ratio 1 (FR 1) time out (TO) 1-sec schedule of reinforcement. Only the right lever was available and the TO period during which the lever remained inactive was signaled by illumination of the cue light above the lever. Once rats earned 100 pellets/session, the TO period was increased to 20 seconds in two additional sessions. Food then was made available ad libitum again.

Jugular Catheter Implantation

Rats were implanted with chronic silastic catheters in the right jugular vein under halothane (1.0 – 1.5%) anesthesia as previously described (Caine et al. 1993). Catheter patency was maintained by flushing with sterile heparin/saline (33.3 USP units/ml) solution before and after self-administration sessions. Rats with compromised catheters were implanted with a new catheter in the left jugular vein or excluded from the experiment.

Self-Administration Training and Conditioning Protocols

The purpose of these procedures was to train rats to intravenously self-administer cocaine while simultaneously establishing reliable discriminative stimuli (SD) associated with cocaine availability vs. non-availability. Self-administration training began seven days after surgery. During the first three days of this phase, sessions were initiated by extension of both levers and simultaneous presentation of an intermittent “beeping” tone (1-sec duration presented at 1-sec intervals) that served as a discriminative stimulus (S+) for cocaine availability and remained present throughout the session. Cocaine was then made available for two hours on a FR 1 schedule of reinforcement. Only the right lever was active and, when depressed, resulted in an infusion of cocaine and 20 seconds illumination of the house light during which time the lever remained inactive to avoid accidental overdosing. Responses at the left lever were recorded but had no programmed consequences. Beginning on day 4, daily access to cocaine was changed from a single 2-hr session to two consecutive 1-hr sessions, separated by 30 minutes. Starting on day 6, a third daily 1-hr session was added during which saline was substituted for the cocaine solution. This session was initiated by extension of both levers and illumination of the cue light above the right lever which served as a discriminative stimulus (S) signaling non-reward (i.e., the availability of saline only) and remained present throughout the session. Each response at the active lever resulted in infusion of 0.1 ml saline and 20 seconds presentation of a white noise TO stimulus during which time the lever was inactivated. Responses at the left, inactive lever were recorded but had no programmed consequences. The sequence of the three daily sessions (i.e., two cocaine and one saline sessions) was random. Rats were removed from the self-administration chambers for 30 minutes between two cocaine sessions, and for 40–60 minutes between cocaine followed by saline sessions. Training under these contingencies was continued until the rats developed stable responding for cocaine (±10 % of mean responses over three consecutive days) and ceased responding during saline sessions.

Extinction Phase

Extinction sessions consisted of single daily 1-hr sessions, initiated by extension of both levers but without presentation of the discriminative stimuli. Responses at the previously active lever had no scheduled consequences except for activation of the syringe pump motor. Extinction sessions continued until animals reached a criterion of < 4 lever presses per session at the previously active lever for three consecutive days.

Reinstatement Phase

Reinstatement tests began one day after each individual animal had reached the extinction criterion. These tests lasted one hour and were conducted once every third day for a total of 12 sessions under conditions identical to those during cocaine or saline self-administration, except that cocaine or saline were not available. Sessions were initiated by extension of both levers and presentation of either the cocaine S+ or S. The respective SD remained present until retraction of the levers at the end of each session. In both conditions, responses at the previously active lever were followed by activation of the syringe pump motor and a 20-sec signaled TO period (i.e., illumination of the house light in the S+ condition, and presentation of a white noise in the S condition) during which time the levers remained inactive.

To confirm the behavioral selectivity of the discriminative stimuli, the rats were tested in the presence of the S on the first day of the reinstatement phase. The animals then were repeatedly tested for response-reinstating effects of the cocaine S+ in nine consecutive sessions (i.e., reinstatement sessions 2–10; days 4–28 post-extinction), each separated by 2-day periods during which the rats remained in their home cages. To further verify the selectivity of the behavioral effects of the discriminative stimuli, rats were again tested in the presence of the S during session 11 of the reinstatement phase (day 31 post-extinction), followed by a final test with the cocaine S+ on day 34 post-extinction (reinstatement session 12). Rats of this group (Group I) also served as vehicle controls for drug tests conducted in a second group of rats (Group II, see below) and received injections of vehicle solutions before reinstatement sessions 3, 5, 7, and 9 (i.e., days 7, 13, 19 and 25 post-extinction), corresponding to the days when pharmacological tests were conducted in Group II.

Delayed Reinstatement Test

Following completion of the repeated cue exposure test phase, eight rats of Group I were randomly selected to study the long-term persistence of the behavioral effects of the cocaine S+. These animals were confined to their home cages in the vivarium for 12–14 weeks (i.e., until approximately 17–19 weeks after their last exposure to cocaine) and then retested individually for their behavioral response to the SDs. The rats were randomly assigned to two groups, and tested once with either the cocaine S+ (N = 4) or S (N = 4) using the same procedures as above.

Drug Tests

To study whether the cue-induced response reinstatement is sensitive to pharmacological manipulation of dopamine receptors, a second group of rats (Group II, N = 10) was prepared. The experimental protocols for self-administration training, extinction, and reinstatement sessions was identical to that in Group I, except that no delayed reinstatement tests were conducted. The rats were treated with pharmacological agents (using the procedures and doses described in “Drugs”) prior to reinstatement sessions 3, 5, 7, and 9 (days 7, 13, 19 and 25 post-extinction). The animals received neither drug nor vehicle treatments during the S+ reinstatement sessions preceding and following each drug test (i.e., reinstatement sessions 2, 4, 6, 8, 10; days 4, 10, 16, 22, 28 post-extinction). To control for order effects, the rats were tested with the different agents in random sequence across the four drug test days.

Data Analysis

Differences among responses during the Training, Extinction and Reinstatement Phases were analyzed by one-way within subjects ANOVA, followed by Newman-Keuls post-hoc tests to identify differences between experimental phases and responses in the presence of the S+ vs. S. Differences in behavioral performance between the Groups I and II during the three experimental phases were analyzed by mixed-factorial ANOVAs, followed by Simple Effects analyses and/or Newman Keuls post-hoc tests. The effects of the pharmacological agents on reinstatement responses in Group II were compared to the number of responses in the vehicle-treated animals of Group I tested at post-extinction days corresponding to the respective drug test days in Group II and analyzed by mixed-factorial ANOVA, followed by simple effects analysis.

RESULTS

Resistance to Extinction

Self-Administration Training and Conditioning Phase

All rats of Group I (N = 8) rapidly developed stable cocaine self-administration. The mean (±SEM) number of days required to reach the training criterion was 10.25 ± 1.0. No differences in the mean number of responses were observed during the final three days of the training phase (F2,14 = 1.83; NS) or between the first and second daily hour of cocaine availability (F1,7 = 1.70; NS). The data of the two daily cocaine sessions were therefore pooled for all subsequent analyses. While there were no differences in the number of cocaine-reinforced responses between the two daily cocaine sessions, responding during cocaine availability differed significantly (F1,7 = 129.09; p < .00001) from that during saline availability (Fig 1 , left panel). Only minimal responding was observed at the inactive lever during both cocaine and saline sessions.

Figure 1
figure 1

Lever-press responses during self-administration training, extinction and reinstatement sessions at an active (A) and inactive (B) lever. Training Phase: Cocaine-reinforced (/•) and saline/nonreinforced (/•) responses during the final three days of the self-administration phase in two groups of rats trained to associate discriminative stimuli with the availability of intravenous cocaine (S+) or saline (S). Rats of Group I (/) were designated for tests of the resistance to extinction of cocaine-seeking behavior induced by the cocaine S+ during the Reinstatement Phase. Rats of Group II (•/•) were used to examine the effects of pharmacological agents on S+-induced responding during the Reinstatement Phase (see Figure 3). No differences were observed between the number of responses during the first and second daily hour of cocaine availability (see Methods) in either of the groups. The data were, therefore, collapsed across daily cocaine sessions. Extinction Phase: Extinction responses at criterion (< 4 responses/session over three consecutive days). The number of days required to reach the criterion was 15.3 ± 3.9 (Group I) and 21.6 ± 2.0 (Group II). Reinstatement Phase: Responses in the presence of the S+ and S in Group I () and Group II (•). Exposure to the S+ elicited significant recovery of responding in the absence of further drug availability while responding in the presence of the S remained at extinction levels. Data for Group II represent reinstatement tests prior to the beginning of the pharmacological tests shown in Figure 3 (Drug Treatments). For statistical comparisons see “Results.”

Extinction Phase

The mean (±SEM) number of responses per session during the first three sessions of the extinction phase was 11.6 ± 1.44. The rats required on average 15.3 ± 3.9 sessions to reach the extinction criterions. Responding at the inactive lever was negligible throughout the extinction phase (Figure 1, center panel).

Reinstatement Phase

Responding at the active lever remained at extinction levels during testing in the presence of the non-reward S on the first day of the reinstatement phase. In contrast, exposure to the S+ beginning with the second reinstatement test produced strong recovery of responding. Responses at the active lever during presentation of the S+ were significantly increased over extinction levels on all 10 S+ testing occasions (p < .01) and statistically indistinguishable from cocaine-reinforced responses during the Training Phase (Newman Keuls post-hoc tests after one-way ANOVA: F17,119 = 9.80; p < .00001). The response-reinstating effects of the cocaine S+ remained unaltered throughout the nine consecutive sessions, and no differences were observed in the number of responses among tests preceded by vehicle injections (reinstatement sessions 3, 5, 7, and 9) or no injection (sessions 2, 4, 6, 8, 10 and 12) as determined by Newman-Keuls post-hoc analysis. As during the first S reinstatement test, responding did not increase over extinction levels (Newman-Keuls) during the second S test on post-extinction day 31 (reinstatement session 11) but recovered fully (Figure 1, reinstatement phase) when the rats were again presented with the S+ on test day 12 (post-extinction day 34). The number of responses at the inactive lever was negligible throughout all reinstatement tests (mean total responses/session: 2.34 ± 0.31) without differences between S+ and S sessions (F11,77 = 1.21; NS).

Inspection of the response patterns of individual rats revealed that responding in the presence of the S+ was sustained throughout all reinstatement sessions, and distinctly different from the pattern of non-reinforced responses during the Extinction Phase. In addition, there was no sign of extinction to the effects of the S+ with repeated testing, and in some animals a pattern of densely distributed but remarkably regular inter-response intervals emerged (Figure 2 ).

Figure 2
figure 2

Representative records of three rats illustrating the pattern of responding during the cocaine self-administration, extinction, and reinstatement phases. (All records represent 60 min sessions). Cocaine Self-Administration: Cocaine-reinforced responding during the last day of the training phase. Extinction: Responses during the first and last session of the extinction phase. Cue-Induced Reinstatement: (Panel A) Responding in the presence of the cocaine S+ during the first reinstatement session. (Panels B and C) Response records of the same animals selected arbitrarily at later time points during the 34-day reinstatement testing phase

Delayed Reinstatement Test

Presentation of the S+ in rats (N = 4) subjected to a total of 17 – 19 weeks of abstinence (including the extinction phase) significantly reinstated responding to levels indistinguishable from those during the initial reinstatement phase. Responses in rats (N = 4) presented with the S remained at extinction levels. ANOVA revealed overall differences in responding across the Self-Administration, Extinction, and Delayed Reinstatement Tests (F2,12 = 22.52; p < .0001). Differences between the behavioral effects of the S+ and S during the Delayed Reinstatement Test were confirmed by a significant interaction between Groups (S+ vs. S) and Experimental Phases (F2,12 = 8.40; p < .01) followed by simple effects analysis (F1,18 = 18.28; p < .0001). The subgroups of rats assigned to testing with the S+ vs. S did not differ in behavioral performance during the final three days of the Self-Administration (F1,18 = 0.84; NS) and Extinction (F1,18 = 0.00; NS) phases (Figure 3 ).

Figure 3
figure 3

Long-term persistence of the cue-induced cocaine-seeking behavior in rats confined to their home cages following completion of the reinstatement test series shown in Figure 1 (Group I) and retested after 12–14 weeks of abstinence. Even after this extended abstinence period, the cocaine S+ but not S reinstated cocaine-seeking behavior. For comparison, cocaine self-administration and extinction responses are shown separately for animals tested with the cocaine S+ (S+ Condition) vs. saline-associated S (S Condition) during the reinstatement tests: (SA) Cocaine-maintained responses averaged across the last three days of the Self-Administration Phase. (EXT) Responses averaged across the last three days of the Extinction Phase. **p < .01; different from EXT in S+ Condition, and both EXT and S performance in S Condition

Effects of Pharmacological Agents on Cue-Induced Responding

Rats of the drug treatment group (Group II, N = 10) reached the training criterion within 8.4 ± 0.6 days and did not differ in the number of training days required from Group I (F1,16 = 2.90; NS). Similarly, no differences between Groups I and II were apparent in the number of responses at the active lever during the final three days of the training phase (Figure 1, left panel). This was reflected by a lack of significant differences between “Groups” (F1,16 = 0.61; NS) and interactions between “Drug Condition (cocaine vs. saline)” (F1,16 = 3.20; NS) or “Groups” and “Daily Sessions” (F2,23 = 1.30; NS). Rats of Group II made on average 12.27 ±1.44 responses per session during the first three sessions of the extinction phase and reached the extinction criterion within 21.6 ± 2.0 sessions. There were no differences between this group and Group I in the number of days to criterion (F1,16 = 3.63; NS) or the number of responses during the last three days of the extinction phase (F1,16 = 0.67; NS). Responding at the inactive lever was negligible during the Training and Extinction Phases (Figure 1).

The behavioral performance of Group II during both the S and drug-free S+ reinstatement tests was identical to that obtained on the corresponding test days in Group I. Specifically, as in Group I, responding in the presence of the S did not increase over extinction levels during the first reinstatement test, whereas presentation of the S+ in the second reinstatement test produced recovery of responding to levels identical to those during the Self-Administration Training Phase (Figure 1). The S+-induced response reinstatement was confirmed by statistical analysis [p < .01; Newman Keuls after overall ANOVA of Extinction vs. S and S+ responses (F12,96 = 12.08; p < .00001); data from one animal in Group II that developed health complications and did not complete all tests were excluded from all statistical analyses, reducing the sample size of Group II to N = 9]. Responding during S+ tests remained stable across the five (pre/post) tests in which these animals received neither drug nor vehicle pretreatments (F4,32 = 2.05; NS). Behavioral performance during S+ tests was identical to that in Group I as indicated by a lack of differences in reinstatement responses between groups (F1,15 = 0.62; NS) or interactions (F4,60 = 1.73; NS) between groups and test days (data represent responses during each of the five drug-free S+ pre/post tests in Group II vs. the corresponding five S+ test sessions in the vehicle-treated rats of Group I. The respective mean ± SEM responses across the five test days in Groups I and II are shown in Figure 3: “Baseline” C vs. D). In addition, as in Group I, presentation of the S on test day 11 (post-extinction day 31) failed to elicit significant responding, whereas responding recovered fully (p < .01; Newman-Keuls tests after one-way ANOVA: F12,96 = 12.08; p < .00001) when these animals were again presented with the S+ on test day 12 (post-extinction day 34; data not shown). Responding at the inactive lever was negligible throughout the reinstatement phase without differences among S+ and S sessions (F7,56 = 1.40; NS).

The effects of the pharmacological agents on reinstatement responses in Group II were compared to the number of responses in the vehicle-treated animals of Group I, tested at post-extinction days corresponding to the respective drug test days in Group II. The effects of the pharmacological treatments are shown in Figure 4 . None of the agents produced significant changes in responding at the inactive lever (F3,45 = 2.69; NS). In contrast, the drug (F3,45 = 6.04; p < .002) but not vehicle (F3,45 = 0.23; NS) treatments exerted significant effects on responding [simple effects after main effect of drug treatments (F3,45 = 3.16; p < .05) in overall ANOVA]. Additional simple effects analyses confirmed a significant reduction in S+-induced responding at the active lever by the D1 antagonist SCH 39166 (F1,15 = 9.43; p < .01), the D2/D3 antagonist nafadotride (F1,15 = 6.62; p < .05), and the D2/D3 agonist PD 128907 (F1,15 = 9.14; p < .01). No mean change in responses at the active lever was observed after pretreatment with the D1 receptor agonist SKF 81297 (F1,15 = 0.33; NS).

Figure 4
figure 4

Effects of pharmacological manipulation of DA receptors on S+-induced responding at the previously active (upper panel) and inactive (lower panel) lever. Baseline: Mean ± SEM responses across the five drug-free S+ tests in Group II (D) and the corresponding five S+ test sessions (days 4, 10, 16, 22, 28 in Figure 1) in the vehicle-treated rats of Group I (C). Drug Treatments: S+-induced reinstatement responses after pretreatment with the D1 antagonist SCH 39166 (SCH, 10 μg/kg); the D1 agonist SKF 81297 (SKF, 1 mg/kg); the D2/3 antagonist nafadotride (NAF, 1 mg/kg); and the D2/3 agonist PD 128,907 (PD, 0.3 mg/kg). Responding was attenuated by SCH 39166, nafadotride, and PD 128,907, but not by SKF 81297. **p < .01, *p < .05; different from reinstatement performance in vehicle-treated controls (Baseline “C”)

Inspection of the data from individual animals revealed that both the D1 antagonist SCH 39166 and the D2/D3 agonist PD 128907 produced extinction-like response profiles characterized by an initial response burst followed by early termination of responding, similar to that observed during the extinction phase. Treatment with the D1 agonist SKF 81297 and the D2/D3 antagonist nafadotride was associated with a variable response profile. SKF 81297 produced substantial increases in responding in three animals, intermittent bursts of responding distributed throughout the session in three animals, and complete or near complete suppression of responding in four rats. Nafadotride exerted overall suppressant effects on reinstatement responses, but intermittent response bursts throughout the session were observed in four animals, extinction-like effects with early termination of responding in three rats, and near complete inhibition of behavior in two animals.

DISCUSSION

The purpose of this investigation was to characterize the persistence of the motivating actions of cocaine-related environmental stimuli with emphasis on the significance of discriminative stimuli for drug availability in the resumption of drug-seeking behavior following extinction and prolonged abstinence. The results confirm that discriminative stimuli associated with the availability and self-administration of cocaine can elicit strong cocaine-seeking behavior in rats as measured by the recovery of extinguished responding at a previously drug-paired lever without further availability of the primary reinforcer. Of particular significance is the finding that the response-reinstating actions of the cocaine S+ proved highly resistant to extinction as reflected by the stability of the cue effects throughout the 34-day phase of repeated testing, and by the lack of decreases in the efficacy of the S+ in rats that had been confined to their home cages for an additional 12–14 weeks. These behavioral effects of the S+ cannot be attributed to non-specific arousal or spontaneous recovery since responding at the inactive lever remained negligible and, more importantly, responding in the presence of the SD associated with non-reward (S) remained at (Figure 1, Day 1; Figure 3) or decreased to (Figure 1, Day 31) extinction levels. These observations indicate that the animals’ behavior was controlled selectively by the S+. Finally, the near identical behavioral results obtained during vehicle tests in the drug treatment group confirmed the generality of the resistance to extinction of the cocaine-cue effects.

The results confirm a previous report in which the efficacy of a cocaine S+ to elicit drug-seeking behavior remained unaltered over eight days of repeated testing (Weiss et al. 2000) and corroborate related findings that noncontingent presentation of drug-predictive discriminative stimuli can reinstate extinguished approach behavior in an operant runway model of heroin relapse (Ettenberg et al. 1996; McFarland and Ettenberg 1997) as well as responding at a lever previously associated with ethanol or heroin reinforcement (Gracy et al. 2000; Katner et al. 1999; Katner and Weiss 1999). However, the present data extend these earlier findings by demonstrating that the behavioral actions of previously cocaine-predictive discriminative stimuli are remarkably resistant to extinction and endure over prolonged periods of abstinence. Moreover, the undiminished efficacy and selectivity of the cocaine S+ to elicit responding at the previously drug-paired lever both with repeated testing and following long-term abstinence suggests that responses conditioned to environmental stimuli acting as discriminative stimuli for drug availability are an important factor in the long-lasting addictive potential of cocaine. Discriminative stimuli signal the availability of a reinforcer under a particular stimulus condition and reinforcement contingency and, thereby, facilitate or set the occasion for behavioral responses directed at obtaining the desired substance or object. The manner in which drug-associated contextual cues attain their motivating effects is likely to involve the occasion-setting nature of such stimuli rather than only classically conditioned responses. Indeed, it has been argued that human drug-seeking behavior may be most appropriately modeled by paradigms in which environmental stimuli that are predictive of drug reinforcement (i.e., discriminative stimuli) facilitate drug-related operant responding (McFarland and Ettenberg 1997). The endurance of the motivating actions of the cocaine S+ supports this hypothesis and suggests that contextual cues that act as discriminative stimuli for drug availability are particularly effective in eliciting and sustaining drug-seeking behavior and, thus, may be a critical environmental factor in compulsive drug-seeking behavior and long-lasting vulnerability to relapse.

In order to fully understand the stimulus conditions that controlled behavior during the reinstatement tests it is necessary, however, to consider that during the self-administration phase the rats were not only exposed noncontingently to the cocaine S+ but responses at the active lever also resulted in presentation of a response-contingent cue whose purpose was to signal the 20-sec post-response inactivation of the cocaine lever to prevent accidental overdosing. Because of the temporal contiguity between cocaine infusions and the “time out (TO)” signal, this stimulus may perhaps have acted as an infusion cue and attained conditioned reinforcing properties that were additive to the occasion-setting effects of the S+. More specifically, the cocaine S+ may have primarily facilitated the initiation of responding whereas the response-contingent TO stimulus, acting as a conditioned reinforcer, may have maintained subsequent drug-seeking behavior. The results do not provide direct insight into the relative contribution of the two stimulus categories to the results. However, there is evidence that “time-out” cues that are routinely employed during cocaine self-administration training attain, at best, only modest conditioned reinforcing efficacy. For example, in studies by both Meil and See (1996) and Weissenborn et al. (1995) noncontingent exposure to a TO cue light alone was ineffective in reinstating extinguished responding at a previously cocaine-paired lever. Similarly, response-contingent presentation of a TO cue light maintained conditioned responding only when preceded by a cocaine priming injection (Weissenborn et al. 1995). Consistent with these observations, omission of the TO stimulus in a preliminary examination of interactions between the S+ and TO cue—conducted after completion of the repeated cue exposure phase in rats not assigned to the prolonged abstinence condition—produced only a moderate 35% decrease in S+-induced responses (data not shown). Conversely, introduction of a cocaine S+ increased responding by 300% over (low) levels of responding maintained by a response-contingent TO stimulus alone (Weiss 1996). Thus, while conditioned reinforcing effects by the “time-out” stimulus may have enhanced the magnitude of the response reinstatement, available data suggest that drug-seeking behavior under the present contingencies was controlled primarily by the S+.

There are two issues that require consideration for the understanding of possible interactions between the S+ and TO cue. First, discriminative stimuli can acquire the ability to “set the occasion” for either reinforcement or non-reinforcement and to modulate responding generated by other conditioned stimuli in a manner that is independent of the association of the occasion setter with the unconditioned stimulus (Holland 1983; Rescorla 1985). For example, nonreinforced presentation of an occasion setter to extinguish associations between this cue and an associated unconditioned stimulus does not affect control of responding to a relevant conditioned stimulus by the occasion setter (Bouton and Swartzentruber 1986). It is possible that the training procedure in the present experiment conveyed occasion setting to the cocaine S+ that resulted in a “facilitation” of responses to the TO light cue. More importantly, according to this account, the remarkable persistence of non-reinforced responding in the presence of the S+ may have been related to difficulties in extinguishing the occasion-setting property of this discriminative stimulus. A second consideration relevant for possible interactions between the S+ and TO cue is that the concurrent presentation of the S+ and cocaine-contiguous TO cue during training produced some form of compound stimulus with greater salience and consequently greater efficacy in establishing and maintaining responding. Many classically conditioned responses are greater when elicited by a compound conditioned stimulus relative to responses produced by its component stimuli (for review, see e.g., Baker 1968). Recent studies have established that these effects of compound stimuli extend to cocaine-related conditioned responses. Compound cues involving both classically conditioned (See et al. 1999) and discriminative stimuli (Panlilio et al. 1996) have been shown to enhance cocaine-seeking behavior in animal models of cocaine reinforcement and relapse relative to the effects of the respective individual stimulus components. The preliminary findings that omission of the “time-out” stimulus produced a moderate decrease in S+-induced responses and that introduction of an S+ increased responding maintained by a response-contingent TO stimulus alone (Weiss 1996) suggest that compound stimulus effects may have been a factor in the magnitude of the response-reinstatement observed here as well.

While understanding of the precise nature of the stimulus control of behavior under the present contingencies will require further investigation, the results demonstrate that these stimulus conditions elicit strong and remarkably persistent cocaine-seeking behavior. Moreover, as illustrated in Figure 2, exposure to the S+ did not simply elicit a transient behavioral response that extinguished when lever-pressing was not reinforced by cocaine, but produced sustained responding throughout the test sessions. Typical extinction-like response profiles—characterized by repetitive response bursts after the onset of test sessions and eventual cessation of responding—were rarely observed. Importantly as well, throughout the course of the reinstatement test phase, all animals typically initiated responding immediately after the onset of each session in spite of absence of the primary reinforcer during the preceding session (i.e., access to the previously cocaine-paired lever under extinction conditions). The endurance of the behavioral effects of the cocaine S+ observed here appears analogous to the persistence of conditioned cue reactivity and cue-induced craving in humans (e.g., Childress et al. 1993) that has been implicated as a critical factor in long-lasting relapse risk and high rates of recidivism (e.g., O'Brien et al. 1998). Thus, the procedures employed here may provide a valuable tool for future investigations of behavioral and neurobiological factors underlying conditioned cue reactivity, cue-induced cocaine craving, and relapse.

Exposure to a cocaine-predictive discriminative stimulus was recently shown to significantly increase dopamine efflux in the nucleus accumbens and amygdala of rats as measured by microdialysis (Weiss et al. 2000). This finding suggested that the conditioned behavioral effects of cocaine-related stimuli may depend on activation of dopamine neurotransmission within limbic forebrain regions (Weiss et al. 2000). It was, therefore, important to examine whether the response-reinstating actions of a cocaine S+ can be altered by pharmacological manipulation of DA transmission. The approach employed for this purpose was to screen several ligands for the D1 and D2/3 families of receptors, each at a dose that was previously shown to modify cocaine self-administration or other cocaine-related behaviors (Caine et al. 1997; Parsons et al. 1996; Self et al. 1996; Weissenborn et al. 1996). While a single-dose pharmacological testing strategy is inevitably fraught with limitations regarding the interpretation of the data, the overall pattern of results is consistent with the hypothesis that the behavioral effects of cocaine-predictive stimuli are sensitive to pharmacological manipulation of dopamine receptors. In particular, the selective D1 antagonist SCH 23390 produced a complete reversal of the effects of the cocaine S+ with an extinction-like behavioral profile characterized by initial intermittent response bursts followed by eventual termination of responding. It is unlikely that the suppression of cue-induced responding by SCH 23390 was merely the result of general disruptive effects or impairments in motor performance. Low-dose D1 antagonist treatments alter the direct reinforcing actions of cocaine without interfering with motor performance (Amalric et al. 1993) or food-maintained behavior (Caine and Koob 1994). Consistent with these observations, a selective D1 antagonist (SCH 39166) was recently shown to attenuate the euphoric effects of cocaine in humans without inducing sedation or nonspecific anhedonic effects (Romach et al. 1999). More importantly, SCH 23390 selectively and dose-dependently reduced conditioned responding maintained by a cocaine-related conditioned stimulus, while leaving fully intact behavior maintained by a stimulus associated with a food reinforcer (Weissenborn et al. 1996). There is evidence that cocaine-predictive discriminative (Ciccocioppo et al. 1999) and cocaine-paired contextual stimuli (Neisewander et al. 2000) potently induce Fos expression within the basolateral amygdala and anterior cingulate cortex of rats, and that these effects are reversed by the selective D1 antagonist SCH 39166 (Ciccocioppo et al. 1999). In light of these findings, the reversal of the behavioral effects of a cocaine-predictive discriminative stimulus by a selective D1 antagonist in the present study identifies the D1 receptor as a particularly promising target for the further exploration of the neuropharmacological basis of cue-induced cocaine-seeking behavior.

The D2/3 antagonist nafadotride also produced a significant mean decrease in cue-induced responding and, as in the case of SCH 23390, this effect was associated with an extinction-like behavioral profile in the majority of the animals. Nafadotride has been shown to increase cocaine self-administration in rats, an effect that has been interpreted as reflective of an attenuation of the reinforcing actions of cocaine (Caine et al. 1997). Consistent with a functional dopamine antagonist action as well, nafadotride reversed autoregulatory decreases in accumbal dopamine release induced by the D2/3 agonist 7-OH-DPAT (Parsons et al. 1996). The inhibitory effects on cue-induced responding by both a D1 and D2/3 receptor antagonist, therefore, complement the finding that cocaine-predictive discriminative stimuli can increase dopamine release in the amygdala and nucleus accumbens (Weiss et al. 2000) and provide further support for the hypothesis that the behavioral effects of such stimuli may involve activation of neurotransmission in these brain regions.

The D1 agonist SKF 81297 produced a variable behavioral profile, attenuating or eliminating cue-induced responding in some animals, while augmenting this behavior in others. In instances where SKF 81297 attenuated the effects of the cocaine S+, the initiation of responding was either substantially delayed or rats failed to respond throughout the session. One explanation for this observation is that SKF 81297 substituted for the pharmacological actions of cocaine in these animals and, thereby, decreased or eliminated cocaine-seeking behavior. This interpretation is consistent with reports that D1 agonists can substitute for the reinforcing effects of cocaine in rhesus monkeys (Weed et al. 1993, 1997; Weed and Woolverton 1995) and effectively block drug-seeking behavior induced by cocaine priming injections in rats, presumably as a result of their cocaine-like pharmacological action (Self et al. 1996). In rats that showed an augmented behavioral response to the cocaine S+ after SKF 81297, responding was vigorous and sustained throughout the session. The most likely explanation for the divergent behavioral profile of SKF 81297 is that the 1.0 mg/kg dose of this D1 agonist represented a threshold dose that produced cocaine-like priming effects in some animals while substituting for the effects of cocaine in others. Another intriguing possible interpretation for the effects of SKF 81297 comes from recent electrophysiological data showing that medium spiny neurons in the nucleus accumbens receiving strong depolarizing glutamatergic input exhibit a strong excitatory response to local microiontophoretic application of SKF 81297 while neurons that receive weak glutamatergic input, not sufficient to produce burst firing, consistently show a D1-mediated decrease in excitability (Cooper and White 1999). These findings were interpreted to suggest that the level of excitatory input received by accumbal medium spiny neurons may be a determining factor in their response to D1 receptor stimulation (Cooper and White 1999). It is known that increasing the excitability of accumbal neurons by enhancing glutamatergic tone elicits cocaine-seeking behavior (Cornish et al. 1999). Thus, individual differences in the level of glutamatergic excitatory drive may have been responsible for the capability of SKF 81297 to augment reinstatement in some rats and to reduce this behavior in others.

The D2/3 agonist PD 128907 significantly suppressed the recovery of responding in the presence of the cocaine S+. The 0.3 mg/kg dose of PD 128907 corresponds to a dose previously found to increase the reinforcing actions of cocaine in self-administering rats (Caine et al. 1997). Thus, one interpretation to account for the effects of PD 128907 is that this compound substituted for the effects of cocaine, thereby decreasing the motivating actions of the cocaine cue.

Considering the results of the pharmacological tests, and in particular those obtained with the two receptor agonist compounds, it is clear that an understanding of the role of dopamine receptors as well as the possibility of a preferential mediation of this behavior by particular receptor types will require more systematic investigation in the future. Nonetheless, the attenuation of cue-induced cocaine-seeking behavior by SCH 23390 and nafadotride is consistent with the finding that cocaine-associated environmental stimuli increase extracellular dopamine levels in limbic brain regions (Weiss et al. 2000). Moreover, these data complement and extend prior evidence that has implicated D1 (Ciccocioppo et al. 1999; Spealman et al. 1999; Weissenborn et al. 1996) and D2/3 (Pilla et al. 1999) receptors in the mediation of the conditioned effects of cocaine-related environmental stimuli.

In conclusion, the results show that cocaine-seeking behavior elicited by drug-related discriminative stimuli in rats is highly resistant to extinction. The enduring responsiveness to these stimuli may involve learned or drug-induced changes in dopaminergic function, necessary for the expression of cocaine-seeking behavior induced by cocaine-predictive environmental stimuli. Further research will be essential to more precisely identify the stimulus and learning conditions, as well as dopamine receptor types mediating the motivating effects of cocaine-related environmental stimuli.