INTRODUCTION

A recent study reported cognitive deficits in human heroin addicts similar to those seen in patients with selective lesions of the temporal and frontal lobes (Ornstein et al, 2000). Brain imaging studies involving human opiate addicts have indicated that many brain areas, particularly the frontal and temporal lobes, are hypofunctional during both prolonged abstinence (Gerra et al, 1998) and acute withdrawal (Krystal et al, 1995). The suppressed activity in these cortical and limbic areas may relate to the cognitive deficits found, as well as to the symptoms of anxiety and depression that linger after the physical symptoms of withdrawal have dissipated (Jaffe, 1990).

It has been reported that generalized dysphoria and depressive symptoms can be present for weeks or months following abstinence from opiates (Dole et al, 1966; Martin and Jasinski, 1969). Many researchers have hypothesized that these symptoms of protracted opiate withdrawal involve a dysregulation of hedonic processing because of alterations in brain chemistry (Solomon, 1977; Koob and Le Moal, 1997). If opiate-withdrawn animals are dysfunctional in processing hedonic events, then it could be predicted that the withdrawn animals would have problems learning about appetitive reinforcers that predict pleasurable events, but not about other types of reinforcement (ie aversive events).

In this series of experiments, we used several paradigms to determine whether opiate-abstinent animals show altered learning. In Experiment 1, rats were trained to lever press for food on an escalating fixed-ratio schedule of reward during a 2-week period postmorphine withdrawal. In Experiment 2, these same animals were used in a conditioned suppression paradigm, in which a tone previously paired with foot shock was presented during the performance of the instrumental food task to assess the effect on responding (Experiment 2). In Experiment 3, we examined food place preference at 2 and 5 weeks postmorphine withdrawal and compared this to morphine place preference.

MATERIALS AND METHODS

Subjects

Male Sprague–Dawley rats (200–250 g) from Harlan (Indianapolis, IN) were used in all experiments. Rats were group housed in accordance with NIH guidelines on a 12-h light/dark cycle with food and water available ad libitum. All animal procedures were approved by the Institutional Animal Care and Use Committees of the Philadelphia Veterans Administration Medical Center and the University of Pennsylvania. A total of 45 animals were used with individual group numbers of seven to eight animals.

Chronic Drug Treatment

Two 75 mg morphine tablets (provided by the National Institute on Drug Abuse) were subcutaneously implanted under halothane anesthesia to induce morphine dependence. Nondependent rats were implanted at the same time with inert placebo pellets. All chronically morphine-pretreated animals are denoted by the abbreviation ‘M’, and all placebo-pretreated animals are denoted by the abbreviation ‘P’. Previous studies have shown that morphine pellets are a reliable way to induce physical dependence (Yoburn et al, 1985; Gold et al, 1994). The signs of physical dependence begin to wane around day 14 as the morphine pellets dissolve (Gold et al, 1994). Deprivation withdrawal (a model of abstinence) was induced by removing the pellets after 14 days.

Operant Food-Conditioning Procedure

Rats were subchronically pretreated with either morphine or placebo pellets, as described above (n=7 each). At 1 week after pellet removal, rats were placed on a food deprivation schedule, such that they received only 15 g of food daily, sufficient to maintain them at 85% of their free-feeding weight. The animals were fed each day after the training sessions. Training and testing took place in Med Associates operant chambers housed within sound- and light-resistant shells. Each chamber was equipped with a pellet dispenser that delivered one 45 mg Noyes pellet (formula A/I) when activated. The chamber contained one active lever on which responses were reinforced with one pellet of food and the illumination of a light above the active lever, and one inactive lever on which responses were never reinforced. A 3 W house light mounted on the top-center of the wall opposite the magazine provided illumination. Microcomputers equipped with the MED-PC program (Med Associates) controlled the equipment and recorded lever presses.

At 24 h after starting the food deprivation schedule, rats were placed in the operant chamber and allowed to bar press for food on a fixed-ratio schedule of one response per one reinforcer (FR1). During the first day of training, pellets were taped to the lever, and rats received a free pellet every 5 min to facilitate learning. Subsequent training sessions were 30 min long and rats were given two sessions per day until they reached a criterion of 40 reinforcers in a 30 min time period. Once this criterion was maintained for two consecutive sessions, rats were placed on progressively more difficult schedules of reinforcement (FR2, FR4, FR8) until they mastered an FR16 schedule. The amount of time (in minutes) it took each rat to reach the criterion of two consecutive sessions with 40 reinforcers was recorded.

Conditioned Suppression Paradigm

Once all the animals mastered the FR16 schedule, they were conditioned by receiving two tone–shock pairings in the operant chamber. The CS was a 30-s 1 kHz tone and the US was a 1 mA/0.5-s foot shock. The day after this conditioning, the animals were placed back in the operant chamber, the CS was presented alone, and the amount of time it took for the animals to resume responding on the FR16 ratio was recorded for 30 min. After 5 days, a second identical test took place.

Conditioned Place Preference Procedure

Rats were implanted with morphine (n=16) or placebo pellets (n=15), as described above. Pellets were removed after 14 days in all animals to initiate abstinence withdrawal. The place preference procedure was carried out in a Plexiglas apparatus consisting of two distinct compartments separated by a tunnel. One compartment had a grid floor with black walls, and the second compartment had a mesh floor with black and white stripes on the walls. Each compartment was equipped with photocells to record automatically the time that the animals spent in each compartment (MED Associates, East Fairfield, VT). On the first day after pellet removal, animals were allowed to explore freely all of the apparatus for 15 min, and the amount of time spent in each compartment was recorded. None of the animals had an initial bias for either compartment, and were randomly assigned to one compartment for food conditioning in a balanced design. Animals received conditioning either 1 week or 4 weeks after pellet removal. By this time all somatic signs of opiate withdrawal had dissipated. Conditioning consisted of pairing one side of the chamber with 6-oat flavored pieces of Lucky Charms cereal (General Mills, Minneapolis, MN). The other chamber had no food in it but contained an empty plastic dish similar to the one used to hold the cereal. Conditioning sessions occurred in the morning and afternoon and the presentation of the food was alternated between the morning and afternoon sessions on three separate days. Conditioning sessions were 30 min long and all rats consumed all the food presented during the sessions. Since the food was highly desirable, the rats were not food deprived for this experiment. Testing occurred 5 days following conditioning during which M and P animals were given free access to the apparatus for 15 min. The amount of time spent in each compartment was recorded.

Morphine Conditioned Place Preference Procedure

At 1 week after being placed back on a free feeding schedule, six M and six P rats from the operant food-training task were trained in a morphine place-conditioning task. Conditioning occurred at 4 weeks postmorphine withdrawal. The apparatus and procedure for this place conditioning was the same as described above for food conditioning. Conditioning took place over a 3-day period with alternating injections of morphine (10 mg/kg, i.p.) or saline given in 30 min morning and afternoon sessions. Testing took place 5 days after conditioning.

Data Analyses

For both the operant food training data and conditioned suppression data, group differences in time were compared using a t-test. Place conditioning data were analyzed by calculating the time spent in the food-paired chamber minus the time spent in the other chamber. The resulting difference score was compared between groups using one-way analysis of variance. In addition, a within-group measurement of conditioned place preference was assessed by comparing the difference in time spent in the food-paired and nonfood-paired sides preconditioning vs postconditioning. Where necessary, post hoc analysis was carried out with a Newman–Keuls test.

RESULTS

Experiment 1: Operant Food Training

As shown in Figure 1 (upper panel), M animals took significantly more time to learn to bar press for food (t(12)=2.4, p<0.015) and were significantly slower in mastering each FR requirement except for the transition between FR2 and FR4 (FR2: t=2.65, p<0.01; FR4: t=1.20, p<0.12; FR8: t=2.79, p<0.008; FR16: t=2.28, p<0.02) compared to P animals. To determine if the M rats were slower to learn because of a motor impairment, we compared response rates during the last 10 min of the session in which they reached the criterion to advance to a new FR schedule. Response rates for each group on the rewarded and nonrewarded levers are shown for each day of the experiment in Figure 1 (lower panel). As illustrated there, response rates are shifted to the right for the M animals because they were slower to advance to the higher FR schedules. The P animals reached the criterion for the FR16 by day 10, while the M animals did not reach this level until day 12. However, there were no significant differences in response rates between M and P animals once they mastered an FR schedule (ie during the last 10 min of the session in which they reached the criterion to advance to a new FR schedule). This finding indicates that motor impairment is not a likely reason for the slower learning rates found in the M animals.

Figure 1
figure 1

Acquisition of responding for food reward at each FR schedule. Upper panel: The mean amount of time for each group to achieve 40 reinforcers averaged over different 30-min sessions is depicted: *Significantly different (p<0.05). Lower panel: Mean number of lever presses made by each group over the different days of training. P animals complete the criterion for the FR16 by day 10.

Experiment 2: Conditioned Suppression

As can be seen from Figure 2, M animals took significantly longer to resume responding on an FR16 schedule for food after the presentation of the CS previously paired with shock on both test day 1 (t(12)=4.1 p<0.01) and test day 2 (t(12)=2.3 p<0.05). On test day 1 only two out of seven M animals resumed responding in the 30 min time period, while all seven of the P animals resumed responding. On test day 2, four of seven M animals resumed responding in the 30 min time period, compared to all seven P animals.

Figure 2
figure 2

Mean amount of time to resume responding for food after the presentation of a tone that had been paired with shock for M and P groups. The maximum amount of time was 30 min: *Significantly different (p<0.05), **significantly different (p<0.01).

Experiment 3: Food Place Conditioning

Figure 3 (upper panel) shows the results of the food place conditioning study. A two-way ANOVA disclosed a significant main effect for morphine treatment (F(1, 27)=16.21, p<0.001), and further analysis revealed that P rats expressed a significantly greater preference for the food-paired chamber than M rats. Comparison analysis of each group pre- and postconditioning indicated that all groups except the 2-week M group showed a significant place preference (p<0.001). A comparison of body weights in the 2-week groups revealed no significant difference between M and P animals, indicating that M rats were physically recovered from withdrawal and eating normally. In a previous study (Harris and Aston-Jones, 2002), we found that nonconditioned P and M groups given saline injections for three days paired with both compartments did not show a significant place preference (p=0.12; F(1, 24)=2.4, indicating that the place preference obtained in this study was not because of a side bias.

Figure 3
figure 3

Upper panel: Preference scores for the food-paired environment expressed as the mean time in seconds spent in the food-paired side minus the mean time in seconds spent in the nonfood paired side on the test day. Weeks 2 and 5 refer to the number of weeks of morphine abstinence at the time of testing: *Significantly different (p<0.01). Lower panel: Preference scores for the morphine-paired environment expressed as the mean time in seconds spent in the morphine-paired side minus the mean time in seconds spent in the saline-paired side on the test day: *Significantly different (p<0.01).

Figure 3 (lower panel) shows the results of morphine place conditioning in rats withdrawn at 5 weeks. Both M and P groups showed significant preferences for the morphine-paired environment (F(1, 10)=47.17 for M and 52.81 for P, p<0.001), but M animals showed significantly greater preference than P animals (F(1, 10)=14.65, p<0.01). An analysis of locomotor activity on the test day revealed no significant differences in activity in either the morphine paired side (p=0.61) or during the entire session (p=0.09).

DISCUSSION

These experiments indicate that during protracted abstinence withdrawal from morphine animals are slower to learn about environmental cues that predict food reinforcement, while at the same time they readily learn about cues predictive of morphine reward. The M animals were significantly slower to learn the operant response for food reward, and were significantly slower to increase their responding when the response requirements for food were elevated. When tested at 2 weeks postwithdrawal, M rats failed to show a place preference for a food reward, and at 5 weeks postwithdrawal M animals developed a food place preference that was significantly smaller than the preference expressed by P animals. In contrast, when tested at 5 weeks postwithdrawal M animals showed significantly greater morphine place preference than P animals, and also showed significantly greater retention of learning in the conditioned suppression paradigm.

The fact that M animals were slower to learn an operant food reward task and failed to prefer an environment associated with food reward probably does not reflect a lack of desire for food. All rats in the morphine groups consumed all of the food they were given in both experiments. In addition, once the morphine-abstinent M subjects learned an FR schedule in the instrumental-conditioning task, they obtained as many reinforcers in a session as the nonmorphine-exposed P rats. Furthermore, the body weights of the M rats were not significantly different from those of P animals, and M rats appeared to be completely recovered from the experience of opiate withdrawal by the time of testing.

M rats were slower to learn the association between bar pressing and receiving food. They were also slower to adapt when the response requirements for food were increased. It seems unlikely that their sluggishness in learning the elevated FR schedules were because of motor impairments because when they eventually learned the more demanding task requirements, they responded at rates similar to those seen in P animals. In addition, M rats did not have any apparent physical or behavioral abnormalities nor were they less active or less exploratory than P animals.

In the food place-conditioning experiment, 2-week-withdrawn M rats showed no preference for the food-paired environment. The 5-week-withdrawan M group developed a preference for the food-associated chamber that was much smaller than that found in P animals. All rats (including the morphine-abstinent rats) eagerly consumed all of the food reward immediately upon being placed in the food chamber, indicating that they found the food palatable. One possible explanation for the difference between M and P rats in food preference conditioning is that prior morphine exposure may alter the valence of natural reinforcers. It has been postulated that drugs of abuse alter the brain mechanisms involved in reward regulation and that these powerful neuropharmacological effects could ‘short circuit’ the neurochemistry of reward (Koob and Le Moal, 2001). Previous studies have also reported a decrease in the motivation for natural reinforcers, following amphetamine withdrawal (Barr and Phillips, 1999; Barr et al, 1999). In the opponent-process view (Solomon and Corbit, 1974), these reward systems are thought to become downregulated with chronic drug exposure to compensate for continuous stimulation of reward-related receptors during drug exposure. When the drug is removed during abstinence, this ‘antireward’ response is uncovered and leads to symptoms of anhedonia, depression, and anxiety. Continued drug use is thought to be rewarding not only because it stimulates the natural reward circuitry, but also because of the reinforcement associated with offsetting the antireward response (including stress hyper-responsivity and anxiety, as discussed above).

In support of this hypothesis, we found that M animals withdrawn at 5 weeks showed much larger preferences for the morphine-associated environment than P animals. This replicates our previous reports of increased morphine-associated preference at 2 or 5 weeks postwithdrawal (Harris and Aston-Jones, 2001, 2002). In the nondependent P subjects, the preference for the food-associated chamber was twice as high as that found for the morphine-associated chamber. The exact opposite was true for the M animals, in which the preference for morphine-associated cues was twice as that found for food-associated cues. At 5 weeks postwithdrawal, the lack of motivation for food-reinforced responding seems to be diminishing, possibly indicating recovery of normal reward function. However, the motivation to seek out morphine-associated environmental cues has not decreased. This indicates that the changes that occur in the reward mechanisms following opiate exposure may involve multiple systems that recover at different rates.

In a theory outlined by Di Chiara (1998), (1999), natural reinforcers release DA in the accumbens shell and this response is thought to rapidly habituate. It is further proposed that the release of DA produced by drugs such as morphine does not habituate, thereby leading to a relative strengthening of the connections between environmental stimuli and drug reward. Such a mechanism may be involved in the altered responsiveness to morphine- vs food-paired stimuli that we observed. For example, morphine pretreatment may have sensitized animals to the reinforcing effects of morphine, thereby leading to increased preference for the morphine-paired environment. Although we did not see evidence of locomotor sensitization in the M group on the morphine place-conditioning test day, others have shown that morphine pretreatment can lead to a sensitization to the reinforcing effects of morphine using a place-conditioning paradigm (Shippenberg et al, 1996; Shippenberg and Elmer, 1998).

Unlike the impaired performance in the instrumental tasks, M animals showed better performance than P rats in the conditioned suppression experiments. Thus, M rats displayed heightened retention of the tone–shock pairings, and a majority of M animals did not even resume responding during the entire 30 min session whereas this did not occur for any P rat. These findings indicate that M animals are not impaired in learning in general. We speculate that the enhanced learning shown in this paradigm by M animals may be because of heightened stress reactivity and anxiety. Previously, we found that morphine-abstinent animals show prolonged augmented anxiety-like behaviors and enhanced retention of this behavior using a conditioned burying paradigm (Harris and Aston-Jones, 1993, 2001).

Furthermore, it has been well documented in humans that former opiate addicts show prolonged hyper-responsiveness to stress and that the hypothalamic–pituitary–adrenal axis is altered causing augmented release of corticotropin-releasing factor (CRF) and adrenocorticotropin hormone (Kreek, 1987; Kreek and Koob, 1998). This altered responsiveness to stress has been hypothesized to facilitate relapse in human addicts (Kreek, 1973, 1987; Childress et al, 1987; Kreek and Koob, 1998). In animals, exposure to stress has been shown to reinstate extinguished opiate self-administration (Shaham et al, 2000; Shalev et al, 2002), further establishing the link between stress and relapse to drug-seeking behaviors. Increases in CRF transmission in both the amygdala and bed nucleus of the stria terminalis (BNST) have also been reported following opiate withdrawal (Kreek and Koob, 1998; Sarnyai et al, 2001). Increases in both norepinephrine and CRF in the BNST, and CRF in the amygdala, have been associated with the anxiety-like behaviors (Lee and Davis, 1997; Cecchi et al, 2002) and specifically with opiate withdrawal-induced aversions (Heinrichs et al, 1995; Delfs et al, 2000). These findings may explain the increased anxiety and greater retention of fear-associated contextual memories that accompany opiate abstinence.

Withdrawal from excessive opiate exposure appears to alter the motivational properties of natural reinforcers. It can lead to decrements in the motivation to acquire other appetitive reinforcers and increase anxiety levels. These results indicate some of the severe problems that addicts face during attempts to maintain drug abstinence. A better understanding of the brain mechanisms that underlie these changes in affective processing could lead to the development of more successful treatments for opiate addiction.