INTRODUCTION

Methamphetamine (METH), a highly addictive psychostimulant drug, is a major public health concern. Exposure to METH results in persistent, partial loss of dopamine (DA), particularly in dorsal striatum, and serotonin (5-HT) systems in striatum, hippocampus, amygdala, and prefrontal cortex (Morgan and Gibb, 1980; Ricaurte et al, 1980; Seiden et al, 1976; Wagner et al, 1980). In humans, METH-induced neurotoxicity is evident as decreases in DA transporter binding in caudate-putamen (McCann et al, 1998; Volkow et al, 2001a, 2001b; Wilson et al, 1996) and SERT binding across multiple brain regions, including caudate-putamen and frontal cortex (Kish et al, 2009; Sekine et al, 2006). Exposure to neurotoxic regimens of METH is also associated with loss of glutamatergic neurons in somatosensory cortex (Eisch et al, 1998b; Pu et al, 1996). Thus, exposure to METH induces a wide range of neurotoxic insults in the brains of both rodents and humans.

Given the breadth of changes induced by METH, it is not surprising that cognitive sequelae are apparent after such exposure. Indeed, studies of abstinent METH abusers reveal motor slowing, memory impairment, and persistent psychiatric symptoms (McCann et al, 1998; Sekine et al, 2001; Volkow et al, 2001a). Likewise, in rats, exposure to neurotoxic regimens of METH is associated with decreased locomotor responses (Wallace et al, 1999) and impaired performance on the Morris water maze task (Friedman et al, 1998), active avoidance and balance beam tasks (Walsh and Wagner, 1992), a novel object recognition task (Herring et al, 2008; Schroder et al, 2003), a path-integration task (Herring et al, 2008), and a reversal learning task (Izquierdo et al, 2010). Our previous work also shows that METH-induced DA loss in striatum is correlated with impaired sequential motor learning (Chapman et al, 2001; Daberkow et al, 2005), a task dependent on dorsal striatal function (DeCoteau and Kesner, 2000). It is therefore evident that METH-induced neurotoxicity negatively affects numerous behavioral functions, including those known to be dependent on function of dorsal striatum.

Recent studies of instrumental conditioning focusing on striatum-dependent learning have established that two dissociable associative processes—action–outcome (A–O; goal-directed) vs stimulus–response (S–R; habitual)—can control instrumental actions, and that these processes are mediated, at least to some extent, by dorsomedial (DM) vs dorsolateral (DL) aspects of striatum, respectively (Balleine and Dickinson, 1998; Balleine et al, 2009; Faure et al, 2005; Yin and Knowlton, 2006; Yin et al, 2004, 2005b). During instrumental learning, actions early in training are typically mediated by A–O, whereas actions late in training are mediated by S–R associations (Grahn et al, 2009; Wickens et al, 2007; Yin and Knowlton, 2006; Yin et al, 2004, 2005b, 2009). Additionally, manipulating the contingency between the instrumental response and outcome delivery during training influences the association underlying instrumental performance. Specifically, ratio schedules of reinforcement promote the formation of A–O associations, whereas interval schedules of reinforcement promote the formation of S–R associations (Balleine and Dickinson, 1998; Dickinson, 1985; Yin and Knowlton, 2006). Thus, the involvement of DM and DL striatal regions in instrumental responding will presumably vary as a consequence of degree of instrumental training and type of reinforcement schedule used during training.

Several lines of evidence suggest that corticostriatal circuitry, and particularly DA modulation of that circuitry, are critical for formation of the specific associations underlying instrumental learning and performance (Balleine and O’Doherty, 2010). For example, reinstatement of DA signaling in dorsal striatum by viral gene transfer restores instrumental conditioning in DA-depleted mice (Darvas and Palmiter, 2009; Robinson et al, 2007). Furthermore, rats with depletions of DA in posterior DM striatum (Lex and Hauber, 2010) or DL striatum (Faure et al, 2005) show intact instrumental conditioning; however, rats with the former lesions show apparent disruption of A–O associations, whereas rats with the latter lesions show disruption of S–R associations. Finally, sensitization of DA signaling in amphetamine-pretreated rats leads to rapid transition of instrumental behavior control from A–O to S–R associations (Nelson and Killcross, 2006). Taken all together, these data suggest that DA signaling in dorsal striatal subregions is an important determinant of the associative processes underlying instrumental behavior. Thus, given the impact of METH on the DA innervation of dorsal striatum, as well as other aspects of corticostriatal circuitry, we wished to determine the impact of METH-induced neurotoxicity on A–O vs S–R processes mediating instrumental behavior. Therefore, the present study was conducted to investigate the hypothesis that METH-induced neurotoxicity would alter A–O or S–R associations underlying striatum-dependent instrumental conditioning.

MATERIALS AND METHODS

Animals and Drug Treatment

Male Sprague–Dawley rats (300–325 g; Charles River, Wilmington, MA) housed in hanging wire cages in a temperature- and light-controlled (12 : 12 h) room were allowed free access to food and water. All animal use was approved by the Institutional Animal Care and Use Committee of the University of Utah, and was in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals.

(±)-Methamphetamine hydrochloride was provided by the National Institute on Drug Abuse (Rockville, MD). Doses were calculated as free base. The night before drug treatment, rats were weighed and re-housed in plastic tub cages (33 × 28 × 17 cm, 6 rats per tub). The next day, rats received four injections at 2-h intervals of either 0.9% saline or METH (7.5 or 10 mg/kg, s.c.). Body temperatures were monitored via a rectal probe every hour during the injections. If core temperature exceeded 41 °C, the rat was removed and put in a cage over ice to decrease hyperthermia. Twelve hours after the last injection, rats were returned to their home cages and given free access to food and water. Two weeks later, all rats were re-housed singly and food restricted for 1 week (10–15 g chow per day) to achieve approximately 80% of free-feeding weight.

Experiment 1: Effect of METH Pretreatment on A–O Associated Learning

Lever press training

Behavioral training and testing took place in standard operant chambers within sound- and light-attenuating boxes (Coulbourn Instruments, Whitehall, PA). Each cage had two retractable levers on either side of a recessed magazine and pellet trough. Upon appropriate lever press, a liquid dripper delivered 0.08 ml of 20% sucrose solution into the magazine, or a pellet dispenser delivered a 45-mg sucrose pellet (Research Diets, New Brunswick, NJ) into the trough. A 3-W, 24-V house light, mounted on the top center of the wall opposite the magazine, provided illumination. A computer equipped with Graphic State 3.0 software (Coulbourn Instruments) controlled the equipment and recorded lever presses.

Rats (saline-pretreated, n=14; METH-pretreated, n=14) were first trained with two 30-min magazine training sessions (ie, one session for each reinforcer) per day for 2 days, during which reinforcer (sucrose solution or pellets) was delivered on a random time, 60-s schedule, with no levers present. The next day, lever-press training began on a continuous reinforcement schedule (CRF) for 2 days. Training then proceeded through progressively leaner schedules of reinforcement (random ratio-5 (RR-5), RR-10, RR-20). An RR schedule was used, because it promotes the formation of the A–O associations underlying instrumental behavior (Dickinson, 1985; Yin et al, 2004, 2005a). Rats trained on each schedule for 2 days before progressing to the next schedule. Each day, rats were given a 30-min session for each reinforcer with a 2-h break between sessions. Lever/reinforcer pairings were counterbalanced across rats in each group. The order of the sessions alternated each day for each rat. Each training session began with illumination of the house light and insertion of the lever, and ended with retraction of the lever and turning off of the light. Once training began, rats were fed each day after the training sessions (25–30 g), and given free access to water.

Outcome devaluation

After the last day of training, rats were given free exposure in the home cage to one outcome (sucrose pellet or solution) for 30 min, with half of the rats in each group getting access to each outcome. Immediately after these pre-feeding sessions, a 10-min extinction test was given in which rats were placed in operant chambers with both levers present. Lever presses were recorded, but had no scheduled consequences.

Contingency degradation

After the extinction test, rats were retrained for 2 days on RR-20 schedules. They then underwent 4 days of contingency degradation in which one outcome (pellet or sucrose solution; counterbalanced across animals) for each rat was delivered non-contingently, such that the probability of its delivery in each second of the training session was equally likely, whether the rat responded or not, as previously described (Yin et al, 2005b). Two 20-min sessions were given each day, one on each lever, with at least a 2-h break between sessions. The order of the sessions was alternated across days for each rat. After the fourth day of training with contingency degradation, rats received a 10-min extinction test during which lever pressing was recorded, but had no scheduled consequences.

Experiment 2: Effect of METH Pretreatment on S–R Associated Learning

Lever press training

Rats were trained in operant chambers as described above. However, the chamber was equipped with only one retractable lever on one side (left or right) of a recessed magazine. When activated, a liquid dripper delivered 0.08 ml of a 20% sucrose solution into the magazine.

Training of all rats (saline-pretreated, n=31, METH-pretreated, n=29) began with two 30-min magazine training sessions for 2 days. After 2 days of magazine training sessions, lever-press training began on a CRF schedule, followed by sessions in which the sucrose solution was delivered on an random interval (RI-15) second, RI-30, and then RI-60 schedule of reinforcement as previously described (Yin et al, 2004), because interval schedules of reinforcement promote the formation of S–R associations (Dickinson, 1985). Rats completed 2 days of training on each schedule, with two sessions per day. Each 30-min training session began with illumination of the house light and insertion of the lever into the chamber, and ended with retraction of the lever and extinction of the house light.

Outcome devaluation and extinction test

The day after the last training session, the outcome was devalued in half of the rats using lithium chloride (LiCl)-induced conditioned taste aversion (CTA). For each of three daily sessions, rats were allowed to drink the 20% sucrose solution from a glass bottle in the home cage for 30 min. At that point, half of the saline- (n=16) and METH-pretreated (n=15) rats received an injection of 0.15 M LiCl (20 ml/kg, s.c.), whereas the rest of the saline- (n=15) and METH-pretreated (n=14) rats received an injection of saline (0.9%, 20 ml/kg, s.c.). The amount of sucrose consumed each day was recorded. The day after the final CTA session, all rats were returned to operant chambers for a 10-min extinction test, during which lever pressing had no scheduled consequence.

Tissue Preparation

At the end of each experiment, rats were killed by exposure to CO2 (1 min). Brains were removed rapidly and frozen in 2-methylbutane (Mallinckrodt Baker, Phillipsburg, NJ), chilled on dry ice. Brains were stored at −80 °C until cryosectioned at 12-μm (Cryocut 1800, Cambridge Instruments, Bayreuth, Germany). Fresh-frozen sections (Bregma: +1.60 to −1.40 mm, (Paxinos and Watson, 1998)) were collected onto slides and stored at −20 °C. While sectioning, a blunt-tip, 18-gauge needle was used to collect 1-mm3 punches from both DM and DL striatum for determination of monoamine content.

Tyrosine Hydroxylase (TH) Immunohistochemistry

Immunohistochemistry for TH was performed on fresh-frozen sections as previously described (Chapman et al, 2001; Daberkow et al, 2005). Sections were washed in PBS, fixed, and rinsed in PBS. Endogenous peroxidase activity was quenched with H2O2. Sections were then blocked in normal horse serum, and incubated overnight in mouse anti-TH primary antibody (1 : 1000 dilution, Immunostar, Hudson, WI) at 4 °C. Slides were then washed in PBS, and incubated in horseradish peroxidase-coupled donkey anti-mouse IgG (1 : 400 dilution, Vector Laboratories, Burlingame, CA) for 2 h at 4 °C. Sections were then rinsed, incubated in diaminobenzidine (Santa Cruz Biotechnology, Santa Cruz, CA) with nickel intensification, dehydrated, and cover-slipped.

Determination of Monoamine Content in Tissue via HPLC–ECD

Dopamine and 5-HT content in tissue punches from DM and DL striatum was determined as previously described (Chapman et al, 2001; Daberkow et al, 2005). Punches were sonicated in tissue buffer (0.05 M sodium phosphate/0.03 M citric acid, 25% methanol (v/v), pH 2.5), and centrifuged (22 000 g) twice. Twenty microliters of supernatant were injected onto a high-pressure liquid chromatography system coupled to an electrochemical detector (HPLC–ECD; EOx=+0.6V; Decade, Antech-Leyden, The Netherlands). A Whatman PartiSphere C-18 column (250 × 4.6 mm, 5 μm) and mobile phase (pH 2.87; flow rate 0.5 min/ml; MeOH (23% v/v), sodium octyl sulfate (0.03% w/v), EDTA (0.1 mM), sodium phosphate dibasic (0.05 M), and citric acid (0.03 M)) were used to separate monoamines. All values were expressed per microgram of protein determined with the Bradford assay (Bio-Rad Life Science Research, Hercules, CA).

Image Analysis

Images of TH immunohistochemical staining were captured and analyzed using the image analysis program, ImageJ (National Institute of Health; http://rsbweb.nih.gov/ij), as previously described (Chapman et al, 2001). Images of sections from all groups within an experiment were digitized and saved under constant lighting and camera conditions. Images were coded so that all subsequent analyses were conducted by an experimenter blinded to the treatment groups. Basic densitometric analysis yielded average density values over the DM and DL striatum. The average gray value of the white matter overlying the dorsal striatum was subtracted from these values to correct for background labeling. Four sections (Bregma: +1.60 to −0.26 mm, (Paxinos and Watson, 1998)) of DM and DL striatum (averaged across left and right hemispheres) from each animal were used for analysis.

Data Analysis

All data were analyzed by analysis of variance (ANOVA), followed by Tukey–Kramer post-hoc analyses using JMP 7.0. (SAS, Cary, NC). Statistical significance was set at p0.05.

RESULTS

METH-Induced Monoamine Depletions

Pretreatment of rats with METH resulted in significant decreases in the DA innervation of the DM and DL striatum, as determined by both TH immunohistochemistry (Figure 1a; Experiment 1 and 2) and determination of DA tissue content (Figure 1b, Table 1; Experiment 2). TH immunohistochemical staining in the DM striatum and DL striatum was decreased by 20.1±3.8% (mean±SEM, n=43; F1, 86=10.93, p<0.01) and 15.6±3.6% (mean±SEM, n=43; F1, 86=7.36, p<0.01) in DM and DL striatum, respectively, of METH-pretreated animals used in Experiments 1 and 2 (Figure 1a). There was no significant difference between the degree of these decreases in TH immunohistochemical staining across the two experiments in DM (t=−0.44, p=0.65) and DL striatum (t=−0.23, p=0.81); therefore, the data were collapsed. Determination of DA tissue content by HPLC–ECD yielded larger estimates of the degree of the DA depletions in both DM and DL striatum (Figure 1b). The tissue content of DA was decreased 46.4±6.4% (mean±SEM, n=29, F1, 58=22.56, p<0.001) and 53.3±4.6% (mean±SEM, n=29, F1, 58=13.11, p<0.001) in the DM and DL striatum, respectively, of METH-pretreated rats used in Experiment 2. As shown in Figure 1c, there are significant correlations (r2=0.40 and 0.37, p<0.001 for DM and DL striatum, respectively) between the degree of DA loss estimated by TH immunohistochemistry and that estimated by DA tissue content in the animals used in Experiment 2, for which both measures were performed. Interestingly, however, the magnitude of the estimated DA depletions in DM and DL striatum determined by HPLC–ECD was always greater than the estimate determined by TH immunohistochemical staining.

Figure 1
figure 1

Methamphetamine (METH)-induced dopamine (DA) depletions in the dorsomedial (DM) and dorsolateral (DL) striatum. The magnitude of DA depletions assessed by (a) TH immunohistochemistry in DM and DL striatum (mean gray values (arbitrary units)±SEM expressed as % of saline (Sal)-pretreated rats, n=45; METH-pretreated rats, n=43) and (b) DA tissue content in tissue punches from the DM and DL striatum (b; pg/μg protein±SEM expressed as % of Sal-pretreated rats, n=31; METH-pretreated rats, n=29). See Table 1 for actual values of DA and 5-HT tissue content. (c) Correlation between the estimate of DA depletions by TH immunohistochemistry and by tissue DA content. **Significant main effect of METH-pretreatment, p<0.01.

PowerPoint slide

Table 1 Monoamine Content in Dorsal Striatum of Saline- and Methamphetamine (METH)-Pretreated Rats

In addition to the DA depletions and consistent with earlier research, METH pretreatments also induced partial depletions of 5-HT in DM (46.1±7.9%; mean±SEM, n=28; the level of 5-HT in one animal was excluded as a statistical outlier by Grubbs’ test, F1, 57=19.68, p<0.001) and DL (31.4±9.3%; mean±SEM, n=29, F1, 58=6.39, p<0.05) striatum, respectively (Table 1). However, only DA levels in the DM and DL striatum correlated with instrumental responding during the extinction test in Experiment 2 (Figure 2a, DM striatum, r2=0.11, p=0.01, n=58 (saline- and METH-pretreated groups combined); Figure 2b, DL striatum, r2=0.10, p=0.02). The degree of 5-HT innervation in the DM and DL striatum was not significantly correlated with the instrumental responding (Figure 2c, DM striatum, r2=0.03, p=0.16; Figure 2d, DL striatum, r2=0.03, p=0.20).

Figure 2
figure 2

Correlations between dopamine (DA) (a and b) and serotonin (c and d) content (pg/μg protein) in dorsomedial (DM; a and c) and dorsolateral (DL; b and d) striatum, and lever presses/min during extinction testing in rats trained on the random interval (RI) schedule of reinforcement (Experiment 2). DA and 5-HT content were quantified in 1-mm3 tissue punches from each region via high-performance liquid chromatography (HPLC) with electrochemical detection. *Significantly correlated, p<0.05.

PowerPoint slide

Experiment 1: Effect of METH Pretreatment on A–O Associated Learning

To evaluate the effects of METH-induced neurotoxicity on the development and expression of A-O learning, saline- and METH-pretreated rats were trained using a RR schedule of reinforcement on a differential lever-pressing task. Two-way ANOVAs on the number of lever presses during the sucrose pellet and sucrose solution training sessions revealed significant main effects of training during both sessions (sucrose pellet, F3, 24=90.35, p<0.001; sucrose solution, F3, 24=60.17, p<0.001), but no significant pretreatment effects (sucrose pellet, F1, 26=0.81, p=0.38; sucrose solution; F1, 26=0.29, p=0.6) and no significant interactions (training × pretreatment; sucrose pellet, F3, 24=1.68, p=0.2; sucrose solution, F3, 24=1.39, p=0.27; Figure 3). Thus, all rats, regardless of pretreatment, learned to press the levers showing increased responding with increased ratio requirements from CRF to RR-20. Furthermore, there was no effect of METH pretreatment on the acquisition of instrumental responding under these conditions for either the sucrose pellet (Figure 3a), or the 20% sucrose solution reinforcer (Figure 3b). Overall, for all rats, no preference toward sucrose pellet vs sucrose solution as a reinforcer was evident, as the numbers of lever presses by rats in each session did not differ (data not shown; t=−1.03, p=0.30).

Figure 3
figure 3

Experiment 1. Effect of METH-pretreatment on action–outcome (AO) associated instrumental learning. Response rates over the course of training during continuous reinforcement (CRF) and on random ratio (RR) schedules with (a) sucrose pellets and (b) sucrose solution as reinforcers. Data are expressed as mean lever presses/minute (±SEM, Sal-pretreated, n=14; METH-pretreated, n=14) from average lever presses for each animal across the 2 days on the CRF, RR5, RR10, and RR20 schedules of reinforcement during instrumental training. *Significantly different from all other schedules, p<0.05. #Significantly different from CRF and RR5 schedules, p<0.05.

PowerPoint slide

To assess whether the instrumental responding observed in rats trained on the RR schedule reflected an underlying A–O association, we determined the effects of outcome devaluation and contingency degradation on the established instrumental behavior, as sensitivity to these manipulations is indicative of A–O/goal-directed behavior (Dickinson, 1985; Yin et al, 2008). In this experiment, outcome devaluation was accomplished through reinforcer-specific satiety, followed by a choice extinction test, as previously reported (Lex and Hauber, 2010; Yin et al, 2005b). A two-way ANOVA (pretreatment × devaluation) on the choice extinction test performance (ie, lever-pressing) revealed a significant main effect of devaluation (F1, 52=15.03, p<0.001) and a main effect of METH pretreatment (F1, 52=3.97, p=0.05), but no significant interaction (F1, 52=0.17, p=0.68). Thus, all rats decreased responding on the lever paired with reinforcer that was devalued, and METH-pretreated rats pressed less than did the saline-pretreated rats on both the devalued and the non-devalued levers during the extinction test (ie in the absence of any outcome delivery) (Figure 4). These results thus show intact A–O (goal-directed) encoding underlying instrumental behavior in METH-pretreated rats, and further suggest that METH-pretreated rats, if anything, may be more sensitive to manipulations of reinforcer availability.

Figure 4
figure 4

Experiment 1. Effect of outcome devaluation on extinction responding in saline (Sal)- vs methamphetamine (METH)-pretreated rats trained on random ratio (RR) schedules. Response rates (lever presses/min, mean±SEM) during the 10-min extinction test by Sal- (n=14) and METH-pretreated rats (n=14) on the lever previously yielding delivery of the outcome now devalued (deval) by free feeding on that outcome in the home cage for 30 min before the extinction session, and on the lever previously yielding delivery of the other, non-devalued outcome (ie, still valued; val). Lever presses/min on the last day of RR20 training also are shown for comparison. *Significant main effect of devaluation, p<0.05. #Significant main effect of pretreatment, p<0.05.

PowerPoint slide

After the outcome devaluation and extinction test, instrumental responding in rats in Experiment 1 was re-established on the RR-20 schedule of reinforcement for 2 days (Figure 5a). Then, over the course of 4 days during which the contingency between lever pressing and reinforcer delivery was degraded for one reinforcer (see Materials and Methods), both saline- and METH-pretreated rats decreased responding on the lever paired with the degraded (Degr) reinforcer, but not on the other lever paired with non-degraded (Ndegr) reinforcer (Figure 5a). The repeated measures, mixed-factor ANOVA (pretreatment × degradation × day) revealed a main effect of degradation (F1, 52=21.75, p<0.001), a main effect of day (F3, 50=10.57, p<0.001), and a significant interaction between degradation and day (F3, 50=9.63, p<0.001), indicating that both saline- and METH-pretreated rats showed an increasing effect of contingency degradation on responding over the 4 days (Figure 5a). Importantly, there was no main effect of pretreatment (F1, 52=0.06, p=0.81) and no pretreatment × degradation (F1, 52=0.25, p=0.62), pretreatment × day (F3, 50=2.52, p=0.07) or pretreatment × degradation × day (F3, 50=1.21, p=0.32) interactions, indicating that rats with METH-induced neurotoxicity were similarly sensitive to contingency degradation relative to saline-pretreated controls.

Figure 5
figure 5

Experiment 1. Effect of contingency degradation on extinction responding in saline (Sal)- vs methamphetamine (METH)-pretreated rats trained on random ratio (RR) schedules. (a) Response rates during the last day of retraining on RR20 schedule (Re-RR20), and the 4 days of contingency degradation training. Data are expressed as mean lever presses/min (±SEM) in Sal- (n=14), and METH-pretreated rats (n=14) on the lever for which the contingency between lever press and outcome delivery was not being degraded (Ndegr), and on the lever for which the contingency was being degraded (Degr). **Significantly greater responding on the Ndegr lever vs the Degr on that day, p<0.01. (b) Mean lever presses/min (±SEM) on the non-degraded (Ndegr) and degraded (Degr) levers during the 10-min extinction test conducted the day after the last contingency degradation session. *Significant main effect of contingency degradation, p<0.05.

PowerPoint slide

After the degradation training, all rats again received a choice extinction test (Figure 5b). The two-way ANOVA (pretreatment × degradation) on lever pressing during the extinction test revealed a significant main effect of degradation (F1, 52=5.25, p<0.03), but no main effect of pretreatment (F1, 52=0.19, p=0.67), and no pretreatment × degradation interaction (F1, 52=0.30, p=0.59). The similar decrease in lever pressing for reinforcers when the contingency between the lever pressing (action, A) and delivery of the outcome (O) was degraded suggests again that the instrumental responding in both saline-pretreated rats and rats with METH-induced neurotoxicity similarly reflected an underlying A–O association.

Experiment 2: Effect of METH-Pretreatment on S–R Associated Learning

To evaluate the effects of METH-induced neurotoxicity on the development and expression of S–R learning, saline- and METH-pretreated rats were trained using a RI schedule of reinforcement, as such schedules are known to promote the formation of S–R associations (Dickinson, 1985). As was the case for training in Experiment 1 under the RR schedule of reinforcement, METH-pretreated rats showed intact instrumental learning under this RI reinforcement schedule (Figure 6a). The repeated-measure ANOVA (pretreatment × training) revealed a main effect of training (F3, 56=8.77, p<0.001), but no main effect of pretreatment (F1, 58=0.29, p=0.59), and no significant interaction (F3, 56=0.67, p=0.57). To then assess whether this instrumental behavior reflected an underlying S–R association, half of the rats in the saline- and METH-pretreated groups were subjected to LiCl-induced CTA. As shown in Figure 6b, both saline- and METH-pretreated rats subject to LiCl injections after consumption of the reinforcer in the home cage decreased consumption of the reinforcer over days. The repeated-measures ANOVA (pretreatment × devaluation × day) verified a significant main effect of devaluation (F1, 56=255.38, p<0.001), a main effect of day (F2, 55=74.74, p<0.001), and significant devaluation × day interaction (F2, 55=135.42, p<0.001). There was no main effect of pretreatment (F1, 56=0.48, p=0.49), and no significant pretreatment × devaluation (F1, 56=0.13, p=0.72), pretreatment × day (F2, 55=1.14, p=0.33), or pretreatment × devaluation × day (F2, 55=0.45, p=0.64) interactions. Thus, the development of the CTA and, therefore, the devaluation of the reinforcer (20% sucrose solution) were similar for METH-pretreated rats relative to saline-pretreated controls.

Figure 6
figure 6

Experiment 2. Effects of methamphetamine (METH)-pretreatment on stimulus–response (S–R) associated instrumental learning. (a) Response rates (lever presses/min) of saline (Sal)- (n=29), and METH-pretreated rats (n=31) over the course of training during continuous reinforcement (CRF), and on random interval (RI) schedules of reinforcement. Data are expressed as mean lever presses/minute (±SEM) from the average lever presses for each animal across the 2 days on the CRF, RI15, RI30, and RI60 schedules of reinforcement during instrumental training. (b) Development of lithium chloride (LiCl)-induced conditioned taste aversion (CTA) over days. Data are expressed as mean ml (±SEM) of sucrose solution consumed in the home cage over 1 h by animals, then injected with vehicle (Veh) or LiCl (Sal, Veh, n=15; METH, Veh, n=14; Sal, LiCl, n=16; METH, LiCl, n=15). **Significantly different from consumption in LiCl-treated animals, p<0.01.

PowerPoint slide

After the final day of CTA pairings, all rats were returned to the operant chambers for an extinction test to determine the effects of the outcome devaluation on the instrumental behavior. Consistent with earlier reports of animals trained on the random-interval schedule (Yin et al, 2004; Yu et al, 2009), instrumental responding of the saline-pretreated rats during the extinction test was insensitive to outcome devaluation (Figure 7a and b), indicating an S–R association underlying the instrumental responding. Instrumental responding in the METH-pretreated rats, on the other hand, was sensitive to the outcome devaluation (Figure 7a and b). Two-way ANOVA (pretreatment × devaluation) on the number of lever presses per minute during the extinction test (Figure 7a) revealed a main effect of pretreatment (F1, 56=8.26, p<0.01), but no main effect of devaluation (F1, 56=2.55, p=0.12), and no significant interaction (F1, 56=1.47, p=0.23). Planned post-hoc comparisons revealed that outcome devaluation significantly decreased responding in the METH-pretreated (p<0.05), but not the saline-pretreated rats (p=0.81). Similarly, a two-way ANOVA (pretreatment × devaluation) on the ratio between responding during the extinction test after the outcome devaluation and responding during the last RI-60 training session revealed main effects of pretreatment (F1, 56=12.11, p<0.01) and devaluation (F1, 56=9.05, p<0.01), but no significant interaction (F1, 56=2.226, p=0.14; Figure 7b). Planned post-hoc comparisons (Tukey–Kramer) again revealed that outcome devaluation significantly decreased responding in the METH-pretreated (p<0.01), but not the saline–pretreated rats (p=0.31). Thus, the METH-pretreated rats undergoing reinforcer devaluation pressed less than all other groups during the extinction test, suggesting impaired formation of S–R associated learning in METH-pretreated rats.

Figure 7
figure 7

Experiment 2. Effect of outcome devaluation via LiCl-induced conditioned taste aversion (CTA) on extinction responding in saline (Sal)- vs methamphetamine (METH)-pretreated rats trained on RI schedules of reinforcement. (a) Response rates during the 10-min extinction test (lever presses/min, mean±SEM) by Sal- and METH-pretreated rats for whom the outcome was devalued (Deval) by LiCl-induced CTA (Deval) or not (Valued; Val) (Sal, Val, n=15; METH Val, n=14; Sal, Deval, n=16; METH, Deval, n=15). Lever presses/min on the last day of RI60 training are shown for comparison. (b) Ratio of responding during the 10-min extinction test to responding on the last day of instrumental training on the RI60 schedule of reinforcement in Sal- and METH-pretreated rats subject to LiCl-induced CTA (Deval) or not (Val). Data are expressed as mean ratio (extinction/RI60; ±SEM; Sal, Val, n=15; METH, Val, n=14; Sal, Deval, n=16; METH, Deval, n=15). *Significantly different from all other groups, p<0.05.

PowerPoint slide

DISCUSSION

The present study demonstrates apparently intact appetitive instrumental learning under two different types of reinforcement schedules in rats with METH-induced neurotoxicity. However, despite the apparent gross similarity of overall learning, further interrogation revealed that the associations underlying the learning were different in rats with such toxicity relative to normal controls. That is, METH-pretreated rats did not form S–R associations (ie, did not undergo transition to a more habitual form of behavior), as saline-pretreated rats did, when trained on an RI schedule of reinforcement. Rather, METH-pretreated rats continued to show A–O/goal-directed behavior, which appears to be intact in these animals, as their instrumental responding was sensitive to manipulations of outcome value and contingency degradation. These data suggest that METH-induced neurotoxicity disrupts the formation of S–R associations, and presumably, therefore, the formation of habitual behavior. To the extent that such automatization of behavior is critical for reducing cognitive demands and facilitating behavioral efficiency under high workload conditions (Redgrave et al, 2010), the present findings suggest that individuals with METH-induced neurotoxicity, and possibly others with partial monoamine loss, such as those with preclinical/early Parkinson's disease, are likely to exhibit impaired cognitive function, particularly under conditions of high cognitive demand.

An important implication of the present results is that METH-induced neurotoxicity may be associated with behavioral abnormalities that are not readily apparent based on a gross examination of behavior. That is, in the present study, instrumental learning by METH-pretreated rats under two different reinforcement schedules appears to be intact, yet examination of the mechanisms underlying the learning revealed that METH-pretreated rats do not learn or execute the behavior the same way as normal controls. Likewise, we have previously reported that METH-pretreated rats appear to perform normally on a motor response reversal learning task on a T-maze (ie, they take the same average number of trials to reach criterion), yet they no longer show the correlation between the expression of Arc (activity regulated, cytoskeletal-associated) mRNA in DM striatum and trials to criterion as observed in normal animals (Daberkow et al, 2007, 2008). These findings suggest that rats with METH-induced damage may be shifting cognitive strategies, and thus the brain regions being used to solve the task at hand. Preliminary data from our laboratory support the idea that rats with METH-induced neurotoxicity are, in fact, not using their dorsal striatum to solve the motor response reversal task (Pastuzyn and Keefe, 2009). Similarly, a functional imaging study of patients with early Parkinson's disease has shown that the performance of these Parkinson's disease patients on a striatally-based, procedural learning task is similar to that of normal controls, but that the patients show activation of the medial temporal lobe, whereas controls show activation of the caudate-putamen (Moody et al, 2004). Therefore, apparently intact behavioral performance does not necessarily mean that the learning or behavior is ‘normal’.

The aspect of METH-induced neurotoxicity that underlies the deficit in formation of S–R associations is not completely clear, as exposure to METH is associated not only with central DA and 5-HT depletions, but also cortical neuron injury (Yamamoto et al, 2010). Although we can not definitively exclude the possibility at present, it seems unlikely that METH-induced DA or 5-HT loss in hippocampus underlies the deficit, as earlier work suggests that manipulations of hippocampus either do not alter A–O learning (Reichelt et al, 2011), or render instrumental behavior insensitive to outcome devaluation (ie, render it S–R driven; Corbit and Balleine, 2000). Likewise, earlier work has suggested that lesions of prelimbic cortex impair acquisition of instrumental responding, and also impair the ability of the rats to use A–O associations to guide choice behavior, as evidenced by lack of selective effects of outcome devaluation and contingency degradation on instrumental performance during extinction trials (Balleine and Dickinson, 1998; Corbit and Balleine, 2003). Such patterns of sensitivity to manipulations of both outcome value and A–O contingency were not apparent in the METH-pretreated rats, suggesting that toxic effects of METH on hippocampus or prelimbic cortex are not likely responsible for the impaired formation of S–R associations observed in the METH-pretreated rats in the present work.

Alternatively, the disruptive effects of METH on S–R formation may be secondary to altered functions of infralimbic cortex or impairment of DL striatal function, particularly—but perhaps not exclusively—the DA innervation of DL striatum. With respect to infralimbic involvement, earlier work (Coutureau and Killcross, 2003; Killcross and Coutureau, 2003) has shown that lesions of infralimbic cortex render an otherwise S–R driven instrumental behavior sensitive to outcome devaluation, a pattern similar to that observed in the present work in METH-pretreated rats. Furthermore, the regimen of METH exposure used in the present work induces loss of 5-HT in infralimbic cortex (Hotchkiss and Gibb, 1980; Ricaurte et al, 1980; Keefe lab, unpublished data). Thus, METH-induced alterations in infralimbic function may underlie the observed shift in control of instrumental behavior from S–R to A–O associations.

Alternatively or in addition, disruption of DL striatal function may underlie the observed changes in instrumental control observed in METH-pretreated rats, as nigral DA and cortical glutamatergic afferents to DL striatum are heavily implicated in S–R associations underlying instrumental behaviors (Balleine and O’Doherty, 2010). Clearly, METH has significant impact on both aspects of this circuitry, as it causes damage to glutamatergic neurons in somatosensory cortex (Eisch et al, 1998b; Pu et al, 1996), as well as monoamine loss in the DL striatum, as confirmed in the present experiments. On the one hand, we do not think that the 5-HT loss in the striatum contributes to the deficits observed, because there is a significant correlation between DA levels in striatum and instrumental responding during extinction in rats trained on the RI schedule, whereas there is no such correlation between 5-HT content in striatum and the behavior. Additionally, a recent study showed that intrastriatal administration of an SSRI did not alter compulsive (likely habitual) instrumental responding in intact rats (Schilman et al, 2010). Thus, depletion of 5-HT in DL striatum does not seem to be a likely basis for the behavioral changes observed in METH-pretreated rats.

On the other hand, numerous studies indicate that DA signaling in striatum has a crucial role underlying instrumental conditioning (Darvas and Palmiter, 2009; Lovinger, 2010; Robinson et al, 2007; Schultz, 2007; Surmeier et al, 2007). Specifically, viral restoration of DA signaling to DL striatum restores instrumental conditioning to DA-deficient mice (Darvas and Palmiter, 2009; Robinson et al, 2007). Also, lesions of the nigrostriatal DA system induced by 6-OHDA result in an impaired habit formation (Faure et al, 2005; Robbins et al, 1990). Finally, impaired formation of habitual behavior is apparent in humans with early Parkinson's disease (Knowlton et al, 1996). Importantly, it seems likely that the neurotoxicity associated with the METH exposure is the cause of the presently observed behavioral changes, rather than the sensitizing effects of such a neurotoxic METH regimen (Itzhak et al, 2002), as prior work has shown that a sensitizing regimen of amphetamine actually enhances the formation of S–R associations, presumably by supporting increased DA release (Nelson and Killcross, 2006), an outcome opposite to that observed herein. To what extent it is striatal DA loss alone that contributes to the deficits vs the DA loss in conjunction with altered/diminished excitatory drive from damaged somatosensory cortex remains to be determined. Fortunately, there are manipulations that can be performed in future studies in an attempt to dissociate the role of these two aspects of METH-induced neurotoxicity on the formation of S–R associations underlying instrumental behavior (Eisch and Marshall, 1998a; O’Dell and Marshall, 2002). Taken together, these findings suggest that disruption of DA and/or glutamatergic input to DL striatum induced by METH likely contributes to the impaired formation of habitual behavior observed in the context of METH-induced neurotoxicity, consistent with impaired formation of habitual behavior in Parkinson's disease patient (Knowlton et al, 1996; Moody et al, 2010). Whether similar abnormal automatization of behavior is apparent in human METH addicts remains to be determined.

The preservation of instrumental learning and formation of A–O associations along with impaired formation of S–R associations may be secondary to the nature of the toxicity induced by the METH regimen. Although DA in both DM and DL striatum is partially depleted by such neurotoxic regimens of METH, the DA innervation of the nucleus accumbens (NAc) is relatively preserved (Haughey et al, 1999; Wallace et al, 1999). Theoretically, it has been proposed that DA signaling in DM striatum and NAc is involved in the acquisition of learning, whereas the signaling in DL striatum is important for general performance, particularly S–R/habitual behavior (Atallah et al, 2007; Graybiel, 2008; Sellings and Clarke, 2003). However, other data suggest that the NAc may have a more critical role in response-initiation or motivation to perform an action, or in aspects of Pavlovian rather than instrumental conditioning (Yin et al, 2008). An alternative basis for the preserved A–O performance may also be that the partial DA loss in DM striatum and its consequent effect on DA signaling may simply not be sufficient to disrupt the acquisition and expression of goal-directed actions thought to be mediated by DM striatum in the absence of impaired cortical input, as this METH regimen has not been reported to damage glutamatergic neurons in cortical regions projecting to DM striatum. Clearly, further research will be necessary to fully delineate which aspects of reward-based learning are impaired, and which are preserved in the context of METH-induced neurotoxicity.

In conclusion, the present data confirm the partial loss of monoamines induced by exposure of rodents to neurotoxic doses of METH, and extends those findings by demonstrating that despite apparently normal instrumental learning, rats with METH-induced neurotoxicity fail to develop normal S–R associations underlying such behavior. Rather, the behavior remains goal-driven (A–O). Thus, METH-induced disruption of nigrostriatal/corticostriatal circuitry may selectively impair the development of more habitual/automatized forms of behavior, and thus impair cognitive function. Such behavioral impairments may functionally limit the extent to which individuals with such damage can engage in cognitive behavioral therapies for management of drug addiction, as well as the extent to which they can utilize such cognitive processes in their daily lives, such as for tasks related to their employment or personal lives. Additional studies are needed to fully understand the cellular and molecular substrates of learning and memory processes in cortical-basal ganglia circuitry that are compromised by such neurotoxicity, and approaches to mitigate such effects.