Orbitofrontal cortex control of striatum leads economic decision-making

Gore, Felicity; Hernandez, Melissa; Ramakrishnan, Charu; Crow, Ailey K.; Malenka, Robert C.; Deisseroth, Karl

doi:10.1038/s41593-023-01409-1

Download PDF

Article
Open access
Published: 17 August 2023

Orbitofrontal cortex control of striatum leads economic decision-making

Nature Neuroscience volume 26, pages 1566–1574 (2023)Cite this article

14k Accesses
3 Citations
49 Altmetric
Metrics details

Subjects

Abstract

Animals must continually evaluate stimuli in their environment to decide which opportunities to pursue, and in many cases these decisions can be understood in fundamentally economic terms. Although several brain regions have been individually implicated in these processes, the brain-wide mechanisms relating these regions in decision-making are unclear. Using an economic decision-making task adapted for rats, we find that neural activity in both of two connected brain regions, the ventrolateral orbitofrontal cortex (OFC) and the dorsomedial striatum (DMS), was required for economic decision-making. Relevant neural activity in both brain regions was strikingly similar, dominated by the spatial features of the decision-making process. However, the neural encoding of choice direction in OFC preceded that of DMS, and this temporal relationship was strongly correlated with choice accuracy. Furthermore, activity specifically in the OFC projection to the DMS was required for appropriate economic decision-making. These results demonstrate that choice information in the OFC is relayed to the DMS to lead accurate economic decision-making.

Memorability shapes perceived time (and vice versa)

Article 22 April 2024

Perceptography unveils the causal contribution of inferior temporal cortex to visual perception

Article Open access 18 April 2024

Conjunctive encoding of exploratory intentions and spatial information in the hippocampus

Article Open access 15 April 2024

Main

Economic decision-making, the process of evaluating options in the environment to inform the best course of action, is critical for a wide range of behaviors essential for survival and well-being. To make optimal decisions, the neural representation of each option must be integrated with information about the type and scale of outcome it predicts to provide a representation of the subjective value of each alternative. Representations of subjective value can then be compared before engaging neural circuits that generate flexible behavioral responses^1,2,3.

Neural representations of subjective value have been identified in the orbitofrontal cortex (OFC)⁴, and electrical microstimulation of the OFC can bias choice behavior⁵. These results have supported a widespread hypothesis that the OFC has a role in economic decision-making^{1,6,7,8,9,10,11}. However, lesions and inactivation of the OFC yielded conflicting results on choice behavior^{12,13,14,15,16}. Furthermore, representations of subjective value exist in other brain regions including the medial prefrontal cortex¹⁷, dorsomedial striatum (DMS)¹⁸ and mediodorsal thalamus¹⁹; similar manipulations of each of these brain regions influence decision-making behavior^{20,21,22,23,24}. Thus, multiple brain regions may have important roles in economic decision-making; however, surprisingly little is known about if and how these brain regions may interact to mediate economic choices. One reason for this limited understanding is that most studies examining the neural correlates of value-based decision-making have been conducted in nonhuman primate systems, wherein tools are more restricted for recording and manipulating activity of precisely defined populations of neurons. To address this limitation, we adapted an economic decision-making task for rats, which permits recording and manipulation of neural activity in multiple defined neural populations while rats make economic decisions.

Results

Integration of reward quantity and quality information during economic decision-making

In the initial experiments, we developed and validated an economic decision-making task in rats (Fig. 1a). On each trial, rats were presented with two visual cues side by side. The type of stimulus (vertical or horizontal drifting gratings) indicated the identity of the associated reward (blackcurrant-flavored or lemon-flavored water), and the size of the visual stimulus indicated the size of the associated reward. After 2 s, the animals could perform a nosepoke to the side of the chosen cue to indicate choice, whereupon the chosen reward was delivered. We found that rats reliably chose visual stimuli that predicted larger volume rewards (Fig. 1b,c). In addition, animals displayed slower choice latencies on trials in which the difference in available reward volume was small (difficult trials), compared with trials in which the difference in available reward volume was large (easy trials) (Fig. 1d). To confirm that animals were making decisions based on the value of the stimuli presented, as opposed to simply detecting larger visual cues more reliably, we included a subset of animals in which the size of the visual stimulus was not positively correlated with the size of the reward it predicted. These animals still reliably chose stimuli that predicted larger volume rewards, indicating that animals used information about the available reward volume to make appropriate decisions (Extended Data Fig. 1b–e).

**Fig. 1: Rats integrate information about reward quantity and reward identity to make economic decisions.**

We next asked whether animals used information about reward identity, in addition to information about reward volume, to guide their decision-making. For each animal, we generated a preference score by calculating the difference in available reward (the number of drops of blackcurrant-flavored water − number of drops of lemon-flavored water) at which the animal was equally likely to choose the blackcurrant-predictive or lemon-predictive cue. We found that individual animals displayed modest preferences for either blackcurrant-predictive or lemon-predictive cues (Fig. 1e). Moreover, we found that the preference scores of individual animals were strongly correlated across consecutive sessions (Extended Data Fig. 1f) as well as across sessions separated by approximately 4 months (Fig. 1f). Thus, juice preferences were stable across both short and long timescales in individual animals. Animals therefore integrated individual (subjective) internal preferences regarding the available reward quality with externally accessible (objective) information about available reward quantity to make economic decisions.

Activity in both OFC and DMS is necessary for economic decision-making

Electrophysiological recordings identified several brain regions that appear to encode important features of economic decision-making tasks^4,17,18,19. To determine which brain regions were critical for this task, we performed an optogenetic inactivation screen. Specifically, after rats achieved criterion performance (Methods), we injected an adeno-associated virus (AAV) encoding the inhibitory stabilized step function opsin SwiChR++²⁵ under the control of the human synapsin promoter (AAV8 hSyn:SwiChR++EYFP) bilaterally into the OFC, DMS, mediodorsal thalamus or prelimbic cortex, and positioned optical fibers above each of these structures (Fig. 2a and Extended Data Fig. 2). When animals had reestablished criterion performance, we asked whether optical inhibition of each of these brain structures altered decision-making performance (Fig. 2b).

**Fig. 2: Activity in the OFC and DMS is important for economic decision-making.**

In accordance with previous work in mice¹⁶, optogenetic inhibition of the OFC impaired economic decision-making. We found that psychometric curves were flatter and latencies for easy choices were slower (Fig. 2c,d). In addition, we found that preferences computed on trials in which the OFC was inhibited were not correlated with preferences computed on trials in which the OFC was not inhibited, indicating that optogenetic inhibition of the OFC also disrupted juice preferences (Fig. 2e). Optogenetic inhibition of the DMS also impaired economic decision-making; psychometric curves were flatter and latencies for easy choices were slower, but choice preferences were unchanged (Fig. 2i–k), suggesting that decision-making based on reward volume was disrupted but juice preferences remained intact. In contrast, optogenetic inhibition of either the prelimbic cortex (Fig. 2f–h) or mediodorsal thalamus (Fig. 2l–n) had no discernible effect on economic decision-making. Decision-making was also unchanged in animals injected with control virus encoding enhanced yellow fluorescent protein (EYFP) and subjected to the same procedures (Extended Data Fig. 3a–d).

To determine whether the effects we observed were attributable to a specific deficit in economic decision-making or due to an unanticipated nonspecific effect of intervention (such as impaired visual perception, action execution or value recall), we placed the same animals into a control task, in which the choice component of the economic decision-making task was removed (Extended Data Fig. 3e). On uninhibited trials, animals were faster to respond to cues that predicted larger volume rewards, suggesting that animals could perceive the cues, remember their values and act accordingly. Importantly this relationship was maintained when either the OFC (Extended Data Fig. 3f) or DMS (Extended Data Fig. 3g) was inhibited. Thus, inhibition of the OFC or DMS impairs economic decision-making without impairing visual perception, action execution or the representation (or recollection) of cue value.

Choice-related activity in the OFC precedes choice-related activity in the DMS

To explore in more detail what function these brain areas might have in economic decision-making, we performed wireless extracellular electrophysiological recordings in the OFC and DMS in freely moving rats. A large proportion of task-modulated single units were identified among all the units resolved in both brain areas (OFC: 1,157 of 1,329 units, n = 6 rats; DMS: 524 of 656 units, n = 6 rats). In both regions, trial-averaged single-unit activity spanned the trial, and single units that were modulated by a range of task features were identified (Fig. 3a,b). We observed striking similarity in neural encoding in the OFC and DMS, with single-unit responses dominated by the spatial features of the task (size of the reward offered on the left, size of the reward offered on the right, and side chosen) in both brain areas (Fig. 3c and Extended Data Fig. 4a). Interestingly, in agreement with our inactivation data, we observed that despite a similar proportion of neurons encoding both the objective value (size) and subjective value of rewards predicted by cues presented on either side of the animal, neurons in the OFC were more strongly modulated by the subjective value of a stimulus than by its objective value, an effect that was not observed in the DMS (Extended Data Fig. 4b).

**Fig. 3: Activity in the OFC and DMS encodes spatial features of economic decision-making.**

To characterize the temporal dynamics of encoding between the OFC and DMS, we trained a linear support vector machine (SVM) to decode the choice the animal made on each trial (left or right) from neural activity data recorded in either the OFC or DMS (Fig. 3d). We were able to decode choice direction with high accuracy on held-out neural activity data from both brain regions. Importantly, across all animals, choice prediction peaked in the OFC before it peaked in the DMS (Fig. 3e and Extended Data Fig. 4c). We next examined how this temporal relationship related to choice accuracy. Cross-correlations of the predicted choice parameter (the perpendicular distance of the decoded decision value from the support vector, a proxy for decision confidence) computed on single trials revealed that the OFC led the DMS more on trials in which animals chose the larger available reward (‘correct’ trials) than on trials in which animals chose the smaller available reward (‘incorrect’ trials) (Fig. 3f,g; correct trials lag = −23.93 ± 22.86 ms (OFC leads), incorrect trials lag 44.82 ± 26.69 ms (DMS leads), n = 30 sessions from five rats). These data demonstrate that the encoding of choice-related information in the OFC precedes the encoding of choice-related information in the DMS, and that this relationship is correlated with choice accuracy.

To examine how information transmission between the OFC and DMS might be disrupted on error trials, we first asked whether an SVM trained on trials where animals chose the larger available reward (correct trials) could predict choice behavior on trials when animals chose the smaller available reward (incorrect trials). Strikingly, a model trained on data recorded from either the OFC or DMS on correct trials predicted the side the animal would choose equally well on correct and incorrect trials (Extended Data Fig. 5a), suggesting that both brain areas encode the chosen side with equivalent accuracy regardless of the correctness of the choice. We next examined the SVM predicted choice parameters computed on held-out trials where the animal made either correct or incorrect choices. As before, on correct trials we observed that the predicted choice parameter increased in the OFC before the DMS. However, when animals made an erroneous choice, we observed that despite the predicted choice parameter reaching similar levels as seen on correct trials, the predicted choice parameter did not increase in the OFC before the DMS (Extended Data Fig. 5b–d). Thus, while the transmission of spatial choice information from the OFC to the DMS is necessary to initiate appropriate value-based choice behavior, without this information choice might be initiated by other brain regions reflecting internal biases relating to habitual behavior.

Activity of the OFC projection to the DMS is necessary for economic decision-making

The temporal relationship between choice-related information in the OFC and DMS suggests that choices represented in the OFC could be relayed to the DMS to guide appropriate choice behavior. To address this hypothesis, we first examined the axonal projections from the OFC and confirmed the presence of a robust projection to the DMS²⁶ (Fig. 4a,b). We next specifically inhibited this direct projection by bilaterally injecting an AAV encoding a variant of the inhibitory halorhodopsin, which we optimized for axonal trafficking²⁷, under the control of the human synapsin promoter (AA8 hSyn:eNpHR3.0-NRN-EYFP) into the OFC (Fig. 4c,d). We positioned optical fibers bilaterally in either the DMS or mediodorsal thalamus, another major target of the OFC projections (Fig. 4b). We found that optogenetic inhibition of OFC inputs into the DMS selectively impaired decision-making related to reward volume: psychometric curves were flatter and choice latencies were disrupted (Fig. 4e,f), while preference scores were unchanged, indicating that inhibition of the OFC projection to the DMS did not disrupt juice preferences (Fig. 4g). In contrast, optogenetic inhibition of the OFC inputs to the mediodorsal thalamus had no effect on economic decision-making (Fig. 4h–j). In addition, optogenetic inhibition of the OFC projection to the DMS or mediodorsal thalamus had no effect on response latencies in the control task in which the choice component of the economic decision-making task was selectively eliminated, confirming that this manipulation did not impair visual perception, action execution or the representation (or recollection) of cue value (Extended Data Fig. 6a–c). Taken together, the data shown in this study indicate that information relayed directly from the OFC to the DMS is important for guiding economic decision-making.

**Fig. 4: Activity of the projection from the OFC to the DMS is necessary for economic decision-making.**

Discussion

Animals must constantly evaluate stimuli in their environment to guide appropriate approach and avoidance behaviors^1,2,3. To study how neural activity patterns across the brain may mediate these complex behaviors, we adapted an economic decision-making task for rats. Our experiments demonstrate that activity in the OFC and DMS, but surprisingly not in the prelimbic cortex or mediodorsal thalamus, is important for economic decision-making. Moreover, neural activity in both brain areas is dominated by spatial features of the economic decision-making task. Interestingly, we found that choice-related activity emerges in the OFC before the DMS, a relationship that correlates with choice accuracy. Finally, we found that activity of the direct connection from OFC to DMS is important for appropriate decision-making behavior. Taken together, these data suggest that spatial choice information is relayed from the OFC to the DMS to guide economic decision-making appropriate to the individual.

Several lines of previous evidence have supported a role for the OFC in economic decision-making^{1,4,5,6,7,8,9,10,11}; however, inactivation and lesion studies have yielded contradictory results^{12,13,14,15,16}. In this study, we leveraged the temporal resolution and enhanced the sensitivity of a designed inhibitory stabilized step function opsin²⁵ to inhibit OFC selectively during the cue evaluation period, when rats are making decisions. This optogenetic strategy avoided prolonged tissue heating (which could modulate neural activity directly) and prevented OFC disruption during choice execution and reward consumption (which could have other influences on decision-making behavior^28,29). In addition, we used a new training paradigm in which exposure to pairs of cues was limited to the testing context, so that animals would be unlikely to develop unnatural habitual responses to specific cue combinations (a phenomenon that could underlie the negative results observed in some previous studies^12,13,14). This training paradigm resulted in precise psychometric curve functions that allowed us to detect subtle impairments in economic decision-making. Finally, we demonstrated that activity in the OFC was not necessary for performance of a control task in which the choice component was selectively removed. This experiment excluded the possibility that effects were driven by sensory, motor or motivational deficits induced by optical inhibition. Taken together, these data revealed that OFC inhibition—restricted to the cue evaluation period—specifically and potently impaired economic decision-making appropriate to individual preference.

The OFC has been proposed to function as a cognitive map of the world, that is, an internal model of the associative and predictive relationships present in the environment^{30,31,32,33,34}. This hypothesis could unify several contrasting observations regarding the role of the OFC in distinct tasks, in which the OFC appears to be specifically required when individuals must use multiple categories of established knowledge to guide behavior in new scenarios^35,36,37. Consistent with this hypothesis, we found that OFC activity is necessary when animals must choose between differently valued options, only previously experienced in isolation. Importantly, we observed that OFC inhibition does not appear to preclude the ability to access value information; for example, OFC-inhibited animals still respond more rapidly to cues that predict larger-magnitude rewards in a single-cue control condition. Notably, this is also a task the animals had never seen before.

These data therefore suggest that OFC activity (and associated cognitive maps) is specifically recruited when animals must resolve motivational conflict to guide new decision-making. It should be noted that the OFC is a large, heterogenous structure consisting of the medial, ventral, ventrolateral, lateral and dorsolateral orbital areas^33,38. In this study, we specifically targeted the ventrolateral orbital area due to its reported role in supporting flexible behavior^39,40,41,42. In the future, it will be important to determine how these results compare to inactivation of other orbitofrontal subregions and how future results relate to established differences in anatomical connectivity across mediolateral and anterior-posterior gradients^33,38.

In contrast to previous observations of nonhuman primates making economic decisions^1,6, which have consistently demonstrated that task variables are represented in the OFC in goods (that is, resource) space, our data suggest that the rodent OFC has a critical role in making decisions in action space^16,43. Consistent with this idea, we observed that decision-related variables are represented in the rat OFC in a spatially mapped manner. Moreover, although optogenetic inhibition of the OFC did not influence behavior in animals presented with a single sensory cue eliciting a single action in the control task, optogenetic inhibition profoundly impaired behavior when animals were presented with the same single cue to guide decision-making between two different actions in the choice task (for example, three drops of blackcurrant juice reward versus no reward). Taken together, these data suggest that OFC activity in rodents is specifically recruited when animals must make choices between differently valued actions. Moving forward, it will be important to determine whether this reflects a fundamental difference in processing across species or is due to the different demands of the specific tasks used⁴⁴ (for example, the freely moving task used in this study might necessitate a more detailed representation of the spatial environment than the head-restrained tasks that have typically been used in nonhuman primates).

In contrast to the role of the OFC itself, the role of OFC outputs to other brain regions in value-based decision-making has been less comprehensively characterized. Previous studies showed that OFC projections to the ventral tegmental area can mediate aspects of appropriate credit assignment⁴⁵, projections to basolateral amygdala from lateral or medial OFC can mediate encoding and retrieval of values respectively^46,47, and OFC projections to both the dorsal and ventral striatum are important for using outcomes to update the value of specific actions^{42,48,49,50,51,52,53}. In this study, we expanded on this work and showed for the first time that the direct transmission of choice information from the OFC to the DMS, a region implicated in the generation of goal-directed actions^{21,23,42,49,54,55,56,57,58,59,60}, is important for the evaluation of different reward options before any outcome is delivered. Moreover, by demonstrating that activity of the same projection is not required for performance of a control task in which we selectively removed motivational conflict, we confirm that this deficit in decision-making behavior is not due to a general failure to recall outcomes that specific cues predict⁵⁰.

Surprisingly, while inhibition of the OFC disrupts choices based on both reward size (objective value) and reward type (subjective value), inhibition of either the DMS or the projection from the OFC to DMS only disrupts choices based on reward size (objective value). In addition, we found that neurons in the OFC are more strongly modulated by subjective value than objective value, an effect that is not observed in the DMS. These data suggest that an additional pathway out of the OFC may also contribute to decision-making about different types of reward. In the future, it will be important to identify how distinct OFC projections function in concert to support different components of decision-making. Taken together, these data provide new insight into how choices encoded in the OFC engage downstream neural circuits to generate appropriate behavioral responses.

Economic decision-making requires animals to compare the subjective value of sensory stimuli to guide appropriate behavior. To achieve this goal, sensory representations must be imbued with subjective value information, compared and used to engage neural circuits that generate appropriate behavioral responses. In this study we report that the projection from the OFC to the DMS ultimately connects sensory representations to appropriate behavioral output, to implement accurate economic decisions. Thus, the OFC projection to the DMS provides a critical anatomical substrate through which cortical representations exert dynamic control over ongoing behavior.

Methods

Experimental procedures were approved by the Stanford University Institutional Animal Care and Use Committee and by the Administrative Panel on Laboratory Animal Care (protocol no. 32908), according to the National Institutes of Health (NIH) guidelines for the care and use of laboratory animals.

Experimental animals and stereotactic surgery

Adult (10–12 weeks) male and female Long–Evans rats (Charles River Laboratories) were group-housed until surgery. Rats were randomly assigned to different experimental groups. Animals were anesthetized with isoflurane (1–5%, Henry Schein) and placed into a stereotactic frame (Kopf Instruments). Bone screws (Stoelting Co.) were inserted. For the optogenetic experiments, microinjection needles (WPI) were then inserted (coordinates from bregma: OFC +4 anteroposterior, ±2 mediolateral, −3 dorsoventral; prelimbic cortex +2.5 anteroposterior, ±0.5 mediolateral, −3.5 dorsoventral; DMS +1 anteroposterior, ±2.5 mediolateral, −4 dorsoventral; mediodorsal thalamus −2.8 anteroposterior, ±0.8 mediolateral, −5 dorsoventral; note that the dorsoventral coordinates reflect the distance from the brain surface) and each structure was injected with virus at a speed of 0.1 μl min. A 200-μm diameter optical fiber (Thorlabs) was placed 250 μm above the target sites and fixed in place using dental cement (RelyX, 3M). For the electrophysiological recordings, 64-channel silicon probes (Cambridge NeuroTech) were mounted on a microdrive and lowered to 500 μm above the site of interest. Craniotomies were sealed with Dura-Gel and microdrives were fixed in placed using dental cement. Molex connectors were attached to a wireless headstage (White Matter LLC), which was affixed to the skull with dental cement. Probes were lowered to the recording site 2 days before recordings. Buprenorphine SR (1 mg kg⁻¹) was administered. As an exclusion criterion, we only included rats with viral expression confined to the site of interest and fiber placement above the target site. (This resulted in the exclusion of one animal.) All experiments were conducted according to approved protocols at Stanford University.

Rat behavior

Water scheduled rats (1 h of water per day) were placed into a custom operant chamber equipped with three nosepoke portals mounted on a screen. The center portal was equipped with a lick spout for reward delivery. Entries into each nosepoke portal were detected by the breakage of an infrared beam and licks were detected using a capacitive touch sensor. (This was omitted for the electrophysiological recordings.) All events were controlled and recorded using custom MATLAB code using the MATLAB Support Package for Arduino and the Psychophysics Toolbox v.3 (ref. ⁶¹). For training, animals were placed into the operant chamber. One second after entering the center portal they were presented with a visual cue on one side of the center portal. The type of cue (vertical or horizontal drifting gratings) indicated the type of reward associated with the cue (zero calorie blackcurrant-flavored or lemon-flavored water); the number of squares that included the cue indicated the size of reward associated with the cue. Lemon-predictive and blackcurrant-predictive cues could be presented on either the left or right side of the animal, randomized for each trial. After 2 s, animals had to perform a nosepoke to the side the cue was presented to obtain the corresponding reward. Reward was delivered in the center portal. Reward collection was followed by a variable intertrial interval (ITI) of 5–10 s. If animals responded to the wrong side, no reward was delivered and the screen turned white for a 10-s time-out period. This taught animals to move to the side of the cue to indicate the response and to reinforce contingency. Trials in which animals took more than 12 s to indicate a response, and trials in which the animal took more than 5 s to collect the reward, were excluded.

When animals had achieved criterion performance (> 90% accuracy and response latency inversely proportional to reward magnitude on three consecutive sessions; each stimulus was comparably learned as shown in Extended Data Fig. 1a), they were placed into a full choice session. Animals were placed into the operant chamber; 1 s after entering the center portal, animals were presented with two visual cues side by side. Lemon-predictive or blackcurrant-predictive cues could be presented on either the left or right side of the animal, randomized for each trial. After 2 s, animals had to move to the side of the chosen cue to indicate their choice, and the chosen reward was delivered in the center portal. Reward collection was followed by a variable ITI of 5–10 s. Trials in which animals took more than 12 s to indicate choice, and trials in which animals took more than 5 s to collect the reward, were excluded. If animals performed at more than 75% accuracy (as animals made choices primarily to maximize the total volume of liquid consumed, accuracy was defined as the proportion of trials wherein animals selected the larger available reward), the following day animals were placed into another full choice session (for a maximum of three consecutive full choice sessions). For the choice sessions, a total of 15 cue combinations were used; each session was terminated after 600 trials or after 2.5 h, whichever came first. Otherwise, animals were placed back into training sessions until reachieving criterion performance. Summary data are presented as a composite of three consecutive full choice sessions per rat. Behavioral data were fitted by probit regression using the glmfit function in MATLAB. Preference scores were computed by calculating the difference in available reward (number of drops of blackcurrant − number of drops of lemon) for which the animal was equally likely to choose a blackcurrant-predictive or lemon-predictive cue. Long-term preference comparisons were between the preference score from the final three consecutive full choice sessions before a 4-month university shutdown, and the preference score from the first three consecutive full choice sessions after the 4-month shutdown. Comparisons of short-term preferences were performed on preference scores from each of three consecutive sessions. Latency to choice was calculated by finding the mean latency from the end of the mandatory 2-s cue presentation period, to the time at which the animal made its nosepoke response for each trial type. For each animal, we then subtracted the trial type with the fastest mean response time from all other trial types to obtain a relative latency to choice.

We carried out control behavior to account for the nonspecific effects of optical inhibition. Animals were placed into an operant chamber equipped with two nosepoke portals mounted on a screen; the left portal was equipped with a lick spout for reward delivery. One second after entering the left portal, animals were presented with a single visual cue in the center of the portal. (The same visual cues were also used for training and the full choice task.) After 2 s, animals had to perform a nosepoke in the second portal to indicate response. Reward was delivered in the left portal. Reward collection was followed by a variable ITI of 5–10 s. Trials in which animals took more than 12 s to indicate response, and trials in which the animal took more than 5 s to collect the reward, were excluded.

Optogenetic inhibition

Rats were placed into the operant chamber and a top-branch with a 200-μm diameter fiber-optic patch cord (Doric) coupled to either a 473 nm (Omicron) and 635 nm (CNI), or a 594 nm (Cobalt), laser setup outside of the operant chamber connected to the implanted optical fibers. Immediately beforehand, power output from the patch cord was adjusted to 8 mW (473 nm), 5 mW (635 nm) or 10 mW (594 nm). Animals received randomly interleaved presentations of inhibited and uninhibited trials. On the SwiChR++ inhibition trials, 1 s of 473-nm light stimulation to initiate inhibition was delivered when the visual stimuli were presented; 1 s of 635 nm light stimulation to relieve inhibition was delivered when the animal exited the center portal to indicate its choice. On the halorhodopsin inhibition trials, 594-nm light stimulation was initiated when the visual stimuli were presented and terminated when the animal exited the center portal to indicate choice.

Chronic electrophysiology

Animals were implanted with 64-channel silicon probes over the right DMS and right OFC. On the day of implantation, electrodes were lowered to 500 μm above the site of interest. Animals were allowed to recover for 2–3 weeks before behavioral training was resumed. Microdrives were lowered by 250 μm 2 days before each recording session. Electrophysiological data were acquired at 20 kHz using a wireless acquisition system (White Matter LLC). Recordings were made in freely moving rats, which may impact the degree of lateralization of the neural responses observed. Behavioral time stamps were acquired at 30 kHz using an Open Ephys acquisition system. Clocks were synchronized by sending a signal on every Open Ephys sample to the White Matter LLC acquisition system.

Acute electrophysiology

Animals expressing SwiChR++ in the OFC were anesthetized with isoflurane and placed into a stereotactic frame. A craniotomy was placed over the OFC and a custom optrode (200-μm fiber cemented onto a silicone probe) was inserted into the region of the infected cells. Recordings were made using an Open Ephys acquisition system applying a bandpass filter from 300 to 6,000 Hz to the voltage signal. A 1-s pulse of blue light (473 nm, 8 mW) was delivered to initiate inhibition and a 1-s pulse of red light (635 nm, 5 mW) was delivered to alleviate inhibition 4 s later. Laser timing was controlled by a Master-8 pulse generator (AMPI).

Electrophysiology data analysis

Spikes were sorted using Kilosort2 and were manually curated using Phy2 (ref. ⁶²). Units with less than 1% inter-spike intervals shorter than 2 ms were considered single units for the analysis purposes. Spike counts were binned in 50-ms bins, stepped at 25-ms increments and converted into a z-scored firing rate across the whole session. Z-scored firing rates were aligned to task events (cue presentation, choice nosepoke and reward delivery) and the mean firing rate in the 500 ms before cue presentation was subtracted on a per trial basis. Task-modulated units were identified based on a Wilcoxon rank-sum test of the mean firing rate within the 500-ms baseline and ten 500-ms epochs spanning the trial starting at cue onset. A cell was deemed task-modulated if any of the task epochs differed significantly from baseline after false discovery rate correction, with a corrected significance threshold of P < 0.001. For each neural response, we performed a linear regression against each of a set of ten predefined variables (separately). For subjective value regression, preference scores were calculated for each session by finding the difference in the available reward at which the animal was equally likely to choose blackcurrant and lemonade. This score was then added to the volume of lemonade available on each trial to generate subjective value predictors. Units were deemed modulated by the variable if the regression slope differed significantly from zero (correct significance threshold of P < 0.001).

Decoding analysis was performed using a fourfold cross-validated linear SVM⁶³. Classification accuracy was calculated as the fraction of correct predictions made on held-out data averaged across four cross-validation splits, repeated five times. For the single-trial analysis, predicted choice parameters were computed as the perpendicular distance of decision value from the support vector at each time point, repeated across four cross-validation splits. Cross-correlations of the predicted choice parameters were calculated in the 3 s surrounding the choice nosepoke and averaged across 20 decoding repeats per session. Single-trial predicted choice parameters were smoothed with a 50-ms Gaussian filter for analysis and a 250-ms Gaussian filter for visualization. For the latency analysis, an arbitrary threshold of 0.33 was set. For all decoding analysis, the numbers of units across brain areas were matched to the size of the smallest recorded population.

Histological processing and analysis

Rats were euthanized by transcardiac perfusion with 150 ml PBS, followed by 100 ml 4% paraformaldehyde. Brains were extracted and 100-μm sections were cut on a vibratome. Slices were labeled with goat anti-GFP (1:1,000, Abcam) primary antibody and Alexa Fluor 488 donkey anti-goat (1:1,000, Invitrogen). For the axon tracing studies, a microinjection needle was inserted into the brain (coordinates from bregma: +4 anteroposterior, ±2 mediolateral, −3 dorsoventral) and 0.5 μl AAV8 hSyn:oScarlet was injected into the OFC at a speed of 0.1 μl min⁻¹. At least 8 weeks later, brains were prepared for histology and axonal projections were quantified as described previously⁶⁴. Briefly, 100-μm coronal slices were imaged on a confocal microscope (ZEISS, Zen software) using a ×20 objective and the resultant images were processed in ImageJ for quantification. Briefly, the injection site was first manually removed and background was subtracted. Threshold was set to ×4 the mean of the local background and pixels above this threshold were interpreted as positive signal from the OFC axons. Region of interest (ROI) boundaries were manually defined based on 4,6-diamidino-2-phenylindole staining and the Paxinos and Watson rat brain atlas⁶⁵. Axon density was calculated as the percentage of total ROIs containing pixels above the threshold. Three sections per ROI were analyzed and those values were averaged to calculate a single value per ROI per rat. This approach cannot distinguish between axon terminal and fibers of passage.

Whole-brain clearing

Adult Long–Evans rats were anesthetized with isoflurane and placed into a stereotactic frame. A microinjection needle was inserted into the brain (coordinates from bregma: +4 anteroposterior, ±2 mediolateral, −3 dorsoventral) and 0.5 μl AAV8 hSyn:oScarlet was injected into the OFC at a speed of 0.1 μl min⁻¹. Eight weeks later, brains prepared for imaging using SHIELD⁶⁶. Briefly, rats were euthanized by transcardial perfusion with 150 ml PBS, followed by 100 ml 4% paraformaldehyde, followed by 50 ml 12% epoxide SHIELD perfusion solution. Brains were extracted and incubated in SHIELD perfusion solution at 4 °C for 48 h. Brains were removed from SHIELD perfusion solution and transferred to SHIELD OFF solution and incubated at 4C for 48 h. Brains were then transferred to SHIELD ON solution and incubated at 37 °C for 24 h. After completion of the SHIELD reaction, brains were transferred to SDS clearing solution and cleared passively at 37 °C for 7 days, before being transferred to a SmartClear system for active clearing for 10–14 days. When brains were clear, they were washed in 0.1% PBS with Tween 20 at 37 °C for 3 days, before being equilibrated in exPROTOS at room temperature for 2 days and imaged using a COLM light sheet microscope^67,68.

Statistics and reproducibility

Data are presented as the mean ± s.e.m. unless otherwise indicated. Raw data were tested for normality of distribution; statistical analyses were performed using Student’s t-test, Wilcoxon signed-rank test or ANOVA with Bonferroni correction for multiple comparisons. Statistical analyses were performed in Prism (GraphPad Software) and MATLAB (MathWorks). No statistical method was used to predetermine sample size, but sample sizes were based on previous studies⁶⁹. For practical reasons, data collection and analysis could not be performed blind to the conditions of the experiments (for example, because of obviously different positions of the fibers), but data were collected and analyzed in an automated manner to prevent experimenter bias.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All primary data for the figures and extended data figures are available from the corresponding author (K.D.) upon request.

Code availability

The code used for data processing and analysis is available from the corresponding author (K.D.) upon request.

References

Padoa-Schioppa, C. Neurobiology of economic choice: a good-based model. Annu. Rev. Neurosci. 34, 333–359 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pearson, J. M., Watson, K. K. & Platt, M. L. Decision-making: the neuroethological turn. Neuron 82, 950–965 (2014).
Article CAS PubMed PubMed Central Google Scholar
Sugrue, L. P., Corrado, G. S. & Newsome, W. T. Choosing the greater of two goods: neural currencies for valuation and decision-making. Nat. Rev. Neurosci. 6, 363–375 (2005).
Article CAS PubMed Google Scholar
Padoa-Schioppa, C. & Assad, J. A. Neurons in the orbitofrontal cortex encode economic value. Nature 441, 223–226 (2006).
Article CAS PubMed PubMed Central Google Scholar
Ballesta, S., Shi, W., Conen, K. E. & Padoa-Schioppa, C. Values encoded in orbitofrontal cortex are causally related to economic choices. Nature 588, 450–453 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fellows, L. K. Orbitofrontal contributions to value-based decision-making: evidence from humans with frontal lobe damage. Ann. N. Y. Acad. Sci. 2139, 51–58 (2011).
Article Google Scholar
Kable, J. W. & Glimcher, P. W. The neurobiology of decision: consensus and controversy. Neuron 63, 733–745 (2009).
Article CAS PubMed PubMed Central Google Scholar
Levy, D. J. & Glimcher, P. W. The root of all value: a common currency for choice. Curr. Opin. Neurobiol. 22, 1027–1038 (2012).
Article CAS PubMed PubMed Central Google Scholar
Murray, E. A. & Rudebeck, P. H. Specializations for reward-guided decision-making in the primate ventral prefrontal cortex. Nat. Neurosci. 19, 404–417 (2018).
Article CAS Google Scholar
Shadlen, M. N. & Shohamy, D. Decision-making and sequential sampling from memory. Neuron 90, 927–939 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wallis, J. D. Cross-species studies of orbitofrontal cortex and value-based decision-making. Nat. Neurosci. 15, 13–19 (2011).
Article PubMed PubMed Central Google Scholar
Gardner, M. P. H., Conroy, J. S., Shaham, M. H., Styer, C. V. & Schoenbaum, G. Lateral orbitofrontal inactivation dissociates devaluation-sensitive behavior and economic choice. Neuron 96, 1192–1203 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gardner, M. P. H., Conroy, J. C., Sanchez, D. C., Zhou, J. & Schoenbaum, G. Real-time value integration during economic choice is regulated by orbitofrontal cortex. Curr. Biol. 29, 4315–4322 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gardner, M. P. H. et al. Processing in lateral orbitofrontal cortex is required to estimate subjective preference during initial, but not established, economic choice. Neuron 108, 526–537 (2020).
Article CAS PubMed PubMed Central Google Scholar
Izquierdo, A., Suda, R. K. & Murray, E. A. Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices by both reward value and reward contingency. J. Neurosci. 24, 7540–7548 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kuwabara, M., Kang, N., Holy, T. E. & Padoa-Schioppa, C. Neural mechanisms of economic choices in mice. eLife 9, e49669 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kable, J. W. & Glimcher, P. W. The neural correlates of subjective value during intertemporal choice. Nat. Neurosci. 10, 1625–1633 (2007).
Article CAS PubMed PubMed Central Google Scholar
Cai, X., Kim, S. & Lee, D. Heterogeneous coding of temporally discounted values in the dorsal and ventral striatum during intertemporal choice. Neuron 69, 170–182 (2011).
Article CAS PubMed PubMed Central Google Scholar
Orona, E. & Gabriel, M. Multiple-unit activity of the prefrontal cortex and mediodorsal thalamic nucleus during reversal learning of discriminative avoidance behavior in rabbits. Brain Res. 263, 313–329 (1983).
Article CAS PubMed Google Scholar
Verharen, J. P. H., den Ouden, H. E. M., Adan, R. A. H. & Vandershchuren, L. J. M. J. Modulation of value-based decision-making behavior by subregions of the rat prefrontal cortex. Psychopharmacology 237, 1267–1280 (2020).
Article CAS PubMed PubMed Central Google Scholar
Friedman, A. et al. A corticostriatal path targeting striosomes controls decision-making under conflict. Cell 161, 1320–1333 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schumacher, J. D., van Holstein, M., Bagrodia, V., Le Bouder, H. B. & Floresco, S. B. Dorsomedial striatal contributions to different forms of risk/reward decision making. Neurobiol. Learn. Mem. 178, 107369 (2021).
Article CAS PubMed Google Scholar
Yin, H. H., Ostlund, S. B., Knowlton, B. J. & Balleine, B. W. The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 22, 513–523 (2005).
Article PubMed Google Scholar
Chakraborty, S., Kolling, N., Walton, M. E. & Mitchell, A. S. Critical role for the mediodorsal thalamus in permitting rapid reward-guided updating stochastic reward environments. eLife 5, e13588 (2016).
Article PubMed PubMed Central Google Scholar
Berndt, A. et al. Structural foundations of optogenetics: determinants of channelrhodopsin ion selectivity. Proc. Natl Acad. Sci. USA 26, 822–829 (2016).
Article Google Scholar
Hoover, W. B. & Vertes, R. P. Projections of the medial orbital and ventral orbital cortex in the rat. J. Comp. Neurol. 519, 3766–3801 (2011).
Article PubMed Google Scholar
Ye, L. et al. Wiring and molecular features of prefrontal ensembles representing distinct experiences. Cell 165, 1776–1788 (2016).
Article CAS PubMed PubMed Central Google Scholar
Constantinople, C. M. et al. Lateral orbitofrontal cortex promotes trial-by-trial learning of risky, but not spatial biases. eLife 8, e49744 (2019).
Article PubMed PubMed Central Google Scholar
Jennings, J. H. et al. Interacting neural ensembles in the orbitofrontal cortex for social and feeding behaviour. Nature 565, 645–649 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stalnaker, T. A., Cooch, N. K. & Schoenbaum, G. What the orbitofrontal cortex does not do. Nat. Neurosci. 18, 620–627 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schuck, N. W., Cai, M. B., Wilson, R. C. & Niv, Y. Human orbitofrontal cortex represents a cognitive map of state space. Neuron 91, 1402–1412 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wilson, R. C., Takahashi, Y. K., Schoenbaum, G. & Niv, Y. Orbitofrontal cortex as a cognitive map of task space. Neuron 81, 267–279 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bradfield, L. A. & Hart, G. Rodent medial and lateral orbitofrontal cortices represent unique components of cognitive maps of task space. Neurosci. Biobehav. Rev. 108, 287–294 (2020).
Article PubMed Google Scholar
Behrens, T. E. J. et al. What is a cognitive map? Organizing knowledge for flexible behavior. Neuron 100, 490–509 (2018).
Article CAS PubMed Google Scholar
Jones, J. J. et al. Orbitofrontal cortex supports behavior and learning using inferred but not cached values. Science 338, 2743–2770 (2012).
Article Google Scholar
Stalnaker, T. A. et al. Orbitofrontal neurons infer the value and identity of predicted outcomes. Nat. Commun. 5, 3926 (2014).
Article CAS PubMed Google Scholar
Vertechi, P. et al. Inference-based decisions in a hidden state foraging task: differential contributions of prefrontal cortical areas. Neuron 106, 166–176 (2020).
Article CAS PubMed PubMed Central Google Scholar
Izquierdo, A. Functional heterogeneity within rat orbitofrontal cortex in reward learning and decision making. J. Neurosci. 37, 10529–10540 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chase, E. A., Tait, D. S. & Brown, V. J. Lesions of the orbital prefrontal cortex impair the formation of attentional set in rats. Eur. J. Neurosci. 36, 2368–2375 (2012).
Article PubMed Google Scholar
Meyer, H. C. & Bucci, D. J. Imbalanced activity in the orbitofrontal cortex and nucleus accumbens impairs behavioral inhibition. Curr. Biol. 26, 2834–2839 (2016).
Article CAS PubMed PubMed Central Google Scholar
Parkes, S. L. et al. Insular and ventrolateral orbitofrontal cortices differentially contribute to goal-directed behavior in rodents. Cereb. Cortex 28, 2313–2325 (2018).
Article PubMed Google Scholar
Gremel, C. M. & Costa, R. M. Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions. Nat. Commun. 4, 2264 (2013).
Article PubMed Google Scholar
Feierstein, C. E., Quirk, M. C., Uchida, N., Sosulski, D. L. & Mainen, Z. F. Representation of spatial goals in rat orbitofrontal cortex. Neuron 51, 495–507 (2006).
Article CAS PubMed Google Scholar
Yoo, S. B. M., Sleezer, B. J. & Hayden, B. Y. Robust encoding of spatial information in the orbitofrontal cortex and striatum. J. Cogn. Neurosci. 30, 898–913 (2018).
Article PubMed Google Scholar
Namboodiri, V. M. K. et al. Single-cell activity tracking reveals that orbitofrontal neurons acquire and maintain a long-term memory to guide behavioral adaptation. Nat. Neurosci. 22, 1110–1121 (2019).
Article CAS PubMed PubMed Central Google Scholar
Malvaez, M., Shieh, C., Murphy, M. D., Greenfield, V. Y. & Wassum, K. M. Distinct cortical-amygdala projections drive reward value encoding and retrieval. Nat. Neurosci. 22, 762–769 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lichtenberg, N. et al. The medial orbitofrontal cortex-basolateral amygdala circuit regulates the influence of reward cues on adaptive behavior and choice. J. Neurosci. 41, 7267–7277 (2021).
Article CAS PubMed PubMed Central Google Scholar
Groman, S. M. et al. Orbitofrontal circuits control multiple reinforcement-learning processes. Neuron 103, 734–746 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gremel, C. M. et al. Endocannabinoid modulation of orbitostriatal circuits gates habit formation. Neuron 90, 1312–1324 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, D. C. et al. A molecularly integrated amygdalo-fronto-striatal network coordinates flexible learning and memory. Nat. Neurosci. 25, 1213–1224 (2022).
Article CAS PubMed Google Scholar
Jenni, N. L., Rutledge, G. & Floresco, S. B. Distinct medial orbitofrontal-striatal circuits support dissociable component processes of risk/reward decision-making. J. Neurosci. 42, 2743–2755 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hirokawa, J., Vaughan, A., Masset, P., Ott, T. & Kepecs, A. Frontal cortex neuron types categorically encode single decision variables. Nature 576, 446–451 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pascoli, V. et al. Stochastic synaptic plasticity underlying compulsion in a model of addiction. Nature 564, 366–371 (2018).
Article CAS PubMed Google Scholar
Bissonette, G. B. & Roesch, M. R. Rule encoding in dorsal striatum impacts action selection. Eur. J. Neurosci. 42, 2555–2567 (2015).
Article PubMed PubMed Central Google Scholar
Cromwell, H. C. & Schultz, W. Effects of expectations for different reward magnitudes on neuronal activity in the primate striatum. J. Neurophysiol. 22, 2823–2838 (2003).
Article Google Scholar
Cui, G. et al. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature 494, 238–242 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hikosaka, O., Sakamoto, M. & Usui, S. Functional properties of monkey caudate neurons. III. Activities related to expectation of target and reward. J. Neurophysiol. 61, 814–832 (1989).
Article CAS PubMed Google Scholar
Hollerman, J., Tremblay, L. & Schultz, W. Influence of reward expectation of behavior-related neuronal activity in the primate striatum. J. Neurophysiol. 80, 947–963 (1998).
Article CAS PubMed Google Scholar
Santacruz, S. R., Rich, E. L., Wallis, J. D. & Carmena, J. M. Caudate microstimulation increases value of specific choices. Curr. Biol. 27, 3375–3383 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tai, L.-H., Moses Lee, A., Benavidez, N., Bonci, A. & Wilbrecht, L. Transient stimulation of distinct subpopulations of striatal neurons mimics changes in action value. Nat. Neurosci. 15, 1281–1289 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kleiner, M. et al. What’s new in psychtoolbox-3. Perception 36, 1–16 (2007).
Google Scholar
Stringer, C., Pachitariu, M., Steinmetz, N., Carandini, M. & Harris, K. D. High-dimensional geometry of population responses in visual cortex. Nature 571, 361–365 (2019).
Article CAS PubMed PubMed Central Google Scholar
Meyers, E. M. The neural decoding toolbox. Front. Neuroinform. 7, 8 (2013).
Article PubMed PubMed Central Google Scholar
Beier, K. T. et al. Circuit architecture of VTA dopamine neurons revealed by systematic input-output mapping. Cell 162, 622–634 (2015).
Article CAS PubMed PubMed Central Google Scholar
Paxinos, G. & Watson, C. The Rat Brain in Stereotactic Coordinates (Academic Press, 2007).
Google Scholar
Park, Y. G. et al. Protection of tissue physicochemical properties using polyfunctional crosslinkers. Nat. Biotechnol. 37, 73–83 (2019).
Article CAS Google Scholar
Tomer, R., Ye, L., Hsueh, B. & Deisseroth, K. Advanced CLARITY for rapid and high-resolution imaging of intact tissues. Nat. Protoc. 9, 1682–1697 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lerner, T. N. et al. Intact-brain analyses reveal distinct information carried by SNc dopamine subcircuits. Cell 162, 635–647 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zalocusky, K. A. et al. Nucleus accumbens D2R cells signal prior outcomes and control risky decision-making. Nature 531, 642–646 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank T. Machado, J. Baruni and members of the Deisseroth and Malenka laboratories for helpful discussions. This work was supported by grants from the UCSF Dolby Family Center for Mood Disorders (to R.C.M.), the NIH (no. P50DA042012 to K.D. and R.C.M.; no. K99DA050662 to F.G.), National Science Foundation, Gatsby, Fresenius, Wiegers, Grosfeld and NOMIS Foundations (to K.D.), a Walter V. and Idun Berry award and a Brain & Behavior Research Foundation (formerly National Alliance for Research on Schizophrenia & Depression) Young Investigator Award (to F.G.).

Author information

Authors and Affiliations

Department of Bioengineering, Stanford University, Stanford, CA, USA
Felicity Gore, Melissa Hernandez, Charu Ramakrishnan, Ailey K. Crow & Karl Deisseroth
Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, USA
Felicity Gore, Melissa Hernandez, Charu Ramakrishnan, Ailey K. Crow, Robert C. Malenka & Karl Deisseroth
Nancy Pritzker Laboratory, Stanford University, Stanford, CA, USA
Felicity Gore & Robert C. Malenka
Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
Karl Deisseroth

Authors

Felicity Gore
View author publications
You can also search for this author in PubMed Google Scholar
Melissa Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Charu Ramakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Ailey K. Crow
View author publications
You can also search for this author in PubMed Google Scholar
Robert C. Malenka
View author publications
You can also search for this author in PubMed Google Scholar
Karl Deisseroth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.G., R.C.M. and K.D. conceived the project, designed the experiments and wrote the manuscript with input from all authors. F.G. performed the behavioral experiments, optogenetic manipulations, electrophysiological recordings and data analysis. M.H. assisted with the behavioral experiments. C.R. designed and produced the viral constructs. A.K.C. performed the light sheet microscopy. R.C.M. and K.D. supervised all aspects of the work.

Corresponding author

Correspondence to Karl Deisseroth.

Ethics declarations

Competing interests

R.C.M. is on the scientific advisory boards of MapLight Therapeutics, Bright Minds Biosciences, MindMed and Aelis Farma. K.D. is on the scientific advisory boards of MapLight Therapeutics, Stellaromics, and Bright Minds Biosciences. The other authors declare no competing interests.

Peer review

Peer review information

Nature Neuroscience thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Behavioral characterization.

a. Proportion of correct responses to each cue in final 3 training sessions before surgery; cues were comparably learned, n = 36 rats. b. Reinforcement contingencies where size of stimulus was not proportional to reward size for probing whether animals perform value-based or perceptual decision-making. c. Probability of choosing blackcurrant-predictive cue for all cue combinations for animals training using reversed reinforcement contingencies (n = 8 rats, one-way repeated-measures ANOVA). d. Probability of choosing blackcurrant-predictive cue as a function of the difference in the size of rewards available for animals trained using reversed reinforcement contingencies (n = 8 rats, one-way repeated-measures ANOVA). Inset, fraction of trials in which animal chose the larger available reward (n = 8 rats, 0.79±0.01). e. Latency to choice nosepoke response as a function of difference in the size of rewards available for animals trained using reversed reinforcement contingencies (n = 8 rats, one-way repeated-measures ANOVA). Center dot represents median, bars represent first and third quartile. f. Correlations of preference scores computed on 3 consecutive sessions. Preference scores are highly correlated, indicating juice preferences of individual animals are stable across time (Pearson’s correlation). * P < 0.05, ** P < 0.01, *** P < 0.001, Unless otherwise noted, data are presented as mean ± SEM.

Extended Data Fig. 2 Fiber placements.

a-d. Representative images of EYFP expression and fiber placements in animals injected with AAV8 hSyn:SwiChR++EYFP or AAV8 hSyn:EYFP in the OFC (a), the prelimbic cortex (b), the DMS (c), or the mediodorsal thalamus (d).

Extended Data Fig. 3 Optogenetic inhibition control experiments.

a-d. Optical stimulation does not alter economic decision-making in animals expressing EYFP in the OFC. a. Schematic of experimental preparation. b. Probability of choosing blackcurrant-predictive cue as a function of difference in the volume of available rewards for no illumination (green) and illumination (magenta) trials (n = 6 rats, two-way repeated-measures). Inset, fraction of trials in which animal chose the larger available reward on no illumination (green) and illumination (magenta) trials (n = 6 rats, two-sided paired t-test). c. Latency to choice nosepoke response as a function of the absolute difference in the size of rewards available on no illumination (green) and illumination (magenta) trials (n = 6 rats, two-way repeated-measures ANOVA). d. Juice preferences computed on trials with OFC illumination are correlated with juice preferences computed on trials without OFC illumination (Pearson’s correlation). e. Schematic of control task for probing whether effects of optical inhibition specifically impact choice behavior. f, g. Latency to nosepoke response for cues predicting different size rewards in control no-choice task on trials in which the OFC (f) or DMS (g) was not inhibited (green) or was inhibited (magenta). OFC or DMS inhibition did not alter latencies to respond in no-choice control task (OFC: n = 12 rats, DMS< n = 6 rats, two-way repeated-measures ANOVA). * P < 0.05, ** P < 0.01, *** P < 0.001, Data are presented as mean ± SEM.

Extended Data Fig. 4 Electrophysiology supporting data 1.

a. Proportion of units significantly modulated by distinct task features in the OFC (top) or DMS (bottom) for each individual animal. b. Left, proportions of neurons significantly modulated by objective (reward size) and subjective value did not differ across OFC (blue) or DMS (red). Right, coefficients of determination (R2) of each modulated unit in either OFC (blue) or DMS (red) when either the subjective or objective value of the stimuli presented on either left or right were used as predictors. Black lines denote means. OFC units were more strongly modulated by subjective value than objective value (n = 107 units per condition, three-way mixed ANOVA). c. Chosen-side classification accuracy of 4-fold cross validated support vector machine trained on single unit activity in the OFC (blue) or DMS (red) for each individual animal (R102 n = 86 units per brain area, R109 n = 141 units per brain area, R116 n = 54 units per brain area, R140 n = 103 units per brain area, R144, n = 145 units per brain area, R145 n = 133 OFC units, R123 n = 71 DMS units). * P < 0.05, ** P < 0.01, *** P < 0.001, Data are presented as mean ± SEM.

Extended Data Fig. 5 Electrophysiology supporting data 2.

a. Chosen-side classification accuracy of 4-fold cross validated support vector machine trained on single unit activity recorded in either the OFC (blue) or DMS (red) on correct trials, and tested on either held-out correct trials (left) or incorrect trials (center). Note decoding performance is reduced compared to Fig. 3 due to the relatively small number of incorrect trials performed. Right, classification accuracy reaches equivalent levels on both correct and incorrect trials in both the OFC and DMS (n = 20 random samples, two-way repeated-measures ANOVA). b. Average predicted choice parameters computed on single correct (left) or incorrect (right) trials aligned to choice response (n = 30 sessions). c. Peak predicted choice parameters are equivalent on correct and incorrect trials (n = 20 random samples, two-way repeated-measures ANOVA). d. Latency to predicted choice parameter threshold relative to choice on correct or incorrect choice trials for models trained using data recorded simultaneously from OFC (blues) or DMS (reds). OFC activity does not precede DMS activity when animals make incorrect choices (n = 20 random samples, two-way repeated-measures ANOVA). * P < 0.05, ** P < 0.01, *** P < 0.001, Data are presented as mean ± SEM.

Extended Data Fig. 6 Optogenetic axon terminal inhibition control experiments, related to Fig. 4.

a. Schematic of control task for probing whether effects of optically inhibiting OFC axon terminals specifically influence choice behavior. b, c. Latency to nosepoke response for cues predicting different size rewards in control no-choice task on trials in which the projection from OFC to DMS (b) or OFC to mediodorsal thalamus (c) was not inhibited (green) or was inhibited (magenta). Inhibition of OFC projections to DMS or mediodorsal thalamus does not alter latencies to respond in no-choice control task (n = 7 rats, two-way repeated-measures ANOVA). * P < 0.05, ** P < 0.01, *** P < 0.001, Data are presented as mean ± SEM.

Supplementary information

Reporting Summary

Supplementary Table 1

Statistics summary. Full statistical details for each figure.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gore, F., Hernandez, M., Ramakrishnan, C. et al. Orbitofrontal cortex control of striatum leads economic decision-making. Nat Neurosci 26, 1566–1574 (2023). https://doi.org/10.1038/s41593-023-01409-1

Download citation

Received: 12 February 2023
Accepted: 17 July 2023
Published: 17 August 2023
Issue Date: September 2023
DOI: https://doi.org/10.1038/s41593-023-01409-1

This article is cited by

Adolescents rats engage the orbitofrontal-striatal pathway differently than adults during impulsive actions
- Aqilah M. McCane
- Lo Kronheim
- Bita Moghaddam
Scientific Reports (2024)