Multivariate Neural Representations of Value during Reward Anticipation and Consummation in the Human Orbitofrontal Cortex

Yan, Chao; Su, Li; Wang, Yi; Xu, Ting; Yin, Da-zhi; Fan, Ming-xia; Deng, Ci-ping; Hu, Yang; Wang, Zhao-xin; Cheung, Eric F. C.; Lim, Kelvin O.; Chan, Raymond C. K.

doi:10.1038/srep29079

Download PDF

Article
Open access
Published: 05 July 2016

Multivariate Neural Representations of Value during Reward Anticipation and Consummation in the Human Orbitofrontal Cortex

Chao Yan^1,2,
Li Su³,
Yi Wang¹,
Ting Xu¹,
Da-zhi Yin⁴,
Ming-xia Fan⁵,
Ci-ping Deng²,
Yang Hu²,
Zhao-xin Wang²,
Eric F. C. Cheung⁶,
Kelvin O. Lim⁷ &
…
Raymond C. K. Chan¹

Scientific Reports volume 6, Article number: 29079 (2016) Cite this article

3082 Accesses
25 Citations
10 Altmetric
Metrics details

Subjects

Abstract

The role of the orbitofrontal cortex (OFC) in value processing is a focus of research. Conventional imaging analysis, where smoothing and averaging are employed, may not be sufficiently sensitive in studying the OFC, which has heterogeneous anatomical structures and functions. In this study, we employed representational similarity analysis (RSA) to reveal the multi-voxel fMRI patterns in the OFC associated with value processing during the anticipatory and the consummatory phases. We found that multi-voxel activation patterns in the OFC encoded magnitude and partial valence information (win vs. loss) but not outcome (favourable vs. unfavourable) during reward consummation. Furthermore, the lateral OFC rather than the medial OFC encoded loss information. Also, we found that OFC encoded values in a similar way to the ventral striatum (VS) or the anterior insula (AI) during reward anticipation regardless of motivated response and to the medial prefrontal cortex (MPFC) and the VS in reward consummation. In contrast, univariate analysis did not show changes of activation in the OFC. These findings suggest an important role of the OFC in value processing during reward anticipation and consummation.

Value and choice as separable and stable representations in orbitofrontal cortex

Article Open access 10 July 2020

A structural and functional subdivision in central orbitofrontal cortex

Article Open access 24 June 2022

Outcome contingency selectively affects the neural coding of outcomes but not of tasks

Article Open access 18 December 2019

Introduction

The orbitofrontal cortex (OFC) has received considerable attention for its role in value computation/representation and value/utilities comparison in decision-making tasks^1,2,3,4 as well as in the absence of an overt decision-making^5,6.

The OFC, occupying the ventral surface of the frontal part of the brain, is a relatively large and heterogeneous brain area in human (comprising Brodmann Areas (BA) 11, 12, 13, 14 and 47) and non-human primates (BA 10, 11, 12, 13 and 14)^7,8. It receives inputs from various sensory modalities and has reciprocal connections with limbic, striatal and frontal areas. It has been suggested that the OFC is a key and multifunctional brain area in the reward network^7,9. For example, complex or abstract reinforcers (i.e. money and social reward) are represented more anteriorly in the OFC than less complex reinforcers (i.e. food and erotic information)^7,10,11,12. Moreover, the medial (mOFC) and the lateral (lOFC) orbitofrontal cortex differentially respond to rewarding and punishing events^13,14. In an animal study, Rolls and colleagues¹⁵ have suggested that the different subpopulations of neurons in the OFC encode value across several modalities including taste and odour as well as visual cues of rewarding objects and faces. Furthermore, neurons of the rats’ OFC were found to encode reward value in a population of cells rather than by a single unit^16,17,18. These findings suggest a possible heterogeneous functional/anatomical organisation and distributed neural representations of values within the OFC.

In conventional neuroimaging analysis, which mainly focuses on mapping the extent of the regional averaged changes in blood-oxygen-dependent level (BOLD) signal¹⁹, considerable smoothing and averaging are employed during pre-processing and statistical testing. This may reduce the sensitivity for detecting subtle changes in anatomically/functionally heterogeneous areas (i.e. OFC) during reward processing^20,21,22. Multi-voxel pattern analysis (MVPA) may overcome this limitation by capturing fine-grained changes involved in the encoding of values^20,22. Few studies have investigated how the human OFC encodes value (valence and magnitude) during reward anticipation and consummation using MVPA^23,24,25 and findings had been mixed. Kahnt and his colleagues, using MVPA, had shown that distributed pattern in the mOFC represented reward value during both reward anticipation and consummation²³. Tusche et al.²⁵ reported that multivariate pattern in the ventral prefrontal cortex represented attractiveness of consumer products (cars), which could predict consumers’ future choices of purchasing. However, another studies suggested that valence rather than magnitude is represented in the central OFC (located between medial and lateral OFC, BA 11 and 13 ²⁶) during reward anticipation²⁴. In different studies, different phases (anticipatory vs. consummatory) of reward processing and sub-regions (mOFC, vmPFC and central OFC) were investigated, which may complicate the interpretation of OFC’s role in reward valuation. In the present study, we aimed to further examine whether the mOFC and the lOFC encode valence and magnitude information in the anticipatory and consummatory phases of reward processing.

Representational similarity analysis (RSA)^27,28,29,30, which is one type of MVPA, was employed in this study to detect multivariate fMRI activation pattern in the OFC. RSA was developed based on the assumption that information encoded by the brain can be represented by the similarity between fMRI patterns associated with different experimental conditions. In order to capture value processing during reward anticipation and consummation, the monetary incentive delay task (MID) was employed³¹. In this task, participants were presented with a cue (circle or square) with a value information (i.e. win¥5.00, exchange rate at the time of experiment was approximately 1 US dollar = ¥6.2) and were required to wait for a short period (anticipatory phase) before responding to a target. Following the target, there was another waiting period which was defined as the anticipatory phase after making a response. Finally, feedback containing reward or punishment information was informed to participants based on their performance (consummatory phase, Fig. 1). There were two types of anticipatory phase in the MID task: before and after making a response. Based on the framework of anticipatory affect model by Knutson et al.³², the anticipatory phase before making a response is regarded as a more important period, which determines human’s reward anticipation and promotes future motivated behaviour. Based on this theoretical framework, we focused on value processing during this anticipatory phase. In addition, we also examined the anticipatory phase after making a response, because this phase is a purer anticipatory phase and less affected by the response preparation.

In order to test whether or not the OFC represents valence and magnitude information during the anticipatory and consummatory phase, we constructed models RDMs based on the affective property of cue stimuli reflecting hypothesis on different value information (valence and magnitude) during each phase. For example, for a model of valence, regardless of how much money was presented, patterns of win conditions are similar to each other but different from the patterns of loss conditions and vice versa. There were three types of model RDMs for each kind of value information: a simple model for overall value encoding; a simple model for specific value encoding (i.e. win and loss for the valence model, respectively) and a complex model for continuous value encoding. The simple model reflects that the value was encoded as “all or none” (either the same or different pattern between conditions) while complex models represent a graded difference between values (See Fig. 2). Then we performed the Spearman’s correlation between the brain RDMs in the OFC and the models reflecting different value representations to see how the OFC represented value information.

Next, we explored whether or not the fMRI patterns in the OFC were similar to the activation patterns in those regions that were traditionally associated with reward anticipation and consummation. Previous studies have suggested that the anticipation of primary rewards (i.e. pleasant taste, smell)¹² and secondary rewards (i.e. monetary and social rewards)^11,33 increases the activity in the ventral striatum (VS) and the anterior insular (AI). A meta-analysis conducted by Liu and colleagues including 65 studies and 1553 foci and has implicated a role of the VS and the AI in reward anticipation⁴. On the other hand, in a recent imaging meta-analysis comprising 35 imaging studies and 461 foci, Diekhof and colleagues³⁴ found that the MPFC and the VS encode reward magnitude during the consummation of primary and secondary rewards. Another meta-analysis conducted by Knutson and Greer including 12 studies and 87 foci has also emphasized on the important role of the VS and MPFC in reward consummation³². Therefore, the VS and the AI during the reward anticipation as well as the medial prefrontal cortex (MPFC) and the VS during the reward consummation were chosen as traditional reference regions and compared with OFC in this study.

Results

Reaction time and subjective affective ratings

Participants responded to the target more quickly with an increase in monetary value, regardless of valence (win /loss), which was reflected by the significant main effect of magnitude (F (2, 44) = 18.87, p < 0.001) and non-significant valence × magnitude interaction (p = 0.24). For anticipatory experience, an interaction effect of valence × magnitude was observed in participants’ valence ratings (F (2, 21) = 20.657, p < 0.001). Participants reported more pleasantness and aversive feeling with increases in reward and punishment value, respectively. For consummatory experience, we observed a significant outcome × magnitude interaction in valence ratings (F (2, 44) = 56.968, p < 0.001), indicating that participants reported more pleasantness/aversive experience with increases in monetary value when participants received favourable/favourable outcomes. In terms of arousal, a main effect for magnitude was observed during both the anticipatory and the consummatory phase (anticipation: F (2, 21) = 20.272, p < 0.001; consummation: F (2, 21) = 20.526, p < 0.001), indicating that the participants were more excited with the increasing monetary value during both reward/punishment anticipation and consummation (see Supplementary Fig. 1 in Supplementary Materials).

Conventional univariate fMRI analysis

We did not observe any significant main effects of valence, magnitude, outcome or any interaction effects on OFC activation during either the anticipatory or the consummatory phase. Consistent with the previous findings by Knutson et al.³², our whole brain analysis showed a significant main effect for magnitude on activation in the VS in the anticipatory phase before making a response, a main effect of magnitude on activation in the AI in the anticipatory phase after making a response and a significant main effect of outcome and valence on activation in the VS and the MPFC in the consummatory phase (p_{FWE-corrected} < 0.05 at the cluster level, Fig. 3; and see Supplementary Table 1, 2 and 3 in Supplementary Materials).

We then carried out small volume correction analysis within the mOFC (BA 11: regions defined by BA atlas implemented in SPM) and lOFC (BA 12 and 47) during each phase. We found that, during either anticipatory phase before or after making a response, there was no significant main effect for valence, main effect for magnitude or valence × magnitude interaction effect on the mOFC and the lOFC activities. During consummatory phase, we did not observe any significant main effect or interaction in the mOFC or the lOFC either. Also, similar findings were observed when we carried out the analysis based on the percentage of BOLD signal change in the OFC (See Fig. 2 in the supplementary material).

Therefore, consistent with the previous studies, the conventional univariate analysis failed to provide robust evidence suggesting the involvement of the OFC in reward anticipation and consummation. In the next section, we will investigate whether fine-grained multivariate activation patterns in OFC were more sensitive in revealing representations associated with reward values.

Multivariate representational similarity fMRI analysis

In RSA, the primary data structure: representation dissimilarity matrix (RDM) is a correlation distance (1 – Pearson correlation) matrix of beta estimates associated with experimental conditions, reflecting brain activation patterns during tasks (e.g. anticipating a reward of ¥5.00) in concerned brain areas. Each element of the RDM is a correlation distance, called distance coefficient (DC), indicates similarity of activation patterns between the pairs of experimental conditions. A small DC indicates high similarity between multivariate brain activation patterns and vice versa.

OFC encodes valence and magnitude of reward

During the anticipatory and consummatory phases, to examine whether the OFC encodes valence and magnitude information, we compared brain activation RDMs in the OFC with the a-priori model RDMs based on stimulus types (valence and magnitude) using Spearman’s correlation. Non-parametric permutation testing (10,000 permutations) was used to test the significance of correlations.

In the anticipatory phase before making a response, we found that activation RDM in the lOFC matched the simple model RDM for magnitude (specific, none vs. small + large) at a trend level (lOFC: r = 0.40, p_permutation = 0.07). In addition, the RDM of the mOFC showed a trend to match the simple and complex model RDM for magnitude (overall) (simple: r = 0.46, p_permutation = 0.07; complex: r = 0.45, p_permutation = 0.07). However, we did not observe any significant correlation between brain activation RDM in the OFC and model RDMs for valence (all ps > 0.1). In the anticipatory phase after making the response, there was a trend level correlation between the activation RDM in the lOFC and the simple model RDM for valence (loss) (lOFC: r = 0.46, p_permutation = 0.10) (See Table 1).

Table 1 Relationships between Brain RDM in the mOFC, lOFC and Model RDMs for anticipatory and consummatory phase.

Full size table

In the consummatory phase, similar to RDMs of the VS and the MPFC, RDMs of the mOFC and the lOFC significantly matched the simple and complex model RDMs for magnitude (overall) (r s = 0.23 to 0.54, ps_permutation < 0.05), suggesting that the OFC encoded magnitude information during the consummatory phase of reward processing. On the other hand, it was found that activation RDMs in the mOFC and the lOFC matched either the simple or complex model RDMs for valence (win vs. loss) (rs = 0.28 to 0.43, ps_permutation < 0.05), rather than model RDMs for outcome (ps_permutation > 0.1) (see Table 1), suggesting the OFC encoded valence (win vs. loss) regardless of favourable or unfavourable outcomes. The lOFC rather than the mOFC matched the simple model for loss (lOFC: r = 0.33, p_permutation = 0.004; mOFC: r = 0.02, p_permutation = 0.42).

Comparing OFC with other regions within the reward network

In order to examine whether the mOFC and the lOFC encode value information similarly as the VS, the AI and the MPFC, we performed Spearman’s correlations between the RDMs at the mOFC and the lOFC and RDM at the traditional reference regions during the anticipatory and consummatory phases.

We observed that the RDM of the OFC was similar to that of the VS (lOFC: r = 0.50, p_permutation = 0.03; but mOFC: r = 0.09, p_permutation = 0.37) and AI (lOFC: r = 0.47, p_permutation = 0.03; mOFC: r = 0.64, p_permutation = 0.01) in the anticipatory phase before making a response (See Fig. 4A). Within the win and loss components of the RDM, we did not observe any significant difference between the DCs of the mOFC, the lOFC, the VS and the AI (ps_{Bonferroni corrected} > 0.1).

During the anticipatory phase after making the response, the RDMs of the lOFC were significantly similar to that of the VS (r = 0.66, p_permutation = 0.01) and show a trend level of similarity to that of the AI (r = 0.41, p_permutation = 0.099) (See Fig. 4A). Within the win or loss component RDM, there was no significant difference in DCs between the mOFC, the lOFC, the VS and the AI (ps_{Bonferroni corrected} > 0.1). Therefore, although no significant activation was detected on conventional univariate analysis, the RSA showed that the OFC was indeed encoding similar information as the VS/the AI during the anticipatory phase before and after making a response.

During the consummatory phase, RDMs of the mOFC and the lOFC were similar to those of the MPFC (mOFC: r = 0.79, p_permutation < 0.001; lOFC: r = 0.37, p_permutation = 0.001) and the VS (mOFC: r = 0.44, p_permutation = 0.001; lOFC: r = 0.68, p_permutation < 0.001), suggesting the involvement of the OFC in reward consummation (see Fig. 4B). For the win, loss avoidance, no win and loss components of the RDMs, there were no significant differences in DCs between the VS, the MPFC, the mOFC and the lOFC (ps_{Bonferroni corrected} > 0.1). Taken together, these findings suggest that the OFC was encoding similar information as the VS and the MPFC during reward consummation.

Discussion

Our multivariate analysis showed that the OFC encoded magnitude and partial valence information during the consummatory phase of reward processing. The lOFC may represent loss information regardless of favourable or unfavourable outcome. During the anticipatory phase, the lOFC may encode magnitude information before making a response and valence information after making a response. In addition, the OFC exhibited similar activation patterns as the VS/AI during the anticipatory phase before and after making a response and as the MPFC and the VS during the consummatory phase. By contrast, OFC activations were not observed during the anticipatory or the consummatory phase using conventional univariate approaches suggesting that the neural representations of reward value in OFC maybe distributed across populations of neurons in human brains.

Although we did not find any OFC activation during the anticipatory or the consummatory phases of the MID task using conventional univariate analysis, the multi-voxel activation patterns in the OFC were largely similar to that of the VS/AI/MPFC using RSA. Our result is consistent with one previous study investigating value representation in the OFC during reward anticipation and consummation using multivariate pattern classification²³. In this study, Kahnt found that the reward value of complicated sensory cues (i.e. rotation direction and colour) could be decoded from distributed fMRI patterns in the OFC during both reward anticipation and consummation²³. They did not observe any significant activations using conventional analysis, either²³. This suggests that multivariate pattern analysis may be a more sensitive and appropriate approach than conventional univariate analysis to detect neural responses in the OFC, which is a functionally and anatomically heterogeneous brain area^7,9. Using a slightly different design (containing both reward and punishment cues and outcome) and different types of MVPA (i.e. RSA), we confirmed that multi-voxel fMRI pattern in the OFC encoded value during both reward anticipation and consummation, which is also consistent with evidences from animal studies^16,17,18. In the present study, we investigated whether the OFC was involved in value processing during the anticipatory phase when no motivated response was required. It was found that the lOFC encoded values in a similar way as the AI and the VS during this phase. Although this result was less commonly reported in previous studies, it suggests that the OFC might be involved in value encoding during reward anticipation regardless of whether overt responses are involved or not.

During the consummatory phase, the OFC represented partial valence information (win vs. loss) regardless of outcome produced (favourable vs. unfavourable), which is consistent with previous findings suggesting that activities in the OFC may be associated with valence during anticipation^12,24, decision making^35,36 and consummation of outcome^37,38. In the recent study using MVPA, Kahnt et al.²⁴ found that multi-voxel patterns of the OFC represented valence information in the anticipatory phase. Specifically, the patterns coding for appetitive and aversive outcomes were similar, indicating a common neural circuit for encoding both appetitive and aversive values²⁴. Extending the finding by Kanht and his colleagues, our study showed that OFC might also encode valence information in the consummatory phase. Furthermore, the lOFC rather than mOFC particularly responded to monetary loss, which is partly consistent with previous evidence suggesting that the medial-lateral OFC may differentially respond to rewarding and punishing events^13,14. Unexpectedly, the way the OFC encoded valence was independent of outcome, possibly because that there are other connected brain regions that encode outcome information.

Interestingly, our findings suggest that the OFC also represents magnitude during the consummatory phase, which is different from previous studies suggesting that the OFC does not represent magnitude information^{24,35,36,37,38}. By contrast, this is consistent with results from a previous meta-analysis, which reported that the mOFC/MPFC processes magnitude during reward consummation³⁴. The inconsistency might be due to the paradigm selected in the studies. In the present study, the paradigm we employed yielded stronger multi-voxel patterns in the OFC during the consummatory phase. In most previous studies, participants were asked to either make a decision or to wait for the reward/punishment stimuli, which lacked the feeling of self-involvement and subjective effort^{12,23,24,35,36}. However, in our MID task, outcomes were determined by explicit motor responses, which increased the sense of agency and motivation and might result in an arousal-like pattern in the OFC during the consummatory phase. Taken together, both valence and magnitude information appear to be encoded by the OFC during reward consummation.

During the anticipatory phase before participants’ making a response, the multi-voxel pattern of the mOFC and the lOFC tended to represent magnitude rather than valence information. This finding is similar to the VS showing increased activation for both anticipated gain and loss when outcomes were uncertain and salience was high³⁹, supporting the anticipatory affect model proposed by Knutson and Greer³². In their model, the feeling of anticipation for future uncertain outcomes is closely related to human arousal regardless of valence³². A recent study has also suggested that the OFC may encode a general anticipatory value signal, regardless of reinforcer valence (appetitive/aversive)⁴⁰. Interestingly, after the participants had responded, the lOFC tended to represent valence (especially for loss) instead, which is partly consistent with previous findings suggesting the OFC’s involvement in encoding valence^12,24,37,38. The reason for the lOFC’s differential value representations during different anticipatory phase might be related to whether overt responses were required during the task. If motivated responses were not required, the OFC represents valence rather than magnitude of predicted outcomes²⁴. Or, it might produce stronger multi-voxel patterns in the OFC encoding magnitude over valence information because overt response is closely related with arousal dimension of anticipatory affect³². Thus, our data suggests a possibility that multi-voxel pattern in the lOFC may differentially encode magnitude and valence information during the anticipatory phase of reward processing before and after making a response. Future research is needed to clarify the specific role of lOFC during reward anticipation.

This study has several limitations. First, the lOFC is an area where there might be relatively higher fMRI signal drop-out. To avoid signal drop-out in the lOFC due to magnetic susceptibility in homogeneity, we angled slices away from the orbits. Secondly, subjective ratings were obtained off-line, which might be noisy estimates of valence and arousal during the task due to the delay. Future studies should evaluate valence and arousal experience during the task.

Conclusion

In conclusion, our findings suggest that similar to the VS, the AI and the MPFC, the OFC may also play an important role in value processing during both reward anticipation and consummation. The fine-grained multi-voxel activation pattern of the OFC might encode both valence and magnitude information in reward consummation.

Methods

Participants

Twenty-three participants (12 males; mean age: 19.78 (sd = 0.8)) were recruited from the East China Normal University and the Shanghai Normal University. All of them had no personal or family history of neurological, psychiatric or personality disorders. The study was approved by the Ethics Committee of the Institute of Psychology, the Chinese Academy of Sciences and was carried out in accordance with the approved guidelines. Written informed consent was obtained from all participants.

Monetary Valence Delay Task

We employed a modified version of the “Monetary Valence Delay Task” (MID) developed by Knutson et al.³¹. Each trial started with the presentation of a cue. The shapes of the cue (circle/square) indicated win or loss. The number of lines inside the cue reflected the amount of money (no line = ¥0, one line = ¥0.50 and three lines = ¥5.00). Following a pseudo-random delay (2000–2500 ms) in the anticipatory phase (before making a response), participants responded to the target (a white solid square) that appeared for a variable length of time (110 ms–560 ms) by pressing a button as quickly as possible with the right index finger. After another delay (anticipatory phase after making the response, 1500–2500 ms), a feedback was given to the participants indicating the amount of money they had won or lost and their current balance. Participants could receive or avoid loss of money by successfully pressing the button while the targets were still presented on the screen. Task difficulty, which was evaluated based on reaction times collected during the practice session before scanning, was matched across participants to have a success rate of approximately 66%. Each trial lasted for about 10 seconds (Fig. 1). Participants underwent three 9-minute-12-second sessions, each of which comprised 54 trials and were informed prior to the experiment that they would be paid with real money at the end of the experiment based on their winnings during the experiment. Participant earned between 23.7 (3.8 US$) to 56.7 (9.14 US$) RMB on average. The US Dollar to Chinese Yuan exchange rate was approximately 1 $ = ¥6.2 at the time of the experiment. In the MID task, three within-group variables were designed: valence (win vs. loss); magnitude (none (¥0) vs. small (¥0.50) vs. large (¥5.00)); and outcome (favourable vs. unfavourable. A favourable outcome refers to hitting the targets resulting in a non-zero reward or avoiding a non-zero loss).

After the scans, participants were immediately asked to rate their emotional state across the valence, magnitudes and outcome dimensions during the anticipatory and the consummatory phase of the MID task. A nine-point liker scale was used to measure valence (1 = extremely negative, 5 = neutral, 9 = extremely positive) and arousal (1 = extremely calm, 9 = extremely excited).

Imaging acquisition

All the participants were scanned in a 3-Tesla Siemens Trio magnetic resonance imaging scanner (Siemens Medical Solutions, Erlangen, Germany). The functional images were acquired with the following sequence: TR = 2000 ms, TE = 30 ms, field of view (FOV) = 210 × 210 mm, flip angle = 90 degree, resulting in voxel size of 3.3 × 3.3 × 4 mm, Slice number = 32. To avoid signal drop-out in the lOFC due to magnetic susceptibility in homogeneity, we angled slices away from the orbits. Participants viewed visual stimuli on a projector screen via a mirror fixed on the head coil and responded with the right index finger by pressing the button on a response glove fixed on their right hand. High-resolution T1-weighted MPRAGE anatomy images were also acquired (TR = 2530 ms, TE = 30 ms, FOV = 256 × 256 mm, flip angle = 7 degree, 192 continuous axial slice of 1-mm thickness, voxel size = 1 × 1 × 1 mm).

Data analysis

Behavioural data analysis

For motivated behaviour and subjective rating of anticipatory pleasantness, two-way (Valence × Magnitude) repeated measure ANOVAs were separately performed on reaction time, valence and arousal ratings. For the consummatory phase, a three-way (Outcome × Valence × Magnitude) repeated measure ANOVA was performed on valence and arousal. Valence, magnitude and outcome (favourable and unfavourable) were set as within-subject factors. Multiple comparisons were controlled using Bonferroni correction.

fMRI data analysis

Pre-processing

Functional images were analysed using the SPM8 (Statistic Parametric Mapping, Welcome Department of Neurology, London, UK, 2009). Data pre-processing included realignment, slice timing, co-registration, anatomical segmentation and normalisation. Brain images were smoothed with 8 mm Gaussian kernel for univariate analysis but were unsmoothed for RSA in order to preserve the fine spatial details in the fMRI signal. For each participant, a general linear model (GLM) (Friston et al.¹⁹) was used to estimate a BOLD response for each condition containing 12 regressors for conditions during the anticipatory phase (-¥5.00, -¥0.50, -¥0.00, ¥0.00, ¥0.50, ¥5.00) before (6 regressors) and after (6 regressors) making a response, 12 regressors for different outcomes during the consummatory phase: win (expected reward), avoid loss (expected relief), no win (expected failure), loss (expected punishment) for three magnitudes and six regressors for head movement parameters, a regressor for frame-wise displacement (FD)⁴¹ and separate regressors for each extreme spike (FD > 0.2 mm) across each run to control for head motion; one for whiter matter signal and one for cerebro-spinal fluid signal at the GLM analysis.

Conventional univariate analysis

The conventional univariate whole-brain analysis was performed using two-way repeated measure ANOVAs, which aimed to determine brain areas showing significant main effects of valence condition (win vs. loss), magnitude (large vs. small vs. none) and interactions of valence condition × magnitude in the anticipatory phase before and after making a response, respectively. For the consummatory phase, a three-way ANOVA analysis was performed with outcome (favourable vs. unfavourable), valence condition and magnitude as within-group variables. These contrasts were thresholded at p < 0.001 (uncorrected) at the peak level and p < 0.05 (Family Wise Error (FWE) for multiple comparison) at the cluster level, followed by small-volume correction.

Representational Similarity Analysis

Independent from the univariate analysis, Representational Similarity Analysis (RSA)^27,28,29,30 was carried out in anatomically defined ROIs. First, the brain RDMs were computed based on the beta estimates of each condition from the GLM using correlation distance (1 – Spearman’s rho). This resulted in a 6 × 6 (anticipatory) and a 12 × 12 (consummatory) brain RDM for each participant at the concerned ROIs in the anticipatory and the consummatory phase, respectively (see Fig. 4A,B). The value of the correlation distance, named distance coefficient (DC), indicates similarity of activation patterns between two conditions. For the ease of discussion, we will use the term “similarity” instead of “dissimilarity” in the rest of the paper.

To detect whether the mOFC and the lOFC represented valence and magnitude information in the anticipatory and the consummatory phases, we defined a number of model RDMs which reflected valence and magnitude coding and used Spearman correlations to test whether brain RDMs of the mOFC and the lOFC matched these a-priori model RDMs. There were three types of model RDMs for each type of value information (valence, magnitude, outcome): a simple model for overall value encoding (i.e. valence; DC = 0 or 1; where 0 indicates that the two conditions are the same while 1 indicates the difference); simple models for specific value encoding (i.e. win and loss for the valence model, respectively) and complex models for overall value encoding in a transitional way (i.e. valence, DC ranged from 0 to 1). The simple model reflects that the value was encoded as “all or none” (either the same or different between conditions) while complex models represent a gradual difference between values. For example, within the model RDM for valence, winning large reward was assumed to be completely the same as winning no reward in the simple model (DC = 0), but partly similar to winning no reward (DC = 0.5) in the complex model (please see supplementary material for more details). Then we performed the Spearman’s correlation between the brain activation patterns in the OFC and the models reflecting different value representations. (see Fig. 2 and the Supplementary Materials for more details).

Non-parametric permutation testing (10,000 permutations) was used to see whether multi-voxel patterns in the OFC were similar to the patterns in the traditional reference regions such as VS, the AI and the MPFC as reported by previous researches^4,32,34. Spearman correlations were used to test the correlations between RDMs of the mOFC, lOFC, VS and AI in the anticipatory phase and correlations between the RDMs of the mOFC, lOFC, VS and MPFC in the consummatory phase, respectively.

Within the anticipatory/consummatory phase, we further examined whether the patterns of the sub-components of the RDM (including “Win” and “Loss” component within the anticipatory RDM and “Win”, “Avoid Loss”, “No Win” and “Loss” component within the consummatory RDM, Supplementary Fig. 2) were different or not between ROIs. Standardized Fisher-transformed DCs within each sub-component of RDM were extracted and averaged across three runs. Independent t-tests were performed on averaged DCs of subcomponents between different ROIs. P value was adjusted using Bonferroni correction.

ROIs selection

All regions of interest (ROI) were anatomically defined based on brain atlases⁴² implemented in Wake Forest University PickAtlas toolbox (version 3.0 4) in the SPM 8⁴³. Because of previous meta-analysis implicating the VS and the AI in the value processing during anticipatory phase^4,32 and the VS and the MPFC during consummatory phase^32,34, we considered these regions as traditional regions associated with value processing. The mOFC and the lOFC, on the other hand, was the ROIs for which we hypothesized based on previous studies^7,8,10,23,40. We defined anatomical ROI masks including a MPFC mask (BA 10, BA 32 and BA 25)⁴⁴, a AI mask (BA 13 bounded caudally at y = 0 to include only the anterior region)^45,46, a mOFC mask (BA 11)⁴⁴ and a lOFC mask (BA 12 and BA 47)⁷ based on Brodmann Areas Map as well as a VS mask (nucleus accumbens) based on Individual Brain Atlases using Statistical Parametrical Mapping (IBASPM)³². All of these ROIs were defined in MNI space.

Additional Information

How to cite this article: Yan, C. et al. Multivariate Neural Representations of Value during Reward Anticipation and Consummation in the Human Orbitofrontal Cortex. Sci. Rep. 6, 29079; doi: 10.1038/srep29079 (2016).

References

Kennerley, S. W. & Walton, M. E. Decision making and reward in frontal cortex: complementary evidence from neurophysiological and neuropsychological studies. Behav. Neurosci. 125, 297–317 (2011).
PubMed PubMed Central Google Scholar
Rushworth, M. F., Noonan, M. P., Boorman, E. D., Walton, M. E. & Behrens, T. E. Frontal Cortex and Reward-Guided Learning and Decision-Making. Neuron 70, 1054–1069 (2011).
CAS PubMed Google Scholar
Wallis, J. D. Orbitofrontal cortex and its contribution to decision-making. Annu. Rev. Neurosci. 30, 31–56 (2007).
CAS PubMed Google Scholar
Liu, X., Hairston, J., Schrier, M. & Fan, J. Common and distinct networks underlying reward valence and processing stages: a meta-analysis of functional neuroimaging studies. Neurosci. Biobehav. Rev. 35, 1219–1236 (2011).
PubMed Google Scholar
Lebreton, M., Jorge, S., Michel, V., Thirion, B. & Pessiglione, M. An automatic valuation system in the human brain: evidence from functional neuroimaging. Neuron 64, 431–439 (2009).
CAS PubMed Google Scholar
Smith, D. V. et al. Distinct value signals in anterior and posterior ventromedial prefrontal cortex. J. Neurosci. 30, 2490–2495 (2010).
CAS PubMed PubMed Central Google Scholar
Kringelbach, M. L. The human orbitofrontal cortex: linking reward to hedonic experience. Nat. Rev. Neurosci. 6, 691–702 (2005).
CAS PubMed Google Scholar
Rolls, E. T. The functions of the orbitofrontal cortex. Brain Cogn. 55, 11–29 (2004).
PubMed Google Scholar
Kahnt, T., Chang, L. J., Park, S. Q., Heinzle, J. & Haynes, J.-D. Connectivity-Based Parcellation of the Human Orbitofrontal Cortex. The Journal of Neuroscience 32, 6240–6250 (2012).
CAS PubMed PubMed Central Google Scholar
Sescousse, G., Redouté, J. & Dreher, J.-C. The Architecture of Reward Value Coding in the Human Orbitofrontal Cortex. The Journal of Neuroscience 30, 13095–13104 (2010).
CAS PubMed PubMed Central Google Scholar
Spreckelmeyer, K. N. et al. Anticipation of monetary and social reward differently activates mesolimbic brain structures in men and women. Soc. Cogn. Affect. Neurosci. 4, 158–165 (2009).
PubMed PubMed Central Google Scholar
O’Doherty, J. P., Deichmann, R., Critchley, H. D. & Dolan, R. J. Neural Responses during Anticipation of a Primary Taste Reward. Neuron 33, 815–826 (2002).
PubMed Google Scholar
Elliott, R., Agnew, Z. & Deakin, J. F. Hedonic and informational functions of the human orbitofrontal cortex. Cereb. Cortex 20, 198–204 (2010).
CAS PubMed Google Scholar
Rolls, E. T., Kringelbach, M. L. & de Araujo, I. E. Different representations of pleasant and unpleasant odours in the human brain. Eur. J. Neurosci. 18, 695–703 (2003).
PubMed Google Scholar
Rolls, E. T. The Orbitofrontal Cortex and Reward. Cereb. Cortex 10, 284–294 (2000).
CAS PubMed Google Scholar
van Duuren, E. et al. Neural coding of reward magnitude in the orbitofrontal cortex of the rat during a five-odor olfactory discrimination task. Learn. Memory 14, 446–456 (2007).
Google Scholar
van Duuren, E. et al. Single-cell and population coding of expected reward probability in the orbitofrontal cortex of the rat. J. Neurosci. 29, 8965–8976 (2009).
CAS PubMed PubMed Central Google Scholar
van Duuren, E., Lankelma, J. & Pennartz, C. M. Population coding of reward magnitude in the orbitofrontal cortex of the rat. J. Neurosci. 28, 8590–8603 (2008).
CAS PubMed PubMed Central Google Scholar
Friston, K. J. et al. Statistical parametric maps in functional imaging: A general linear approach. Hum. Brain Mapp. 2, 189–210 (1994).
Google Scholar
Haxby, J. V. et al. Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science 293, 2425–2430 (2001).
ADS CAS PubMed Google Scholar
Kamitani, Y. & Tong, F. Decoding the visual and subjective contents of the human brain. Nat. Neurosci. 8, 679–685 (2005).
CAS PubMed PubMed Central Google Scholar
Kriegeskorte, N., Goebel, R. & Bandettini, P. Information-based functional brain mapping. Proc. Natl. Acad. Sci. USA 103, 3863–3868 (2006).
ADS CAS Google Scholar
Kahnt, T., Heinzle, J., Park, S. Q. & Haynes, J. D. The neural code of reward anticipation in human orbitofrontal cortex. Proc. Natl. Acad. Sci. USA 107, 6010–6015 (2010).
ADS CAS PubMed Google Scholar
Kahnt, T., Park, S. Q., Haynes, J.-D. & Tobler, P. N. Disentangling neural representations of value and salience in the human brain. Proc. Natl. Acad. Sci. USA 111, 5000–5005 (2014).
ADS CAS PubMed Google Scholar
Tusche, A., Bode, S. & Haynes, J.-D. Neural Responses to Unattended Products Predict Later Consumer Choices. The Journal of Neuroscience 30, 8024–8031 (2010).
CAS PubMed PubMed Central Google Scholar
Valentin, V. V., Dickinson, A. & O’Doherty, J. P. Determining the neural substrates of goal-directed learning in the human brain. J. Neurosci. 27, 4019–4026 (2007).
CAS PubMed PubMed Central Google Scholar
Carlin, Johan D., Calder, Andrew J., Kriegeskorte, N., Nili, H. & Rowe, James B. A Head View-Invariant Representation of Gaze Direction in Anterior Superior Temporal Sulcus. Curr. Biol. 21, 1817–1821 (2011).
CAS PubMed PubMed Central Google Scholar
Cichy, R. M., Pantazis, D. & Oliva, A. Resolving human object recognition in space and time. Nat Neurosci 17, 455–462 (2014).
CAS PubMed PubMed Central Google Scholar
Kriegeskorte, N. et al. Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey. Neuron 60, 1126–1141 (2008).
CAS PubMed PubMed Central Google Scholar
Nili, H. et al. A toolbox for representational similarity analysis. PLoS Comput. Biol. 10, e1003553 (2014).
PubMed PubMed Central Google Scholar
Knutson, B., Adams, C. M., Fong, G. W. & Hommer, D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. J. Neurosci. 21, RC159 (2001).
CAS PubMed PubMed Central Google Scholar
Knutson, B. & Greer, S. M. Anticipatory affect: neural correlates and consequences for choice. Philos. Trans. R. Soc. Lond. B Biol. Sci. 363, 3771–3786 (2008).
PubMed PubMed Central Google Scholar
Cho, Y. T. et al. Nucleus accumbens, thalamus and insula connectivity during incentive anticipation in typical adults and adolescents. NeuroImage 66, 508–521 (2013).
PubMed Google Scholar
Diekhof, E. K., Kaps, L., Falkai, P. & Gruber, O. The role of the human ventral striatum and the medial orbitofrontal cortex in the representation of reward magnitude – An activation likelihood estimation meta-analysis of neuroimaging studies of passive reward expectancy and outcome processing. Neuropsychologia 50, 1252–1266 (2012).
PubMed Google Scholar
Hardin, M. G., Pine, D. S. & Ernst, M. The influence of context valence in the neural coding of monetary outcomes. NeuroImage 48, 249–257 (2009).
PubMed PubMed Central Google Scholar
Litt, A., Plassmann, H., Shiv, B. & Rangel, A. Dissociating Valuation and Saliency Signals during Decision-Making. Cereb. Cortex 21, 95–102 (2011).
PubMed Google Scholar
Anderson, A. K. et al. Dissociated neural representations of intensity and valence in human olfaction. Nat. Neurosci. 6, 196–202 (2003).
CAS PubMed Google Scholar
Lewis, P. A., Critchley, H. D., Rotshtein, P. & Dolan, R. J. Neural correlates of processing valence and arousal in affective words. Cereb. Cortex 17, 742–748 (2007).
CAS PubMed Google Scholar
Cooper, J. C. & Knutson, B. Valence and salience contribute to nucleus accumbens activation. NeuroImage 39, 538–547 (2008).
PubMed Google Scholar
Metereau, E. & Dreher, J.-C. The medial orbitofrontal cortex encodes a general unsigned value signal during anticipation of both appetitive and aversive events. Cortex (in press).
Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L. & Petersen, S. E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. NeuroImage 59, 2142–2154 (2012).
Google Scholar
Kriegeskorte, N., Simmons, W. K., Bellgowan, P. S. & Baker, C. I. Circular analysis in systems neuroscience: the dangers of double dipping. Nat Neurosci 12, 535–540 (2009).
CAS PubMed PubMed Central Google Scholar
Maldjian, J. A., Laurienti, P. J., Kraft, R. A. & Burdette, J. H. An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets. NeuroImage 19, 1233–1239 (2003).
PubMed Google Scholar
Haber, S. N. & Knutson, B. The Reward Circuit: Linking Primate Anatomy and Human Imaging. Neuropsychopharmacology 35, 4–26 (2009).
PubMed Central Google Scholar
Dupont, S., Bouilleret, V., Hasboun, D., Semah, F. & Baulac, M. Functional anatomy of the insula: new insights from imaging. Surg. Radiol. Anat. 25, 113–119 (2003).
CAS PubMed Google Scholar
Craig, A. D. How do you feel–now? The anterior insula and human awareness. Nat. Rev. Neurosci. 10, 59–70 (2009).
CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Dr Nikolaus Kriegeskorte for useful discussions and helps on RSA. This study was supported by grants from the National Science Fund China (81088001, 31500894 and 91132701), the Strategic Priority Research Programme (B) of the Chinese Academy of Science (XDB02030002), the Beijing Training Project for the Leading Talents in S & T (Z151100000315020), the Beijing Municipal Science & Technology Commission Grant and a grant from the initiation fund of the CAS/SAFEA International Partnership Programme for Creative Research Team (Y2CX131003). LS is funded by the NIHR Biomedical Research Centre and Biomedical Research Unit in Dementia based at Cambridge University Hospitals NHS Foundation Trust and the University of Cambridge. These funding agents had no role in the study design; collection, analysis and interpretation of the data; writing of the manuscript; or decision to submit the paper for publication.

Author information

Authors and Affiliations

Neuropsychology and Applied Cognitive Neuroscience Laboratory, Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Room 606, South Building, 16 Lincui Road, Beijing, Beijing, 100101, China
Chao Yan, Yi Wang, Ting Xu & Raymond C. K. Chan
Key Laboratory of Brain Functional Genomics, Ministry of Education, Shanghai Key Laboratory of Brain Functional Genomics (MOE & STCSM), East China Normal University, Room 213, Junxiu Building, 3663 North Zhongshan Road, Shanghai, 200062, China
Chao Yan, Ci-ping Deng, Yang Hu & Zhao-xin Wang
Department of Psychiatry, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0SP, UK
Li Su
Institute of Neuroscience, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
Da-zhi Yin
Shanghai Key Laboratory of MRI, East China Normal University, 3663 North Zhongshan Road, Shanghai, 200062, China
Ming-xia Fan
Department of General Adult Psychiatry, Castle Peak Hospital, 15 Tsing Chung Koon Road, Tuen Mun, N.T. Hong Kong Special Administrative Region, China
Eric F. C. Cheung
Department of Psychiatry, University of Minnesota, F282/2A West 2450 Riverside Avenue, Minneapolis, MN 55454, USA
Kelvin O. Lim

Authors

Chao Yan
View author publications
You can also search for this author in PubMed Google Scholar
Li Su
View author publications
You can also search for this author in PubMed Google Scholar
Yi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ting Xu
View author publications
You can also search for this author in PubMed Google Scholar
Da-zhi Yin
View author publications
You can also search for this author in PubMed Google Scholar
Ming-xia Fan
View author publications
You can also search for this author in PubMed Google Scholar
Ci-ping Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zhao-xin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Eric F. C. Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Kelvin O. Lim
View author publications
You can also search for this author in PubMed Google Scholar
Raymond C. K. Chan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.C. designed the study, analysed the data and wrote the first draft of the paper. S.L. and X.T. analysed the data and interpreted the data. W.Y., Y.D., F.M., D.C.P., H.Y. and W.Z.X. collected the data. C.E.F.C. and L.K.O. interpreted the data and commented the paper significantly. C.R.C.K. generated the idea, interpreted the data, wrote the first draft of the paper. All authors read and commented on the final version of the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Yan, C., Su, L., Wang, Y. et al. Multivariate Neural Representations of Value during Reward Anticipation and Consummation in the Human Orbitofrontal Cortex. Sci Rep 6, 29079 (2016). https://doi.org/10.1038/srep29079

Download citation

Received: 02 October 2015
Accepted: 10 June 2016
Published: 05 July 2016
DOI: https://doi.org/10.1038/srep29079

This article is cited by

Representation of rewards differing in their hedonic valence in the caudate nucleus correlates with the performance in a problem-solving task in dogs (Canis familiaris)
- Laura V. Cuaya
- Raúl Hernández-Pérez
- Dorottya Júlia Ujfalussy
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.