Flexible combination of reward information across primates

Farashahi, Shiva; Donahue, Christopher H.; Hayden, Benjamin Y.; Lee, Daeyeol; Soltani, Alireza

doi:10.1038/s41562-019-0714-3

Article
Published: 09 September 2019

Flexible combination of reward information across primates

Nature Human Behaviour volume 3, pages 1215–1224 (2019)Cite this article

2839 Accesses
44 Citations
92 Altmetric
Metrics details

Subjects

Abstract

A fundamental but rarely contested assumption in economics and neuroeconomics is that decision-makers compute subjective values of risky options by multiplying functions of reward probability and magnitude. By contrast, an additive strategy for valuation allows flexible combination of reward information required in uncertain or changing environments. We hypothesized that the level of uncertainty in the reward environment should determine the strategy used for valuation and choice. To test this hypothesis, we examined choice between risky options in humans and rhesus macaques across three tasks with different levels of uncertainty. We found that whereas humans and monkeys adopted a multiplicative strategy under risk when probabilities are known, both species spontaneously adopted an additive strategy under uncertainty when probabilities must be learned. Additionally, the level of volatility influenced relative weighting of certain and uncertain reward information, and this was reflected in the encoding of reward magnitude by neurons in the dorsolateral prefrontal cortex.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 2: Different strategies for combination of reward information under risk and uncertainty.**

**Fig. 3: Additive models explain choice under uncertainty.**

**Fig. 4: Adjustment of choice behaviour to volatility of the environment.**

**Fig. 5: Behavioural adjustments in response to changes in volatility of the environment in the ML task.**

**Fig. 6: Neural signature of behavioural adjustments to volatility in the dlPFC.**

Temporally organized representations of reward and risk in the human brain

Article Open access 09 March 2024

A neuronal prospect theory model in the brain reward circuitry

Article Open access 04 October 2022

Choice-relevant information transformation along a ventrodorsal axis in the medial prefrontal cortex

Article Open access 10 August 2021

Data availability

The data that support the findings of this study are available from the corresponding author upon request.

Code availability

Custom computer codes that support the findings of this study are available from the corresponding author upon request.

References

Bernoulli, D. Expositions of a new theory of the measurement of risk. Econometrica 22, 23–36 (1954).
Article Google Scholar
Edwards, W. The theory of decision making. Psychol. Bull. 51, 380 (1954).
Article CAS Google Scholar
Kahneman, D. & Tversky, A. On the psychology of prediction. Psych. Rev. 80, 237–251 (1973).
Article Google Scholar
Stewart, N. Information integration in risky choice: identification and stability. Front. Psychol. 2, 301 (2011).
Article Google Scholar
Ernst, M. O. & Banks, M. S. Humans integrate visual and haptic information in a statistically optimal fashion. Nature 415, 429 (2002).
Article CAS Google Scholar
Hunt, L. T., Dolan, R. J. & Behrens, T. E. Hierarchical competitions subserving multi-attribute choice. Nat. Neurosci. 17, 1613–1622 (2014).
Article CAS Google Scholar
Farashahi, S., Rowe, K., Aslami, Z., Lee, D. & Soltani, A. Feature-based learning improves adaptability without compromising precision. Nat. Commun. 8, 1768 (2017).
Article Google Scholar
Farashahi, S. et al. Metaplasticity as a neural substrate for adaptive learning and choice under uncertainty. Neuron 94, 401–414 (2017).
Article CAS Google Scholar
Spitmaan, M., Chu, E. & Soltani, A. Salience-driven value construction for adaptive choice under risk. J. Neurosci. 39, 5195–5209 (2019).
Article CAS Google Scholar
Strait, C. E., Blanchard, T. C. & Hayden, B. Y. Reward value comparison via mutual inhibition in ventromedial prefrontal cortex. Neuron 82, 1357–1366 (2014).
Article CAS Google Scholar
Farashahi, S., Azab, H., Hayden, B. & Soltani, A. On the flexibility of basic risk attitudes in monkeys. J. Neurosci. 38, 4383–4398 (2018).
Article CAS Google Scholar
Hayden, B., Heilbronner, S. & Platt, M. Ambiguity aversion in rhesus macaques. Front. Neurosci. 4, 166 (2010).
Article Google Scholar
Donahue, C. H. & Lee, D. Dynamic routing of task-relevant signals for decision making in dorsolateral prefrontal cortex. Nat. Neurosci. 18, 295–301 (2015).
Article CAS Google Scholar
Massi, B., Donahue, C. H. & Lee, D. Volatility facilitates value updating in the prefrontal cortex. Neuron 99, 598–608 (2018).
Article CAS Google Scholar
Stephan, K. E., Penny, W. D., Daunizeau, J., Moran, R. J. & Friston, K. J. Bayesian model selection for group studies. NeuroImage 46, 1005–1017 (2009); erratum 48, 311–311 (2009).
Behrens, T. E. J., Woolrich, M. W., Walton, M. E. & Rushworth, M. F. S. Learning the value of information in an uncertain world. Nat. Neurosci. 10, 1214–1221 (2007).
Article CAS Google Scholar
Tversky, A. Intransitivity of preferences. Psychol. Rev. 76, 31 (1969).
Article Google Scholar
Lichtenstein, S. & Slovic, P. The Construction of Preference (Cambridge Univ. Press, 2006).
Ariely, D., Loewenstein, G. & Prelec, D. “Coherent arbitrariness”: stable demand curves without stable preferences. Q. J. Econ. 118, 73–106 (2003).
Article Google Scholar
Frederick, S., Loewenstein, G. & O’Donoghue, T. Time discounting and time preference: a critical review. J. Econ. Lit. 40, 351–401 (2002).
Article Google Scholar
Kolling, N., Wittmann, M. & Rushworth, M. F. Multiple neural mechanisms of decision making and their competition under changing risk pressure. Neuron 81, 1190–1202 (2014).
Article CAS Google Scholar
Ferrari-Toniolo, S., Bujold, P. M. & Schultz, W. Probability distortion depends on choice sequence in rhesus monkeys. J. Neurosci. 39, 2915–2929 (2019).
Article CAS Google Scholar
Hayden, B. Y. Time discounting and time preference in animals: a critical review. Psychon. Bull. Rev. 23, 39–53 (2016).
Article Google Scholar
Kennerley, S. W., Walton, M. E., Behrens, T. E. J., Buckley, M. J. & Rushworth, M. F. S. Optimal decision making and the anterior cingulate cortex. Nat. Neurosci. 9, 940–947 (2006).
Article CAS Google Scholar
Soltani, A. & Izquierdo, A. Adaptive learning under expected and unexpected uncertainty. Nat. Rev. Neurosci. https://doi.org/10.1038/341583-019-0180-y (2019).
Brainard, D. H. The psychophysics toolbox. Spat. Vis. 10, 433–436 (1997).
Article CAS Google Scholar
Cornelissen, F. W., Peters, E. M. & Palmer, J. The Eyelink Toolbox: eye tracking with MATLAB and the Psychophysics Toolbox. Behav. Res. Methods Instrum. Comput. 34, 613–617 (2002).
Article Google Scholar

Download references

Acknowledgements

We thank E. Chu, S. Nichols-Worley and L. Tran for collecting human data, and C. Strait and M. Mancarella for collecting monkey data in the gambling task. This work is supported by the National Science Foundation (CAREER Award no. BCS1253576 to B.Y.H. and EPSCoR Award no. 1632738 to A.S.), and the National Institutes of Health (grant no. R01 DA038615 to B.Y.H., grant nos. R01 DA029330 and R01 MH108629 to D.L., and grant no. R01 DA047870 to A.S.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
Shiva Farashahi & Alireza Soltani
The Gladstone Institutes, San Francisco, CA, USA
Christopher H. Donahue
Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
Christopher H. Donahue & Daeyeol Lee
Department of Neuroscience and Center for Magnetic Resonance Imaging, University of Minnesota, Minneapolis, MN, USA
Benjamin Y. Hayden
The Zanvyl Krieger Mind/Brain Institute, Department of Neuroscience, Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, MD, USA
Daeyeol Lee

Authors

Shiva Farashahi
View author publications
You can also search for this author in PubMed Google Scholar
Christopher H. Donahue
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Y. Hayden
View author publications
You can also search for this author in PubMed Google Scholar
Daeyeol Lee
View author publications
You can also search for this author in PubMed Google Scholar
Alireza Soltani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S. conceived the project. C.H.D., B.Y.H. and D.L. designed the experiments in monkeys. S.F. and A.S. designed the human experiments. S.F. and A.S. performed model simulations and analysed the data. C.H.D. and S.F. conducted the experiments. C.H.D., S.F., D.L., B.Y.H. and A.S. analysed and interpreted the experimental data. D.L., B.Y.H. and A.S wrote the manuscript and all authors contributed to revising the manuscript.

Corresponding author

Correspondence to Alireza Soltani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information: Primary Handling Editor: Marike Schiffer.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Notes 1 and 2 and Figs. 1–7.

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Farashahi, S., Donahue, C.H., Hayden, B.Y. et al. Flexible combination of reward information across primates. Nat Hum Behav 3, 1215–1224 (2019). https://doi.org/10.1038/s41562-019-0714-3

Download citation

Received: 13 October 2018
Accepted: 29 July 2019
Published: 09 September 2019
Issue Date: November 2019
DOI: https://doi.org/10.1038/s41562-019-0714-3

This article is cited by

Neural mechanisms underlying the hierarchical construction of perceived aesthetic value
- Kiyohito Iigaya
- Sanghyun Yi
- John P. O’Doherty
Nature Communications (2023)
The rat frontal orienting field dynamically encodes value for economic decisions under risk
- Chaofei Bao
- Xiaoyue Zhu
- Jeffrey C. Erlich
Nature Neuroscience (2023)
Computational models of adaptive behavior and prefrontal cortex
- Alireza Soltani
- Etienne Koechlin
Neuropsychopharmacology (2022)
A structural and functional subdivision in central orbitofrontal cortex
- Maya Zhe Wang
- Benjamin Y. Hayden
- Sarah R. Heilbronner
Nature Communications (2022)
Dissociable roles of cortical excitation-inhibition balance during patch-leaving versus value-guided decisions
- Luca F. Kaiser
- Theo O. J. Gruendler
- Gerhard Jocham
Nature Communications (2021)