Behavioral results suggest that learning by trial-and-error (i.e., reinforcement learning) relies on a teaching signal, the prediction error, which quantifies the difference between the obtained and the expected reward. Evidence suggests that distinct cortico-striatal circuits are recruited to encode better-than-expected (positive prediction error) and worst-than-expected (negative prediction error) outcomes. A recent study by Villano et al.1 provides evidence for differential networks that underlie learning from positive and negative prediction errors in humans using real-life behavioral data. More specifically, they found that university students are more likely to update beliefs concerning grade expectations following positive rather than negative prediction errors.
Virtually all animals use past rewards and punishments to modify their future course of actions (a process referred to as reinforcement learning). Behavioral and computational theories suggest that reinforcement learning relies on a teaching signal, the prediction error (PE), which quantifies the difference between the obtained and the expected reward. PE minimization across trials and repetitions enables (animals and humans alike) to efficiently learn and adapt their behavior accordingly.
Neurophysiological studies across rodents, monkeys and humans collectively suggest that reinforcement learning processes are largely underpinned by dopamine-mediated neural plasticity of the cortical-subcortical connections and that positive and negative PEs are encoded across dissociable circuits that recruit different cortical and subcortical structures of the brain2. A direct implication of these neurobiological observations is that learning from positive and negative PEs could be independently modulated at the behavioral level3.
Until recently, however, prior studies were carried out in relatively constrained and “aseptic” experimental settings. Most of the tasks employed implemented quite artificial learning scenarios, which potentially casts doubt on the ecological and real life validity of these findings.
A recent study from Villano et al.1 has overcome this limitation by investigating how humans integrate positive and negative PEs when updating their belief concerning consequential real life outcomes. More specifically, in a large scale study, they asked 625 university students to predict their grades during four consecutive semesters.
At the macroscopic level, participants learned to modify their expectations concerning future grades in a manner compatible with prediction error minimization. However, a more fine-grained analysis showed signs that could be considered as an “optimistic” bias: on average participants learned more following positive as opposed to negative PEs. The results support the idea of distinguishable neural networks for positive and negative prediction errors by showing an optimistic bias manifesting at the behavioral level.
The authors also report that participants who presented symptoms consistent with anxiety and depression, also showed a reduction in optimistic learning bias. This finding is consistent with prior evidence showing similar learning rate modifications in depression and anxiety4.
Altogether, this study provides evidence for prediction error-based learning outside of the laboratory, in real life scenarios. It also provides evidence for the fact that learning from positive and negative prediction errors are functionally dissociable. Both findings are consistent with decades-long neurobiological investigations of the computational mechanisms underlying reinforcement learning.
References
Villano, W. J. et al. Individual differences in naturalistic learning link negative emotionality to the development of anxiety. Sci. Adv. 9, eadd2976 (2023).
Niv, Y. Reinforcement learning in the brain. J. Math. Psychol. 53, 139–154 (2009).
Frank, M. J. Hold your horses: a dynamic computational role for the subthalamic nucleus in decision making. Neural Netw. 19, 1120–36 (2006).
Pike, A. C. & Robinson, O. J. Reinforcement learning in patients with mood and anxiety disorders vs control individuals: a systematic review and meta-analysis. JAMA Psychiatry. 79, 313–322 (2022).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
SP is a member of the Editorial Board of Communications Biology.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Vandendriessche, H., Palminteri, S. Neurocognitive biases from the lab to real life. Commun Biol 6, 158 (2023). https://doi.org/10.1038/s42003-023-04544-4
Published:
DOI: https://doi.org/10.1038/s42003-023-04544-4