Reward conditioning may not have an effect on category-specific memory

Sukumaran, Priyanka; Kazanina, Nina; Houghton, Conor

doi:10.1038/s41598-023-48874-z

Download PDF

Article
Open access
Published: 15 December 2023

Reward conditioning may not have an effect on category-specific memory

Priyanka Sukumaran^1,2,
Nina Kazanina^2,3^na1 &
Conor Houghton¹^na1

Scientific Reports volume 13, Article number: 22297 (2023) Cite this article

505 Accesses
Metrics details

Subjects

Abstract

Behavioural tagging facilitates the temporary storage of seemingly insignificant episodic events, which may later become salient and enhanced in memory. Human behavioural studies have demonstrated selective memory enhancement for neutral stimuli from one category when this category is subsequently paired with reward. Although this phenomenon has implications for the role of reward conditioning on emotional and adaptive memory, its generalisability is underexplored. We conducted four experiments to investigate whether pairing items from a semantic category, animals or objects, with high or low rewards resulted in preferential memory for the high-reward category. Three of these experiments also aimed to replicate the category-specific retrospective enhancement effect reported by Patil et al. and two explored the corresponding prospective memory effect. None of our experiments showed consistent evidence for an effect of reward on category-specific memory enhancement, despite employing the same reward paradigm and incidental encoding protocol as in the original study. Consequently, we found no evidence for category-specific retrospective or prospective enhancement effects. Our experiments were conducted online which is an equally relevant method for assessing behavioural phenomenon as the in-person studies conducted by Patil et al. Overall, our results question the generalisability of previously reported category-specific memory enhancement effects due to reward.

Memory for rewards guides retrieval

Article Open access 16 April 2024

The effect of prediction error on episodic memory encoding is modulated by the outcome of the predictions

Article Open access 29 May 2023

Individual differences in experienced and observational decision-making illuminate interactions between reinforcement learning and declarative memory

Article Open access 15 March 2021

Introduction

It is important to remember particularly emotional, rewarding or punishing events, as this information could be useful for predicting future decisions¹. This function is facilitated by an adaptive memory system which temporarily stores memories that initially seem insignificant but later acquire salience through emotional experiences. Additionally, it might be advantageous that emotional experiences not only enhance the memory of a particularly salient event, but also other seemingly unimportant events that are conceptually, temporally or spatially related to that event. The synaptic tag-and-capture hypothesis² provides an underlying neural mechanism for such adaptive memory effects, which has also gained human behavioural evidence with studies reporting ‘behavioural tagging’ effects³. For example, studies have shown that extrinsic reward (monetary incentives) retrospectively enhances memory for events with greater temporal⁴ and spatial⁵ closeness to rewarding events.

Interestingly, it has also been shown that conceptually related stimuli can be retrospectively enhanced in memory through reward⁶ and fear conditioning^7,8. In the study by Patil et al.⁶ which we will refer to as the RREM (Reward Retroactively Enhances Memory) study, participants incidentally encoded neutral images from two categories, animals and tools, in a pre-conditioning phase. In a following conditioning phase, images from one category were associated with high reward and the other with low reward. A surprise 24-hour delayed recognition memory test revealed preferential memory for items from the high-reward category encoded in the conditioning as well as the pre-conditioning phase. Importantly, items from the pre-conditioning phase were never directly paired with reward. In other words, items from pre-conditioning phase were retrospectively enhanced in memory when items from the same semantic category were conditioned with high reward during the conditioning phase. While prior studies have shown general memory enhancement for neutral stimuli paired with salient events, regardless of their semantic category^9,10,11, Patil et al.⁶ demonstrated that such memory effects can be highly specific and applied to conceptually related categories. However, the reliability of this effect is contested. A recent meta-analysis¹² of 14 studies on selective retrospective memory enhancement induced by reward, fear and other salient conditioning, suggests these effects were inflated by small-study biases, resulting in Bayesian meta-analyses supporting the null hypothesis. Among the 14 studies, only the RREM study⁶ and Oyarzún et al.¹³ employed reward-conditioning. Despite using a similar monetary reward paradigm as the RREM study, with the only difference being indication of reward expectation on each trial, Oyarzún et al.¹³ found no evidence for a retrospective memory effect due to reward. They do report a strong effect of reward on category-specific memory for items directly paired with reward in the conditioning phase, and show evidence for prospective memory enhancement. The generalisability of reward-induced selective memory effects remains an open question.

We conducted four online experiments to investigate category-specific memory effects due to reward-conditioning. Experiment 1 aimed to investigate if the phenomenon found in the RREM study⁶ generalised to word and image stimuli in a more complex word-image associative learning paradigm. Experiment 1 did not show evidence for retrospective or prospective enhancement of category-specific memory due to reward. Moreover, the effects of conditioning on items that were explicitly rewarded during the conditioning phase were inconsistent across memory measures. This questions the generalisability of effects reported in the RREM study⁶. Next, Experiment 2a aimed to closely replicate the RREM study to investigate category-specific retrospective memory enhancements of images, while Experiment 2b tested the previously unexplored possibility of prospective memory enhancement using the same reward-conditioning protocol. Finally, Experiment 3 was a high-powered replication of Experiment 2a. Our experiments were conducted online, in contrast to the in-person RREM study⁶. We recognize the differences and trade-offs between online and in-person experiments: online testing allows for larger samples but may have reduced data quality, while in-person data collection is influenced by experimenter interaction, and limits the diversity and size of participant groups. However, numerous well-established psychological phenomena^14,15,16,17, including those related to memory^18,19,20 and reward processing^21,22,23, yield consistent results across both settings; affirming the validity of online testing for behavioural research. It is thus informative that the two experimental conditions lead to different results, suggesting that category-specific memory effects due to reward⁶ do not reliably generalise with experimental variations.

Experiment 1

Experiment 1 was designed to test the generalisation of category-specific memory enhancement effects due to reward, as reported in the RREM study⁶, by employing an associative word-image encoding protocol. The word-learning protocol was adopted from a study by DeLoof et al.²⁴ where participants successfully learn foreign word-to-image pairs through reward-prediction error conditioning. We use this learning paradigm along with the reward paradigm and incidental encoding design from the RREM study⁶ to investigate reward conditioning effects on category-specific memory of image and word stimuli. We test 24-hour delayed memory retrieval as this was the condition where the critical reward-conditioning effects emerged in the RREM study. In addition, we also carried out a version of Experiment 1 with immediate memory retrieval which did not show any evidence for category-specific memory effects due to reward conditioning, which is line with the RREM study, see Supplementary M1, Section F.

Methods

Participants

A sample size of 120 participants, between the ages of 18–35, was targeted. As there were no previous studies with exact protocols testing selective enhancement of associative and item memory, the sample sizes were based on an a priori simulation of logistic regression models. We used pilot data from eight participants to estimate the necessary sample size for detecting a main effect of reward category on recognition memory. See pre-registration protocol for more details: https://osf.io/vghn4. Participants were required to have English as their first language and no literacy difficulty. 127 participants completed Experiment 1 online on prolific.co. The following were excluded: (1) participants with below-chance performance on the memory test; this was evaluated using d-prime scores, as described below in the “Data analysis” section below, and participants were excluded if their d-prime score was less than or equal to zero, (2) outliers in response times in the memory test (outside 1.5 times the interquartile range calculated across participants). The final group of 120 participants consisted of 86 females and 34 males, aged $M=26.17$, $SEM=0.46$. All participants in Experiments 1, 2 and 3 provided informed consent prior to the experiment. All experiments were approved by the School of Psychological Sciences Research Ethics Committee of University of Bristol and were performed in accordance with relevant guidelines and regulations.

Materials

The stimulus consisted of 192 images, 96 animals and 96 objects on white backgrounds, and 384 words which were Japanese nouns of two or three syllables. For each participant, image-word pairs from one semantic category, either animal or object, were associated with high reward and the other category with low reward. This association was learnt in the conditioning phase during encoding (see “Procedure” section below). Half the images and words were used during encoding and the other half as foils for a recognition memory test after encoding phases.

Procedure

The experiment consisted of three encoding phases: pre-conditioning, conditioning, and post-conditioning. Each phase had 32 trials, including 16 animal trials and 16 object trials. Allocation of stimuli to the three phases were randomised and additionally, stimulus order was pseudo-randomised such that no more than three trials from the same semantic category occurred consequently. On each trial of the pre-conditioning phase, participants were presented with an image of either an animal or an object and two Japanese words, one of which was the correct Japanese word for the image. The set of two words presented with a particular image remained the same for all participants, but were randomly positioned on the left or right of the image. Participants were given two seconds to guess the correct word, and feedback was provided to ensure they learned the correct word-image pairing: the correct word turned green, while the wrong word turned red.

During the conditioning phase, participants were told that guessing the correct Japanese word would result in earning £0.01 on grey star trials and $\pounds$0.15 on green star trials, and trial type would be indicated along with the image cue, see Fig. 1. Grey star, low-reward and green star, high-reward trials were each associated with one of the two image categories: animals or objects. However, participants were not informed of this association and had to learn it through the conditioning trials. Allocation of animal or object image to high- or low-reward category was randomised and counterbalanced across participants. After any exclusions, further participants were allocated categories while maintaining perfect counterbalancing. During each trial, as in the pre-conditioning phase, participants were presented with an image and two Japanese words, accompanied by a grey or green star, as well as the potential reward ($\pounds$0.01 or $\pounds$0.15). After selecting a word, any earned reward was displayed, together with feedback on whether the choice was correct or not. The post-conditioning phase was identical to the pre-conditioning phase, with no rewards or information about trial type provided.

Participants then completed a surprise recognition memory test 24 hours after the post-conditioning phase. To avoid any biases due to test expectancy, no prior indication had been given to participants that they would be asked to do a memory test beyond the learning task itself²⁵. In the memory test, participants had to decide whether they had previously seen a given item (word or image) in the encoding phases. 192 images and 384 words, half of which were foils, were intermixed and randomly presented one after the other. For each trial, participants chose the most applicable response from: ‘definitely old’, ‘likely old’, ‘maybe old’, ‘maybe new’, ‘likely new’, ‘definitely new’. Participants also completed a mental health questionnaire and another association memory task, the results of which are not explored in this paper.

Data analysis

We follow a similar analysis approach as in the RREM study⁶, which is also detailed in our pre-registration protocol (https://osf.io/vghn4/). Recognition memory was quantified using corrected recognition scores: $R = H - F$, where H is the hit rate and F is the false alarm rate. In addition, we report a version of the analysis using the signal-detection theoretic measure of sensitivity d-prime ($d'$) as a measure of memory^26,27: $d' = z(H) - z(F)$, where z(H) is z-scored hit rate and z(F) is z-scored false alarm rate. In order to compute z-scores, H and F were corrected to the range of 1% : 99%, as per standard practice followed in previous studies using d-primes²⁸. The use of d-primes has been suggested to have advantages over other measures including corrected recognition, which are based on threshold models of recognition memory^28,29,30,31.

For the main analysis, similar to RREM⁶ and Oyarzún et al.¹³, we conducted a 3 × 2 repeated measures analysis of variance (ANOVA) on memory measures (d-prime and corrected recognition) with encoding phase (pre-conditioning, conditioning, post-conditioning) and reward category (high, low) as within-subject factors. The ANOVA will indicate memory differences across phases and reward categories. To dissect specific effects within each phase, the effect of reward category on memory of items was further quantified using two-tailed paired t-tests with alpha=0.05. Cohen’s d(average) was used to estimate effect sizes for paired t-tests, referred to as $d_{av}$ in the text and tables³². We additionally used Bayesian counterparts for t-tests to quantify evidence in support of the null hypothesis. A Bayes factor:

$$\begin{aligned} BF_{10}=\frac{P(D|H_1)}{P(D|H_0)} \end{aligned}$$

(1)

was calculated, where D represents the data, $P(D|H_a)$ the probability of the data conditional on a hypothesis $H_a$. $H_0$ and $H_1$ are the two competing hypotheses, in this case $H_0$ is the null hypothesis that there is no effect of reward category on memory and $H_1$ is the alternative hypothesis that there is an effect of reward category on memory (i.e. items specifically drawn from high-reward category are enhanced in memory). We implemented this using the ttestBF function in R, with a Cauchy prior distribution and a default scale parameter of $r=0.707$. Bayes factors less than 0.33 signifies substantial evidence for $H_0$, whereas Bayes factors greater than three signifies substantial evidence for $H_1$. Anecdotal evidence for $H_0$ and $H_1$ corresponds to $0.33< BF_{10} < 1$ and $1< BF_{10} < 3$ respectively^33,34.

In addition to the ANOVA and t-tests on memory measures, we also estimated generalised linear mixed-effects models (GLMM) on categorical responses from the memory test as described in the pre-registration protocol. We ran GLMM models with a logit-link function using the lme4 package in R³⁵. The dependent variable was the binarised response in the memory test collapsed across certainty levels: responding ‘old’ or responding ‘new’. The GLMM included main effects of reward category and encoding phase, and the interaction between them. Random intercepts were included for each participant and stimuli items. All analysis was conducted in R 4.2.3.

Furthermore, as in the RREM study, we repeated all the analysis steps using a subset of trials with only higher certainty responses from the memory test. This was done by including trials with responses ‘definitely old’, ‘likely old’, ‘likely new’, ‘definitely new’ and excluding trials in which participants chose ‘maybe old’ or ‘maybe new’. Experiment code, analysis scripts and raw data are available at: https://github.com/prisukumaran23/adaptiveMemoryReplication.

Table 1 Summary of t-tests in Experiment 1.

Full size table

Results

Overall performance

Guessing accuracy did not significantly differ between image category (animal vs. object) in any of the three phases, $p >.15$, or between high- and low-reward trials in the conditioning phase, $t_{(119)}=-1.53$, $p=0.13$, $d_{av}=-.19$. The average hit rate was $M=0.51$, $SEM=0.15$, and the average false alarm rate was $M=0.19$, $SEM=0.12$. See Supplementary M1, Table S2 for breakdown of memory test responses by certainty.

Recognition memory by phase and reward category

A repeated measures ANOVA revealed an effect of phase, $F(1, 119)=7.29$, $p=0.001$, $\eta ^{2}=0.005$, on corrected recognition. There was a significant interaction effect between encoding phase and reward category, $F(1, 119)=3.46$, $p=0.03$, $\eta ^{2}=0.004$, suggesting that the effect of reward category varied with phase. The d-prime analysis revealed an effect of phase, $F(1, 119)=4.48$, $p=0.02$, $\eta ^{2}=0.008$, but no interaction effect between encoding phase and reward category, $F(1, 119)=2.45$, $p=0.09$, $\eta ^{2}=0.003$, unlike the corrected recognition analysis.

For items encoded in the conditioning phase, t-tests revealed significant evidence for an effect of reward category on corrected recognition, $t(119)=2.33$, $p=0.02$, $d_{av}=0.25$, but this was diminished with d-primes, $t(119)=1.97$, $p=0.05$, $d_{av} =0.22$, see Fig. 2. This suggests that reward conditioning was successful, albeit weak, and emerged after a 24-hour post-consolidation period as found in the RREM study⁶. However, there was no significant evidence for an effect of reward-category on corrected recognition nor on d-primes for items encoded in the pre- and post-conditioning phases, see Table 1 for t-tests.

Bayesian analysis

The Bayesian hypothesis test on corrected recognition of items from the conditioning phase revealed anecdotal evidence for the one-sided alternative hypothesis $H_1$ that the reward category effect is greater than zero, $BF_{10}$ $=2.70$. The equivalent analysis with d-primes revealed weaker evidence in favor of $H_1$, $BF_{10}$ $=1.28$, which is consistent with the t-test analysis above. For items in the pre- and post-conditioning phases, there was substantial evidence for the null hypothesis, with all Bayes factors less than 0.33. Although there was some anecdotal evidence for $H_1$ when analysing all memory trials, analysis focusing on higher certainty responses did not support this and showed evidence in favor of the null hypothesis, see Supplementary M1, Table S6. Additionally, linear mixed-effects modelling on categorical response data, presented in Supplementary M1, Table S11–S12, also showed that there were no significant interaction effects between reward category and the three phases.

Discussion

In Experiment 1, a significant effect of reward category on items in the conditioning phase was observed when measured by corrected recognition, but not by d-primes or when analyzing only higher certainty memory responses. This questions the generalisability of memory enhancement effects found using this reward conditioning paradigm, and suggests that the distinction between high- and low-reward category was not salient enough in our experiment. Given that the overall design was close to the original RREM study⁶, it is unclear why we did not find consistent evidence for category-specific memory enhancement effects and why any effects found were not significant when evaluated using d-prime measures and higher certainty responses.

However, our experimental design deviates from the RREM study in a few ways. Firstly, our study only had 32 images per phase whereas the RREM study had 60. Secondly, the RREM study used only image cues and a delayed match-to-sample task, whereas our study had a more complicated learning paradigm with word-image associations and a guessing task. In our study, recognition memory could have been affected by salient effects related to the guessing task, for example wrong guesses could lead to diminished memory. In order to characterise this effect, we estimated GLMMs on categorical responses from the memory test with phase and guess outcome as a predictor, see Supplementary M2, section 1.B.2.1, pg. 97–98, which revealed that guess-outcome (correct/wrong feedback) did not significantly affect memory for items in the pre-conditioning or conditioning phases, however, there was a significant but weak effect for items in the post-conditioning phase. Since guess-outcome did not differ significantly across reward category (high/low) and semantic category (animal/object), any effects would have been equal across categories, leading to no differences in memory.

Thirdly, in the RREM study, the rewards were motivated by an overall bonus of $20 or $1 for high matching performance for items in each category. In our study, the potential monetary reward was shown on each trial as $\pounds$0.15 or $\pounds$0.01, which could have induced an interfering reward-anticipatory response. Interestingly, in the study by Oyarzún et al.¹³ which also failed to find evidence for reward-induced category-specific retrospective memory effects, participants answered a (yes/no) reward anticipation question on each trial. This indicates that reward conditioning may be influenced by minor changes to factors such as anticipation, overall motivation and reward outcome in the conditioning paradigms. Relatedly, reward prediction error (RPE) conditioning, as opposed to explicit reward conditioning as used in our experiments, was successful in increasing recognition memory of images in the same foreign-word and image learning protocol²⁴ that we adopted for our experiment. Since RPE accounts for the effects of reward anticipation and outcome, it may be a better measure of reward salience than explicit reward²³. Further work is needed to explore the role of explicit reward versus RPE conditioning in inducing adaptive memory effects.

The neurobiological literature on memory suggests that associative memory retrieval is more reliant on the hippocampal activity than item recognition memory^36,37. Consequently, associative memory tasks may, in theory, offer an alternative memory assay to probe the dopaminergic modulation of the hippocampus by rewards, which is the proposed neural mechanism underlying behavioural tagging with rewards^1,3,3. Our finding that selective memory effects due to reward, reported by Patil et al.⁶, do not generalise to a previously unexplored word-image associative memory task, calls for further research to characterise associative memory in behavioural tagging experiments, as well as in the broader context of reward-memory studies.

Experiment 2

In Experiment 2, we follow the same protocols for incidental learning and reward conditioning as in the RREM study⁶, and reduce differences that could have resulted in weaker reward-conditioning and category-specific memory enhancement. Experiment 2a replicates the design of Experiment 1 from the RREM study⁶ to test retrospective memory enhancement 24-hours after encoding. In Experiment 2b, we incorporated a post-conditioning phase to examine prospective memory enhancement effects, building on prior reports of this effect induced by reward-conditioning¹³ and fear-conditioning⁷.