The possibility to make choices modulates feature-based effects of reward

Heuer, Anna; Wolf, Christian; Schütz, Alexander C.; Schubö, Anna

doi:10.1038/s41598-019-42255-1

Download PDF

Article
Open access
Published: 08 April 2019

The possibility to make choices modulates feature-based effects of reward

Scientific Reports volume 9, Article number: 5749 (2019) Cite this article

1617 Accesses
2 Citations
2 Altmetric
Metrics details

Subjects

Abstract

When making decisions, humans can maximize the positive outcome of their actions by choosing the option associated with the highest reward. We have recently shown that choices modulate effects of reward via a bias in spatial attention: Locations associated with a lower reward are anticipatorily suppressed, as indicated by delayed responses to low-reward targets and increased parieto-occipital alpha power. Here, we investigated whether this inhibition also occurs when reward is not coupled to location but to a nonspatial feature (color). We analyzed reaction times to single targets associated with a low or high reward as a function of whether a second trial type, choice-trials, were interleaved. In choice-trials, participants could choose either one of two targets to obtain the associated reward. Indeed, responses to low-reward targets were slower when choice-trials were present, magnifying the influence of reward, and this delay was more pronounced in trials immediately following a choice. No corresponding changes in parieto-occipital alpha power were observed, but the behavioral findings suggest that choices modulate a reward-related bias in feature-based attention in a similar manner as for spatial attention, and support the idea that reward primarily affects behaviour when it is of immediate relevance.

Neurophysiological mechanisms underlying the differential effect of reward prospect on response selection and inhibition

Article Open access 05 July 2023

Unraveling the influence of trial-based motivational changes on performance monitoring stages in a flanker task

Article Open access 06 November 2023

Outcome saliency modulates behavioral decision switching

Article Open access 31 August 2020

Introduction

Humans value specific events or objects differently, and strive to optimize their behavior by choosing actions that are associated with a rewarding outcome. Reward is an important incentive that shapes not only big decisions in life, but also our experience of the world around us at a much more fundamental level by modulating the processing of visual information. Reward influences which parts of our visual surroundings we attend to and thus select for further processing^1,2, where and when we look^3,4,5, what we remember^6,7,8 and how we perform even simple, visually-guided movements such as reaching^9,10.

Biasing processing in a reward-related manner is only truly advantageous when the outcome of a given situation or event depends on a person’s behavior. This is for instance the case when choices can be made between multiple options of varying associated value. Indeed, we have recently shown that the necessity or possibility to make choices modulates effects of reward on response preparation of saccadic eye movements¹¹ and manual responses¹². Both studies employed a task that consisted of two trial types, which were presented in an interleaved fashion. In single-trials, participants had to respond to a single target presented to the left or right from fixation. Each target was associated with either a low or a high reward and participants received these rewards for correct responses. In choice-trials, both targets were presented and participants were free to choose to obtain the associated reward. Reward was coupled to the spatial location(s) of the target(s): One visual hemifield was assigned a low reward, and the other one a high reward. We manipulated the proportion of choice-trials within a block of trials, and analyzed single-trial performance as a function of choice-trial proportion in the same block. Across several experiments, response latencies to low-reward single-targets were delayed when choice-trials were present as compared to blocks with only single-trials. The magnitude of this delay increased with choice-trial proportion and was particularly pronounced in single-trials directly following a choice-trial. Cueing single-trials on a trial-to-trial basis reduced but not eliminated latency differences between low- and high-reward targets, suggesting that the expectation of an upcoming choice-trial could not fully account for the delayed responses to low-reward targets. Importantly, this slowing of responses to low-reward targets was not affected by the frequency of left and right responses or by a change in response from one trial to the next¹¹. Taken together, these findings indicate that a stronger reward-related bias was implemented when choice-trials were present, preparing for optimal, reward-maximizing decisions.

Analysis of oscillatory brain activity during a preparatory period preceding target presentation revealed that the underlying reward-related bias was not in motor preparation, but in visuospatial attention. In blocks with a high choice-trial proportion, an increase in parieto-occipital alpha power contralateral to the visual hemifield associated with a low reward was observed prior to target presentation¹². Preparatory modulations of posterior oscillatory power in the alpha band (8–14 Hz) have been identified as a reliable index of the voluntary deployment of covert visuospatial attention, reflecting the facilitation of relevant and the suppression of irrelevant information via a modulation of cortical excitability in a retinotopic fashion^{13,14,15,16,17,18,19}. Specifically, increases in posterior alpha power have been suggested to reflect the functional inhibition of the sensory brain areas processing information at the to-be-ignored regions of space²⁰. Thus, our previous findings¹² indicate that when the likelihood was high that a choice could be made to maximize reward, the spatial region associated with a low reward was actively suppressed in preparation for optimal target selection, presumably by reducing baseline excitability²¹. Modelling saccade latency distributions provided further support for this conclusion, likewise indicating that the delayed responses to low-reward single targets in the presence of choice-trials were due to a reduced baseline level¹¹.

In our previous studies^11,12, items were associated with different magnitudes of reward based on their spatial location. However, the value of visual objects cannot only be determined by their location, but also by non-spatial characteristics, which can be simple visual features such as color or size, or more complex stimulus attributes such as object category. This might, in fact, be the more common case in everyday life. In the current study, we examined whether the possibility to make choices in order to maximize positive outcome modulates the effects of reward when reward magnitude is coupled to a non-spatial feature in a similar manner as it does when reward is determined by spatial location^11,12.

In analogy to our findings with respect to effects of spatially defined reward¹², we assumed that choices would modulate the effects of featurally defined reward via a reward-related bias in feature-based attention. Feature-based attention selectively increases sensitivity to specific features across the visual field and thereby prioritizes the visual processing of behaviorally relevant stimuli^22,23. Feature-based and spatial attention thus serve the same function (i.e., to tune visual perception to what is important) and they have similar effects on neuronal responses in visual cortex^24,25, activating a largely overlapping frontoparietal network^26,27,28. In spite of these commonalities, however, feature-based and spatial attention appear to be distinct attentional mechanisms that independently enhance relevant visual signals with dissociable behavioral signatures^29,30,31,32, and that are supported by specialized regions within the common network^{26,27,28,33,34}. Thus, what has been established for spatial attention does not necessarily apply to feature-based attention.

To determine whether the possibility to make choices between options of different value modulates feature-based effects of reward as it does for spatially indicated reward, we modified the task used in our previous studies^11,12 and coupled reward to color. Delayed responses to low-reward single targets in blocks with choice-trials would indicate that choices do not only induce a reward-related bias in visuospatial attention^11,12 but also in feature-based attention.

Our prior work identified a spatially specific increase in alpha power, anticipatorily inhibiting regions associated with a low reward, as the neural mechanism that mediated the choice-induced modulation of reward effects¹². Therefore, we additionally analyzed parieto-occipital alpha power during a preparatory period preceding target presentation to examine whether a reward-related bias in feature-based attention is supported by the same oscillatory mechanism. However, whereas the involvement of alpha oscillations in spatial attention has been well established^14,20, only few studies have investigated whether preparatory feature-based attention is similarly reflected in alpha power changes, and these studies produced somewhat mixed evidence^35,36,37. Inducing the expectation of an upcoming target feature (left- or rightward motion) has been found to increase overall alpha power over occipital cortex as compared to when participants had no expectation³⁵. A more specific pattern of alpha-band increases has been observed for different feature dimensions, consistent with the hypothesized role of alpha as a suppression mechanism in spatial attention: When participants were cued to attend to either the color or the motion of an upcoming dot array, alpha power in the cortical areas processing the irrelevant feature dimension increased, indicating their functional inhibition³⁶. A recent study directly comparing spatial, feature-based and combined spatial and feature-based cues, by contrast, failed to observe any alpha power modulations specifically related to feature-based attention, even though the feature-based cues were behaviorally effective³⁷. Thus, we did not expect that a choice-induced modulation of feature-based reward effects, analogous to our previous findings for spatially indicated reward, would necessarily be reflected in a corresponding modulation of parieto-occipital alpha power. But we tentatively hypothesized that an anticipatory suppression of low-value target features might result in an alpha power increase in neural subpopulations coding for the low-value target feature, reflected in overall higher parieto-occipital alpha power in blocks with choice-trials compared to block without choice trials (see also de Lange et al.³⁵).

Results

In this experiment, participants had to indicate the location of a target presented along with a non-rewarding distractor item (single-trials). The task is illustrated in Fig. 1. Targets were defined by their color: Each participant was assigned two target colors, and a third color was used for distractors. One target color was assigned a low reward and the other one a high reward. Participants received these rewards for correct responses. Across blocks of trials, we manipulated the proportion of choice-trials (0 vs. 0.33) that were randomly interleaved with these single-trials. In choice-trials, two targets were presented and participants could choose between the two to obtain the reward associated with the color of the chosen target.

Behavioral measures

Our primary measure of interest were single-trial reaction times as a function of choice-trial proportion. In Fig. 2a, their means are shown as a function of the proportion of interleaved choice-trials and separately for low- and high-reward single targets. Responses were slower in the 33% choice-trial condition (373 ms ± 8 ms) than in the 0% choice-trial condition (366 ms ± 7 ms; F_(1,22) = 7.40, p = 0.013, partial ƞ² = 0.25), and slower for targets associated with a low reward (384 ms ± 8 ms) than for targets associated with a high reward (356 ms ± 7 ms; F_(1,22) = 35.91, p < 0.001, partial ƞ² = 0.62). Importantly, an interaction between choice-trial proportion and reward (F_(1,22) = 66.14, p < 0.001, partial ƞ² = 0.75) revealed that the difference in reaction times to low- and high-reward targets was larger in the 33% choice-trial condition (41 ms ± 6 ms) than in the 0% choice-trial condition (15 ms ± 4 ms). Whereas responses to low-reward targets were slower with 33% choice-trials (394 ms ± 9 ms) than without choice-trials (373 ms ± 7 ms; t₍₂₂₎ = 5.43, p < 0.001), responses to high-reward targets were similar in the 0% choice-trial condition (357 ms ± 6 ms) and in the 33% choice-trial condition (353 ms ± 7 ms; t₍₂₂₎ = 1.53, p = 0.141). Thus, the presence of choice-trials slowed responses to low-reward targets, but did not affect responses to high-reward targets.

Single-trial accuracy was close to optimal (94.35% ± 0.73%) and reflected the pattern found for reaction times, ruling out the possible influence of a speed-accuracy trade-off. For accuracies, there was an effect of choice-trial proportion (F_(1,22) = 19.25, p < 0.001, partial ƞ² = 0.47) and an interaction between choice-trial proportion and reward (F_(1,22) = 16.80, p < 0.001, partial ƞ² = 0.43), but no main effect of reward (F_(1,22) = 1.93, p = 0.018, partial ƞ² = 0.08). Whereas accuracy for high-reward targets was at the same level in the 0% (96.06% ± 0.68%) and in the 33% choice-trial condition (96.31% ± 1.13%; t₍₂₂₎ = 0.20, p = 0.84), accuracy for low-reward targets decreased when choices were present (89.32% ± 1.96%) compared to when there were no choices (93.30% ± 1.15%; t₍₂₂₎ = 2.53, p = 0.019). In choice-trials, participants chose the target with high reward in 87.1% of trials, revealing that they indeed aimed to maximize their financial outcome.

We further examined intertrial effects in the 33% choice-trial condition to check whether the delay in responses to low-reward targets was particularly pronounced immediately following a choice-trial¹¹. Single-trials in the 33% choice-trial condition were split into trials following either a single-trial with a high-reward target, a single-trial with a low-reward target, or a choice-trial. Figure 2b shows reaction times in single-trials separately for these different preceding trial types and for low- and high-reward targets. Overall, responses were faster for single targets associated with a high reward than for single targets associated with a low reward (F_(1,22) = 80.57, p < 0.001, partial ƞ² = 0.79). Reaction times were also found to be influenced by the preceding trial type (F_(2,44) = 9.42, p < 0.001, partial ƞ² = 0.30): Responses were fastest after a single-trial with a low-reward target (368 ms ± 7 ms), slightly slower after a single-trial with a high-reward target (373 ms ± 7 ms), and slowest after a choice-trial (379 ms ± 7 ms). Importantly, there was also an interaction (F_(2,44) = 4.23, p = 0.021, partial ƞ² = 0.16), indicating that the difference in reaction times to low- and high-reward targets differed depending on the preceding trial. To further elucidate this interaction, we conducted separate one-way ANOVAs for low- and high reward targets. For low-reward targets, there was an effect of preceding trial type (F_(2,44) = 10.323, p < 0.001, partial ƞ² = 0.32) and subsequent pairwise comparisons revealed that responses following a choice-trial were slower than responses following a single-trial with a low-reward target (19 ms ± 5 ms, p = 0.002) and responses following a single-trial with a high-reward target (13 ms ± 3 ms; p = 0.003). Responses following the two types of single-trials did not differ significantly (7 ms ± 5 ms, p = 0.456). Reaction times to high-reward targets were not affected by the preceding trial type (F_(2,44) = 0.57, p = 0.571, partial ƞ² = 0.03). Thus, the effect of reward on single-trial performance was larger when the previous trial was a choice-trial, and this was due to a slowing of responses to low-reward targets.

Parieto-occipital alpha power

Figure 3a shows time-frequency representations of the preparatory period in the two different choice-trial conditions. Power estimates were computed relative to a 500 ms baseline period preceding the preparatory period, so that a value of 1 indicates no change, values greater than 1 indicate an increase, and values smaller than 1 indicate a decrease in alpha power. Changes in parieto-occipital alpha power in preparation for the upcoming target presentation were analyzed for an early (100–300 ms) and a late (300–500 ms) time window during the 600 ms preparatory period. Figure 3b shows the relative power change estimates across the preparatory period averaged across the alpha-band (8–14 Hz). As can be seen, alpha power decreased during the preparatory period. This was confirmed by a significant effect of time window (early time window: 0.96 ± 0.03; late time window: 0.91 ± 0.02; F_(1,22) = 12.63, p = 0.002, partial ƞ² = 0.37). However, there was neither an effect of choice-trial proportion (F_(1,22) = 2.41, p = 0.135, partial ƞ² = 0.10) nor an interaction of time window and choice-trial proportion (F_(1,22) = 0.66, p = 424, partial ƞ² = 0.03). Thus, there was an overall decrease in alpha power in preparation for target processing, but this decrease was equivalent with and without choice-trials present.

We further examined intertrial effects in the 33% choice-trial condition, so as not to miss a reward-related preparation that was only initiated directly after a choice for a high-reward target had been made. Figure 3c shows the relative alpha power change across the preparatory period separately for single-trials following either a single-trial with a high-reward target, a single-trial with a low-reward target or a choice-trial. The general decrease in alpha power during the preparatory period was again confirmed by an effect of time window (F_(1,22) = 11.05, p = 0.003, partial ƞ² = 0.33). But, as is quite apparent in Fig. 3b, the preceding trial type did not modulate overall alpha power (F_(2,44) = 0.11, p = 0.90, partial ƞ² = 0.01) or the alpha power decrease (F_(2,44) = 0.60, p = 0.556, partial ƞ² = 0.026).

Discussion

In the present study, we investigated whether responses to single targets associated with different levels of reward are modulated by interleaved choices when reward is coupled to a nonspatial feature (i.e., color) as is the case when reward is coupled to spatial location^11,12. Indeed, responses to single targets associated with a low reward were found to be delayed when choice-trials were present, magnifying the effect of reward on reaction times, while responses to high-reward single targets were not influenced by the manipulation of choice-trial proportion. The analysis of intertrial effects revealed that this selective slowing of responses to low-reward targets was particularly pronounced for single targets directly following a choice-trial, whereas responses to low-reward single-trials following a high-reward single-trial were not similarly delayed. This confirms that the increased reaction times to low-reward targets after a choice-trial were due to the previously made choice rather than due to a change in the color participants responded to from one trial to the next. Notably, the intertrial effects could not fully account for the response delay induced by the presence of choices (see also Wolf et al.¹¹): Irrespective of the preceding trial type, responses to low-reward targets were slower with 33% choice-trials in the same block than without interleaved choice-trials. Overall, these findings mirror the pattern of results we obtained in our previous studies^11,12, in which the level of reward was coupled to spatial location, and thus strongly suggest that choices modulate a reward-related bias in feature-based attention in a similar way as they do for spatial attention.

There is one aspect, in which the pattern of behavioral results differs from our previous findings. We failed to observe any effect of reward in blocks without choice-trials when reward was indicated spatially^11,12, but in the present study, there was an effect of reward on reaction times to single targets even in the 0% choice-trial condition. However, it seems likely that this difference between studies with respect to reward effects without choice-trials is a quantitative but not a qualitative difference. In our previous experiments, there was always a consistent trend for slower performance for low-reward than for high-reward targets, and the effect observed in the present study, while larger and significant, was still rather small. That the effect of reward in the absence of choices was somewhat larger in the present study might be related to task-difficulty. The possibility to make choices and thereby maximize positive outcome is only one way by which reward can be assigned particular behavioral relevance. Another way is increased task-difficulty: With increasing task difficulty, the risk of making a mistake and losing reward altogether is higher. Therefore, biasing visual processing in a reward-related manner is a sensible strategy, which ensures that a mistake infers only a small cost (i.e., the loss of a low reward). The task in the present study was presumably slightly more difficult than the task we previously used^11,12, because the target in single-trials was presented along with a distractor and had to be selected based on its color. That task difficulty was indeed higher with this paradigm is evident from overall longer reaction times and lower accuracy compared to our previous results. Correspondingly, the effect of reward on reaction times was slightly larger here than in our previous studies (16 ms as compared to 9 ms in Heuer et al.¹²). This is consistent with our finding that increased difficulty to make optimal choices, manipulated by varying the contrast of the targets, increases the delay in responses to low-reward targets¹¹. The notion that a reward-related bias is only implemented when behavior can be optimized to maximize reward also reconciles our findings with the large body of research showing that reward affects visual processing in tasks that do not provide analogous choice opportunities (e.g., typical visual search tasks). These tasks are usually more difficult, for instance due to the presence of more distractors, so a reward-related bias ensures that mistakes will not be overly detrimental for the overall outcome^4,38,39,40.

To examine whether the same oscillatory mechanism that we have previously identified for spatially indicated reward¹² also supported the reward-related bias in feature-based attention observed in the present study, we additionally analyzed alpha-oscillations over parieto-occipital cortex during the preparatory period preceding target presentation. There was an overall decrease in alpha power during this period in all conditions, indicating that neural excitability in visual cortex was increased in order to facilitate processing of the upcoming target. However, even though the behavioral results clearly indicated that the feature associated with a low reward was effectively and more strongly suppressed when choices were present and especially so immediately following a choice-trial, this pattern was not reflected in posterior alpha power.

It might be tempting to assume that this lack of any reward-related modulation of posterior alpha oscillations shows that the suppression of low-value features was supported by a different neural mechanism than the suppression of low-value regions of space. This could be regarded as in line with previous findings. Of particular interest in this context is the study by Wildegger et al.³⁷, who cued either the location, orientation, both or neither of an upcoming target stimulus and examined preparatory alpha modulations over visual cortex. While the anticipation of target location with spatial and combined spatial and feature cues was reflected in robust alpha lateralizations, preparing for a target feature (i.e., orientation) modulated neither lateralized nor global alpha power, even though the feature cues yielded clear performance benefits. The authors proposed that preparatory alpha modulations reflect a spatial gating mechanism that is involved in the gating of information processed by nonoverlapping sensory areas. This is for instance the case for different retinotopical locations^17,18 but also for different feature dimensions³⁶, which are processed in dedicated, separate areas. By contrast, alpha modulations would not operate at the level of specificity that is required when overlapping and interdigitating populations process the attended information³⁷. This account accordingly predicts that no alpha modulations would be observed when feature values within the same dimension are attended, for example different orientations as in Wildegger et al.³⁷ or different colors as in the present study. Along these lines, our results could be seen as extending empirical support for this idea put forward by Wildegger et al.³⁷ to another feature dimension (i.e., color).

It is important to note, however, that there are plausible alternative explanations that could account for this null effect. For one, our measure might not have been sufficiently sensitive. We reasoned that the suppression of the low-reward feature would be mediated by increased alpha power (i.e., reduced excitability) in neural networks coding for that feature and that this would be reflected in overall higher alpha power. Possibly, our aggregate measure of electrophysiological activity of large populations of neurons in visual cortex recorded at the scalp surface was not sensitive enough to capture the changes in the oscillatory power of much smaller subpopulations. Moreover, the changes induced by the presence of choice-trials in the present study might even have been particularly subtle, seeing as only 33% of trials were choice-trials. The advantage of this design is that it controls for frequency effects: With a choice-trial-proportion of 33%, all conditions (choice-trials, low-reward single-trials and high-reward single-trials) have the same number of trials, which means that also all item types (low-reward target, high-reward target and distractor item) are presented equally often. But our previous work has shown that a higher proportion of choice-trials results in a larger response delay and presumably in a stronger underlying bias, so it is conceivable that a higher proportion of choice-trials might be required for oscillatory changes to be detected (see also Heuer et al.¹¹ and Wolf et al.¹² for discussions of the implications of different choice-trial proportions).

In summary, we have shown that the possibility to make choices modulates the effects of reward coupled to a non-spatial feature. Similar to what we have found for effects of reward coupled to spatial location^11,12, responses to low-reward single targets were more delayed when choices between targets of different value were interleaved. Presumably, this is the result of an anticipatory bias in feature-based attention: Suppressing the feature value associated with a low reward in preparation of target presentation ensures that the more valuable target will be more readily selected when given the opportunity to make a choice. At a broader level, our findings support the notion that reward primarily affects performance when it is of immediate behavioral relevance, for instance due to the possibility to maximize positive outcome by making choices.

Methods

Participants

Twenty-six students of Philipps-Universität Marburg participated in the experiment. The data from three participants had to be excluded: Two because of technical problems that unsystematically distorted the EEG markers, and one because of excessive alpha activity. Analyses were performed on the data of the remaining twenty-three participants (18 female, five male; mean age 21 years, range 19–31 years). The experiment was conducted in accordance with the ethical standards laid down in the Declaration of Helsinki and approved by the Ethics Committee of the Faculty of Psychology. All participants provided informed written consent, were naive to the purpose of the experiment, and had normal or corrected-to-normal visual acuity and color vision. Visual acuity and color vision were tested with the OCULUS Binoptometer 3 (OCULUS Optikgeräte GmbH, Wetzlar, Germany).

Apparatus and stimuli

The experiment was conducted in a dimly-lit and electrically shielded room. Participants were seated in a comfortable chair and were facing a monitor (22″, 1680 × 1050 px) at a viewing distance of 104 cm. Stimulus presentation and response collection were controlled by a Windows PC using E-Prime 2.0 software (Psychology Software Tools, Inc.). Participants responded by pressing buttons on the back of a gamepad (Microsoft SideWinder USB) with their left or right index finger. Three isoluminant colors were used as target and distractor colors: blue, green and yellow. All other stimuli were black and all stimuli were presented on a grey background. Target and distractor items as well as the small fixation cross shown during the preparatory period and during target presentation all had a size of 0.55° of visual angle. The large fixation crosses presented at the beginning of each trial and between trials, and the reward feedback presented at the end of each trial subtended 1.10°. Target and distractor items appeared 9.84° left or right from fixation.

Procedure and Design

Figure 1 depicts the trial procedure. For the first 500–1000 ms of every trial, a fixation cross was shown. This presentation duration varied in randomly chosen steps of 100 ms. Until target and distractor onset, two placeholders (small fixation crosses) were presented at the upcoming target positions. The central fixation cross changed its size indicating the onset of the 600 ms preparatory period. Then, two circle-shaped items of different colors appeared, replacing the placeholders. For each participant, two out of three colors were defined as target colors. In the example illustrated in Fig. 1, green and blue are target colors, and the third color (yellow) served as distractor color. The color assignment was balanced across participants. In single-trials, one of the items was a target, as defined by its color, and participants had to indicate whether the target was presented to the left or right from fixation by pressing the spatially corresponding button (left or right) on a gamepad. In choice-trials, both items were targets, and participants could freely choose by pressing the left or the right button. The target was displayed until response or for a maximum of 700 ms. Participants received a reward for correct responses within this reaction time window. In each block of trials, one of the two target colors was assigned a low and the other one a high reward. In single-trials, correct responses were rewarded with either a low reward (+1 point) or a high reward (+9 points), depending on the target color. In choice-trials, the reward depended on the color of the chosen target. At the end of the experiment, reward points were converted into a monetary reward (35 Cents for 1000 points). Reward feedback (“+1”, “+9” or “+0”) at the end of each trial was presented for 700 ms. Inter-trial intervals varied randomly between 500 and 1000 ms in steps of 100 ms.

The experiment comprised 1728 trials in total. The two choice-trial proportions (0 vs. 0.33) were crossed with the assignment of reward to the two target colors (color 1 low, color 2 high vs. color 1 high, color 2 low) and varied blockwise (four blocks of 432 trials each). In 0% choice-trial blocks, half of all trials were low-reward single trials and the other half were high-reward single-trials. In 33% choice-trial blocks, one third of trials were low-reward single-trials, one third were high-reward single-trials, and the remaining third were choice-trials. Choice-trial proportion changed after each block and the reward assignment changed after two blocks (i.e., after the first half of the experiment). Within blocks, trial types (low-reward single-trial, high-reward single-trial, choice-trial) were chosen randomly. This design was disclosed to the participants. We balanced the order of the blocks across participants. Within each block, participants could take a short rest every 36 trials.

Behavioral analyses

We excluded trials if the reaction time was more than 2.5 SD above the individual mean reaction time. This applied on average to 2.1% of all trials. The dependent variable of interest were reaction times in single-trials. Mean reaction times, including only trials with correct responses, were calculated separately for each proportion of choice-trials (0 vs. 0.33) and for low- and high-reward targets, and compared using a two-way repeated measures ANOVA. The same analysis was also computed for accuracy in percent to ensure that reaction times were not affected by a systematic trade-off between speed and accuracy. Moreover, we examined the percentage of high-reward choices in choice-trials as a manipulation check.

To examine intertrial effects, single-trials in the 33% choice-trial condition were sorted according to the preceding trial (choice-trial vs. low-reward single-trial vs. high-reward single-trial) and the reward associated with the target (low vs. high), and analyzed with a two-way repeated measures ANOVA.

EEG recording and analyses

The EEG was recorded with 64 Ag/AgCl active electrodes (actiCAP, Brain Products, Munich, Germany) positioned according to the International 10–20 system. We recorded the horizontal (hEOG) and vertical electrooculogram (hEOG) as the voltage difference between electrodes positioned to the left and right of the eyes, and above and below. All electrodes were referenced to FCz and re-referenced offline to the average of all electrodes. Impedances were kept below 5 kΩ. The signal was recorded at a sampling rate of 1000 Hz with a high cutoff filter of 250 Hz and a low cutoff filter of 0.016 Hz.

Oscillatory activity in the preparatory period was analyzed in essentially the same way as in our previous study¹² to facilitate comparison of the findings. However, in contrast to our previous study, we did not compute a lateralization index to investigate hemispheric differences in alpha power, because reward was no longer coupled to the visual hemifields. Instead, we examined parieto-occipital alpha power averaged across the hemispheres.

EEG preprocessing and analyses were performed in MATLAB (MathWorks) using the Fieldtrip toolbox⁴¹ and custom scripts. The continuous EEG was segmented into epochs of 2200 ms, starting 1000 ms before the onset of the preparatory period. We chose this comparatively long epoch to allow calculating wavelet coefficients for all frequencies and time points of interest, including the preparatory period and a preceding baseline from −500 to 0 ms. Trials that were incorrect, identified as reaction time outliers (>2.5 SD from individual mean reaction time), or that contained blinks (vEOG > 100 µV) or eye movements (hEOG > 70 µV) in the critical time window (−500 ms to 600 ms with respect to the onset of the preparatory period) were removed from the data. Segments were also excluded, when the absolute voltage in the channels of interest (O1/2, PO3/4 and PO7/8) exceeded 80 µV.

Time-frequency representations of the preparatory period in each trial were computed by convolving 5-cycle Morlet wavelets with the EEG segments for frequencies from 5 to 30 Hz with a resolution of 1 Hz. We applied this procedure in steps of 10 ms throughout the preparatory period and the preceding baseline of 500 ms for three electrode pairs over parieto-occipital cortex (O1/2, PO3/4 and PO7/8). Power estimates were baseline-corrected by dividing by the average power in the 500 ms preceding the onset of the preparatory period at each frequency. The resulting values thus reflect the change in power relative to the baseline period: a value of 1 indicates no change, values greater than 1 indicate a power increase, and values smaller than 1 indicate a power decrease. The relative power change estimates were averaged across electrodes and across frequency bins in the alpha range (8–14 Hz), separately for the 0% and 33% choice-trial condition. We excluded the first and last 100 ms of the preparatory period from the analysis so that it would not be affected by perceptual processing of the fixation cross change (i.e., the onset of the preparatory period) and the target. The remaining 400 ms were divided into an early (100–300 ms) and a late (300–500 ms) time window of analysis to ensure that more transient changes in alpha power would not be missed. These relative alpha power change estimates were submitted to a two-way repeated measures ANOVA with the factors proportion of choice-trials (0 vs. 0.33) and time window of analysis (early vs. late).

To examine intertrial effects, trials in the 33% choice-trial condition were further split according to the preceding trial type (choice-trial vs. low-reward single-trial vs. high-reward single-trial), and the resulting relative alpha power change estimates submitted to a two-way repeated measures ANOVA with the additional factor time window of analysis (early vs. late).

Data Availability

The data are available at the following https://doi.org/10.5281/zenodo.1453309.

References

Anderson, B. A. The attention habit: How reward learning shapes attentional selection. Ann. NY Acad. Sci. 1369, 24–39 (2016).
Article ADS Google Scholar
Failing, M. & Theeuwes, J. Selection history: How reward modulates selectivity of visual attention. Psychon. Bull. Rev. 25, 514–538 (2017).
Article Google Scholar
Dunne, S., Ellison, A. & Smith, D. T. Rewards modulate saccade latency but not exogenous spatial attention. Front. Psychol. 6, 1–9 (2015).
Article Google Scholar
Le Pelley, M. E., Pearson, D., Griffiths, O. & Beesley, T. When goals conflict with values: Counterproductive attentional and oculomotor capture by reward-related stimuli predictiveness-driven attentional capture. J. Exp. Psychol. Gen. 144, 158–171 (2015).
Article Google Scholar
Schütz, A. C., Tommershäuser, J. & Gegenfurtner, K. R. Dynamic integration of information about salience and value for smooth pursuit eye movements. Proc. Natl. Acad. Sci. USA 109, 7547–7552 (2012).
Article ADS Google Scholar
Gong, M. & Li, S. Learned reward association improves visual working memory. J Exp Psychol Hum Percept. Perform. 40, 841–856 (2014).
Article Google Scholar
Heuer, A. & Schubö, A. Separate and combined effects of action relevance and motivational value on visual working memor. J. Vis. 18, 14 (2018).
Article Google Scholar
Klink, P. C., Jeurissen, D., Theeuwes, J., Denys, D. & Roelfsema, P. R. Working memory accuracy for multiple targets is driven by reward expectation and stimulus contrast with different time-courses. Sci. Rep. 7, 9082 (2017).
Article ADS Google Scholar
Chapman, C. S., Gallivan, J. P. & Enns, J. T. Separating value from selection frequency in rapid reaching biases to visual targets. Vis Cognit. 23, 249–271 (2015).
Article Google Scholar
Moher, J., Anderson, B. A. & Song, J.-H. Dissociable effects of salience on attention and goal-directed action. Curr. Biol. 25, 2040–2046 (2015).
Article CAS Google Scholar
Wolf, C., Heuer, A., Schubö, A. & Schütz, A. C. The necessity to choose causes effects of reward on saccade preparation. Sci. Rep. 7, 16966 (2017).
Article ADS Google Scholar
Heuer, A., Wolf, C., Schütz, A. C. & Schubö, A. The necessity to choose causes reward-related anticipatory biasing: Parieto-occipital alpha-band oscillations reveal suppression of low-value targets. Sci. Rep. 7, 14318 (2017).
Article ADS Google Scholar
Kelly, S. P., Lalor, E. C., Reilly, R. B. & Foxe, J. J. Increases in alpha oscillatory power reflect an active retinotopic mechanism for distracter suppression during sustained visuospatial attention. J. Neurophysiol. 95, 3844–3851 (2006).
Article Google Scholar
Klimesch, W. Alpha-band oscillations, attention, and controlled access to stored information. Trends Cogn. Sci. 16, 606–617 (2012).
Article Google Scholar
Rihs, T. A., Michel, C. M. & Thut, G. A bias for posterior alpha-band power suppression versus enhancement during shifting versus maintenance of spatial attention. Neuroimage 44, 190–199 (2009).
Article Google Scholar
Sauseng, P. et al. A shift of visual spatial attention is selectively associated with human EEG alpha activity. Eur. J. Neurosci. 22, 2917–2926 (2005).
Article CAS Google Scholar
Thut, G., Nietzel, A., Brandt, S. A. & Pascual-Leone, A. Alpha-band electroencephalographic activity over occipital cortex indexes visuospatial attention bias and predicts visual target detection. J. Neurosci. 26, 9494–9502 (2006).
Article CAS Google Scholar
Worden, M. S., Foxe, J. J., Wang, N. & Simpson, G. V. Anticipatory biasing of visuospatial attention indexed by retinotopically specific alpha-band electroencephalography increases over occipital cortex. J. Neurosci. 20, RC63 (2000).
Article CAS Google Scholar
Yamagishi, N., Goda, N., Callan, D. E., Anderson, S. J. & Kawato, M. Attentional shifts towards an expected visual target alter the level of alpha-band oscillatory activity in the human calcarine cortex. Brain Res. Cogn. Brain Res. 25, 799–809 (2005).
Article Google Scholar
Foxe, J. J. & Snyder, A. C. The role of alpha-band brain oscillations as a sensory suppression mechanism during selective attention. Front. Psychol. 2, 154 (2011).
Article Google Scholar
Iemi, L., Chaumon, M., Crouzet, S. M. & Busch, N. A. Spontaneous neural oscillations bias perception by modulating baseline excitability. J. Neurosci. 37, 807–819 (2017).
Article CAS Google Scholar
Carrasco, M. Visual attention: The past 25 years. Vision Res. 51, 1484–1525 (2011).
Article Google Scholar
Maunsell, J. H. R. & Treue, S. Feature-based attention in visual cortex. Trends Neurosci. 29, 317–322 (2006).
Article CAS Google Scholar
Cohen, M. R. & Maunsell, J. H. R. Using neuronal populations to study the mechanisms underlying spatial and feature attention. Neuron 70, 1192–1204 (2011).
Article CAS Google Scholar
Patzwahl, D. R. & Treue, S. Combining spatial and feature-based attention within the receptive field of MT neurons. Vision Res. 49, 1188–1193 (2009).
Article Google Scholar
Giesbrecht, B., Woldorff, M. G., Song, A. W. & Mangun, G. R. Neural mechanisms of top-down control during spatial and feature attention. Neuroimage 19, 496–512 (2003).
Article CAS Google Scholar
Greenberg, A. S., Esterman, M., Wilson, D., Serences, J. T. & Yantis, S. Control of spatial and feature-based attention in frontoparietal cortex. J. Neurosci. 30, 14330–14339 (2010).
Article CAS Google Scholar
Slagter, H. A. et al. fMRI evidence for both generalized and specialized components of attentional control. Brain Res. 1177, 90–102 (2007).
Article CAS Google Scholar
Andersen, S. K., Fuchs, S. & Müller, M. M. Effects of feature-selective and spatial attention at different stages of visual processing. J. Cogn. Neurosci. 23, 238–246 (2011).
Article Google Scholar
Heuer, A. & Schubö, A. Feature-based and spatial attentional selection in visual working memory. Mem. Cogn. 44, 621–632 (2016).
Article Google Scholar
Liu, T., Stevens, S. T. & Carrasco, M. Comparing the time course and efficacy of spatial and feature-based attention. Vision Res. 47, 108–113 (2007).
Article Google Scholar
White, A. L., Rolfs, M. & Carrasco, M. Stimulus competition mediates the joint effects of spatial and feature-based attention. J. Vis. 15, 7 (2015).
Article Google Scholar
Heuer, A., Schubö, A. & Crawford, J. D. Different cortical mechanisms for spatial vs. feature-based attentional selection in visual working memory. Front. Hum. Neurosci. 10, 415 (2016).
Article Google Scholar
Schenkluhn, B., Ruff, C. C., Heinen, K. & Chambers, C. D. Parietal stimulation decouples spatial and feature-based attention. J. Neurosci. 28, 11106–11110 (2008).
Article CAS Google Scholar
de Lange, F. P., Rahnev, D. A., Donner, T. H. & Lau, H. Prestimulus oscillatory activity over motor cortex reflects perceptual expectations. J. Neurosci. 33, 1400–1410 (2013).
Article Google Scholar
Snyder, A. C. & Foxe, J. J. Anticipatory attentional suppression of visual features indexed by oscillatory alpha-band power increases: A high-density electrical mapping study. J. Neurosci. 30, 4024–4032 (2010).
Article CAS Google Scholar
Wildegger, T., van Ede, F., Woolrich, M., Gillebert, C. R. & Nobre, A. C. Preparatory alpha-band oscillations reflect spatial gating independently of predictions regarding target identity. J. Neurophysiol. 117, 1385–1394 (2017).
Article CAS Google Scholar
Anderson, B. A., Laurent, P. A. & Yantis, S. Reward predictions bias attentional selection. Front. Hum. Neurosci. 7, 262 (2013).
PubMed PubMed Central Google Scholar
Hickey, C., Chelazzi, L. & Theeuwes, J. Reward changes salience in human vision via the anterior cingulate. J. Neurosci. 30, 11096–11103 (2010).
Article CAS Google Scholar
Kiss, M., Driver, J. & Eimer, M. Reward priority of visual target singletons modulates ERP signatures of attentional selection. Psychol. Sci. 20, 245–251 (2009).
Article Google Scholar
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J. M. FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 156869 (2011).
Article Google Scholar

Download references

Acknowledgements

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – project number 222641018– SFB/TRR 135 TP B2 & B3. We would like to thank Pius Kern for assistance with data collection.

Author information

Anna Heuer
Present address: Department of Psychology, Humboldt-Universität zu Berlin, Rudower Chaussee 18, 12489, Berlin, Germany

Authors and Affiliations

Experimental and Biological Psychology, Philipps-Universität Marburg, Marburg, Germany
Anna Heuer, Christian Wolf, Alexander C. Schütz & Anna Schubö

Authors

Anna Heuer
View author publications
You can also search for this author in PubMed Google Scholar
Christian Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Alexander C. Schütz
View author publications
You can also search for this author in PubMed Google Scholar
Anna Schubö
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.H., C.W., A.C.S. and A.S. conceived and designed the research. A.H. analysed the data and wrote the first draft of the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Anna Heuer.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Heuer, A., Wolf, C., Schütz, A.C. et al. The possibility to make choices modulates feature-based effects of reward. Sci Rep 9, 5749 (2019). https://doi.org/10.1038/s41598-019-42255-1

Download citation

Received: 04 November 2018
Accepted: 26 March 2019
Published: 08 April 2019
DOI: https://doi.org/10.1038/s41598-019-42255-1

This article is cited by

Vision as oculomotor reward: cognitive contributions to the dynamic control of saccadic eye movements
- Christian Wolf
- Markus Lappe
Cognitive Neurodynamics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.