The reward system is important in assessing outcomes to guide behavior. To achieve these purposes, its core components interact with several brain areas involved in cognitive and emotional processing. A key mechanism suggested to subserve these interactions is oscillatory activity, with a prominent role of theta and high-beta oscillations. The present study used single-trial coupling of simultaneously recorded electroencephalography and functional magnetic resonance imaging data to investigate networks associated with oscillatory responses to feedback during a two-choice gambling task in healthy male participants (n=19). Differential associations of theta and high-beta oscillations with non-overlapping brain networks were observed: Increase of high-beta power in response to positive feedback was associated with activations in a largely subcortical network encompassing core areas of the reward network. In contrast, theta-band power increase upon loss was associated with activations in a frontoparietal network that included the anterior cingulate cortex. Trait impulsivity correlated significantly with activations in areas of the theta-associated network. Our results suggest that positive and negative feedback is processed by separate brain networks associated with different cognitive functions. Communication within these networks is mediated by oscillations of different frequency, possibly reflecting different modes of dopaminergic signaling.
'Once bitten, twice shy': adaptive behavior depends on the ability to recognize contingencies and to use them to make predictions about future events. These functions are carried out by the reward system, core components of which include the ventral tegmental area and substantia nigra, ventral and dorsal striatum, and dorso-/ventromedial prefrontal areas.1, 2 Research into the reward system is relevant for our understanding of psychiatric disorders such as psychotic, mood and substance disorders.
The reward system does not act in isolation; its output needs to be evaluated and also registered in memory. To achieve these purposes, the aforementioned core reward regions interact with several other areas—most notably regions involved in cognitive and emotional processing such as the medial and lateral prefrontal cortex, hippocampus and amygdala.1 These various components need to be flexibly and differentially recruited depending on the specific context (for example, significance for survival, conflicts between short- and long-term rewards and so on). One key mechanism through which this is achieved is oscillatory activity: neuronal oscillations enable communication between distant brain areas, with oscillations of different frequency corresponding to different network configurations.3, 4, 5 Therefore, neuronal oscillations of varying frequency are a plausible candidate as the mechanism of flexible communication within the reward system.
Several electroencephalography (EEG) studies have provided evidence for frequency-specific responses to different reward-related stimuli. In gambling paradigms, processing of positive outcomes is mainly associated with oscillations in the high-beta/low-gamma frequency range. On the other hand, losses are accompanied by an increase in the power and synchronization of oscillations in the theta frequency range6, 7, 8 and, partly associated with these, by a negative event-related potential with a midfrontal scalp distribution, the feedback-related negativity.9 Beta- and theta-band oscillations respond to different features of the feedback stimulus: For example, the theta-band response has been reported to be mainly driven by feedback valence,8, 9 whereas high-beta oscillations are affected by additional aspects of reward-related stimuli such as their probability and magnitude.6, 8, 10 Moreover, previous studies by our group and others indicate that the two types of oscillatory response are differentially associated with trait impulsivity: Both in healthy subjects11, 12, 13 and in patients with borderline personality disorder and alcohol dependence,14, 15 impulsivity is associated with dampened theta-band oscillatory responses to negative feedback, an effect that involves the dorsal anterior cingulate cortex (dACC) and possibly also lateral prefrontal areas;14 in contrast, beta oscillatory responses to reward are not correlated with trait impulsivity.13, 14
The above dissociation supports the notion of a frequency-specific, context-dependent modulation of the reward system. However, it is still unclear whether the latter involves separate sub-networks within the reward system, or rather represents the same components interacting with each other by means of different oscillatory processes. On the basis of theoretical considerations, it was proposed that the former is the case, with high-beta activity originating in ventromedial, and theta activity in dorsomedial, prefrontal areas.16, 17 However, EEG studies have failed to conclusively confirm or disconfirm this hypothesis; the two types of oscillatory response have a largely overlapping midfrontal topography, and source localization studies have provided partly inconsistent results. Theta-band oscillations and the closely associated feedback-related negativity in response to negative feedback have often been reported to originate in the anterior cingulate cortex (ACC),16, 18 but other generators such as the posterior cingulate cortex19 or basal ganglia20, 21 have also been proposed. High-beta responses to reward, on the other hand, have been localized in dorsolateral prefrontal areas14, 22 but, in one case, also in the ACC.14
The above discordant findings exemplify limitations of EEG-based approaches, resulting from the lack of a unique solution to the inverse problem of cortical source localizations based on scalp-recorded activity. Another limitation of EEG is that it is restricted in its capacity to detect activity in deep-located structures of the brain; this constrains its usefulness when investigating the reward system, which comprises several subcortical components. These limitations can be overcome with use of multimodal imaging techniques that combine EEG and functional magnetic resonance imaging (fMRI) analyses, profiting both from the superiority of EEG in assessing the temporal characteristics of neural oscillations and from the excellent spatial resolution of fMRI.23, 24, 25
So far, only two studies have used multimodal techniques to depict networks associated with the feedback-related negativity26 or high-beta oscillatory responses to reward.27 Their findings suggest that the two EEG measures correspond to different networks: the feedback-related negativity in response to negative feedback was associated with activations in a purely cortical network,26 whereas the high-beta oscillatory responses to positive feedback corresponded to a network comprising not only lateral frontal, but also striatal and hippocampal areas.27 However, it should be kept in mind that comparability of the above two studies is limited due to the different paradigms used. According to recent evidence,28 the same brain areas may communicate in different frequencies depending on the exact cognitive operations involved, even within the same cognitive domain (that is, learning from feedback).
Prompted by the above, the aim of the present study was to investigate the networks associated with oscillatory responses to feedback within the context of the same gambling paradigm, using EEG-informed fMRI. Based on existing literature, we hypothesized that theta-band oscillations upon loss would be associated with activations in a frontoparietal network including the ACC, whereas high-beta oscillations in response to positive feedback would involve activations of frontal, striatal and hippocampal areas. Furthermore, we expected that processing of different feedback dimensions (valence vs magnitude) would implicate different brain areas in both frequency ranges; the latter assumption was not specified further owing to the scarcity of related previous findings. A secondary aim of the study was to explore the association between trait impulsivity and theta-associated activations in response to negative feedback. According to our previous results, we expected a negative correlation between impulsivity scores and theta-band-associated activations in the dACC and/or lateral prefrontal areas.
Materials and methods
The study was conducted in accordance with the Declaration of Helsinki and was approved by the ethics committee of the Medical Council of Hamburg. All the participants provided written informed consent.
Twenty-two healthy male individuals (age 23.67±3.2 years) were recruited among students of the University of Hamburg. The sample size was defined based on previous studies by our group on feedback processing.13, 14 All the participants were nonsmokers and had normal or corrected-to-normal vision. Exclusion criteria were lifetime psychotic, bipolar or substance-use disorders, depressive or anxiety disorders in the past year, neurological or major somatic illnesses, and psychotropic or any other medication known to affect cognitive functions. Three subjects were excluded from the analyses (one because he later admitted to daily cannabis consumption in the week preceding the testing session, one because of major head movement and one because of very poor EEG data quality). Thus, 19 participants were included in the final analysis.
In all the participants, trait impulsivity was assessed with the Barratt Impulsiveness Scale (BIS), a 30-item Likert-type self-report questionnaire yielding scores for attentional, motor and non-planning impulsivity.29 The BIS has been widely used in similar studies and has good reliability and validity.30 Moreover, a general screening of personality attributes was carried out with the German version of the NEO Five-Factor Inventory,31 a self-rating instrument containing 60 items that are rated on a five-point Likert-type scale across five personality dimensions: neuroticism, extraversion, openness to experience, conscientiousness and agreeableness.
Presentation version 17, installed on a computer set in a monitoring room shielded from the MR scanner, was used for stimulus presentation. The experiment consisted of four blocks of 100 trials each. At the beginning of each trial, two numbers (25 and 5) were presented on the screen in randomized position order (  or  ). The participants were instructed to choose one of the two numbers by button press within 1 s of the stimulus onset. Two seconds after trial onset, the selected number was set to bold; if the participant had failed to press a button in the required time, the trial was dismissed. After a further delay of 2 s, one of the numbers randomly turned green and the other red, indicating whether the selected amount (25 or 5) was added (green—win feedback) or subtracted (red—loss feedback) from the participant’s account. The trial ended with a 2 s display presenting the current account balance. A 2 s fixation square preceded the next trial.
The participants were instructed in a standardized manner and practiced the paradigm in advance. They were informed that their aim was to gain as many points as possible, that loss and gain events occurred at equal probability and that in each trial they were free to choose the high- or low-risk option without any constraints. The participants were reimbursed with 30€ for study participation.
EEG was recorded during fMRI acquisition using BrainVision Recorder (Version 1.10, Brain Products, Munich, Germany) and MR-compatible AC-amplifiers (BrainAmp MRplus; Brain Products). The electrode cap (BrainCapMR 64, Brain Products) contained 62 active sintered silver/silver chloride EEG electrodes positioned according to a modified 10/10 system; FCz served as reference and AFz as ground; an EOG electrode under the left eye and an ECG electrode recorded eye movement and data for cardioballistogram correction, respectively. The ribbon cable connecting the electrode wires and amplifiers was fixated with sand bags on foam cushions to avoid artifacts generated by the scanner’s vibrations. Electrode skin impedance was kept below 10 kΩ. The data were collected with a sampling rate of 5000 Hz and an amplitude resolution of 0.5 μV.
Imaging was performed on a 3-Tesla MR scanner (Magnetom Trio, Siemens, Munich, Germany) equipped with a 12-channel head coil. Twenty-five slices were recorded using a standard gradient echo-planar imaging (EPI) T2*-sensitive sequence for functional blood-oxygen-level-dependent (BOLD) imaging. For each block, there were 530 volumes (TR=2 s; TE=25 ms; FOV=216 mm; matrix=108 × 108; continuous slice acquisition; slice thickness=3 mm; interslice gap=1 mm). The vacuum pump of the MRI scanner was switched off during acquisition, to avoid EEG artifacts in the high-frequency ranges. A high-resolution (voxel size 1 × 1 × 1 mm) T1-weighted anatomical image (MPRAGE) was acquired for each subject in the same position as the EPI images.
EEG preprocessing and time-frequency analysis
Brain Vision Analyzer Version 2.0 (Brain Products) was used for offline EEG data preprocessing and analysis. The continuous MR-Artifact was corrected by generating a sliding average template using baseline correction. The data set was resampled to a sampling rate of 500 Hz and filtered with a 50 Hz low-pass (slope 12 dB/oct) and 0.1 Hz (slope 48 dB/oct) high-pass Butterworth zero-phase filter. For cardioballistic artifact correction, a pulse template was semi-automatically detected and marked in the electrocardiogram channel, then used to subtract the cardioballistic artifact from recordings. Prominent non-stereotyped artifacts such as movement artifacts and channel drifts were removed by visual inspection. Independent component analysis (restricted biased Infomax algorithm) was applied to eliminate further artifacts; components indicating blinks and eye-movements, residual gradient and head movement artifacts were detected and removed on the basis of their power spectrum and topography. Subsequently, the EEG signal was re-referenced to a common average reference and segmented into periods of 3 s, starting 1800 ms before the feedback stimulus. Baseline correction for the 200 ms pre-stimulus interval was applied. An automatic artifact correction procedure rejected segments that contained voltage steps higher than 50 μV, amplitudes exceeding ±95 μV or a difference higher than 200 μV between the highest and lowest value, or activity below 0.5 μV.
Time-frequency information was extracted at the single-trial level for EEG activity at electrode Fz (similar to previous studies by our group13, 14) using wavelet convolution for the frequencies from 2 to 50 Hz (complex Morlet wavelet, 25 frequency steps distributed on a logarithmic scale, Morlet parameter c=6, Gabor Normalization). Oscillatory power at each time point and frequency layer was divided by a baseline norm value n representing the sum of values across the 200 ms pre-stimulus baseline, weighted by the relative length of the baseline interval with respect to total segment length. In this way, power changes with respect to the pre-stimulus baseline were assessed, rather than absolute power. Layers with central frequencies of 5.1Hz (range: 4.4–5.8 Hz) and 25.5 Hz (range: 22–29 Hz) were extracted to investigate theta and high-beta activity, respectively. Markers indicating the maximum peak power 100–600 ms post-stimulus for theta and 100–500 ms post-stimulus for high-beta were set.
The effects of feedback valence (gain vs loss) and magnitude (25 vs 5 points) on theta and high-beta power were assessed with separate linear-mixed models (LMMs), which were preferred over repeated-measures ANOVAs because they are better suited to address inter-subject variability. Dependent variables for the two linear-mixed models were averaged peak theta and high-beta values over trials; valence and magnitude were repeated-measures fixed-effect factors. The valence × magnitude interaction was initially included in the models but subsequently removed, as it was not significant in either of the two analyses (both P>0.180). Both linear-mixed models used the maximum-likelihood estimation algorithm and a diagonal covariance structure; subject ID was included as a random factor.
fMRI preprocessing and analysis
The fMRI data were processed using standard procedures implemented in the Statistical Parameter Mapping software (SPM12, Wellcome Department of Imaging Neuroscience, London, UK, www.fil.ion.ucl.ac.uk/spm/). The first five volumes of each block were discarded to allow for MRI saturation effects. The preprocessing included slice timing, realignment, registration to standard space (Montreal Neurological Institute) and spatial smoothing with an 8 mm Gaussian kernel.
BOLD responses to feedback stimuli were examined using the general linear model approach. For first-level analyses, the following conditions were modeled as regressors through convolution with a canonical hemodynamic response function: (a) the four conditions of feedback (large gain, large loss, small gain, small loss); (b) initial stimulus presentation; (c) motor response; (d) anticipation phase; and (e) presentation of the account balance. Motion parameters (n=6) were included in the model as regressors of no interest.
EEG-informed fMRI analysis
Coupling effects of theta and high-beta power with BOLD activity were investigated in two separate general linear models. For each condition (large gain, large loss, small gain, small loss), a parametric modulator corresponding to single-trial (theta or high-beta) oscillatory power measured at Fz was added to the respective regressor representing onsets of the events of interest in the design matrix. To remove shared variance between regressors and parametric modulators, the latter were orthogonalized with respect to the former by subtracting the mean (theta or high-beta) oscillatory power within each block and condition from the single-trial power for the corresponding condition.
First-level contrasts were calculated for each feedback condition compared with baseline, and entered into a second-level flexible factorial model with three factors (valence, magnitude and subject ID as a random effect). Analyses were carried out for valence (gain > loss, loss > gain) and magnitude contrasts (large > small, small > large feedback). Effects observed at P<0.001 and surviving a false discovery rate (FDR) correction at the cluster level at P(FDR)<0.05 are reported as significant for fMRI analyses. EEG–BOLD coupling analyses typically produce weak effect sizes because of the low signal-to-noise ratio in single EEG trials. Therefore, we used a more lenient threshold of P<0.005 (uncorrected) with a cluster extent of 100 voxels.
Correlations with trait impulsivity
We used a functional region-of-interest approach to investigate correlations between BIS subscales and theta-associated activations upon loss feedback: Using MarsBar (marsbar.sourceforge.net), spherical regions of interest with a 5 mm radius were built around the peak voxel of each cluster that achieved significance for the loss vs gain contrast in the theta-band EEG–BOLD coupling analysis. The mean of the linear fit coefficient of all voxels within the sphere was used as the region-of-interest summary measure for correlations. As there were no significant deviations from normality (Wilcoxon's test), Pearson's r was used for correlational analyses. The Benjamini and Hochberg FDR method34 was used to correct for multiple testing. Although our hypothesis was specific to the theta band and the loss vs gain contrast, for comparison we also conducted similar correlational analyses for the high-beta band and the opposite (that is, gain vs loss) contrast.
The following personality dimension scores (means±s.d.) were derived from the NEO Five-Factor Inventory: Neuroticism 12.58±6.3; extraversion 28.89±6.1; openness to experience 36.89±6.39; agreeableness 34.44±8.7; conscientiousness 36.53±5.5.
In six participants, only three blocks were available for analysis because of technical problems during acquisition (n=2), significant artifacts in single blocks that could not be removed with the procedures described above (n=2), and selection of the same number (25 or 5) throughout the block, resulting in null regressors in the general linear model analysis (n=2). For these participants, first-level general linear models were constructed on the basis of the three remaining blocks, and first-level contrasts were adjusted for the number of blocks in all the participants. All the results reported below are based on the same number of blocks in all participants. Only significant results are reported.
The main effect of valence was significant both in the theta (F(1,38.00)=6.913, P=0.012) and high-beta band (F(1,54.87)=7.729, P=0.007). The direction of effects was as expected: power was increased upon negative feedback in the theta band, and upon positive feedback in the high-beta frequency band (Figure 1). There were no significant magnitude effects in either frequency band (theta F(1,42.51)=1.348, P=0.252; beta F(1,53.96)=0.25, P=0.619).
For the gain > loss contrast, significant activations were observed in two large clusters that included the ventral striatum, putamen, caudate nucleus, amygdala and hippocampi bilaterally, but also in anterior and posterior medial areas, and bilateral lateral temporal areas (Figure 2a and Table 1).
The magnitude contrast (large > small) revealed significant activations in the dACC, posterior medial, and occipital lateral areas (Table 1). Finally, the small > large contrast revealed activations in the right temporoparietal junction and the left lateral prefrontal cortex (Table 1).
Oscillatory power coupling with BOLD activity
High-beta frequency band
Regions that showed increased BOLD activity for the contrast gain > loss included the right ventral striatum (including the nucleus accumbens) and amygdala, medial posterior and parahippocampal areas bilaterally, and lateral temporal areas bilaterally (Figure 2b and Table 2); activations in posterior areas and the right lateral temporal cortex achieved significance at a cluster-corrected threshold of P(FDR)<0.05.
The magnitude contrast (large > small) revealed high-beta-associated activity in the ACC (Table 2).
Theta frequency band
For the loss > gain contrast, significant associations with theta power were observed in the dACC, right dorsolateral prefrontal cortex (DLPFC), left and right temporoparietal junction, and left superior parietal cortex (Figure 2b and Table 3).
The magnitude contrast (large > small) yielded significant results in right inferomedial temporal areas (Table 3).
Correlations of theta- and high-beta-associated BOLD activations with impulsivity
Significant negative correlations with BIS subscales were observed for theta-associated activity upon loss in the following areas: (a) right DLPFC with BIS non-planning score (r=0.641, P=0.003) and BIS attention (r=0.551, P=0.014); (b) left superior parietal cortex with BIS non-planning score (r=0.531, P=0.019). Trend-wise correlations were also observed for the dACC region of interest with BIS attention (r=0.428, P=0.068) and BIS motor impulsivity (r=0.439, P=0.060). After correction for multiple testing, the correlation of right DLPFC theta-associated activity with BIS non-planning score remained significant (P=0.046), while the correlations of right DLPFC with BIS attention and left superior parietal cortex with BIS non-planning only achieved a trend level (both P=0.097).
In the high-beta frequency band, there was a significant negative correlation between activity in the right middle temporal cortex and BIS attention (r=−541, P=0.015), which disappeared after correction for multiple testing (P=0.369).
The present study used a gambling paradigm and single-trial coupling of simultaneously recorded EEG and fMRI data to investigate brain areas associated with theta and high-beta oscillatory responses to feedback. High-beta oscillations in response to gain and theta oscillations in response to loss stimuli were associated with activations in non-overlapping brain areas. Moreover, there was evidence that trait impulsivity correlated negatively with theta-associated activity upon negative feedback in some components of the respective network.
The network associated with high-beta oscillatory responses to gain involved regions typically associated with reward (ventral striatum) and memory processing (hippocampus, anterior lateral temporal cortex). It also included the posterior cingulate cortex; although this region is typically associated with the default mode network,35 it has been consistently implicated in reward processing, and especially positive outcome processing, by meta-analyses of fMRI data.36, 37 Its exact role is unclear, but studies in non-human primates suggested that it may mediate the integration of stimulus characteristics to motivate a shift in behavior.38 Thus, our findings are consistent with the proposition17 that oscillations in the high-beta/low-gamma frequency range may mediate the synchronization of brain regions involved in learning from positive feedback. A similar high-beta-/low-gamma-associated network was reported by a previous study by Mas-Herrero et al.,27 in which a different multimodal imaging technique was used to assess a gambling paradigm. A notable difference is the absence of beta-associated prefrontal cortex activations in our study.27 This might be attributed to subtle differences in the gambling paradigms used: The study by Mas-Herrero et al. included a trial-by-trial manipulation of the probability of winning (25, 50 or 75%), a dimension known to affect high-beta/low-gamma oscillations in response to reward.10 Moreover, their paradigm included different winning probabilities for different stimuli and thus conceivably promoted the use of explicit strategies to optimize gains more than the paradigm we implemented, in which the probability of winning was determined entirely by chance. Interestingly, using the same paradigm in a previous EEG study,14 we observed frontal cortex activations when contrasting only the two maximum feedback conditions (gain 25 vs loss 25), which are also those with the greatest influence on behavior.32 Thus, it may be that frontal activations are dependent on the usefulness of feedback for behavioral adjustments; this is consistent with the results of a previous EEG source localization study,22 in which high-beta-associated DLPFC activity was only observed when stimulus-reward contingencies could be used to optimize performance (see also below, section on dACC and feedback magnitude).
The theta-band response to loss events was associated with a different network comprising mainly frontoparietal areas. All these areas have been associated with negative feedback in two previous studies that used different EEG-derived components to inform fMRI analyses.26, 39 Moreover, an EEG study implicated theta-band synchronization of midfrontal with dorsolateral prefrontal and parietal areas in feedback processing.40 The parietal cortex has been associated with attentional processes, and the DLPFC with cognitive monitoring and control; both of these areas have been associated with strategy switching in learning tasks.41, 42 Therefore, their theta-mediated synchronization with the dACC conceivably reflects the mobilization of attentional and cognitive resources in the face of a negative outcome necessitating a new strategy. In line with this view, it has been suggested that theta oscillations have a role in processing feedback stimuli that indicate a need for a behavioral change.28
The above dissociation between activations associated with high-beta and theta oscillations is consistent with our hypotheses and existing theoretical accounts of feedback-based learning.16 Moreover, it entails the possibility that the two networks are differentially linked to the dopamine system. Midbrain dopamine neurons exhibit two modes of signaling patterns in vivo: low-frequency (<10 Hz) discharges in a pacemaker-like fashion, and transient high-frequency activity (15–30 Hz).43 It is assumed that low-frequency activity regulates tonic levels of dopamine,44, 45 while high-frequency activity gives rise to phasic dopamine responses.45, 46 Converging evidence suggests that these two types of dopaminergic activity have quite different functions. Phasic dopamine signals encode prediction error signals and are essential for processing of positive feedback in the ventral striatum, possibly reinforcing behaviors that lead to reward by regulating synaptic plasticity.47 On the other hand, tonic dopamine levels in the prefrontal cortex have been suggested to be relevant for motivation,48, 49 as well as for producing a sustained activation state that promotes attentional and monitoring functions while the individual is pursuing a goal.44, 48, 50 These two different modes of dopamine signaling might be reflected in the two different activation patterns we observed—a high-frequency network in reward areas and a low-frequency network in areas associated with cognitive monitoring and control. However, it remains to be determined how the slow dynamics of the mesocortical, low-frequency dopamine system might trigger the fast theta response, occurring within milliseconds from negative feedback (see for example, Jocham and Ullsperger51).
The dACC was prominently involved in the processing of negative feedback, in line with the proposed role of this region in using action outcomes to guide future behavior.52, 53 However, there was also an association with feedback magnitude. This finding might reflect a conflict monitoring function of the dACC (see for example, Knutson et al.54), as larger gains in our paradigm also entailed the possibility of higher risk. Alternatively, it may relate to the significance of feedback magnitude for behavioral adaptation:32 In a monkey study, reward-associated beta oscillations in the ACC were dependent on the usefulness of feedback in terms of learning.55 Notably, the two aspects of dACC involvement—processing of feedback valence vs magnitude—were mediated by oscillations of different frequency, and were localized in adjacent, but distinct areas of the dACC. These results are in line with the view of the dACC as a region with significant heterogeneity, suggested to consist of sub-networks that deal with different feedback dimensions.56 They are also consistent, to revisit the points made above, with the assumption that the dACC subserves different aspects of decision-making by responding differently to changes in tonic vs phasic dopamine firing rate.48 It is argued that tonic dopamine levels enable the 'online' maintenance of task-related information through D1 receptor modulation, whereas phasic dopamine changes mediate information updating and outcome appraisal by acting on D2 receptors44, 48, 57—although there are alternative accounts.58 In this framework, increased theta-band activity in the dACC following negative outcomes could, as detailed above, reflect reallocation of cognitive resources in the face of negative feedback, whereas high-beta activity in response to large wins or losses might serve to flag events that are important for the ongoing cost-benefit analysis of selection behavior.
Various aspects of trait impulsivity correlated negatively with theta-associated BOLD activity in the right DLPFC and, to a less-pronounced degree, with the left superior parietal cortex. This finding expands upon previous studies associating trait impulsivity with a deficit in theta oscillatory responses to negative feedback.12, 13, 14, 15 It has been suggested that this deficit represents a 'reward deficiency syndrome', whereby a reward system dysfunction leads to stimulation-seeking behaviors such as drug abuse and impulsivity.59 Our results only partially confirm this hypothesis: according to the foregoing considerations, reduced theta responses to negative feedback are more likely to represent a deficit in cognitive processes that use reward system output to guide behavior, rather than dysfunctional reward mechanisms per se. However, this conclusion needs to be confirmed with studies in clinical populations, as the present study only investigated normal impulsivity variations in healthy individuals.
The present study included only male participants in an effort to increase sample homogeneity, given reports of gender differences in negative feedback processing.12, 60 However, this might entail limitations for the generalizability of results, which remain to be confirmed. A further limitation is that the simple gambling paradigm we used did not allow monitoring of participants’ expectations, thus making it impossible to disentangle the effects of reward and loss from those of prediction error. This should be kept in mind when considering the above interpretation of results and the postulated role of the dACC, given that midfrontal theta-band oscillations have been implicated in the processing not only of negative feedback, but also of unsigned (that is, valence-independent) prediction error.26, 61 Finally, although theta and high-beta are the most intensively studied frequencies in the context of reward and loss, oscillations in other frequency ranges such as alpha and delta have been also reported to be relevant for feedback processing.13, 62, 63 A detailed assessment of these frequencies would well exceed the scope of the present study, but might be an interesting goal for further studies.
In summary, we were able to show that positive and negative feedback is processed by separate brain networks associated with different cognitive functions, and possibly with different aspects of dopaminergic signaling. Communication within each of these two networks, but also processing of different feedback dimensions within the same region (dACC), were mediated by oscillations of different frequency, speaking for a prominent role of frequency-specific neuronal oscillations in the flexible, context-dependent adaptation of reward-related areas. Trait impulsivity was associated with decreased theta-associated activation in frontoparietal areas, suggesting a deficit in attentional and monitoring processes associated with reward processing in impulsive subjects.
Haber SN, Knutson B . The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology 2010; 35: 4–26.
Koob GF, Volkow ND . Neurocircuitry of addiction. Neuropsychopharmacology 2010; 35: 217–238.
Hillebrand A, Barnes GR, Bosboom JL, Berendse HW, Stam CJ . Frequency-dependent functional connectivity within resting-state networks: an atlas-based MEG beamformer solution. Neuroimage 2012; 59: 3909–3921.
Hipp JF, Hawellek DJ, Corbetta M, Siegel M, Engel AK . Large-scale cortical correlation structure of spontaneous oscillatory activity. Nat Neurosci 2012; 15: 884–890.
Marzetti L, Della Penna S, Snyder AZ, Pizzella V, Nolte G, de Pasquale F et al. Frequency specific interactions of MEG resting state activity within and across brain networks as revealed by the multivariate interaction measure. Neuroimage 2013; 79: 172–183.
Cohen MX, Elger CE, Ranganath C . Reward expectation modulates feedback-related negativity and EEG spectra. Neuroimage 2007; 35: 968–978.
Gehring WJ, Willoughby AR . Are all medial frontal negativities created equal? Toward a richer empirical basis for theories of action monitoring. In: Ullsperger M, Falkenstein M (eds). Errors, Conflicts and the Brain. Current Opinions on Performance Monitoring. Max Planck Institute of Cognitive Neuroscience: Leipzig, Germany, 2004, pp 14–20.
Marco-Pallares J, Cucurell D, Cunillera T, Garcia R, Andres-Pueyo A, Munte TF et al. Human oscillatory activity associated to reward processing in a gambling task. Neuropsychologia 2008; 46: 241–248.
Holroyd CB, Hajcak G, Larsen JT . The good, the bad and the neutral: electrophysiological responses to feedback stimuli. Brain Res 2006; 1105: 93–101.
HajiHosseini A, Rodriguez-Fornells A, Marco-Pallares J . The role of beta-gamma oscillations in unexpected rewards processing. Neuroimage 2012; 60: 1678–1685.
De Pascalis V, Varriale V, Rotonda M . EEG oscillatory activity associated to monetary gain and loss signals in a learning task: effects of attentional impulsivity and learning ability. Int J Psychophysiol 2012; 85: 68–78.
Kamarajan C, Rangaswamy M, Chorlian DB, Manz N, Tang Y, Pandey AK et al. Theta oscillations during the processing of monetary loss and gain: a perspective on gender and impulsivity. Brain Res 2008; 1235: 45–62.
Leicht G, Troschutz S, Andreou C, Karamatskos E, Ertl M, Naber D et al. Relationship between oscillatory neuronal activity during reward processing and trait impulsivity and sensation seeking. PLoS One 2013; 8: e83414.
Andreou C, Kleinert J, Steinmann S, Fuger U, Leicht G, Mulert C . Oscillatory responses to reward processing in borderline personality disorder. World J Biol Psychiatry 2015; 16: 575–586.
Schuermann B, Kathmann N, Stiglmayr C, Renneberg B, Endrass T . Impaired decision making and feedback evaluation in borderline personality disorder. Psychol Med 2011; 41: 1917–1927.
Cohen MX, Wilmes K, Vijver I . Cortical electrophysiological network dynamics of feedback learning. Trends Cogn Sci 2011; 15: 558–566.
Marco-Pallares J, Munte TF, Rodriguez-Fornells A . The role of high-frequency oscillatory activity in reward processing and learning. Neurosci Biobehav Rev 2015; 49: 1–7.
Walsh MM, Anderson JR . Learning from experience: event-related potential correlates of reward processing, neural adaptation, and behavioral choice. Neurosci Biobehav Rev 2012; 36: 1870–1884.
Cohen MX, Ranganath C . Reinforcement learning signals predict future decisions. J Neurosci 2007; 27: 371–378.
Carlson JM, Foti D, Mujica-Parodi LR, Harmon-Jones E, Hajcak G . Ventral striatal and medial prefrontal BOLD activation is correlated with reward-related electrocortical activity: a combined ERP and fMRI study. Neuroimage 2011; 57: 1608–1616.
Foti D, Weinberg A, Dien J, Hajcak G . Event-related potential activity in the basal ganglia differentiates rewards from nonrewards: temporospatial principal components analysis and source localization of the feedback negativity. Hum Brain Mapp 2011; 32: 2207–2216.
HajiHosseini A, Holroyd CB . Reward feedback stimuli elicit high-beta EEG oscillations in human dorsolateral prefrontal cortex. Sci Rep 2015; 5: 13021.
Mulert C, Jager L, Schmitt R, Bussfeld P, Pogarell O, Moller HJ et al. Integration of fMRI and simultaneous EEG: towards a comprehensive understanding of localization and time-course of brain activity in target detection. Neuroimage 2004; 22: 83–94.
Mulert C, Leicht G, Hepp P, Kirsch V, Karch S, Pogarell O et al. Single-trial coupling of the gamma-band response and the corresponding BOLD signal. Neuroimage 2010; 49: 2238–2247.
Mulert C . What can fMRI add to the ERP story? In: Mulert C, Lemieux L (eds). EEG-fMRI. Physiological Basis, Technique, and Applications. Springer: Heidelberg, Germany, 2010, pp 83–96.
Hauser TU, Iannaccone R, Stampfli P, Drechsler R, Brandeis D, Walitza S et al. The feedback-related negativity (FRN) revisited: new insights into the localization, meaning and network organization. Neuroimage 2014; 84: 159–168.
Mas-Herrero E, Ripolles P, HajiHosseini A, Rodriguez-Fornells A, Marco-Pallares J . Beta oscillations and reward processing: coupling oscillatory activity and hemodynamic responses. Neuroimage 2015; 119: 13–19.
Luft CD . Learning from feedback: the neural mechanisms of feedback processing facilitating better performance. Behav Brain Res 2014; 261: 356–368.
Patton JH, Stanford MS, Barratt ES . Factor structure of the Barratt Impulsiveness Scale. J Clin Psychol 1995; 51: 768–774.
Stanford MS, Mathias CW, Dougherty DM, Lake SL, Anderson NE, Patton JH . Fifty years of the Barratt Impulsiveness Scale: an update and review. Pers Individ Dif 2009; 47: 385–395.
Körner A, Drapeau M, Albani C, Geyer M, Schmutzer G, Brähler E . [German norms for the NEO-Five Factor Inventory]. Z Med Psychol 2008; 17: 133–144.
Gehring WJ, Willoughby AR . The medial frontal cortex and the rapid processing of monetary gains and losses. Science 2002; 295: 2279–2282.
Vega D, Soto A, Amengual JL, Ribas J, Torrubia R, Rodriguez-Fornells A et al. Negative reward expectations in borderline personality disorder patients: neurophysiological evidence. Biol Psychol 2013; 94: 388–396.
Benjamini Y, Hochberg Y . Controlling the false discovery rate—a practival and powerful approach to multiple testing. J R Statist Soc B 1995; 57: 289–300.
Raichle ME, MacLeod AM, Snyder AZ, Powers WJ, Gusnard DA, Shulman GL . A default mode of brain function. Proc Natl Acad Sci USA 2001; 98: 676–682.
Liu X, Hairston J, Schrier M, Fan J . Common and distinct networks underlying reward valence and processing stages: a meta-analysis of functional neuroimaging studies. Neurosci Biobehav Rev 2011; 35: 1219–1236.
Silverman MH, Jedd K, Luciana M . Neural networks involved in adolescent reward processing: an activation likelihood estimation meta-analysis of functional neuroimaging studies. Neuroimage 2015; 122: 427–439.
Pearson JM, Heilbronner SR, Barack DL, Hayden BY, Platt ML . Posterior cingulate cortex: adapting behavior to a changing world. Trends Cogn Sci 2011; 15: 143–151.
Fouragnan E, Retzler C, Mullinger K, Philiastides MG . Two spatiotemporally distinct value systems shape reward-based learning in the human brain. Nat Commun 2015; 6: 8107.
Luft CD, Nolte G, Bhattacharya J . High-learners present larger mid-frontal theta power and connectivity in response to incorrect performance feedback. J Neurosci 2013; 33: 2029–2038.
Crone EA, Wendelken C, Donohue SE, Bunge SA . Neural evidence for dissociable components of task-switching. Cereb Cortex 2006; 16: 475–486.
Sohn MH, Ursu S, Anderson JR, Stenger VA, Carter CS . The role of prefrontal cortex and posterior parietal cortex in task switching. Proc Natl Acad Sci USA 2000; 97: 13448–13453.
Grace AA, Bunney BS . Intracellular and extracellular electrophysiology of nigral dopaminergic neurons—3. Evidence for electrotonic coupling. Neuroscience 1983; 10: 333–348.
Bilder RM, Volavka J, Lachman HM, Grace AA . The catechol-O-methyltransferase polymorphism: relations to the tonic-phasic dopamine hypothesis and neuropsychiatric phenotypes. Neuropsychopharmacology 2004; 29: 1943–1961.
Volkow ND, Wang GJ, Fowler JS, Tomasi D, Telang F . Addiction: beyond dopamine reward circuitry. Proc Natl Acad Sci USA 2011; 108: 15037–15042.
Wanat MJ, Willuhn I, Clark JJ, Phillips PE . Phasic dopamine release in appetitive behaviors and drug addiction. Curr Drug Abuse Rev 2009; 2: 195–213.
Keiflin R, Janak PH . Dopamine prediction errors in reward learning and addiction: from theory to neural circuitry. Neuron 2015; 88: 247–263.
Assadi SM, Yucel M, Pantelis C . Dopamine modulates neural networks involved in effort-based decision-making. Neurosci Biobehav Rev 2009; 33: 383–393.
Costa RM . Plastic corticostriatal circuits for action learning: what's dopamine got to do with it? Ann N Y Acad Sci 2007; 1104: 172–191.
Moustafa AA, Gluck MA . A neurocomputational model of dopamine and prefrontal-striatal interactions during multicue category learning by Parkinson patients. J Cogn Neurosci 2011; 23: 151–167.
Jocham G, Ullsperger M . Neuropharmacology of performance monitoring. Neurosci Biobehav Rev 2009; 33: 48–60.
Hayden BY, Platt ML . Neurons in anterior cingulate cortex multiplex information about reward and action. J Neurosci 2010; 30: 3339–3346.
Williams ZM, Bush G, Rauch SL, Cosgrove GR, Eskandar EN . Human anterior cingulate neurons and the integration of monetary reward with motor responses. Nat Neurosci 2004; 7: 1370–1375.
Knutson B, Rick S, Wimmer GE, Prelec D, Loewenstein G . Neural predictors of purchases. Neuron 2007; 53: 147–156.
Matsumoto M, Matsumoto K, Abe H, Tanaka K . Medial prefrontal cell activity signaling prediction errors of action values. Nat Neurosci 2007; 10: 647–656.
Bush G, Vogt BA, Holmes J, Dale AM, Greve D, Jenike MA et al. Dorsal anterior cingulate cortex: a role in reward-based decision making. Proc Natl Acad Sci USA 2002; 99: 523–528.
Cohen JD, Braver TS, Brown JW . Computational perspectives on dopamine function in prefrontal cortex. Curr Opin Neurobiol 2002; 12: 223–229.
Schultz W . Multiple dopamine functions at different time courses. Annu Rev Neurosci 2007; 30: 259–288.
Comings DE, Blum K . Reward deficiency syndrome: genetic aspects of behavioral disorders. Prog Brain Res 2000; 126: 325–341.
Grose-Fifer J, Migliaccio R, Zottoli TM . Feedback processing in adolescence: an event-related potential study of age and gender differences. Dev Neurosci 2014; 36: 228–238.
Mas-Herrero E, Marco-Pallares J . Frontal theta oscillatory activity is a common mechanism for the computation of unexpected outcomes and learning rate. J Cogn Neurosci 2014; 26: 447–458.
Cohen MX, Axmacher N, Lenartz D, Elger CE, Sturm V, Schlaepfer TE . Good vibrations: cross-frequency coupling in the human nucleus accumbens during reward processing. J Cogn Neurosci 2009; 21: 875–889.
Hauser TU, Hunt LT, Iannaccone R, Walitza S, Brandeis D, Brem S et al. Temporally dissociable contributions of human medial prefrontal subregions to reward-guided learning. J Neurosci 2015; 35: 11209–11220.
Parts of this work were prepared in the context of H Frielinghaus’ doctoral dissertation at the Faculty of Medicine, University of Hamburg, Germany.
This work was funded by the German Research Foundation (SFB 936 / C6 to C.M.).
About this article
Cite this article
Andreou, C., Frielinghaus, H., Rauh, J. et al. Theta and high-beta networks for feedback processing: a simultaneous EEG–fMRI study in healthy male subjects. Transl Psychiatry 7, e1016 (2017). https://doi.org/10.1038/tp.2016.287
Trait sensation seeking is associated with heightened beta-band oscillatory dynamics over left ventrolateral prefrontal cortex during reward expectancy
Journal of Affective Disorders (2021)
Individual Alpha Peak Frequency, an Important Biomarker for Live Z-Score Training Neurofeedback in Adolescents with Learning Disabilities
Brain Sciences (2021)
Biological Psychology (2021)
Capture or suppression? Attentional allocation upon reward and loss-associated nonsalient distractors are supported by distinct neural mechanisms: An EEG study
Diminished Feedback Evaluation and Knowledge Updating Underlying Age-Related Differences in Choice Behavior During Feedback Learning
Frontiers in Human Neuroscience (2021)