Introduction

Decision-making involves several cognitive processes underlying the selection of the optimal choice among the existing alternatives1,2, including the assessment of risk when their potential outcomes are probabilistic rather than certain3. Considerable neurobiological evidence shows a close relationship between the computational and emotional facets of decision-making under risk, because the evaluation of prospective outcomes entails the anticipation of their rewarding or punishing affective consequences4,5.

Emotions are indeed considered to potentiate the appetitive or aversive drives generated by such anticipatory processes, thus modulating choice behaviour and adaptive behavioural learning6,7,8. In particular, Mellers’9 Decision affect theory emphasized the role of positive or negative affective states such as satisfaction or disappointment for outcomes better or worse than expected, respectively10,11. These basic feelings do not entail a sense of responsibility for probabilistic outcomes occurring regardless of one’s own decisions. Outcome evaluation and subsequent choices, however, are also shaped by the awareness that things might have been better or worse under a different choice, i.e. by the experience and anticipation of relief or regret, respectively9. These complex, cognitively mediated, emotions result from spontaneous counterfactual comparisons between the outcomes of selected vs. rejected options12, promoting the avoidance of the aversive experience of regret in subsequent choices7,8,13,14,15. The salience of such comparisons is enhanced both by the feeling of responsibility for one’s own outcomes16 and by the ease with which an alternative outcome can be mentally represented. The closeness between actual and counterfactual outcomes, resulting in so-called near-miss outcomes17, potentiates the affective and behavioural impact of regret-based learning18. The affective salience of near-miss outcomes is also considered to potentiate motivations towards maladaptive behaviours19 and might thus contribute to abnormal choices in pathological conditions. Indeed, the development of computational models of choice-related affects boosted the investigation of impaired decision-making in different neuro-psychiatric diseases, such as Parkinson’s disease20, obsessive–compulsive disorder21 or depression22. In particular, altered decision-making under risk represents a core phenotype in addictions, including alcohol use disorder (AUD)23,24.

Neurobiological models of addiction suggest that decision-making impairments in AUD may reflect defective behavioural adaptations to changes in reward contingencies, i.e. to “reward prediction errors” coding the difference between expected and actual outcomes25. The formation of alcohol habits is considered a result of maladaptive reinforced learning, strengthening the association between consumption and either rewarding, or the lack of aversive experiences26. The driving role of negative reinforcement, i.e., the need to escape the aversive state associated with the craving for alcohol27, might thus be increased both by the experience of regret, computationally coded as a “fictive prediction error”25, and by near-miss outcomes19. However, only few studies have investigated regret processing and/or anticipation in pathological populations28,29, including problem gambling30 but not AUD.

On this basis, we investigated possible alterations in adjusting choices to experienced disappointment, regret and near-miss outcomes in AUD patients compared with healthy controls. We used a Wheel of Fortune (WoF) task31, that allows to assess the extent to which choice behaviour is influenced by these variables, in addition to expected value8. In this task, subjects are repeatedly asked to choose between two gambles, depicted as wheels of fortune associated with specific paired combinations of monetary outcomes and levels of probability. The task was divided in two conditions associated with separate blocks. In the “partial feedback” (PF) condition, the spinning arrow and the related outcome were presented for the selected wheel only. In the “complete feedback” (CF) condition the spinning arrows and the associated outcomes appeared both in the selected and rejected wheels (see Sect. 4.2 for a detailed description of the task). Based on previous evidence of altered WoF performance in pathological populations6,29, we predicted that AUD patients’ choice behaviour would reflect a) decreased integration of anticipated regret, and b) increased influence of near-miss outcomes, compared with healthy controls. We also explored possible group differences concerning other aspects of decision-making performance, including response time (RT), overall financial performance (W) and number of time-outs (TO; i.e. trials in which subjects did not respond within the available time-window). Information processing speed is indeed considered a possible marker of cognitive decline in several neurological diseases32,33, with a prominent role in age-related motor slowing as well34. Moreover, previous studies investigating alcohol-related decision-making impairments have shown increased choice latency as a possible marker of psychomotor slowing35,36, highlighting the “output” stage of decision-making as the most vulnerable to chronic alcohol consumption. Based on our previous findings, we thus predicted slower choice latencies in AUD patients compared with controls.

Results

WoF task performance and learning curve

Mann–Whitney U tests on PF and CF mean values highlighted, for both conditions, slower RTs in AUD patients compared with healthy controls (PF: p = 0.031r = − 0.328; CF: p = 0.012, r = − 0.397) (Table 1c). Instead, neither W (p = 0.339) or TO (p = 0.495) variables were significantly different across groups (Table1c). We found a positive correlation only between CF RTs and age (r = 0.262, p = 0.041). ANCOVA confirmed a significantly slower performance, in AUD patients, in CF trials after removing age effect (p = 0.045, η2 = 0.092; Table 1c). This finding was confirmed by additional analyses comparing performance across runs. AUD patients were slower than controls in all CF runs, but only in PF runs 2 (p < 0.015) and 4 (p < 0.38) (Table 2a, Fig. 1a). When testing the overall RTs regardless of condition, we confirmed AUD patients’ slower performance in all runs (Table 2a, Fig. 1a).

Table 1 Demographic and clinical variables.
Table 2 Performance analysis.
Figure 1
figure 1

Task performance and learning curve. Panels (a) and (b) depict group differences in run-specific RTs and in learning curves, respectively, for partial- and complete-feedback conditions, or both.

Although we observed significant RT differences across runs in both groups (Table 2b), the analyses on learning curves showed fastest task performance at run 2 in the whole sample (AUD: p = 0.001, r = 0.715; HC: p = 0.005, r = 0.726), suggesting no group difference in the amount of time needed to stabilize performance (Table 2c, Fig. 1b).

Choice behaviour

We tested two models of choice, incorporating the effect of different choice variables on decision-making behaviour. In the former, we modelled the effects of anticipating disappointment (d) and regret (r), alongside the maximization of expected value (e), under the assumption that individuals should aim to maximize EV while also learning to avoid the negative feelings associated with disappointment and/or regret (see details in8). The results of this model showed that, for both groups, choices in the PF condition were significantly modulated only by the maximization of expected value. In the CF condition, instead, healthy controls displayed a significant modulation by anticipated disappointment (β = 0.0000283, p < 0.036) but not regret (β = 0.0019097, p < 0.087), while neither variables were associated with significant effects in AUD patients (Table 3).

Table 3 Choice behaviour at the WoF task: Model 1.

The second model aimed to investigate the modulation of choice behaviour by the affective experience associated with a near-miss outcome (NM) in the previous trial, in addition to the maximization of expected value (e). The results revealed group-specific modulations of choice behaviour depending on the experimental condition (Table 4). Choices in the PF condition were driven only by the maximization of expected value in healthy controls (β = 0.0008, p < 0.0001), but also by previous experiences of near-miss outcomes in AUD patients (β = 0.28, p < 0.006). The contrary was true for the CF condition, in which choices were guided only by expected value in AUD patients (β = 0.000191, p < 0.0001), and also by past experiences of near-miss outcomes on the unchosen gamble in healthy controls (β = − 0.28, p < 0.023).

Table 4 Choice behaviour at the WoF task: Model 2.

Discussion

Alcohol use disorder is one of the most prevalent psychiatric conditions worldwide. The Diagnostic and Statistical Manual of Mental Disorders—fifth edition—defined AUD as a chronic relapsing condition characterized by excessive alcohol consumption despite its devastating consequences on individuals’ physical, social and cognitive functioning47. In particular, the revised diagnostic criteria for AUD highlighted the impairment in behavioural control, an important high-order cognitive function implicated also in decision-making process.

Alcohol-related decision-making impairments are increasingly investigated by focusing on possible alterations of the computational facets of choice behaviour25,26. While previous studies have highlighted altered reinforcement learning as a core component of the typical vicious circle linking craving, immediate rewarding effects and long-term negative consequences, it is still unknown whether such impairments also involve choice-related, cognitively mediated, emotions such as regret and relief, which have been shown to support adaptive behavioural learning in healthy individuals8. We addressed this issue with a cognitively demanding decision-making task incorporating several choice variables, such as expected value, satisfaction for a gain or disappointment for a loss, complex emotions, such as regret or relief, and the influence of previously experienced near-miss outcomes.

AUD patients and healthy controls did not differ in terms of overall payoff, number of time-outs or learning curve throughout the task, with RTs stabilizing at the second run in both groups. AUD patients, however, were significantly slower than healthy controls in making their choices both in PF and CF conditions. In line with the role played by information processing speed in higher-order cognitive tasks37, these results confirm previous reports of alcohol-related increased deliberation time35,36,38, likely reflecting a generalized executive impairment extending to the output stages of decision-making39,40,41,42. However, when controlling for possible age effect in CF trials we observed significantly longer RTs in AUD patients compared with healthy controls. The results of behavioural modelling allowed assessing whether AUD patients’ sensitivity to this condition, over and beyond possible age effect, reflects a defective incorporation of anticipated negative emotions in choice behaviour.

In line with our hypothesis, AUD patients chose by maximizing expected value, but failed to minimize both disappointment and regret. Unlike healthy controls, who displayed a significant anticipation of disappointment, they thus neglected the affective consequences of their choices when evaluating gambles. This impairment might contribute to AUD patients’ behavioural alterations in everyday life. The ability to anticipate negative emotions is indeed considered a powerful motivator to change behavioural strategies in order to reach better outcomes43 associated with healthy behaviours. The defective estimation, and/or incorporation, of “reward” and “fictive” prediction errors might thus represent the computational basis of AUD patients’ neglect for the affective consequences of their choices. Since the associated emotions of disappointment and regret are considered to enhance adaptive behavioural learning from past experiences, driving motivated behaviour away from risk, the observed impairment is thus likely to promote patients’ inability to learn from the negative consequences of chronic alcohol consumption, and thus the maintenance of AUD. Unlike previous studies, we did not observe a significant minimization of anticipated regret in healthy controls6,8. This negative finding might be explained by the higher age mean and standard deviation in our sample (Table 1) compared with previous studies on regret processing7,29.

The hypothesis of a defective incorporation of affective information in AUD patients’ evaluative processes was supported by a second model of choice, testing the effect of near-miss outcomes alongside expected value. While both groups chose by maximizing expected value, they displayed different modulations by the emotional experience associated with near-miss outcomes, biasing only the PF condition in AUD patients, and only the CF condition in healthy controls. To date, near-miss outcomes have been interpreted either as frustrating events reinforcing maladaptive behaviours to diminish the associated negative emotional state45, or as positive reinforcers mentally represented as actual appetitive outcomes46. The former interpretation fits with the role played by negative reinforcement in AUD, i.e., by the need to escape the aversive state associated with the craving for alcohol27. In either case, however, near-miss outcomes are known to exert their effect by potentiating the affective load attached to what might have been under a different fate in the PF condition, or a different choice in the CF one. Regardless of a specific interpretation, this finding confirms that the defective integration of information concerning the unselected option represents a consistent trait of AUD patients’ decision-making processes, decreasing the chances of learning from previous negative experiences.

There are limitations to this study. First, the accurate case–control matching for demographic variables and our stringent inclusion criteria resulted in a small-to-moderate sample size. Moreover, the lack of specific measures of information processing speed did not allow establishing a causal connection between the executive and computational impairments displayed by AUD patients. The present results should thus be considered as preliminary evidence in need of further support from studies including larger samples and additional neuro-cognitive variables. Finally, here we focused our attention only on the computational aspects of cognitive-based emotions of regret and relief, disappointment and satisfaction, neglecting the role of individual differences in perception and interpretation of such complex affective states. Further investigations are needed in order to assess the relationships between the impaired ability of AUD patients in implementing cognitive-based emotions during decision-making and their relative subjective feelings. To the best of our knowledge, however, this is the first study investigating AUD patients’ decision-making performance by modelling the anticipation of complex emotions resulting from counterfactual thinking31. Although preliminary and in need of further supporting evidence, our results highlight the defective implementation, and thus avoidance, of disappointment and regret as a component of AUD patients’ alterations in learning from negative experiences. Future research might build on these results by addressing gender differences, the neural bases of these alterations, as well as the development and assessment of treatment protocols specifically focused on the implementation of emotional experiences in choice-related behaviours.

Materials and methods

Participants

Twenty-six adult AUD patients (10 females; mean age: 46.50 years ± 8.25; range: 29–64; mean education: 10.88 years ± 3.51) and 19 age- and education-matched healthy control subjects (8 females; mean age: 45.11 years ± 8.69; range: 27–57; mean education: 10.63 years ± 3.05) participated in the study. A chi-square test confirmed that the distribution of males and females was not significantly different across AUD patients and healthy controls (p = 0.805). AUD patients were enrolled from the Functional Rehabilitation Unit of ICS Maugeri-Pavia (Italy), and healthy controls were recruited via local advertisement. There was no significant demographic difference between AUD patients and healthy controls (Table 1). Average disease duration in AUD patients ranged from 1 to 26 years (mean: 10.77 years ± 6.78). Inclusion criteria for AUD patients were: 1) age between 20 and 60 years; 2) a diagnosis of alcohol dependence according to DSM-V diagnostic criteria. Exclusion criteria for both AUD patients and control subjects were: (1) presence or history of neurological or psychiatric disorders other than AUD, or any comorbid disorder except for nicotine dependence; (2) family history of neurological or psychiatric disorders; (3) current use of any psychotropic substance or medication; (4) past brain injury or loss of consciousness; (5) major medical disorders (e.g. kidney or liver diseases, severe diabetes and/or malnutrition); (6) inability to undergo the neuropsychological assessment. Healthy controls were also excluded in case of presence or history of alcohol abuse. AUD patients joined the experimental protocol after being detoxified for at least 10 days, via medically supported standard treatments. However, they had ceased benzodiazepine treatment at least 8 days before scanning. Healthy participants were at least abstinent 10 days before scanning. All participants provided written informed consent to the experimental procedure, which was approved by the local Ethical Committee of ICS Maugeri-Pavia. The investigation was conducted in accordance with the latest version of the Declaration of Helsinki (see also35,39,40,41,42).

Wheel of Fortune (WoF) Task

The WoF task is an experimental paradigm adapted from Mellers and colleagues31, previously used to investigate the contribution of choice-related emotions such as satisfaction/disappointment and relief/regret to decision-making under risk6,8,29. Subjects are repeatedly asked to choose between two gambles, depicted as wheels of fortune, on the left and right halves of the screen. Each wheel is divided in two sectors, i.e. green (left) and red (right), always associated with the best and worse outcomes, respectively (Fig. 2). In each gamble, the possible outcomes involve paired combinations of 200, 50, − 50 and − 200 (arbitrary units), associated with 3 different levels of probability (20–80, 50–50 and 80–20) represented by the size of the green/red sectors. Therefore, the possible counterfactual combinations of wins and losses result in four potential levels of regret and relief (± 100, ± 150, ± 250 and ± 400) when subjects are shown the outcomes of both the selected and unselected gambles (CF). Thus, if the computational process results in a positive outcome they will experience relief, otherwise regret. In this condition, indeed, they can evaluate not only the financial consequences of their decision, but also the outcome they might have obtained, if they had selected the alternative gamble. In the PF condition, instead, only the outcome of the selected gamble is shown, thus resulting in satisfaction or disappointment for the best or worse outcome, respectively, without a sense of personal responsibility.

Figure 2
figure 2

Wheel of fortune (WoF) task. Subjects are repeatedly asked to choose between two gambles, depicted as wheels of fortune associated with specific paired combinations of monetary outcomes (200, 50, − 50 and − 200) and levels of probability represented by the size of the green/red sectors (20–80, 50–50 and 80–20). The possible counterfactual combinations of wins and losses result in four potential levels of regret and relief (± 100, ± 150, ± 250 and ± 400) when subjects are shown the outcomes of both the selected and unselected gambles (CF). In the PF condition, instead, only the outcome of the selected gamble is shown, thus resulting in satisfaction or disappointment for the best or worse outcome. The gambles were shown for 4.5 s, during which subjects could evaluate them and make a decision. Next, the appearance of an asterisk in the centre of the screen prompted them to choose a gamble, which was highlighted by a yellow contour. Two seconds after the appearance of the asterisk a white arrow appeared in a random position in both wheels (CF) or only in the selected wheel (PF). One second later, the arrow(s) started spinning clock-wise, with different and random initial speed(s), and stopped after exactly 4 s, indicating the final outcome(s), that remained on the screen for 3 s.

In the present study, the gambles were shown for 4.5 s, during which subjects could evaluate them and make a decision. Next, the appearance of an asterisk in the centre of the screen prompted them to choose, by pressing one of two buttons on a keyboard with their right index or middle fingers. Subjects had 2 s to choose the gamble, otherwise they received an “out of time” message, and a new trial started. Once selected, the chosen gamble was highlighted by a yellow contour, that would remain on the screen up to the end of the trial, and 2 s after the appearance of the asterisk a white arrow appeared in a random position in both wheels (CF) or only in the selected one (PF). One second later, the arrow(s) started spinning clock-wise, with different and random initial speed(s), and stopped after exactly 4 s, indicating the final outcome(s), that remained on the screen for 3 s. The spatial distance between the resting position of the spinning arrow and the boundary between the green and red sectors was used to code three possible values of “closeness” of near-miss outcomes (see below).

Each participant performed 192 trials overall, blocked in 4 runs of 24 trials per condition. The order of CF and PF blocks was counterbalanced across subjects. Because of the complexity of the task and in order to make participants confident with answer’s procedure, each subject performed a training session before to start.

Definition of choice variables

We used the following variables to investigate possible group differences in task performance and choice behaviour: response time to select the gamble (RT); number of time-outs (TO); expected value (EV, i.e. the sum of the value of the two possible gamble outcomes, each weighted by the corresponding probability); gain (W, the value associated with the outcome); disappointment (d, the negative emotion associated with the difference between the actual and unobtained outcomes of the selected gamble); regret (r, negative emotion associated with the difference between the outcomes of the selected and rejected gambles); near-miss outcomes (NM; the degree of “closeness” of non-win outcomes).

For each variable we computed: a) the overall mean value in PF and CF conditions separately; b) the mean value in each of the 4 PF and CF runs; c) the mean value for each of the 4 runs regardless of the experimental condition.

Analysis of choice performance

To evaluate task performance, we considered RT, TO and W variables as representative indexes of subjects’ ability to perform the WoF task. Thus, we first checked for possible group differences by means of Mann–Whitney non-parametric U tests on the overall PF and CF mean values, separately. We then checked for age and education effects by means of correlation analyses. For those variables showing both between-group differences and a significant effect of age and/or education, we ran an Analysis of Covariance (ANCOVA) to assess the stability of results after removing their effects. We applied a primary statistical threshold of p < 0.05, one-tailed due to a priori hypotheses of alcohol-related impairment23.

To investigate possible effects of chronic alcohol consumption on learning curves, we then explored group differences among runs. Namely, for the variables showing significant group differences in the previous analysis we performed additional Mann–Whitney non-parametric U tests on run-specific mean values regardless of condition, as well as PF and CF mean values separately. Based on the considerable executive load of the WoF task, we also assessed possible group differences in learning abilities in terms of time needed for task execution. To this purpose, we first applied the Friedman test within each group, to highlight significant RT differences along the four runs. We then performed post-hoc analyses (Wilcoxon signed-rank test) to identify the specific run at which each group reached the maximum (i.e. fastest) performance. For each statistical analysis we performed also effect size calculation, i.e. Glass rank biserial correlation for non-parametric tests and partial eta squared for ANCOVA. Statistical analyses were performed with SPSS (IBM Corp. Released 2015. IBM SPSS Statistics for Macintosh Version 23.0. Armonk, NY: IBM Corp.) and JASP (https://jasp-stats.org/).

Analysis of choice behaviour

We applied regression analyses, using a panel logit procedure with an individual random effect, to unveil the contribution of different choice-related emotions to decision-making performance. The panel data analysis modelled each subject as unit, and each trial as time variable. The random-effects model was used as the default model, and the parameters were estimated by maximum likelihood. We tested two models of choice, incorporating the effect of different choice variables on decision-making behaviour.

In the former, we modelled the effects of anticipating disappointment (d) and regret (r), alongside the maximization of expected value (e) (see details in8). The probability of choosing gamble 1 is:

$$Pr\left({g}_{1st}\right)=1-Pr\left({g}_{2st}\right)=F\left[{d}_{st};{r}_{st}{;e}_{st}\right]$$
(1)

where s = subject, t = time and F[θ] denotes the function eθ/(1 + eθ). The variables d and r, as described in Eqs. 2 and 3, indicate the process of minimizing future disappointment and future regret, respectively; e indicates the result of maximizing expected values. x1 and y1 represent the better and worst outcome of gamble 1 (g1), and x2 and y2 represent the better and worst outcome of gamble 2 (g2). The probability of x1 is p and the probability of y1 is 1 – p; the probability of x2 is q, and the probability of y2 is 1 – q.

$$d=\left({y}_{2}-{x}_{2}\vee \left(1-q\right)\right)-\left({y}_{1}-{x}_{1}\vee \left(1-p\right)\right)$$
(2)
$$r=\left|{y}_{2}-{x}_{1}\right|-\left|{y}_{1}-{x}_{2}\right|$$
(3)
$$e=EV\left({g}_{1}\right)-EV\left({g}_{2}\right)=\left({px}_{1}+\left(1-p\right){y}_{1}\right)-\left({qx}_{2}+\left(1-q\right){y}_{2}\right)$$
(4)

In the CF condition, the optimal behaviour depends on the subject’s ability to minimize disappointment (d; Eq. 2) and regret (r; Eq. 3), while maximizing expected value (e; Eq. 4). In the PF condition, instead, subjects can only integrate in their evaluative process the effect of anticipated disappointment and maximization of expected values (see details in8).

The second model aimed to investigate the modulation of choice behaviour by the affective experience associated with a near-miss outcome (NM) in the previous trial, in addition to the maximization of expected value (e). Therefore, choice behaviour depends a) both on the maximization of expected value (Eq. 4) and near-miss outcome of both gambles in the CF condition; b) only on expected value and near-miss of the chosen gamble in the PF condition. The probability of choosing g1 is:

$$Pr\left({g}_{1st}\right)=1-Pr\left({g}_{2st}\right)=F\left[{NM}_{st}{;e}_{st}\right]$$

where the NM regressor included three possible values, depending on the proportional distance between the resting position of the spinning arrow and the boundaries between the best and worse gamble outcomes. Therefore, the participants could experience a "clear" outcome, when the arrow stopped in the central area of the best or worse gamble segments; a full near-miss outcome, when it stopped very close to the boundary between them; and a partial near-miss outcome, when it stopped in an approximately intermediate area between the former positions.

The analysis of choice behaviour was carried out with STATA (StataCorp. 2019. Stata Statistical Software: Release 13. College Station, TX: StataCorp LLC.).