Introduction

Tobacco smoking is a major risk factor for several chronic diseases including cancer, lung diseases, and cardiovascular diseases1, imposing immense costs to individuals and society. A crucial question is why smokers so often fail in smoking cessation despite wanting to quit and knowing the potentially deadly long-term consequences. The high rate of relapse in chronic nicotine dependence is thought to at least partially stem from deficits to impulse control and exaggerations in the steepness of delay discounting2,3,4, alongside various cognitive and social factors5,6. Delay discounting refers to the decline in the subjective value of rewards with time until their receipt, as seen for instance when individuals prefer an immediate reward (e.g., smoking a cigarette) over a long-term outcome (e.g., avoidance of negative long-term consequences of smoking)7. A large body of evidence suggests that smokers devalue outcomes in the future more strongly than non-smokers (for a meta-analysis, see8), with steeper discounting of future outcomes predicting both a higher risk of smoking initiation and lower rates of successful quitting9.

Despite evidence for impulse control deficits and steeper delay discounting in nicotine dependence, smokers often deny having problems with delay discounting and overestimate their capacity to quit smoking10,11,12. A discrepancy between objective smoking-related decisions and self-reported beliefs about those decisions might indicate deficits in metacognitive accuracy. The term “metacognitive accuracy” (or “metacognitive sensitivity”) refers to the ability to monitor, evaluate, and reliably report the accuracy of decision processes13,14. Recent theoretical accounts highlight the moderating role of metacognitive beliefs for substance use behavior15,16,17. For example, refusal self-efficacy—confidence in the ability to refuse smoking—moderates smoking behavior in adolescents18,19. However, these accounts mainly focus on the contents of metacognitive beliefs (e.g., beliefs about the controllability of one’s desires), whereas the role of the accuracy of metacognitive beliefs for substance use behavior remains unknown. Thus, despite the evidence that smokers might be too optimistic regarding their impulse-control capacities or preferences for delayed over immediate rewards, it has never been experimentally tested whether the accuracy of metacognitive processes is lower in smokers than in non-smokers.

Understanding whether smokers show not only differences in impulse control or delay discounting but also in metacognition is important given that not only objective time preferences but also subjective beliefs about one’s preference for future over immediate rewards guide human behavior7,20,21,22. For example, only if smokers anticipate that they will give in to smoking a cigarette when going to a party can they avoid such situations where their capacity to resist immediate temptations would not be sufficient. Conceptually, prospective decisions where decision makers voluntarily restrict their access to temptations presuppose metacognitive awareness of one’s preferences for delayed versus immediate rewards7,21,23. Recent findings suggest that better metacognitive accuracy indeed predicts a higher likelihood of restricting one's access to immediate rewards when anticipating potential preference reversals24,25. If smokers overestimate their preferences for future over immediate rewards due to lack of metacognitive insight into their preferences, they may thus also show a lower willingness to precommit to long-term rewards in order to avoid preference reversals (i.e., switching from one’s current preference for the delayed reward to preferring the smaller-sooner reward when re-deciding in the future). From this perspective, metacognitive awareness of one’s preferences for delayed versus immediate rewards represents the precondition for the ability to accurately anticipate potential preference reversals from delayed to immediate rewards. As the anticipation of potential preference reversals belongs to the driving forces of precommitment decisions7,23, deficits in metacognition may thus reduce the sensitivity to preference reversals during precommitment decisions. Metacognitive deficits in nicotine addiction might also challenge the theory of rational addiction according to which substance abuse can be understood as rational and farsighted maximization of an individual’s utility26,27. Conceptualizing addiction as rational forward-looking behavior presupposes that decision makers possess accurate representations of their economic preferences. Evidence for metacognitive deficits in addiction would therefore challenge the plausibility of this account. However, because research on delay discounting in smokers (and other forms of substance abuse/dependence in general) has mainly focused on the objective tendency to choose delayed over immediate rewards, it remains unknown whether smokers indeed have a reduced willingness to precommit to avoid preference reversals compared to non-smokers, and how any such changes might be related to possible metacognitive deficits.

Here, we addressed these questions by investigating whether smokers, relative to non-smokers, show metacognitive deficits and a lower demand for precommitment (i.e., binding, irreversible choices of delayed rewards). First, we hypothesize that smokers, compared with non-smokers, have worse metacognitive accuracy regarding their preferences for delayed over immediate outcomes and more strongly overestimate the value of future rewards relative to their revealed preferences (i.e., preferences for delayed versus immediate rewards as determined via observable choice behavior rather than self-report). Second, we expect that smokers, relative to non-smokers, also show a lower sensitivity to the risk of potential preference reversals when making decisions about whether or not to precommit to long-term rewards, with the metacognitive deficits (see first hypothesis) statistically explaining the lower sensitivity to preference reversals. To test these hypotheses, we administered decision tasks that allow quantifying the degree of metacognitive awareness of one’s preferences for delayed over immediate rewards. In the confidence accuracy task, 37 smokers and 38 non-smokers made choices between smaller-sooner (SS; e.g., 3 euro today) and larger-later (LL; e.g., 5 euro in 90 days) monetary rewards (Fig. 1A). After each choice, they rated their subjective confidence in having made the best possible choice. In line with previous procedures24,25,28, this allows determining the degree to which individuals can reliably report uncertainty in the choice process while deciding between SS and LL rewards (“metacognitive accuracy”). Moreover, we asked participants to estimate the subjective value of monetary rewards in the future, and compared these self-reported estimates with the subjective values derived from choice behavior28. If smokers overestimate the values of future rewards relative to their revealed preferences, self-reported values of future rewards should be higher than the observed reward values determined by participants’ choices in the decision task. Lastly, to assess how metacognitive insight influences the willingness to restrict access to short-term rewards, participants also performed a precommitment task where they decided between making binding choices for LL rewards and postponing the final decision, which bears the risk that they might reverse their preference for the LL to the SS reward when having to make the final decision24,25. If smokers have less insight into how their preferences change over time and overestimate the value of future rewards, we expect them to be less sensitive to the risk of potential preference reversals in the precommitment task than non-smokers.

Figure 1
figure 1

(A) In the confidence accuracy task, participants made choices between smaller-sooner (SS; e.g., 3 euro today) and larger-later (LL; e.g., 5 euro in 90 days) rewards. After each choice, they had to rate their subjective confidence to have made the best possible choice. (B) Consistent with previous findings, smokers discounted future rewards steeper than non-smokers. (C) Smokers also showed lower metacognitive accuracy than non-smokers, as indicated by a smaller difference between the slopes (which capture decision uncertainty) for low versus high confidence. For illustration purpose, we split confidence ratings into low versus high confidence trials. Shaded areas indicate 95% confidence intervals.

Results

First, we aimed to replicate previous findings of steeper delay discounting in nicotine dependence. We compared the posterior distributions of the group-level hyperbolic discount parameters between smokers and non-smokers by subtracting the posterior distribution of the discount factor in the smoker group from the posterior distribution in the nonsmoker group (if the 95% HDI of the differences between the group-specific posterior distributions does not contain zero, the group difference can be considered as being significant). Consistent with the literature8, this comparison revealed that the discount factor k was significantly larger (indicating steeper delay discounting) in the smoker than in the non-smoker group, HDImean = -0.023, HDI95% = [− 0.042; − 0.005] (Fig. 1B). Smokers and non-smokers were comparable with respect to age, sex, educational status, working memory capacity, and verbal intelligence, all ps > 0.27 (Table 1), suggesting that the difference in delay discounting cannot be explained by any pre-existing differences in these variables. However, there were significant group differences in the BIS and BAS scales, both p < 0.05, consistent with previous evidence29. Note that all statistical analyses were robust to adding the BIS/BAS scores as well as age and educational status as covariates of no interest.

Table 1 Demographic characteristics of the non-smoker and smoker groups.

We next asked whether smokers and non-smokers also differ in their metacognitive accuracy. Following previous procedures24,25, we assessed metacognitive accuracy with a mixed generalized linear model (MGLM) where binary choices in the confidence accuracy task were regressed on predictors for Group (non-smoker versus smoker), difference in value (DV; value LL reward minus value SS reward), confidence, and the interaction terms. The strength of the interaction between DV (as measure of objective decision uncertainty) and confidence (subjective uncertainty) is an indicator of metacognitive accuracy. This is because participants should be more confident that they made the best choice the larger the difference in subjective value between the options, and the DV × Confidence interaction measures participants’ ability to track and report the consistency of their decisions. Smokers made significantly fewer LL choices than non-smokers, HDImean = − 3.33, HDI95% = [− 5.53; − 1.35]. Moreover, the main effects of both DV, HDImean = 5.76, HDI95% = [4.87; 6.76], and confidence, HDImean = 1.08, HDI95% = [0.64; 1.57], were significant, the latter suggesting that non-smokers reported higher confidence after LL relative to SS choices. This association between higher confidence and LL choices was significantly reduced in the smoker group, Group × Confidence: HDImean = − 1.00, HDI95% = [− 1.64; − 0.45], and smokers also showed lower choice consistency (as the predictor DV indicates how consistently participants’ choices are guided by the strength of their preference for one option over the other), Group × DV: HDImean = − 1.68, HDI95% = [− 2.93; − 0.55]. The DV × Confidence interaction, HDImean = 1.85, HDI95% = [1.38; 2.34], suggested that non-smokers could reliably track and report uncertainty in decision making, a measure of metacognitive accuracy. Importantly, metacognitive accuracy was significantly reduced in smokers compared to non-smokers, HDImean = − 0.64, HDI95% = [− 1.19; -0.08] (Fig. 1C and Table 2). The significant Group × DV × Confidence interaction was robust to adding BIS/BAS scores, age, and educational status as covariates of no interest, HDImean = − 0.59, HDI95% = [− 1.15; − 0.04]. This suggests that, in addition to their steeper delay discounting, smokers possess worse metacognitive access to the accuracy of their intertemporal choice process.

Table 2 Results of Bayesian MGLM in the confidence accuracy task, regressing binary choices (0 = immediate reward, 1 = delayed reward) on predictors for Group (0 = nonsmoker, 1 = smoker), difference in value (DV), confidence, and the interaction terms.

The observed deficits in metacognitive accuracy raise the question as to whether smokers are less able to recognise and correct for any biases in their intertemporal decision-making. Given that smokers show steeper delay discounting than non-smokers, it seems plausible to assume that they more strongly overestimate the value of future rewards relative to their revealed preferences. To test this, we compared the differences between self-reported subjective values of delayed rewards (bidding task) and the choice-based, revealed subjective values (confidence accuracy task) between study groups (value bias; Fig. 2A). In the control group, participants significantly overestimated the value of future rewards for longer delays, HDImean = 0.31, HDI95% = [0.13; 0.49]. Crucially, smokers more strongly overestimated the value of future rewards than non-smokers, HDImean = 0.78, HDI95% = [0.23; 1.34], and this group difference was stronger for longer delays, Group × Delay: HDImean = 0.39, HDI95% = [0.13; 0.65] (Fig. 2B and Table 3). This suggests that smokers, compared with non-smokers, overestimate their preferences for delayed over immediate rewards. Again, the results of this analysis were robust to controlling for BIS/BAS scores, age, and educational status, HDImean = 0.40, HDI95% = [0.13; 0.65]. The strength of this bias was negatively correlated with individual differences in metacognitive accuracy (individual coefficients for the DV × Confidence interaction), r = − 0.64, p < 0.001. This suggests that smokers’ overestimation of their preference for future over immediate rewards correlates with their deficits in metacognitive accuracy.

Figure 2
figure 2

(A) In the bidding task, participants indicated which immediate reward magnitude they consider as equivalent to a given reward delivered in the future on a rating scale from 0 to 5 euro. The difference between subjective reward values estimated from observed choices in the confidence accuracy task (via individual hyperbolic discount parameters) and the self-reported values from the bidding task reflects the degree to which individuals over- or underestimate the value of future rewards relative to their revealed preferences. (B) Smokers, relative to non-smokers, overestimate their preferences for delayed over future rewards, particularly for longer delays, as indicated by higher self-reported than observed subjective values. Black dots indicate individual data points.

Table 3 Results of Bayesian MGLM for the bidding task, regressing differences between revealed and self-reported subjective values on predictors for Group (0 = nonsmoker, 1 = smoker), Delay, and the interaction term.

Distorted representations of one’s time preferences may have negative consequences for prospective decision making, considering that individuals may prefer to restrict their access to immediate temptations particularly when anticipating that their impulse control will not be sufficient to resist the temptation24. We therefore assessed whether smokers, due to their metacognitive deficits, also show a lower willingness to precommit to long-term rewards. The precommitment task allowed us to assess the willingness to precommit as a function of the individual risk of preference reversals (Fig. 3A). Reversal risk was defined as the value difference between the LL and SS reward from participants’ current perspective minus the value difference between these options when participants had to make a final decision 28 days later. Higher values of this variable indicate that an individual is likely to switch from preferring the LL over the SS reward to preferring the SS over the LL reward when re-deciding between these options 28 days later. Smokers did not generally (i.e., irrespective of the risk of preference reversals) make fewer precommitment choices than non-smokers, HDImean = 0.69, HDI95% = [− 0.62; 2.05]. However, while non-smokers increasingly preferred to precommit with a higher risk of preference reversals, HDImean = 1.85, HDI95% = [0.03; 3.67], smokers were significantly less sensitive to the risk of preference reversals than non-smokers, Group × Reversal risk: HDImean = − 2.25, HDI95% = [− 4.45; − 0.06] (Fig. 3B and Table 4). The Group × Reversal risk interaction remained significant when controlling for BIS/BAS scores, age, and educational level, HDImean = − 2.21, HDI95% = [− 4.45; − 0.04]. A separate MGLM for the smoker group revealed no significant effect of Reversal risk on precommitment choices in smokers, HDImean = − 0.17, HDI95% = [− 1.59; 1.26]. Thus, smokers are less willing to restrict their access to immediate rewards if the risk of preference reversals is high, perhaps because they underestimate their preferences for immediate over delayed rewards. In line with this interpretation, individual differences in the influence of the risk of preference reversals on precommitment choices (individual coefficients for Reversal risk) were significantly correlated with value bias (individual intercepts extracted from the MGLM in the bidding task), r = − 0.32, p = 0.006 (Fig. 3C), as well as with metacognitive accuracy (individual coefficients for DV × Confidence from the MGLM in the confidence accuracy task), r = 0.36, p = 0.002. Note that the correlation between Reversal risk and Value bias remained significant when using absolute rather than signed scores for Value bias, r = − 0.35, p = 0.003. This suggests that individuals with better metacognitive skills or who less overestimate their preference for LL over SS rewards (relative to their revealed preferences) also show a stronger sensitivity to potential preference reversals when making precommitment choices.

Figure 3
figure 3

(A) In the precommitment task, participants opted between making a binding choice for a larger-later reward (e.g., 5 euro in 68 days) or postponing the decision. In the latter case, participants were re-contacted after 28 days and had to make a final choice between the adjusted reward options (in this example, “3 euro today versus 5 euro in 40 days”). (B) While non-smokers more strongly preferred to make a binding choice with increasing risk of preference reversals (i.e., the preference switches from the larger-later to the smaller-sooner option when being re-contacted after 28 days), smokers’ precommitment choices were unaffected by potential preference reversals. (C) The sensitivity to preference reversals correlated with individual differences in value bias, i.e. the degree to which individuals overestimate their preference for delayed over immediate rewards. Shaded areas in (B) indicate 95% confidence intervals.

Table 4 Results of Bayesian MGLM in the precommitment task, regressing precommitment choices (0 = no precommitment, 1 = precommitment) on predictors for Group (0 = nonsmoker, 1 = smoker), difference in value (DV), reversal risk (DV in 28 days minus DV now), and the interaction terms.

Next, we tested whether smokers’ value bias statistically explains the weaker influence of future preference reversals on precommitment choices with a mediation analysis. For this purpose, we re-computed the MGLM for the precommitment task and added the individual coefficients for value bias as predictor. This allowed us to assess whether the significant effect of smoking status on the sensitivity to reversal risk can be explained by the group differences in value bias (mediator variable). The result of a Sobel test assessing the significance of the indirect path (i.e., whether group differences in sensitivity to reversal risk are mediated by value bias), t = 2.02, p = 0.04, suggests that the degree to which individuals overestimate the value of LL rewards relative to their revealed preferences indeed mediates group differences in the sensitivity to Reversal risk. When we assessed whether smokers’ lower sensitivity to preference reversals is statistically mediated by the group difference in metacognitive accuracy, the Sobel test showed only a marginally significant result, t = 1.94, p = 0.05. Taken together, this suggests that discrepancies between self-reported and revealed subjective values (at least partially) explain why smokers are worse in anticipating and acting upon potential preference reversals.

Discussion

Delay discounting is elevated in nicotine dependence8,9 and predicts (alongside other factors) both smoking initiation and the likelihood of relapsing2,3,30. However, less is known about the role of smokers’ metacognitive awareness of their time preferences. Here, we provide evidence for metacognitive deficits in nicotine dependence and their influence on prospective decision making. Smokers, relative to non-smokers, could less reliably report decision uncertainty in intertemporal decision making (metacognitive accuracy). Smokers also overestimated the self-reported value of future rewards relative to their revealed preferences. The finding that smokers overestimate their preference for future over immediate rewards informs previous questionnaire data according to which smokers are rather optimistic regarding their capacity to quit smoking10,11,12. While previous research has not directly compared self-reported with revealed time preferences, we provide evidence that smokers, compared with non-smokers, overestimate their preference for future over immediate rewards. Thus, nicotine dependence is characterized not only by steeper delay discounting but also by weaker metacognitive insight into time preferences and a stronger discrepancy between self-reported and revealed preferences for delayed rewards. We note that this discrepancy should not be necessarily interpreted as reflecting a metacognitive deficit, as choosing between LL and SS rewards (confidence accuracy task) might require different processes than estimating the value of LL rewards (bidding task). For example, choosing LL over SS rewards is thought to require higher-order control processes31. Deficits in such control processes might therefore contribute to the larger discrepancy between self-reported and revealed subjective values in smokers compared to non-smokers.

Metacognitive deficits may contribute to deficient prospective decision making in nicotine dependence by impairing the anticipation of potential preference reversals. This interpretation is corroborated by the findings in the precommitment task where the risk of changing the preference from LL to SS rewards in future choices showed a weaker impact on smokers’ than on non-smokers’ precommitment decisions. Replicating previous findings25, metacognitive abilities significantly correlated with the individual sensitivity to preference reversals, which supports the assumption that the efficient use of precommitment requires metacognitive insight into one’s time preferences. The significant mediation analysis suggests that because smokers overestimate their preference for delayed over immediate rewards, they may have difficulties in anticipating future situations where they might give in to the temptation to smoke despite currently preferring not to smoke (though one should hesitate to infer causal links between metacognition and precommitment from the mediation analysis). A lower sensitivity to preference reversals might potentially increase the risk of relapses32, albeit this remains speculative and will need to be tested with more ecologically valid measures of preference reversals in nicotine addiction.

The current findings have important implications for our conceptual understanding of delay discounting and impulse control deficits in nicotine dependence. Substance dependence can be conceptualized as arising at least in part from a dysfunctional dominance of the desire for immediate rewards over the pursuit of long-term goals33,34,35. Correspondingly, previous research focused on delay discounting in intertemporal choice as a behavioral marker of the preference for immediate rewards in addiction8,9. Additionally pharmacological or brain stimulation-based treatments in substance dependence/abuse aim at re-calibrating the balance between brain mechanisms involved in impulse control and wanting36,37,38,39.

Besides impulse control, however, voluntary self-restrictions are also an effective strategy which improve resistance to immediate temptations7,40. Precommitment devices where individuals voluntarily self-restrict their access to the desired substance have been designed to improve pharmacological treatments of substance dependence41,42. However, the use of such precommitment devices presupposes that patients possess metacognitive access to their time preferences. The current findings suggest that this metacognitive insight might be impaired in individuals with substance use disorders, which may hamper the efficient use of such precommitment devices. An important implication of our findings is that therapeutic neural interventions should target not only brain regions involved in impulse but also in metacognitive processes, for example the frontopolar cortex25,43,44, as enhancing metacognition may optimize the use of precommitment strategies. This is supported by evidence for abnormal activation in anterior prefrontal cortex during performance monitoring processes in addiction17, which may form the neural substrate of the observed metacognitive deficits. Alternatively, behavioral interventions like metacognitive training45 may allow for a reduction of dysfunctional metacognitive biases in substance dependence.

The current findings may also shed a new light on economic theories of addiction. According to the influential theory of rational addiction, substance abuse can be conceptualized as rational behavior that maximizes an individual’s utility function26,27. However, if smokers possess reduced metacognitive insight into their economic time preferences (as evidenced by the current findings), it seems questionable whether they can indeed integrate all future benefits and costs of substance abuse, which challenges the assumptions of the theory of rational addiction.

It is worth mentioning some limitations of the current study: first, the administered experimental tasks used monetary rewards, whereas impulsive decisions in nicotine dependence require trading-off the immediate consumption of cigarettes versus the negative long-term consequences of smoking. Note that the use of a dependence-unrelated currency was necessary in order to compare behavior between smokers and non-smokers, and there is evidence that the proposed link between metacognition and precommitment holds also for primary rewards24,43,46. Moreover, smokers discount monetary rewards and cigarettes to a similar extent47. While we are therefore optimistic that the reported findings will hold also for intertemporal decisions involving substance-related rewards, this will need to tested by future studies employing more ecologically valid measures of impulsiveness and craving in addiction. We note that decisions may not only be motivated by the desire for immediate rewards but also by the desire for avoiding the aversive state of craving48, whereas the current task measured only the preference for immediate rewards. As further limitation, we note that smokers made choices in a “cold” state as we placed no restrictions on their smoking before the experiment. We are thus cautious in drawing any conclusions regarding smokers’ behavior in a “hot” craving state, in which smokers have been found to overestimate their desire for future cigarette consumption compared to when being in a “cold” state49. Moreover, we did not control for participants’ income or socio-economic status, although these factors might potentially contribute to the observed group differences in delay discounting. Finally, one should keep in mind that the delay used in the confidence accuracy task (with a maximum delay of one year) are relatively short-term compared with the long-term consequences of smoking, which further constrains the clinical relevance of our current findings.

Taken together, the current findings inform theoretical accounts of substance dependence by providing evidence that—besides elevated delay discounting and deficits in impulse control—nicotine dependence is characterized by metacognitive deficits. These metacognitive deficits, in turn, may hamper the identification of situations where smokers’ impulse control capacities are not sufficient to resist temptations.

Materials and methods

Participants

Thirty-seven regular smokers (23 females, meanage = 29.3 years, range = 19–54) and 38 non-smokers participated in the study (24 females, meanage = 27.1 years, range = 19–55). Participants were recruited via advertisement and the internal participant pools of the chair of experimental psychology as well as of the Munich Experimental Laboratory for Economic and Social Sciences (MELESSA) at the Ludwig Maximilian University Munich. An a-priori power analysis based on a meta-analysis of delay discounting in addiction reporting an effect size of Cohen’s d = 0.638 suggested that 32 participants per group are sufficient to detect an effect of smoking status on delay discounting with a power of 80% (alpha = 5%). Note that such a sample size is also sufficient to reliably measure metacognitive accuracy in intertemporal decision making24,25,28. For study inclusion, smokers had to smoke ≥ 10 cigarettes per day and to have a score of ≥ 3 in the Fagerström Test for Cigarette Dependence50,51, whereas volunteers without history of regular tobacco smoking were assigned to the non-smoker group. Both smokers and non-smokers were screened for excessive alcohol consumption (inclusion criterion: AUDIT-C score ≤ 552; meansmoker = 3.1; meannon-smoker = 2.7), other substance dependence, and neurological or psychiatric disorders. For one participant in the smoker group, data of all decision tasks (except for the risk preference task, see below) were lost due to technical issues. All participants gave voluntary informed written consent prior to participation. The study was approved by the local ethics committee of the department for psychology at Ludwig Maximilian University Munich (protocol number: 44_Soutschek_b) and performed in accordance with the Declaration of Helsinki. Study participation was re-imbursed with 10 euro/hour in addition to a performance-dependent bonus (see below).

Stimuli and task design

Confidence accuracy task To assess participants’ metacognitive awareness of their time preferences, participants performed a monetary intertemporal choice task24. Participants chose between an immediately available smaller-sooner (SS) reward (0–5 euro today, in steps of 0.5 euro) and a larger-later (LL) reward (5 euro delivered after a delay of 1–360 days). The SS and LL reward options were randomly presented on the left or right side of the screen, and participants chose the left or right option by pressing the left or right arrow key, respectively, on a standard keyboard within 6 s. Following each choice, participants indicated their confidence that they made the best possible choice on a rating scale from 0 (low confidence) to 20 (high confidence) within 3 s (Fig. 1A). Participants made a total of 66 choices in this task, including all combinations of SS reward magnitudes (11 levels) and delays (6 levels). We note that higher LL reward magnitudes than employed in this study might increase patience due to the so-called magnitude effect53, which might lead to ceiling effects and reduce the sensitivity of our task to measuring individual differences in time preferences.

Bidding task In the bidding task28,54, participants indicated on a rating scale which amount of money delivered today (in steps of 0.2 euro) they considered as equivalent to 5 euro at different points of time in the future (e.g., “Which amount of money available today is equivalent to 5 euro in 90 days?”). We used the same delays as in the confidence accuracy task. This allowed us to compute the difference between the self-reported (bidding task) and the observed subjective values (based on the choices in the decision task) for each delay (Fig. 2A).

Precommitment task. As for the confidence accuracy task, participants made choices between SS and LL rewards. One option consisted of a fixed monetary reward of 5 euro delivered after delays of 29–388 days (precommitment option; e.g., “5 euro in 68 days”; Fig. 3A). When choosing this option, participants received 5 euro after the given delay without having the possibility to reverse their choice. The other option (postpone option) entailed an SS reward of 0–5 euro delivered after 28 days and an LL reward that was identical to the precommitment option (e.g., “3 euro in 28 days” or “5 euro in 68 days”). If for the bonus payment a trial of the precommitment task was selected where a participant had decided to postpone the final decision, the participant was re-contacted by the experimenter via email after 28 days and was asked to make a choice between the SS and LL rewards of the chosen trial, with the delays adjusted for the 28 days that had passed (in the current example, “3 euro today” or “5 euro in 40 days”). If a participant had selected the precommitment option in the chosen trial, the participant received information about the chosen option via email after 28 days without the possibility to reverse the choice. Thus, only the postpone option allowed participants to reverse their preference after 28 days. Participants performed a total of 36 choices in the precommitment task. We used 6 levels of SS reward magnitudes (2, 2.5, 3, 3.5, 4, and 4.5 euro) that were delivered after 28 days, whereas the LL reward magnitude was fixed to 5 euro and was delivered after one of 6 possible delays (29, 33, 38, 48, 68, and 118 days). These reward magnitudes and delays were adopted from our previous studies on the link between metacognitive accuracy and precommitment24,25.

Procedure

After having checked participants’ eligibility, we administered a task battery including baseline measures for reward sensitivity (Behavioral Inhibition System/Behavioral Approach System (BIS/BAS) questionnaire55), the multiple choice vocabulary test MWT-B as proxy for verbal intelligence56, and working memory capacity (digit span backward task57). In the computerized version of the digit span backward task, participants had to report a stream of digits presented on the screen (with 3 to 9 digits) in reversed order, with each difficulty level being presented only once. For the BIS/BAS questionnaire, we computed separate scores for the BIS and BAS subscales via the subscale-specific mean rating scores (range: 1–4)58. These measures served to control for potential baseline differences between the smokers and non-smokers. Participants then performed the confidence accuracy task, the bidding task, and the precommitment task. Additionally, participants also performed (in counterbalanced order) decision tasks eliciting risk preferences59,60 and the willingness to engage in rewarded mental effort61,62. Because neither effort nor risk preferences have been linked to precommitment so far, the results of the latter two tasks will be reported in a separate article focusing on how smokers trade off benefits against the costs of actions. We informed participants that at the end of the experiment one of the choices made in the decision tasks would randomly be selected and the corresponding payoffs added to their payment. This established procedure ensures that participants had to take all choices seriously (as all choices were equally likely to be selected after the experiment) and avoids satisficing effects that might occur if all choices are paid out.

Data analysis

We analyzed data with Bayesian mixed generalized linear models (MGLMs) as implemented in the brms package in R63. Trials with reaction times faster than 250 ms were excluded from the analyses. We also removed trials where participants failed to respond within the time limits (1% of all trials in the confidence accuracy task and 3% of all trials in the precommitment task). In the confidence accuracy task, we measured participants’ metacognitive awareness of their decision uncertainty following a previously described approach24,25,64. For that purpose, we first estimated each individual’s time preferences by fitting a hyperbolic discount function65 to the choices in the confidence accuracy task (Eq. 1):

$$SV_{LL} = \frac{{{\text{LL}}\;{\text{reward}}\;{\text{magnitude}}}}{{1 + k \times {\text{delay}}}}$$
(1)

where SVLL is the discounted subjective value of the LL reward and k is a participant-specific constant that indicates the steepness of the discount function (“discount factor”). To translate subjective value into binary choices, we fitted a standard softmax function to each participant’s choices:

$$P\left( {{\text{choice}}\;{\text{of}}\;{\text{LL}}\;{\text{option}}} \right) = \frac{1}{{1 + e^{{ - \beta_{{{\text{temp}}}} \times \left( {SV_{LL} - SV_{SS} } \right)}} }}$$
(2)

This function captures the likelihood of choosing the LL reward option as a function of the difference between the subjective value of the LL reward option (SVLL) and the SS reward option (SVSS). The inverse temperature parameter βtemp captures the slope of the function, i.e., how strongly participants relied on this value difference for their choices. We compared the group-level estimates of the discount factor k and the inverse temperature βtemp by estimating group-level parameters for the smoker and non-smoker groups with a hierarchical Bayesian approach (2 chains with 4,000 samples, the first 2000 samples were used as burn-in) using the function “dd_hyperbolic” in the hBayesDM package66. To measure metacognitive access to individual time preferences, we computed the difference between the value of the SS reward and the subjective value of the LL reward (SVLL–SVSS) based on the individual discount factors. We then performed a Bayesian MGLM regressing binary choices (1 = LL reward, 0 = SS reward) on fixed-effects predictors for Group (0 = non-smoker, 1 = smoker), subjective value difference (DV = SVLL − SVSS), confidence ratings, as well as all interaction terms. As random effects, we included participant-specific intercepts as well as random slopes for DV, confidence, and the interaction term. The interaction between value difference and confidence indicates the degree to which participants are aware of objective decision uncertainty in the choice process and thus constitutes a measure of metacognitive accuracy: the stronger the relationship between the slope of DV and confidence ratings, the more reliably an individual is able to track decision uncertainty in intertemporal decisions. Assessing the impact of smoking status on this interaction effect thus allowed us to test whether smokers and non-smokers differ in the metacognitive awareness of their time preferences. As in all other MGLMs, we estimated posterior distributions of parameters with two chains, each entailing 4000 samples (burn-in = 2000 samples).

In addition, we analyzed the differences between participants’ self-reported subjective values of delayed rewards in the bidding task and the subjective values as given by the individual hyperbolic discount functions derived from the decision task (see above). A positive difference between self-reported and revealed subjective reward values indicates that an individual overestimates the value of future over immediate rewards relative to their revealed preferences, whereas a negative difference indicates an underestimation of the value of delayed rewards. We regressed differences between self-reported and revealed subjective values on fixed-effect predictors for Group, Delay, and the interaction term. As random effects, we modelled participant-specific intercepts and slopes for Delay.

In the precommitment task, we asked whether smokers and non-smokers differ regarding their sensitivity to the expected benefit from precommitment. We conducted an MGLM that regressed choices in the precommitment task (0 = postpone option, 1 = precommitment option) on fixed-effect predictors for Group, Reversal risk, and value difference between SS and LL reward (DV). As random effects, we added participant-specific intercepts and random slopes for DV, reversal risk, and the interaction term. The predictor DV controls for the possibility that participants decided for the postpone option because at time of choice they preferred the SS over the LL option. We computed the reversal risk on each trial by subtracting the value difference between SS and LL reward in the perspective of 28 days later (i.e., when participants had to make a final choice between the options) from the value difference between these options in participants’ current perspective (DV) based on the individual discount factors estimated in the confidence accuracy task. A higher score indicates a higher risk of preference reversals (i.e., that a participant prefers the LL reward in the experimental session and the SS reward in 28 days) and thus a higher expected benefit from precommitting to the LL reward.