Introduction

Overweight and obesity are growing problems with epidemic dimensions1,2. In Europe, 30.5% of women and 44.7% of men are considered obese, and 53.1% meet the criteria for being overweight3. Prevalence is especially high in industrialized countries, but the problem is also on the rise in developing countries1,4. Obesity is a primary risk factor for several disorders (e.g., cardiovascular disease, coronary heart disease, sleep apnea, atrial fibrillation, diabetes) and aggravates known risk factors (e.g., high blood pressure) for other life-shortening disorders (e.g., stroke). High BMI is regarded as responsible for 4 million deaths worldwide per year, representing 7.1% of deaths due to any cause and contributing to 120 million disability-adjusted life years4. Moreover, obesity is associated with psychological sequelae such as (self-)stigma5.

Standard measures to decrease weight are dieting (i.e., reducing the intake of calories) and/or increased physical activity to burn additional calories. In severe cases, surgery is recommended6 as a last resort despite possible side effects7. Some pharmacological agents have also shown promise8.

The present article is concerned with another route of intervention—the reduction of craving for foodFootnote 1 —by means of approach bias modification (ABM). ABM procedures are rooted in dual-process theories9,10, which try to explain substance and behavioral addictions, including obesity, as an excess of (implicit) impulsive approach tendencies that override both the (explicit) will to disengage from the substance or behavior and the insight into the negative consequences of the substance or behavior11. In line with the tenets of dual-process theories, experimental studies using the approach-avoidance task (AAT) show that individuals with addictive behavior are faster to pull a picture of their preferred/craved substance/object (e.g., drug, alcoholic drink, high-calorie food) toward themselves than to push it away via a joystick in a computerized set-up compared to non-addicted individuals12. This effect is well-established across a range of disorders, particularly substance disorders but also obesity (see ref. 13 for a study with an approach-avoidance variant of the implicit-association task), and speaks to an embodiment of addiction (embodiment or embodied cognition is the theory that many features of an agent’s cognition are shaped by aspects of his or her body14). Building upon diagnostic findings from the AAT and other assessment procedures, ABM aims to revert and decrease embodiment. The feasibility and effectiveness of the approach is well established in problem drinkers15 but has been successfully applied to other addictions as well16. In ABM, the addictive substance or behavior is coupled with pushing a joystick, which causes the picture to shrink in size, thus creating the illusion of distance, whereas a positive or neutral cue is coupled with pulling a joystick, which causes the picture to increase in size, thus creating the illusion of approach. Thus, ABM adopts the rationale of the AAT to decrease the approach bias towards addiction-related stimuli. The reduction of relapse in alcohol use disorder is usually small through ABM. Several studies have shown that ABM reduces the exaggerated approach bias in people with addictive behavior, particularly relating to alcohol16,17,18. The approach was then transposed to other addictions, particularly behavioral ones. For eating disorders results are mixed. In one meta-analysis, cognitive bias modification interventions that included ABM resulted in a medium change in the approach bias and small effects on food consumption and cravings for high-calorie food19. The latest meta-analysis found no effect of ABM on eating behavior. Likewise, a recent study, not included in the meta-analysis, found positive effects of ABM on the approach-avoidance bias but not on eating pathology questionnaires or the body mass index (BMI)20.

Recent meta-analyses, however, have questioned the general effectiveness of the ABM approach in ameliorating clinical outcomes (addiction and craving)21,22. In the meta-analysis by Boffo et al. 22, cognitive bias modification exerted a small effect on cognitive biases and relapse but not on the reduction of substance use.

The ABM procedure is simple to use, but even its developers acknowledge that the task can be boring and tedious for some individuals23. Another obstacle for routine implementation is the need for access to a computer device and a joystick. Moreover, when using computerized devices the available images are limited and difficult to personalize, particularly relating to the typical environment of intake24. Further, the many possible combinations of the substance that is craved (e.g., alcohol: wine, beer), brand/type of packaging, and favorite method of consumption (e.g., can, glass, bottle) are hard to capture with pictures.

Imaginal retraining aims to address these limitations. The technique consists of two phases that are easy for the participants to execute. Following a simple negative mood induction to enhance embodiment (for details, see the “Methods” section), the alleged primary mechanism of ABM25, in the averse sequence participants throw (the behavior/movement is actually performed) the imagined high-calorie food away from them. This sequence corresponds to the push movement in conventional ABM. For the opposite sequence, the user engages in a positive mood induction (see the “Methods” section for details) before imagining eating or drinking a delicious but healthy, low-calorie beverage, or food, while coupling this with other positive sensations to enhance the effects of embodiment25. This sequence corresponds to the pull movement in conventional ABM. Two randomized-controlled trials with problem drinkers24 and smokers26 showed that imaginal retraining reduced both craving for alcohol/cigarettes (proximal outcome), as well as actual drinking and smoking behavior (distal/clinical outcome).

For the first time, we examined the efficacy of imaginal retraining to reduce craving for high-calorie food (primary outcome) in individuals with overweight (BMI > 25). Secondary outcomes were the Three-Factor Eating Questionnaire (part of the German Fragebogen zum Essverhalten, FEV; German version27); the Food Cravings Questionnaire-Trait-reduced (FCQ-T-R)28, well-being (WHO Quality of Life scale)29, and weight. We adapted the original manual, which we had developed for problem drinkers and smokers, to people with overweight and obesity. The new manual encouraged neither dieting nor any additional physical activity. The (wait-list) control group received access to treatment upon completion of the post assessment.

Methods

The web-based intervention was conducted via WisoPanel30,31, a participant pool of German-speaking individuals who have registered for participation in web-based studies (https://www.wisopanel.net). Members of WisoPanel are drawn from diverse sources comprised of people from all social classes and is demographically similar to the general population.

Because the BMI of women registered with WisoPanel was not known and this was the primary inclusion criterion of the study, all registered women (N = 8.786 at the time of inclusion) received an invitation and a link to the study. The invited sample thus was comprised of women who were normal weight, overweight, or obese. After a short description of the purpose of the study, interested people were asked to give their explicit informed consent if they met the inclusion criteria (see below). Inclusion criteria were age between 18 and 75 years and a BMI > 25 (we calculated this based on the weight and height of the participants). A history of anorexia, bulimia, or psychosis led to exclusion. Acute suicidality (as manifested by a score of 2 or 3 on the BDI-II rating for suicidal ideation) led to automatic exclusion with a referral to specific websites offering help, including emergency numbers (see Fig. 1).

Fig. 1
figure 1

CONSORT flow chart for the current study. ITT = intention-to-treat analysis.

Design

The intervention was unguided; the study was set up as a randomized-controlled trial (randomization plan); controls were allowed access to standard care. Study participation was anonymous; participants were advised to create email addresses that did not disclose their names. Participants in the intervention condition received the imaginal retraining manual immediately after randomization via an email attachment, whereas participants in the control condition received the manual upon completion of the post assessment. One week after the initial sending of the manual, half of the intervention sample received an email with specific instructions on how to set the timer on their smartphone in order to receive daily reminders to perform the exercises (no participant canceled the study nor was excluded from it for not having a smartphone or not being willing to use it).

Data from 384 participants were included in the final analyses (see Table 1), which according to g*power allowed for the detection of a small effect size. The main reasons for exclusion of data were cancellation of the survey before completion, BMI ≤ 25, and refusal to give informed consent (see CONSORT flow chart in Fig. 1). The trial was registered with the German Clinical Trials Register (DRKS0001722). We carefully checked that required statistical assumptions were met (e.g., normal distribution; equal variances—for certain comparisons adjustments for unequal variances were made).

Invitation and baseline survey

Assessments were conducted online using Questback/UniPark®. In keeping with guidelines of the European General Data Protection Regulation (GDPR), no IP addresses were stored. The local ethics committee for psychologists at the University Hospital of Hamburg-Eppendorf (Germany) approved the study (LPEK-0030). As an incentive for participation, a self-help manual on imagery rescripting was sent to all individuals who completed the study32, as well as a link to the MCT & More app (www.uke.de/mct_app), which is aimed to improve self-esteem.

At the start of the survey, electronic informed consent was required. Questions on the participant’s sociodemographic background as well as medical history (e.g., prior experience with psychotherapy; current treatment status; prior psychiatric diagnoses, if any) followed. Subsequently, scales on eating behavior and well-being were presented (see section “Questionnaires”). We also asked for the participant’s present weight. At the end, we asked participants whether they had truthfully answered the questions (inclusion criterion). We requested a pseudonymized email address. Email addresses were not stored online. Within 24 h, participants were randomized to one of the three conditions (ratio: 1 (wait list); 0.5 standard retraining; 0.5 retraining with electronic reminders). The allocation was implemented based on the time the participant entered the study. Owing to the design of the online study, which involved no personal contact, concealed allocation as in clinical trials with face-to-face assessments could not be implemented. Our procedure is best characterized as centralized assignment with no risk of bias because the person allocating individuals to conditions had no information about participants other than the date they had signed up for the study and their anonymized email.

Participants in the control group were informed via email that they would receive the retraining manual after the post assessment. Six weeks after initial participation, all participants were invited to participate in the post assessment. Up to three reminders were sent. For the post assessment, participants were first asked to re-enter their email address to allow matching of pre and post data. The same set of questionnaires was administered as in the baseline survey. We asked participants for their present weight and whether this was measured with a scale (this question was entered during the follow-up period and the results are therefore available for a subgroup; see Tables 2 and 3). Further, we asked whether they had eaten less during the study period vs. the same or more as previously (the latter two categories were later pooled). Those in the retraining conditions who had at least started to read the manual were asked further questions related to their subjective assessment of the quality, comprehensibility, satisfaction, and efficacy of the manual (see Tables 4 and 5).

Table 1 Demographics, well-being, and eating-related characteristics of the sample at baseline.
Table 2 Intention-to-treat analyses for the pooled retraining group and subgroup with electronic reminders (RER) vs. controls.
Table 3 Per-protocol comparisons (technique executed at least once during intervention period) for pooled retraining group and subgroup with electronic reminders (RER) vs. controls.
Table 4 Subjective appraisal by users of imaginal retraining (adapted from the Patient Satisfaction Questionnaire, German acronym ZUF-8; Schmidt et al. 1989), with percentage of positive vs. negative appraisals, means and standard deviations (results of the imaginal retraining trial in problem drinkers are inside square brackets).

Imaginal retraining

Imaginal retraining is a manualized self-help intervention (10 pages with 5 figures; 3918 words). The manual starts with a short psychoeducational section that highlights well-established negative consequences of overweight and obesity in order to increase motivation to change. Next, participants are familiarized with findings using the classical approach-avoidance procedure (diagnostic and intervention [ABM] procedure). To enhance participants’ understanding of the rationale for our procedure, the psychological mechanisms that seem to underlie conventional retraining are explained. Some disadvantages of the original technique (need for access to a computer, lack of personalization, boredom) are identified as issues that our procedure aims to address.

Before conveying the technique of imaginal retraining, we describe exposure in vivo and in sensu, as the latter intervention is an ingrained part of our procedure. Participants are then instructed to imagine their preferred high-calorie foods that they want to eat less of along with their usual environment when eating these foods. Following this, they are asked to imagine a healthy, low-calorie food they also like to eat (e.g., apples). Participants are informed about the close connection between body posture, thoughts, and emotions. We illustrate that when we are in a bad mood, we tend to walk slumped over and with the corners of our mouths turned down, whereas when we feel happy, we walk with a more upright body posture and have a confident facial expression. Posture and emotion influence each other reciprocally. Thus, straightening the body leads to a slight improvement in mood, whereas adopting a bent-over posture tends to reduce well-being.

Then, imaginal retraining is taught, which consists of two sequences with two steps each. In the first part, the participant is asked to first exhale, slump forward, and round his or her shoulders. This posture should be reinforced as vividly as possible with negative thoughts (i.e., mood induction). Subsequently, participants should imagine their favorite high-calorie food and the place where they usually consume it. Participants should then push or throw the food away from themselves in their imagination (e.g., throwing a cake against the wall) while vigorously executing the movement in reality. This sequence is depicted in the manual by a series of drawings (see Fig. 2). We also advise participants to throw the imagined food onto the ground because pushing away and downward movements are both typically associated with disgust and rejection.

Fig. 2: The two steps of imaginal retraining.
figure 2

Aversion sequence (upper panel): The individual should imagine grabbing a favorite high-calorie food, bend forward and contemplate negative thoughts, and then throwing the imagined food away from themselves (actual movement with imagined object). Approach sequence (lower panel): The individual should imagine grabbing a healthy food such as an apple, stand upright, lift the imagined food high and, if possible, couple this with other positive feelings (e.g., stroking a pet; actual movement with imagined object).

For the approach sequence, participants are instructed to imagine eating healthy food (e.g., an apple). They are asked to take a deep breath and stand up straight and tall as if someone were pulling them up by an imaginary string attached to the top of their head. They are then told to move the imagined healthy food or drink toward their mouth in a somewhat exaggerated, happy manner, as actors often do in advertisements, so that they are looking slightly upwards (to improve mood). At the same time, they should contemplate pleasant thoughts and images (e.g., eating an apple while stroking a pet lying against their chest). Again, drawings are used to depict this procedure (see Fig. 2). Participants are asked to perform the exercises regularly, at least twice a day. No further tips (e.g., regarding a diet or physical exercises) are provided.

After one week, half of the participants in the retraining group (the RER group) were sent specific instructions that included screenshots on programming their android or iOS smartphone to receive messages once or twice daily.

Questionnaires

Primary outcome

Visual Analog Scale (VAS) for food craving. The mean total score of a three-item VAS scale served as the primary outcome. This scale was used in analogous trials in problem drinkers24 and smokers26. The VAS referred to the previous week. Participants had to move a bar between 0 and 100 to represent the strength of their high-calorie food craving in non-eating phases (not at all [=0] to very strong [=100]); strongest craving for high-calorie food (not at all [=0] to very strong [=100]); and frequency of craving for high-calorie food (never [=0] to always [=100]). Thus, higher scores indicated stronger craving.

Secondary outcomes

Food Cravings Questionnaire-Trait-reduced (FCQ-T-R28). The FCQ-T-R is a short version of the Food Cravings Questionnaire-Trait (FCQ-T33. The questionnaire is sensitive to change34,35 and is correlated with the BMI, as well as with impulsiveness and is negatively correlated with success in diets28. The short form consists of five subscales: lack of control over eating (five items; Lack of Control), thoughts or preoccupation with food (five items; Thoughts/Preoccupation with Food), intentions and plans to consume food (two items; Intentions to Consume), emotions before or during food craving (two items; Emotions), and cues that may trigger food craving (one item; Triggers). Following Holmes and Meule36, items have to be endorsed on a 5-point Likert scale. Internal consistency (α = 0.94) as well as test–retest reliability (r = 0.74) are good28,37. Higher scores indicate greater problems with eating behavior.

Three-Factor Eating Questionnaire (Fragebogen zum Essverhalten, FEV27). The German FEV is a multidimensional questionnaire and incorporates the English Three-Factor Eating Questionnaire (TEFQ) by Stunkard and Messick38, a commonly used instrument39. Three subscales exist. The first is cognitive restraint from eating (21 items). A high score represents spontaneous and virtually uncontrolled binge eating; a low score hints at controlled eating by means of, for example, counting calories (e.g., “I have a pretty good idea of the number of calories in common foods”; ”I consciously hold back at meals in order not to gain weight”). The subscale disinhibition is composed of 16 items. A high score shows cue-driven excessive eating behavior due to, for example, emotional turmoil and external triggers (e.g., “I usually eat too much at social occasions, like parties and picnics”; “When I feel anxious, I find myself eating”). The third scale taps hunger. A high score indicates bothersome sensations of hunger (e.g., “I am usually so hungry that I eat more than three times a day”; “Dieting is so hard for me because I just get too hungry”). For the German version27, high validity and internal consistency has been reported (Cronbach’s α = 0.74–0.87).

Beck Depression Inventory II (BDI-II40). The BDI-II contains 21 items that capture common somatic and psychological symptoms of depression. The internal consistency and test–retest reliability of the German version are good41.

WHO Quality of Life (WHOQOL-BREF29). The global item of the WHOQOL-BREF served as an index of quality of life.

Subjective appraisal and benefit

Participants who had at least started to read the manual were asked to fill out the Patient Satisfaction Questionnaire (German acronym ZUF-842, adapted for online interventions). The scale assesses subjective appraisal of the technique (e.g., quality, satisfaction, effectiveness, and intention to use the application in the future). Tables 4 and 5 show the results of these and additional questions pertaining to treatment.

Results

The final sample consisted of 384 women with a BMI > 25. A total of 200 participants were allocated to the control condition, while 91 were allocated to standard retraining (RS) and 93 to retraining with electronic reminders (RER). Baseline sociodemographic and eating-relevant characteristics of the three conditions are displayed in Table 1. The participants were in their late 40s. The mean BMI corresponds to obesity, and the mean BDI-II suggests the presence of minimal to mild depressive symptoms. Approximately one-third of the sample (33.1%) had attempted one to three diets in the past. Almost one-fifth (20.5%) had attempted more than nine diets or were on a permanent diet. Approximately the same percentage had never tried a diet (20.6%).

Table 5 Subjective appraisal of the retraining manual, with means and standard deviations (results from problem drinkers are inside square brackets).

Adherence was comparable across groups (controls: 74.5%; standard imaginal retraining without electronic reminders [RS]: 69%; retraining with electronic reminders [RER]: 73%), χ2 (2) = 0.88, p = 0.643. Being on a diet during the study period did not influence either craving or weight and did not interact with intervention in the two-way ANOVA with Group and Diet (yes, no) as factors (p > 0.2)

Of those in the two retraining conditions who agreed to rate their degree of engagement with the treatment, 5% acknowledged that they had not read the manual at all (prior study on smoking: 8.5%), whereas 26.7% reported they had read the manual at least partially but had not (yet) performed any exercises (prior study: 43.9%); 18.3% had performed the exercises once in the intervention period (prior study: 24.4%); and 17.5% had performed the exercises once a week (prior study: 24.4%). Almost one-fourth (24.2%) had performed the exercises several times a week (prior study 17.1%). Six participants (5%) had performed the exercises on a daily basis (prior study: 1.2%). Few participants (3.3%) had performed the exercises several times daily (prior study: 4.9%). Thus, approximately two-thirds of the sample (68.3%) had performed the exercises at least once during the study period.

Of the controls, 24.2% (prior study on smoking: 26.4%) self-reported eating less during the intervention period compared to 36.1% in the RS and 39.4% in the RER conditions (prior study on smoking: 45.5%), χ2(2) = 6.19, p = 0.045. The latter rates rose to 45.9% and 46.5%, respectively, when considering only those who had performed the exercises at least once in the treatment period (prior study: 59%), χ2(1) = 11.70, p = 0.003.

Utilization and dose effects

Participants in the standard RS condition used the technique somewhat less often than those in the RER condition who were encouraged to use electronic reminders (criterion: usage multiple two times a week or more often: 24.1% vs. 40.3%), χ2(1) = 3.58, p = 0.059.

The frequency of performance of the technique (i.e., only read manual; performed exercises once during the study period; performed once weekly; performed several times a week; performed daily; performed several times daily) correlated with changes on certain eating-related parameters suggestive of a beneficial effect of the intervention on outcome: VAS (r = 0.22, p = 0.015), FCQ-T-R Total (r = 0.35, p < 0.001), FCQ-T-R Lack of Control (r = 0.37, p < 0.001), FCQ-T-R Thoughts/Preoccupation with Food (r = 0.28, p = 0.002), FCQ-T-R Intentions to Consume (r = 0.33, p < 0.009), FEV Cognitive Restraint (r = −0.24, p < 0.009), and FEV disinhibition (r = 0.23, p < 0.012).

Test–retest reliability

Test-retest reliability was satisfactory to good, with the following scores (in ascending order of strength): weight (r = 0.98), BDI-II (r = 0.79), FEV Cognitive Restraint (r = 0.72), and FCQ-T-R Total (r = 0.70; FCQ-T-R subscales: r= 0.56–0.73). Reliability of FEV Disinhibition (r = 0.69), FEV Hunger (r = 0.66), the QoL global item (r = 0.60), and the VAS composite score (r = 0.49) were acceptable (all p < 0.001).

Intention-to-treat and per-protocol analyses

Group differences across time were calculated using mixed ANOVAs with Group as the between-subjects and Time as the within-subject factor. The intention-to-treat (ITT; missing values were estimated using the expectation-maximization method) and the within-subject analyses results are given in Table 2. Except for quality of life, participants improved on all scores, including depression, irrespective of group allocation. The pooled imaginal retraining group displayed greater improvements than controls on four outcomes: FEV Cognitive Restraint, FEV Hunger, FCQ-T-R Total, and FCQ-T-R Lack of Control. Yet, no differences emerged on the VAS, the primary outcome. When analyses were confined to the retraining group with electronic reminders (RER), which showed better adherence than the standard (RS) group, seven outcomes, including the primary outcome, yielded significance: VAS, FEV Cognitive Restraint, FEV Disinhibition, FCQ-T-R Total, FCQ-T-R Lack of Control, FCQ-T-R Thoughts/Preoccupation with Food, and BDI-II (results pertaining to the BDI-II should be interpreted cautiously in view of baseline differences). Effect sizes for significant results were in the small to medium range. In the subgroup in which weight was measured with a scale, the pooled retraining group showed a greater weight loss at statistical trend level, which achieved significance when considering the RER group only (see Table 2).

Results were corroborated by the per-protocol (PP) analyses, considering only those participants who had used the technique at least once in the treatment period. Again, participants improved overall at a medium to large effect size except for quality of life. The pooled retraining group was superior to the control group on seven out of 12 outcomes: VAS, FEV Cognitive Restraint, FEV Hunger, FCQ-T-R Total, FCQ-T-R Lack of Control, FCQ-T-R Intentions to Consume, and FCQ-T-R Triggers. At statistical trend level, weight loss was more marked in the pooled retraining group when considering those who endorsed that they had measured their weight using a scale at both baseline and reassessment. When constricting analyses to the RER group, this outcome achieved significance at a medium effect size. Again, for seven outcomes the RER group had better results than controls; two results bordered on significance and three were nonsignificant.

The two retraining subgroups performed almost equally except on the QoL (p = 0.016, ηp2 = 0.070) and FCQ-T-R Emotions (p = 0.042, ηp2 = 0.051), where the RER displayed grater improvement. The paired comparisons indicate that group differences were due to improvement in the experimental conditions and not worsening in the control condition. In fact, controls improved on most outcomes across time.

Completers vs. non-completers

Compared to non-completers, completers were older (p = 0.017), more often endorsed that they had attempted more than 15 diets (p < 0.001), had somewhat higher treatment expectancy (p = 0.044), and scored higher on FEV Cognitive Restraint (p = 0.039).

Subjective appraisal

Tables 4 and 5 display the subjective appraisals of the participants in the two retraining groups who had performed the exercises at least once. Appraisals were better for the group with electronic reminders on the following parameters: whether they had received the expected treatment and whether participants would use the manual again. At trend level, the electronic reminder group more often endorsed that their consumption of high-calorie food had decreased because of their use of the program (79.5% vs. 63.2%). More than 80% of the participants endorsed the following statements: good or excellent quality of the manual, would recommend the manual to a friend (RER only), good for self-help, comprehensible, the manual would have made more sense if it were used in combination with psychotherapy, I had to force myself to use the manual (RER only), expected type of treatment received (RER only), and helpful and would use manual again (RER only). Between 70% and 79.9% endorsed the following items: I had to force myself to use the manual (RS only), helped me cope with problems more successfully (RER only), satisfied with the manual in general, happy about the extent of help received from the manual, consumption of high-calorie food decreased because of the program (RER only), able to use the manual on a regular basis (RER only), and would recommend the manual to a friend (RS only). Between 60% and 69.9% endorsed the following statements: would use manual again (RS), consumption of high-calorie food decreased because of the program (RS only), helped me cope with problems (RER only), and expected type of treatment received (RS only).

For use as a benchmark, the figures from our forerunner trial on imaginal retraining in problem drinkers are set in square brackets in Tables 4 and 5. We comment on the differences to the benchmark if they exceed 10%. More participants in the current trial (RER: 52.3%, RS: 55.3%) endorsed that the manual was not applicable to their problem behavior than in the alcohol study (40.9%). In particular, the participants in the RS condition endorsed less often that they were able to use the manual on a regular basis during the previous 6 weeks (55.3% vs. 86.4% in the prior study). Again, less participants in the current trial (RER: 71.8%, RS: 58.3%) endorsed that the manual helped them cope with their problems more successfully relative to the alcohol study (86.4%). Yet, their happiness about the help they received was markedly higher.

Discussion

Individuals intending to lose weight usually either start a diet aimed at the reduction of calorie intake or initiate/enhance recreational activities (e.g., sports) aimed at burning additional calories. Such measures necessitate effort and discipline and are often short-lived. Once they end, the weight often increases43,44, sometimes above baseline due to hormonal changes (i.e., the yo–yo effect45).

We borrowed the present technique from the computerized ABM paradigm, which has frequently been administered in studies related to eating20,46, but we pursued another route. We expected the intake of and craving for high-calorie food to be reduced by a simple, quick technique that dampens the craving/addiction-like properties of conditioned high-calorie food stimuli. Participants learn to couple the image of their favorite unhealthy/high-calorie food with a mildly aversive emotion (mood induction) and a push movement, similar to classical ABM but with no need for technical equipment24. This imaginal variant of retraining has been successfully tested in problem drinkers24 and smokers26, and the technique reduced craving and also actual consumption. The present study transposed the approach to food.

Unique to this study, two retraining conditions were set up to test whether adherence could be improved by reminders. Similar to the forerunner studies, one group, the standard retraining (RS) group, received the manual with instructions to read through the manual and to perform the exercises on a daily basis. The other group (RER) received the same manual but was also sent an email after one week with specific instructions on how to use the timer on their smartphone in order to give them daily reminders to perform the exercises. This condition was implemented following evidence that daily smartphone notifications to perform simple exercises foster reduction of target symptoms47. Indeed, the latter group performed the exercises more regularly, and this frequency in turn was associated with improvement on several outcomes at a small to medium effect size. Accordingly, overall effects were larger for the RER group than for the other retraining group. Both ITT and PP analyses indicate that this subgroup benefited from the intervention in terms of craving and also psychopathological symptoms (depression). For the small subgroup who had measured their weight before and after the study period with a scale, weight loss was more than 3 kg, which represented a medium effect size relative to controls. The pooled retraining group showed a significant effect on craving in the PP but not the ITT analyses. Effects on five outcomes, however, were significant for both kinds of analyses: FCQ-T-R Lack of Control, FCQ-T-R Triggers, FCQ-T-R Total, FEV Cognitive Restraint, and FEV Hunger.

Individuals in both retraining groups reported eating less than controls during the intervention period (RER: 39.4%; RS: 36.1% vs. Control: 24.2%), and the majority would use the training again and recommend it to others. Adherence was larger than in smokers. Satisfaction was less for a number of the outcomes relative to the study on alcohol (e.g., able to use the manual on a regular basis, helped me cope with problems). Overall, RER received more favorable assessments than RS, which reached significance for two outcomes. The appraisal of the RER technique in individuals with overweight was numerically better on 14 of the items relative to RS and on 11 of the outcomes compared to the treatment manual for problem drinkers.

Effects for participants in the current study were significant on core outcomes but somewhat weaker than for smokers and drinkers. At this point, we can only speculate about possible reasons. First, smaller effects were likely due to unexpected improvements in the wait-list control group. Controls also reduced weight as well as craving and improved on other outcomes. Second, the range of possible stimuli is narrower in smokers and drinkers, who often have a typical “drug of choice” (e.g., bottle of red wine; certain brand of cigarette). For eating, stimulus control is poor because even the same type of food (e.g., cake, cookies) can have many different shapes or colors. This assumption can be tested, however. For example, we would expect a greater reduction of weight and craving in overweight persons who imagine a wide range of unhealthy foods they like to eat during the retraining compared to those who imagined only a few unhealthy foods they like to eat (e.g., chips, pizza, and soda). Lastly, the imaginal retraining manual for drinkers gave additional recommendations that might have augmented the effect. However, no such tips were provided to smokers, so this cannot explain the efficacy gap in the study of smokers vs. the current study of people with overweight or obesity.

Motivation to engage the manual was higher than in smokers, but still 31.7% of the participants did not perform the exercises. Even though the technique is simple, other ways of presentation should be tested, such as video clips via social media demonstrating how to perform the exercises. The technique contains many elements. Future research should elucidate the differential effectiveness of its various elements and whether certain elements are dispensable, such as negative mood induction and the positive counter-sequence. Also, it would be helpful to know whether adding further behavioral tips to the manual or encouraging a special diet would enhance the effect.

The study is subject to a number of limitations. First, we do not know whether the effects are stable over time, and we have not yet tested long-term adherence to the technique. Relatedly, at this point we can only speculate which aspects of the eating behavior had changed (e.g., overall reduction or eating less unhealthy vs. eating more healthy food). Second, we relied on self-reports, which are prone to the effects of social desirability; future trials should monitor participants under more controlled conditions with objective assessments, including weighing with a scaleFootnote 2. As in our forerunner trials, we refrained from implementing a sham condition for ethical reasons. Third, the efficacy of the technique needs to be explored in males as well, who have a higher prevalence of obesity, which seems to be more strongly related to alcohol consumption than in females48. Fourth, for exploratory purposes we did not control for multiple comparisons, which would have rendered many secondary outcomes non-significant. Fifth, future studies should explore motifs for non-adherence more rigorously (e.g., forgetting, inefficacy despite attempt to use it, no confidence in the rationale). Sixth, subsequent trials should test the present approach against an active control which may attenuate placebo and expectancy effects. Finally, whether the technique reduced craving and approach behavior as assessed by implicit measures has yet to be studied.

Conclusions

In individuals with overweight (BMI > 25), many of whom met criteria for obesity, imaginal retraining significantly reduced craving for high-calorie food over a period of 6 weeks. A small to medium correlation emerged between utilization frequency and outcome. Treatment effects were larger in the subgroup encouraged to use electronic reminders than in the pooled retraining group. The long-term effects of the technique in terms of acceptance, feasibility, and effectiveness need to be tested as well as alternative ways of presenting the technique (e.g., video clips) and the long-term efficacy.