Introduction

Besides the psychological and somatic symptoms that represent core diagnostic criteria of affective disorders, depression is characterized by maladaptive cognitive beliefs and biases, which lie at the heart of cognitive behavioral theories of depression1. Although dysfunctional beliefs have been shown to be relatively stable2, they do appear to be malleable and, in consequence, represent a major target of cognitive behavioral therapy (CBT). As a result, research until now has converged on the notion that depressive symptoms can be reduced via the modification of cognitive biases3, 4. However, the results are not fully conclusive3. For instance, it is unclear which mechanisms of change are evoked by different (cognitive) treatments for depression. Identification of the specific process variables that mediate patient outcomes could help to improve our understanding of how psychotherapy works and, as a result, initiate specific refinements.

Over the past decades, several treatment approaches with a metacognitive focus have been developed in parallel, including Metacognitive Therapy in Great Britain5 and Metacognitive Training in Germany (the latter was originally developed for psychosis6 and has been shown effective in a recent meta-analsis7). Although both approaches use the word “metacognition” in their name, they are quite distinct therapeutic interventions. Metacognitive Therapy developed by Wells represents a generic approach for various psychological disorders that considers working on specific cognitive content irrelevant. In contrast, Metacognitive Training developed by Moritz targets disorder-specific cognitive biases (e.g., jumping to conclusions in psychosis) by addressing general mechanisms of thinking (the metacognitive perspective) as well as the content of dysfunctional thoughts (the cognitive perspective). In other words, Moritz’s approach to Metacognitive Training differs from Wells’ approach to Metacognitive Therapy in that it targets both the content of cognition as well as metacognitive beliefs, whereas Wells’ approach is concerned with meta-level cognition and does not address the specific content of intrusive thoughts. Moritz’s Metacognitive Training is thus fully compatible with a general CBT treatment approach. It adopts both the cognitive and metacognitive perspectives, which are not deemed exclusive but rather complementary. As mentioned, Moritz’s approach was originally developed for patients with psychosis. Over the past decade, he and his colleagues have developed specific metacognitive training programs for depression, obsessive compulsive disorder, and borderline personality disorder, which in their general structure and use of exercises were inspired by Metacognitive Training for psychosis6. As a result, all these trainings attempt to challenge disorder-specific cognitive biases through the use of creative and engaging exercises supported by a multimedia presentation.

The present study used Metacognitive Training for Depression (D-MCT)8, which uses content tailored to the specific problems of individuals with depression. It seeks to enable group members to recognize and correct the often automatic and unconscious thought patterns that accompany depression. In addition to the depressive thought patterns that are also targeted in CBT (e.g., overgeneralization), general cognitive biases that have been identified by basic cognitive research (e.g., mood-congruent memory) are also addressed. Finally, dysfunctional coping strategies (i.e., thought suppression, rumination) are targeted, which are also addressed in the Metacognitive Therapy developed by Wells. D-MCT thus uses established techniques of CBT but complements them with new, engaging exercises (for an example, see Fig. 1) as well as a broader metacognitive perspective. D-MCT has been developed as a low-threshold (i.e., minimal demands) group concept in that patients are not required to talk about their individual problems but may if they wish; either way, they still experience how cognitive biases work and influence one’s mood. Moreover, the threshold is low with regard to administration; the preparation time is short because the therapy sessions are highly standardized (through the use of multimedia slide based presentations) and are thus easy to administer. Moreover, new patients may join the group in any session as the training is delivered in an open group format.

Figure 1
figure 1

Exercise in D-MCT module 7 on “Jumping to Conclusions: Mind reading”. (To modify their dysfunctional thought pattern and to demonstrate that mind reading is not possible, patients are asked to guess the title of the painting and “read the mind” of the artist). The correct answer is “b” (“The Waterseller from Sevilla” by Diego Velázquez, 1623).

We recently reported the efficacy of D-MCT compared with an active control intervention (health training) in a randomized controlled trial (RCT) regarding depression, dysfunctional beliefs, and quality of life with medium (to large) effect sizes9, 10, but little is known about the mechanisms of change. We expect that, in line with general principles of CBT, patients benefit through the modification of their cognitive beliefs (i.e., the content of their cognition) as well as by reducing their dysfunctional metacognitive beliefs. It has been shown that depression is associated with dysfunctional metacognitive beliefs11, 12. For D-MCT, we were able to show in an initial study that over the course of the training, dysfunctional cognitive and metacognitive beliefs were reduced at medium effect9. However, due to the study’s design (uncontrolled pilot study, only two assessments), mechanisms of change could not be investigated, and it remains unclear whether change in beliefs represents a cause or a consequence of the improvement in depression. In our recent trial10, we also reported a larger decrease in dysfunctional cognitive beliefs in the D-MCT compared to a control group over time; change in metacognitive beliefs, however, were not reported, and mechanisms of change were not investigated.

The aim of the present study is thus to extend our current understanding by providing a more comprehensive picture. To that end, we analyzed data on change in metacognitive beliefs and reanalyzed previous data from our recent RCT on the effectiveness of D-MCT10 with regard to mechanisms of change. In our effort to develop a more complete understanding, we hypothesized a larger reduction of metacognitive beliefs in the D-MCT compared to the control group. Moreover, in line with cognitive behavioral models of depression1, which assume that maladaptive cognitive beliefs are involved in the development and maintenance of depressive symptoms3, 4, we expected that a better outcome 6 months after D-MCT treatment would be mediated by an improvement in dysfunctional cognitions. To this end, and to conclusively address the specific cognitions that the D-MCT training affects, we explored treatment effects over the course of treatment on two types of dysfunctional cognitions: cognitive beliefs (i.e., the content of cognitions as measured by the Dysfunctional Attitude Scale) and metacognitive beliefs (as assessed by the Metacognition Questionnaire).

Methods

Design

Design of the study has been described elsewhere10. We performed a parallel RCT comparing two group interventions, which were administered in addition to an intensive psychosomatic outpatient treatment program: D-MCT (experimental group) and health training (HT, active control group). Participants were allocated to D-MCT or HT after baseline assessment according to a fixed computer-generated randomization plan with a 1:1 allocation ratio. Patients met with research staff blind to treatment allocation at baseline (t0) as well as 4 weeks (post-treatment, t1) and 6 months later (follow-up, t2) for a face-to-face assessment. To keep raters blind with respect to group allocation (to prevent the Rosenthal effect), patients were not assessed at the same time at which groups met and were instructed (and repeatedly reminded) not to disclose group assignment to the raters. All participants gave written informed consent prior to participation. The research was carried out in accordance with approved guidelines and in accordance with the Declaration of Helsinki and its amendments. The study protocol was approved by the Ethics Committee of the German Psychological Association and was registered at the German Clinical Trials Register (#DRKS00007907, registration date: 2015/04/30).

Participants

Eighty-four patients (62 women and 22 men; age: M = 45.5, SD = 9.89; years of formal school education: M = 10.61, SD = 1.69; number of major depressive episodes (MDE): M = 3.03, SD = 5.69) with a depressive disorder treated at a psychosomatic outpatient day clinic were included. Diagnosis of a current MDE, recurrent depression, or dysthymia was verified using the Mini International Neuropsychiatric Interview (MINI)13. The MINI was slightly modified in order to differentiate recurrent vs. single-episode major depressive disorder as well as double depression. At baseline, 36 patients (43%) were diagnosed with a single MDE, 47 (56%) with recurrent depression, and one with dysthymia (1%). At intake, more than half of the patients (n = 43) met criteria for at least one comorbid disorder, according to the MINI. Demographic data is presented in Table 1. Patients were excluded if they were younger than 18 or older than 65 years of age, did not fulfill criteria for a current or recurrent MDE or dysthymia, suffered from psychotic symptoms (i.e., hallucinations, delusions, or mania), were currently suicidal, or showed signs of intellectual disability (estimated IQ < 70). Use of psychotropic medications was tolerated. Completion rate at t1 was 94.0% and at t2 was 71.4%. Participants who completed the 6-month follow-up did not significantly differ from patients who were lost to follow-up on baseline demographic, psychopathological, or cognitive characteristics.

Table 1 Demographics: Frequencies, Means, and Standard Deviations (in Brackets).

Treatment Conditions

All patients received a standard intensive psychosomatic outpatient treatment for 5 days a week, 8 hours a day. The program consisted of individual therapy sessions, group therapies (e.g., social skills training), occupational therapies, physical training, weekly ward rounds, etc. An extensive D-MCT treatment manual is available in German8; material in other languages, including English, can be downloaded at www.uke.de/depression. The HT group was equivalent to D-MCT in terms of the frequency (eight sessions, two sessions per week), duration of sessions (60 min), and group format. For HT, the weekly format consisted of one group walking session (60 min) and one psychoeducation session on health topics such as stress reduction or nutrition (60 min).

Outcome measures

Two outcomes were investigated10. The primary outcome measure was the total score of the Hamilton Depression Rating Scale (HDRS, 17-item version)14. The secondary outcome measure was self-assessed depression as measured by the well-established Beck Depression Inventory (BDI-I)15.

Potential cognitive mediators

  • The Dysfunctional Attitudes Scale (DAS)16 was used to assess dysfunctional cognitive beliefs (i.e., the content of cognition). The DAS includes 40 items that are rated by participants on a 7-point Likert scale. Reliability (internal consistency ranges between α = 0.88 and 0.94) and validity (correlations with other measures of depression range between r = 0.71 and 0.85) have been documented. In the current study, internal consistency was excellent, with α = 0.93.

  • To assess metacognitive beliefs, the following three subscales of the Metacognitions Questionnaire 30 (MCQ-30)17 were used: (1) Positive Beliefs (PB) about worry (e.g., “Worrying helps me to get things sorted in my mind”, 6 items); (2) Negative Beliefs (NB) about thoughts concerning uncontrollability and danger (e.g., “When I start worrying I cannot stop”, 6 items), and (3) beliefs about the need to control thoughts (Need for Control [NFC], e.g., “Not being able to control my thoughts is a sign of weakness”, 6 items). Only these three subscales were administered as they are most related to depression12 and the two remaining MCQ subscales, “cognitive confidence” and “cognitive self-consciousness”, were less affected by D-MCT in the pilot study9. The MCQ-30 showed good psychometric qualities, with a test-retest reliability of r = 0.75 and internal consistencies ranging from α = 0.72–0.93 for the subscales and 0.93 for the total score in previous studies. In the current study, internal consistencies were acceptable, with α = 0.75 for PB, α = 0.78 for NB, and α = 0.71 for NFC.

Strategy of data analysis

IBM SPSS 22.0 software was used for all analyses. All randomized patients were included in the analyses (intention to treat), and missing outcome data were imputed by the expectation-maximization (EM) algorithm trimmed to fall between the minimum and maximum of possible values. For mediation analysis, D-MCT treatment was coded as 1 and HT treatment as 0. A treatment effect in the analyses thus refers to effects of D-MCT above and beyond HT. To capture the change in depression (HDRS and BDI), standardized residualized change scores using a simple linear regression model in which baseline scores predicted follow-up scores (t0 to t2) were calculated. To capture change in the mediators (DAS and MCQ subscales), standardized residualized change scores were calculated in which baseline scores predicted post-assessment scores (t0 to t1). The subsequent mediation analysis thus determines to what extent the change in depression from t0 to t2, i.e., from baseline to 6-months follow-up, that was brought about by D-MCT above and beyond HT can be explained by change in the mediator variables in the treatment period, i.e., from baseline to end of treatment 4 weeks later (t0–t1). The mediational analyses met all of the criteria for the definition and evaluation of mediators within RCTs as established by Kraemer and colleagues18.

The analysis was conducted using an SPSS macro PROCESS developed by Hayes (version v2.13 downloaded 3 March 2015)19. The analysis allows delineating the effects of each of the proposed mediators separately while controlling for the others20. It thus presents a rather conservative test for estimating individual mediation effects. To correct for potential biases of non-normality in the sample, we bootstrapped results 1,000 times. When the effect range (LL = lower limit to UL = upper limit) of the 95% confidence interval (CI) does not include zero, the null hypothesis is considered rejected. When the 95% CI does include zero, the mediation hypothesis is considered rejected. Following Preacher and Kelley21, we additionally computed κ2 for significant mediators as effect size. Statistically, this is currently only possible without the consideration of the simultaneous effects of other covariates (i.e., mediators). Respective effect sizes can be interpreted in analogy to Cohen’s r 2, that is, 0.01, 0.09, and 0.25 constitute small, medium, and large effects, respectively. Effect sizes for ANOVAs are reported according to Kinnear and Gray22 with ηp 2 ≈ 0.01, ηp 2 ≈ 0.06, and ηp 2 ≈ 0.14, corresponding to small, medium, and large effects, respectively.

Results

Groups were similar on psychopathological and sociodemographic data as well as medication (see Table 1). Mean scores of the three MCQ subscales are presented in Table 2 (for DAS results, see Jelinek10), showing greater reduction on all three subscales in the D-MCT group in comparison to the HT group at a medium to large effect size (ηp 2 = 0.048 to 0.101). Zero-order correlations between all involved variables over the different time points (t0, t1, t2) are displayed in Table 3. Correlations were highest within the same constructs over time (e.g., DAS t0 with DAS t1) but, as expected, the cognitive (DAS) and metacognitive (MCQ) scales were also significantly correlated cf.,12.

Table 2 Group Comparisons (Intention to Treat Sample) and Results of the ANCOVAs for Metacognitive Beliefs for Pre- and Post-Assessment.
Table 3 Zero-Order Correlations Between Cognitive (DAS) Metacognitive Beliefs (MCQ) for Pre- and Post-Assessment.

Mediation was tested with regard to clinician-assessed (HDRS) and self-assessed (BDI) depression (Table 4). The results related to both outcomes correspond: Whereas treatment with D-MCT elicited improvement on all four (sub)scales beyond HT, only the change in NFC evoked long-term changes in depression (i.e., it is the only significant mediator). For the HDRS, the indirect effect of treatment on change in depression via NFC is 0.15 (SE = 0.10; BootLLCI = 0.02, BootULCI = 0.44; κ2 = 0.09), with a remaining significant direct effect of treatment on change in depression of 0.64 (SE = 0.21; LLCI = 0.21, ULCI = 1.06). For the BDI, the indirect effect of treatment on change in depression via NFC is 0.22 (SE = 0.11; LLCI = 0.05, ULCI = 0.52; κ2 = 0.13), with a remaining direct effect of treatment on change in depression of 0.59 (SE = 0.19; LLCI = 0.21, ULCI = 0.98). For the remaining but insignificant indirect effects of the other potential mediators that were explored, please see Table 5.

Table 4 Mediation Analysis Part 1: Effects of Treatment on Mediators, and Effects of Mediators (Including Treatment) on Both Outcomes.
Table 5 Mediation Analysis Part 2: The Indirect Effects of the Potential Four Mediators on Both Outcomes Brought About by the Treatment.

Discussion

Our study provides initial insight into how D-MCT exerts its effects on depression. As predicted, D-MCT elicited improvement in cognitive (i.e., content of cognition) as well as metacognitive beliefs across treatment beyond the control treatment (HT). However, only one of the potential (meta)cognitive processes, NFC, acted as a mediator between treatment and depression decline. Moreover, only partial mediation with medium effect occurred, and thus other processes seem to be involved. The lack of a mediating role of the other two meta-cognitive subscales (NB, PB) is at first surprising. Although they were significantly reduced, with at least medium effect size over treatment in the D-MCT group, they did not seem to work as mediators, which invites further consideration. Both subscales (NB, PB) are primarily concerned with “worry” in the form of perseverative negative thinking, which has been traditionally implicated in anxiety but more recently also in depression23. In fact, all MCQ items were first developed for aspects of anxiety. However, only change in NFC worked as a mediator, and it worked even better than change in dysfunctional cognitive beliefs as measured by the DAS, an instrument that has quite reliably indicated the association between decrease of dysfunctional beliefs and improvement of depression24. Although D-MCT largely overlaps with classic CBT (which mainly aims at the modification of the ‘content’ of dysfunctional thoughts), it directly addresses the modification of dysfunctional metacognitive beliefs. A core theme of all the Metacognitive Trainings developed by Moritz is that everybody’s cognitions are prone to biases (“to err is human”). This is potentially (best) picked up by the MCQ subscale NFC, which, in consequence, is the most important mediator of change.

Because we did not use all scales of the MCQ in the present study, it remains to be shown whether the MCQ scales “cognitive confidence” (CC) and “cognitive self-consciousness” (CSC) may also act as mediators. Whereas CC has been found to predict outcome in other research25,26,27 and is generally relevant to depression, items of the CC subscale of the MCQ do not capture the content of D-MCT in this regard and instead mainly relate to subjective memory problems. To be more specific, the CC scale includes items such as “I have little confidence in my memory for actions/places/words and names” as well as “My memory can mislead me at times”. In D-MCT, module #2 is primarily concerned with memory and attention in depression. However, in this module patients learn (through exercises) about the origin of memory deficits in depression (poor selective attention, for example, due to rumination) and mood-congruent recall as well as false memories. Accordingly, they are taught that memory in general does not work passively like a video camera but is prone to biases. Although we assume that cognitive confidence in general would increase as a result of D-MCT, it is unlikely to be captured by the items of the CC subscale of the MCQ and there may even be a decline in this scale (despite successful therapy). We therefore consider the CC scale inadequate for measuring the success of the D-MCT. However, this remains to be shown in future studies.

The metacognitive model by Wells conceptually argues that the relationship between metacognitive beliefs and depression is mediated by rumination. Extending this idea to our present empirical results, this could mean that NFC may underlie the use of maladaptive coping strategies (e.g., rumination). However, use of coping strategies was not measured, so it remains to be tested whether the change in metacognitive beliefs directly impacts depression at the same time as coping strategies or whether they work in sequence, i.e., a change in metacognitive beliefs impacts depression through a change in coping (including increased activation/activities or social relations). Still, the present study is the first to empirically address the cognitive mechanisms of change in D-MCT. It specifically suggests that D-MCT’s ‘active essence’ is the encouraging of patients to disengage from dysfunctional metacognitive beliefs. This may ultimately enable patients to refrain from engaging in unhelpful cognitive strategies as well as behaviors arising from their former metacognitive beliefs. However, to fully understand whether a mediator is specific to D-MCT, it is necessary to compare D-MCT mediators to those of another active treatment, for example, to a CBT group approach.

The current study has several strengths (RCT, blinded assessors, etc.), but naturally it is not without limitations. First, the HDRS mean score was 15.48 points, indicating that the patients had mild depression, and thus it is unclear whether the results also apply to more severely depressed patients. On the other hand, patients had experienced on average three depressive episodes and were hospitalized for the second time10, which may indicate that their current depressive symptoms might have underplayed the general severity of their disorder. Moreover, scores of the assessed MCQ subscales were largely comparable to those of other, more severely depressed patients28,29,30.

Second, we did not assess the inter-rater reliability of clinical interviews in the present study (MINI, HDRS). Because all patients were also pre-diagnosed by a clinician, this may have mainly affected HDRS scores. However, because each patient was assessed by the same rater over all three assessment points, the presented results should not be affected. Third, to verify the proposed causal relationship, more intermediate assessments (e.g., session by session) would be desirable in future studies, as well as the manipulation of potential mediators. Fourth, changes in dysfunctional cognitive and metacognitive beliefs were only assessed by the use of self-report instruments, and these instruments, as discussed, did not cover the entire range of the D-MCT. As a result, and as suggested by the partial mediation, important mediators may still exist, so it would be desirable to cross-validate the results, for example, with other measures such as implicit tests24.

In summary, our findings suggest that the key mechanisms of change in D-MCT are metacognitive beliefs, particularly NFC. As such, the current study provides further support for the importance of addressing metacognitive beliefs in the treatment of depression.