Introduction

Anxiety as well as trauma- and stressor-related disorders such as posttraumatic stress disorder (PTSD) represent the most common mental disorders (Kessler et al, 2005) leading to tremendous individual and socioeconomic challenges. Exposure therapy incorporated in cognitive behavioral therapy (CBT) constitutes a highly effective treatment approach in these disorders, in particular in phobias, but without being capable of preventing relapses completely (Boschen et al, 2009; Bouton, 2002). One main mechanism of exposure therapy is supposed to rely on the extinction of conditioned fear (Craske et al, 2008; Graham and Milad, 2011). Extinction creates a new inhibitory memory trace competing with the original fear memory trace in a context-dependent way (Bouton, 2004; Vervliet et al, 2013). Thus, the extinction memory trace might be well recalled in the extinction/exposure context but not necessarily in the original acquisition context or completely new contexts. Relapses could therefore develop because of difficulties in extinction memory recall and/or its (long-lasting) transfer beyond the therapy context (Boschen et al, 2009; Vervliet et al, 2013).

Therapy success of exposure therapy might be improved using the stress hormone cortisol that has been extensively shown to modulate emotional learning and memory processes: cortisol inhibits retrieval but enhances consolidation of declarative memories (Schwabe et al, 2012). Indeed, preexposure cortisol application enhanced therapy success in patients with spider, social, or height phobia, also in the long run (de Quervain et al, 2011; Soravia et al, 2006). Corresponding to the literature on declarative memory, it has been suggested that cortisol application reduces fear recall during the encounter of the feared object or situation in exposure therapy and simultaneously facilitates the consolidation of the newly learned information (less phobic fear) enabling an enhanced extinction memory recall at a later point in time (de Quervain and Margraf, 2008; de Quervain et al, 2017). Translating these effects to fear extinction, only one human study so far reported preextinction stress to reduce fear recall (evident in expectancy of the dangerous unconditioned stimulus) at the beginning of extinction and a test session one day later (Bentz et al, 2013).

The crucial brain region involved in these beneficial effects of stress hormones on phobic fear processing seems to be the amygdala (Nakataki et al, 2016) that acts as the central player for the acquisition of fear and extinction memories (Quirk and Mueller, 2008). In addition, excitatory input from the (dorsal) anterior cingulate cortex (ACC) and inhibitory input from the ventromedial prefrontal cortex (vmPFC) modulate the expression of fear and extinction memories via the amygdala (Graham and Milad, 2011; Dejean et al, 2015). Importantly, this circuit receives contextual information from the hippocampal complex as shown by converging evidence from rodent and human studies (Maren et al, 2013; Milad et al, 2007; Hermann et al, 2016).

In the present functional magnetic resonance imaging (fMRI) study, we investigated the neural underpinnings of preextinction cortisol administration on the extinction circuit (taking place one day after fear acquisition) and its context dependency. Crucially, the transfer of the assumed cortisol-driven enhanced extinction memory established during extinction learning was tested one week later during recall in the extinction context as well as in a new context. In addition, we explored the modulation of context-dependent fear responding toward a nonextinguished fear stimulus by cortisol. Taken together, this approach provides a valuable model for relapses including context- and stimulus-dependent generalization processes.

Materials and methods

Participants

In total, 48 healthy male participants were recruited at the local university. Persons reporting MRI exclusion criteria, chronic or acute illnesses, color blindness, regular intake of medicine, current medical or psychological treatment, drug use, age <18 or >35 years, and body mass index (BMI) <18 or >27 kg/m2 were not eligible for participation. Right-handedness was assured by the Edinburgh Inventory of Handedness (Oldfield, 1971). All participants had normal or corrected-to-normal vision. They were instructed to refrain from physical exercise and consumption of food and drinks (except water) 2 h before all testing sessions.

After exclusion of all persons not completing all three experimental days, not acquiring contingency awareness during fear acquisition or showing excessive head motions during scanning, 40 participants remained in the final sample. All participants were reimbursed with 10€/h for their participation. All procedures were conducted in agreement with the Declaration of Helsinki and approved by the ethics committee of the Faculty of Medicine of the Ruhr-University Bochum (registration number: 4571-13).

Stimulus Material

Stimuli and procedure were adopted from Milad and colleagues (2007) as implemented before (Hermann et al, 2016). Pictures of an office room, a room with a shelf, and a conference room served as contexts that were assigned as contexts A, B, or C by means of permutation. The same desk lamp was included in each of the contexts lighting up in either red, blue, or yellow serving as three conditioned stimuli (CS). The experiment was realized with the Presentation software package (Neurobehavioral Systems, Albany, CA) and presented via fMRI-ready goggles (Visua-Stim Digital; Resonance Technology, Northridge, CA).

A constant voltage stimulator (STM200; BIOPAC Systems, Goleta, CA) provided transcutaneous electrical stimulation serving as the unconditioned stimulus (UCS). Application of the UCS was realized using two electrodes (surface size: 1 cm2) attached to the fingertips of the second and third fingers of the right hand. The intensity of the electrical stimulation was set individually to be ‘unpleasant but not painful’ by use of a gradually increasing rating procedure.

Procedure

Individual sessions were conducted in the afternoon (between 1230 and 1900 h) on 2 consecutive days for acquisition and extinction and one week later for recall. The duration of experimental sessions was 60 min on days 1 and 2, respectively, and 90 min on day 3. After arrival on day 1, participants confirmed understanding of the procedure and gave written informed consent, filled out questionnaires on demographic variables, and were tested for red–green color blindness using five Ishihara plates (selected from Ishihara, 1990). Before the start of each experimental phase, they were instructed to watch the stimulus presentation with the goal of observing any possible regularity in the occurrence of lamplight colors and electrical stimulation. They were informed that should they discover such a relationship, it would remain stable in all experimental phases (cf, Hermann et al, 2016). This instruction was used to facilitate learning of contingencies (a prerequisite for studying extinction memory recall) and to avoid participants expecting a complete reversal of contingencies during extinction training (ie, expecting stimulation to occur after CS− presentations). However, note that participants were not informed about the actual CS–UCS contingencies.

During all experimental phases, the trial structure was identical for all CS types (except for trials with UCS presentation). After an initial presentation of a black screen with a white fixation cross (duration jittered between 0 and 1.875 s), the context without a CS (turned-off lamp) was presented for a duration of 3 s. The CS (lamp within the context picture lighting up in either red, blue, or yellow for the three CS types respectively) was presented for 6 s. During reinforced CS+ trials, the UCS (100 ms) was delivered immediately after the offset of the CS. A white fixation cross on a black background was shown from CS offset until the start of the next context presentation for 9.125–11 s (total trial duration: 20 s).

Acquisition training took place in context A on day 1, extinction training in context B on day 2, and recall was tested in contexts B and C on day 3. During acquisition training (total duration: 9 min), two separate CS+ (CS+E and CS+U (see below); eg, red and yellow lights) were shown eight times each, and both CS+ were paired with the UCS in five out of eight trials (62.5% partial reinforcement rate). A third CS (CS−; eg, blue light) was never paired with the UCS and shown eight times.

During extinction training (total duration: 12 min), the CS+E (extinguished) was shown 16 times without subsequent UCS presentation to extinguish the conditioned fear response. Sixteen CS− trials were presented intermixed with the 16 CS+E trials, the CS+U (unextinguished) was not shown. During recall (total duration: 17 min), all three CS were presented eight times each in context B as well as in a new context C (within-subjects design) in two blocks with four trials of each CS–context combination, respectively. The first six CS trials always consisted of all three CS in both contexts with a permuted allocation of the first CS. After recall, a reinstatement test was included that is not central to the present research question (see Supplementary Information for further details and Supplementary Figure S1 for a scheme of the paradigm).

Pseudo-randomized stimulus orders were used for all phases, in which no more than two consecutive presentations of the same CS and no more than three consecutive presentations of a CS+ were allowed. During acquisition training, the first and the last three trials consisted of one CS−, one reinforced CS+E and one reinforced CS+U trial, respectively, whereas the other trials were arranged in three blocks with two trials of each CS (one of the CS+ was not reinforced in each block). During extinction training, the first and last two trials consisted of one CS+E and one CS− trial, respectively. In addition, up to three consecutive presentations of the same context were allowed during recall. Individual stimulus presentation orders were used for each participant of the cortisol group and the identical (matched) orders for the placebo group.

Cortisol Administration, Saliva Sampling, and Analysis

In a double-blind, randomized design, 20 men were administered three 10 mg tablets of cortisol (hydrocortisone; Jenapharm, Germany) 50 min before the start of the functional scans for extinction on day 2. Visually identical placebos (tablettose and magnesium) were given to the remaining 20 men. In order to assess cortisol concentrations, saliva samples (Salivette sampling devices; Sarstedt, Nümbrecht, Germany) were collected directly before tablet intake (baseline), as well as 40 and 70 min after tablet intake (5 min before and immediately after extinction training). In addition, saliva samples were taken at baseline and after the scanning sessions on day 1 (25 min before and immediately after acquisition training) and day 3 (20 min before recall and immediately after the reinstatement test; cf, Supplementary Figure S1 in the Supplementary Information). Saliva samples were stored at −20 °C until assayed using a Dissociation-Enhanced Lanthanide Fluorescent Immunoassay as described previously (Dressendörfer et al, 1992) determining the fraction of free unbound salivary cortisol. The detection limit was 0.5 nmol/l, and inter- and intra-assay coefficients of variance were below 9.0% and 6.7%, respectively.

Analyses of Skin Conductance Responses

Details concerning sampling of skin conductance responses (SCRs) can be found in the Supplementary Information. Statistical comparisons of mean SCRs were conducted separately for acquisition training, extinction training, and recall via analysis of variance (ANOVA). We focused on conditioned responding (CS+E vs CS−) to emphasize learning-related contextual responding to extinguished stimuli and added the corresponding analyses regarding the CS+U to analyze stimulus generalization. For acquisition and extinction training, the within-subjects factors CS (CS+E/CS+U vs CS−) and block (each comprising four trials of each CS) as well as the between-subjects factor treatment (placebo vs cortisol) were entered. For recall, the first block was analyzed separately for contexts B and C with the factors CS and treatment. We focused on this early recall phase to avoid reextinction processes most likely occurring afterward.

Statistical analyses were performed in IBM SPSS Statistics for Windows 20.0 (Armonk, NY) with Greenhouse–Geisser correction if needed. The statistical significance level was set to p<0.05 (trends are reported up to p<0.077), and effect sizes are reported as η2p.

fMRI Analyses

Details concerning data acquisition of brain images, preprocessing, first-level model, and regions of interests (ROIs) can be found in the Supplementary Information. On the second level, random effects group analyses were conducted in Statistical Parametric Mapping (SPM8; Wellcome Department of Cognitive Neurology, London, UK) implemented in MatLab R2012a (Mathworks, Sherborn, MA) using two-sample t-tests. Furthermore, the flexible factorial design was used for analyses of time-dependent changes during extinction training.

Functional coupling of the extinction network was analyzed by conducting psychophysiological interaction (PPI) analyses for each subject for early (first block) extinction and early extinction recall. These analyses explore the functional connectivity between a seed region (volume of interest; 5 mm sphere around the corresponding peak voxel derived from results of the between-group comparisons) and other brain areas in interaction with the experimental task (CS+E/CS+U vs CS−).

Regarding exploratory whole-brain analyses, the significance threshold was set to p<0.05 on voxel-level corrected for multiple testing (family-wise error (FWE) correction); the minimal cluster size (k) was 20 voxels. For the ROI analyses, the significance threshold was set to p<0.05 on voxel level, corrected for multiple testing within each ROI (FWE-corrected; using the small volume correction option of SPM8). Results for neural activation (but not functional connectivity) are reported up to pcorr.<0.10.

Results

Participants’ Descriptive Data

No significant differences regarding age or BMI were found between participants in the placebo (mean age±SD: 24.8±4.2 years; BMI: 24.1±2.1 kg/m2) compared with the cortisol group (age: 24.1±2.9 years; BMI: 23.9±2.3 kg/m2; both p>0.51).

Cortisol Concentrations

Cortisol concentrations changed because of administration of 30 mg hydrocortisone after the baseline sample on day 2 (time × treatment interaction: F(1.1, 43.6)=16.24, p<0.001, ηp2=0.299; main effect time: F(1.1, 43.6)=15.92, p<0.001, ηp2=0.295; main effect treatment: F(1, 38)=49.16, p<0.001, ηp2=0.564; Table 1). Although no differences were evident in the baseline sample (T(38)=0.38, p=0.71), cortisol concentrations were significantly elevated before (T(19.0)=4.79, p<0.001, ηp2=0.376) and after extinction (T(19.0)=8.91, p<0.001, ηp2=0.676) in the cortisol compared with the placebo group. Neither on day 1 (fear acquisition) nor on day 3 (recall) any differences between groups or between sampling times were observed (all main and interaction effects: F<3.96; p>0.05).

Table 1 Mean (SD) Cortisol Concentrations (in nmol/l) Are Depicted for All 3 Experimental Days

Fear Acquisition in Context A

Successful fear acquisition was indicated by a significant differentiation between both CS+ (CS+E/CS+U) and the CS−, developing over time (CS+E vs CS−: interaction CS × block: F(1, 36)=24.40, p<0.001, η2p=0.404; main effect CS: F(1, 36)=26.69, p<0.001, η2p=0.426; CS+U vs CS−: interaction CS × block: F(1, 36)=9.56, p=0.004, η2p=0.210; main effect CS: F(1, 36)=29.17, p<0.001, η2p=0.448; main effect block: F(1, 36)=5.12, p=0.030, η2p=0.124; Figure 1). On the neural level, enhanced differential conditioned activation was observed in ACC, right anterior parahippocampal gyrus, left insula, left precentral gyrus, and at a trend level in the left amygdala for the combined CS+ compared with the CS− (Supplementary Table S1 in the Supplementary Information). For both measures, no significant main or interaction effects with treatment were observed.

Figure 1
figure 1

Mean (±SEM) skin conductance responses (SCRs) are depicted for CS+E (CS+ extinguished), CS+U (CS+ unextinguished), and CS− during acquisition in context A on day 1, extinction in context B on day 2, and recall in contexts B and C that were presented in an intermixed manner on day 3 after one week. Separate charts are outlined for the placebo (upper row) and cortisol groups (lower row); cortisol administration took place before extinction training. Corresponding to the statistical analyses, all blocks comprise four CS trials. For recall, the first (early) and second (late) blocks are depicted separately for contexts B and C. After successful fear acquisition, conditioned responding was reduced at the beginning of extinction training (first and second blocks) in the cortisol group (p<0.05). For recall in context B, the cortisol group exerted generally reduced SCRs over the first and second blocks compared with the placebo group. When SCRs toward the CS+E and CS− were analyzed between early recall in context B and late extinction training (last block of four trials), the cortisol group also exerted overall reduced SCRs compared with the placebo group. SCRs during recall in context C were not subject to a modulation by cortisol.

PowerPoint slide

Extinction Training in Context B

Differential conditioned responding declined during extinction training (interaction CS × block: F(2.6, 97.9)=11.76, p<0.001, η2p=0.236; main effect CS: F(1, 38)=15.75, p<0.001, η2p=0.293; main effect block: F(1.6, 60.8)=27.03, p<0.001, η2p=0.416). Importantly, cortisol time-dependently diminished conditioned responding (interaction CS × block × treatment: F(2.6, 97.9)=3.10, p=0.037, η2p=0.075) that was evident in less differential SCRs in the first and second blocks of extinction training in the cortisol compared with the placebo group (both p<0.05; Figure 1), but not in the third and fourth blocks (both p>0.14).

Accordingly, this cortisol effect was also found on the neural level: cortisol significantly altered the time course of extinction learning in the bilateral amygdala, the right anterior parahippocampal gyrus, right hippocampus, and at trend level in the left insula. The post hoc analyses revealed reduced activations in the bilateral amygdala, right anterior parahippocampal gyrus, and right hippocampus during the first block of extinction training in the cortisol compared with the placebo group (Table 2 and Figure 2a). In addition, cortisol heightened the functional connectivity of the right anterior parahippocampal gyrus with the vmPFC during the first block of extinction training compared with placebo (x=−2, y=44, z=−2, Zmax=3.40, pcorr.=0.009; Figure 2b). The post hoc analyses revealed that the cortisol group displayed increased functional connectivity between the right anterior parahippocampal gyrus and the vmPFC (x=2, y=38, z=2, Zmax=3.32, pcorr.=0.012). An additional trend was found in the placebo group showing decreased functional connectivity between the right anterior parahippocampal gyrus and the vmPFC (x=4, y=42, z=−4, Zmax=2.69, pcorr.=0.068).

Table 2 Activation Differences for the Contrast CS+E (CS+ Extinguished) Minus CS− Are Displayed for the Time-Dependent Cortisol Effects on Extinction (A) and Group Differences for the First Block of Extinction Training (B)
Figure 2
figure 2

Neural activation and functional connectivity at the beginning of extinction for the contrast CS+E (CS+ extinguished) minus CS−. Data are illustrated with (a) F4.0 or (b) T2.0 (see color bar for exact F- and T-values). The depicted slices were selected according to the reported activations in the left amygdala (y=0), the right amygdala/hippocampus/anterior parahippocampal gyrus (y=−4), and the ventromedial prefrontal cortex (vmPFC; x=−2). Mean (±SEM) contrast estimates are displayed in the respective peak voxels in the bar graphs separately for the placebo and the cortisol groups. L=left, R=right. (a) Cortisol administration reduced differential neural activation at the beginning of extinction in the amygdala, hippocampus, and anterior parahippocampal gyrus. (b) Furthermore, cortisol enhanced the functional connectivity between the anterior parahippocampal gyrus and the vmPFC.

PowerPoint slide

Recall in Context B

Conditioned SCRs significantly recovered for the CS+E (main effect CS: F(1, 36)=4.94, p=0.033, η2p=0.121) as well as for the CS+U (main effect CS: F(1, 36)=4.70, p=0.037, η2p=0.115) in comparison with the CS−. However, no group differences occurred for SCRs in context B (all p>0.12; Figure 1). But extending these analyses regarding early recall to the whole recall phase, the cortisol group exhibited reduced overall SCRs compared with the placebo group (CS+E vs CS−: main effect treatment: F(1, 36)=6.07, p=0.019, η2p=0.144; CS+U vs CS−: main effect treatment: F(1, 36)=3.33, p=0.076, η2p=0.085; main effect CS: F(1, 36)=5.05, p=0.037, η2p=0.123). An additional comparison between conditioned SCRs during early recall in context B and during late extinction training (fourth block) revealed higher SCRs toward the CS+E compared with the CS− (main effect CS: F(1, 36)=5.80, p=0.021, η2p=0.139). Besides, a trend toward overall lowered SCRs in the cortisol group compared with the placebo group was found (main effect treatment: F(1, 36)=3.48, p=0.070, η2p=0.088). No further main or interaction effects emerged, in particular no CS × block interaction. The missing interaction indicates successful recall of the extinction memory, as there was no increase in conditioned SCRs from late extinction training to early recall in context B.

In addition, the cortisol group showed increased differentiation between CS+E and CS− in the right hippocampus (x=22, y=−26, z=−10, Zmax=3.55, pcorr.=0.042; Figure 3a) and enhanced functional connectivity of this right hippocampal activation peak with the vmPFC (x=−4, y=38, z=0, Zmax=2.95, pcorr.=0.028; Figure 3b) during early recall. The post hoc analyses revealed that this result was driven by the cortisol group that exerted increased functional connectivity between the right hippocampus and the vmPFC (x=−4, y=38, z=0, Zmax=3.02, pcorr.=0.026). For the contrast CS+U minus CS−, cortisol reduced neural activation in the right insula (x=36, y=20, z=−8, Zmax=3.88, pcorr.=0.026; Figure 3c) during early recall. Extending these analyses to the whole recall phase, the reported structures were no longer modulated by cortisol administration.

Figure 3
figure 3

Neural activation and functional connectivity at the beginning of recall in context B for the contrast CS+E (CS+ extinguished) minus CS− as well as neural activation for the contrast CS+U (CS+ unextinguished) minus CS−. Data are illustrated with T2.0 (see color bar for exact T-values). The depicted slices were selected according to the reported activations in the right hippocampus (y=−26), the ventromedial prefrontal cortex (vmPFC; x=−4), and the right insula (y=20). Mean (±SEM) contrast estimates are displayed in the respective peak voxels in the bar graphs separately for the placebo and the cortisol groups. L=left, R=right. (a) Cortisol administration enhanced CS+E/CS− differentiation activation at the beginning of recall in context B in the hippocampus and (b) its functional connectivity to the vmPFC. (c) In the contrast CS+U minus CS−, cortisol reduced neural activation in the right insula.

PowerPoint slide

Recall in Context C

Conditioned SCRs significantly recovered for the CS+E (main effect CS: F(1, 36)=15.66, p<0.001, η2p=0.303) as well as for the CS+U (main effect CS: F(1, 36)=15.80, p<0.001, η2p=0.305) compared with the CS− during early recall. An additional comparison between conditioned SCRs (CS+E vs CS−) during early recall in context C and during late extinction training (fourth block) revealed that conditioned SCRs (CS+E vs CS−) tended to be higher during early recall in context C compared with late extinction training in context B (interaction CS × block: F(1, 36)=3.48, p=0.070, η2p=0.088; main effect CS: F(1, 36)=10.14, p=0.003, η2p=0.220; main effect block: F(1, 36)=4.14, p=0.049, η2p=0.103), indicating fear renewal. No further main or interaction effects emerged; importantly, no group differences occurred for SCRs and on the neural level in context C (all p>0.24; Figure 1).

Whole-brain analyses did not result in any significant effect for all contrasts.

Discussion

Cortisol administration before extinction training diminished conditioned fear responses during extinction learning (evident in SCRs and on the neural level). In addition, cortisol altered neural correlates of conditioned responding during recall one week later, also extending to a nonextinguished fear stimulus. However, this cortisol effect did not generalize to a new, unknown context. Overall, these results for the first time provide new insights into how stress hormones influence extinction processes in the long term and demonstrate corresponding boundary conditions.

In line with literature on stress hormone effects on declarative memory (Schwabe et al, 2012) and on exposure therapy (de Quervain et al, 2011; Soravia et al, 2006), we could show that cortisol reduced conditioned SCRs and neural activation in the amygdala–hippocampal complex and enhanced the related functional connectivity to the vmPFC during early extinction learning pointing to inhibited fear recall after cortisol administration. These interactions between amygdala, hippocampus, anterior parahippocampal gyrus, and vmPFC critically reflect the balance between processes conveying fear and extinction memories. All these structures express receptors at which cortisol binds (Joëls and Baram, 2009), hence a modulation of (emotional) learning and memory processes is clearly conceivable (Schwabe et al, 2012). For example, in the rodent model, systemic or intraamygdala stimulation or blockade of cortisol receptors led to facilitated or impaired fear extinction (Ninomiya et al, 2010; Yang et al, 2006, 2007). In men, cortisol administration or stress induction also reduced neural activation in the amygdala–hippocampal complex during fear acquisition and reward anticipation (Kinner et al, 2016; Merz et al, 2012, 2013). Seemingly, cortisol exerts comparable effects on negative as well as on positive emotional processing opening the adjunct treatment application of cortisol to new avenues: not only patients with phobias (de Quervain et al, 2011; Soravia et al, 2006) or PTSD (Aerni et al, 2004; Schelling et al, 2004; Yehuda et al, 2015) might profit from cortisol administration or high cortisol concentrations due to the circadian rhythm (Lass-Hennemann and Michael, 2014), but also patients with disorders related to aberrant reward processing such as addiction as recently shown in heroin addicts (Walter et al, 2015).

At one week after extinction, we found extinction recall in the extinction context to be changed after cortisol administration as seen in attenuated overall SCRs as well as increases in differential conditioned hippocampal activation and its functional connectivity to the vmPFC. These findings might result from altered consolidation processes taken place after corrected information was acquired during extinction training (cf, Bentz et al, 2013). A concerted action of the hippocampus and vmPFC during extinction recall might be associated with activation of the extinction memory trace and the suppression of the original fear memory trace. This is in line with the proposed role of the hippocampus in conveying contextual information and that of the vmPFC in inhibiting conditioned fear responses via projections to inhibitory intercalated cells in the amygdala (Dejean et al, 2015; Maren et al, 2013; Milad and Quirk, 2012; Quirk and Mueller, 2008). Furthermore, patients with anxiety disorders or PTSD show decreased activation and volumes of the hippocampus and vmPFC (Etkin and Wager, 2007; Graham and Milad, 2011; Greco and Liberzon, 2016) pointing to their essential roles in adequately modulating fear responses.

Furthermore, cortisol reduced insula activation to the CS+U during recall in context B. The insula is connected to the amygdala and the ACC (Augustine, 1996), involved in the identification of emotionally important information (Phillips et al, 2003) and the conscious anticipation of threat-related stimuli (Mechias et al, 2010; Straube et al, 2007). Importantly, the CS+U was not presented during extinction training, and thus cortisol could not exert a direct effect on CS+U processing. It might be assumed that cortisol effects on the recall of extinguished vs unextinguished conditioned responses rely partly on distinct mechanisms.

Cortisol administration before extinction training solely influenced neural activation during early recall, whereas SCRs were generally (irrespective of CS type) reduced only when the whole recall phase was taken into account or when SCRs during early recall were compared with late extinction training (for CS+E vs CS−). These additional analyses at least give a first hint for reduced overall arousal in the cortisol group during recall. Regarding the whole recall phase, it should be noted that reextinction processes might come into play; hence, the observed SCR results cannot be solely attributed to a pure recall effect. Regarding the potentially beneficial effect of cortisol on reextinction, future studies need to test the enduring effects by adding an additional recall phase after reextinction has taken place. Altogether, the interpretation of the recall results should be treated with caution and clearly needs further exploration.

Preextinction cortisol application did not release the extinction memory from its context dependency, as cortisol did not modulate conditioned responding in the new context. Indeed, cortisol diminished SCRs and activation of the amygdala–hippocampal complex during extinction learning and changed extinction recall probably by altering consolidation of the extinction memory. However, these cortisol effects were restricted to the extinction context. For fear extinction as well as extinction of predictions in a neutral learning task, postextinction stress (thus exclusively influencing consolidation) enhanced the context dependency of extinction memories (Hamacher-Dang et al, 2013, 2015) corresponding to our present results. Hence, the exact timing (pre- vs post-extinction stress), memory domain (declarative memory vs fear conditioning), and experimental manipulation (psychosocial stress vs cortisol administration) are crucial factors to consider in future studies. Altogether, our findings complement and extend the proposed model of how exposure therapy benefits from prior cortisol administration (de Quervain and Margraf, 2008; de Quervain et al, 2017) with the corresponding neural underpinnings as well as with stimulus-specific and context-dependent processes.

A necessary prerequisite for cortisol effects on extinction learning to occur seems to include a consolidation phase between acquisition and extinction, as previous work reported cortisol administration directly after acquisition and before extinction to impair extinction learning (Merz et al, 2014). In light of the consolidation hypothesis (McGaugh, 2000), the critical time window of when and how exactly cortisol modulates new (emotional) memories still needs to be determined. Accordingly, time-dependent cortisol effects targeting genomic vs nongenomic pathways in the modulation of the amygdala–hippocampus–cortical network should be considered as future research avenues (cf, Henckens et al, 2010, 2011, 2012). Furthermore, the timing between extinction and recall test was set to one week in this experiment to more adequately model long-term processes occurring during exposure therapy. This timing might also have led to a stronger renewal in context C resulting in enhanced conditioned SCRs toward the CS+E, comparable to SCRs towards the CS+U (cf, Figure 1). Future studies should elucidate whether the same results occur with a different timing of the recall test relative to extinction training also in order to capture long-lasting effects of cortisol on emotional memory. In addition, extinction training took place one day after acquisition training, thus representing delayed extinction that has been associated with attenuated conditioned responding during subsequent recall compared with immediate extinction, in particular in designs with a contextual component (eg, Merz et al, 2016; for a review: Maren, 2014). Hence, we cannot exclude potential floor effects interfering with the possibility to detect between-group differences in SCRs in the current design. Besides, SCR analyses were conducted using the same strategy as fMRI analyses, combining the first four trials during recall in one block. Although this strategy is necessary for adequate fMRI analyses, it might not capture more transient effects occurring in SCRs, partly being identifiable only during the first recall trial (cf, Lonsdorf et al, 2017). Altogether, neural and behavioral findings during recall in this study did not correspond completely, complicating a clear interpretation of the neural findings on day 3.

In conclusion, cortisol administration before extinction training influences subsequent extinction learning and context-dependent fear and extinction recall: on one side, cortisol decreased conditioned fear at the beginning of extinction training; on the other side, cortisol seems to influence extinction memory consolidation resulting in a neural activation pattern one week later, most likely reflecting stronger inhibitory processes, but without generalizing to the behavioral level. Neural underpinnings of these effects comprise key players of the fear and extinction network and their interplay: amygdala, hippocampal complex, and vmPFC. We provide first evidence that cortisol appears to also influence a nonextinguished stimulus but only within the extinction context. Thus, contextualization of extinction memories still comprises a challenge to be faced in order to further enhance treatment success of CBT.

Funding and disclosure

This work was supported by a grant from the German Research Foundation (DFG; ME 3831/4-1 to CJM and HE 7013/1-1 to AH). The DFG had no further role in study design, collection, analysis and interpretation of data, in the writing of the manuscript, and in the decision to submit the paper for publication. The authors declare no conflict of interest.