Abstract
Anhedonia, a pronounced reduction in interest or pleasure in any of life’s daily activities, is a cardinal symptom of major depression. In this Perspective article, we synthesise the recent evidence from rodent, monkey and human neuroimaging literature to highlight how the habenula, a small evolutionarily conserved subcortical structure located in the midbrain, may orchestrate the behavioural expression of anhedonia across fronto-mesolimbic networks. We then review how this circuitry can be modulated by ketamine, an NMDA receptor antagonist with rapid antidepressant properties. We propose that experimental paradigms founded in reinforcement learning and value-based decision-making can usefully probe this network and thereby help elucidate the mechanisms underlying ketamine’s rapid antidepressant action.
Introduction
The discovery that a sub-anaesthetic dose of ketamine can have rapid antidepressant effects has been hailed as the most important discovery in 50 years of depression research [1, 2]. Ketamine is a high trapping, non-competitive NMDA receptor antagonist [3] and has been associated with rapid and reliable antidepressant effects in patients with treatment-resistant major depressive disorder (MDD), which emerge within hours and last for a number of days [4, 5]. Although these important clinical effects were discovered around two decades ago [6], an accepted mechanistic account of the antidepressant action of ketamine in depressed patients remains elusive. Recently, it was reported that a direct injection of ketamine into the lateral habenula (lHb) suppressed the burst activity of neurons in this region and produced a behavioural antidepressant effect in a rodent model of depression. This suggests that NMDA receptors play a role in the burst firing of the lHb neurons and raises the interesting possibility that ketamine might alleviate symptoms of depression via direct action on the lHb [7]. Ketamine is particularly shown to be effective in reducing anhedonia [5, 8].
Key work on understanding ketamine’s antidepressant drug actions have focused more on cellular mechanisms (for a review see [3]). In this Perspective article, we take a translational cognitive neuroscience approach and consider the potential mechanisms of ketamine across neural circuits important for reward and punishment. We present an overview of emerging evidence from rodent, monkey and human neuroimaging studies which cumulatively demonstrate how ketamine modulates the expression of anhedonic behaviours across habenula and fronto-mesolimbic networks.
Fractionating anhedonia
Anhedonia is typically defined as diminished interest or pleasure in activities that people have previously enjoyed. Anhedonia can be operationalised into a set of subcomponents which recognise that anhedonic behaviour could be the result of dysfunction in several distinct processes, including the anticipation of, consumption of and learning about reward [9, 10]. These subcomponents serve as quantifiable outcome measures in preclinical and experimental medicine studies. Some of these outcome measures can be expressed by mathematical models of choice behaviour. Previous studies using computational modelling of human behaviour in signal detection tasks have dissociated differences between reward learning versus reward sensitivity (i.e., learning about rewards compared to the subjective value of the reward). Dysfunction in each one of these processes could contribute to anhedonia and related behaviours. In signal detection tasks, anhedonia in patients with depression has been shown to be selectively related to a reduction in sensitivity to reward (i.e. a reduction in the consummatory pleasure of reward) rather than differences in learning about reward from feedback [11, 12].
Distinct contributions of reward learning, anticipation, and consumption to the behavioural expression of anhedonia are also demonstrated in neuroimaging studies of reward processing. Auerbach et al. argued that different neuroimaging modalities may be optimised to study different subcomponents of anhedonia, recommending event-related functional magnetic resonance imaging (fMRI) of the basal ganglia for studying reward anticipation, and resting-state connectivity for reward consummation [9]. Although a previous large-scale study comparing 421 MDD patients with 488 healthy volunteers identified decreased resting-state functional connectivity with increasing depressive symptom severity in the medial orbitofrontal cortex [13], an area the authors highlighted as being commonly associated with reward processing, the degree to which this measure correlates with external measures of reward consummation remains an open question.
One important element of reward learning is the signalling of reward-prediction error (RPE), defined as the difference between an individual’s expectations about the outcome of an event and the actual outcome of said event (also see Box 1). Ground-breaking neurophysiology work in monkeys showed that activity in the ventral tegmental area (vTA) and the substantia nigra (i.e., structures within the basal ganglia) convey RPEs [14]. This role for basal ganglia structures has since been demonstrated in humans [15, 16] and has been identified as a neural signature of depression. For example, early fMRI studies in patients with MDD identified blunted temporal difference reward-learning signals in the ventral striatum (vSTR) [17], and diminished expected-reward value encoding in the amygdala-hippocampal complex [18]. A recent study [19] using the monetary incentive delay task further demonstrated the role of basal ganglia structures by showing that reward anticipation signals encoded in the nucleus accumbens were blunted over time. This signal was shown to be modulated by an interaction between anhedonia and response to positive mood induction, in a manner consistent with theoretical work suggesting that positive mood should reduce the magnitude of RPE by heightening subjective reward expectations [20]. Taken together, these studies demonstrate that basal ganglia structures encode RPEs during reinforcement learning (RL) and suggest a role for the basal ganglia in expression of anhedonia (for a more detailed consideration of clinical neuroimaging studies of reward processing see [21]. We think fractionating anhedonia into subcomponents will be important for understanding the specificity of treatment effects and ultimately improving the precision of treatment recommendations.
Interactions between habenula and dopamine neurons in reward and punishment processing
The habenula works in tandem with midbrain dopaminergic neurons during the processing of reward and punishment, particularly signalling prediction errors that arise when reward expectations are violated (i.e. the negative PEs) [22]. In monkeys, electrophysiological recordings have highlighted that the phasic activity of lHb neurons encodes information PEs (i.e., those arising from systematic manipulation of expected uncertainty (see Box 1), which determines the information content of outcomes during RL [23]) to a considerably greater extent than negative RPEs encoded in this region [24]. These studies suggest a role for the lHb as an “effective critic” of value-based decision-making through its reciprocal connections with the dopaminergic midbrain [24].
The habenula is an evolutionarily conserved brain structure which receives a broad range of inhibitory and excitatory inputs while providing a relatively small range of outputs [25, 26]. Areas projecting to the lateral aspect of the habenula (i.e., the lHb) contribute to a wide range of functions related to depression including circadian rhythm [27]; sleep; expression of negative emotions; stress and threat signalling; and motivated/appetitive behaviours [28,29,30]. Habenula and dopamine neurons undergo similar changes in firing rate pattern during the ramping up of tonic activity, and in their phasic responses at the initiation of reward-seeking behaviours [31]. While the tonic signal is thought to preferentially encode rewards, the phasic response occurs at the onset of both rewarding and punishing trials, reflecting the sensitivity of habenula and midbrain dopaminergic neurons to outcome salience irrespective of outcome valence [31]. This suggests that habenula and midbrain dopamine neurons and their connectivity are important for value and salience signalling, both of which may be relevant to anhedonia.
An important excitatory input to the lHb is from the parvalbumin-positive neuronal populations [32] in the dopaminergic ventral pallidum (vP), which modulates analogues of depressive behaviours in rodent models [33]. Glutamatergic projections from the lHb to the rostromedial tegmental nucleus (rmTN) within the dopaminergic vTA have been shown to influence appetitive motivation and willingness to exert effort for reward in rodents [34]. Here, it is worthwhile to highlight that some of these studies [33, 34] rely on behavioural despair models (behavioural despair is defined as “a depression-like phenotype that reflects the feeling that nothing will improve” [34]) and assess anhedonic response in the forced swim test (FST), which is a commonly used outcome measure in preclinical research for antidepressant compounds [35]. Stimulation of the lHb to rmTN pathway is also shown to promote both active and passive types of behavioural avoidance in mice [36]. Neuroanatomic segmentation of the vTA identifies the rmTN at the tail end of this region, hosting populations of inhibitory GABAergic neurons which act as the “master brake” for the dopaminergic midbrain [37]. Optogenetic studies in rats have enabled the precise manipulation of the lHB to rmTN pathway that controls behaviours associated with anhedonia and a depressive phenotype. Stimulation of this pathway results in reduced mobility in the FST, and a reduction in the effort rats are willing to exert for reward [34]. The connections highlighted in this section are shown on a schematic diagram in Fig. 1.
In this schematic we collate evidence from key human [46, 50, 59, 83, 88, 89], monkey [14, 22, 24, 25, 31, 91] and rodent [7, 33, 34, 36, 37, 40] studies (depicted by angled lines on the surface of the coloured nodes with colours referring to different task components). The literature highlights the habenula as a key node in negative information processing, primarily through its direct and indirect (via MRN/DRN) connections to dopaminergic release sites as well as through input from mPFC. The majority of subcortical connections are based on findings from rodent studies (magenta coloured pathways) with some contributions from monkey neurophysiology (cyan coloured pathways). The direction of the connection between areas is represented by the location of the marker (e.g. on the line connecting the vP and the Hb, the circle is joined to the Hb to reflect that the vP provides input to the Hb). For clarity, a number of inputs to the habenula have been omitted (described in rodents and reviewed elsewhere [29]). Based on existing evidence in the literature, reversal learning tasks administered to human participants should probe the dACC and LC (key regions encoding environmental volatility); vTA, vSTR and Hb (key regions encoding positive and negative prediction errors) and regions of the mPFC encoding decision values and can be instrumental in understanding the effect of ketamine on subcomponents of anhedonia such as learning and reward processing.
The habenula as a relay station regulating behavioural analogues of affective symptoms
Animal studies have highlighted a possible function for the habenula as a relay station orchestrating signals from both dopaminergic and serotonergic regions that regulate affective response during reward-guided interactions with the physical environment. Two connections that are important for this function are the lHb inputs to the dorsal raphe nucleus (dRN) and the medial raphe nucleus (mRN) [25, 29]. The dRN receives both direct (glutamatergic) and indirect (via rmTN) projections from the lHb, this indirect pathway provides feedforward inhibition to serotonergic (5-HT) neurons. A previous rodent study showed that glutamate signalling contributes to the learning of reward associations [38], this is compatible with recent learning theories which posit that receipt of unexpected rewards during learning should uplift mood and attenuate the inhibitory influence of the lHb on the dRN via the direct pathway [29]. The dRN pathway would therefore be a candidate route via which the lHb neurons can integrate affective information (e.g., emotional response to reward outcomes) into reward processing [39], with causal implications for depressed and elated mood through aberrant learning from negative RPEs [20]. It is possible that impairments in learning about the reward contingencies in the environment which would lead to heightened negative RPE signalling in the habenula may be depressogenic. Intriguingly, both the lHb and dRN receive projections from the mPFC and optogenetic manipulation of these projections reduced mobility in the FST, highlighting a potential mechanism for depressive behaviours embedded in this circuitry [40]. These studies also suggest a translational hypothesis for future neuroimaging studies in which the activity of the human habenula might be parametrically modulated by momentary mood responses to reward outcomes received from the environment [41]. The connections highlighted in this section are also shown in Fig. 1.
The role of the habenula in anhedonia and depression
Thus far, we have reviewed evidence that describes the habenula’s role in the processing of reward and punishment as well as its contribution to analogues of depressive behaviours, mostly based on findings from rodent studies [7, 34, 42,43,44]. One intriguing finding from the rodent literature demonstrated that acute stress can reverse the activity of reward selective neurons in the lHb [42], such that they respond to rewards as though they were punishing. Considering the evolutionarily preserved role of the habenula across species, this reversal in activity which is accompanied by an increase in the magnitude of responses to reward omission, provides a candidate mechanism through which exposure to stress can trigger the onset of anhedonic behaviours. Whilst the human habenula’s functional connectivity with midbrain dopaminergic areas such as the vTA, as well as serotonergic areas such as the dRN [45] and general anatomical parcellation are well-characterised [42,43,44], relatively few studies have assessed whether the habenula makes a similar contribution to depressive symptoms in humans.
A few studies investigating the habenula’s function in humans have provided complementary evidence to findings shown in rodent models. One of these human studies used a Pavlovian conditioning paradigm in which abstract fractals were probabilistically associated with different types of outcomes: electroshocks or monetary [win/loss] outcomes [46]. In healthy controls, the habenula responded positively to the expected values of cues predicting a shock but did not encode the expected value of monetary outcomes. These findings are in line with an earlier monkey neurophysiology study which suggested that during Pavlovian conditioning the habenula is most responsive to the expected value of the worst outcome in the environment (e.g. least rewarding or frequently punishing outcomes) [47].
In a follow-up study, the same group of authors reported that this habenula response is sensitive to severity of self-reported depressive symptoms in nonclinical volunteers, such that a subset of participants who reported higher depressive symptoms did not show a positive habenula response to shocks. Furthermore, a group of patients with MDD showed a deactivation in the same region, leading to significant differences between patients and non-depressed healthy volunteers [48]. Interestingly, this deactivation to shock-predicting cues in people with depression is similar to response of the habenula to rewards in healthy volunteers who participated in the same group of author’s previous work [46], as well as in monkeys [14]. Notably, the habenula response seen in healthy controls was positively correlated with a measure of conditioned suppression (in this case, slower response times in a flicker detection task completed on shock trials, which were also correlated with pupil dilation) [46]. The authors suggest the positive habenula response to shocks seen in control participants may potentiate avoidance behaviour; the negative response observed in MDD patients therefore may result in a loss of the capacity for active avoidance [48]. Overall, these studies outline a dual function for the habenula in the anticipation and experience of rewards and punishments relevant for understanding cognitive processes that are impaired by symptoms of MDD.
Consistent with the findings from monkey neurophysiology studies [24, 49], functional neuroimaging suggests that the human habenula also encodes the expected values of negative outcomes during value-based decision-making. For example, the habenula has been shown to respond to monetary loss outcomes to a larger extent than win outcomes, and this activity is parametrically modulated by the probability of losses [50]. Interestingly this pattern of activity was not seen in patients with MDD, suggesting that depression may impair a core function of the habenula that is, encoding the expected value of negative outcomes. Taken together, these studies demonstrate that the habenula functions similarly in humans and laboratory animals during reward and punishment processing and how this function is influenced by symptoms of depression.
Loci of ketamine’s antidepressant action
In this section, we will highlight the role of brain regions that play a key role in ketamine’s anti-anhedonic effects in preclinical models. As we highlighted in the introduction, the lHb has recently been identified as an important site for ketamine’s rapid antidepressant action [5]. Optogenetic stimulation of lHb neurons results in a burst firing pattern that underlies an expression of anhedonia and behavioural despair in rodents [7]. This study also demonstrated that ketamine attenuates the burst firing of habenula neurons, in turn relieving rodent analogues of depressive symptoms.
The other key region involved in ketamine’s antidepressant action in rodent models of depression is the medial prefrontal cortex (mPFC), which notably projects to the habenula [40], and involves an increase in the extracellular excitatory glutamate levels in this area [51]. Chronic unpredictable stress, which is an experimental model of depression in rodents, is linked with reduction in the number of synapses in mPFC [52], and subsequent studies have shown that ketamine infusion can reverse synaptic deficits and an anhedonic behavioural response caused by chronic exposure to stress [53]. Recent advances in positron emission tomography may be useful in translating these studies, to aid the identification of similar changes in mPFC synaptic architecture in MDD patients who are treated with ketamine infusion relative to their baseline (i.e. before treatment onset in a within-subject design) [54].
Another rodent study demonstrating a role for the mPFC used an affective bias model of depression and compared ketamine with a traditional antidepressant (venlafaxine) [55]. Negative bias was induced by administering a benzodiazepine receptor inverse agonist (acting as an acute stressor in this rodent model) prior to one-shot learning via which rodents encoded the subjective value of primary rewards (i.e., a food substrate). Although one might argue that negative bias models of depression relate to depressive symptoms which are distinct from anhedonic behaviours, the experimental approach used by this study relied on probing components of cognition such as reward processing and learning which also relate to anhedonic behaviours. Stuart et al. showed that rapid and delayed action antidepressants act via distinct neuropsychological mechanisms [55], and demonstrated that ketamine engages the mPFC in rapidly mitigating the effect of negative bias associated with a reward substrate conditioned under an acute stressor.
The evidence reviewed in this section highlights that ketamine acts on both cortical and subcortical structures, specifically the mPFC and lHb. Action at both of these sites rapidly mitigates analogues of depressive symptoms in rodent models.
NMDA modulation of habenula neurons might be orchestrating the expression of anhedonia across fronto-mesolimbic and fronto-striatal networks
So far, we have synthesised findings from key animal experimental and neuroimaging studies of the habenula which have focused on its functional relationship with striatal dopaminergic regions during RL, and with other key neurotransmitter hubs within the mesolimbic network [56] involved in the expression of anhedonic and other depressive behaviours [40]. We have highlighted that optogenetic manipulation of axons projecting from the mPFC to the lHb regulate analogues of depressive behaviours in rodents [40]. Furthermore, we have explained how direct infusion of ketamine to the lHb [7] or the mPFC [52] can attenuate analogues of depressive symptoms. These findings suggest a potential mechanism through which mPFC may regulate subcortical contributors to the depression phenotype. In this section, we will expand on the role of the mPFC and highlight findings from monkey neurophysiology and human neuroimaging studies. Here, it is worthwhile to highlight the importance of drawing inferences from monkey neurophysiology as a bridge between the preclinical studies that we reviewed in the preceding sections and human neuroimaging [57]. Although the functional neuroanatomy of habenula is assumed to be preserved across species, more translational work is needed to understand the degree to which prefrontal cortex maintains a similar functional homology across rodents, monkeys and humans.
The use of multivariate approaches in human fMRI has enabled measurement of the heterogeneity in responses within the mPFC, the ventral sections of the mPFC in particular are consistently implicated in the relative coding of value in humans [58, 59] and monkeys [49]. Several human neuroimaging studies suggest that using prosocial incentives can effectively probe regions of the vmPFC during social learning [60] and decision-making [61]. For example, a recent study suggested that depressive symptoms correlate with a heightened neural response to negative delayed social incentives in the subgenual [cingulate] cortex [62] (sgACC, Brodmann area (BA) 25 [63], as well as the posterior-most section of the mPFC overlapping with BA32 and parts of BA24 and BA33 residing below the genu i.e. “the knee” of the corpus callosum). It is noted that these cortical areas may not be structurally [64] or functionally homologous in the context of reward processing [65].
The sgACC, broadly defined based on the Brodmann areas highlighted above, has been consistently implicated in the aetiology of depression [66,67,68,69], prediction of its recurrence [70] and treatment [71]. Ketamine’s antidepressant effect has been shown to depend on the expression of the neuropeptide precursor VGF in the sgACC, which is significantly depleted in patients with MDD, as demonstrated by post-mortem examinations, while its overexpression is shown to promote resilience to chronic stressors in a mouse model [72]. In lower primates such as marmoset monkeys, experimentally induced overactivity of the sgACC is associated with an anhedonic anticipatory response to primary rewards and reduced willingness to exert effort to obtain reward, both of which can be ameliorated by acute peripheral administration of ketamine but not by traditional antidepressants such as citalopram [73]. Remarkably, across species, the role of sgACC appears to be selectively related to anticipatory responses, rather than consummatory responses. Using decision-making probes of the sgACC in human neuroimaging studies of participants who have received a ketamine infusion may be a useful avenue in testing a translational prediction that ketamine should modulate reward processing impaired by anhedonia, such as value encoding.
Human neuroimaging studies also demonstrated ketamine related changes in functional connectivity across the fronto-striatal circuitry. For example, a recent study showed increased functional connectivity between the caudate and PFC regions (dlPFC and vlPFC), and between the putamen and PFC regions (more specifically perigenual ACC and the orbitofrontal cortex) in treatment-resistant depression (TRD) patients [74]. However, in healthy volunteers, ketamine reversed these functional connectivity effects. In addition, there is preliminary evidence to suggest that improvements in depressive symptoms following ketamine infusion in patients with MDD correlates with an increased functional connectivity between the right habenula and right frontal pole [75].
Taken together, the studies that we reviewed in this section suggest that reward circuitry distributed across fronto-mesolimbic and fronto-striatal networks are important in understanding ketamine’s antidepressant effect. A comprehensive understanding of ketamine’s action would require studying its direct effects on reward processing (e.g., in healthy volunteers) as well as its indirect effects on reward processing through an action on anhedonia (e.g., in patients with depression). However, there are not any studies to our knowledge, which have investigated the direct effects of ketamine on reward processing in humans or monkeys. On the other hand, only a limited number of rodent studies have investigated how ketamine modulates reward processing within a temporal discounting framework. This is a relevant domain as previous literature suggests that temporal discounting decisions are also impaired in patients with depression [76, 77] who display a higher discounting tendency associated with a preference for immediately available rewards. However in rodents, a clinically relevant sub-anaesthetic dose of ketamine (5 mg/kg) did not reveal consistent results, one study suggesting an increase in discounting [78], whereas a more recent study did not show any significant changes in discounting behaviour at this dosage [79]. This limited literature suggests that more studies are needed across all species to be able to decompose ketamine’s direct and indirect effects on reward processing.
Experimental probes and future directions
In this Perspective article, we have reviewed evidence showing that habenula and midbrain dopaminergic neurons work closely in signalling reward and punishment PEs during tasks performed over a single testing session (i.e., technically described as “online learning” [80]), and that these neural processes are likely to be involved in the neural basis of anhedonia. We have also highlighted evidence that ketamine produces its antidepressant effects through action on the habenular complex and fronto-mesolimbic networks. As we are entering the third decade of human research into the antidepressant effects of NMDA receptor antagonists, future studies could harness existing computational modelling, functional connectivity, novel neuroimaging analysis methods and imaging modalities with higher spatial and temporal resolution (e.g., scan human participants at higher magnetic field strengths [81, 82] or with MEG and pupillometry). So far only a small number of studies have focused on the human habenula in healthy and clinical populations. The current state of the human neuroimaging literature does not reflect the richness of animal models describing the mechanisms of antidepressant action across the habenular complex and fronto-mesolimbic circuitry (Fig. 1). In this section we will elaborate on experimental methods that can probe these systems in human participants and help bridge this translational gap.
Although both Pavlovian and instrumental conditioning frameworks have been adopted by previous studies, RL tasks involving reversals in which the better option in the environment changes at time points unknown to the participant, might be the best experimental approach to probe these systems. Continuous reversals in the task environment mean that the participant will continue experiencing PEs over the course of the experiment, resulting in continuous engagement of the subcortical structures implicated in RL. In RL terminology, unexpected uncertainty arising from dynamically changing outcome contingencies is known as volatility [83] (Box 1). It is important to highlight that awareness of environmental volatility relates to a second-order statistic which builds up as participants experience more trials during the task (i.e., learning about the structure of the environment). Seminal work in this area conducted by Behrens and colleagues demonstrated that dorsal anterior cingulate cortex (dACC) encodes volatility signals (i.e., the rate with which the environment changes) during reversal learning of non-social and social reward contingencies [83, 84]. In similar reward-learning tasks, regions of the dACC are also shown to encode trialwise PE estimates in both humans [85] and monkeys [49]. Recent electrophysiology work conducted in patients with pharmacologically intractable epilepsy has extended these findings, suggesting that neuronal populations within the dACC respond more selectively to punishments than to rewards [86]. This is in line with previous findings from monkey electrophysiology revealing similar results that, demonstrate the within-trial temporal frame of the neural activity across dACC and habenula neurons, where the latter are shown to be quicker to respond to omission of reward [87]. Across trials, the dACC is shown to be engaged with the running history of outcomes, consistent with its role in encoding volatility signals, whereas habenula neurons were involved more strongly in guiding behavioural adjustments in response to the outcome of the current trial [87], potentially indicating a role in the implementation of learning rates. Therefore, it is highly likely that adjustment of learning rates to environmental volatility/uncertainty, a key cognitive process with survival value across the animal kingdom, might be implemented across dACC and habenula neurons. Considering that the habenula has influence over the neurotransmitter hubs of the mesolimbic pathway and these projections extend to prefrontal cortex and the dACC, it is possible that a complete feedback loop exists between the fronto-mesolimbic pathway and the habenula, though this proposed link remains to be explored by future human and monkey neurophysiology studies. This feedback loop may rely on key cognitive processes implemented across dACC and habenula neurons and is likely to be critical for understanding the neurobiology of depression (Box 2).
In human physiology studies, the activity of the pupil-linked arousal systems is shown to encode the volatility of punishment contingencies [88] more strongly than the volatility of reward contingencies [89]. A number of previous studies have demonstrated that changes in pupil dilation relate to the activity of central norepinephrine neurons [90, 91]. Both the dACC and the central norepinephrine system are also important target sites for understanding ketamine’s antidepressant action. For example, previous work suggests that the degree of ketamine’s anti-anhedonic effect correlates with increased glucose metabolism in the dACC [8] and the habenula, a key subcortical target of injection for ketamine’s antidepressant effects in rodents, is also known to exert control over the locus coeruleus and the central norepinephrine system [26, 92, 93]. These areas may also be linked in terms of the cognitive processes they encode during reward learning. For example, the central norepinephrine system (as indexed by pupil dilation) responds to informative/volatile outcomes in the environment and the habenula encodes information PEs in monkeys. An additional strength of probing these systems with reversal learning tasks is that the computational neuroscience literature holds a number of well-established [83] and emerging [23, 85, 94] mathematical models (i.e., Bayesian and RL) which can be instrumental in revealing precisely how ketamine modulates computational processes such as learning rates and reward/punishment sensitivity across the habenular complex and the fronto-mesolimbic networks (Fig. 1). Indications from the rodent pharmacology studies are that ketamine may affect sensitivity to reward magnitude, plausibly reflecting one of the aforementioned components of anhedonia.
We believe that converging evidence presented in this manuscript from existing rodent and monkey models warrants an interdisciplinary approach for the next decade of human research, designed to translate these insights into tractable models of rapid antidepressant action in depressed patients and thereby produce a radical improvement in the management of clinical mood disorders. Moreover, given anhedonia is a transdiagnostic feature of many psychiatric and neurological conditions, the translational approach we outlined here may have utility for clinical improvement in all neuropsychiatric conditions in which anhedonia manifests.
References
Hyman SE. Revitalizing psychiatric therapeutics. Neuropsychopharmacology. 2014;39:220–9.
Duman RS, Aghajanian GK. Synaptic dysfunction in depression: potential therapeutic targets. Science. 2012;338:68–72.
Duman RS, Shinohara R, Fogaça MV, Hare, B. Neurobiology of rapid-acting antidepressants: convergent effects on GluA1-synaptic function. Mol Psychiatry. 2019;24:1816–32.
Coyle CM, Laws KR. The use of ketamine as an antidepressant: a systematic review and meta‐analysis. Hum Psychopharmacol Clin Exp. 2015;30:152–63.
Lally N, et al. Neural correlates of change in major depressive disorder anhedonia following open-label ketamine. J Psychopharmacol. 2015;29:596–607.
Berman RM, et al. Antidepressant effects of ketamine in depressed patients. Biol Psychiatry. 2000;47:351–4.
Yang Y, et al. Ketamine blocks bursting in the lateral habenula to rapidly relieve depression. Nature. 2018;554:317–22.
Lally N, et al. Anti-anhedonic effect of ketamine and its neural correlates in treatment-resistant bipolar depression. Transl Psychiatry. 2014;4:e469–e469.
Auerbach RP, Pagliaccio D, Pizzagalli DA. Toward an improved understanding of anhedonia. JAMA Psychiatry. 2019;76:571–3.
Husain M, Roiser JP. Neuroscience of apathy and anhedonia: a transdiagnostic approach. Nat Rev Neurosci. 2018;19:470–84.
Huys QJ, Pizzagalli DA, Bogdan R, Dayan P. Mapping anhedonia onto reinforcement learning: a behavioural meta-analysis. Biol Mood Anxiety Disord. 2013;3:1.
Huys QJ, Daw ND, Dayan P. Depression: a decision-theoretic analysis. Ann Rev Neurosci. 2015;38:1–23.
Cheng W, et al. Medial reward and lateral non-reward orbitofrontal cortex circuits change in opposite directions in depression. Brain. 2016;139:3296–309.
Schultz W, Dayan P, Montague PR. A neural substrate of prediction and reward. Science. 1997;275:1593–9.
Pagnoni G, Zink CF, Montague PR, Berns GS. Activity in human ventral striatum locked to errors of reward prediction. Nat Neurosci. 2002;5:97–98.
Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature. 2006;442:1042–5.
Kumar P, et al. Abnormal temporal difference reward-learning signals in major depression. Brain. 2008;131:2084–93.
Gradin VB, et al. Expected value and prediction error abnormalities in depression and schizophrenia. Brain. 2011;134:1751–64.
Green IW, Pizzagalli DA, Admon R, Kumar P. Anhedonia modulates the effects of positive mood induction on reward-related brain activation. NeuroImage. 2019;193:115–25.
Eldar E, Rutledge RB, Dolan RJ, Niv Y. Mood as representation of momentum. Trends Cogn Sci. 2016;20:15–24.
Halahakoon DC, Kieslich K, O’Driscoll C, Nair A, Lewis G, Roiser JP. Reward-processing behavior in depressed participants relative to healthy volunteers: A Systematic Review and Meta-analysis. JAMA Psychiatry, 2020:77;1286–95.
Matsumoto M, Hikosaka O. Lateral habenula as a source of negative reward signals in dopamine neurons. Nature. 2007;447:1111–5.
Pulcu E, Browning M. The misestimation of uncertainty in affective disorders. Trends Cognitive Sci. 2019;23:865–75.
Bromberg-Martin ES, Hikosaka O. Lateral habenula neurons signal errors in the prediction of reward information. Nat Neurosci. 2011;14:1209–16.
Hikosaka O, Sesack SR, Lecourtier L, Shepard PD. Habenula: crossroad between the basal ganglia and the limbic system. J Neurosci. 2008;28:11825–9.
Lecourtier L, Kelly PH. A conductor hidden in the orchestra? Role of the habenular complex in monoamine transmission and cognition. Neurosci Biobehav Rev. 2007;31:658–72.
Zahm DS, Root DH. Review of the cytology and connections of the lateral habenula, an avatar of adaptive behaving. Pharmacol Biochem Behav. 2017;162:3–21.
Stamatakis AM, et al. Lateral hypothalamic area glutamatergic neurons and their projections to the lateral habenula regulate feeding and reward. J Neurosci. 2016;36:302–11.
Yang Y, Wang H, Hu J, Hu H. Lateral habenula in the pathophysiology of depression. Curr Opin Neurobiol. 2018;48:90–96.
Kaye A, Ross DA. The habenula: darkness, disappointment, and depression. Biol psychiatry. 2017;81:e27–8.
Bromberg-Martin ES, Matsumoto M, Hikosaka O. Distinct tonic and phasic anticipatory activity in lateral habenula and dopamine neurons. Neuron. 2010;67:144–55.
Watson C, Paxinos G, Puelles, L. The mouse nervous system. Academic Press; London, UK; 2012.
Knowland D, et al. Distinct ventral pallidal neural populations mediate separate symptoms of depression. Cell. 2017;170:284–97. e218.
Proulx CD, et al. A neural pathway controlling motivation to exert effort. Proc Natl Acad Sci USA. 2018;115:5792–7.
Unal G, Canbeyli R. Psychomotor retardation in depression: a critical measure of the forced swim test. Behav Brain Res. 2019;372:112047.
Stamatakis AM, Stuber GD. Activation of lateral habenula inputs to the ventral midbrain promotes behavioral avoidance. Nat Neurosci. 2012;15:1105–7.
Bourdy R, Barrot M. A new control center for dopaminergic systems: pulling the VTA by the tail. Trends Neurosci. 2012;35:681–90.
Liu Z, et al. Dorsal raphe neurons signal reward through 5-HT and glutamate. Neuron. 2014;81:1360–74.
Proulx CD, Hikosaka O, Malinow R. Reward processing by the lateral habenula in normal and depressive behaviors. Nat Neurosci. 2014;17:1146–52.
Warden MR, et al. A prefrontal cortex–brainstem neuronal projection that controls response to behavioural challenge. Nature. 2012;492:428–32.
Rutledge RB, et al. Association of neural and emotional impacts of reward prediction errors with major depression. JAMA Psychiatry. 2017;74:790–7.
Shabel SJ, Wang C, Monk B, Aronson S, Malinow R. Stress transforms lateral habenula reward responses into punishment signals. Proc Natl Acad Sci USA. 2019;116:12488–93.
Hu H, Cui Y, Yang Y. Circuits and functions of the lateral habenula in health and in disease. Nat Rev Neurosci. 2020;21:277–95.
Aleksandrova LR, Wang YT, Phillips AG. Ketamine and its metabolite,(2R, 6R)-HNK, restore hippocampal LTP and long-term spatial memory in the Wistar-Kyoto rat model of depression. Mol Brain. 2020;13:1–16.
Torrisi S, et al. Resting state connectivity of the human habenula at ultra-high field. Neuroimage. 2017;147:872–9.
Lawson RP, et al. The habenula encodes negative motivational value associated with primary punishment in humans. Proc Natl Acad Sci USA. 2014;111:11858–63.
Matsumoto M, Hikosaka O. Representation of negative motivational value in the primate lateral habenula. Nat Neurosci. 2009;12:77.
Lawson R, et al. Disrupted habenula function in major depression. Mol Psychiatry. 2017;22:202–8.
Matsumoto M, Matsumoto K, Abe H, Tanaka K. Medial prefrontal cell activity signaling prediction errors of action values. Nat Neurosci. 2007;10:647–56.
Furman DJ, Gotlib IH. Habenula responses to potential and actual loss in major depression: preliminary evidence for lateralized dysfunction. Soc Cogn Affect Neurosci. 2016;11:843–51.
Moghaddam B, Adams B, Verma A, Daly D. Activation of glutamatergic neurotransmission by ketamine: a novel step in the pathway from NMDA receptor blockade to dopaminergic and cognitive disruptions associated with the prefrontal cortex. J Neurosci. 1997;17:2921–7.
Liu R-J, Aghajanian GK. Stress blunts serotonin-and hypocretin-evoked EPSCs in prefrontal cortex: role of corticosterone-mediated apical dendritic atrophy. Proc Natl Acad Sci USA. 2008;105:359–64.
Li N, et al. Glutamate N-methyl-D-aspartate receptor antagonists rapidly reverse behavioral and synaptic deficits caused by chronic stress exposure. Biol Psychiatry. 2011;69:754–61.
Finnema SJ, et al. Imaging synaptic density in the living human brain. Sci Transl Med. 2016;8:348ra396–348ra396.
Stuart SA, Butler P, Munafò MR, Nutt DJ, Robinson ES. Distinct neuropsychological mechanisms may explain delayed-versus rapid-onset antidepressant efficacy. Neuropsychopharmacology. 2015;40:2165–74.
Nakamura K. The role of the dorsal raphé nucleus in reward-seeking behavior. Front Integr Neurosci. 2013;7:60.
Roberts AC, Clarke HF. Why we need nonhuman primates to study the role of ventromedial prefrontal cortex in the regulation of threat-and reward-elicited responses. Proc Natl Acad Sci USA. 2019;116:26297–304.
Nieuwenhuis S, Slagter HA, Von Geusau NJA, Heslenfeld DJ, Holroyd CB. Knowing good from bad: differential activation of human cortical areas by positive and negative outcomes. Eur J Neurosci. 2005;21:3161–8.
Knutson B, Taylor J, Kaufman M, Peterson R, Glover G. Distributed neural representation of expected value. J Neurosci. 2005;25:4806–12.
Lockwood PL, Apps MA, Valton V, Viding E, Roiser JP. Neurocomputational mechanisms of prosocial learning and links to empathy. Proc Natl Acad Sci USA. 2016;113:9763–8.
Pulcu E, et al. Enhanced subgenual cingulate response to altruistic decisions in remitted major depressive disorder. NeuroImage Clin. 2014;4:701–10.
He Z, Zhang D, Muhlert N, Elliott R. Neural substrates for anticipation and consumption of social and monetary incentives in depression. Soc Cogn Affect Neurosci. 2019;14:815–26.
Judaš M, Cepanec M, Sedmak G. Brodmann’s map of the human cerebral cortex—or Brodmann’s maps? Transl Neurosci. 2012;3:67–74.
Öngür D, Ferry AT, Price JL. Architectonic subdivision of the human orbital and medial prefrontal cortex. J Comp Neurol. 2003;460:425–49.
Nicolle A, et al. An agent independent axis for executed and modeled choice in medial prefrontal cortex. Neuron. 2012;75:1114–21.
Mayberg HS, et al. Cingulate function in depression: a potential predictor of treatment response. Neuroreport. 1997;8:1057–61.
Drevets WC, Ongur D, Price JL. Neuroimaging abnormalities in the subgenual prefrontal cortex: implications for the pathophysiology of familial mood disorders. Mol Psychiatry. 1998;3:220–6.
Drevets WC, et al. Subgenual prefrontal cortex abnormalities in mood disorders. Nature. 1997;386:824–7.
Drevets WC, Savitz J. The subgenual anterior cingulate cortex in mood disorders. CNS Spectr. 2008;13:663–81.
Lythe KE, et al. Self-blame–selective hyperconnectivity between anterior temporal and subgenual cortices and prediction of recurrent depressive episodes. JAMA Psychiatry. 2015;72:1119–26.
Mayberg HS, et al. Deep brain stimulation for treatment-resistant depression. Neuron. 2005;45:651–60.
Jiang C, et al. VGF and its C-terminal peptide TLQP-62 in ventromedial prefrontal cortex regulate depression-related behaviors and the response to ketamine. Neuropsychopharmacology. 2019;44:971–81.
Alexander L, et al. Fractionating blunted reward processing characteristic of anhedonia by over-activating primate subgenual anterior cingulate cortex. Neuron. 2019;101:307–20.e306.
Mkrtchian A, et al. Ketamine modulates fronto-striatal circuitry in depressed and healthy individuals. Mol Psychiatry. 2020:1–10.
Rivas-Grajales AM, et al. Habenula connectivity and intravenous ketamine in treatment-resistant depression. Int J Neuropsychopharmacol. 2021;24:383–91.
Dombrovski AY, et al. Lethal forethought: delayed reward discounting differentiates high-and low-lethality suicide attempts in old age. Biol Psychiatry. 2011;70:138–44.
Pulcu E, et al. Temporal discounting in major depressive disorder. Psychol Med. 2014;44:1825–34.
Floresco SB, Maric T, Ghods-Sharifi S. Dopaminergic and glutamatergic regulation of effort-and delay-based decision making. Neuropsychopharmacology. 2008;33:1966–79.
Yates JR, Gunkel BT, Rogers KK, Hughes MN, Prior NA. Effects of N-methyl-D-aspartate receptor ligands on sensitivity to reinforcer magnitude and delayed reinforcement in a delay-discounting procedure. Psychopharmacology. 2017;234:461–73.
Si J, Wang Y-T. Online learning control by association and reinforcement. IEEE Trans Neural Netw. 2001;12:264–76.
Sladky R, et al. High-resolution functional MRI of the human amygdala at 7 T. Eur J Radiol. 2013;82:728–33.
Strotmann B, et al. High‐resolution MRI and diffusion‐weighted imaging of the human habenula at 7 tesla. J Magn Reson Imaging. 2014;39:1018–26.
Behrens TE, Woolrich MW, Walton ME, Rushworth MF. Learning the value of information in an uncertain world. Nat Neurosci. 2007;10:1214–21.
Behrens TE, Hunt LT, Woolrich MW, Rushworth MF. Associative learning of social value. Nature. 2008;456:245–9.
Findling C, Skvortsova V, Dromnelle R, Palminteri S, Wyart V. Computational noise in reward-guided learning drives behavioral variability in volatile environments. Nat Neurosci. 2019;22:2066–77.
Gazit T, et al. The role of mPFC and MTL neurons in human choice under goal-conflict. Nat Commun. 2020;11:1–12.
Kawai T, Yamada H, Sato N, Takada M, Matsumoto M. Roles of the lateral habenula and anterior cingulate cortex in negative outcome monitoring and behavioral adjustment in nonhuman primates. Neuron. 2015;88:792–804.
Browning M, Behrens TE, Jocham G, O’Reilly JX, Bishop SJ. Anxious individuals have difficulty learning the causal statistics of aversive environments. Nat Neurosci. 2015;18:590–96.
Pulcu E, Browning M. Affective bias as a rational response to the statistics of rewards and punishments. Elife. 2017;6:e27879.
Aston-Jones G, Cohen JD. An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. Annu Rev Neurosci 2005;28:403–50.
Joshi S, Li Y, Kalwani RM, Gold JI. Relationships between pupil diameter and neuronal activity in the locus coeruleus, colliculi, and cingulate cortex. Neuron. 2016;89:221–34.
Pasquier DA, Kemper TL, Forbes WB, Morgane PJ. Dorsal raphe, substantia nigra and locus coeruleus: interconnections with each other and the neostriatum. Brain Res Bull. 1977;2:323–39.
Kobayashi RM, Palkovits M, Kopin IJ, Jacobowitz DM. Biochemical mapping of noradrenergic nerves arising from the rat locus coeruleus. Brain Res. 1974;77:269–79.
Pulcu E. A nonlinear relationship between prediction errors and learning rates in human reinforcement learning. 2019. https://www.biorxiv.org/content/10.1101/751222v1.article-metrics.
Acknowledgements
This work is supported by a joint funding from the UK Medical Research Council and Janssen Pharmaceuticals (MR/S035591/1) awarded to SM, PJC and CJH, and by the NIHR Oxford Health Biomedical Research Centre. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.
Funding
This work is supported by a joint funding from the UK Medical Research Council and Janssen Pharmaceuticals (MR/S035591/1) awarded to SM, PJC and CJH.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
CJH has received consultancy fees from P1vital Ltd, Janssen Pharmaceuticals, Sage Therapeutics, Pfizer, Zogenix and Lundbeck. SM has received consultancy fees from P1vital, Zogenix, Sumitomo and Janssen Pharmaceuticals. EP has received consultancy fees from Janssen Pharmaceuticals. CJH and SEM hold grant income from UCB Pharma, Zogenix and Janssen Pharmaceuticals. CJH, SM and PJC hold grant income from a collaborative research project with Pfizer.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pulcu, E., Guinea, C., Cowen, P.J. et al. A translational perspective on the anti-anhedonic effect of ketamine and its neural underpinnings. Mol Psychiatry 27, 81–87 (2022). https://doi.org/10.1038/s41380-021-01183-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41380-021-01183-1