INTRODUCTION

Pharmacologic challenge paradigms have been used extensively to investigate the responsiveness and integrity of the central serotonin system. They have primarily been used to study depression and personality disorder, including traits associated with impulsive/aggressive behaviors and suicidal behavior. Blunted prolactin response to these challenge agents—such as fenfluramine (FEN) or citalopram—has been found in most studies of major depression (Newman et al, 1998), as well as in personality disorders characterized by impulsive and aggressive traits (New et al, 2004). Blunting of the prolactin response correlates with impulsiveness and aggressiveness in both patient (Coccaro et al, 1989; Soloff et al, 2003) and non-patient populations (Manuck et al, 1998, 2002). Given the association with both depression and aggression, it is not surprising that a blunted prolactin response to FEN is also associated with past suicidal behavior (Cleare et al, 1996; Malone et al, 1996; Correa et al, 2000; Duval et al, 2001; Soloff et al, 2003), consistent with a variety of other measures of central serotonin function (Mann, 2003).

Although the majority of these studies have focused on the prolactin response to serotonergic challenge, other hormones released by serotonergic stimulation of the hypothalamus/pituitary have been measured as well, including adrenocorticotrophic hormone (ACTH) and growth hormone, as well as ACTH-stimulated cortisol secretion. In studies using FEN, other hormone responses typically do not discriminate clinical groups as well as prolactin (Newman et al, 1998). In the case of cortisol, this may be due in part to methodological factors, since these challenge tests tend to be run early in the day when circulating levels of cortisol are at their highest and declining. As a result, administration of challenge agents results in an attenuation of this circadian decline rather than a clear increase from baseline levels (Newman et al, 1998). Nonetheless, blunting of cortisol response has been found in suicide attempters (Cleare et al, 1996; Duval et al, 2001). Cortisol response to serotonergic challenge may have particular relevance to the assessment of acute psychopathology, since prolactin responses to these challenge agents tend to be stable before and after treatment (Flory et al, 1998; Dulchin et al, 2001), while cortisol response is not (Bhagwagar et al, 2002b).

These differences in the nature of prolactin and cortisol responses may be important for the assessment of later risk for suicidal behavior. Prospective studies suggest that risk for both future attempt and completion may be evident in baseline measures of serotonin activity or hypothalamic–pituitary–adrenal (HPA) axis dysregulation (Mann and Currier, 2007). FEN and other serotonin challenge paradigms provide indices of both serotonin system responsiveness and HPA axis function and may be able to predict risk of suicide attempts in major depression.

Beyond these biological effects, mood and behavioral effects of serotonergic challenge agents may also have prognostic significance. In a small sample of subjects with familial depression who received fluorodeoxyglucose positron emission tomography (PET) scans after the administration of FEN, we found an acute reduction in self-reported depressed mood that correlated with reduced glucose metabolism in the ventromedial prefrontal cortex (Kegeles et al, 2003), within a network of regions typically associated with treatment responsiveness. Positive initial mood responses to serotonergic agents, as early as the first week of treatment, appear related to treatment efficacy (Taylor et al, 2006). Conversely, a poor initial mood response to treatment, as well as increasing activation, may be associated with the persistence or even emergence of suicidal ideation (Perlis et al, 2007).

In this study, we analyzed data from FEN challenge tests that had been administered to drug-free healthy volunteers, drug-free depressed subjects with no history of suicide attempt, and depressed subjects who had made one or more suicide attempts in the past. These challenge procedures included measures of both mood and motor speed. We subsequently followed subjects for 2 years, and identified a subsample that made a suicide attempt after the initial assessment. We compared these future attempters' neuroendocrine, mood, and behavioral responses during the FEN challenge test with healthy volunteers, depressed past suicide attempters, and depressed patients who had never made a suicide attempt. We hypothesized that future suicide attempters would show exaggerated blunting of both prolactin and cortisol responses after FEN administration. We conducted exploratory studies of effects of serotonin release on mood and motor speed, and relationship of those responses to outcome on follow-up.

METHODS

Subjects

Groups were (1) 15 healthy volunteers (non-patient comparison group); (2) 19 depressed subjects without any history of suicidal behavior (non-attempters); (3) 19 depressed subjects with a past history of suicidal attempt (past attempters) and no future attempt; and (4) 7 depressed subjects who would go on to make a suicide attempt within the 2 years following the assessment (future attempters). Six of the seven subjects who went on to make an attempt had made an attempt in the past. One of these future attempts was a completed suicide.

Clinical Assessments

Psychiatric illnesses were ruled out in non-patients using the non-patient version of the Structured Clinical Interview for DSM-IV (SCID-NP; First et al, 1997). Diagnosis was established in patients using the SCID, Axes I (SCID-I; Spitzer et al, 1990) and II (SCID-II; First et al, 1996). History of past suicidal behavior was assessed via semistructured interview using the Columbia Suicidal Behavior History Form (Oquendo et al, 2003c), which was repeated at 3 months, 1 year, and 2 years after discharge.

Baseline clinical assessments included the Hamilton Depression Scale (Hamilton, 1960), Beck Depression Inventory (Beck et al, 1961), Global Assessment of Functioning, Beck Hopelessness Scale (Beck et al, 1974), Scale for Suicidal Ideation (Beck et al, 1979), Barratt Impulsiveness Scale (Barratt, 1985, 1994), Buss–Durkee Hostility Inventory (Buss and Durkee, 1961), and Brown–Goodwin Aggression History (Brown et al, 1979). For subjects with a past history of suicide attempt, the Beck Suicide Intent Scale (Beck et al, 1975) and a rating of Medical Lethality of past attempts (op. cit.) was also completed.

A set of repeatable assessments was also completed at baseline and during the course of the FEN challenge procedure. Repeated assessments included the Profile of Mood States (POMS; McNair et al, 1981), a 65-item questionnaire that produces a Total mood disturbance score and six subscale scores for Depression, Tension/Anxiety, Anger/Hostility, Confusion, Fatigue, and Vigor. Subjects also received a brief assessment of motor speed via the finger tapping test, a standard neuropsychological measure using a telegraph-like key and counter to assess speed of intentional finger oscillations (Reitan and Wolfson, 1985). Drug side effect ratings for activation, dizziness, nausea, and confusion (0–3 rating: none, mild, moderate, and severe) were completed by examiners on each challenge day.

FEN Challenge

The FEN challenge involved a placebo and active drug day, done in a fixed order, as described previously (Kegeles et al, 2003; Oquendo et al, 2003d). Placebo or 0.8 mg/kg D, L FEN (∼60 mg for average subject) was administered at 0800 hours, blood samples were drawn 15 min before drug administration, immediately after drug administration (hour 0) and at hourly intervals for the next 5 h. Bloods were assayed for prolactin, cortisol, FEN and norfenfluramine levels. Prolactin and cortisol levels were determined by immunoradiometric assay, drug and metabolite levels by gas liquid chromatography, according to methods published previously (Kegeles et al, 2003; Oquendo et al, 2003d).

Mood and behavioral assessments were completed on the day before the initial FEN procedure (run-in baseline to reduce practice effects on challenge days), and at the end of the procedure on each of the two challenge days (5.5 h after FEN administration). Medication side effect ratings were completed at the start of these assessments on each of the two challenge days only. Because all subjects were participants in a study examining the effects of FEN challenge on cerebral glucose metabolism and received a PET scan between hours 3 and 5 of the standard FEN procedure (Kegeles et al, 2003; Oquendo et al, 2003d), they were unavailable for behavioral testing at this time.

Statistical Analyses

Demographic and baseline clinical variables were compared using one-way ANOVA and post hoc Tukey LSD comparisons for continuous variables, χ2 for categorical variables. Baseline, peak change from baseline, and total area under the curve (AUC; Pruessner et al, 2003) in prolactin and cortisol (FEN day only) were compared among groups using analysis of covariance (ANCOVA) within general linear models, with age, sex, and combined FEN/norfenfluramine level at peak (hours 3 and 4, averaged) as covariates. These three parameters effectively characterize the overall response curves for each group. Peak response was of primary interest, but AUC was analyzed to capture responses that might be modest but prolonged over the course of several hours. Peak change was computed as the difference between the nadir up to hour 1 of the procedure, and the highest point thereafter. Mood ratings derived from the POMS were analyzed with repeated-measures ANCOVA across study days, with age and sex as covariates. Finger tapping scores were converted to normatively corrected z-scores—which adjust for age and sex effects—based on published external normative data (Spreen and Strauss, 1998), and averaged across both hands. Finger tapping scores and side effect ratings were compared using repeated-measures ANOVA under general linear models.

The fixed order of presentation of placebo and drug days limits our ability to attribute changes in mood or performance to drug effects vs accumulated order effects. However, repeated assessment effects were managed, in part, by the use of a run-in baseline. We attributed changes in mood ratings or behavioral performance to drug effects if the overall time or group by time effect in repeated-measures analysis was significant, and if there was a significant difference in ratings or performance between drug and placebo days (significant differences between drug day and baseline alone were not attributed to drug effects).

Correlations were computed among scores that differentiated groups, and logistic regression was used to determine if neurohormonal response and mood change could be used to predict later suicide attempt.

RESULTS

Demographic and Clinical Characteristics of Groups

Both past and future attempters were younger than non-patients, but all groups were comparable in education, WAIS-R Vocabulary score, and gender distribution (Table 1). The depressed groups were comparably depressed on the Hamilton Depression Rating Scale, although past and future attempters had higher Beck Depression Scale scores as we have previously reported in larger samples (Mann et al, 1999; Grunebaum et al, 2005). There was no greater frequency of bipolar depression, borderline personality, substance use disorder, or post-traumatic stress disorder in the future attempters group relative to other patient groups. The only distinguishing clinical characteristic of the future attempters was higher current suicidal ideation and Beck Depression scores.

Table 1 Demographic and Clinical Rating Data

Future suicide attempts were made, on average, 18.7±7.4 months after FEN assessments, with the earliest at 5 months. One subject made multiple attempts (three) and made the earliest attempt. Maximum lethality rating of future attempts averaged 3.4±2.8, and ranged from 0 (no injury) to 8 (death). Suicidal ideation remained high in future attempters (10.6±12.0 at 3 months post-discharge, 14.7±9.2 after 1 year) and consistently higher than other depressed subjects (F(2,33)=4.59, p=0.02). Beck Depression score also remained high (26.0±17.6 and 31.8±8.6) and higher than other groups (F(2,37)=6.24, p=0.005). Hamilton score remained high, but was comparable to other groups (20.6±14.1 and 27.6±8.9; F(2,39)=2.29, p=0.11).

FEN Challenge

Raw prolactin and cortisol values in each group on the FEN day are presented in Figure 1a and b.

Figure 1
figure 1

(a, b) Raw plasma prolactin and cortisol values (mean±SEM) for each group at initial baseline (−15 min) and at hourly intervals after administration of fenfluramine (active drug day).

Adjusted baseline (F(3,53)=0.93, p=0.43) and change to peak prolactin levels (F(3,51)=2.24, p=0.10) were not different among the groups (Table 2). AUC for prolactin, however, was significantly lower in both past and future attempters relative to non-patients (F(3,51)=3.88, p=0.01). Sex was, as expected, a significant covariate for all prolactin measures (for baseline: F(1,51)=9.97, p=0.003; for change to peak: F(1,51)=36.45, p<0.001; for AUC: F(1,51)=43.86, p<0.001), age affected both baseline prolactin (F(1,51)=7.26, p=0.009) and AUC (F(1,51)=13.28, p=0.001), both of which were lower in older subjects.

Table 2 Hormonal and Drug Level Measures during FEN Challenge

Adjusted baseline cortisol levels (F(3,54)=1.45, p=0.24) and AUC (F(3,52)=0.71, p=0.55) were comparable among the groups; however, change to peak in cortisol (F(3,52)=3.88, p=0.01) was significantly lower in future attempters relative to all other groups (Table 2). These measures were not affected by sex of subject, and subject's age only affected AUC, which was higher in older subjects (F(1,52)=6.29, p=0.02).

As an additional check on these findings, baseline, change to peak, and AUC were computed for prolactin and cortisol on the placebo day. Each value was then used as a covariate in the analysis of the FEN day response (in addition to age, sex, and drug levels), as a control for non-pharmacologic effects on these outcome measures. Group differences were similar to those in the previous analyses. AUC for prolactin differed among groups (F(3,50)=4.57, p=0.007) with past and future attempters lower than non-patients (post hoc p's<0.01), and peak change in cortisol differed among groups (F(3,51)=4.58, p=0.007) with future attempters lower than all other groups (post hoc p's<0.004).

Supplemental analyses to examine the effect of unipolar vs bipolar depression revealed no differences between these depression subtypes. Additional covariance analyses examining the effect of depression severity, using either the Hamilton scale score or the Beck Depression Inventory score, did not alter the nature of group differences.

Mood and Behavior Measures

On the POMS (Table 3), there were significant group effects for the Total score (F(3,53)=10.86, p<0.001) and all subscales throughout the study (for all scales, F(3,53)>4.40, p<0.01). All patients had significantly poorer scores than non-patients on all POMS measures throughout the study (all post hoc p<0.02). Future attempters were significantly worse than all other groups on the Depression subscale (F(3,53)=1.02, p<0.001, post hoc p's<0.05).

Table 3 Mood and Behavioral Measures during Challenge Testing

Significant change across evaluation day independent of group membership was found for the Fatigue subscale (F[2,106]=4.05, p=0.02), which worsened significantly on the placebo day, and then improved significantly on the FEN day.

Differential changes across evaluation days, across groups were found on the POMS Total (F(6,106)=2.42, p=0.03), Depression (F(6,106)=2.76, p=0.02), and Tension/Anxiety (F(6,106)=2.40, p=0.03) scales, with a near significant effect on the Anger/Hostility scale (F(6,106)=2.01, p=0.07). In all cases, there was improvement in mood ratings in depressed non-attempters and past suicide attempters and a slight worsening of mood in the future attempters. Change scores computed as FEN day mood ratings minus placebo day ratings differed significantly across the groups (for POMS Total: F(3,53)=4.10, p=0.01; for Depression: F(3,53)=3.68, p=0.02; for Tension/Anxiety: F(3,53)=4.81, p=0.005; all post hoc p<0.02 for comparisons of change scores between future attempters and either past attempters or depressed non-attempters). The nature of these differences on the POMS Total score (adjusted for sex and age) is illustrated in Figure 2.

Figure 2
figure 2

POMS Total score (±SEM) on baseline, placebo, and active drug (fenfluramine) days in each subject group, adjusted for age and gender. Decline in Total score (improvement) is significant between placebo and fenfluramine days in non-attempter and past attempter groups, and significantly different than change (worsening) in future attempter group.

On the finger tapping test (Table 3), performance did not differ among the groups at baseline and throughout the study, but changed significantly in all groups over the course of the FEN challenge procedure (F(2,96)=12.05, p<0.001). Tapping rate was significantly higher on the FEN day than on either placebo or baseline days.

Overall, subjects showed a decrease in POMS Fatigue and increase in finger tapping rate on the FEN day relative to placebo. Patient subjects, with the exception of future attempters, showed an improvement in POMS Depression, Anxiety, and Total scores on the FEN day. Depression score, in fact, worsened in most future attempters (5/7, 71.4%), a significantly higher percentage than in the non-patient (3/15, 20.0%) non-attempter (5/19, 26.3%) or past attempter (2/18, 11.1%) groups (χ2(3)=10.0, p=0.02).

Side Effect Ratings

Side effect ratings are presented in Table 3. Side effects of activation and nausea increased on the FEN day, but with no differences among subject groups. Change in activation ratings was due to mild increase in 17.2% (n=10) of all subjects (6 patients/4 non-patients); change in nausea was due to mild symptoms in 19.0% (n=11) of all subjects (7 patients/4 non-patients). Apparent activation was related to improvement in POMS Depression score in patients (t(41)=2.28, p=0.03), but not finger tapping change (t(39)=0.23, p=0.82), suggesting it was a proxy for a brightening of mood.

Correlations

AUC for prolactin (corrected for age, sex, and drug levels) was not associated with other variables, including the change to peak cortisol (r=−0.01, p=0.95).

The peak change in cortisol response in patients was marginally associated with baseline suicidal ideation (r=−0.28, p=0.06). The peak change in cortisol was also associated with suicidal ideation in the entire depressed sample at both 3 months (r=−0.32, p=0.05) and at 1 year (r=−0.40, p=0.007). These associations approached significance in the future attempter group (r's=−0.68, p=0.09 at both 3 months and 1 year). It correlated as well with lifetime number of suicide attempts (log-transformed) among all attempters (r=−0.39, p=0.05).

A supplemental, stepwise logistic regression was run in all depressed subjects to determine if a combination of variables derived from the challenge procedure might be useful in identifying future attempters. Change to peak cortisol, change in POMS Total score from placebo to FEN day, age, sex, and current level of suicidal ideation were used in the pool of predictors, with a p-to-enter of 0.05. The final equation included three variables: lower change to peak cortisol (coefficient=0.61, Wald statistic=4.28, p=0.04, odds ratio=1.84, 95% CI=1.03–3.28), less improvement in POMS Total score (coefficient=0.10, Wald statistic=4.26, p=0.04, odds ratio=1.11, 95% CI=1.01–1.22), and younger age (coefficient=0.21, Wald statistic=3.69, p=0.05, odds ratio=1.24, 95% CI=1.00–1.54). This equation identified 57.1% of the future attempters (4/7), with a 2.3% (1/37) misclassification rate of other depressed subjects.

DISCUSSION

Lower AUC for prolactin response distinguished both past and future attempter groups from non-patients. This difference was not reflected in the peak change measure due to persistence of group differences over multiple hours, and small sample sizes, which limited statistical power to define a unique peak difference. These groups did not differ from other depressed subjects who had never made a suicide attempt, although if past and future attempters are combined in a single ‘any suicide attempt’ group, they differ from non-attempters on the AUC measure. The reduced prolactin response in subjects who have made or will make a suicide attempt is consistent with prior research regarding the blunting of response in association with past suicidal behavior (Cleare et al, 1996; Malone et al, 1996; Correa et al, 2000; Duval et al, 2001; Soloff et al, 2003), and also consistent with studies suggesting that prolactin response is a trait measure of serotonergic abnormality not sensitive to changes in clinical state that might indicate an acute risk for suicidal behavior (Flory et al, 1998; Dulchin et al, 2001; Bhagwagar et al, 2002b). As such, it may be a predictor of long-term or lifetime risk for suicidal behavior rather than immediate or short-term risk.

Cortisol responses to FEN were clearly blunted in the future attempters group, and differentiated them from all other groups. This difference was reflected in the peak measure because, unlike prolactin, there was a more defined increase in cortisol response—in all groups except future attempters—that occurred around hour 2 of the procedure, followed by normal morning decline in cortisol levels. Blunting of cortisol response, defined by lower peak change, was related to persistently higher suicidal ideation, even up to a year after the FEN challenge, and greater number of past attempts.

The relationship between cortisol regulation and future suicidal behavior is consistent with a role for HPA dysfunction in suicidal behavior. A recent meta-analysis found that non-suppression of cortisol on the dexamethasone suppression test predicts completed suicide (Mann and Currier, 2007), though a recent follow-up study of the largest such cohort found that this relationship only holds for those with a history of attempt (Jokinen et al, 2007). Higher cortisol levels and dexamethasone resistance characterize melancholia, which carries a heightened risk for suicidal behavior in mood disorders (Grunebaum et al, 2004). On the other hand, lower cortisol responses to administration of combined dexamethasone–corticotropin-releasing factor have been found in suicide attempters (Pfennig et al, 2005), and cortisol responses to pharmacologic challenge tend to be lower in subjects with histories of abuse (Heim and Nemeroff, 2001) or post-traumatic stress disorder (Yehuda, 2001; Oquendo et al, 2003a), both significant risk factors for suicidal behavior (Brodsky et al, 2001; Oquendo et al, 2003b). Cortisol response to pharmacologic challenge appears to be sensitive to clinical state (Bhagwagar et al, 2002b), which may account for some of these discrepancies. Some of these differences may also related to sampling, since the likelihood of future suicide completion even in very high-risk past attempter groups remains small. Alternatively, dysregulation in cortisol does not necessarily imply that levels will always be high or low. One possibility is that ambient levels of cortisol are low and the threshold for response high in subjects at risk for suicidal behavior or with trauma histories, due to hypersensitivity of glucocorticoid receptors and enhanced negative feedback inhibition (Yehuda and LeDoux, 2007). Once that threshold is crossed, however, cortisol responses may be exaggerated.

The post-FEN mood and behavioral response of depressed subjects in this study are consistent with short-term effects of serotonergic medications. Antidepressant effects of serotonergic medications, in terms of improved mood and increased energy, may be evident in the earliest stages of treatment and distinguish those who are likely respond (Taylor et al, 2006). Our data suggest that there may be transient mood effects that are evident within hours after a single dose of a serotonin-enhancing drug. Depressed patients in this study reported a decrease in fatigue and increase in motor speed, and a modest, transient improvement in feelings of depression (30.9% reduction in score) after receiving FEN. Future attempters, however, also showed evidence of increased energy without an improvement in mood. The combination of being energized without improvement in mood during antidepressant medication treatment may create risk for later suicidality in the context of treatment (Perlis et al, 2007). In our study, logistic regression suggested that absence of mood improvement alone is related to later risk of attempt, but this is at a single, baseline assessment that occurred well before subsequent suicide attempt. Given concerns about the safety of antidepressants, future studies need to evaluate relative effects on mood and activation in the early phases of antidepressant treatment trials and to carefully monitor suicidal thinking to determine if this predicts later risk of suicidal behavior.

The study was limited by its small sample sizes, fixed order of administration of the placebo and FEN to avoid carry-over effects of FEN, and intercurrent PET scanning. There may have been mood and behavioral effects that were more pronounced during the intermediate phases of the study, although analysis of FEN and norfenfluramine levels revealed that levels of both were at or near peak levels even at the final assessment point of the procedure, and prolactin levels remained high at this point as well. Central activity of the drug was likely to have persisted to this point. The PET scans themselves may have had an effect on both prolactin and cortisol response. However, the prolactin and cortisol responses to FEN in all groups on the placebo day showed no increases related to the timing of these scans (prolactin responses were flat and identical in all groups, cortisol responses showed a continuous decline over the 5 h of the study and were also identical across groups; data available upon request). In addition, prolactin and cortisol responses on the FEN day in non-patients, non-attempters, and past attempters are consistent with prior studies that did not involve scanning. This is the first study to evaluate mood and behavior in the context of a pharmacologic challenge in a prospective sample of suicide attempters, and its results are of interest. However, given the small sample sizes, future studies with larger samples, more frequent mood and behavioral assessments, and counterbalanced order of placebo/drug administration are needed.

The results of logistic regression, while exploratory, suggest that responses to acute pharmacologic challenge may be useful for generating predictors of later suicidal behavior. Technological and methodological improvements make it feasible to include these measures in larger studies. Cortisol can now be assessed reliably and non-invasively from saliva (Aardal-Eriksson et al, 1998), and alternatives to FEN (ie citalopram) have been used successfully in challenge paradigms (Bhagwagar et al, 2002a; Flory et al, 2004). The ease with which such data can be collected also raise the possibility of developing procedures that might be translated to the clinic, given adequate evaluation of their prognostic value and better understanding of underlying mechanisms relating them to suicidal behavior. Additional research is needed to fully develop these techniques for clinical use.