INTRODUCTION

Raloxifene is a selective estrogen receptor modulator (SERM) that is used to treat osteoporosis in postmenopausal women (Heringa, 2003). Raloxifene has positive effects on cardiovascular risk factors, and is not associated with an increased risk of stroke and cardiovascular disease that is observed for regular estrogen treatment (Barrett-Connor et al, 2002). For such reasons, raloxifene treatment is considered an attractive alternative for prevention and treatment of osteoporosis and heart disease in male subjects as well (Doran et al, 2001; Blum et al, 2000). Recent results from a randomized controlled clinical trial show that raloxifene treatment at 120 mg daily for 3 years may successfully delay the onset of mild cognitive impairment (MCI) and possibly Alzheimer's Disease (AD) in elderly women (Yaffe et al, 2005). Thus, the need for an assessment of the effects of raloxifene treatment on male brain function and mental performance is growing.

Estrogens and SERMs such as raloxifene may differentially affect mental performance, including memory performance, in both sexes (Cahill, 2003). It is therefore unclear whether the preventive effects of raloxifene treatment in women can be extrapolated to males. To our knowledge, no studies have been published that examined effects of raloxifene treatment on memory performance in male subjects. Such studies usually involve large-scale clinical trials in which subjects or patients are examined for several years in order to detect significant effects of pharmacological treatment on neuropsychological measures of behavior. While anticipating the results of such studies, the current study examined the effects of 3 months of raloxifene treatment on brain function during memory task performance using functional magnetic resonance imaging (fMRI). Task-related fMRI yields both functional and behavioral measures of treatment effects, which may serve as preliminary findings prior to large-scale investigations of similar effects in controlled clinical trials. fMRI may be more sensitive and less ambiguous to the effects of sex steroids on mental performance than conventional neuropsychological assessment scales (Maki and Resnick, 2001). Additionally, fMRI may provide an insight into the treatment mechanism of raloxifene. Treatment effects on brain function can be mapped to individual brain structures at high spatial resolution (Honey and Bullmore, 2004). The behavioral significance of such effects may be examined either by direct correlation of behavioral measures with such effects (Goekoop et al, 2005), or by examining brain function associated with distinct response types during memory performance (eg correct hits, correct rejections) (Buckner et al, 1998). This allows a detailed analysis of the effects of SERMs on memory function, which may eventually be relevant to the design and development of new SERMs that target specific aspects of memory performance, such as encoding and retrieval processes (Bernardi et al, 2003).

In a previous fMRI study, we examined effects of raloxifene treatment vs placebo on brain function in healthy elderly males during encoding of new information (human faces) into memory (Goekoop et al, 2005). Raloxifene globally enhanced brain activation during face encoding, with the exception of subcortical, medial temporal, and visual structures. This was interpreted as a global effect of raloxifene treatment on cortical arousal, rather than a specific influence on each of the cognitive domains represented by this pattern (Goekoop et al, 2005). Such effects are in line with findings from a large number of studies, suggesting that estrogens have a context-dependent effect on cortical arousal (Morgan et al, 2004; Cahill and Alkire, 2003). Based on the locations of treatment effects, we predicted small effects of raloxifene treatment on working memory performance, executive functions, and verbal skills (Goekoop et al, 2005).

Since no treatment effects were found in subcortical or medial temporal areas during encoding (despite activation of these structures during task performance), we considered an effect of raloxifene treatment on episodic memory to be less likely. However, the effects of raloxifene treatment on episodic memory could not be fully assessed, since only the encoding phase of memory performance was examined. Since a growing number of imaging studies has shown that pharmacological substances affect brain function in process-specific manner (Honey and Bullmore, 2004; Thiel, 2003), raloxifene treatment may have unique effects on brain function during encoding, consolidation, or retrieval stages of memory performance. We therefore used fMRI to examine the effects of 3 months of raloxifene treatment on brain function during face recognition. Given the reported efficacy of raloxifene treatment in preventing the onset of memory impairment in elderly women, we hypothesized that raloxifene treatment would eventually affect brain function during retrieval of episodic information.

METHODS

Study Design

Subjects were screened for participation in a randomized, double-blind, placebo-controlled repeated measures design. In each subject, fMRI was performed at baseline (BL; no medication; session 1) and after 3 months of a once daily oral intake of raloxifene 120 mg or placebo (TR; session 2). This dose was chosen because a previous study showed no significant effects of raloxifene 60 mg on markers of bone turnover in males (Doran et al, 2001). BL and TR sessions were performed on the same time of day in each subject (margin 30 min). If data acquisition failed, the subject's consent was asked for an additional scanning visit, up until which time the relevant medication regime (raloxifene or placebo) was continued, to obtain a maximum number of complete data sets. Study period extension was not to exceed 10 days.

Subject Recruitment

The medical ethical review board of the VU University Medical Center of Amsterdam approved the study. A total of 30 healthy, right-handed elderly males, aged 60–70 years (mean 63.6 years, SD 2.4; range 60–69 years) were recruited by advertisement in local newspapers. This was the same study population that was examined previously for effects of raloxifene treatment on brain function during face encoding. All subjects provided informed consent during a screening visit in which the procedure was explained and contraindications were checked. The mental status of subjects was assessed by means of a structured clinical interview. This interview involved questions regarding the subjects’ general health, mental health, social status, history, and intoxications. Exclusion criteria involved (a history of) psychiatric disease (ie any diagnosable disorders according to DSM-4-TR criteria (American Psychiatric Association, 2000), any neurological illness (memory complaints in particular), and the use of any medication or other substances that are known to influence cerebral functioning (including >0.5 pack of cigarettes, >4 glasses of alcohol). Elaborate physical examination and laboratory tests were performed to back the clinical interview. Exclusion criteria to MRI involved the presence of metallic implants in high-risk areas and a history of claustrophobia. Formal education was determined using a dutch system (low, middle, high). On the day of the second scanning session (after 3 months), a similar structured clinical interview was performed with a particular focus on side effects of treatment. Compliance was assessed by pill counts and the subjects’ comments.

fMRI

Imaging was carried out on a 1.5T Sonata MR scanner (Siemens, Erlangen, Germany), using a standard circularly polarized head coil with foam padding to restrict head motion. For fMRI, an echo planar imaging sequence was used (echo time 60 ms, flip angle 90°, matrix 64 × 64, field of view 192 × 192 mm) to obtain 21 transverse slices (thickness 5 mm, interslice gap 1 mm). Task stimuli were projected on a screen located at the head end of the scanner table via an LCD projector located outside the scanner room. Subjects viewed the screen through a mirror located on the head coil. In each hand, subjects held an fMRI compatible response-box through which they were able to react to task stimuli by pressing a single button using one of their index fingers. A T1-weighted structural MRI scan was obtained of each subject (MPRAGE; inversion time: 300 ms, TR=15 ms; TE=7 ms; flip angle=8°; 160 coronal slices, 1 × 1 × 1.5 mm voxels). Total scanning time including structural imaging on average was 21 min for each visit.

Face Recognition Task

Immediately after face encoding, which involved the presentation of four blocks of unfamiliar faces (24 in total), alternating with blocks of fixation (Goekoop et al, 2005), a recognition task was administered. Total scanning time was 6 min 12 s for the encoding task and 3 min 40 s for the recognition task. Hence, minimum delay between items presented during encoding and recognition tasks was 1 min, and maximum delay 10 min. During the first 10.5 s of each memory-task, subjects saw a circle indicating time left before the onset of the first condition. During face recognition, 24 faces, of which 12 had been shown during encoding and 12 were new, were presented sequentially in random order on a black background. Presentation time was 5 s for each face, followed by a white fixation-cross presented for 3 s on a black background. We chose to contrast faces with a ‘low level’ fixation cross (X) in order to maximize contrast between processing demands for both conditions, which would result in a large number of brain areas being activated for faces>fixation contrasts, including visual and sensorimotor areas. This would increase chances of finding significant effects of treatment. Subjects were instructed verbally to indicate whether the presented faces were familiar or unfamiliar by pressing one of two buttons (written instructions ‘Left: familiar’ and ‘Right: unfamiliar’ also appeared alongside the pictures). Mean reaction times and the number of hits and misses were recorded. To avoid investigator's bias, task performance data were viewed only after both scanning visits had been completed.

Two comparable versions of each paradigm were constructed and randomized across subjects and scanning sessions (baseline, treatment) to reach equal numbers of subjects receiving test versions 1 and 2 in both treatment groups. A third ‘spare’ version was available in case data acquisition might fail during one of the scanning sessions. Each paradigm contained unique items (faces) to avoid any overlap or interference with previously observed items at the time of the second scanning session. Paradigms were practised extensively to minimize effects of skill learning in the course of the scanning sessions. A practicing task was used that was similar to tasks used for neuroimaging, but contained less items (practicing items did not overlap with those of tasks used for scanning purposes). A day before the onset of the first scanning session, subjects visited the hospital and practiced both encoding and retrieval tasks. On the day of the first scanning session, minutes before the onset of the first measurements, the tasks were again practiced with subjects in the scanner. On the day of the the second scanning session, subjects practiced the procedure within the scanner only. Total time for one scanning visit including instructions of memory tasks was approximately 1 h.

Statistical Analysis of Task Performance Data

Post hoc sorting of response types yielded a number of true positive (TP, correct hits), true negative (TN, correct rejections), false positive (FP, false hits), false negative (FN, false rejections), and forgotten items (Forgot). Based on these responses, an overall performance accuracy score (‘Pr’) was calculated by using the following formula: Pr=FP−TP (Corwin, 1994). This measure was divided by the total number of new items (12) to correct for 50% chance levels, yielding scores between −1 and 1. A ‘false-alarm rate’ (‘FAR’) was calculated using the following formula: FAR=(FP+0.5)/13. A measure of response bias ‘Br’ was then calculated using the following formula: Br=FAR/(1−Pr). This measure indicates the chance that subjects will guess that tested items are targets when they are in an uncertain state. High values of Br indicate a tendency to produce high numbers of FP responses, whereas low values of Br indicate high uncertainty (Corwin, 1994).

Three independent variables were examined for effects of raloxifene vs placebo treatment: that is, accuracy (Pr), latency (RTav), and response bias (Br). For each variable, a single mixed effects (ME) linear model was specified using SPSS 12.0, with ‘group’ (two levels: RAL, PLAC), ‘scanorder’ (two levels: session 1, session 2), and ‘test version’ (two levels: 1 and 2) as fixed factors, ‘scanorder’ being the repeated factor, and ‘subjectnr’ defining linked measurements from the same subject. This model allowed calculation of significant effects of raloxifene vs placebo treatment (interaction group*scanorder), as well as effects of raloxifene or placebo treatment separately (custom contrasts examining ‘placebo: session 2—session 1’, and ‘raloxifene: session 2—session 1’) and differences in performance between raloxifene and placebo groups at baseline and after treatment (custom contrasts examining ‘session1: RAL—PLAC’, and ‘session 2: RAL—PLAC’). Effects were examined using a threshold for significance defined by p<0.05, Bonferroni corrected for multiple comparisons. The advantage of this model is that it is more accurate than conventional models, allows calculation of all effects of interest within the confines of a single statistical model, and is closely related to the model used for analysis of the functional data. Since our previous analysis of the same performance data used the conventional but less accurate (multimodel) approach, the significance of the observed effects as reported in the current study may vary to some degree with those reported earlier (Goekoop et al, 2005). However, the basic trends remain the same (see Results section).

Analysis of Functional Neuroimaging Data

Functional data sets were analyzed using FSL 3.2 (Smith et al, 2004). The first five volumes of each data set were discarded to account for T1-saturation effects. At first level (individuals), the following preprocessing was applied: non-brain removal, slice-timing correction using Fourier-space time-series phase-shifting, motion correction and spatial smoothing using a Gaussian kernel of FWHM 8 mm, mean-based intensity normalization of all volumes by the same factor, and high and low pass temporal filtering (Jenkinson et al, 2002; Smith, 2002). Functional neuroimages of each subject were coregistered to corresponding structural images in native space, and structural images were registered to structural Talairach standard images (Talairach and Tournoux, 1988), defined by the Montreal Neurological Institute standard brain supplied with FSL. The same transformation matrices used for structural-to-standard transformations were then used for functional-to-standard space transformations of coregistered functional images. All registrations were carried out using an intermodal registration tool based on the correlation ratio (Jenkinson and Smith, 2001). After preprocessing, the following statistics was applied on a voxel-wise basis on each time series, using local autocorrelation correction (Woolrich et al, 2001): signal change during face recognition was modeled in an event-related fashion, using separate regressors for TP, TN, FP, FN, and forgotten response types (see above). Signal variance during fixation (X condition) was not modeled, to prevent overspecification of the model. Type and onset time of the events were determined by post hoc sorting, based on the responses given by the individual subjects. Thus, all individuals obtained a unique model of signal response containing a regressor for each response type, which was convolved with a γ function to model the hemodynamic response. Model fitting generated whole brain images in native space of parameter estimates and corresponding variance, representing average signal change during a particular condition (eg TP, TN, FP, FN, forgot) vs fixation (X, an implicit baseline condition). Owing to a limited number of false responses, effects of raloxifene treatment vs placebo were examined on ‘TP’, ‘TN’, ‘TP>TN’, and ‘TN>TP’ contrasts only, that is, false responses were ignored in further analyses.

Since our face recognition task contained both novel and familiar items, both encoding and retrieval processes occurred during task performance (Buckner et al, 2001). Such processes can be studied separately by means of event-related analyses of brain function during TP and TN items. TP and TN vs low-level fixation (X) contrasts (‘loose comparisons’) examine more general aspects of recognition memory performance, which are biased with respect to successful retrieval processes (TP>X) and encoding processes (TN>X), respectively (Buckner et al, 1998). In contrast TP 〈〉 TN contrasts (‘tight comparisons’) provide additional information concerning specific subcomponent processes during retrieval: TP>TN contrasts specifically examine brain areas where signal intensity during successful retrieval of familiar information was significantly higher than signal intensity during successful rejection/encoding of new information. Areas of significant signal differences are, therefore, thought to represent ‘successful retrieval’ processes. Conversely, the reverse contrast TN>TP examines brain function related to ‘encoding during attempted retrieval’ (Daselaar et al, 2003). Contrast images were resampled to 2 × 2 × 2 mm in standard space and fed into in a second-level statistical analysis to examine effects of raloxifene treatment on (distinct aspects of) brain function during retrieval.

Activation images of main effects during recognition were produced by calculating an average activation map based on all individual activation maps for each session, medication regime, and event-type separately (ie BL (placebo), BL (raloxifene), TR (placebo), and TR (raloxifene)) in a mixed effects higher level analysis (Woolrich et al, 2004) using clusters determined by Z<2.3 and a corrected cluster significance threshold of p=0.05 (Forman et al, 1995; Friston et al, 1994; Worsley et al, 1992). Main effects at baseline were first tested for significant differences between raloxifene and placebo groups using the following contrasts: BL (RAL) 〈〉 BL (PLAC), at Z=3.1 (p<0.001). Effects of treatment were then calculated using a repeated measures model, in which two explanatory variables (EVs) contrasted BL and TR sessions of placebo and raloxifene groups, allowing separate variances for both groups. Two additional EVs (one for each treatment group) coded for effects of test version on signal response. The main contrasts of interest were (TR (raloxifene)−BL (raloxifene)) 〈〉 (TR (placebo)−BL (placebo)), testing for an interaction between raloxifene and placebo groups before and after treatment. Analyses were performed using a threshold of Z=3.1 (p<0.001). Images were rendered on a mean anatomical brain volume of all subjects in standard space (Talairach and Tournoux, 1988) for display purposes.

RESULTS

Demographics

Mean age of subjects was 64.1 (SD 2.4) years in the placebo group and 63.1 (SD 2.5) years in the raloxifene group. The placebo group contained two smokers vs one in the raloxifene group. Education level was equal in both groups (χ2=0.26; p=0.88).

Subject Compliance and Discontinuation

One subject (placebo group) was claustrophobic and data quality of both sessions was poor in another subject (raloxifene group). These data were discarded, yielding 28 complete data sets containing BL and TR data (n=14 raloxifene group, n=14 placebo group). Rescanning was performed once in a single individual (placebo group) because of poor data quality during the second scanning session. Subject compliance was good as assessed by tablet counts and there were no dropouts. No significant side effects were reported.

Task Performance

Numbers and percentages of all response types (ie TP, TN, FP, and FN responses) are shown in Table 1A. Mean overall recognition accuracy was 0.60 (SD 0.20) above 50% chance level, with mean overall reaction time 1.63 s (SD 0.33 s). Table 1B displays the significance of the difference between mean performance values (accuracy, latency and response bias) across both groups and scanning sessions. The placebo group showed significantly better mean recognition accuracy scores at baseline than the raloxifene group (p=0.022). This trend was reversed after treatment (p=0.052). After treatment, recognition accuracy increased significantly in the raloxifene group (p=0.046), but decreased in the placebo group (p=0.003), with the interaction being significant at p=0.003. Response latency was not significantly affected in both groups. Response bias showed a trend toward a decrease in the placebo group (p=0.08), but remained stabile in the raloxifene group (p=0.86), with the interaction being nonsignificant (p=0.17). No significant effects were found for the covariate test version, or its interactions with other factors, on measures of task performance.

Table 1 Performance Measures in Raloxifene and Placebo Groups Before and After Treatment

Brain Function

Main effects of face recognition (correct responses) involved activation of ventral and dorsal occipital (visual) areas, bilateral inferior parietal, parahippocampal, superior temporal, prefrontal areas, and the lateral sulci (Figure 1, Table 2). These effects were very similar to previous patterns of brain function observed during delayed recognition memory performance (Rugg et al, 2002). Activation patterns for TP and TN decisions differed significantly: TP>TN contrasts showed activation bilaterally in primary visual cortex, anterior and posterior cingulate cortex, inferior parietal lobes, and insula, whereas unilateral activation was observed in right motor cortex, right basal ganglia, left cerebellum, and left inferior, middle, and superior frontal cortex (Z>2.3 cluster corrected (cluster threshold Z=3.1); Figure 1). The lateralization in precentral areas likely represents increased motor activity of the right hand and fingers during TP decisions when compared to TN decisions. In contrast, only left motor cortex showed increased activation in TN>TP contrasts (Figure 1).

Figure 1
figure 1

Axial slices showing main effects during face recognition rendered on a mean anatomical brain volume of all subjects. Left in the image is left in the brain. Effects after cluster correction at Z=2.3 and p<0.05. Color scale extends from Z=2.3 (red) to Z=8.3 (yellow). Effects during correct hits (TP), correct rejections (TN), and TP 〈〉 TN activation differences. Baseline: brain activation at baseline. Treatment: brain activation after 3 months of treatment. Ral: raloxifene group. Plac: placebo group. Visual inspection suggests an enhancement of activation after raloxifene treatment for TP items. See also text and Table 2.

Table 2 Volume, Z-Scores, and Coordinates of Peak Voxels of Local Maxima of Main Effects during Correct Recognition of Familiar Items (TP Responses; Averaged Across Baseline and Treatment Sessions of Raloxifene and Placebo Groups)

The main contrast examining the interaction between raloxifene and placebo groups at baseline and after treatment showed significant increases in activation during TP decisions only. Effects occurred in the left parahippocampal area and right inferior prefrontal cortex (Figure 2c; Table 3; Z>3.1). Nearly identical effects were found on brain function during TN decisions, but these were not significant at Z>3.1 (Figure 2d; Z>2.3). When studying the separate contributions of each group to treatment effects reported in Figure 2, the raloxifene group contributed mostly to the observed effects of treatment (Figure 2a and b). No significant effects were found of placebo intake on brain function. No significant treatment effects were found on contrasts describing TP 〈〉 TN activation differences. No decreases in activation were observed after treatment with either raloxifene or placebo. No significant differences were found at baseline between activation levels of raloxifene and placebo groups within areas of significant treatment effects (Z>3.1). Plots of percent signal change (from global mean values) of peak voxels of local maxima illustrate intensity changes in both groups before and after treatment (Figure 2).

Figure 2
figure 2

Axial slices showing effects of raloxifene treatment on brain function during TP and TN responses (correct hits and correct rejections). Functional maps are rendered on a mean anatomical brain volume of all subjects (Mean_anat_RALMEN). Left in the image is left in the brain. Color scale extends from Z=2.3 (orange) to Z=4.0 (yellow) for display purposes. No significant effects were observed after placebo intake (data not shown). (a) Raloxifene group, TP items, contrast TR>BL. When compared to baseline, raloxifene treatment significantly increases brain activation in parahippocampal and right inferior prefrontal cortex. Effects are significant at Z>3.1. (b) Raloxifene group, TN items, contrast TR>BL (RAL). Effects are not significant at Z>3.1, but are highly similar to treatment effects during TP responses at Z>2.3. (c) Raloxifene group vs placebo group, TP items, contrast (TR>BL (RAL))>(TR>BL (PLAC)) (interaction). Increased activation is observed in parahippocampal cortex and right inferior prefrontal cortex (areas listed in Table 3). Effects are significant at Z>3.1. (d) Raloxifene group vs placebo group, TN items, contrast (TR>BL (RAL))>(TR>BL (PLAC)) (interaction). Effects are not significant at Z>3.1, but are highly similar to treatment effects during TP responses at Z>2.3. The strong resemblance of treatment effects on TP and TN contrasts suggests a general effect of raloxifene treatment on brain function during recognition, rather than a selective effect on encoding or retrieval processes (see text). Plots: graphs depicting the interaction between mean percent signal change of raloxifene and placebo groups at baseline and after treatment, as observed in peak voxels of local maxima of significant effects of treatment. Two reference voxels have been sampled in peak voxels of local maxima during face recognition in similar slices (10 −76 16) and (2 −82 4), showing no significant interaction after treatment. Means and standard deviations (error bars) are shown. Arrows indicate the corresponding clusters of activation. See also text and Table 3.

Table 3 Volume, Z-Scores, and Coordinates of Peak Voxels of Local Maxima of Significant Effects of Medication Intake (Z>3.1), As Determined by the Interaction Between Activation Levels of Raloxifene and Placebo Groups at Baseline and After Treatment (see Methods)

DISCUSSION

In a previous study, we reported effects of raloxifene treatment on brain activation during encoding of unfamiliar faces (Goekoop et al, 2005). Results of the current study complement our previous findings by examining brain activation during recognition of the encoded items.

Effects of Raloxifene Treatment: Behavioral Data

An unexpected drop in recognition accuracy was observed in the placebo group after 3 months of treatment, whereas performance in the raloxifene group increased slightly (Table 1). The nature of this drop in accuracy scores is unclear, but a number of explanations are possible. First, although group size of the current study is large to fMRI standards, it is rather small in terms of behavioral studies. Such small groups may show random differences in mean performance measures (Wilkinson and Halligan, 2004). Second, a drop in recognition accuracy at time 2 (retest phase) with respect to time 1 (test phase) is a regularly observed phenomenon in test–retest studies. Such effects are attributed to changes in context between test- and retest phases, and may involve ‘true’ decreases in sensitivity to old or new items as well as a change in response bias (ie subjects perform worse when they are insecure, or in an unfamiliar environment) (Feenan and Snodgrass, 1990). We therefore considered the possibility that a change in response bias was responsible for the observed decline in performance accuracy in the placebo group. A trend was observed for a difference in response bias (Br) in the placebo group (p=0.08), but no significant effects of raloxifene vs placebo treatment were found. The possibility, therefore, exists that a change in response bias explains the observed decline in performance in the placebo group, but group size may not have been large enough to produce a significant difference. A change in response bias may reflect the subject's sensitivity to interference by changes in context between encoding and retrieval phases, practicing and scanning stages, or between first and second scanning sessions (eg no practicing round was held the day before the second scanning session). Since test items did not overlap between different test versions and the factor ‘testversion’ was no significant confounder of task performance scores, we consider it unlikely that specific test items or the use of different test versions acted as significant distracters. Since both groups were matched with respect to age, gender, and education level, we considered problems of group matching to be an unlikely source of performance differences between raloxifene and placebo groups. Additionally, a pathological decline of memory performance in the placebo group was considered unlikely, since these changes in performance occurred over a relatively short period of time (3 months) in otherwise healthy subjects (Goekoop et al, 2005). Future studies may want to include more subjects and perform more detailed assessment of cognitive status (including measures of context-dependency) in order to evaluate the effects of pharmacological substances on brain function and mental performance.

Effects of Raloxifene Treatment: Functional–Behavioral Relationships

Previous studies have shown that (small) behavioral changes do not necessarily translate into functional effects observed using fMRI at current group sizes, and vice versa (Wilkinson and Halligan, 2004). To examine whether performance changes in each group separately were in any way related to the observed functional effects of treatment, we studied the separate contributions of raloxifene and placebo groups to the overall pattern of treatment effects on brain function during recognition. Treatment with placebo had no significant effect on brain activation. Indeed, treatment effects were mainly due to effects of raloxifene intake (Figure 2a and b). This indicated that the drop in performance observed in the placebo group was not a significant confounder of functional effects of raloxifene treatment vs placebo as reported in this study (Figure 2c and d). Similarly, we examined the separate contributions of both groups to effects of raloxifene intake vs placebo on brain activation during encoding in our previous study (Goekoop et al, 2005). Only the raloxifene group showed significant increases in brain activation in treatment-related areas, again indicating that placebo intake and subsequent performance changes were not a significant confounder of treatment effects during encoding.

Although further studies are necessary to corroborate our current findings, our data are consistent with the hypothesis that a ‘normal’ decrease in performance accuracy as a result of the subjects’ sensitivity to a difference in context between test- and retest phases was countered by an active process related to raloxifene treatment (Goekoop et al, 2005). A reduction in the variability of recognition accuracy scores has been observed previously in elderly women receiving estrogen treatment (Wegesin and Stern, 2004). Additionally, a canceling of learning effects has been observed in male subjects receiving testosterone injections (Wolf et al, 2000), which may involve estrogen-mediated mechanisms (Wolf, 2003; Longcope et al, 1969). Such effects may involve effects of estrogens on arousal and attentional levels rather than direct effects on episodic memory. Estrogens are thought to influence brain function through a context-dependent effect on cortical arousal (Morgan et al, 2004). Such effects involve well-documented effects of estrogens on the four primary neuromodulatory neurotransmitters that regulate cortical arousal states (ie serotonin, dopamine, nor-adrenaline, and acetylcholine) (Bernardi et al, 2003; Korol, 2004). An increase in cholinergic (or noradrenergic) arousal is known to reflect an increase in signal-to-noise levels in neural networks, which translates into enhanced attention and working memory performance in animals and humans (Sarter et al, 2005). This mechanism of action is thought to be an important factor underlying improvement of memory performance in patients with AD that are treated with cholinesterase inhibitors (Sarter et al, 2005). Similarly, raloxifene treatment may have improved memory performance by enhancing cortical arousal (signal-to-noise-levels) during initial encoding, thereby reducing sensitivity to interference as a result of context-changes. Future studies may want to examine effects of raloxifene treatment on physiological measures of arousal and attention in order to examine this hypothesis in more detail.

Effects of Raloxifene Treatment: Encoding Vs Recognition

During face encoding, signal intensity during task performance was enhanced symmetrically across a wide range of neocortical areas (Goekoop et al, 2005). This rather generalized enhancement of cortical brain activation during face encoding was interpreted as a global effect of raloxifene intake on cortical arousal, rather than a specific effect of raloxifene treatment on all cognitive domains represented by this pattern (Goekoop et al, 2005). Treatment effects observed in the current study showed both similarities and differences with treatment effects during initial encoding. During both phases of memory performance, raloxifene intake produced increases in brain activation. Although decreases may have been possible in theory, these were not observed. The reason for the absence of signal decreases is unclear, but may reflect effects of increased arousal, since enhanced arousal levels increase rather than decrease the reactivity of neural networks (Coull, 1998; Morgan et al, 2004). In contrast to face encoding, however, treatment effects during face recognition did not involve a widespread increase of main effects of task performance in cortical structures. Direct effects of cortical arousal may, therefore, not have been relevant to treatment effects during face recognition. Indeed, a recent study shows that noradrenaline and neural steroids such as cortisol may modulate memory consolidation by interacting with arousal levels at initial encoding, rather than recognition (Cahill and Alkire, 2003). Although raloxifene intake may have influenced brain function during recognition independently of encoding, it is possible that the observed effects of raloxifene treatment during recognition represent indirect, or ‘downstream’ effects of treatment effects during initial encoding, or consolidation processes occurring between the two tasks. Clearly, more research is needed to examine the neurodynamic changes underlying the observed effects of treatment, along with the possible interactions between treatment effects occurring at different stages of memory performance.

The current study shows that raloxifene treatment eventually affects brain function during memory retrieval. Since identical procedures of functional data analysis were followed, the asymmetric distribution of treatment effects across encoding and retrieval phases of memory performance suggests that the effects of raloxifene, like the effects of many other pharmacological substances, are process-specific (Honey and Bullmore, 2004). A direct statistical comparison of treatment effects between encoding and recognition phases was not performed, however, since these tasks were too dissimilar (ie block vs event-related designs) to allow meaningful comparisons. We therefore report our findings of the effects of cholinergic challenge separately for both memory tasks. Although encoding processes during attempted retrieval may differ from encoding during attempted encoding (Reber et al, 2002; Rombouts et al, 2001), studies of brain function during retrieval allow analyses of encoding and retrieval processes within the same scanning session (see Methods), which avoids some of the potential confounds that may be introduced by across-session comparisons (eg differences in task design, subject positioning, motion artefacts).

Significant effects of raloxifene treatment were found on brain activation during TP decisions (familiar items), but not on TN decisions (unfamiliar items) (Figure 2c). Although this may indicate a preference for raloxifene treatment to influence successful recognition rather than encoding during attempted retrieval, the small number of items analyzed per subject somewhat limit these findings (Table 1). Indeed, when the threshold for significant brain function was lowered for contrast images representing treatment effects during TN decisions, treatment effects were found that were nearly identical to those observed for TP decisions (Figure 2d). This suggested that raloxifene treatment affected more general aspects of recognition memory function that are examined equally by ‘loose’ comparisons of brain function during TP and TN decisions vs fixation (see Methods). In order to explore this possibility in more detail, we studied the effects of raloxifene on ‘tight’ TP 〈〉 TN comparisons of brain function, which examine specific encoding and retrieval processes during task performance (see Methods). No significant effects of raloxifene treatment were found, supporting the view that raloxifene treatment affected general aspects of recognition memory performance, rather than specific subcomponent processes during retrieval. Brain areas showing treatment effects for TP items included the right inferior frontal gyrus and left parahippocampal cortex (Figure 2c, Table 3). These structures are important to retrieval of episodic (visuospatial) information after some delay (Schacter and Wagner, 1999; Fletcher and Henson, 2001; Rugg et al, 2002). Enhanced activation of these structures may reflect direct effects of estrogen receptor stimulation on neural signaling and brain function (Bisagno et al, 2003), or sustained effects of estrogen receptor stimulation, which alters protein synthesis and enhances the outgrowth of neural spines in hippocampal areas, which is known to be associated with increased memory performance (Li et al, 2004). Future studies may want to examine the relationships between functional effects of treatment and changes in (parahippocampal) spine density in vivo, by combining phMRI and molecular imaging techniques (eg PET). Based on the current results, we predict an effect of raloxifene treatment on delayed (visuospatial) memory performance in males.

Effects of Raloxifene Treatment: Blood Vs Brain

In the absence of direct measurements of vascular changes (eg perfusion studies), only indirect arguments may serve to locate treatment effects on blood oxygenation level-dependent (fMRI) signal reactivity more precisely to either the neural or vascular compartment (Honey and Bullmore, 2004). In the current study, vascular effects seemed less likely than neurogenic effects for several reasons. First, effects on blood vessels are likely to be similar across different mental processes, yet treatment effects differed between encoding and retrieval stages of memory performance. Second, effects on vascular tissue are likely to be generalized instead of localized, yet treatment effects were localized in our studies. Even during encoding, which showed a widespread pattern of signal enhancement, there were some highly vascularized and active brain structures (eg primary visual cortex) that did not show enhancement of signal intensity after raloxifene treatment. Finally, treatment effects were found in functionally meaningful areas only (eg not in white matter). Together, these observations strongly suggest that the effects of raloxifene treatment were of neurogenic rather than of vascular origin.

Summary and Conclusions

Raloxifene treatment enhanced brain activation in male subjects during recognition of familiar items. A possible treatment mechanism involves enhanced memory retrieval as a result of increased cortical arousal during initial encoding, which reduces effects of context-dependency. Similar effects may underlie the ability of long-term raloxifene treatment to delay the onset of MCI in elderly women. Our combined studies on the effects of raloxifene treatment on episodic memory performance in male subjects predict the occurrence of small behavioral effects on working memory and delayed (visuospatial) memory performance, executive functions and verbal skills, which are secondary to enhanced arousal and attention during initial encoding. Further neuropsychological studies involving larger groups of subjects are necessary to test the validity of these predictions.