Article | Open

# CA1 and CA3 differentially support spontaneous retrieval of episodic contexts within human hippocampal subfields

• Nature Communicationsvolume 9, Article number: 294 (2018)
• doi:10.1038/s41467-017-02752-1
Accepted:
Published online:

## Abstract

The hippocampus plays a critical role in spatial and episodic memory. Mechanistic models predict that hippocampal subfields have computational specializations that differentially support memory. However, there is little empirical evidence suggesting differences between the subfields, particularly in humans. To clarify how hippocampal subfields support human spatial and episodic memory, we developed a virtual reality paradigm where participants passively navigated through houses (spatial contexts) across a series of videos (episodic contexts). We then used multivariate analyses of high-resolution fMRI data to identify neural representations of contextual information during recollection. Multi-voxel pattern similarity analyses revealed that CA1 represented objects that shared an episodic context as more similar than those from different episodic contexts. CA23DG showed the opposite pattern, differentiating between objects encountered in the same episodic context. The complementary characteristics of these subfields explain how we can parse our experiences into cohesive episodes while retaining the specific details that support vivid recollection.

## Introduction

Considerable evidence suggests that the hippocampus is essential for episodic memory and that it plays a particular role in binding information about items and the context in which they were encountered1,2. Most mechanistic models suggest that the hippocampal subfields play complementary roles in spatial and/or episodic memory3,4,5,6,7. Although these models generally predict large differences between neural coding in CA1 and CA3, between-subfield differences at the level of single units in rodents and in overall activity in human fMRI studies have been relatively modest. Indeed, both CA1 and CA3 have been implicated in representations of temporal8,9 and spatial10,11 contextual information. One suggestion12 is that the different computations supported by CA3 and CA1 should be most apparent when one analyzes the population-level activity patterns elicited by different contexts—whereas CA3 should differentiate between specific experiences in the same context and CA1 should globally differentiate between different contexts.

One way to do this is with pattern similarity (PS) analyses. We can think of PS analyses as capturing the activity across voxels that act as spatiotemporal filters sampling from blood vessels, which, in turn, respond to fluctuations in activity from large populations of neurons13. Voxel PS analysis is analogous to population vector analysis approaches used in single unit recordings (e.g., ref. 14). If we assume that activity in each voxel reflects the outcome of vascular sampling of neural responses from large, distributed populations, then the voxel pattern is a filtered, macro-scale analog of the neural population vector. Just as different population vectors reflect changes in the relative firing rates across different neurons across two conditions, different voxel patterns may reflect changes in the underlying population-level response across two conditions.

Here, we used high-resolution fMRI and multivariate analysis methods to test how different hippocampal subfields contribute to representations of spatial and episodic context. We designed a virtual reality environment consisting of two houses (spatial contexts; Fig. 1). After becoming familiarized with the spatial layouts of each house, participants viewed a series of 20 videos (episodic contexts) depicting first-person navigation through each house while they encountered a series of objects. Each object was studied only once and was uniquely placed in a single house within a single video. Following this study phase, we scanned participants while they performed an item recognition test that required them to differentiate between studied and novel objects. Although the items were displayed without any contextual information, based on cognitive models of recognition memory and models of human hippocampal function, we predicted that recollection-based item recognition should trigger reactivation of information about the context in which that item was encountered1,2. Accordingly, we tested whether multi-voxel patterns elicited during item recognition carried information about the associated spatial (house) or episodic (same house and video) context.

## Results

### Behavioral results

Behavioral data are presented in Supplementary Table 1. Recognition memory performance was indexed by evaluating responses to new and old items in the object recognition test completed during MRI scanning. Accuracy was high for both hits (“remember” hit rate = 0.68 [SD = 0.17]; “familiar” hit rate = 0.25 [SD = 0.15]) and correct rejections (correct rejection rate = 0.90 [SD = 0.04]). False alarms were not present for the majority of participants (N = 17 with no “remember” false alarms; N = 3 with no “familiar” false alarms), therefore we did not compute discriminability indices. Remember rates varied by spatial context, t(22) = 3.33, p = 0.003, d = 0.21, such that recognition was slightly lower for items that had been studied in the gray house.

Because participants’ responses in the scanner were based on subjective measures of recollection, we directly assessed memory for the spatial context associated with each item in a test administered immediately after MRI scanning. Overall, spatial context memory was high (mean spatial context memory hit rate = 0.71 [SD = 0.11]; Supplementary Table 1), and did not reliably vary between the two spatial contexts, t(22) = 1.06, p = 0.30, d = 0.19. Consistent with the idea that item recollection involves activation of associated information, memory for spatial context was higher for items endorsed as remembered than for those endorsed as familiar, F(1, 22) = 74.72, p < 0.001, ηp2 = 0.51.

### fMRI results

Many standard-resolution and high-resolution fMRI studies have reported enhanced hippocampal activity during successful recollection and/or retrieval of spatial source information (for a review, see ref. 15). Consistent with these prior studies, whole-brain voxel-wise comparisons revealed increased activity in the left hippocampus during recollection hit trials as compared to activity during familiarity hit trials and misses (FWE corrected p < 0.05; Supplementary Fig. 2a, Supplementary Table 2). Estimated hemodynamic response functions for each subfield for this contrast as well as univariate estimates of subfield activity for each spatial context can be found in Supplementary Figs. 3 and 4, respectively. Outside of the hippocampus, activation in medial prefrontal cortex and other regions in the “core recollection network”16 was increased during recollection trials as compared to familiar hits and misses (Supplementary Fig. 2B). These results suggest a general role for the hippocampus in item recollection in addition to the differential roles the subfields play to support memory for contextual information.

Having established hippocampal recruitment for item recollection, we proceeded to investigate whether hippocampal activity patterns carried information related to spontaneous retrieval of spatial and episodic contexts for these recollected trials.

As shown in Fig. 1, we estimated single-trial multi-voxel patterns within regions of interest (ROIs) corresponding to CA1 and a combined CA2/CA3/dentate gyrus (CA23DG) subregion within the body of hippocampus. Distributions of single-trial activity estimates are included in Supplementary Figure 6 and representative PS values are shown in Supplementary Figure 7. Specifically, we computed voxel PS between trial pairs for recollected items that shared the same episodic context (i.e., same video/same house), shared the same spatial context but with different episodic contexts (different video/same house), or were associated with different episodic and spatial contexts (different video/different house). To maximize the likelihood of identifying trials that were associated with successful context retrieval, we restricted analyses to trials that were associated with correct recollection-based item recognition and correct identification of spatial context (house) in the post-scan context memory test.

To test whether regions carried information about an item’s encoding context (spatial/house, episodic/house and video), we fitted a mixed model with a random effect of subject17 testing for effects of ROI (CA1, CA23DG), context similarity (same episodic, same spatial, different context), and hemisphere (left, right), as well as their interactions on PS values. There was a significant ROI × context similarity  × hemisphere interaction (χ2(2) = 13.30, p = 0.001). Follow up analyses revealed that this was driven by a reliable interaction between ROI and context similarity in left (χ2(2) = 15.64, p < .001) but not right (χ2(2) = 1.65, p = 0.44; see Supplementary Figure 5). To further break down this interaction, we conducted separate analyses restricted to left hemisphere in our ROIs to assess representation of context similarity.

To investigate whether regions carried information about an item’s spatial encoding context (house), we compared PS values for items that had been studied in the same house or different house. In the same house condition, we eliminated trial pairs that had been studied within the same video to ensure that any observed effects could uniquely be attributed to spatial context similarity and not episodic context similarity. Based on traditional models3,18, we expected to see greater PS for same-house as compared to different-house pairs in CA23DG but not in CA1. As can be seen in Fig. 2, neither CA23DG (χ2(1) = 0.08, p = 0.78) nor CA1 (χ2(1) = 0.03, p = 0.86) systematically differed in their representation based on an item’s spatial context. These results indicate that neither CA1 nor CA23DG were differentially sensitive to spatial context similarity alone.

To investigate whether activity patterns in hippocampal subfields carried information about episodic context, we compared PS values between pairs of trials that were studied in the same video (which necessarily meant that that the items had also been studied in the same spatial context) against PS values for pairs of trials that were studied in different videos that depicted the same spatial context. Some models3,12 predict that CA1 should treat items from the same episodic context as more similar to one another than CA23DG. Indeed, in CA1, activity patterns were more similar across pairs of items from the same episodic context than across pairs from different videos (χ2(1) = 6.50, p = 0.01). Intriguingly, CA23DG showed the reverse pattern; that is, PS was significantly lower for items in the same video than for items in different videos (χ2(1) = 10, p = 0.002). We observed a significant interaction, (χ2(1) = 15.45, p < 0.001; Fig. 2), indicating that the PS profiles of CA1 and CA23DG were qualitatively different for episodic context similarity.

A control analysis was performed to ensure that our results could not merely be explained by the number of trials in each condition since the same video/same house condition necessarily had the smallest number of trial pairs contributing to the PS analysis. To do this, we took the condition with the fewest number of trial pairs on a subject-by-subject basis and randomly selected a matching number of trial pairs from all other conditions. For example, if a participant had 152 trial pairs in the same-video/same-house condition, we randomly selected 152 trial pairs for all other conditions by generating a random sequence of numbers of length 152 and then using this random sequence to index trial pairs for selection in the mixed modeling of PS. We repeated this simulation 1000 times and looked to see whether our observed χ2 values for the critical interaction of ROI × context similarity × hemisphere interaction were significant at a threshold of p < 0.05 (Supplementary Figure 9). Across nearly all of these random samples of trials, the effect was statistically significant.

We next investigated whether the effects of episodic context on PS were driven by just a few influential voxels. To better characterize the nature of the observed patterns, we repeated our analyses dropping five influential voxels in three different ways. First, we identified voxels that consistently had the largest absolute magnitude of response across all trials. These voxels could drive the PS result by increasing the magnitude of the observed correlations. Second, we identified voxels that had the largest standard deviation in their response across all trials. These voxels could contribute to PS differences due to their variability, thereby driving the observed between-condition differences. Third, we identified voxels that had the largest mean squared differences between trial pairs in the same video/same house and different video/same house conditions. We defined voxel variability relative to these two conditions since we observed a significant difference in the ROIs only in the episodic context condition. These voxels could lead to inflated PS and the observed between-condition differences due to systematic variability between the contexts. Across all three approaches, dropping influential voxels did not change the observed pattern of results.

An additional control analysis was performed to ensure that differences in PS on the basis of context similarity did not simply reflect differences in reaction times between conditions19. As suggested by Todd et al.19, we re-modeled our data including a random effect of the reaction time difference in trial pairs that went into the PS analyses. Results remained unchanged after removing the effect of reaction time differences for all comparisons of interest (Supplementary Figure 10).

To show the specificity of these effects to CA1 and CA23DG, we repeated our PS analyses using entorhinal cortex and subiculum instead (Supplementary Figure 8). These regions are in the hippocampal formation, but not considered to be subfields of the hippocampus proper. For episodic context similarity, neither entorhinal cortex (left: ($χ$2(1) = 0.09, p = 0.77; right: ($χ$2(1) = 0.51, p = 0.47) nor subiculum (left: ($χ$2(1) = 0.15, p = 0.70; right: ($χ$2(1) = 3.75, p = 0.053) differed in their representation based on an object’s episodic context. For spatial context similarity, neither entorhinal cortex (left: ($χ$2(1) = 0.64, p = 0.42; right: ($χ$2(1) = 1.79, p = 0.18) nor subiculum (left: ($χ$2(1) = 0.15, p = 0.70; right: ($χ$2(1) = 0.08, p = 0.77) were differentially sensitive to spatial context alone.

## Discussion

Our results reveal striking differences in retrieval of contextual information across the hippocampal subfields and provide a rare statistical dissociation between CA1 and CA23DG. In CA1, activity patterns were more similar across trials that involved recollection of the same episodic context (same video) than across trials that involved retrieval of different episodic contexts (different videos); in CA23DG, PS was lower between trials that were associated with the same episodic context than between trials that were associated with different episodic contexts. These results are consistent with the idea that CA1 represents global contextual regularities across items (“pattern completion”), whereas CA23DG exaggerates differences between items that have competing associations within the same episodic context (“pattern separation”). Together, CA1 and CA23DG can play complementary roles in supporting episodic memory by allowing one to remember specific items, as well as their relationships, within a shared context.

Differences between the subfields are a prominent component of models of subfield function3,12, which, in turn, are based on anatomical differences in the inputs, connections, and firing properties of the subfields20. Studies in rodents have reported differences between spatial coding in CA1 and CA3 following changes in the environment21,22, when learning spatial tasks such as the watermaze23, in distinguishing non-spatial sequences24, and in contextual fear conditioning paradigms25. However, we are not aware of any prior findings in humans that have reported dissociations between the roles of CA1 and CA23DG during spontaneous retrieval of contextual information.

One previous human fMRI study reported a statistical dissociation between CA1 and CA23DG as participants monitored the spatial layouts of cities that varied in similarity26. Stokes et al. found that CA23DG represented cities with the same layout as more similar than cities with different layouts whereas CA1 was not sensitive to the change in layouts26. One critical difference between their study and ours is that Stokes et al.26 directly assessed memory for spatial layouts, whereas we assessed incidental retrieval of contextual information during item recognition. Our results challenge traditional notions that CA3 is particularly sensitive to changes in spatial contexts12,26 in that PS values were not sensitive to spatial context information alone. The findings indicate that, although space may be important in defining an episodic context, spatial context alone does not account for differences between representations in CA1 and CA23DG during memory retrieval.

The present results converge with other findings10 showing that, when spatial information alone is insufficient to resolve context, spatial information does not drive representations within CA3. Our findings also accord with Leutgeb and Leutgeb’s12 proposal that CA3 can differentiate between different sensory cues that are encountered in the same place.

Our findings dovetail with extant evidence that CA1 plays a critical role in representing time8,27,28,29, in representing sequences of ordered information24,30,31,32,33,34,35,36, and in using this temporal information to define episodes37,38,39 under demands that mimic a realistic, real-world episodic context. Additionally, our findings support the idea that CA1 is critical for distinguishing similar episodic contexts3, given that we saw greater PS in CA1 for items within the same video as compared with those that occurred between videos.

The fact that CA23DG showed lower neural similarity for items encountered during the same video, as compared to items encountered during different videos, seems at odds with theories proposing a critical role for CA23DG in episodic memory retrieval3. Examined more closely, however, the results align with recent findings indicating that the hippocampus differentiates between related information in an episode40,41,42,43. For instance, building on the theory of Marr18, it has been argued that dentate gyrus and CA3 work together to distinguish related information (i.e., “pattern separation”; for a review see ref. 44). Our finding that CA23DG is more likely to individuate objects within a video—resulting in lower neural similarity—is consistent with the idea that CA3 generally pushes apart representations of similar items (see also refs. 45,46,47 for related findings).

Another possibility is that the coding scheme of CA23DG may be flexible based on the elements that have priority in the task4. In some tasks, such as those involving spatial navigation, distinguishing between competing spatial representations may be essential to correct navigation and subsequent memory performance46, and, thus, CA23DG may be pushed toward orthogonalization. Successful performance on our task requires one to differentiate between representations of items encountered in the same video that share a spatial environment. However, if we could construct a task in which there were lower demands to orthogonalize item-specific features, we might expect CA23DG to show a representational scheme more consistent with pattern completion. That is, we would expect CA23DG to show increased similarity for items in the same context relative to items encountered in different contexts.

In one recent demonstration of the flexibility of coding schemes in CA23DG, Aly et al.48 asked participants to deploy attention either to objects within a context (pieces of art) or to the context itself (the layout of the room). Critically, stimuli were identical between the conditions but participants’ attentional state varied based on the task demands. They found that CA23DG, but not CA1, showed a greater match in its profile of activity to the task-relevant attention state. This task-driven modulation of activity was related to increased subsequent memory for incidentally encoded information48. In a related study, they replicated the finding that attention state-specific patterns of activity in CA23DG were uniquely related to subsequent memory for the studied items49. Such findings indicate that the coding scheme in CA23DG can change according to task demands, and this can have consequences for later memory performance.

Both temporal and spatial details are key defining features of episodic memories50, but, until now, it was not clear how these spatiotemporal contexts are represented by the hippocampus during retrieval. CA1 and CA23DG exhibited activity patterns that were sensitive to retrieved episodic context information in the absence of memory differences for the objects themselves. The complementary characteristics of these subfields explain how we can parse our experiences into cohesive episodes while retaining the specific details that support vivid recollection7,51,52,53.

## Methods

### Participants

Twenty-eight participants took part in the study. Of these, one failed to complete their MRI session due to technical malfunctions with the scanner. An additional four subjects were excluded due to not having at least two usable runs of fMRI data (either due to excessive motion, N = 2, exiting the MRI scanner between runs, N = 1, or low behavioral performance resulting in correctly recollected trials only in a single run, N = 1). Analyses presented are from the remaining twenty-three participants (Nfemale = 11, mean age = 19.5 years). The study was approved by the Institutional Review Board of the University of California, Davis, and all participants provided written informed consent at the time of the study.

### Stimuli and materials

Study materials included two virtual homes created in Google SketchUp (https://www.sketchup.com/, version 15.3.329). Homes were matched for total virtual square area and subdivided into two rooms. Each house contained ten pieces of landmark furniture that shared semantic labels (e.g., “couch”) but differed in appearance (e.g., angular gray couch vs. plush green couch) between the two homes. Exterior color, wall color, room orientation, and decoration style also differed between houses (Supplementary Fig. 1).

Three-hundred neutral objects (e.g., football helmet, suitcase, teddy bear, vase, and phone booth) were selected from the Google SketchUp image library. Two-hundred and forty of these objects were randomly selected and positioned within the homes in sets of ten objects. Object assignment to homes was random and was house- and video-unique so that 12 lists of objects were assigned to each home. To determine object placement, rooms were divided into eighths, all possible combinations of five positions were generated, and objects were randomly assigned to one of these combinations. Thus, object configurations within each room (and thus within each house) were video-unique.

Videos depicting trajectories through the houses were generated using Google SketchUp’s animation feature. Videos were exported for each house both with landmark furniture only and for each of the 12 videos with objects within each home. Trajectories did not change between videos within a home. Each video was ~1 min 40 s in duration.

### Procedure

After providing informed consent, participants practiced the four phases of the task using an example video recorded in the experimenter’s (HDZ’s) house. Phase one consisted of context familiarization, phase two object encoding, phase three object recognition (scanned), and phase four location recognition.

In the context familiarization phase, participants passively viewed videos of the two homes with landmark furniture but devoid of any other objects. Androgynous names (“Alex”, “Jamie”) were randomly assigned to each house and a label with the owner’s name appeared above the video for all presentations (e.g., “Jamie’s House”). After watching the video tours of both Alex and Jamie’s houses twice, participants were presented with a blank map that included rectangles where the houses were located and the street between the houses but no other contextual details. Maps were generated for the purposes of strengthening participants’ representations of the houses (spatial contexts). Participants were given up to 10 min to draw maps of each house including the location of doors, walls, furniture, and the owner’s name associated with each house to ensure thorough knowledge of the spatial layouts. Before progressing to the object encoding phase, the experimenter reviewed the accuracy of their maps using a transparency overlay and corrected any mistakes.

Following context familiarization, participants progressed to the object encoding phase. During this phase, participants viewed a series of 20 videos. Each video depicted passive navigation through one of the two homes, with ten different objects placed along the trajectory. At the end of each video, a still frame of each of the ten objects in situ in the house appeared one at a time in random order for 4 s while the participant judged whether the object was worth more than \$50 (yes/no). Responses were recorded via keypress but were not analyzed. After making this value judgment, the still frame of the object was replaced by a birds-eye-view perspective map of the house where each room was divided into quadrants with numeric labels. Participants indicated the quadrant number where the object had been located during the video via keypress (1–8). Again, responses were recorded but not analyzed as the purpose of these encoding judgments was to solidify participants’ memory both for object identities and their locations. In total, participants saw ten videos in each house presented such that the order of the houses alternated across even and odd videos (e.g., house1/video1, house2/video2, house1/video3, house2/video4, etc.). The ten videos were randomly selected from the pool of 12 videos in each house and the order of video presentation was uniquely randomized for each participant. This phase took roughly 1 h to complete.

The object recognition test phase took place across four runs in the MRI scanner following acquisition of structural scans. Sixty-three still images were presented in each run for 3 s with a jittered inter-trial interval ranging from 2 to 8 s. All 200 studied objects were presented with an additional 52 new, unstudied objects. New objects were randomly selected from a pool of objects that were not presented in either house. While the image was on the screen, participants made recognition judgments (“remember”, “feels familiar”, “new”) via an MRI-compatible button box. Participants were instructed to make a “remember” response when they could bring to mind a specific detail from when they had studied the object, “feels familiar” if they thought they had studied the object but were unable to retrieve a specific detail, and “new” for objects that they did not think they had studied during object encoding. Critically, no house or video information was re-presented to participants in these object images. Following four runs of the object recognition task, participants additionally completed a spatial localizer task in which they saw short videos (15 s) and still frames from both houses with landmark furniture but no objects. The results of this scan will not be discussed further here.

During the final phase of the experiment, participants were brought back to the lab to complete a spatial context memory test. In this phase, participants were re-presented with the 200 studied objects, and they were asked to recall where (house and room) each object had been studied (Alex room 1, Alex room 2, Jamie room 1, Jamie room 2; names were presented in the same orientation as the houses). Images remained on the screen for 3 s while participants made their response. Data from this phase were used to ensure that item memory did not differ as a function of spatial context.

### fMRI acquisition and pre-processing

Scans were acquired on a Siemens Skyra 3T scanner with a 32 channel head coil. Two sets of structural images were acquired to enable subfield segmentation: A T1-weighted magnetization prepared rapid acquisition gradient echo (MP-RAGE) pulse sequence image (1 mm3 isotropic voxels) and a high-resolution T2-weighted image (TR = 4200 ms; TE = 93 ms; field of view = 200 mm2; flip angle = 139°; bandwidth = 199 Hz/pixel; voxel size = 0.4 × 0.4 × 1.9 mm; 58 coronal slices acquired perpendicular to the long-axis of the hippocampus). High-resolution functional (T2*) images were acquired using a multiband gradient echo planar (EPI) imaging sequence (TR = 2010 ms; TE = 25 ms; field of view = 216 mm; image matrix = 144 × 152; flip angle = 79°; bandwidth = 1240 Hx/pixel; partial phase Fourier = 6/8; parallel imaging = GRAPPA acceleration factor 2 with 36 reference lines; multiband factor = 2; 52 oblique axial slices acquired parallel to the long-axis of the hippocampus slices; voxel size = 1.5 mm3 isotropic).

SPM8 (http://www.fil.ion.ucl.ac.uk/spm/) was used for image pre-processing. Functional EPI images were realigned to the first image and resliced. No slice timing correction was performed due to the acquisition of multiple simultaneous slices with the multiband sequence. Co-registration between the native-space ROIs defined in T2 space and the functional images was done with SPM’s Coregister: Estimate and Reslice procedure. This procedure uses the normalized mutual information cost function between a reference (mean functional) image and source (T2) image to compute and apply a voxel-by-voxel affine transformation matrix. This transformation matrix was then applied to the subfield ROIs that had been defined in T2 space (see 'ROI segmentation') to bring them into register with the functional images (Supplementary Figure 11). The T1 image was co-registered to the mean EPI. Then, nonlinear spatial normalization parameters were derived by segmenting the coregistered T1 image. For the voxel-wise univariate analyses, these normalization parameters were applied to the contrast images to bring all images into MNI space before group analysis. Contrast images were additionally smoothed with a 3 mm FWHM Gaussian kernel. Quality assurance (QA) included identifying suspect timepoints via custom code (https://github.com/memobc/memolab-fmri-qa) defined as time-points in excess of 0.5 mm frame displacement (based on54 or 1.5% global mean signal change (based on ARTRepair recommendations55). Runs were excluded if the frame displacement exceeded the voxel size. Two participants were excluded for motion in excess of these thresholds and one run was excluded from another participant based on these criteria.

### ROI segmentation

Hippocampal subfield ROIs were segmented using the ASHS toolbox56 with the UPenn atlas. Segmented ROIs were co-registered to the mean functional image and split into masks for each ROI of interest. We took a conservative segmentation approach in which we combined subfields DG, CA3, and CA2 into a combined region based on prior high-resolution fMRI work26,57,58 despite ASHS segmentations for each subfield. Head, body, and tail were manually defined but subfield comparisons were limited to the body where boundaries between CA1 and CA23DG can be most clearly and reliably delineated59. Disappearance of the gyrus intralimbicus was used as the head/body boundary70 and presence of the wing of the ambient cistern demarcated body/tail61.

### Data analysis

ROI summary statistics, including PS analyses, were computed using custom code implemented in MATLAB r2014b (www.mathworks.com) and R version 3.3.2 (http://www.R-project.org). Statistical comparisons were conducted in R including linear mixed models implemented with lme462. Our mixed models included fixed effects of condition, ROI, and hemisphere as well as a random subject intercept. P-values were obtained by comparing a full model with the effect of interest against a reduced model without this effect and quantified with Chi-squared likelihood ratio tests17,60,63,64,65. Cohen’s d effect sizes for within-subject paired samples t-tests were computed correcting for correlation between means as suggested by Dunlap et al.66.

### Univariate fMRI analyses

Six conditions of interest (remember and familiar hits split by house, correct rejections, false alarms) were modeled with event-related stick function regressors. Six motion parameters and additional spike regressors as identified by our QA scripts were also included in the model. Trials of no interest (e.g., where no response was made) were included as a nuisance regressor. “Recollection” trials were defined as trials on which a subject correctly endorsed remembering an item that had been presented during encoding. “Familiarity” trials were defined as trials on which a subject correctly endorsed having seen an item but did not endorse being able to remember any specific details about its initial presentation67. Because there were not enough familiarity trials to analyze this condition separately, comparisons of interest combined familiar hits with miss trials.

Contrast maps for recollection-related activity were computed by comparing the activation difference for recollection hit trials as compared to the combined activity for familiarity hit trials and misses. T-maps were cluster corrected at FWE < 0.05 using a combined voxel-wise threshold (p < 0.001) and cluster extent (k = 88).

### PS analyses

PS analyses13 were conducted on beta maps generated from unsmoothed data in native subject space. Following the procedure described by Mumford68,69, single trial models were generated to estimate the unique beta map for every trial in a run (N = 63). Within each single trial model, the first regressor modeled the trial of interest with a stick function, the second regressor modeled all other trials in that run, six regressors were used to capture motion, and any additional spike regressors as identified by our QA scripts were used to capture additional residual variance. Following beta estimation, outlier values were identified by setting a z-scored beta threshold of 0.7–0.85 based on visual inspection of the distribution of z-scored beta values for all subjects. This resulted in an average of 9.87% (mean = 6.22 trials, SD = 7.10 trials) excluded beta values per run for each participant.

Voxel-wise patterns of hemodynamic activity were separately extracted for each ROI from the single trial beta images. Within each ROI, correlations (Pearson’s r) were computed between these trial-wise betas to yield a trial-by-trial correlation matrix that related each voxel’s signal on a trial to all other trials across all runs. We restricted comparisons to those trials for which participants made a correct “remember” response (during MRI scanning) and correctly identified an item’s spatial (house) context (post-MRI spatial source task). Statistical analyses tested for differences in correlations between trial pairs on the basis of encoding context (same vs. different house, same vs. different video within a house). Only between-run correlations were used to maximize the number of possible trial pairs without mixing within- and between-run correlations. Trial pairs of interest were extracted from these trial-by-trial correlation matrices. For all conditions, we restricted our trial pairs to those where participants made both a correct remember response as well as a correct house source memory judgment.

### Data availability

Minimally processed data needed to regenerate analyses and figures can be found online (https://osf.io/5th8r/) as well as relevant analysis code (https://github.com/hallez/abcdcon_pub). All other data that support the findings of this study are available from the corresponding author upon reasonable request.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

Davachi, L. Item, context and relational episodic encoding in humans. Curr. Opin. Neurobiol. 16, 693–700 (2006).

2. 2.

Eichenbaum, H., Yonelinas, A. P. & Ranganath, C. The medial temporal lobe and recognition memory. Annu. Rev. Neurosci. 30, 123–152 (2007).

3. 3.

Kesner, R. P. & Rolls, E. T. A computational theory of hippocampal function, and tests of the theory: new developments. Neurosci. Biobehav. Rev. 48, 92–147 (2015).

4. 4.

Guzowski, J. F., Knierim, J. J. & Moser, E. I. Ensemble dynamics of hippocampal regions CA3 and CA1. Neuron 44, 581–584 (2004).

5. 5.

Hasselmo, M. E. & Eichenbaum, H. Hippocampal mechanisms for the context-dependent retrieval of episodes. Neural Netw. 18, 1172–1190 (2005).

6. 6.

Levy, W. B. A sequence predicting CA3 is a flexible associator that learns and uses context to solve hippocampal-like tasks. Hippocampus 6, 579–590 (1996).

7. 7.

O’Reilly, R. C. & Norman, K. A. Hippocampal and neocortical contributions to memory: advances in the complementary learning systems framework. Trends Cogn. Sci. 6, 505–510 (2002).

8. 8.

Kraus, B. J., Robinson, R. J., White, J. A., Eichenbaum, H. & Hasselmo, M. E. Hippocampal ‘time cells’: time versus path integration. Neuron 78, 1090–1101 (2013).

9. 9.

Salz, D. M. et al. Time cells in hippocampal area CA3. J. Neurosci. 36, 7476–7484 (2016).

10. 10.

Kyle, C. T., Smuda, D. N., Hassan, A. S. & Ekstrom, A. D. Roles of human hippocampal subfields in retrieval of spatial and temporal context. Behav. Brain Res. 278C, 549–558 (2015).

11. 11.

Lee, I., Jerman, T. S. & Kesner, R. P. Disruption of delayed memory for a sequence of spatial locations following CA1- or CA3-lesions of the dorsal hippocampus. Neurobiol. Learn. Mem. 84, 138–147 (2005).

12. 12.

Leutgeb, S. & Leutgeb, J. K. Pattern separation, pattern completion, and new neuronal codes within a continuous CA3 map. Learn. Mem. 14, 745–757 (2007).

13. 13.

Kriegeskorte, N., Mur, M. & Bandettini, P. Representational similarity analysis - connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 4 (2008).

14. 14.

McKenzie, S. et al. Hippocampal representation of related and opposing memories develop within distinct, hierarchically organized neural schemas. Neuron 83, 202–215 (2014).

15. 15.

Diana, R. A., Yonelinas, A. P. & Ranganath, C. Imaging recollection and familiarity in the medial temporal lobe: a three-component model. Trends Cogn. Sci. 11, 379–386 (2007).

16. 16.

Johnson, J. D. & Rugg, M. D. Recollection and the reinstatement of encoding-related cortical activity. Cereb. Cortex 17, 2507–2515 (2007).

17. 17.

Gordon, A. M., Rissman, J., Kiani, R. & Wagner, A. D. Cortical reinstatement mediates the relationship between content-specific encoding activity and subsequent recollection decisions. Cereb. Cortex 24, 3350–3364 (2014).

18. 18.

Marr, D. Simple memory: a theory for archicortex. Philos. Trans. R. Soc. Lond. B Biol. Sci. 262, 23–81 (1971).

19. 19.

Todd, M. T., Nystrom, L. E. & Cohen, J. D. Confounds in multivariate pattern analysis: theory and rule representation case study. Neuroimage 77, 157–165 (2013).

20. 20.

Mizuseki, K., Royer, S., Diba, K. & Buzsáki, G. Activity dynamics and behavioral correlates of CA3 and CA1 hippocampal pyramidal neurons. Hippocampus 22, 1659–1680 (2012).

21. 21.

Lee, I., Rao, G. & Knierim, J. J. A double dissociation between hippocampal subfields: differential time course of CA3 and CA1 place cells for processing changed environments. Neuron 42, 803–815 (2004).

22. 22.

Roth, E. D., Yu, X., Rao, G. & Knierim, J. J. Functional differences in the backward shifts of CA1 and CA3 place fields in novel and familiar environments. PLoS ONE 7, e36035 (2012).

23. 23.

Gusev, P. A., Cui, C., Alkon, D. L. & Gubin, A. N. Topography of Arc/Arg3.1 mRNA expression in the dorsal and ventral hippocampus induced by recent and remote spatial memory recall: dissociation of CA3 and CA1 activation. J. Neurosci. 25, 9384–9397 (2005).

24. 24.

Farovik, A., Dupont, L. M. & Eichenbaum, H. Distinct roles for dorsal CA3 and CA1 in memory for sequential nonspatial events. Learn. Mem. 17, 12–17 (2010).

25. 25.

Ji, J. & Maren, S. Differential roles for hippocampal areas CA1 and CA3 in the contextual encoding and retrieval of extinguished fear. Learn. Mem. 15, 244–251 (2008).

26. 26.

Stokes, J., Kyle, C. & Ekstrom, A. D. Complementary roles of human hippocampal subfields in differentiation and integration of spatial context. J. Cogn. Neurosci. 27, 546–559 (2015).

27. 27.

MacDonald, C. J., Lepage, K. Q., Eden, U. T. & Eichenbaum, H. Hippocampal ‘time cells’ bridge the gap in memory for discontiguous events. Neuron 71, 737–749 (2011).

28. 28.

Mankin, E. A. et al. Neuronal code for extended time in the hippocampus. Proc. Natl Acad. Sci. USA 109, 19462–19467 (2012).

29. 29.

Pastalkova, E., Itskov, V., Amarasingham, A. & Buzsáki, G. Internally generated cell assembly sequences in the rat hippocampus. Science 321, 1322–1327 (2008).

30. 30.

Allen, T. A., Salz, D. M., McKenzie, S. & Fortin, N. J. Nonspatial sequence coding in CA1 neurons. J. Neurosci. 36, 1547–1563 (2016).

31. 31.

Chen, J., Cook, P. A. & Wagner, A. D. Prediction strength modulates responses in human area CA1 to sequence violations. J. Neurophysiol. 114, 1227–1238 (2015).

32. 32.

Hoge, J. & Kesner, R. P. Role of CA3 and CA1 subregions of the dorsal hippocampus on temporal processing of objects. Neurobiol. Learn. Mem. 88, 225–231 (2007).

33. 33.

Hunsaker, M. R., Fieldsted, P. M., Rosenberg, J. S. & Kesner, R. P. Dissociating the roles of dorsal and ventral CA1 for the temporal processing of spatial locations, visual objects, and odors. Behav. Neurosci. 122, 643–650 (2008).

34. 34.

Kesner, R. P., Hunsaker, M. R. & Ziegler, W. The role of the dorsal CA1 and ventral CA1 in memory for the temporal order of a sequence of odors. Neurobiol. Learn. Mem. 93, 111–116 (2010).

35. 35.

Manns, J. R., Howard, M. W. & Eichenbaum, H. Gradual changes in hippocampal activity support remembering the order of events. Neuron 56, 530–540 (2007).

36. 36.

Stark, E., Roux, L., Eichler, R. & Buzsáki, G. Local generation of multineuronal spike sequences in the hippocampal CA1 region. Proc. Natl Acad. Sci. USA 112, 10521–10526 (2015).

37. 37.

Brown, T. I., Hasselmo, M. E. & Stern, C. E. A high-resolution study of hippocampal and medial temporal lobe correlates of spatial context and prospective overlapping route memory. Hippocampus 24, 819–839 (2014).

38. 38.

Hunsaker, M. R., Lee, B. & Kesner, R. P. Evaluating the temporal context of episodic memory: the role of CA3 and CA1. Behav. Brain Res. 188, 310–315 (2008).

39. 39.

Wang, F. & Diana, R. A. Temporal context processing within hippocampal subfields. Neuroimage 134, 261–269 (2016).

40. 40.

Chanales, A. J., Oza, A., Favila, S. E. & Kuhl, B. A. Overlap among spatial memories triggers repulsion of hippocampal representations. Curr. Biol. 27, 2307–2317.e5 (2017).

41. 41.

Favila, S. E., Chanales, A. J. H. & Kuhl, B. A. Experience-dependent hippocampal pattern differentiation prevents interference during subsequent learning. Nat. Commun. 7, 11066 (2016).

42. 42.

LaRocque, K. F. et al. Global similarity and pattern separation in the human medial temporal lobe predict subsequent memory. J. Neurosci. 33, 5466–5474 (2013).

43. 43.

Schlichting, M. L., Mumford, J. A. & Preston, A. R. Learning-related representational changes reveal dissociable integration and separation signatures in the hippocampus and prefrontal cortex. Nat. Commun. 6, 8151 (2015).

44. 44.

Yassa, M. A. & Stark, C. E. L. Pattern separation in the hippocampus. Trends Neurosci. 34, 515–525 (2011).

45. 45.

Kim, G., Norman, K. A. & Turk-Browne, N. B. Neural differentiation of incorrectly predicted memories. J. Neurosci. https://doi.org/10.1523/jneurosci.3272-16.2017 (2017).

46. 46.

Kyle, C. T., Stokes, J. D., Lieberman, J. S., Hassan, A. S. & Ekstrom, A. D. Successful retrieval of competing spatial environments in humans involves hippocampal pattern separation mechanisms. eLife 4, e10499 (2015).

47. 47.

Schapiro, A. C., Kustner, L. V. & Turk-Browne, N. B. Shaping of object representations in the human medial temporal lobe based on temporal regularities. Curr. Biol. 22, 1622–1627 (2012).

48. 48.

Aly, M. & Turk-Browne, N. B. Attention promotes episodic encoding by stabilizing hippocampal representations. Proc. Natl Acad. Sci. USA 113, E420–E429 (2016).

49. 49.

Aly, M. & Turk-Browne, N. B. Attention stabilizes representations in the human hippocampus. Cereb. Cortex 26, 783–796 (2016).

50. 50.

Tulving, E. Précis of elements of episodic memory. Behav. Brain Sci. 7, 223–238 (1984).

51. 51.

Burgess, N., Maguire, E. A. & O’Keefe, J. The human hippocampus and spatial and episodic memory. Neuron 35, 625–641 (2002).

52. 52.

Eichenbaum, H. & Cohen, N. J. Can we reconcile the declarative memory and spatial navigation views on hippocampal function? Neuron 83, 764–770 (2014).

53. 53.

Ranganath, C. A unified framework for the functional organization of the medial temporal lobes and the phenomenology of episodic memory. Hippocampus 20, 1263–1290 (2010).

54. 54.

Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L. & Petersen, S. E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage 59, 2142–2154 (2012).

55. 55.

Mazaika, P., Whitfield-Gabrieli, S. & Cooper, J. C. (2005) Detection and repair of transient artifacts in fMRI data. Poster presented at Human Brain Mapping.

56. 56.

Yushkevich, P. A. et al. Nearly automatic segmentation of hippocampal subfields in in vivo focal T2-weighted MRI. Neuroimage 53, 1208–1224 (2010).

57. 57.

Ekstrom, A. D. et al. Advances in high-resolution imaging and computational unfolding of the human hippocampus. Neuroimage 47, 42–49 (2009).

58. 58.

Zeineh, M. M., Engel, S. A. & Bookheimer, S. Y. Application of cortical unfolding techniques to functional MRI of the human hippocampal region. NeuroImage 11, 668–683 (2000).

59. 59.

Yushkevich, P. A. et al. Quantitative comparison of 21 protocols for labeling hippocampal subfields and parahippocampal subregions in in vivo MRI: towards a harmonized segmentation protocol. Neuroimage 111, 526–541 (2015).

60. 60.

Bates, D. lme4: Mixed-effects Modeling with R (Springer, Madison, WI, 2010).

61. 61.

Yushkevich, P. A. et al. Automated volumetry and regional thickness analysis of hippocampal subfields and medial temporal cortical structures in mild cognitive impairment. Hum. Brain Mapp. 36, 258–287 (2015).

62. 62.

Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. 67, 1–48 (2015).

63. 63.

Barr, D. J., Levy, R., Scheepers, C. & Tily, H. J. Random effects structure for confirmatory hypothesis testing: Keep it maximal. J. Mem. Lang. 68, 255–278 (2013).

64. 64.

Luke, S. G. Evaluating significance in linear mixed-effects models in R. Behav. Res. Methods https://doi.org/10.3758/s13428-016-0809-y (2016).

65. 65.

Winter, B. Linear models and linear mixed effects models in R with linguistic applications. arXiv (2013).

66. 66.

Dunlap, W. P., Cortina, J. M., Vaslow, J. B. & Burke, M. J. Meta-analysis of experiments with matched groups or repeated measures designs. Psychol. Methods 1, 170–177 (1996).

67. 67.

Yonelinas, A. P. The nature of recollection and familiarity: a review of 30 years of research. J. Mem. Lang. 46, 441–517 (2002).

68. 68.

Mumford, J. A., Turner, B. O., Ashby, F. G. & Poldrack, R. A. Deconvolving BOLD activation in event-related designs for multivoxel pattern classification analyses. Neuroimage 59, 2636–2643 (2012).

69. 69.

Ritchey, M., Montchal, M. E., Yonelinas, A. P. & Ranganath, C. Delay-dependent contributions of medial temporal lobe regions to episodic memory retrieval. eLife 4, e05025 (2015).

70. 70.

Frankó, E., Insausti, A. M., Artacho-Pérula, E., Insausti, R. & Chavoix, C. Identification of the human medial temporal lobe regions on magnetic resonance images: human medial temporal lobe landmarks. Hum. Brain Mapp. 35, 248–256 (2014).

## Acknowledgements

We are grateful to the members of the UC Davis Memory Group for their valuable feedback and support throughout the design, analysis, and writing phases. This work was supported by NSF-GFRP 1650042 (H.R.D.-Z.).

## Author information

### Affiliations

1. #### Center for Neuroscience, University of California, Davis, CA, 95618, USA

• Halle R. Dimsdale-Zucker
• , Arne D. Ekstrom
• , Andrew P. Yonelinas
•  & Charan Ranganath
2. #### Department of Psychology, University of California, Davis, CA, 95618, USA

• Halle R. Dimsdale-Zucker
• , Arne D. Ekstrom
• , Andrew P. Yonelinas
•  & Charan Ranganath
3. #### Department of Psychology, Boston College, Chestnut Hill, MA, 02467, USA

• Maureen Ritchey
4. #### Center for Mind and Brain, University of California, Davis, CA, 95618, USA

• Andrew P. Yonelinas

### Contributions

H.R.D.-Z., M.R., A.D.E., A.P.Y., and C.R. designed the study and wrote the paper. H.R.D.-Z. performed the experiment and analyzed the data.

### Competing interests

The authors declare no competing financial interests.

### Corresponding author

Correspondence to Halle R. Dimsdale-Zucker.