Emergence of hierarchical organization in memory for random material

Naim, Michelangelo; Katkov, Mikhail; Recanatesi, Stefano; Tsodyks, Misha

doi:10.1038/s41598-019-46908-z

Download PDF

Article
Open access
Published: 18 July 2019

Emergence of hierarchical organization in memory for random material

Scientific Reports volume 9, Article number: 10448 (2019) Cite this article

5310 Accesses
7 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Structured information is easier to remember and recall than random one. In real life, information exhibits multi-level hierarchical organization, such as clauses, sentences, episodes and narratives in language. Here we show that multi-level grouping emerges even when participants perform memory recall experiments with random sets of words. To quantitatively probe brain mechanisms involved in memory structuring, we consider an experimental protocol where participants perform ‘final free recall’ (FFR) of several random lists of words each of which was first presented and recalled individually. We observe a hierarchy of grouping organizations of FFR, most notably many participants sequentially recalled relatively long chunks of words from each list before recalling words from another list. Moreover, participants who exhibited strongest organization during FFR achieved highest levels of performance. Based on these results, we develop a hierarchical model of memory recall that is broadly compatible with our findings. Our study shows how highly controlled memory experiments with random and meaningless material, when combined with simple models, can be used to quantitatively probe the way meaningful information can efficiently be organized and processed in the brain.

Predicting memory from the network structure of naturalistic events

Article Open access 22 July 2022

Hongmi Lee & Janice Chen

Time-dependent memory transformation in hippocampus and neocortex is semantic in nature

Article Open access 27 September 2023

Valentina Krenz, Arjen Alink, … Lars Schwabe

The double-edged sword of memory retrieval

Article 17 October 2022

Henry L. Roediger III & Magdalena Abel

Introduction

Free recall of randomly assembled lists of words is a long-standing paradigm for studying human memory that produced a great amount of experimental observations^{1,2,3,4,5,6,7,8,9} and theoretical models^{10,11,12,13,14,15}. One of the critical observations made over the decades of research concerns the issue of performance, i.e. the number of words that people can recall from lists of various lengths. It was observed that recall performance grows sublinearly with the list length, which means that even for lists of moderate lengths, people cannot recall most of the words presented to them^{1,2,3,4,5,6,7,8,9}. These observations stand in a stark contrast to much better recall of meaningful texts, such as stories or poems. One plausible explanation for this difference could be in the fact that meaningful texts exhibit various degrees of organization that makes them easier to remember and recall. For example, a story may contain distinct episodes that relate to each other in multiple logical chains that give rise to its ‘meaning’ and makes it more memorable^16,17,18. Often speakers introduce organization to the material to be communicated in order to improve its retention; one of the most prominent organizational strategies is grouping (the parceling of information into smaller parts). Pausing at appropriate places while speaking, allows listeners to divide the speech into meaningful parts¹⁹. Several studies of chunking during free recall were pursued. In some of them, chunking was imposed by the presentation protocol, such as e.g. increasing the time interval between the chunks^20,21,22,23, while in others spontaneous chunking was observed^{21,24,25,26,27}. The typical size of a chunk was related to the working memory capacity (see e.g.²⁸), which was also found to be positively correlated to free recall performance²⁹.

Several studies reported that repeated recall of random lists resulted in increased chunking^16,30,31. In our recent analysis of the data of M. Kahana where each list was presented once, a small fraction of participants developed strong chunking, accompanied by a substantial improvement of performance, up to perfect recall of full lists of 16 words³². However, the overall extent of chunking is very moderate which makes the analysis of its effects on performance quite difficult, in particular a vast majority of participants did not exhibit chunking even after extensive practice.

A highly elaborate model of temporal clustering in both serial and free recall was developed in²⁴, based on earlier models^12,20,33,34. The main idea of this model is a hierarchical representation of temporal context that incorporates episodic clustering into distinct groups of individual items and also serial position of items within each group. The recall of each item is preceded by the retrieval of a group context, which in turn is triggered by group-specific cues and control elements. The main focus of the model is induced or spontaneous chunking in single list recall, characterized by the appearance of short clusters of 3–4 subsequently presented words. Another influential model of clustering, called Context Maintenance and Retrieval (CTM) is proposed in¹⁴. This model generalizes the earlier Temporal Context Model¹² to include the possibility that different memory items are grouped into distinct sources (e.g. words presented aurally vs visually). The model accounts for experimentally observed interplay between two different types of clustering: a temporal one based on presentation position of different items, and source clustering.

In this contribution, we chose to focus our attention on the paradigm of final free recall (FFR), which could be considered as a strong version of induced chunking. In this paradigm, participants recall several lists in a single daily session, in the end of which they are asked (with no prior warning) to recall the words of all the lists in an arbitrary order. FFR was studied in several previous publications (see e.g.^{35,36,37,38,39}). Most of these studies concerned the differences between temporal organization of recall within lists as assessed by a classical serial position curve², for both single list recall and FFR. In some studies, similar organization was reported for both cases, characterized by primacy and recency effects³⁵, while other studies, involving bigger number of longer lists, reported within-list ‘anti-recency’ in FFR^38,39. It was also reported that words from the lists presented towards the end of the session were more likely to be recalled (list recency) and between-the-list transitions tend to be between lists that were presented one after another (list contiguity)^36,37.

We conjectured that FFR should exhibit strong grouping over the lists, because each list was presented and recalled individually in the same session. We also wanted to elucidate the possible effects of grouping on the FFR performance. Indeed over-the-list grouping was observed in³⁶, but only the average length of within-list clusters was reported (2.6 words our of 10 words in each list) and no analysis of its effect on FFR performance was presented. Our analysis not only uncovered a highly significant overall degree of grouping in FFR, but also demonstrated the great diversity of it across participants. Moreover, we found a very strong positive correlation between grouping and FFR performance, thus reinforcing the crucial role of temporal organization of information in episodic memory, in a precisely quantifiable way.

To elucidate the possible mechanisms of grouping and its effect on FFR performance, we developed a highly reduced version of hierarchical recall models of^14,24. It generalizes our previous model that successfully accounted for power-law scaling in free recall of single lists^32,40,41 and includes some of the features that are similar to⁴². To reduce the complexity of the model, we did not include any mechanisms for generating within-list temporal organization which resulted in significantly fewer free parameters than in the previous theoretical studies. We found that the model accounts well for both overall degree of grouping and its diversity, as well as the correlation between grouping and FFR performance in terms of the number of words recalled.

Results

Grouping over lists, induced by presentation protocol

The protocol of the experimental dataset we analyze, obtained in the lab of Prof. Kahana at the University of Pennsylvania⁴³, adheres to the following structure. Each participant performed 16 Immediate Free Recall (IFR) trials a day with randomly assembled non-overlapping lists of 16 words. On selected days they were subsequently asked to recall all the words presented on that day (FFR; Fig. 1a). Averaged over roughly 900 FFR sessions, participants recalled 57 words per session. This level of performance is much higher than typical recall performance of lists of 16×16 = 256 words^2,41, indicating that participants take advantage of the structural organization of presented words imposed by prior IFR trials. To prove that this is indeed the case, we quantify the level of grouping in FFR over the presented lists with a value p₁₆ that reflects the tendency to recall subsequent words from the same list before switching to another list (see Materials and Methods)⁴⁴. The distribution of p₁₆ over the data is very wide (Fig. 1b), covering the range from 0 (random recall) to 0.9 (strong degree of grouping; see Fig. 1c–e for three prototypical examples). Displaying the FFR performance versus the grouping measure p₁₆ revealed a striking correlation between the two (r = 0.62, p = 4×10⁻⁹⁷), with the bulk of data well characterized by linear dependence of performance on p₁₆. Interestingly, in the limit p₁₆ → 0, i.e. when no grouping is employed, performance approached a value of 30 words, supporting the theoretical prediction³². We also observe that in FFR sessions with highest values of p₁₆ participants occasionally recalled single words from a list in between longer sequences from other lists (Fig. 1c; see e.g. a single word from the 15^th list recalled between two groups of words from the 4^th list). We speculate that these short ‘intrusions’ are analogous to famous ‘slips of the tongue’ in natural speech⁴⁵.

A possible interpretation of the above results is that participants perform FFR by applying a mixture of two recall strategies, one that treats all the words as one long random list, and another one that operates on two levels, namely individually presented lists and words within a list. As the second strategy gains prominence, recall becomes progressively more grouped and the value of p₁₆ increases, accompanied by the increase in performance. In particular, the participants could develop stable representations of each list as a separate entity and ‘recall’ a list before recalling words from that list.

Spontaneous grouping within presented lists

The grouping over lists exhibited in Fig. 1 is induced by the experimental protocol as lists are first presented and recalled individually in the IFR protocol. Another level of grouping, that was not induced by the protocol, was identified in FFR through the analysis of IFR data: a small proportion of participants develops chunking strategies in IFR^46,47. These participants divide lists of 16 words into chunks of 3 or 4 consecutively presented words (e.g. words 1–4, 5–8, 9–12 and 13–16 in case of chunks of size 4) and recall these chunks as single entities⁴⁴. This kind of chunking is not imposed by the protocol; hence, it must emerge from active manipulation of the presented list, for example representing chunks of words as separate items in memory. Here we wondered whether the chunks observed during IFR remained in memory till FFR trials. It is hard to infer whether chunking occurred in every single trial, hence we assumed that a chunk is recalled as a unit when all words from that chunk are recalled consecutively in IFR (not necessarily in the correct order). We therefore isolated all chunks of size 4 that were recalled during IFR trials (as described above), and considered the recall of the constituting words during FFR. We computed the probability for the different number of words from this chunk to be recalled. The results are shown in Fig. 2a. We found that for the first three chunks in the list, probability has two peaks, at 0 and 4 words, indicating the tendency for all 4 words in these chunks to be recalled or omitted as a single unit. Interestingly, the probability curve for the last chunk in a list decays monotonically, indicating that words from that chunk are recalled independently. A plausible explanation of this effect is that the last several words in a list are typically recalled immediately during IFR since they are maintained in working memory after the list is presented, and hence their recall is effortless and does not lead to the formation of a chunk representation in memory. A similar explanation also accounts for a recently reported ‘anti-recency’ effect in FFR, where the last words in a list have lower probability to be recalled, as opposed to the well-documented positive recency effect during IFR³⁹. For comparison, if the same analysis is performed for IFR trials where the same four words were recalled but with at least one intervening word, the corresponding probabilities do not exhibit a peak at four words recalled (Fig. 2b).

Spontaneous grouping of lists

Some of the best participants who employ a strong over the list grouping imposed by the presentation protocol, also exhibit a higher-level grouping of lists. In particular, they tend to recall lists in chunks of four consecutive lists, as illustrated in figure Fig. 3.

Taken together, the results presented above illustrate that our memory is trained to create a structure on different levels of organization, including those that are not directly imposed by the presentation protocol.

Hierarchical model of memory recall

The model developed for this study generalizes our previous model of single list free recall that is based on two principles^32,40:

1.
The encoding principle states that each memory item is encoded (“represented”) in the brain by a specific group of neurons in a dedicated memory network. When an item is retrieved (“recalled”), either spontaneously or when triggered by an external cue, this specific group of neurons is activated.
2.
The associativity principle for which, in the absence of specific retrieval cues, the currently retrieved item plays the role of an internal cue that triggers the retrieval of the next item.

From these two principles we were able to theoretically predict that, out of L remembered words, on average, $\sqrt{3\pi L/2}\approx 2.17\sqrt{L}$ words would be recalled⁴¹. This matches well with the average performance and its distribution in single list recall of approximately 8 words out of 16³². However, the average FFR performance of 57 words recalled out of 256 words presented over the entire session (16 lists of 16 words each) is much higher than predicted by this model, which motivated the current extension. To this end, we build an hierarchical model of memory based on these two principles and show that it’s behavior is in agreement with experimental results presented above. Our model could be viewed as a radically simplified version of ²⁴ and¹⁴.

Modeling the encoding

We extend the encoding principle formulated above for the recall of single lists in the following way. Following^14,24 we postulate that different distinct levels of information (words, chunks, lists, context…) are encoded in the form of sparse random neuronal populations in the corresponding distinct subnetworks (see Fig. 4a). In the experimental paradigm words are presented in lists of 16 items and each session consists of 16 lists. Accordingly, each word W is labeled by the triple of indexes: W = (w, l, s), corresponding to the presentation position of the word in the session (from 1 to 256), the presentation position of the list (from 1 to 16), and the session number, respectively. Similar to CMR¹⁴, we represent each word W by the concatenation of three binary patterns, each representing a session, list with a session an a word within a list, respectively:

$${{\xi }}^{W}=\mathop{\underbrace{010010001\ldots 10}}\limits_{{\rm{w}}{\rm{o}}{\rm{r}}{\rm{d}}\,{\rm{e}}{\rm{n}}{\rm{c}}{\rm{o}}{\rm{d}}{\rm{i}}{\rm{n}}{\rm{g}}\,{{\xi }}^{{\rm{w}}}}\mathop{\underbrace{010010100\ldots 10}}\limits_{{\rm{l}}{\rm{i}}{\rm{s}}{\rm{t}}\,{\rm{e}}{\rm{n}}{\rm{c}}{\rm{o}}{\rm{d}}{\rm{i}}{\rm{n}}{\rm{g}}\,{{\xi }}^{{\rm{l}}}}\mathop{\underbrace{101011000\ldots 1}}\limits_{{\rm{s}}{\rm{e}}{\rm{s}}{\rm{s}}{\rm{i}}{\rm{o}}{\rm{n}}\,{\rm{e}}{\rm{n}}{\rm{c}}{\rm{o}}{\rm{d}}{\rm{i}}{\rm{n}}{\rm{g}}\,{{\xi }}^{{\rm{s}}}}$$

(1)

The length of the three vectors equals the number of neurons in each subnetwork N_w, N_l, N_s. Each neuron contributes to the encoding of a word W with probability f so that the total number of neurons which encode a word W is on average f⋅(N_w + N_l + N_s) = f⋅N.

In our previous studies^32,48, transitions between words were driven by similarity defined as a dot product between the corresponding representations (see Methods). Due to the decomposition of representations into three parts (see Eq. (1)), the similarity between any two words W₁ and W₂ can be presented as a sum of three corresponding terms:

$${S}_{tot}^{{W}_{1},{W}_{2}}={S}_{word}^{{w}_{1},{w}_{2}}+\alpha {S}_{list}^{{l}_{1},{l}_{2}}+\beta {S}_{session}^{{s}_{1},{s}_{2}},$$

(2)

where ${S}_{word}^{{w}_{1},{w}_{2}}$ is the similarity matrix between words w₁ and w₂ in words subnetwork; ${S}_{list}^{{l}_{1},{l}_{2}}$ is the similarity between lists l₁ and l₂, to which the words W₁ and W₂ belong; ${S}_{session}^{{s}_{1},{s}_{2}}$ is the similarity of sessions s₁ and s₂ in the session subnetwork; parameters α and β weight the relative strength of the list and session context populations respectively in driving the retrieval process.

The data shows that IFR had a strong effect on FFR, since, while average IFR performance was 50%, 87% of the words recalled in FFR are the words that were previously recalled in IFR. We therefore assumed that only the words that are recalled during IFR are bound to list and session representations, i.e. the last two terms in the total similarity matrix of Eq. (2) are only added for pairs of words that were both recalled during IFR (see Methods).

Associative transitions

The model of the encoding principle provides a simple mathematical characterization of words representation, but it does not describe how these representations are exploited in the retrieval dynamics. This is described within the scope of the associative principle which determines transitions between words.

According to the associativity principle the currently retrieved item functions as an internal cue that triggers the retrieval of the next one. Transitions between words are brought about by similarities between the active word – the last retrieved one – and other encoded words. Simplifying the previous models^14,24, we use the deterministic transition rule, namely, the word which is most similar to the currently retrieved one is then activated and the process continues leading to the retrieval of more and more words. Importantly, the last retrieved word cannot be activated so that a transition which just occured cannot immediately happen in the reverse direction. The IFR of a single list was obtained by the non-hierarchical recall model of⁴⁰ with word-to-word contribution S_word to the similarity matrix of Eq. (2) (see Fig. 4b). The recalled words in IFR were then used to build the total similarity matrix S_tot of Eq. (2) for different strengths of binding between words and lists, α, and FFR was modeled as follows. The dynamical recall process is driven by S_tot (see Fig. 4c, black arrows), unless the same within-list transition is attempted for the second time (Fig. 4b,c, red arrows). At this point, the process, if continued, would enter a loop by recapitulating the same words of a given list that were already retrieved and hence no new words would be recalled (Fig. 4b, bold arrows). Note that the repeated retrieval of the same word not always initiates the loop because sometimes the retrieval could then proceed in the opposite direction (see⁴⁰ and Fig. 4b). Similar to⁴², we assume than when the process approaches a loop, i.e. the same within-list transition is attempted for the second time, the list representation is suppressed and the next transition is determined by the other two contributions in the similarity matrix corresponding to session context and word-to-word similarity:

$${S}_{tot-l}^{{W}_{1},{W}_{2}}={S}_{word}^{{w}_{1},{w}_{2}}+\beta {S}_{session}^{{s}_{1},{s}_{2}}.$$

(3)

We call these transitions ‘random’ in contrast to the ‘structured’ ones induced by S_tot, and show them with green arrows in Fig. 4c. Upon triggering the retrieval of a new word through random transition the process reverts to using the full similarity matrix S_tot with the list representation corresponding to the retrieved word activated, until it eventually enters a big loop that includes several lists.

Comparison between data and model simulation

We now turn to deploying this model in simulating the experimental paradigm analyzed previously. To qualitatively compare the model to experimental findings, we examine how the sequences generated by our model present grouping of items as measured by p₁₆. In the model, the parameter α controls the strength of binding between the words recalled during IFR and the list representation. When α is high, the similarity between the words from the same list is high and hence most of the transitions happen between such words. We let α vary across sessions, and set the binding between words and sessions according to $\beta =\frac{\alpha }{2}+\gamma $. Here γ is a constant that controls the binding of words recalled during IFR to a session, irrespective of how strong the list binding is. The reason for this contribution is that the words recalled in IFR have a higher chance to be recalled in FFR even for sessions with no list grouping (i.e. sessions with p₁₆ = 0), see Fig. 5d. The exact relation between β and α is not important, besides setting the value of α for which grouping saturates (see Fig. 5a).

Using the described model, 6500 sessions of FFR were simulated. We compute p₁₆ for all sessions of FFR so generated and find that p₁₆ on average monotonically increases with the value of α, Fig. 5a. This is an expected behavior since large values of α force structured recall. Similarly to experimental data the model shows a linear dependence of the number of recalled words as a function of p₁₆, Fig. 5b (cfr. Fig. 1b). Intriguingly, the number of sequences of words recalled from the same list as a function of p₁₆ shows a non-monotonic dependence, Fig. 5c (red dots), which we also observed in the experimental data (blue dots). For small values of α, and thus p₁₆, the recall is unstructured and the number of sequences is roughly equal to the number of recalled words (see Fig. 1d). When α and, therefore, p₁₆ increases the number of sequences increases since there is a mixture of two recall processes - random and structured (see Fig. 1e). For intermediate values of p₁₆ the contribution of S_word and S_list to driving structured transitions are comparable and across lists transitions may still be triggered by structured transitions. As we further increase α the recall becomes very structured and the words from a single list are predominantly recalled before words from other lists are recalled (see Fig. 1c). Consequently the number of sequences becomes comparable, or even smaller than the number of presented lists. To further assess the validity of our model we compute the percentage of newly recalled words in FFR (the words that were not recalled in IFR). Figure 5d shows that this steadily decreases with p₁₆ for both the model (red dots) and the experimental data (blue dots).

Discussion

We studied the final free recall of sets of 256 unrelated words that were previously presented and recalled on the same day as 16 lists of 16 words each. We found that FFR trials exhibit various degrees of hierarchical organization: within-list chunking that spontaneously emerged in IFR, over-the-list organization induced by the presentation protocol, and finally list chunking for the very best participants (see Figs 1–3 above). The dominant recall organization, exhibited in the bulk of the data, was the tendency to recall subsequent words from the same list. This type of grouping strongly correlated with performance, Fig. 1b. When extrapolated to the limit of random recall, the performance dipped below the level of 30 words that closely matched our theoretical prediction for structure-less recall. The average performance was almost twice higher than this level, indicating a strong effect of information structure on memory retrieval. We also found that within-list chunks that emerged spontaneously in a limited number of trials in IFR⁴⁴ have a high probability to be recalled or omitted as single units during FFR trials as well. Taken together, our results strongly indicate that people tend to organize information to be remembered in a way that facilitate subsequent recall, even when information itself lacks any meaning, as in the case of free recall of random words.

From a theoretical point of view, we extended the model of associative memory recall⁴⁰ to take into account the hierarchical representation of information in FFR that we found in experimental data. More specifically, we added list and session context subnetworks. The resulting model is compatible with proposed principles of sparse encoding and associative transitions. Our model is much simpler than the previous models of hierarchical contextual recall^14,24, which helps to better understand the relation between the binding of memorized words to context and FFR properties. It should be noted however that the simplicity of the model comes with the price, since it does not account for temporal organization of single list recall, such as primacy, recency and contiguity. Our experimental and theoretical results indicate that the recall of the words in IFR, rather than passive acquisition, is a dominating factor in the emergence of the grouping. The model can be easily generalized to any number of hierarchical levels by adding additional layers of representations, similar to list and session representations.

Methods

Experimental methods

Similarly to⁴⁹, the data reported in this manuscript were collected in the lab of M. Kahana as part of the Penn Electrophysiology of Encoding and Retrieval Study (see⁴³ for details of the experiments). Here we analyzed the results from the 217 participants (age 17–30) who completed the first phase of the experiment, consisting of 7 experimental sessions. All experiments were performed in accordance with relevant guidelines and regulations. Participants were consented according the University of Pennsylvania’s IRB protocol and were compensated for their participation. Informed consent was obtained from all participants or, if participants are under 18, from a parent or legal guardian. Each session consisted of 16 lists of 16 words presented one at a time on a computer screen and lasted approximately 1.5 hours. Each study list was followed by an immediate free recall test. Words were drawn from a pool of 1638 words. For each list, there was a 1500 ms delay before the first word appeared on the screen. Each item was on the screen for 3000 ms, followed by jittered 800–1200 ms inter-stimulus interval (uniform distribution). After the last item in the list, there was a 1200–1400 ms jittered delay, after which participants were given 75 seconds to attempt to recall any of the just-presented items. In 4 out of 7 experimental sessions, following the immediate free recall test from the last list, participants were shown an instruction screen for final-free recall, informing them to recall all the items from the preceding lists in any order. After a 5 s delay, a tone sounded and a row of asterisks appeared. Participants had 5 minutes to orally recall any item from the preceding lists.

Grouping measures

For each final-free recall trial we consider the ordered set of recalled words (W) defined as w₁ → w₂ → … → w_n where n is the number of words recalled in a given trial and w₁ (w₂, …, w_n) denotes the input serial position during the day of the first (second, …, last) word recalled, which is the number between 1 and 256 (see Fig. 1a). We introduce the grouping measure (p), and assign the probability to each transition by assuming that the next word recalled is chosen from the same list as the currently recalled word with probability p and a random word is chosen with probability 1 − p. The probability for the whole sequence is computed as a product of individual transition probabilities. Formally, if l_i is the number of the list (from 1 to 16) from which word w_i was presented, the probability P_i of transition (w_i → w_i+1) and the total logarithm probability of the whole sequence (log-likelihood) are

$$\begin{array}{rcl}{P}_{i} & = & \{\begin{array}{ll}\frac{p\delta ({l}_{i+1},{l}_{i})}{{m}_{i}}+\frac{1-p}{L-i} & {m}_{i} > 0\\ \frac{1}{L-i} & {m}_{i}=0\end{array}\\ l(W|p) & = & \sum _{i=1}^{n-1}\,\mathrm{log}\,({P}_{i})\end{array}$$

(4)

where m_i ∈ [0, …, 15] is the number of not yet recalled words from the list l_i ∈ [1, …, 16] and L = 256 is the total number of words presented during the day. The grouping measure p₁₆ for the FFR trial is then obtained as the value of p that maximizes the likelihood of the sequence l(W|p).

Theoretical model details

The model builds on the idea that words are represented as binary population vectors, Eq. (1). The full similarity matrix between two words is then given by Eq. (2). Given two words W₁ and W₂ the contributions of the different terms is given by

$$\begin{array}{rcl}{S}_{word}^{{w}_{1},{w}_{2}} & = & \sum _{i=1}^{{N}_{w}}\,{\xi }_{i}^{{w}_{1}}{\xi }_{i}^{{w}_{2}}\simeq (\begin{array}{cc} {\mathcal B} ({N}_{w},f), & {w}_{1}={w}_{2}\\ {\mathcal B} ({N}_{w},{f}^{2}), & {w}_{1}\ne {w}_{2}\end{array}\\ {S}_{list}^{{l}_{1},{l}_{2}} & = & \frac{1}{{N}_{l}\,f}{\rm{IFR}}({W}_{1}){\rm{IFR}}({W}_{2})\cdot \sum _{i\mathrm{=1}}^{{N}_{l}}\,{\xi }_{i}^{{l}_{1}}{\xi }_{i}^{{l}_{2}}\\ & \simeq & \frac{1}{{N}_{l}\,f} {\mathcal B} ({N}_{l},f)\cdot {\rm{IFR}}({W}_{1}){\rm{IFR}}({W}_{2})\cdot {\delta }_{{l}_{1},{l}_{2}}\\ {S}_{session}^{{s}_{1},{s}_{2}} & = & \frac{1}{{N}_{s}\,f}{\rm{IFR}}({W}_{1}){\rm{IFR}}({W}_{2})\cdot \sum _{i=1}^{{N}_{s}}\,{\xi }_{i}^{{s}_{1}}{\xi }_{i}^{{s}_{2}}\\ & \simeq & {\rm{IFR}}({W}_{1}){\rm{IFR}}({W}_{2})\cdot {\delta }_{{s}_{1},{s}_{2}}\end{array}$$

(5)

where w₁, w₂ index the word coding part of the population vector ξ, l₁, l₂ the list coding part, s₁, s₂ the session context coding part, while IFR(W) is an indicator function that word W was recalled in the IFR trial following the list presentation, i.e. IFR(W) = 1 was retrieved and 0 otherwise. To speedup simulations we neglected the correlations between elements in similarity matrices and approximate them by independent binomial random variables $ {\mathcal B} ({N}_{w},{f}^{2})$ for word similarities and $ {\mathcal B} ({N}_{l},f)$ for list similarities. We neglected similarities between different lists and different sessions and also assumed session similarities to be equal to each other. The list and session similarity matrices were normalized to have entries of the order of 1.

According to the associative principle, given an active word W_k the formal equation that defines the next word retrieved during structured recall is

$${W}_{k+1}=\mathop{{\rm{argmax}}}\limits_{W\notin {M}_{k}}\,{S}_{tot}^{{W}_{k},W},$$

(6)

where M₁ = {W₁}, and M_k = {W_k−1, W_k}. Similarly for random transitions

$${W}_{k+1}=\mathop{{\rm{a}}{\rm{r}}{\rm{g}}{\rm{m}}{\rm{a}}{\rm{x}}}\limits_{W\notin {M}_{k}}\,{S}_{tot-l}^{{W}_{k},W}$$

(7)

where S_tot−l is given by Eq. (3). In the simulations we consider N = 300000 and f = 0.1, ${N}_{w}=\frac{N}{3}$, ${N}_{l}=\frac{N}{3}$, ${N}_{s}=\frac{N}{3}$, γ = 15. This value for γ was chosen to match the proportion of new words recalled during FFR on sessions with little over the list grouping (see Fig. 5d).

References

Murdock, B. B. Jr. The immediate retention of unrelated words. J. Exp. Psychol. 60, 222 (1960).
Article Google Scholar
Murdock, B. B. Jr. The serial position effect of free recall. J. Exp. Psychol. 64, 482 (1962).
Article Google Scholar
Roberts, W. A. Free recall of word lists varying in length and rate of presentation: A test of total-time hypotheses. J. Exp. Psychol. 92, 365 (1972).
Article Google Scholar
Howard, M. W. & Kahana, M. J. Contextual variability and serial position effects in free recall. J. Exp. Psychol. Learn. Mem. Cogn. 25, 923 (1999).
Article CAS Google Scholar
Kahana, M. J., Howard, M. W., Zaromb, F. & Wingfield, A. Age dissociates recency and lag recency effects in free recall. J. Exp. Psychol. Learn. Mem. Cogn. 28, 530 (2002).
Article Google Scholar
Zaromb, F. M. et al. Temporal associations and prior-list intrusions in free recall. J. Exp. Psychol. Learn. Mem. Cogn. 32, 792 (2006).
Article Google Scholar
Ward, G., Tan, L. & Grenfell-Essam, R. Examining the relationship between free recall and immediate serial recall: the effects of list length and output order. J. Exp. Psychol. Learn. Mem. Cogn. 36, 1207 (2010).
Article Google Scholar
Miller, J. F., Weidemann, C. T. & Kahana, M. J. Recall termination in free recall. Mem. & Cogn. 40, 540–550 (2012).
Article Google Scholar
Grenfell-Essam, R., Ward, G. & Tan, L. Common modality effects in immediate free recall and immediate serial recall. J. Exp. Psychol. Learn. Mem. Cogn. 43, 1909 (2017).
Article Google Scholar
Raaijmakers, J. G. & Shiffrin, R. M. Sam: A theory of probabilistic search of associative memory. In Psychology of learning and motivation, vol. 14, 207–262 (Elsevier, 1980).
Gillund, G. & Shiffrin, R. M. A retrieval model for both recognition and recall. Psychol. Rev. 91, 1 (1984).
Article CAS Google Scholar
Howard, M. W. & Kahana, M. J. A distributed representation of temporal context. J. Math. Psychol. 46, 269–299 (2002).
Article MathSciNet Google Scholar
Laming, D. Failure to recall. Psychol. Rev. 116, 157 (2009).
Article Google Scholar
Polyn, S. M., Norman, K. A. & Kahana, M. J. A context maintenance and retrieval model of organizational processes in free recall. Psychol. Rev. 116, 129 (2009).
Article Google Scholar
Lehman, M. & Malmberg, K. J. A buffer model of memory encoding and temporal correlations in retrieval. Psychol. Rev. 120, 155 (2013).
Article Google Scholar
Bower, G. H., Lesgold, A. M. & Tieman, D. Grouping operations in free recall. J. Verbal Learn. Verbal Behav. 8, 481–493 (1969).
Article Google Scholar
Pollio, H. R., Richards, S. & Lucas, R. Temporal properties of category recall. J. Verbal Learn. Verbal Behav. 8, 529–536 (1969).
Article Google Scholar
Kurby, C. A. & Zacks, J. M. Segmentation in the perception and memory of events. Trends Cogn. Sci. 12, 72–79 (2008).
Article Google Scholar
Tulving, E. Subjective organization and effects of repetition in multi-trial free-recall learning. J. Verbal Learn. Verbal Behav. 5, 193–197 (1966).
Article Google Scholar
Farrell, S. & Lewandowsky, S. Modelling transposition latencies: Constraints for theories of serial order memory. J. Mem. Lang. 51, 115–135 (2004).
Article Google Scholar
Henson, R. N. A. Short-term memory for serial order. Ph.D. thesis, University of Cambridge UK (1996).
Maybery, M. T., Parmentier, F. B. & Jones, D. M. Grouping of list items reflected in the timing of recall: Implications for models of serial verbal memory. J. Mem. Lang. 47, 360–385 (2002).
Article Google Scholar
Ryan, J. Grouping and short-term memory: Different means and patterns of grouping. The Q. J. Exp. Psychol. 21, 137–147 (1969).
Article CAS Google Scholar
Farrell, S. Temporal clustering and sequencing in short-term memory and episodic memory. Psychol. Rev. 119, 223 (2012).
Article Google Scholar
Madigan, S. The serial position curve in immediate serial recall. Bull. Psychon. Soc. 15, 335–338 (1980).
Article Google Scholar
Bower, G. H. Organizational factors in memory. Cogn. Psychol. 1, 18–46 (1970).
Article Google Scholar
Kahana, M. J. & Jacobs, J. Interresponse times in serial recall: Effects of intraserial repetition. J. Exp. Psychol. Learn. Mem. Cogn. 26, 1188 (2000).
Article CAS Google Scholar
Miller, G. A. The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychol. Rev. 63, 81 (1956).
Article CAS Google Scholar
Spillers, G. J. & Unsworth, N. Variation in working memory capacity and temporal–contextual retrieval from episodic memory. J. Exp. Psychol. Learn. Mem. Cogn. 37, 1532 (2011).
Article Google Scholar
Nelson, M. J. et al. Neurophysiological dynamics of phrase-structure building during sentence processing. Proc. Natl. Acad. Sci. 201701590 (2017).
Bower, G. H., Clark, M. C., Lesgold, A. M. & Winzenz, D. Hierarchical retrieval schemes in recall of categorized word lists. J. Verbal Learn. Verbal Behav. 8, 323–343 (1969).
Article Google Scholar
Katkov, M., Romani, S. & Tsodyks, M. Memory retrieval from first principles. Neuron 94, 1027–1032 (2017).
Article CAS Google Scholar
Brown, G. D., Preece, T. & Hulme, C. Oscillator-based memory for serial order. Psychol. Rev. 107, 127 (2000).
Article CAS Google Scholar
Lewandowsky, S. & Farrell, S. Short-term memory: New data and a model. Psychol. learning Motiv. 49, 1–48 (2008).
Article Google Scholar
Laming, D. Serial position curves in free recall. Psychol. Rev. 117, 93 (2010).
Article Google Scholar
Unsworth, N. Exploring the retrieval dynamics of delayed and final free recall: Further evidence for temporal-contextual search. J. Mem. Lang. 59, 223–236 (2008).
Article Google Scholar
Howard, M. W., Youker, T. E. & Venkatadass, V. S. The persistence of memory: Contiguity effects across hundreds of seconds. Psychon. Bull. & Rev. 15, 58–63 (2008).
Article Google Scholar
Tzeng, O. J. Positive recency effect in a delayed free recall. J. Verbal Learn. Verbal Behav. 12, 436–439 (1973).
Article Google Scholar
Kuhn, J. R., Lohnas, L. J. & Kahana, M. J. A spacing account of negative recency in final free recall. J. Exp. Psychol. Learn. memory, cognition 8, 1180–1185 (2018).
Article Google Scholar
Romani, S., Pinkoviezky, I., Rubin, A. & Tsodyks, M. Scaling laws of associative memory retrieval. Neural computation 25, 2523–2544 (2013).
Article MathSciNet Google Scholar
Naim, M., Katkov, M., Romani, S. & Tsodyks, M. Fundamental law of memory recall. arXiv preprint arXiv:1905.02403 (2019).
Rundus, D. Negative effects of using list items as recall cues. J. Verbal Learn. Verbal Behav. 12, 43–50 (1973).
Article Google Scholar
Healey, M. K., Crutchley, P. & Kahana, M. J. Individual differences in memory search and their relation to intelligence. J. Exp. Psychol. Gen. 143, 1553 (2014).
Article Google Scholar
Romani, S., Katkov, M. & Tsodyks, M. Practice makes perfect in memory recall. Learn. & Mem. 23, 169–173 (2016).
Article CAS Google Scholar
Carroll, D. Psychology of language (Nelson Education, 2007).
Buschke, H. Learning is organized by chunking. J. Verbal Learn. Verbal Behav. 15, 313–324 (1976).
Article Google Scholar
Gobet, F. et al. Chunking mechanisms in human learning. Trends Cogn. Sci. 5, 236–243 (2001).
Article CAS Google Scholar
Katkov, M., Romani, S. & Tsodyks, M. Effects of long-term representations on free recall of unrelated words. Learn. & Mem. 22, 101–108 (2015).
Article Google Scholar
Recanatesi, S., Katkov, M., Romani, S. & Tsodyks, M. Neural network model of memory retrieval. Front. Comput. Neurosci. 9, 149 (2015).
Article Google Scholar

Download references

Acknowledgements

This work is supported by European Union Horizon 2020 Framework Program under Grant Agreements Nos 720270 and 785907 (Human Brain Project SGA1 and SGA2), Foundation Adelis and EU - M-GATE 765549. We are grateful to M. Kahana for generously sharing the data obtained in his laboratory. The lab of Kahana is supported by NIH grant MH55687.

Author information

Michelangelo Naim, Mikhail Katkov and Stefano Recanatesi contributed equally.

Authors and Affiliations

Department of Neurobiology, Weizmann Institute of Science, Rehovot, 76000, Israel
Michelangelo Naim, Mikhail Katkov & Misha Tsodyks
Department of Physiology and Biophysics, University of Washington, Seattle, WA, 98195, USA
Stefano Recanatesi

Authors

Michelangelo Naim
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Katkov
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Recanatesi
View author publications
You can also search for this author in PubMed Google Scholar
Misha Tsodyks
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.N., M.K. and S.R. analyzed the data, developed the mathematical model, wrote the manuscript text and edited the figures. M.T. has mentored and guided the project working on the data analysis, mathematical model and writing the manuscript text.

Corresponding author

Correspondence to Misha Tsodyks.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Naim, M., Katkov, M., Recanatesi, S. et al. Emergence of hierarchical organization in memory for random material. Sci Rep 9, 10448 (2019). https://doi.org/10.1038/s41598-019-46908-z

Download citation

Received: 19 March 2019
Accepted: 08 July 2019
Published: 18 July 2019
DOI: https://doi.org/10.1038/s41598-019-46908-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.