High-level cognition during story listening is reflected in high-order dynamic correlations in neural activity patterns

Owen, Lucy L. W.; Chang, Thomas H.; Manning, Jeremy R.

doi:10.1038/s41467-021-25876-x

Download PDF

Article
Open access
Published: 30 September 2021

High-level cognition during story listening is reflected in high-order dynamic correlations in neural activity patterns

Nature Communications volume 12, Article number: 5728 (2021) Cite this article

12k Accesses
9 Citations
223 Altmetric
Metrics details

Subjects

Abstract

Our thoughts arise from coordinated patterns of interactions between brain structures that change with our ongoing experiences. High-order dynamic correlations in neural activity patterns reflect different subgraphs of the brain’s functional connectome that display homologous lower-level dynamic correlations. Here we test the hypothesis that high-level cognition is reflected in high-order dynamic correlations in brain activity patterns. We develop an approach to estimating high-order dynamic correlations in timeseries data, and we apply the approach to neuroimaging data collected as human participants either listen to a ten-minute story or listen to a temporally scrambled version of the story. We train across-participant pattern classifiers to decode (in held-out data) when in the session each neural activity snapshot was collected. We find that classifiers trained to decode from high-order dynamic correlations yield the best performance on data collected as participants listened to the (unscrambled) story. By contrast, classifiers trained to decode data from scrambled versions of the story yielded the best performance when they were trained using first-order dynamic correlations or non-correlational activity patterns. We suggest that as our thoughts become more complex, they are reflected in higher-order patterns of dynamic network interactions throughout the brain.

Predicting memory from the network structure of naturalistic events

Article Open access 22 July 2022

Hongmi Lee & Janice Chen

Ensemble of coupling forms and networks among brain rhythms as function of states and cognition

Article Open access 21 January 2022

Bolun Chen, Luis F. Ciria, … Plamen Ch. Ivanov

Processing of the same narrative stimuli elicits common functional connectivity dynamics between individuals

Article Open access 02 December 2023

Başak Türker, Laouen Belloli, … Jacobo D. Sitt

Introduction

A central goal in cognitive neuroscience is to elucidate the neural code: i.e., the mapping between (a) mental states or cognitive representations and (b) neural activity patterns. One means of testing models of the neural code is to ask how accurately that model is able to “translate” neural activity patterns into known (or hypothesized) mental states or cognitive representations^{1,2,3,4,5,6,7,8,9}. Training decoding models on different types of neural features (Fig. 1a) can also help to elucidate which specific aspects of neural activity patterns are informative about cognition and, by extension, which types of neural activity patterns might compose the neural code. For example, prior work has used region of interest analyses to estimate the anatomical locations of specific neural representations¹⁰, or to compare the relative contributions to the neural code of multivariate activity patterns versus dynamic correlations between neural activity patterns^11,12. An emerging theme in this literature is that cognition is mediated by dynamic interactions between brain structures^{13,14,15,16,17,18,19,20,21,22,23,24,25}.

Studies of the neural code to date have primarily focused on univariate or multivariate neural patterns², or (more recently) on patterns of dynamic first-order correlations (i.e., interactions between pairs of brain structures^{11,12,18,20,21,22}). What might the future of this line of work hold? For example, is the neural code implemented through higher-order interactions between brain structures²⁶? Second-order correlations reflect homologous patterns of correlation. In other words, if the dynamic patterns of correlations between two regions, A and B, are similar to those between two other regions, C and D, this would be reflected in the second-order correlations between (A–B) and (C–D). In this way, second-order correlations identify similarities and differences between subgraphs of the brain’s connectome. Analogously, third-order correlations reflect homologies between second-order correlations– i.e., homologous patterns of homologous interactions between brain regions. More generally, higher-order correlations reflect homologies between patterns of lower-order correlations. We can then ask: which “orders” of interaction are most reflective of high-level cognitive processes?

One reason one might expect to see homologous networks in a dataset is related to the notion that network dynamics reflect ongoing neural computations or cognitive processing²⁷. If the nodes in two brain networks are interacting (within each network) in similar ways then, according to our characterization of network dynamics, we refer to the similarities between those patterns of interaction as higher-order correlations. When higher-order correlations are themselves changing over time, we can also attempt to capture and characterize those high-order dynamics.

Another central question pertains to the extent to which the neural code is carried by activity patterns that directly reflect ongoing cognition^1,2, versus the dynamic properties of the network structure itself, independent of specific activity patterns in any given set of regions¹⁶. For example, graph measures such as centrality and degree²⁸ may be used to estimate how a given brain structure is “communicating” with other structures, independently of the specific neural representations carried by those structures. If one considers a brain region’s position in the network (e.g., its eigenvector centrality) as a dynamic property, one can compare how the positions of different regions are correlated, and/or how those patterns of correlations change over time. We can also compute higher-order patterns in these correlations to characterize homologous subgraphs in the connectome that display similar changes in their constituent brain structures’ interactions with the rest of the brain.

To gain insights into the above aspects of the neural code, we developed a computational framework for estimating dynamic high-order correlations in timeseries data. This framework provides an important advance, in that it enables us to examine patterns of higher-order correlations that are computationally intractable to estimate via conventional methods. Given a multivariate timeseries, our framework provides timepoint-by-timepoint estimates of the first-order correlations, second-order correlations, and so on. Our approach combines a kernel-based method for computing dynamic correlations in timeseries data with a dimensionality reduction step (Fig. 1b) that projects the resulting dynamic correlations into a low-dimensional space. We explored two dimensionality reduction approaches: principle components analysis²⁹ (PCA), which preserves an approximately invertible transformation back to the original data^30,31,32, and a second non-invertible algorithm for computing dynamic patterns in eigenvector centrality³³. This latter approach characterizes correlations between each feature dimension’s relative position in the network (at each moment in time) in favor of the specific activity histories of different features^26,34,35.

We validated our approach using synthetic data where the underlying correlations were known. We then applied our framework to a neuroimaging dataset collected as participants listened to either an audio recording of a ten-minute story, listened to a temporally scrambled version of the story, or underwent a resting state scan³⁶. Temporal scrambling has been used in a growing number of studies, largely by Uri Hasson’s group, to identify brain regions that are sensitive to higher-order and longer-timescale information (e.g., cross-sensory integration, rich narrative meaning, complex situations, etc.) versus regions that are primarily sensitive to low-order (e.g., sensory) information. For example,³⁷ argues that when brain areas are sensitive to fine versus coarse temporal scrambling, this indicates that they are “higher order” in the sense that they process contextual information pertaining to further-away timepoints. By contrast, low-level regions, such as primary sensory cortices, do not meaningfully change their responses (after correcting for presentation order) even when the stimulus is scrambled at fine timescales.

We used a subset of the story listening and rest data to train across-participant classifiers to decode listening times (of groups of participants) using a blend of neural features (comprising neural activity patterns, as well as different orders of dynamic correlations between those patterns that were inferred using our computational framework). We found that both the PCA-based and eigenvector centrality-based approaches yielded neural patterns that could be used to decode accurately (i.e., well above chance). Both approaches also yielded the best decoding accuracy for data collected during (intact) story listening when high-order (PCA: second-order; eigenvector centrality: fourth-order) dynamic correlation patterns were included as features. When we trained classifiers on the scrambled stories or resting state data, only (relatively) lower-order dynamic patterns were informative to the decoders. Taken together, our results indicate that high-level cognition is supported by high-order dynamic patterns of communication between brain structures.

Results

We sought to understand whether high-level cognition is reflected in dynamic patterns of high-order correlations. To that end, we developed a computational framework for estimating the dynamics of stimulus-driven high-order correlations in multivariate timeseries data (see Dynamic inter-subject functional connectivity (DISFC) and Dynamic higher-order correlations). We evaluated the efficacy of this framework at recovering known patterns in several synthetic datasets (see Synthetic data: simulating dynamic first-order correlations and Synthetic data: simulating dynamic higher-order correlations). We then applied the framework to a public fMRI dataset collected as participants listened to an auditorily presented story, listened to a temporally scrambled version of the story, or underwent a resting state scan (see Functional neuroimaging data collected during story listening). We used the relative decoding accuracies of classifiers trained on different sets of neural features to estimate which types of features reflected ongoing cognitive processing.

Recovering known dynamic first-order correlations

We generated synthetic datasets that differed in how the underlying first-order correlations changed over time. For each dataset, we applied Eq. (4) with a variety of kernel shapes and widths. We assessed how well the true underlying correlations at each timepoint matched the recovered correlations (Fig. 2). For every kernel and dataset we tested, our approach recovered the correlation dynamics we embedded into the data. However, the quality of these recoveries varied across different synthetic datasets in a kernel-dependent way.

In general, wide monotonic kernel shapes (Laplace, Gaussian), and wider kernels (within a shape), performed best when the correlations varied gradually from moment-to-moment (Fig. 2a, c, d). In the extreme, as the rate of change in correlations approaches 0 (Fig. 2a), an infinitely wide kernel would exactly recover the Pearson’s correlation (e.g., compare Eqs. (1) and (4)).

When the correlation dynamics were unstructured in time (Fig. 2b), a Dirac δ kernel (infinitely narrow) performed best. This is because, when every timepoint’s correlations are independent of the correlations at every other timepoint, averaging data over time dilutes the available signal. Following a similar pattern, holding kernel shape fixed, narrower kernel parameters better recovered randomly varying correlations.

Recovering known dynamic higher-order correlations

Following our approach to evaluating our ability to recover known dynamic first-order correlations from synthetic data, we generated an analogous second set of synthetic datasets that we designed to exhibit known dynamic first-order and second-order correlations (see Synthetic data: simulating dynamic higher-order correlations). We generated a total of 400 datasets (100 datasets for each category) that varied in how the first-order and second-order correlations changed over time. We then repeatedly applied Eq. (4) using the overall best-performing kernel from our first-order tests (a Laplace kernel with a width of 20; Fig. 2) to assess how closely the recovered dynamic correlations matched the dynamic correlations we had embedded into the datasets.

Overall, we found that we could reliably recover both first-order and second-order correlations from the synthetic data (Fig. 3). When the correlations were stable for longer intervals, or changed gradually (constant, ramping, and event datasets), recovery performance was relatively high, and we were better able to recover dynamic first-order correlations than second-order correlations. This is because errors in our estimation procedure at lower orders necessarily propagate to higher orders (since lower-order correlations are used to estimate higher-order correlations). Conversely, when the correlations were particularly unstable (random datasets), we better recovered second-order correlations. This is because noise in our data generation procedure propagates from higher orders to lower orders (see Synthetic data: simulating dynamic high-order correlations).

**Fig. 3: Recovery of simulated first-order and second-order dynamic correlations.**

We also examined the impact of the data duration (Fig. S3) and complexity (number of zero-order features; Fig. S4) on our ability to accurately recover ground truth first-order and second-order dynamic correlations. In general, we found that our approach better recovers ground truth dynamic correlations from longer duration timeseries data. We also found that our approach tends to best recover data generated using fewer zero-order features (i.e., lower complexity), although this tendency was not strictly monotonic. Further, because our data generation procedure requires ${{{{{{{\mathcal{O}}}}}}}}({K}^{4})$ memory to generate a second-order timeseries with K zero-order features, we were not able to fully explore how the number of zero-order features affects recovery accuracy as the number of features gets larger (e.g., as it approaches the number of features present in the fMRI data we examine below). Although we were not able to formally test this to our satisfaction, we expect that accurately estimating dynamic high-order correlations would require data with many more zero-order features than we were able to simulate. Our reasoning is that high-order correlations necessarily involve larger numbers of lower-order features, so achieving adequate “resolution” high-order timeseries might require many low-order features.

Taken together, our explorations using synthetic data indicated that we are able to partially, but not perfectly, recover ground truth dynamic first-order and second-order correlations. This suggests that our modeling approach provides a meaningful (if noisy) estimate of high-order correlations. We next turned to analyses of human fMRI data to examine whether the recovered dynamics might reflect the dynamics of human cognition during a naturalistic story-listening task.

Cognitively relevant dynamic high-order correlations in fMRI data

We used across-participant temporal decoders to identify cognitively relevant neural patterns in fMRI data (see Forward inference and decoding accuracy). The dataset we examined³⁶ comprised four experimental conditions that exposed participants to stimuli that varied systematically in how cognitively engaging they were. The intact experimental condition (intact) had participants listen to an audio recording of a 10-min story. The paragraph-scrambled experimental condition (paragraph) had participants listen to a temporally scrambled version of the story, where the paragraphs occurred out of order (but where the same total set of paragraphs were presented over the full listening interval). All participants in this condition experienced the scrambled paragraphs in the same order. The word-scrambled experimental condition (word) had participants listen to a temporally scrambled version of the story where the words in the story occurred in a random order. All participants in the word condition experienced the scrambled words in the same order. Finally, in a rest experimental condition (rest), participants lay in the scanner with no overt stimulus, with their eyes open (blinking as needed). This public dataset provided a convenient means of testing our hypothesis that different levels of cognitive processing and engagement are reflected in different orders of brain activity dynamics.

In brief, we computed timeseries of dynamic high-order correlations that were similar across participants in each of two randomly assigned groups: a training group and a test group. We then trained classifiers on the training group’s data to match each sample from the test group with a stimulus timepoint. Each classifier comprised a weighted blend of neural patterns that reflected up to nth-order dynamic correlations (see Feature weighting and testing). We repeated this process for $n\in \left\{0,1,2,...,10\right\}$. Our examinations of synthetic data suggested that none of the kernels we examined were “universal” in the sense of optimally recovering underlying correlations regardless of the temporal structure of those correlations. We found a similar pattern in the (real) fMRI data, whereby different kernels yielded different decoding accuracies, but no single kernel emerged as the clear “best.” In our analyses of neural data, we therefore averaged our decoding results over a variety of kernel shapes and widths in order to identify results that were robust to specific kernel parameters (see Identifying robust decoding results).

Our approach to estimating dynamic high-order correlations entails mapping the high-dimensional feature space of correlations (represented by a T by ${{{{{{{\mathcal{O}}}}}}}}({K}^{2})$ matrix) onto a lower-dimensional feature space (represented by a T by K matrix). We carried out two sets of analyses that differed in how this mapping was computed. The first set of analyses used PCA to find a low-dimensional embedding of the original dynamic correlation matrices (Fig. 4a, b). The second set of analyses characterized correlations in dynamics of each feature’s eigenvector centrality, but did not preserve the underlying activity dynamics (Fig. 4c, d).

**Fig. 4: Across-participant timepoint decoding accuracy varies with correlation order and cognitive engagement.**

Both sets of temporal decoding analyses yielded qualitatively similar results for the auditory (non-rest) conditions of the experiment (Fig. 4: pink, green, and teal lines; Fig. 5: three leftmost columns). The highest decoding accuracy for participants who listened to the intact (unscrambled) story was achieved using high-order dynamic correlations (PCA: second-order; eigenvector centrality: fourth-order). Scrambled versions of the story were best decoded by lower-order correlations (PCA/paragraph: first-order; PCA/word: order zero; eigenvector centrality/paragraph: order zero; and eigenvector centrality/word: order zero). The two sets of analyses yielded different decoding results on resting state data (Fig. 4: purple lines; Fig. 5: rightmost column). We note that, while the resting state times could be decoded reliably, the accuracies were only very slightly above chance. We speculate that the decoders might have picked up on attentional drift, boredom, or tiredness; we hypothesize that these all increased throughout the resting state scan. The decoders might be picking up on aspects of these loosely defined cognitive states that are common across individuals. The PCA-based approach achieved the highest resting state decoding accuracy using order zero features (non-correlational, activation-based), whereas the eigenvector centrality-based approach achieved the highest resting state decoding accuracy using second-order correlations. Taken together, these analyses indicate that high-level cognitive processing (while listening to the intact story) is reflected in the dynamics of high-order correlations in brain activity, whereas lower-level cognitive processing (while listening to scrambled versions of the story that lack rich meaning) is reflected in the dynamics of lower-order correlations and non-correlational activity dynamics. Further, these patterns are associated both with the underlying activity patterns (characterized using PCA) and also with the changing relative positions that different brain areas occupy in their associated networks (characterized using eigenvector centrality).

**Fig. 5: Statistical summary of decoding accuracies for different neural features.**

Having established that patterns of high-order correlations are informative to decoders, we next wondered which specific networks of brain regions contributed most to these patterns. As a representative example, we selected the kernel parameters that yielded decoding accuracies that were the most strongly correlated (across conditions and orders) with the average accuracies across all of the kernel parameters we examined. Using Fig. 4c as a template, the best-matching kernel was a Laplace kernel with a width of 50 (see Kernel-based approach for computing dynamic correlations and Fig. S9). We used this kernel to compute a single K by Knth-order DISFC matrix for each experimental condition. We then used Neurosynth³⁸ to compute the terms most highly associated with the most strongly correlated pairs of regions in each of these matrices (Fig. 6; see Reverse inference).

**Fig. 6: Top terms associated with the most strongly correlated nodes at each order.**

For all of the story listening conditions (intact, paragraph, and word; top three rows of Fig. 6), we found that first- and second-order correlations were most strongly associated with auditory and speech processing areas. During intact story listening, third-order correlations reflected integration with visual areas, and fourth-order correlations reflected integration with areas associated with high-level cognition and cognitive control, such as the ventrolateral prefrontal cortex. However, when participants listened to temporally scrambled stories, these higher-order correlations instead involved interactions with additional regions associated with speech and semantic processing (second and third rows of Fig. 6). By contrast, we found a much different set of patterns in the resting state data (Fig. 6, bottom row). First-order resting state correlations were most strongly associated with regions involved in counting and numerical understanding. Second-order resting state correlations were strongest in visual areas; third-order correlations were strongest in task-positive areas; and fourth-order correlations were strongest in regions associated with autobiographical and episodic memory. We carried out analogous analyses to create maps (and decode the top associated Neurosynth terms) for up to 15th-order correlations (Figs. S5–S8). Of note, examining 15th-order correlations between 700 nodes using conventional methods would have required storing roughly $\frac{70{0}^{2\times 15}}{2}\approx 1.13\times 1{0}^{85}$ floating point numbers– assuming single-precision (32 bits each), this would require roughly 32 times as many bits as there are molecules in the known universe! Although these 15th-order correlations do appear (visually) to have some well-formed structure, we provide this latter example primarily as a demonstration of the efficiency and scalability of our approach.

Discussion

We tested the hypothesis that high-level cognition is reflected in high-order brain network dynamics^19,26. We examined high-order network dynamics in functional neuroimaging data collected during a story listening experiment. When participants listened to an auditory recording of the story, participants exhibited similar high-order brain network dynamics. By contrast, when participants instead listened to temporally scrambled recordings of the story, only lower-order brain network dynamics were similar across participants. Our results indicate that higher orders of network interactions support higher-level aspects of cognitive processing (Fig. 7).

**Fig. 7: Proposed high-order network dynamics underlying high-level cognition during story listening.**

The notion that cognition is reflected in (and possibly mediated by) patterns of first-order network dynamics has been suggested by or proposed in myriad empirical studies and reviews^{11,12,17,18,20,21,22,24,25,32,39,40,41,42}. Our study extends this line of work by finding cognitively relevant higher-order network dynamics that reflect ongoing cognition. Our findings also complement other work that uses graph theory and topology to characterize how brain networks reconfigure during cognition^{16,26,30,31,34,35,43}.

An open question not addressed by our study pertains to how different structures integrate incoming information with different time constants. For example, one line of work suggests that the cortical surface comprises a structured map such that nearby brain structures process incoming information at similar timescales. Low-level sensory areas integrate information relatively quickly, whereas higher-level regions integrate information relatively slowly^{37,44,45,46,47,48,49}. A similar hierarchy appears to play a role in predicting future events⁵⁰. Other related work in human and mouse brains indicates that the temporal response profile of a given brain structure may relate to how strongly connected that structure is with other brain areas⁵¹. Further study is needed to understand the role of temporal integration at different scales of network interaction, and across different anatomical structures. Importantly, our analyses do not speak to the physiological basis of higher-order dynamics, and could reflect nonlinearities, chaotic patterns, non-stationarities, and/or multistability, etc. However, our decoding analyses do indicate that higher-order dynamics are consistent across individuals, and therefore unlikely to reflect non-stimulus-driven dynamics that are unlikely to be similar across individuals.

One limitation of our approach relates to how noise propagates in our estimation procedure. Specifically, our procedure for estimating high-order dynamic correlations depends on estimates of lower-order dynamic correlations. This means that our measures of which higher-order patterns are reliable and stable across experimental conditions are partially confounded with the stability of lower-order patterns. Prior work suggests that the stability of what we refer to here as first-order dynamics likely varies across the experimental conditions we examined³⁶. Therefore a caveat to our claim that richer stimuli evoke more stable higher-order dynamics is that our approach assumes that those high-order dynamics reflect relations or interactions between lower-order features.

Another potential limitation of our approach relates to recent work suggesting that the brain undergoes rapid state changes, for example across event boundaries^44,52. used hidden semi-Markov models to estimate state-specific network dynamics⁵³. Our general approach might be extended by considering putative state transitions. For example, rather than weighting all timepoints using a similar kernel (Eq. (4)), the kernel function could adapt on a timepoint-by-timepoint basis such that only timepoints determined to be in the same “state” were given non-zero weight.

Identifying high-order network dynamics associated with high-level cognition required several important methods advances. First, we used kernel-based dynamic correlations to extend the notion of (static) inter-subject functional connectivity³⁶ to a DISFC that does not rely on sliding windows¹¹, and that may be computed at individual timepoints. This allowed us to precisely characterize stimulus-evoked network dynamics that were similar across individuals. Second, we developed a computational framework for efficiently and scalably estimating high-order dynamic correlations. Our approach uses dimensionality reduction algorithms and graph measures to obtain low-dimensional embeddings of patterns of network dynamics. Third, we developed an analysis framework for identifying robust decoding results by carrying out our analyses using a range of parameter values and identifying which results were robust to specific parameter choices. By showing that high-level cognition is reflected in high-order network dynamics, we have elucidated the next step on the path towards understanding the neural basis of cognition.

Methods

Our general approach to efficiently estimating high-order dynamic correlations comprises four general steps (Fig. 8). First, we derive a kernel-based approach to computing dynamic pairwise correlations in a T (timepoints) by K (features) multivariate timeseries, X₀. This yields a T by ${{{{{{{\mathcal{O}}}}}}}}({K}^{2})$ matrix of dynamic correlations, Y₁, where each row comprises the upper triangle and diagonal of the correlation matrix at a single timepoint, reshaped into a row vector (this reshaped vector is $\left(\frac{{K}^{2}-K}{2}+K\right)$-dimensional). Second, we apply a dimensionality reduction step to project the matrix of dynamic correlations back onto a K-dimensional space. This yields a T by K matrix, X₁, that reflects an approximation of the dynamic correlations reflected in the original data. Third, we use repeated applications of the kernel-based dynamic correlation step to X_n and the dimensionality reduction step to the resulting Y_n+1 to estimate high-order dynamic correlations. Each application of these steps to a T by K timeseries X_n yields a T by K matrix, X_n+1, that reflects the dynamic correlations between the columns of X_n. In this way, we refer to n as the order of the timeseries, where X₀ (order 0) denotes the original data and X_n denotes (approximated) nth-order dynamic correlations between the columns of X₀. Finally, we use a cross-validation-based decoding approach to evaluate how well information contained in a given order (or weighted mixture of orders) may be used to decode relevant cognitive states. If including a given X_n in the feature set yields higher classification accuracy on held-out data, we interpret this as evidence that the given cognitive states are reflected in patterns of nth-order correlations.

**Fig. 8: Estimating dynamic high-order correlations.**

All of the code used to produce the figures and results in this manuscript, along with links to the corresponding datasets, may be found at github.com/ContextLab/timecorr-paper. In addition, we have released a Python toolbox for computing dynamic high-order correlations in timeseries data; our toolbox may be found at timecorr.readthedocs.io.

Kernel-based approach for computing dynamic correlations

Given a T by K matrix of observations, X, we can compute the (static) Pearson’s correlation between any pair of columns, X(⋅,i) and X(⋅,j) using²⁹:

$${{{{{{{\rm{corr}}}}}}}}({{{{{{{\bf{X}}}}}}}}(\cdot ,i),{{{{{{{\bf{X}}}}}}}}(\cdot ,j))=\frac{\mathop{\sum }\nolimits_{t = 1}^{T}\left({{{{{{{\bf{X}}}}}}}}(t,i)-\bar{{{{{{{{\bf{X}}}}}}}}}(\cdot ,i)\right)\left({{{{{{{\bf{X}}}}}}}}(t,j)-\bar{{{{{{{{\bf{X}}}}}}}}}(\cdot ,j)\right)}{\sqrt{\mathop{\sum }\nolimits_{t = 1}^{T}{\sigma }_{{{{{{{{\bf{X}}}}}}}}(\cdot ,i)}^{2}{\sigma }_{{{{{{{{\bf{X}}}}}}}}(\cdot ,j)}^{2}}},\,{{{{{{{\rm{where}}}}}}}}$$

(1)

$$\bar{{{{{{{{\bf{X}}}}}}}}}(\cdot ,k)=\frac{1}{T}\mathop{\sum }\limits_{t=1}^{T}{{{{{{{\bf{X}}}}}}}}(t,k),\,{{{{{{{\rm{and}}}}}}}}$$

(2)

$${\sigma }_{{{{{{{{\bf{X}}}}}}}}(\cdot ,k)}^{2}=\frac{1}{T}\mathop{\sum }\limits_{t=1}^{T}{\left({{{{{{{\bf{X}}}}}}}}(t,k)-\bar{{{{{{{{\bf{X}}}}}}}}}(\cdot ,k)\right)}^{2}$$

(3)

We can generalize this formula to compute time-varying correlations by incorporating a kernel function that takes a time t as input, and returns how much the observed data at each timepoint $\tau \in \left[-\infty ,\infty \right]$ contributes to the estimated instantaneous correlation⁵⁴ at time t (Fig. 9).

**Fig. 9: Examples of kernel functions.**

Given a kernel function κ_t(⋅) for timepoint t, evaluated at timepoints $\tau \in \left[1,...,T\right]$, we can update the static correlation formula in Eq. (1) to estimate the instantaneous correlation at timepoint t:

$${{{{{{{{\rm{timecorr}}}}}}}}}_{{\kappa }_{t}}\left({{{{{{{\bf{X}}}}}}}}(\cdot ,i),{{{{{{{\bf{X}}}}}}}}(\cdot ,j)\right)=\frac{\mathop{\sum }\nolimits_{\tau = 1}^{T}\left({{{{{{{\bf{X}}}}}}}}(\tau ,i)-{\widetilde{{{{{{{{\bf{X}}}}}}}}}}_{{\kappa }_{t}}(\cdot ,i)\right)\left({{{{{{{\bf{X}}}}}}}}(\tau ,j)-{\widetilde{{{{{{{{\bf{X}}}}}}}}}}_{{\kappa }_{t}}(\cdot ,j)\right)}{\sqrt{\mathop{\sum }\nolimits_{\tau = 1}^{T}{\widetilde{\sigma }}_{{\kappa }_{t}}^{2}({{{{{{{\bf{X}}}}}}}}(\cdot ,i)){\widetilde{\sigma }}_{{\kappa }_{t}}^{2}({{{{{{{\bf{X}}}}}}}}(\cdot ,j))}},\,{{{{{{{\rm{where}}}}}}}}$$

(4)

$${\widetilde{{{{{{{{\bf{X}}}}}}}}}}_{{\kappa }_{t}}(\cdot ,k)=\mathop{\sum }\limits_{\tau =1}^{T}{\kappa }_{t}(\tau ){{{{{{{\bf{X}}}}}}}}(\tau ,k),$$

(5)

$${\widetilde{\sigma }}_{{\kappa }_{t}}^{2}({{{{{{{\bf{X}}}}}}}}(\cdot ,k))=\mathop{\sum }\limits_{\tau =1}^{T}{\left({{{{{{{\bf{X}}}}}}}}(\tau ,k)-{\widetilde{{{{{{{{\bf{X}}}}}}}}}}_{{\kappa }_{t}}(\cdot ,k)\right)}^{2}.$$

(6)

Here ${{{{{{{{\rm{timecorr}}}}}}}}}_{{\kappa }_{t}}({{{{{{{\bf{X}}}}}}}}(\cdot ,i),{{{{{{{\bf{X}}}}}}}}(\cdot ,j))$ reflects the correlation at time t between columns i and j of X, estimated using the kernel κ_t. We evaluate Eq. (4) in turn for each pair of columns in X and for kernels centered on each timepoint in the timeseries, respectively, to obtain a T by K by K timeseries of dynamic correlations, Y. For convenience, we then reshape the upper triangles and diagonals of each timepoint’s symmetric correlation matrix into a row vector to obtain an equivalent T by $\left(\frac{{K}^{2}-K}{2}+K\right)$ matrix.

Dynamic inter-subject functional connectivity

Equation (4) provides a means of taking a single observation matrix, X_n and estimating the dynamic correlations from moment to moment, Y_n+1. Suppose that one has access to a set of multiple observation matrices that reflect the same phenomenon. For example, one might collect neuroimaging data from several experimental participants, as each participant performs the same task (or sequence of tasks). Let ${{{{{{{{\bf{X}}}}}}}}}_{n}^{1}$, ${{{{{{{{\bf{X}}}}}}}}}_{n}^{2}$, ..., ${{{{{{{{\bf{X}}}}}}}}}_{n}^{P}$ reflect the T by K observation matrices (n = 0) or reduced correlation matrices (n > 0) for each of P participants in an experiment. We can use inter-subject functional connectivity^36,55 (ISFC) to compute the stimulus-driven correlations reflected in the multi-participant dataset at a given timepoint t using:

$$\bar{{{{{{{{\bf{C}}}}}}}}}(t)=M\left(R\left(\frac{1}{2P}\mathop{\sum }\limits_{p=1}^{P}Z{\left({{{{{{{{\bf{Y}}}}}}}}}_{n+1}^{p}(t)\right)}^{\top }+Z\left({{{{{{{{\bf{Y}}}}}}}}}_{n+1}^{p}(t)\right)\right)\right),$$

(7)

where M extracts and vectorizes the upper triangle and diagonal of a symmetric matrix, Z is the Fisher z-transformation⁵⁶:

$$Z(r)=\frac{{{{{{{\mathrm{log}}}}}}}\,(1+r)-{{{{{{\mathrm{log}}}}}}}\,(1-r)}{2},$$

(8)

R is the inverse of Z:

$$R(z)=\frac{\exp (2z-1)}{\exp (2z+1)},$$

(9)

and ${{{{{{{{\bf{Y}}}}}}}}}_{n+1}^{p}(t)$ denotes the correlation matrix at timepoint t (Eqn. (4)) between each column of ${{{{{{{{\bf{X}}}}}}}}}_{n}^{p}$ and each column of the average X_n from all other participants, ${\bar{{{{{{{{\bf{X}}}}}}}}}}_{n}^{\backslash p}$:

$${\bar{{{{{{{{\bf{X}}}}}}}}}}_{n}^{\backslash p}=\frac{1}{P-1}\mathop{\sum}\limits_{q\in \backslash p}{{{{{{{{\bf{X}}}}}}}}}_{n}^{q},$$

(10)

where \p denotes the set of all participants other than participant p. In this way, the T by $\left(\frac{{K}^{2}-K}{2}+K\right)$ DISFC matrix $\bar{{{{{{{{\bf{C}}}}}}}}}$ provides a time-varying extension of the ISFC approach developed by³⁶.

Low-dimensional representations of dynamic correlations

Given a T by $\left(\frac{{K}^{2}-K}{2}+K\right)$ matrix of nth-order dynamic correlations, Y_n, we propose two general approaches to computing a T by K low-dimensional representation of those correlations, X_n. The first approach uses dimensionality reduction algorithms to project Y_n onto a K-dimensional space. The second approach uses graph measures to characterize the relative positions of each feature ($k\in \left[1,...,K\right]$) in the network defined by the correlation matrix at each timepoint.

Dimensionality reduction-based approaches to computing X _n

The modern toolkit of dimensionality reduction algorithms include Principal Components Analysis²⁹ (PCA), Probabilistic PCA⁵⁷ (PPCA), Exploratory Factor Analysis⁵⁸ (EFA), Independent Components Analysis^59,60 (ICA), t-Stochastic Neighbor Embedding⁶¹ (t-SNE), Uniform Manifold Approximation and Projection⁶² (UMAP), non-negative matrix factorization⁶³ (NMF), Topographic Factor Analysis⁶⁴ (TFA), Hierarchical Topographic Factor analysis¹¹ (HTFA), Topographic Latent Source Analysis⁶⁵ (TLSA), dictionary learning^66,67, and deep auto-encoders⁶⁸, among others. While complete characterizations of each of these algorithms is beyond the scope of the present manuscript, the general intuition driving these approaches is to compute the T by K matrix, X, that is closest to the original T by J matrix, Y, where (typically) K ≪ J. The different approaches place different constraints on what properties X must satisfy and which aspects of the data are compared (and how) in order to optimize how well X approximates Y.

Applying dimensionality reduction algorithms to Y yields an X whose columns reflect weighted combinations (or nonlinear transformations) of the original columns of Y. This has two main consequences. First, with each repeated dimensionality reduction, the resulting X_n has lower and lower fidelity (with respect to what the “true” Y_n might have looked like without using dimensionality reduction to maintain tractability). In other words, computing X_n is a lossy operation. Second, whereas each column of Y_n may be mapped directly onto specific pairs of columns of X_n−1, the columns of X_n reflect weighted combinations and/or nonlinear transformations of the columns of Y_n. Many dimensionality reduction algorithms are invertible (or approximately invertible). However, attempting to map a given X_n back onto the original feature space of X₀ will usually require ${{{{{{{\mathcal{O}}}}}}}}(T{K}^{{2}^{n}})$ space and therefore becomes intractable as n or K grow large.

Graph measure approaches to computing X _n

The above dimensionality reduction approaches to approximating a given Y_n with a lower-dimensional X_n preserve a (potentially recombined and transformed) mapping back to the original data in X₀. We also explore graph measures that instead characterize each feature’s relative position in the broader network of interactions and connections. To illustrate the distinction between the two general approaches we explore, suppose a network comprises nodes A and B, along with several other nodes. If A and B exhibit uncorrelated activity patterns, then by definition the functional connection (correlation) between them will be close to 0. However, if A and B each interact with other nodes in similar ways, we might attempt to capture those similarities between A’s and B’s interactions with those other members of the network.

In general, graph measures take as input a matrix of interactions (e.g., using the above notation, a K by K correlation matrix or binarized correlation matrix reconstituted from a single timepoint’s row of Y), and return as output a set of K measures describing how each node (feature) sits within that correlation matrix with respect to the rest of the population. Widely used measures include betweenness centrality (the proportion of shortest paths between each pair of nodes in the population that involves the given node in question^{69,70,71,72,73}); diversity and dissimilarity (characterizations of how differently connected a given node is from others in the population^74,75,76); eigenvector centrality and pagerank centrality (measures of how influential a given node is within the broader network^77,78,79,80); transfer entropy and flow coefficients (a measure of how much information is flowing from a given node to other nodes in the network^81,82); k-coreness centrality (a measure of the connectivity of a node within its local subgraph^83,84); within-module degree (a measure of how many connections a node has to its close neighbors in the network⁸⁵); participation coefficient (a measure of the diversity of a node’s connections to different subgraphs in the network⁸⁵); and subgraph centrality (a measure of a node’s participation in all of the network’s subgraphs⁸⁶); among others.

For a given graph measure, $\eta :{{\mathbb{R}}}^{K\times K}\to {{\mathbb{R}}}^{K}$, we can use η to tranform each row of Y_n in a way that characterizes the corresponding graph properties of each column. This results in a new T by K matrix, X_n, that reflects how the features reflected in the columns of X_n−1 participate in the network during each timepoint (row).

Dynamic higher-order correlations

Because X_n has the same shape as the original data X₀, approximating Y_n with a lower-dimensional X_n enables us to estimate high-order dynamic correlations in a scalable way. Given a T by K input matrix, the output of Eq. (4) requires ${{{{{{{\mathcal{O}}}}}}}}(T{K}^{2})$ space to store. Repeated applications of Eq. (4) (i.e., computing dynamic correlations between the columns of the outputted dynamic correlation matrix) each require exponentially more space; in general the nth-order dynamic correlations of a T by K timeseries occupies ${{{{{{{\mathcal{O}}}}}}}}(T{K}^{{2}^{n}})$ space. However, when we approximate or summarize the output of Eq. (4) with a T by K matrix (as described above), it becomes feasible to compute even very high-order correlations in high-dimensional data. Specifically, approximating the nth-order dynamic correlations of a T by K timeseries requires only ${{{{{{{\mathcal{O}}}}}}}}(T{K}^{2})$ additional space– the same as would be required to compute first-order dynamic correlations. In other words, the space required to store n + 1 multivariate timeseries reflecting up to nth order correlations in the original data scales linearly with n using our approach (Fig. 8).

Data

We examined two types of data: synthetic data and human functional neuroimaging data. We constructed and leveraged the synthetic data to evaluate our general approach⁸⁷. Specifically, we tested how well Eq. (4) could be used to recover known dynamic correlations using different choices of kernel (κ; Fig. 9), for each of several synthetic datasets that exhibited different temporal properties. We also simulated higher-order correlations and tested how well Eq. (4) could recover these correlations using the best kernel from the previous synthetic data analyses. We then applied our approach to a functional neuroimaging dataset to test the hypothesis that ongoing cognitive processing is reflected in high-order dynamic correlations. We used an across-participant classification test to estimate whether dynamic correlations of different orders contain information about which timepoint in a story participants were listening to.

Synthetic data: simulating dynamic first-order correlations

We constructed a total of 400 different multivariate timeseries, collectively reflecting a total of four qualitatively different patterns of dynamic first-order correlations (i.e., 100 datasets reflecting each type of dynamic pattern). Each timeseries comprised 50 features (dimensions) that varied over 300 timepoints. The observations at each timepoint were drawn from a zero-mean multivariate Gaussian distribution with a covariance matrix defined for each timepoint as described below. We drew the observations at each timepoint independently from the draws at all other timepoints; in other words, for each observation ${s}_{t} \sim {{{{{{{\mathcal{N}}}}}}}}\left({{{{{{{\bf{0}}}}}}}},{{{{{{{{\boldsymbol{\Sigma }}}}}}}}}_{t}\right)$ at timepoint t, p(s_t) = p(s_t∣s_\t).

Constant: we generated data with stable underlying correlations to evaluate how Eq. (4) characterized correlation “dynamics” when the ground truth correlations were static. We constructed 100 multivariate timeseries whose observations were each drawn from a single (stable) Gaussian distribution. For each dataset (indexed by m), we constructed a random covariance matrix, Σ_m:

$${{{{{{{{\boldsymbol{\Sigma }}}}}}}}}_{m}={{{{{{{\bf{C}}}}}}}}{{{{{{{{\bf{C}}}}}}}}}^{\top },\,{{{{{{{\rm{where}}}}}}}}$$

(11)

$${{{{{{{\bf{C}}}}}}}}(i,j) \sim {{{{{{{\mathcal{N}}}}}}}}(0,1),\,{{{{{{{\rm{and}}}}}}}}\,{{{{{{{\rm{where}}}}}}}}$$

(12)

$i,j\in \left[1,2,...,50\right]$. In other words, all of the observations (for each of the 300 timepoints) within each dataset were drawn from a multivariate Gaussian distribution with the same covariance matrix, and the 100 datasets each used a different covariance matrix.

Random: we generated a second set of 100 synthetic datasets whose observations at each timepoint were drawn from a Gaussian distribution with a new randomly constructed (using Eq. (11)) covariance matrix. Because each timepoint’s covariance matrix was drawn independently from the covariance matrices for all other timepoints, these datasets provided a test of reconstruction accuracy in the absence of any meaningful underlying temporal structure in the dynamic correlations underlying the data.

Ramping: we generated a third set of 100 synthetic datasets whose underlying correlations changed gradually over time. For each dataset, we constructed two “anchor” covariance matrices using Eq. (11), Σ_start and Σ_end. For each of the 300 timepoints in each dataset, we drew the observations from a multivariate Gaussian distribution whose covariance matrix at each timepoint $t\in \left[0,...,299\right]$ was given by

$${{{{{{{{\boldsymbol{\Sigma }}}}}}}}}_{t}=\left(1-\frac{t}{299}\right){{{{{{{{\boldsymbol{\Sigma }}}}}}}}}_{{{{{{{{\rm{start}}}}}}}}}+\frac{t}{299}{{{{{{{{\boldsymbol{\Sigma }}}}}}}}}_{{{{{{{{\rm{end}}}}}}}}}.$$

(13)

The gradually changing correlations underlying these datasets allow us to evaluate the recovery of dynamic correlations when each timepoint’s correlation matrix is unique (as in the random datasets), but where the correlation dynamics are structured and exhibit first-order autocorrelations (as in the constant datasets).

Event: we generated a fourth set of 100 synthetic datasets whose underlying correlation matrices exhibited prolonged intervals of stability, interspersed with abrupt changes. For each dataset, we used Eq. (11) to generate five random covariance matrices. We constructed a timeseries where each set of 60 consecutive samples was drawn from a Gaussian with the same covariance matrix. These datasets were intended to simulate a system that exhibits periods of stability punctuated by occasional abrupt state changes.

Synthetic data: simulating dynamic high-order correlations

We developed an iterative procedure for constructing timeseries data that exhibits known dynamic high-order correlations. The procedure builds on our approach to generating dynamic first-order correlations. Essentially, once we generate a timeseries with known first-order correlations, we can use the known first-order correlations as a template to generate a new timeseries of second-order correlations. In turn, we can generate a timeseries of third-order correlations from the second-order correlations, and so on. In general, we can generate order n correlations given a timeseries of order n − 1 correlations, for any n > 1. Finally, given the order n timeseries, we can reverse the preceding process to generate an order n − 1 timeseries, an order n − 2 order timeseries, and so on, until we obtain an order 0 timeseries of simulated data that reflects the chosen high-order dynamics.

The central mathematical operation in our procedure is the Kronecker product (⊗). The Kronecker product of a K × K matrix, m₁, with itself (i.e., m₁ ⊗ m₁) produces a new K² × K² matrix, m₂ whose entries reflect a scaled tiling of the entries in m₁. If these tilings (scaled copies of m₁) are indexed by row and column, then the tile in the ith row and jth column contains the entries of m₁, multiplied by m₁(i, j). Following this pattern, the Kronecker product m₂ ⊗ m₂ yields the K⁴ × K⁴ matrix m₃ whose tiles are scaled copies of m₂. In general, repeated applications of the Kronecker self-product may be used to generate m_n+1 = m_n ⊗ m_n for n > 1, where m_n+1 is a ${K}^{{2}^{n}}\times {K}^{{2}^{n}}$ matrix. After generating a first-order timeseries of dynamic correlations (see Synthetic data: simulating dynamic first-order correlations), we use this procedure (applied independently at each timepoint) to transform it into a timeseries of nth-order correlations. When m_n+1 is generated in this way, the temporal structure of the full timeseries (i.e., constant, random, ramping, event) is preserved, since changes in the original first-order timeseries are also reflected in the scaled tilings of itself that comprise the higher-order matrices.

Given a timeseries of nth-order correlations, we then need to work “backwards” in order to generate the order-zero timeseries. If the nth-order correlation matrix at a given timepoint is m_n, then we can generate an order n − 1 correlation matrix (for n > 1) by taking a draw from ${{{{{{{\mathcal{N}}}}}}}}\left(0,{m}_{n}\right)$ and reshaping the resulting vector to have square dimensions. To force the resulting matrix to be symmetric, we remove its lower triangle, and replace the lower triangle with (a reflected version of) its upper triangle. Intuitively, the reshaped matrix will look like a noisy (but symmetric) version of the template matrix, m_n−1. (When n = 1, no reshaping is needed; the resulting K-dimensional vector may be used as the observation at the given timepoint.) After independently drawing each timepoint’s order n − 1 correlation matrix from that timepoint’s order n correlation matrix, this process can be applied repeatedly until n = 0. This results in a K-dimensional timeseries of T observations containing the specified high-order correlations at orders 1 through n. Following our approach to generating synthetic data exhibiting known first-order correlations, we constructed a total of 400 additional multivariate timeseries, collectively reflecting a total of four qualitatively different patterns of dynamic correlations (i.e., 100 datasets reflecting each type of dynamic pattern: constant, random, ramping, and event). Each timeseries comprised 10 zero-order features (dimensions) that varied over 300 timepoints. After applying our dynamic correlation estimation procedure, this yielded a 100-dimensional timeseries of first-order features that could then be used to estimate dynamic second-order correlations. (We chose to use K = 10 zero-order features for our higher-order simulations in order to put the accuracy computations displayed in Figs. 2 and 3 on a roughly even footing.)

Functional neuroimaging data collected during story listening

We examined an fMRI dataset collected by³⁶ that the authors have made publicly available at arks.princeton.edu/ark:/88435/dsp015d86p269k. The dataset comprises neuroimaging data collected as participants listened to an audio recording of a story (intact condition; 36 participants), listened to temporally scrambled recordings of the same story (17 participants in the paragraph-scrambled condition listened to the paragraphs in a randomized order and 36 in the word-scrambled condition listened to the words in a randomized order), or lay resting with their eyes open in the scanner (rest condition; 36 participants). Full neuroimaging details may be found in the original paper for which the data were collected³⁶. Procedures were approved by the Princeton University Committee on Activities Involving Human Subjects, and by the Western Institutional Review Board (Puyallup, WA). All subjects were native English speakers with normal hearing and provided written informed consent.

Hierarchical topographic factor analysis (HTFA): following our prior related work, we used HTFA¹¹ to derive a compact representation of the neuroimaging data. In brief, this approach approximates the timeseries of voxel activations (44,415 voxels) using a much smaller number of radial basis function (RBF) nodes (in this case, 700 nodes, as determined by an optimization procedure¹¹). This provides a convenient representation for examining full-brain network dynamics. All of the analyses we carried out on the neuroimaging dataset were performed in this lower-dimensional space. In other words, each participant’s data matrix, X₀, was a number-of-timepoints by 700 matrix of HTFA-derived factor weights (where the row and column labels were matched across participants). Code for carrying out HTFA on fMRI data may be found as part of the BrainIAK toolbox⁸⁸, which may be downloaded at brainiak.org.

Temporal decoding

We sought to identify neural patterns that reflected participants’ ongoing cognitive processing of incoming stimulus information. As reviewed by Simony et al.³⁶, one way of homing in on these stimulus-driven neural patterns is to compare activity patterns across individuals (e.g., using ISFC analyses). In particular, neural patterns will be similar across individuals to the extent that the neural patterns under consideration are stimulus-driven, and to the extent that the corresponding cognitive representations are reflected in similar spatial patterns across people⁵⁵. Following this logic, we used an across-participant temporal decoding test developed by¹¹ to assess the degree to which different neural patterns reflected ongoing stimulus-driven cognitive processing across people (Fig. 10). The approach entails using a subset of the data to train a classifier to decode stimulus timepoints (i.e., moments in the story participants listened to) from neural patterns. We use decoding (forward inference) accuracy on held-out data, from held-out participants, as a proxy for the extent to which the inputted neural patterns reflected stimulus-driven cognitive processing in a similar way across individuals.

**Fig. 10: Decoding analysis pipeline. a. Computing dynamic correlations from timeseries data.**

Forward inference and decoding accuracy

We used an across-participant correlation-based classifier to decode which stimulus timepoint matched each timepoint’s neural pattern (Fig. 10). We first divided the participants into two groups: a template group, ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{template}}}}}}}}}$ (i.e., training data), and a to-be-decoded group, ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{decode}}}}}}}}}$ (i.e., test data). We used Eq. (7) to compute a DISFC matrix for each group (${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{template}}}}}}}}}$ and ${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{decode}}}}}}}}}$, respectively). We then correlated the rows of ${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{template}}}}}}}}}$ and ${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{decode}}}}}}}}}$ to form a number-of-timepoints by number-of-timepoints decoding matrix, Λ. In this way, the rows of Λ reflected timepoints from the template group, while the columns reflected timepoints from the to-be-decoded group. We used Λ to assign temporal labels to each row ${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{decode}}}}}}}}}$ using the row of ${\bar{{{{{{{{\bf{C}}}}}}}}}}_{{{{{{{{\rm{template}}}}}}}}}$ with which it was most highly correlated. We then repeated this decoding procedure, but using ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{decode}}}}}}}}}$ as the template group and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{template}}}}}}}}}$ as the to-be-decoded group. Given the true timepoint labels (for each group), we defined the decoding accuracy as the average proportion of correctly decoded timepoints, across both groups. We defined the relative decoding accuracy as the difference between the decoding accuracy and chance accuracy (i.e., $\frac{1}{T}$).

Feature weighting and testing

We sought to examine which types of neural features (i.e., activations, first-order dynamic correlations, and higher-order dynamic correlations) were informative to the temporal decoders. Using the notation above, these features correspond to X₀, X₁, X₂, X₃, and so on.

One challenge to fairly evaluating high-order correlations is that if the kernel used in Eq. (4) is wider than a single timepoint, each repeated application of the equation will result in further temporal blur. Because our primary assessment metric is temporal decoding accuracy, this unfairly biases against detecting meaningful signal in higher-order correlations (relative to lower-order correlations). We attempted to mitigate temporal blur in estimating each X_n by using a Dirac δ function kernel (which places all of its mass over a single timepoint; Figs. 9b and 10a) to compute each lower-order correlation (X₁, X₂, ..., X_n−1). We then used a new (potentially wider, as described below) kernel to compute X_n from X_n−1. In this way, temporal blurring was applied only in the last step of computing X_n. We note that, because each X_n is a low-dimensional representation of the corresponding Y_n, the higher-order correlations we estimated reflect true correlations in the data with lower fidelity than estimates of lower-order correlations. Therefore, even after correcting for temporal blurring, our approach is still biased against finding meaningful signal in higher-order correlations.

After computing each X₁, X₂,..., X_n−1 for each participant, we divided participants into two equally sized groups (±1 for odd numbers of participants): ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{train}}}}}}}}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{test}}}}}}}}}$. We then further subdivided ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{train}}}}}}}}}$ into ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{1}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{2}}$. We then computed Λ (temporal correlation) matrices for each type of neural feature, using ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{1}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{2}}$. This resulted in n + 1Λ matrices (one for the original timeseries of neural activations, and one for each of n orders of dynamic correlations). Our objective was to find a set of weights for each of these Λ matrices such that the weighted average of the n + 1 matrices yielded the highest decoding accuracy. We used quasi-Newton gradient ascent⁸⁹, using decoding accuracy (for ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{1}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{{\rm{train}}}}}}}}}_{2}}$) as the objective function to be maximized, to find an optimal set of training data-derived weights, ϕ_0,1,...,n, where $\mathop{\sum }\nolimits_{i = 0}^{n}{\phi }_{i}=1$ and where ${\phi }_{i}\ge 0\forall i\in \left[0,1,...,n\right]$.

After estimating an optimal set of weights, we computed a new set of n + 1Λ matrices correlating the DISFC patterns from ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{train}}}}}}}}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{test}}}}}}}}}$ at each timepoint. We use the resulting decoding accuracy of ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{test}}}}}}}}}$ timepoints (using the weights in ϕ_0,1,...,n to average the Λ matrices) to estimate how informative the set of neural features containing up to nth order correlations were.

We used a permutation-based procedure to form stable estimates of decoding accuracy for each set of neural features. In particular, we computed the decoding accuracy for each of 10 random group assignments of ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{train}}}}}}}}}$ and ${{{{{{{{\mathcal{G}}}}}}}}}_{{{{{{{{\rm{test}}}}}}}}}$. We report the mean accuracy (along with 95% confidence intervals) for each set of neural features.

Identifying robust decoding results

The temporal decoding procedure we use to estimate which neural features support ongoing cognitive processing is governed by several parameters. In particular, Eq. (4) requires defining a kernel function, which can take on different shapes and widths. For a fixed set of neural features, each of these parameters can yield different decoding accuracies. Further, the best decoding accuracy for a given timepoint may be reliably achieved by one set of parameters, whereas the best decoding accuracy for another timepoint might be reliably achieved by a different set of parameters, and the best decoding accuracy across all timepoints might be reliably achieved by still another different set of parameters. Rather than attempting to maximize decoding accuracy, we sought to discover the trends in the data that were robust to classifier parameters choices. Specifically, we sought to characterize how decoding accuracy varied (under different experimental conditions) as a function of which neural features were considered.

To identify decoding results that were robust to specific classifier parameter choices, we repeated our decoding analyses after substituting into Eq. (4) each of a variety of kernel shapes and widths. We examined Gaussian (Fig. 9c), Laplace (Fig. 9d), and Mexican Hat (Fig. 9e) kernels, each with widths of 5, 10, 20, and 50 samples. We then report the average decoding accuracies across all of these parameter choices. This enabled us to (partially) factor out performance characteristics that were parameter-dependent, within the set of parameters we examined.

Reverse inference

The dynamic patterns we examined comprise high-dimensional correlation patterns at each timepoint. To help interpret the resulting patterns in the context of other studies, we created summary maps by computing the across-timepoint average pairwise correlations at each order of analysis (first order, second order, etc.). We selected the 10 strongest (absolute value) correlations at each order. Each correlation is between the dynamic activity patterns (or patterns of dynamic high-order correlations) measured at two RBF nodes (see Hierarchical Topographic Factor Analysis). Therefore, the 10 strongest correlations involved up to 20 RBF nodes. Each RBF defines a spatial function whose activations range from 0 to 1. We constructed a map of RBF components that denoted the endpoints of the 10 strongest correlations (we set each RBF to have a maximum value of 1). We then carried out a meta analysis using Neurosynth³⁸ to identify the 10 terms most commonly associated with the given map. This resulted in a set of 10 terms associated with the average dynamic correlation patterns at each order.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The authors declare that the data supporting the findings of this study as well as the source data for this paper are available at github.com/ContextLab/timecorr-paper/releases/tag/v0.4 and has been deposited in the Zenodo database under accession code https://doi.org/10.5281/zenodo.5165253. The source data underlying Figs. 2–6 and Supplementary Figs. S1–S9 are provided as Source Data files. Source Data are provided with the manuscript. The raw fMRI data are protected and are not available due to data privacy laws. The processed fMRI dataset collected by³⁶ has been made publicly available⁹⁰ at arks.princeton.edu/ark:/88435/dsp015d86p269k. Source data are provided with this paper.

Code availability

All of our analysis code may be downloaded from github.com/ContextLab/timecorr-paper/releases/tag/v0.4. We have also published a companion Python toolbox that may be downloaded from timecorr.readthedocs.io.

References

Haxby, J. V. et al. Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science 293, 2425–2430 (2001).
Article ADS CAS PubMed Google Scholar
Norman, K. A., Polyn, S. M., Detre, G. J. & Haxby, J. V. Beyond mind-reading: multi-voxel pattern analysis of fMRI data. Trends Cogn. Sci. 10, 424–430 (2006).
Article PubMed Google Scholar
Tong, F. & Pratte, M. S. Decoding patterns of human brain activity. Annu. Rev. Psychol. 63, 483–509 (2012).
Article PubMed Google Scholar
Mitchell, T. M. et al. Predicting human brain activity associated with the meanings of nouns. Science 320, 1191 (2008).
Article ADS CAS PubMed Google Scholar
Kamitani, Y. & Tong, F. Decoding the visual and subjective contents of the human brain. Nat. Neurosci. 8, 679–685 (2005).
Article CAS PubMed PubMed Central Google Scholar
Nishimoto, S. et al. Reconstructing visual experience from brain activity evoked by natural movies. Curr. Biol. 21, 1–6 (2011).
Article CAS Google Scholar
Pereira, F. et al. Toward a universal decoder of linguistic meaning from brain activation. Nat. Commun. 9, 1–13 (2018).
Article CAS Google Scholar
Huth, A. G., Nisimoto, S., Vu, A. T. & Gallant, J. L. A continuous semantic space describes the representation of thousands of object and action categories across the human brain. Neuron 76, 1210–1224 (2012).
Article CAS PubMed PubMed Central Google Scholar
Huth, A. G., de Heer, W. A., Griffiths, T. L., Theunissen, F. E. & Gallant, J. L. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature 532, 453–458 (2016).
Article ADS PubMed PubMed Central Google Scholar
Etzel, J. A., Gazzola, V. & Keysers, C. An introduction to anatomical ROI-based fMRI classification. Brain Res. 1281, 114–125 (2009).
Article CAS Google Scholar
Manning, J. R. et al. A probabilistic approach to discovering dynamic full-brain functional connectivity patterns. NeuroImage 180, 243–252 (2018).
Article PubMed Google Scholar
Fong, A. H. C. et al. Dynamic functional connectivity during task performance and rest predicts individual differences in attention across studies. NeuroImage 188, 14–25 (2019).
Article PubMed Google Scholar
Grossberg, S. Nonlinear neural networks: principles, mechanisms, and architectures. Neural Netw. 1, 17–61 (1988).
Article Google Scholar
Friston, K. J. The labile brain. I. neuronal transients and nonlinear coupling. Philos. Trans. R. Soc. Lond. 355B, 215–236 (2000).
Article Google Scholar
Sporns, O. & Honey, C. J. Small worlds inside big brains. Proc. Natl Acad. Sci. USA 103, 19219–19220 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Bassett, D., Meyer-Lindenberg, A., Achard, S., Duke, T. & Bullmore, E. Adaptive reconfiguration of fractal small-world human brain functional networks. Proc. Natl Acad. Sci. USA 103, 19518–19523 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Turk-Browne, N. B. Functional interactions as big data in the human brain. Science 342, 580–584 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Demertzi, A. et al. Human consciousness is supported by dynamic complex patterns of brain signal coordination. Sci. Adv. 5, eaat7603 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Solomon, S. H., Medaglia, J. D. & Thompson-Schill, S. L. Implementing a concept network model. Behav. Res. Methods 51, 1717–1736 (2019).
Article PubMed PubMed Central Google Scholar
Lurie, D. et al. On the nature of time-varying functional connectivity in resting fMRI. PsyArXiv https://doi.org/10.31234/osf.io/xtzre (2018).
Preti, M. G., Bolton, T. A. W. & Van De Ville, D. The dynamic functional connectome: state-of-the-art and perspectives. NeuroImage 160, 41–54 (2017).
Article PubMed Google Scholar
Zou, Y., Donner, R. V., Marwan, N., Donges, J. F. & Kurths, J. Complex network approaches to nonlinear time series analysis. Phys. Rep. 787, 1–97 (2019).
Article ADS MathSciNet Google Scholar
Mack, M. L., Preston, A. R. & Love, B. C. Medial prefrontal cortex compresses concept representations through learning. bioRxiv https://doi.org/10.1101/178145 (2017).
Bressler, S. L. & Kelso, J. A. S. Cortical coordination dynamics and cognition. Trends Cogn. Sci. 5, 26–36 (2001).
Article PubMed Google Scholar
McIntosh, A. R. Towards a network theory of cognition. Neural Netw. 13, 861–870 (2000).
Article CAS PubMed Google Scholar
Reimann, M. W. et al. Cliques of neurons bound into cavities provide a missing link between structure and function. Front. Comput. Neurosci. 11, 1–16 (2017).
Article Google Scholar
Beaty, R. E., Benedek, M., Silvia, P. J. & Schacter, D. L. Creative cognition and brain network dynamics. Trends Cogn. Sci. 20, 87–95 (2016).
Article PubMed Google Scholar
Bullmore, E. & Sporns, O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186–198 (2009).
Article CAS PubMed Google Scholar
Pearson, K. On lines and planes of closest fit to systems of points in space. Lond., Edinb. Dublin Philos. Mag. J. Sci. 2, 559–572 (1901).
Article MATH Google Scholar
McIntosh, A. R. & Jirsa, V. K. The hidden repertoire of brain dynamics and dysfunction. Netw. Neurosci. https://doi.org/10.1162/netn_a_00107 (2019).
Toker, D. & Sommer, F. T. Information integration in large brain networks. PLoS Comput. Biol. 15, e1006807 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Gonzalez-Castillo, J. et al. Imaging the spontaneous flow of thought: distinct periods of cognition contribute to dynamic functional connectivity during test. NeuroImage 202, 116129 (2019).
Landau, E. Zur relativen Wertbemessung der Turnierresultate. Dtsch. Wochenschach 11, 366–369 (1895).
Google Scholar
Betzel, R. F., Byrge, L., Esfahlani, F. Z. & Kennedy, D. P. Temporal fluctuations in the brain’s modular architecture during movie-watching. bioRxiv https://doi.org/10.1101/750919 (2019).
Sizemore, A. E. et al. Cliques and cavities in the human connectome. J. Comput. Neurosci. 44, 115–145 (2018).
Article MathSciNet PubMed MATH Google Scholar
Simony, E., Honey, C. J., Chen, J. & Hasson, U. Dynamic reconfiguration of the default mode network during narrative comprehension. Nat. Commun. 7, 1–13 (2016).
Article CAS Google Scholar
Hasson, U., Yang, E., Vallines, I., Heeger, D. J. & Rubin, N. A hierarchy of temporal receptive windows in human cortex. J. Neurosci. 28, 2539–2550 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rubin, T. N. et al. Decoding brain activity using a large-scale probabilistic functional-anatomical atlas of human cognition. PLoS Comput. Biol. 13, e1005649 (2017).
Article PubMed PubMed Central CAS Google Scholar
Park, H.-J., Friston, K. J., Pae, C., Park, B. & Razi, A. Dynamic effective connectivity in resting state fMRI. NeuroImage 180, 594–608 (2018).
Article PubMed Google Scholar
Roy, D. S. et al. Brain-wide mapping of contextual fear memory engram ensembles supports the dispersed engram complex hypothesis. bioRxiv https://doi.org/10.1101/668483 (2019).
Liégeois, R. et al. Resting brain dynamics at different timescales capture distinct aspects of human behavior. Nat. Commun. 10, 1–9 (2019).
Article CAS Google Scholar
Chang, C. & Glover, G. H. Time-frequency dynamics of resting-state brain connectivity measured with fMRI. NeuroImage 50, 81–98 (2010).
Article PubMed Google Scholar
Zheng, M., Allard, A., Hagmann, P. & Serrano, M. A. Geometric renormalization unravels self-similarity of the multiscale human connectome. arXiv 10.1073/pnas.1922248117 (2019).
Baldassano, C. et al. Discovering event structure in continuous narrative perception and memory. Neuron 95, 709–721 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hasson, U., Chen, J. & Honey, C. J. Hierarchical process memory: memory as an integral component of information processing. Trends Cogn. Sci. 19, 304–315 (2015).
Article PubMed PubMed Central Google Scholar
Honey, C. J. et al. Slow cortical dynamics and the accumulation of information over long timescales. Neuron 76, 423–434 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lerner, Y., Honey, C. J., Silbert, L. J. & Hasson, U. Topographic mapping of a hierarchy of temporal receptive windows using a narrated story. J. Neurosci. 31, 2906–2915 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lerner, Y., Honey, C. J., Katkov, M. & Hasson, U. Temporal scaling of neural responses to compressed and dilated natural speech. J. Neurophysiol. 111, 2433–2444 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chien, H.-Y. S. & Honey, C. J. Constructing and forgetting temporal context in the human cerebral cortex. bioRxiv https://doi.org/10.1101/761593 (2019).
Lee, C. S., Aly, M. & Baldassano, C. Anticipation of temporally structured events in the brain. bioRxiv https://doi.org/10.1101/2020.10.14.338145 (2020).
Fallon, J., Ward, P. G. D., Parkes, L. & Oldham, S. Timescales of spontaneous fMRI fluctuations relate to structural connectivity in the brain. Netw. Neurosci. 4, 788–806 (2020).
Article PubMed PubMed Central Google Scholar
Shappell, H., Caffo, B. S., Pekar, J. J. & Lindquist, M. A. Improved state change estimation in dynamic functional connectivity using hidden semi-Markov models. NeuroImage 191, 243–257 (2019).
Article PubMed Google Scholar
Vidaurre, D. et al. Discovering dynamic brain neworks from big data in rest and task. NeuroImage 180, 646–656 (2018).
Article PubMed Google Scholar
Allen, E. A. et al. Tracking whole-brain connectivity dynamics in the resting state. Cereb. Cortex 24, 663–676 (2012).
Article PubMed PubMed Central Google Scholar
Simony, E. & Chang, C. Analysis of stimulus-induced brain dynamics during naturalistic paradigms. NeuroImage 216, 116461 (2020).
Article PubMed Google Scholar
Zar, J. H. Biostatistical Analysis (Prentice-Hall, 2010).
Tipping, M. E. & Bishop, C. M. Probabilistic principal component analysis. J. R. Stat. Soc. Ser. B 61, 611–622 (1999).
Article MathSciNet MATH Google Scholar
Spearman, C. General intelligence, objectively determined and measured. Am. J. Psychol. 15, 201–292 (1904).
Article Google Scholar
Jutten, C. & Herault, J. Blind separation of sources, part I: an adaptive algorithm based on neuromimetic architecture. Signal Process. 24, 1–10 (1991).
Article MATH Google Scholar
Comon, P., Jutten, C. & Herault, J. Blind separation of sources, part II: problems statement. Signal Process. 24, 11–20 (1991).
Article MATH Google Scholar
van der Maaten, L. J. P. & Hinton, G. E. Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
MATH Google Scholar
McInnes, L., Healy, J. & Melville, J. UMAP: uniform manifold approximation and projection for dimension reduction. arXiv 1802.03426v3, 1–63 (2018).
Lee, D. D. & Seung, H. S. Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999).
Article ADS CAS PubMed MATH Google Scholar
Manning, J. R., Ranganath, R., Norman, K. A. & Blei, D. M. Topographic factor analysis: a Bayesian model for inferring brain networks from neural data. PLoS ONE 9, e94914 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Gershman, S. J., Blei, D. M., Pereira, F. & Norman, K. A. A topographic latent source model for fMRI data. NeuroImage 57, 89–100 (2011).
Article PubMed Google Scholar
Mairal, J. B., Bach, F., Ponce, J. & Sapiro, G. Online dictionary learning for sparse coding. Proceedings of the International Conference on Machine Learning. p. 689–696 (2009).
Mairal, J., Ponce, J., Sapiro, G., Zisserman, A. & Bach, F. R. Supervised dictionary learning. Advances in Neural Information Processing Systems. p. 1033–1040 (2009).
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Newman, M. E. J. A measure of betweenness centrality based on random walks. Soc. Netw. 27, 39–54 (2005).
Article Google Scholar
Opsahl, T., Agneessens, F. & Skvoretz, J. Node centrality in weighted networks: generalizing degree and shortest paths. Soc. Netw. 32, 245–251 (2010).
Article Google Scholar
Barthélemy, M. Betweenness centrality in large complex networks. Eur. Phys. J. B 38, 163–168 (2004).
Article ADS CAS Google Scholar
Geisberger, R., Sanders, P. & Schultes, D. Better approximation of betweenness centrality. Proceedings of the Meeting on Algorithm Engineering and Experiments. p. 90–100 (2008).
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Rao, C. R. Diversity and dissimilarity coeficients: a unified approach. Theor. Popul. Biol. 21, 24–43 (1982).
Article MATH Google Scholar
Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 37, 145–151 (2009).
Article MathSciNet MATH Google Scholar
Ricotta, C. & Szeidl, L. Towards a unifying approach to diversity measures: bridging the gap between the Shannon entropy and Rao’s quadratic index. Theor. Popul. Biol. 70, 237–243 (2006).
Article PubMed MATH Google Scholar
Newman, M. E. J. The mathematics of networks. New Palgrave Encycl. Econ. 2, 1–12 (2008).
Google Scholar
Bonacich, P. Some unique properties of eigenvector centrality. Soc. Netw. 29, 555–564 (2007).
Article Google Scholar
Lohmann, G. et al. Eigenvector centrality mapping for analyzing connectivity patterns in fMRI data of the human brain. PLoS ONE 5, e10232 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Halu, A., Mondragón, R. J., Panzarasa, P. & Bianconi, G. Multiplex PageRank. PLoS ONE 8, e78293 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Honey, C. J., Kötter, R., Breakspear, M. & Sporns, O. Network structure of cerebral cortex shapes functional connectivity on multiple time scales. Proc. Natl Acad. Sci. USA 104, 10240–10245 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 85, 461–464 (2000).
Article ADS CAS PubMed Google Scholar
Alvarez-Hamelin, I., Dall’Asta, L., Barrat, A. & Vespignani, A. k-corr decomposition: a tool for the visualiztion of large scale networks. arXiv cs/0504107v2, 1–13 (2005).
Christakis, N. A. & Fowler, J. H. Social network sensors for early detection of contagious outbreaks. PLoS ONE 5, e12948 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Rubinov, M. & Sporns, O. Complex network measures of brain connectivity: uses and interpretations. NeuroImage 52, 1059–1069 (2010).
Article PubMed Google Scholar
Estrada, E. & Rodríguez-Velázquez, J. A. Subraph centrality in complex networks. Phys. Rev. E 71, 056103 (2005).
Article ADS MathSciNet CAS Google Scholar
Thompson, W. H., Richter, C. G., Plavén-Sigray, P. & Fransson, P. Simulations to benchmark time-varying connectivity methods for fMRI. PLoS Comput. Biol. 14, e1006196 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Capota, M. et al. Brain Imaging Analysis Kit (2017).
Nocedal, J. & Wright, S. J. Numerical Optimization (Springer, 2006).
Simony, E., Honey, C. J., Chen, J. & Hasson, U. Dynamic reconfiguration of the default mode network during narrative comprehension. DataSpace http://arks.princeton.edu/ark:/88435/dsp015d86p269k (2016).
Combrisson, E. et al. Visbrain: a multi-purpose GPU-accelerated open-source suite for multimodal brain data visualization. Front. Neuroinform. 13, 1–14 (2019).
Article Google Scholar

Download references

Acknowledgements

We acknowledge discussions with Luke Chang, Vassiki Chauhan, Hany Farid, Paxton Fitzpatrick, Andrew Heusser, Eshin Jolly, Aaron Lee, Qiang Liu, Matthijs van der Meer, Judith Mildner, Gina Notaro, Stephen Satterthwaite, Emily Whitaker, Weizhen Xie, and Kirsten Ziman. Our work was supported in part by NSF EPSCoR Award Number 1632738 to J.R.M. and by a sub-award of DARPA RAM Cooperative Agreement N66001-14-2-4-032 to J.R.M. The content is solely the responsibility of the authors and does not necessarily represent the official views of our supporting organizations.

Author information

Authors and Affiliations

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
Lucy L. W. Owen, Thomas H. Chang & Jeremy R. Manning
Amazon.com, Seattle, WA, USA
Thomas H. Chang

Authors

Lucy L. W. Owen
View author publications
You can also search for this author in PubMed Google Scholar
Thomas H. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy R. Manning
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Concept: J.R.M. Implementation: T.H.C., L.L.W.O., and J.R.M. Analyses: L.L.W.O. and J.R.M. Writing: L.L.W.O. and J.R.M.

Corresponding author

Correspondence to Jeremy R. Manning.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Michael Breakspear and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Owen, L.L.W., Chang, T.H. & Manning, J.R. High-level cognition during story listening is reflected in high-order dynamic correlations in neural activity patterns. Nat Commun 12, 5728 (2021). https://doi.org/10.1038/s41467-021-25876-x

Download citation

Received: 23 September 2019
Accepted: 24 August 2021
Published: 30 September 2021
DOI: https://doi.org/10.1038/s41467-021-25876-x

This article is cited by

Effects of a dissociative drug on fronto-limbic resting-state functional connectivity in individuals with posttraumatic stress disorder: a randomized controlled pilot study
- Sarah K. Danböck
- Or Duek
- Ilan Harpaz-Rotem
Psychopharmacology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.